Patent classifications
G10L2021/02161
Systems and methods for generating a cleaned version of ambient sound
A first electronic device is provided. While a media content item provided by a media-providing service is emitted by a second electronic device that is remote from the first electronic device, the first electronic device receives, from the media-providing service, data that includes an audio stream that corresponds to the media content item. The first electronic device detects ambient sound that includes sound corresponding to the media content item emitted by the second electronic device. The first electronic device generates a cleaned version of the ambient sound, which includes: using the data received from the media-providing service to align the audio stream with the ambient sound; and performing a subtraction operation to subtract the audio stream from the ambient sound. The first electronic device detects a voice command in the cleaned version of the ambient sound.
Audio signal processing method and device, and storage medium
An audio signal processing method includes: acquiring audio signals from at least two sound sources respectively through at least two microphones (MICs) to obtain respective original noisy signals of the at least two MICs in a time domain; for each frame in the time domain, using a first asymmetric window to perform a windowing operation on the respective original noisy signals of the at least two MICs to acquire windowed noisy signals; performing time-frequency conversion on the windowed noisy signals to acquire respective frequency-domain noisy signals of the at least two sound sources; acquiring frequency-domain estimated signals of the at least two sound sources according to the frequency-domain noisy signals; and obtaining audio signals produced respectively by the at least two sound sources according to the frequency-domain estimated signals.
Apparatus and method for reducing noise in an audio signal
An apparatus for processing an audio signal includes an audio signal analyzer and a filter. The audio signal analyzer is configured to analyze an audio signal to determine a plurality of noise suppression filter values for a plurality of bands of the audio signal, wherein the analyzer is configured to determine a noise suppression filter value so that a noise suppression filter value is greater than or equal to a minimum noise suppression filter value and so that the minimum noise suppression value depends on a characteristic of the audio signal. The filter is configured for filtering the audio signal, wherein the filter is adjusted based on the noise suppression filter values.
Suppressing or reducing effects of wind turbulence
A method of operation of a device includes receiving an input signal at the device. The input signal is generated using at least one microphone. The input signal includes a first signal component having a first amount of wind turbulence noise and a second signal component having a second amount of wind turbulence noise that is greater than the first amount of wind turbulence noise. The method further includes generating, based on the input signal, an output signal at the device. The output signal includes the first signal component and a third signal component that replaces the second signal component. A first frequency response of the input signal corresponds to a second frequency response of the output signal.
Spectral blending with interior microphone
A headphone can include plurality of exterior microphones, that generates corresponding exterior microphone signals, an accelerometer that generates an accelerometer signal; and an interior microphone, not directly exposed to the environment, that generates an interior microphone signal. A processor of the headphone can be configured to generate an audio signal containing voice of a user, based on a) the accelerometer signal, b) the interior microphone signal, and c) the plurality of exterior microphone signals.
Adaptive null forming and echo cancellation for selective audio pick-up
Audio pickup systems and methods are provided to enhance an audio signal by removing noise components related to an acoustic environment. The systems and methods receive a primary signal and one or more reference signals from various microphones. Adaptive filtering and combining minimizes an energy content of a resulting output signal, e.g., to form a substantially null output when the system is in a static acoustic environment. When the system is a playback sound source, one or more echo cancellers may contribute to removing content from the output signal. A change in the acoustic environment, such as a new sound source, causes content in the output signal until the adaptive filtering adapts to the new environment. In some examples, a desired content such as a wake-up word is detected and adaptation is stopped.
OUTPUTTING NOTIFICATIONS USING DEVICE GROUPS
A system that determines that devices are co-located in an acoustic region and selects a single device to which to send incoming notifications for the acoustic region. The system may group devices into separate acoustic regions based on selection data that selects between similar audio data received from multiple devices. The system may select the best device for each acoustic region based on a frequency that the device was selected previously, input/output capabilities of the device, a proximity to a user, or the like. The system may send a notification to a single device in each of the acoustic regions so that a user receives a single notification instead of multiple unsynchronized notifications. The system may also determine that acoustic regions are associated with different locations and select acoustic regions to which to send a notification based on location.
OPEN ACTIVE NOISE CANCELLATION SYSTEM
Embodiments of the present disclosure set forth a method of reducing noise in an audio signal. The method includes determining, based on sensor data acquired from a first set of sensors, a first position of a user in an environment. The method also includes acquiring, via the first set of sensors, one or more audio signals associated with sound in the environment and identifying one or more noise elements in the one or more audio signals. The method also includes generating a first directional audio signal based on the one or more noise elements. When the first directional audio signal is outputted by a first speaker, the first speaker produces a first acoustic field that attenuates the one or more noise elements at the first position.
Voice interface and vocal entertainment system
A system and method that enhances spoken utterances and provides entertainment by capturing one or more microphone signals containing echo and decomposing the one or more microphone signals into a plurality of signal paths through a synthesizer that adds or makes non-linear modifications to some of the captured one or more microphone signals. The system and method and estimates multiple echo paths from each of the one the one or more microphones. The system and method processes the captured microphone signals in response to the estimated plurality of echo paths by subtracting the echo contributions of each of the plurality of echo paths from the captured one or more microphone signals. The system and method also provide signal separation and post processing functions that renders speech recognition gaming applications.
VOICE ACQUISITION CONTROL METHOD AND DEVICE, AND TWS EARPHONES
A method for speech collection control applied to a master earphone is provided. The method includes: activating a microphone of the master earphone to collect noise and transmitting an activating instruction to a slave earphone, when a user speech is detected, so that the slave earphone controls a microphone of the slave earphone to collect noise in response to the activating instruction; determining an earphone located in an environment with lower noise based on noise data collected by the master earphone and noise data collected by the slave earphone; and controlling a microphone of the earphone located in the environment with lower noise to collect the user speech. A method for speech collection control applied to a slave earphone is further provided.