H04R2227/009

Testing device capture performance for multiple speakers

Systems and methods are described for measuring capture performance of multiple voice signals. A first speech signal is applied to a device, and measured at far-end at a far-end of a testing environment. A second speech signal is separately applied to the device, and is also measured at the far end. The measured speech signals are added, and a quality assessment model is applied to the first far-end combined signal to obtain a first quality metric. The first speech signal and the second speech signal are then both applied at the same time to the device and measured at the far-end. The quality assessment model is applied to the second far-end combined signal to obtain a second quality metric. The quality metric for the second far-end combined signal is normalized, based on the first quality metric, to obtain a performance index for the device.

Linear Filtering for Noise-Suppressed Speech Detection
20190355384 · 2019-11-21 ·

Systems and methods for suppressing noise and detecting voice input in a multi-channel audio signal captured by a plurality of microphones include (i) capturing a first audio signal via a first microphone and a second audio signal via a second microphone, wherein the first and second audio signals respectively comprises first and second noise content from a noise source; (ii) identifying the first noise content in the first audio signal; (iii) using the identified first noise content to determine an estimated noise content captured by the plurality of microphones; (iv) using the estimated noise content to suppress the first and second noise content in the first and second audio signals; (v) combining the suppressed first and second audio signals into a third audio signal; and (vi) determining that the third audio signal includes a voice input comprising a wake word.

Hearing-assist systems and methods for audio quality enhancements in performance venues

To enable a patron of a venue attending a performance to feel as if he or she is front and center even if located much farther away or angled from a stage of the venue, system and methods that enhance audio quality of audio signals especially over the voice frequency spectrum to generate a processed audio signal including an increased voice frequency spectrum are provided. The processed audio signal may be communicated to a receiving system with a headset or other transducer (e.g., loudspeakers) that allows for ambient sounds to be heard by a patron and for the patron to control volume of the processed audio signal, thereby mixing the processed audio signal with the ambient sounds. The receiving system may be rented and an operator may control usage for both performance or long-term rentals using various control features of the processed audio signals and/or receiving system.

AUDIO PICKUP AND PLAY CIRCUIT AND SYSTEM, AND METHOD FOR SWITCHING AUDIO PICKUP AND PLAY
20190327555 · 2019-10-24 ·

The present disclosure relates to a circuit and system for audio pickup and play, and a method for switching audio pickup and play. The circuit includes a microphone array for collecting a first audio signal, a first wireless audio transceiver for receiving a second audio signal from a remote wireless microphone, an audio processor, a first audio player, a controller for outputting a first control signal and a second control signal, an audio input switcher for inputting the first audio signal or the second audio signal into the audio processor based on the first control signal. The audio processor is configured for processing the received first audio signal and second audio signal and outputting an audio play signal. The circuit includes an audio output switcher for outputting the audio play signal to the first audio player or the first wireless audio transceiver based on the second control signal.

Noise mitigation using machine learning

This disclosure relates to solutions for eliminating undesired audio artifacts, such as background noises, on an audio channel. A process for implementing the technology can include receiving a set of audio segments, analyzing the segments using a first ML model to identify a first probability of unwanted background noises in the segments, and if the first probability exceeds a threshold, analyzing the segments using a second ML model to determine a second probability that the one or more background features exist in the segments. In some aspects, the process can include attenuating audio artifacts in the segments, if the second probability exceeds a second threshold. In some implementations, dynamic time stretching and shrinking can be applied to the noise attenuation. Systems and machine-readable media are also provided.

Dynamic Player Selection for Audio Signal Processing
20190304481 · 2019-10-03 ·

A set of signal measures is sent, wherein each signal measure in the set of signal measures corresponds to a respective audio signal received by a playback device in a media playback system and is processed based on a first set of audio processing algorithms. A plurality of signal measures is identified in the set of signal measures. Audio signals corresponding to the identified plurality of signal measures are processed by one or more devices in the media playback system to improve a signal measure of each of the audio signals. The audio signals are processed based on a second set of audio processing algorithms. The processed audio signals are combined into a combined audio signal.

PROCESSING VOICE

A method and an apparatus for processing voice are provided. The method is applied to a decision-making device in communication with a distributed microphone array and the distributed microphone array comprises a plurality of sub-arrays. The method comprises: obtaining, for each sub-array, an awakening voice signal received by each microphone of the sub-array; determining, for each sub-array, a frequency domain signal corresponding to each awakening voice signal of the sub-array, and a first cross-correlation function between every two frequency domain signals; determining an awakened sub-array based on each first cross-correlation function for each sub-array.

Systems and methods for enhancing a signal-to-noise ratio
10388265 · 2019-08-20 · ·

Provided are methods and apparatus for enhancing a signal-to-noise ratio. In an example, provided is an apparatus configured to modify audio to better match the way the human brain processes audio by modifying the audio to a form which takes advantage of human echolocation capabilities. When humans listen to audio, they subconsciously listen for an echo and thus subconsciously focus on listening to, and for, meaningful information in audio. The focus causes humans to ignore noise in the audio, which results in enhancing a signal-to-noise ratio. In an example, the provided apparatus compensates for shortcomings of a device to which the apparatus is coupled by adjusting a respective amplitude of at least one constituent audio frequency of an output digital audio stream of the apparatus.

VEHICULAR SOUND PROCESSING SYSTEM
20190246225 · 2019-08-08 ·

A vehicular sound processing system includes a plurality of interior microphones that detect interior sound emanating from within the interior cabin of the vehicle, and a plurality of exterior microphones that detect exterior sound emanating from exterior the vehicle. The interior microphones also detect exterior sound. A sound processor processes exterior microphone signals to determine exterior sound detected by the exterior microphones. The sound processor processes interior microphone signals to distinguish voices of occupants present within the interior cabin from non-vocal sound emanating from within the interior cabin and from exterior sound emanating from exterior the vehicle. The sound processor processes the exterior microphone signals to determine a sound of interest emanating from exterior of the vehicle. Responsive to determination of the sound of interest, the sound of interest is played by loudspeakers so that a driver of the vehicle can hear the sound of interest.

AUDIO OUTPUT CONTROL

Systems and methods for audio output control are disclosed. Audio may be output via a speaker of a communal device associated with a first portion of an environment. A user may provide a user utterance indicating an intent to add another device in a second portion of the environment to the audio-output session, and/or an intent to move the audio-output session from the first device to the second device, and/or an intent to remove a device from an audio-output session. Based on this determined intent, audio-session queues may be associated and dissociated from devices and device states may be altered to effectuate the intent of the user utterance.