G10L21/0208

Sound Processing Apparatus and Sound Processing Method
20230238013 · 2023-07-27 ·

A sound processing apparatus includes sound collection circuity that collects a sound and generates a first sound signal, and processing circuitry that estimates an estimated noise, controls a gain of the first sound signal and outputs a second sound signal based on the estimated noise, performs filter processing to reduce a component of a predetermined frequency band of the second sound signal based at least in part on the estimated noise.

Voice Filtering Other Speakers From Calls And Audio Messages
20230005480 · 2023-01-05 · ·

A method includes receiving a first instance of raw audio data corresponding to a voice-based command and receiving a second instance of the raw audio data corresponding to an utterance of audible contents for an audio-based communication spoken by a user. When a voice filtering recognition routine determines to activate voice filtering for at least the voice of the user, the method also includes obtaining a respective speaker embedding of the user and processing, using the respective speaker embedding, the second instance of the raw audio data to generate enhanced audio data for the audio-based communication that isolates the utterance of the audible contents spoken by the user and excludes at least a portion of the one or more additional sounds that are not spoken by the user The method also includes executing.

Voice controlled assistant with coaxial speaker and microphone arrangement
11521624 · 2022-12-06 · ·

A voice controlled assistant has a housing to hold one or more microphones, one or more speakers, and various computing components. The housing has an elongated cylindrical body extending along a center axis between a base end and a top end. The microphone(s) are mounted in the top end and the speaker(s) are mounted proximal to the base end. The microphone(s) and speaker(s) are coaxially aligned along the center axis. The speaker(s) are oriented to output sound directionally toward the base end and opposite to the microphone(s) in the top end. The sound may then be redirected in a radial outward direction from the center axis at the base end so that the sound is output symmetric to, and equidistance from, the microphone(s).

Voice controlled assistant with coaxial speaker and microphone arrangement
11521624 · 2022-12-06 · ·

A voice controlled assistant has a housing to hold one or more microphones, one or more speakers, and various computing components. The housing has an elongated cylindrical body extending along a center axis between a base end and a top end. The microphone(s) are mounted in the top end and the speaker(s) are mounted proximal to the base end. The microphone(s) and speaker(s) are coaxially aligned along the center axis. The speaker(s) are oriented to output sound directionally toward the base end and opposite to the microphone(s) in the top end. The sound may then be redirected in a radial outward direction from the center axis at the base end so that the sound is output symmetric to, and equidistance from, the microphone(s).

Audio communication in a vehicle

An audio communication system for communication between vehicle occupants in a vehicle, including an image capturing device configured to monitor a first vehicle occupant, a processor configured to receive an image of the first vehicle occupant from the image capturing device and determine whether the first vehicle occupant is attracting attention from a second vehicle occupant, a first microphone associated to the first vehicle occupant configured to receive an audio input from the first vehicle occupant in response to the determination of the first vehicle occupant attracting the second vehicle occupant's attention, and a first speaker associated to the second vehicle occupant configured to activate an audio augmentation of the received audio input in the first speaker and output the augmented audio input.

Encoding parameter adjustment method and apparatus, device, and storage medium

An encoding parameter adjustment method is performed at a computer device. The method includes: obtaining a first audio signal, and determining a psychoacoustic masking threshold within a service frequency band in the first audio signal; obtaining a second audio signal, and determining a background environmental noise estimation value of the frequency within the service frequency band in the second audio signal; determining a masking tag corresponding to the service frequency band according to the psychoacoustic masking threshold of the first audio signal and the background environmental noise estimation value of the second audio signal; determining a masking rate of the service frequency band according to the masking tag corresponding to the frequency within the service frequency band; determining a first reference bit rate according to the masking rate of the service frequency band; and configuring an encoding bit rate of an audio encoder based on the first reference bit rate.

Self-voice adaptation
11715483 · 2023-08-01 · ·

Aspects of the subject technology relate to a device including a microphone, a filter and a processor. The filter receives an audio signal including ambient noise and a voice of a user of the device from the microphone. At least a portion of ambient noise is filtered from the audio signal. The processor determines a level of the ambient noise in the received audio signal and dynamically adjusts a gain applied to the filtered audio signal based on the level of the ambient noise.

Self-voice adaptation
11715483 · 2023-08-01 · ·

Aspects of the subject technology relate to a device including a microphone, a filter and a processor. The filter receives an audio signal including ambient noise and a voice of a user of the device from the microphone. At least a portion of ambient noise is filtered from the audio signal. The processor determines a level of the ambient noise in the received audio signal and dynamically adjusts a gain applied to the filtered audio signal based on the level of the ambient noise.

FILTER ADAPTATION STEP SIZE CONTROL FOR ECHO CANCELLATION

In some embodiments, an echo cancellation method which includes adaptation of at least one prediction filter, with adaptation step size controlled using gradient descent on a set of filter coefficients of the filter, where control of the adaptation step size is based at least in part on a direction of adaptation and a predictability of a gradient of adaptation (e.g., a gradient vector). Other aspects of embodiments of the invention include systems, methods, and computer program products for controlling adaptation step size of adaptive (e.g., low-complexity adaptive) echo cancellation. In some embodiments, adaptation step size control is based on a normalized, scaled gradient of adaptation, or includes smoothing of a normalized gradient of adaptation

FILTER ADAPTATION STEP SIZE CONTROL FOR ECHO CANCELLATION

In some embodiments, an echo cancellation method which includes adaptation of at least one prediction filter, with adaptation step size controlled using gradient descent on a set of filter coefficients of the filter, where control of the adaptation step size is based at least in part on a direction of adaptation and a predictability of a gradient of adaptation (e.g., a gradient vector). Other aspects of embodiments of the invention include systems, methods, and computer program products for controlling adaptation step size of adaptive (e.g., low-complexity adaptive) echo cancellation. In some embodiments, adaptation step size control is based on a normalized, scaled gradient of adaptation, or includes smoothing of a normalized gradient of adaptation