Patent classifications
G10L21/034
COMPENSATION FOR FACE COVERINGS IN CAPTURED AUDIO
The technology disclosed herein enables compensation for attenuation caused by face coverings in captured audio. In a particular embodiment, a method includes determining that a face covering is positioned to cover the mouth of a user of a user system. The method further includes receiving audio that includes speech from the user and adjusting amplitudes of frequencies in the audio to compensate for the face covering.
COMPENSATION FOR FACE COVERINGS IN CAPTURED AUDIO
The technology disclosed herein enables compensation for attenuation caused by face coverings in captured audio. In a particular embodiment, a method includes determining that a face covering is positioned to cover the mouth of a user of a user system. The method further includes receiving audio that includes speech from the user and adjusting amplitudes of frequencies in the audio to compensate for the face covering.
HEARING AID WITH VOICE RECOGNITION
A system for selectively amplifying audio signals may include a microphone configured to capture sounds from an environment of a user. The system may also include a processor programmed to: receive audio signals representative of the sounds captured by the microphone; cause selective conditioning of at least one audio signal received by the microphone from a region associated with the recognized individual; and cause transmission of the at least one conditioned audio signal to a hearing interface device configured to provide sound to an ear of the user.
HEARING AID WITH VOICE RECOGNITION
A system for selectively amplifying audio signals may include a microphone configured to capture sounds from an environment of a user. The system may also include a processor programmed to: receive audio signals representative of the sounds captured by the microphone; cause selective conditioning of at least one audio signal received by the microphone from a region associated with the recognized individual; and cause transmission of the at least one conditioned audio signal to a hearing interface device configured to provide sound to an ear of the user.
Systems and methods for generating a volume-based response for multiple voice-operated user devices
Systems and methods are provided herein for responding to a voice command at a volume level based on a volume level of the voice command. For example, a media guidance application may detect, through a first voice-operated user device of a plurality of voice-operated user devices, a voice command spoken by a user. The media guidance application may determine a first volume level of the voice command. Based on the volume level of the voice command, the media guidance application may determine that a second voice-operated user device of the plurality of voice-operated user devices is closer to the user than any of the other voice-operated user devices. The media guidance application may generate an audible response, through the second voice-operated user device, at a second volume level that is set based on the first volume level of the voice command.
Systems and methods for generating a volume-based response for multiple voice-operated user devices
Systems and methods are provided herein for responding to a voice command at a volume level based on a volume level of the voice command. For example, a media guidance application may detect, through a first voice-operated user device of a plurality of voice-operated user devices, a voice command spoken by a user. The media guidance application may determine a first volume level of the voice command. Based on the volume level of the voice command, the media guidance application may determine that a second voice-operated user device of the plurality of voice-operated user devices is closer to the user than any of the other voice-operated user devices. The media guidance application may generate an audible response, through the second voice-operated user device, at a second volume level that is set based on the first volume level of the voice command.
Method, apparatus and computer-readable media utilizing positional information to derive AGC output parameters
Method and apparatus for automatic gain control utilizing sound source position information in a shared space having a plurality of microphones and a plurality of sound sources. Sound signals are received from the microphones. One or more processors locate position information corresponding to each of the sound sources. The processor(s) determine the distance to each of the sound sources from each of the microphones. The processor(s) define a predetermined gain weight adjustment for each of the microphones. The processor(s) apply the defined weight adjustments to the microphones to achieve a consistent volume of the desired plurality of sound sources. The processor(s) maintain a consistent ambient sound level regardless of the position of the sound sources and the applied gain weight adjustments. The processor(s) output a summed signal of the sound sources at a consistent volume with a constant ambient sound level across the plurality of sound source positions.
Method, apparatus and computer-readable media utilizing positional information to derive AGC output parameters
Method and apparatus for automatic gain control utilizing sound source position information in a shared space having a plurality of microphones and a plurality of sound sources. Sound signals are received from the microphones. One or more processors locate position information corresponding to each of the sound sources. The processor(s) determine the distance to each of the sound sources from each of the microphones. The processor(s) define a predetermined gain weight adjustment for each of the microphones. The processor(s) apply the defined weight adjustments to the microphones to achieve a consistent volume of the desired plurality of sound sources. The processor(s) maintain a consistent ambient sound level regardless of the position of the sound sources and the applied gain weight adjustments. The processor(s) output a summed signal of the sound sources at a consistent volume with a constant ambient sound level across the plurality of sound source positions.
Speech pre-processing in a voice interactive intelligent personal assistant
A method for speech level adaptation, the method includes: (A) Receiving, by a voice interactive intelligent personal assistant, multiple input audio signals that includes first and second groups of input audio signals. (B) Buffering the multiple input audio signals. (C) Searching for a voice trigger in the first group of input audio signals. When finding the voice trigger then (D) Determining a linear gain factor to be applied on the second group of buffered audio signals, (E) Applying the linear gain factor on the second group of buffered audio signals to provide the output audio signals; and (F) applying a speech recognition process on the output audio signals to detect an audio command embedded in the output audio signals.
Identifying information and associated individuals
A hearing aid system for individual identification of a hearing aid system may include a wearable camera, a microphone, and at least one processor. The processor may be programmed to receive a plurality of images captured by the wearable camera; receive audio signals representative of sounds captured by the microphone; and identify a first audio signal, from among the received audio signals, representative of a voice of a first individual. The processor may transcribe and store, in a memory, text corresponding to speech associated with the voice of the first individual and determine whether the first individual is a recognized individual. If the first individual is a recognized individual, the processor may associate an identifier of the first recognized individual with the stored text corresponding to the speech associated with the voice of the first individual.