Patent classifications
G10L21/02
Context-aware signal conditioning for vehicle exterior voice assistant
A vehicle includes a plurality of microphones to obtain speech from a person outside the vehicle as an input signal and a sensor system to determine a location and orientation of the person relative to the vehicle. The vehicle also includes a controller to determine characteristics of the input signal and to determine whether to perform speech enhancement on the input signal based on one or more of the characteristics and the location and orientation of the person.
ENHANCING MUSICAL SOUND DURING A NETWORKED CONFERENCE
Dynamic adjustment of audio characteristics for enhancing musical sound during a networked conference is disclosed. In an embodiment, a method is provided for sound enhancement performed by a device coupled to a network. The method includes receiving an audio signal to be transmitted over the network, detecting when musical content is present in the audio signal, processing the audio signal to enhance voice characteristics to generate an enhanced audio signal when the musical content is not detected, processing the audio signal to enhance music characteristic to generate the enhanced audio signal when the musical content is detected, and transmitting the enhanced audio signal over the network.
ENHANCING MUSICAL SOUND DURING A NETWORKED CONFERENCE
Dynamic adjustment of audio characteristics for enhancing musical sound during a networked conference is disclosed. In an embodiment, a method is provided for sound enhancement performed by a device coupled to a network. The method includes receiving an audio signal to be transmitted over the network, detecting when musical content is present in the audio signal, processing the audio signal to enhance voice characteristics to generate an enhanced audio signal when the musical content is not detected, processing the audio signal to enhance music characteristic to generate the enhanced audio signal when the musical content is detected, and transmitting the enhanced audio signal over the network.
AUDIO CAPTURE USING ROOM IMPULSE RESPONSES
The disclosed technology is generally directed to audio capture. In one example of the technology, recorded sounds are received such that the sounds recorded were emitted from multiple locations in an environment and such that the sounds recorded are sounds that can be converted to room impulse responses. The room impulse responses are generated from the recorded sounds. Location information that is associated with the multiple locations is received. At least the room impulses responses and the location information are used to generate at least one environment-specific model. Audio captured in the environment is received. An output is generated by processing the captured audio with the at least one environment-specific model such that the output includes at least one adjustment of the captured audio based on at least one acoustical property of the environment.
Trigger word detection with multiple digital assistants
Disclosed herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for selecting a digital assistant from among multiple digital assistants. An embodiment operates by receiving a voice input containing a trigger word at a first voice adapter associated with a digital assistant that generates a first confidence score for the trigger word. The embodiment further receives the voice input at a second voice adapter that generates a second confidence score for the trigger word. The embodiment determines the first confidence score is higher than the second confidence score. The embodiment selects the digital assistant based on the determining.
Noise reduction device
A noise reduction device includes a processor that converts a noise signal collected by a microphone disposed in a control space into a noise signal in a frequency domain, a storage that stores the converted noise signal in the frequency domain as a reference signal, and a signal generator that generates a noise reduction signal for reducing the noise signal collected by the microphone at a control position of the control space. The processor determines whether or not the noise signal is non-stationary noise based on a frequency characteristic of the converted noise signal in the frequency domain and a frequency characteristic of the reference signal. When it is determined that the noise signal is the non-stationary noise, the processor controls the signal generator so as to cancel generation of the noise reduction signal.
Noise reduction device
A noise reduction device includes a processor that converts a noise signal collected by a microphone disposed in a control space into a noise signal in a frequency domain, a storage that stores the converted noise signal in the frequency domain as a reference signal, and a signal generator that generates a noise reduction signal for reducing the noise signal collected by the microphone at a control position of the control space. The processor determines whether or not the noise signal is non-stationary noise based on a frequency characteristic of the converted noise signal in the frequency domain and a frequency characteristic of the reference signal. When it is determined that the noise signal is the non-stationary noise, the processor controls the signal generator so as to cancel generation of the noise reduction signal.
POST-MIXING ACOUSTIC ECHO CANCELLATION SYSTEMS AND METHODS
Acoustic echo cancellation systems and methods are provided that can cancel and suppress acoustic echo from the output of a mixer that has mixed audio signals from a plurality of acoustic sources, such as microphones. The microphones may have captured speech and sound from a remote location or far end, such as in a conferencing environment. The acoustic echo cancellation may generate an echo-cancelled mixed audio signal based on a mixed audio signal from a mixer, information gathered from the audio signal from each of the plurality of acoustic sources, and a remote audio signal. The systems and methods may be computationally efficient and resource-friendly.
POST-MIXING ACOUSTIC ECHO CANCELLATION SYSTEMS AND METHODS
Acoustic echo cancellation systems and methods are provided that can cancel and suppress acoustic echo from the output of a mixer that has mixed audio signals from a plurality of acoustic sources, such as microphones. The microphones may have captured speech and sound from a remote location or far end, such as in a conferencing environment. The acoustic echo cancellation may generate an echo-cancelled mixed audio signal based on a mixed audio signal from a mixer, information gathered from the audio signal from each of the plurality of acoustic sources, and a remote audio signal. The systems and methods may be computationally efficient and resource-friendly.
AUDIO IMPROVEMENT USING CLOSED CAPTION DATA
Methods and systems are described herein for improving audio for hearing impaired content consumers. An example method may comprise determining a content asset. Closed caption data associated with the content asset may be determined. At least a portion of the closed caption data may be determined based on a user setting associated with a hearing impairment. Compensating audio comprising a frequency translation associated with at least the portion of the closed caption data may be generated. The content asset may be caused to be output with audio content comprising the compensating audio and the original audio.