Patent classifications
H04R3/04
MATCHED AND EQUALIZED MICROPHONE OUTPUT OF AUTOMOTIVE MICROPHONE SYSTEMS
A vehicle microphone system may include at least two microphones forming a microphone array, at least one loudspeaker configured to emit audio signals. a processor coupled to a memory and programmed to receive incoming audio signals from the microphone array, determine at least one parameter for each channel of the microphone array, determine at least one filter to apply to at least one channel based on a difference between the parameters of each channel, and store the at least one filter in the memory.
Joint Acoustic Echo Cancelation, Speech Enhancement, and Voice Separation for Automatic Speech Recognition
A method for automatic speech recognition using joint acoustic echo cancellation, speech enhancement, and voice separation includes receiving, at a contextual frontend processing model, input speech features corresponding to a target utterance. The method also includes receiving, at the contextual frontend processing model, at least one of a reference audio signal, a contextual noise signal including noise prior to the target utterance, or a speaker embedding including voice characteristics of a target speaker that spoke the target utterance. The method further includes processing, using the contextual frontend processing model, the input speech features and the at least one of the reference audio signal, the contextual noise signal, or the speaker embedding vector to generate enhanced speech features.
Joint Acoustic Echo Cancelation, Speech Enhancement, and Voice Separation for Automatic Speech Recognition
A method for automatic speech recognition using joint acoustic echo cancellation, speech enhancement, and voice separation includes receiving, at a contextual frontend processing model, input speech features corresponding to a target utterance. The method also includes receiving, at the contextual frontend processing model, at least one of a reference audio signal, a contextual noise signal including noise prior to the target utterance, or a speaker embedding including voice characteristics of a target speaker that spoke the target utterance. The method further includes processing, using the contextual frontend processing model, the input speech features and the at least one of the reference audio signal, the contextual noise signal, or the speaker embedding vector to generate enhanced speech features.
EFFICIENT SEAMLESS SWITCHING OF SIGMA-DELTA MODULATORS
A digital microphone includes at least one integrator; a state detection and parameter control component directly coupled to an output of the integrator; and a signal processing component coupled to an output of the state detection and parameter control component, wherein a parameter of the signal processing component includes a first value in a first operational mode and a second value in a second operational mode different from the first operational mode.
EFFICIENT SEAMLESS SWITCHING OF SIGMA-DELTA MODULATORS
A digital microphone includes at least one integrator; a state detection and parameter control component directly coupled to an output of the integrator; and a signal processing component coupled to an output of the state detection and parameter control component, wherein a parameter of the signal processing component includes a first value in a first operational mode and a second value in a second operational mode different from the first operational mode.
PROCESSING DEVICE AND PROCESSING METHOD
A processing device according to this embodiment includes: a frequency characteristics acquisition unit configured to acquire frequency characteristics of an input signal; an extreme value extraction unit configured to extract an extreme value of spectral data; a kurtosis calculation unit configured to: calculate an evaluation value from spectral data; and calculate a kurtosis of a peak or a dip based on a plurality of evaluation values calculated by changing a calculation width, the evaluation value being used for evaluating the peak or the dip corresponding to the extreme value; a determination unit configured to determine whether to suppress the peak or the dip according to a comparison result between the kurtosis and a threshold value; and a suppression unit configured to suppress the peak or the dip with the extreme value that is determined to be suppressed.
MULTI-MODAL AUDIO AMPLIFIER AND RELATED SYSTEM
Various aspects include audio amplifiers for driving at least one speaker. In some cases, the amplifier includes: a controller for amplifying at least one input signal to provide an amplified audio output signal, the controller configured to operate the amplifier in at least two modes, including: a first mode including a dedicated connection to the at least one speaker; and a second mode including a direct physical connection with an additional audio amplifier and the dedicated connection to the at least one speaker, where in the first mode and the second mode the amplifier is configured to provide the amplified audio output signal to drive the at least one speaker, and in the second mode the amplifier is configured to forward the at least one input signal to enable the additional audio amplifier to control audio output at an additional speaker.
PROCESSING DEVICE AND PROCESSING METHOD
A processing device according to an embodiment includes: a frequency characteristics acquisition unit configured to acquire frequency characteristics of at least one sound pickup signal; a smoothing processing unit configured to perform smoothing processing so as to generate second spectral data smoother than first spectral data based on the frequency characteristics; a first compression unit configured to calculate a first difference value corresponding to a difference between the second spectral data and the first spectral data in a first band, and to compress the second spectral data based on the first difference value; and a filter generation unit configured to generate a filter, based on the second spectral data.
PROCESSING DEVICE AND PROCESSING METHOD
A processing device according to an embodiment includes: a frequency characteristics acquisition unit configured to acquire frequency characteristics of at least one sound pickup signal; a smoothing processing unit configured to perform smoothing processing so as to generate second spectral data smoother than first spectral data based on the frequency characteristics; a first compression unit configured to calculate a first difference value corresponding to a difference between the second spectral data and the first spectral data in a first band, and to compress the second spectral data based on the first difference value; and a filter generation unit configured to generate a filter, based on the second spectral data.
Personalized headphone EQ based on headphone properties and user geometry
Audio processing for a headworn device can include obtaining ear geometry of a user. A frequency response or transfer function can be determined, based on the ear geometry of the user and a model of the headworn device, where the frequency response or transfer function characterizes an effect of a path between a speaker of the headworn device and an ear canal entrance of the user on sound. An equalization filter profile can be generated based on the based on the frequency response or transfer function. The equalization filter profile can be applied to an audio signal, and the audio signal can be used to drive the speaker of the headworn device.