Patent classifications
G10L21/034
HEARING SYSTEM INCLUDING A HEARING INSTRUMENT AND METHOD FOR OPERATING THE HEARING INSTRUMENT
A hearing system includes a hearing instrument for capturing a sound signal from an environment of the hearing instrument. The captured sound signal is processed, and the processed sound signal is output to a user of the hearing instrument. In a speech recognition step, the captured sound signal is analyzed to recognize speech intervals, in which the captured sound signal contains speech. In a speech enhancement procedure performed during recognized speech intervals, the amplitude of the processed sound signal is periodically varied according to a temporal pattern that is consistent with a stress rhythmic pattern of the user. A method for operating the hearing instrument is also provided.
HEARING SYSTEM INCLUDING A HEARING INSTRUMENT AND METHOD FOR OPERATING THE HEARING INSTRUMENT
A hearing system includes a hearing instrument for capturing a sound signal from an environment of the hearing instrument. The captured sound signal is processed, and the processed sound signal is output to a user of the hearing instrument. In a speech recognition step, the captured sound signal is analyzed to recognize speech intervals, in which the captured sound signal contains speech. In a speech enhancement procedure performed during recognized speech intervals, the amplitude of the processed sound signal is periodically varied according to a temporal pattern that is consistent with a stress rhythmic pattern of the user. A method for operating the hearing instrument is also provided.
SPEECH ENHANCEMENT METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM
A speech enhancement method includes: determining a glottal parameter corresponding to a target speech frame according to a frequency domain representation of the target speech frame; determining a gain corresponding to the target speech frame according to a gain corresponding to a historical speech frame of the target speech frame; determining an excitation signal corresponding to the target speech frame according to the frequency domain representation of the target speech frame; and synthesizing the glottal parameter corresponding to the target speech frame, the gain corresponding to the target speech frame, and the excitation signal corresponding to the target speech frame, to obtain an enhanced speech signal corresponding to the target speech frame.
SPEECH ENHANCEMENT METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM
A speech enhancement method includes: determining a glottal parameter corresponding to a target speech frame according to a frequency domain representation of the target speech frame; determining a gain corresponding to the target speech frame according to a gain corresponding to a historical speech frame of the target speech frame; determining an excitation signal corresponding to the target speech frame according to the frequency domain representation of the target speech frame; and synthesizing the glottal parameter corresponding to the target speech frame, the gain corresponding to the target speech frame, and the excitation signal corresponding to the target speech frame, to obtain an enhanced speech signal corresponding to the target speech frame.
Method for adjusting sound playback and portable device thereof
A method for adjusting sound playback of a portable device for constancy notwithstanding different environments outputs from the portable device detectable audio signals inaudible to user and the device receives reflected audio before the portable device is actually commanded to play an audio file. A list of volume weightings for reflected audio is calculated. Before commencing playback of the audio file, the portable device obtains reference volume weightings from a list according to the current volume setting, and calculates adjustment coefficients for different frequency bands based on weightings of the reference volume list and of the reflected audio list. The audio signals of the audio file are output after adjustment. A portable device is also disclosed.
METHOD AND SYSTEM FOR DYNAMIC VOICE ENHANCEMENT
The present disclosure provides a method and system for voice enhancement. The method and system of the present disclosure may simultaneously perform signal processing of two paths on an input signal. The first path signal processing includes receiving an audio source input and performing dynamic loudness balancing on the audio source input based on a first gain control parameter. The second path signal processing includes: performing voice detection on the audio source input and calculating a detection confidence; and calculating a second gain control parameter based on the detection confidence. The first path signal processing and the second path signal processing may be synchronous or asynchronous. The method of the present disclosure also includes updating the first gain control parameter with the second gain control parameter calculated by a second processing path and performing the first path signal processing based on the updated first gain control parameter.
METHOD AND SYSTEM FOR DYNAMIC VOICE ENHANCEMENT
The present disclosure provides a method and system for voice enhancement. The method and system of the present disclosure may simultaneously perform signal processing of two paths on an input signal. The first path signal processing includes receiving an audio source input and performing dynamic loudness balancing on the audio source input based on a first gain control parameter. The second path signal processing includes: performing voice detection on the audio source input and calculating a detection confidence; and calculating a second gain control parameter based on the detection confidence. The first path signal processing and the second path signal processing may be synchronous or asynchronous. The method of the present disclosure also includes updating the first gain control parameter with the second gain control parameter calculated by a second processing path and performing the first path signal processing based on the updated first gain control parameter.
ADJUSTING AUDIO AND NON-AUDIO FEATURES BASED ON NOISE METRICS AND SPEECH INTELLIGIBILITY METRICS
Some implementations involve determining a noise metric and/or a speech intelligibility metric and determining a compensation process corresponding to the noise metric and/or the speech intelligibility metric. The compensation process may involve altering a processing of audio data and/or applying a non-audio-based compensation method. In some examples, altering the processing of the audio data does not involve applying a broadband gain increase to the audio signals. Some examples involve applying the compensation process in an audio environment. Other examples involve determining compensation metadata corresponding to the compensation process and transmitting an encoded content stream that includes encoded compensation metadata, encoded video data and encoded audio data from a first device to one or more other devices.
ADJUSTING AUDIO AND NON-AUDIO FEATURES BASED ON NOISE METRICS AND SPEECH INTELLIGIBILITY METRICS
Some implementations involve determining a noise metric and/or a speech intelligibility metric and determining a compensation process corresponding to the noise metric and/or the speech intelligibility metric. The compensation process may involve altering a processing of audio data and/or applying a non-audio-based compensation method. In some examples, altering the processing of the audio data does not involve applying a broadband gain increase to the audio signals. Some examples involve applying the compensation process in an audio environment. Other examples involve determining compensation metadata corresponding to the compensation process and transmitting an encoded content stream that includes encoded compensation metadata, encoded video data and encoded audio data from a first device to one or more other devices.
TRANSMISSION-AGNOSTIC PRESENTATION-BASED PROGRAM LOUDNESS
This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.