G10L25/06

LINEAR PREDICTION ANALYSIS DEVICE, METHOD, PROGRAM, AND STORAGE MEDIUM

An autocorrelation calculation unit 21 calculates an autocorrelation R.sub.O(i) from an input signal. A prediction coefficient calculation unit 23 performs linear prediction analysis by using a modified autocorrelation R′.sub.O(i) obtained by multiplying a coefficient w.sub.O( ) by the autocorrelation R.sub.O(i). It is assumed here, for each order i of some orders i at least, that the coefficient w.sub.O(i) corresponding to the order i is in a monotonically increasing relationship with an increase in a value that is negatively correlated with a fundamental frequency of the input signal of the current frame or a past frame.

NOISE SUPRESSION FOR SPEECH ENHANCEMENT

A noise suppression method includes transforming a time-domain input signal into an input spectrum that is the spectrum of the input signal, the input signal comprising speech components and noise components, and the input spectrum comprising a speech spectrum that is the spectrum of the speech components and a noise spectrum that is the spectrum of the noise components, smoothing magnitudes of the input spectrum to provide a smoothed-magnitude input spectrum, and estimating basic suppression filter coefficients from the input spectrum and the smoothed input spectrum. The method further includes determining noise suppression filter coefficients from the estimated basic suppression filter coefficients and a spectral correlation factor, the spectral correlation factor indicating whether speech is present in the input signal or not, filtering the input spectrum based on the noise suppression filter coefficients to generate an output spectrum; and transforming the output spectrum into a time-domain output signal.

NOISE SUPRESSION FOR SPEECH ENHANCEMENT

A noise suppression method includes transforming a time-domain input signal into an input spectrum that is the spectrum of the input signal, the input signal comprising speech components and noise components, and the input spectrum comprising a speech spectrum that is the spectrum of the speech components and a noise spectrum that is the spectrum of the noise components, smoothing magnitudes of the input spectrum to provide a smoothed-magnitude input spectrum, and estimating basic suppression filter coefficients from the input spectrum and the smoothed input spectrum. The method further includes determining noise suppression filter coefficients from the estimated basic suppression filter coefficients and a spectral correlation factor, the spectral correlation factor indicating whether speech is present in the input signal or not, filtering the input spectrum based on the noise suppression filter coefficients to generate an output spectrum; and transforming the output spectrum into a time-domain output signal.

Audio-based link generation

First and second speech data can be received from respective first and second devices. The first and second speech data can be determined to be from a same dialog. A link can be generated based on the dialog.

METHOD FOR PROCESSING AN AUDIO SIGNAL, SIGNAL PROCESSING UNIT, BINAURAL RENDERER, AUDIO ENCODER AND AUDIO DECODER
20230032120 · 2023-02-02 ·

A method for processing an audio signal in accordance with a room impulse response is described. The audio signal is processed with an early part of the room impulse response separate from a late reverberation of the room impulse response, wherein the processing of the late reverberation has generating a scaled reverberated signal, the scaling being dependent on the audio signal. The processed early part of the audio signal and the scaled reverberated signal are combined.

METHOD FOR PROCESSING AN AUDIO SIGNAL, SIGNAL PROCESSING UNIT, BINAURAL RENDERER, AUDIO ENCODER AND AUDIO DECODER
20230032120 · 2023-02-02 ·

A method for processing an audio signal in accordance with a room impulse response is described. The audio signal is processed with an early part of the room impulse response separate from a late reverberation of the room impulse response, wherein the processing of the late reverberation has generating a scaled reverberated signal, the scaling being dependent on the audio signal. The processed early part of the audio signal and the scaled reverberated signal are combined.

DETECTION OF ATTACHMENT PROBLEM OF APPARATUS BEING WORN BY USER
20230091735 · 2023-03-23 · ·

Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.

DETECTION OF ATTACHMENT PROBLEM OF APPARATUS BEING WORN BY USER
20230091735 · 2023-03-23 · ·

Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.

Audio fingerprinting for meeting services

The present technology can receive audio segments from sources within one or more conference room, and can create audio fingerprints from the sources. The audio fingerprints are optimized for audio in conference room environments, which include distortions from room impulse responses, and various encoding used by telecommunication networks. In some embodiments, when two audio segments are matched, a user equipment can be instructed to mute its speakers to avoid feedback. In some embodiments, when two audio segments are matched, a user equipment can be given instructions to join a conference taking place in the room in when the audio segment originated.

Audio fingerprinting for meeting services

The present technology can receive audio segments from sources within one or more conference room, and can create audio fingerprints from the sources. The audio fingerprints are optimized for audio in conference room environments, which include distortions from room impulse responses, and various encoding used by telecommunication networks. In some embodiments, when two audio segments are matched, a user equipment can be instructed to mute its speakers to avoid feedback. In some embodiments, when two audio segments are matched, a user equipment can be given instructions to join a conference taking place in the room in when the audio segment originated.