G10L21/01

Audio processor and method for processing an audio signal using horizontal phase correction

An audio processor for processing an audio signal includes an audio signal phase measure calculator configured for calculating a phase measure of an audio signal for a time frame, a target phase measure determiner for determining a target phase measure for the time frame, and a phase corrector configured for correcting phases of the audio signal for the time frame using the calculated phase measure and the target phase measure to obtain a processed audio signal.

Audio processing method and audio processing device for expanding or compressing audio signals
10891966 · 2021-01-12 · ·

An audio processing device includes a feature extraction unit and signal generating unit. The feature extraction unit is configured to extract a feature quantity of a first audio signal for each of a plurality of periods. The signal generating unit is configured to for generate a second audio signal by time axis expanding/compressing either a section of the first audio signal in which the feature quantity is steadily maintained for a period time, or a section of the first audio signal in which a fluctuation of the feature quantity is repeated and excluding from the time axis expanding/compressing a section of the first audio signal in which a fluctuation of the feature quantity is not similar to that of other sections of the first audio signal.

Audio processing method and audio processing device for expanding or compressing audio signals
10891966 · 2021-01-12 · ·

An audio processing device includes a feature extraction unit and signal generating unit. The feature extraction unit is configured to extract a feature quantity of a first audio signal for each of a plurality of periods. The signal generating unit is configured to for generate a second audio signal by time axis expanding/compressing either a section of the first audio signal in which the feature quantity is steadily maintained for a period time, or a section of the first audio signal in which a fluctuation of the feature quantity is repeated and excluding from the time axis expanding/compressing a section of the first audio signal in which a fluctuation of the feature quantity is not similar to that of other sections of the first audio signal.

SOUND PROCESSING METHOD, SOUND PROCESSING APPARATUS, AND RECORDING MEDIUM
20200402525 · 2020-12-24 ·

A method obtains a first sound signal representative of a first sound, including a first spectrum envelope contour and a first reference spectrum envelope contour; obtains a second sound signal, representative of a second sound differing in sound characteristics from the first sound, including a second spectrum envelope contour and a second reference spectrum envelope contour; generates a synthesis spectrum envelope contour by transforming the first spectrum envelope contour based on a first difference between the first spectrum envelope contour and the first reference spectrum envelope contour at a first time point of the first sound signal, and a second difference between the second spectrum envelope contour and the second reference spectrum envelope contour at a second time point of the second sound signal; and generates a third sound signal representative of the first sound that has been transformed using the generated synthesis spectrum envelope contour.

SOUND PROCESSING METHOD, SOUND PROCESSING APPARATUS, AND RECORDING MEDIUM
20200402525 · 2020-12-24 ·

A method obtains a first sound signal representative of a first sound, including a first spectrum envelope contour and a first reference spectrum envelope contour; obtains a second sound signal, representative of a second sound differing in sound characteristics from the first sound, including a second spectrum envelope contour and a second reference spectrum envelope contour; generates a synthesis spectrum envelope contour by transforming the first spectrum envelope contour based on a first difference between the first spectrum envelope contour and the first reference spectrum envelope contour at a first time point of the first sound signal, and a second difference between the second spectrum envelope contour and the second reference spectrum envelope contour at a second time point of the second sound signal; and generates a third sound signal representative of the first sound that has been transformed using the generated synthesis spectrum envelope contour.

Audio processor and method for processing an audio signal using vertical phase correction

An audio processor for processing an audio signal includes a target phase measure determiner for determining a target phase measure for the audio signal in a time frame, a phase error calculator for calculating a phase error using a phase of the audio signal in the time frame and the target phase measure, and a phase corrector configured for correcting the phase of the audio signal in the time frame using the phase error.

Global frequency-warping transformation estimation for voice timbre approximation
10706867 · 2020-07-07 · ·

A method and system for converting a source voice to a target voice is disclosed. The method comprises: recording source voice data and target voice data; extracting spectral envelope features from the source voice data and target voice data; time-aligning pairs of frames based on the extracted spectral envelope features; converting each pair of frames into a frequency domain; generating a plurality of frequency-warping factor candidates, wherein each of the plurality of frequency-warping factor candidates is associated with one of the pairs of frames; generating a single global frequency-warping factor based on the candidates; acquiring source speech; converting the source speech to target speech based on the global frequency-warping factor; generating a waveform comprising the target speech; and playing the waveform comprising the target speech to a user.

Calculator and method for determining phase correction data for an audio signal

A calculator for determining phase correction data for an audio signal includes a variation determiner for determining a variation of a phase of the audio signal in a first and a second variation mode, a variation comparator for comparing a first variation determined using the first variation mode and a second variation determined using the second variation mode, and a correction data calculator for calculating the phase correction data in accordance with the first variation mode or the second variation mode based on a result of the comparing.

Calculator and method for determining phase correction data for an audio signal

A calculator for determining phase correction data for an audio signal includes a variation determiner for determining a variation of a phase of the audio signal in a first and a second variation mode, a variation comparator for comparing a first variation determined using the first variation mode and a second variation determined using the second variation mode, and a correction data calculator for calculating the phase correction data in accordance with the first variation mode or the second variation mode based on a result of the comparing.

AUDIO DATA PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, MEDIUM AND PROGRAM PRODUCT
20240038251 · 2024-02-01 ·

An audio data processing method is provided. The method includes: obtaining human voice audio data to be adjusted and reference human voice audio data; performing framing on the human voice audio data to be adjusted and the reference human voice audio data respectively so as to obtain a first audio frame set and a second audio frame set respectively; recognizing a pronunciation unit corresponding to each audio frame respectively; determining, based on a timestamp of each audio frame, a timestamp of each pronunciation unit in the human voice audio data to be adjusted and the reference human voice audio data respectively; and adjusting the timestamp of at least one pronunciation unit to make the timestamp of the pronunciation unit in the human voice audio data to be adjusted to be consistent with the timestamp of the corresponding pronunciation unit in the reference human voice audio data.