G10L21/043

Systems and methods for generating a graphical representation of audio signal data during time compression or expansion

Systems and methods for generating a graphical representation of audio signal data during time compression or expansion are provided. The system may include a processor that performs a method including displaying a waveform during audio-signal playback at a first speed by scrolling the waveform from aright portion of a display to a left portion of the display. The method includes receiving a command to increase or decrease the audio-signal playback speed and horizontally expanding or horizontally contracting the waveform in response to receiving the command to increase or decrease the audio-signal playback speed.

Playback sound provision device

A playback sound provision device includes: a surrounding information detection device configured to detect detection information including information on a three-dimensional object or a planar display around the vehicle; and a control device configured to determine a playback method for a playback sound based on a music piece based on the detection information when a predetermined target is included in the detection information, and provide the playback sound based on the playback method.

Playback sound provision device

A playback sound provision device includes: a surrounding information detection device configured to detect detection information including information on a three-dimensional object or a planar display around the vehicle; and a control device configured to determine a playback method for a playback sound based on a music piece based on the detection information when a predetermined target is included in the detection information, and provide the playback sound based on the playback method.

AUDIO DATA PROCESSING METHOD, APPARATUS AND DEVICE, AND STORAGE MEDIUM
20220020389 · 2022-01-20 · ·

Provided are an audio data processing method and apparatus, a device and a storage medium. The method includes: acquiring audio data to be processed and a variable-speed rate of at least one audio frame in the audio data; sequentially using the at least one audio frame as a current audio frame to be processed, and converting the current audio frame to a frequency domain; determining a target phase signal of the current audio frame according to a variable-speed rate of the current audio frame and a variable-speed rate of a previous audio frame; and performing, according to the target phase signal, time domain conversion on the current audio frame converted to the frequency domain to obtain a processed current audio frame.

AUDIO DATA PROCESSING METHOD, APPARATUS AND DEVICE, AND STORAGE MEDIUM
20220020389 · 2022-01-20 · ·

Provided are an audio data processing method and apparatus, a device and a storage medium. The method includes: acquiring audio data to be processed and a variable-speed rate of at least one audio frame in the audio data; sequentially using the at least one audio frame as a current audio frame to be processed, and converting the current audio frame to a frequency domain; determining a target phase signal of the current audio frame according to a variable-speed rate of the current audio frame and a variable-speed rate of a previous audio frame; and performing, according to the target phase signal, time domain conversion on the current audio frame converted to the frequency domain to obtain a processed current audio frame.

Method and device for processing, playing and/or visualizing audio data, preferably based on AI, in particular decomposing and recombining of audio data in real-time

The present invention relates to a method for processing and playing audio data comprising the steps of receiving mixed input data and playing recombined output data. Furthermore, the invention relates to a device for processing and playing audio data, preferably DJ equipment, comprising an audio input unit for receiving a mixed input signal, a recombination unit and a playing unit for playing recombined output data. In addition, the present invention relates to a method and a device for representing audio data, i.e. on a display.

TIME-DOMAIN GAIN MODELING IN THE QMF DOMAIN
20250232783 · 2025-07-17 · ·

A method of processing audio is provided. The method includes determining modulated filter bank, MFB, domain broad band gains for fading an audio signal in accordance with a time domain target gain, so that application of the broad band gains in the MFB domain emulates application of the target gain in the time domain. Determining the broad band gains includes computing the broad band gains using the target gain, an MFB analysis prototype filter, and an MFB synthesis prototype filter. Also provided are corresponding apparatus, programs, and computer-readable storage media.

TIME-DOMAIN GAIN MODELING IN THE QMF DOMAIN
20250232783 · 2025-07-17 · ·

A method of processing audio is provided. The method includes determining modulated filter bank, MFB, domain broad band gains for fading an audio signal in accordance with a time domain target gain, so that application of the broad band gains in the MFB domain emulates application of the target gain in the time domain. Determining the broad band gains includes computing the broad band gains using the target gain, an MFB analysis prototype filter, and an MFB synthesis prototype filter. Also provided are corresponding apparatus, programs, and computer-readable storage media.

Method for changing speed and pitch of speech and speech synthesis system
11776528 · 2023-10-03 · ·

This application relates to a method of synthesizing a speech of which a speed and a pitch are changed. In one aspect, the method includes a spectrogram may be generated by performing a short-time Fourier transformation on a first speech signal based on a first hop length and a first window length, and speech signals of sections having a second window length at the interval of a second hop length from the spectrogram. A ratio between the first hop length and the second hop length may be set to be equal to the value of a playback rate and a ratio between the first window length and the second window length may be set to be equal to the value of a pitch change rate, thereby generating a second speech signal of which the speed and the pitch are changed.

Method for changing speed and pitch of speech and speech synthesis system
11776528 · 2023-10-03 · ·

This application relates to a method of synthesizing a speech of which a speed and a pitch are changed. In one aspect, the method includes a spectrogram may be generated by performing a short-time Fourier transformation on a first speech signal based on a first hop length and a first window length, and speech signals of sections having a second window length at the interval of a second hop length from the spectrogram. A ratio between the first hop length and the second hop length may be set to be equal to the value of a playback rate and a ratio between the first window length and the second window length may be set to be equal to the value of a pitch change rate, thereby generating a second speech signal of which the speed and the pitch are changed.