Patent classifications
G10L21/01
AUDIO DATA PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, MEDIUM AND PROGRAM PRODUCT
An audio data processing method is provided. The method includes: obtaining human voice audio data to be adjusted and reference human voice audio data; performing framing on the human voice audio data to be adjusted and the reference human voice audio data respectively so as to obtain a first audio frame set and a second audio frame set respectively; recognizing a pronunciation unit corresponding to each audio frame respectively; determining, based on a timestamp of each audio frame, a timestamp of each pronunciation unit in the human voice audio data to be adjusted and the reference human voice audio data respectively; and adjusting the timestamp of at least one pronunciation unit to make the timestamp of the pronunciation unit in the human voice audio data to be adjusted to be consistent with the timestamp of the corresponding pronunciation unit in the reference human voice audio data.
MUSIC SYNTHESIS METHOD, SYSTEM, TERMINAL AND COMPUTER-READABLE STORAGE MEDIUM
A music synthesis method, a system, a terminal and a computer-readable storage medium are provided. The method includes: receiving a track selected by a user; obtaining a text; receiving speech data recorded by the user on the basis of the text; and forming a music file in accordance with the selected track and the speech data. The speech of a user can be combined with the track through the music synthesis method of the present application and an optimal effect of music can be simulated such that the user can participate in the singing and presentation of a music, thereby making music more entertaining.
MEDIA SEGMENT PREDICTION FOR MEDIA GENERATION
A device includes one or more processors configured to input one or more segments of an input media stream into a feature extractor. The one or more processors are further configured to pass an output of the feature extractor into an utterance classifier to produce at least one representation of at least one utterance class of a plurality of utterance classes. The one or more processors are further configured to pass the output of the feature extractor and the at least one representation into a segment matcher to produce a media output segment identifier.
MEDIA SEGMENT PREDICTION FOR MEDIA GENERATION
A device includes one or more processors configured to input one or more segments of an input media stream into a feature extractor. The one or more processors are further configured to pass an output of the feature extractor into an utterance classifier to produce at least one representation of at least one utterance class of a plurality of utterance classes. The one or more processors are further configured to pass the output of the feature extractor and the at least one representation into a segment matcher to produce a media output segment identifier.
Measuring and compensating for jitter on systems running latency-sensitive audio signal processing
A system and method receives one or more captured signals through a captured audio path and produces one or more playback signals through a playback audio path. The system and method executes one or more signal processing functions and measures the delays within the playback audio path and captured audio path during operation of the one or more signal processing functions. The system and method stores the measured delays in a memory and compensates the one or more signal processing functions for the playback delay and the capture delay.
AUDIO PROCESSOR AND METHOD FOR PROCESSING AN AUDIO SIGNAL USING HORIZONTAL PHASE CORRECTION
An audio processor for processing an audio signal includes an audio signal phase measure calculator configured for calculating a phase measure of an audio signal for a time frame, a target phase measure determiner for determining a target phase measure for the time frame, and a phase corrector configured for correcting phases of the audio signal for the time frame using the calculated phase measure and the target phase measure to obtain a processed audio signal.
Audio processor and method for processing an audio signal using vertical phase correction
An audio processor for processing an audio signal includes a target phase measure determiner for determining a target phase measure for the audio signal in a time frame, a phase error calculator for calculating a phase error using a phase of the audio signal in the time frame and the target phase measure, and a phase corrector configured for correcting the phase of the audio signal in the time frame using the phase error.
Audio processor and method for processing an audio signal using vertical phase correction
An audio processor for processing an audio signal includes a target phase measure determiner for determining a target phase measure for the audio signal in a time frame, a phase error calculator for calculating a phase error using a phase of the audio signal in the time frame and the target phase measure, and a phase corrector configured for correcting the phase of the audio signal in the time frame using the phase error.
Electronic apparatus and method for controlling the electronic apparatus
An electronic apparatus is disclosed. The electronic apparatus includes an input unit configured to receive a user input, a storage configured to store a recognition model for recognizing the user input, a sensor configured to sense a surrounding circumstance of the electronic apparatus, and a processor configured to control to recognize the received user input based on the stored recognition model and to perform an operation corresponding to the recognized user input, and update the stored recognition model in response to determining that the performed operation is caused by a misrecognition based on a user input recognized after performing the operation and the sensed surrounding circumstance.
AUDIO PROCESSOR AND METHOD FOR PROCESSING AN AUDIO SIGNAL USING VERTICAL PHASE CORRECTION
An audio processor for processing an audio signal includes a target phase measure determiner for determining a target phase measure for the audio signal in a time frame, a phase error calculator for calculating a phase error using a phase of the audio signal in the time frame and the target phase measure, and a phase corrector configured for correcting the phase of the audio signal in the time frame using the phase error.