G10H2210/325

Crowd-sourced technique for pitch track generation

Digital signal processing and machine learning techniques can be employed in a vocal capture and performance social network to computationally generate vocal pitch tracks from a collection of vocal performances captured against a common temporal baseline such as a backing track or an original performance by a popularizing artist. In this way, crowd-sourced pitch tracks may be generated and distributed for use in subsequent karaoke-style vocal audio captures or other applications. Large numbers of performances of a song can be used to generate a pitch track. Computationally determined pitch trackings from individual audio signal encodings of the crowd-sourced vocal performance set are aggregated and processed as an observation sequence of a trained Hidden Markov Model (HMM) or other statistical model to produce an output pitch track.

TRANSITION FUNCTIONS OF DECOMPOSED SIGNALS

A device including: first and second input units providing first and second input signals of first and second audio tracks, a decomposition unit to decompose the first input audio signal to obtain decomposed signals, a playback unit to start playback of a first output signal obtained from recombining at least first and second decomposed signals at first and second volume levels, respectively, and a transition unit for performing a transition between playback of the first output signal and playback of a second output signal obtained from the second input signal. The transition unit is adapted for reducing the first/second volume levels according to first/second transition functions. The device includes an analyzing unit to analyze an audio signal to determine a song part junction between two song parts. The transition time interval of at least one of the transition functions is set such as to include the song part junction.

METHOD, DEVICE AND SOFTWARE FOR APPLYING AN AUDIO EFFECT
20210390938 · 2021-12-16 · ·

The present invention provides a method for processing music audio data, comprising the steps of providing input audio data representing a first piece of music containing a mixture of predetermined musical timbres, decomposing the input audio data to generate at least a first audio track representing a first musical timbre selected from the predetermined musical timbres, and a second audio track representing a second musical timbre selected from the predetermined musical timbres, applying a predetermined first audio effect to the first audio track, applying no audio effect or a predetermined second audio effect, which is different from the first audio effect, to the second audio track, and obtaining recombined audio data by recombining the first audio track with the second audio track.

SOUND SIGNAL GENERATION METHOD, GENERATIVE MODEL TRAINING METHOD, SOUND SIGNAL GENERATION SYSTEM, AND RECORDING MEDIUM
20210383816 · 2021-12-09 ·

A computer-implemented sound signal generation method includes: obtaining a first sound source spectrum of a sound signal to be generated; obtaining a first spectral envelope of the sound signal; and estimating fragment data representative of samples of the sound signal based on the obtained first sound source spectrum and the obtained first spectral envelope.

SOUND SIGNAL SYNTHESIS METHOD, GENERATIVE MODEL TRAINING METHOD, SOUND SIGNAL SYNTHESIS SYSTEM, AND RECORDING MEDIUM
20210375248 · 2021-12-02 ·

A computer-implemented sound signal synthesis method includes: generating, based on first control data representative of a plurality of conditions of a sound signal to be generated, (i) first data representative of a sound source spectrum of the sound signal, and (ii) second data representative of a spectral envelope of the sound signal; and synthesizing the sound signal based on the sound source spectrum indicated by the first data and the spectral envelope indicated by the second data.

Integrated Musical Instrument Systems
20220208160 · 2022-06-30 ·

A system suitable for use as a musical instrument system is provided. The system includes at least one sensor. The system also includes at least one control surface configured to interface with the at least one sensor. Further, the system includes at least one controller configured to interface with the at least one sensor. Additionally, the system includes at least one program module configured to interface with the at least one sensor. The system includes an enclosure. The at least one sensor and the at least one control surface are positionable on the base. The system also includes at least one data processor configured to interface with the at least one sensor, the at least one control surface, and the at least one program module arranged to function as a musical instrument system. The system also includes an enclosure

METHOD, DEVICE AND SOFTWARE FOR CONTROLLING TRANSPORT OF AUDIO DATA
20220199056 · 2022-06-23 · ·

A method for processing music audio data, including providing input audio data representing a first piece of music comprising a mixture of musical timbres. The method also includes decomposing the input audio data to generate at least first-timbre decomposed data representing a first timbre selected from the musical timbres of the first piece of music, and second-timbre decomposed data representing a second timbre selected from the musical timbres of the first piece of music. The method also includes applying a transport control to obtain transport controlled first-timbre decomposed data. The method also includes recombining audio data obtained from the transport controlled first-timbre decomposed data with audio data obtained from the second-timbre decomposed data to obtain recombined audio data.

ARTIFICIAL NEURAL NETWORK
20220180208 · 2022-06-09 · ·

A computer-implemented method of training an artificial neural network (ANN) by generating one or more learned parameters for use during a subsequent inference phase of the trained ANN, comprises providing training data representing first and second input signals, the second input signal exhibiting one or more transformations relative to the first signal selected from a set of transformations; using the ANN and in response to the one or more parameters, generating a magnitude and phase representation of each of the first and second input signals; and training the one or more parameters, in dependence upon a constraint which causes the magnitude representation of the first input signal and the magnitude representation of the second input signal to tend to become more similar to one another, the training step comprising: detecting an error signal; and updating the one or more parameters in dependence upon the error signal.

Acoustic device and acoustic control program

An acoustic device includes an audio recording playback unit, an analysis unit, and an acoustic effect imparting unit. The audio recording playback unit records and plays back string independent acoustic signals for each string independent acoustic signal. The string independent acoustic signals respectively correspond to different strings of a stringed instrument and being independent from each other. The analysis unit analyzes at least one string independent acoustic signal from among the recorded string independent acoustic signals. The acoustic effect imparting unit imparts an acoustic effect to the at least one string independent acoustic signal for each string independent acoustic signal, based on a result of the analysis by the analysis unit.

Transition functions of decomposed signals

A device for processing audio signals, including: first and second input units providing first and second input signals of first and second audio tracks, a decomposition unit to decompose the first input audio signal to obtain a plurality of decomposed signals, a playback unit configured to start playback of a first output signal obtained from recombining at least a first decomposed signal at a first volume level with a second decomposed signal at a second volume level, such that the first output signal substantially equals the first input signal, and a transition unit for performing a transition between playback of the first output signal and playback of a second output signal obtained from the second input signal. The transition unit has a volume control section adapted for reducing the first and second volume levels according to first and second transition functions.