G10H7/10

LEARNING SINGING FROM SPEECH
20220343904 · 2022-10-27 · ·

A method, computer program, and computer system is provided for converting a singing voice of a first person associated with a first speaker to a singing voice of a second person using a speaking voice of the second person associated with a second speaker. A context associated with one or more phonemes corresponding to the singing voice of a first person is encoded, and the one or more phonemes are aligned to one or more target acoustic frames based on the encoded context. One or more mel-spectrogram features are recursively generated from the aligned phonemes, the target acoustic frames, and a sample of the speaking voice of the second person. A sample corresponding to the singing voice of a first person is converted to a sample corresponding to the second singing voice using the generated mel-spectrogram features.

Timbre creation system

A timbre creation method, system, and computer program product include performing a timbre analysis of a sound from an input source to generate a digital fingerprint of the sound, performing deep learning to create a patch that matches the digital fingerprint, and generating a second patch for a synthesizer which reproduces a timbre that complements the digital fingerprint based on the patch.

LEARNING SINGING FROM SPEECH
20210248997 · 2021-08-12 · ·

A method, computer program, and computer system is provided for converting a singing voice of a first person associated with a first speaker to a singing voice of a second person using a speaking voice of the second person associated with a second speaker. A context associated with one or more phonemes corresponding to the singing voice of a first person is encoded, and the one or more phonemes are aligned to one or more target acoustic frames based on the encoded context. One or more mel-spectrogram features are recursively generated from the aligned phonemes, the target acoustic frames, and a sample of the speaking voice of the second person. A sample corresponding to the singing voice of a first person is converted to a sample corresponding to the second singing voice using the generated mel-spectrogram features.

SINGING VOICE CONVERSION
20210256958 · 2021-08-19 · ·

A method, computer program, and computer system is provided for converting a singing first singing voice associated with a first speaker to a second singing voice associated with a second speaker. A context associated with one or more phonemes corresponding to the first singing voice is encoded, and the one or more phonemes are aligned to one or more target acoustic frames based on the encoded context. One or more mel-spectrogram features are recursively generated from the aligned phonemes and target acoustic frames, and a sample corresponding to the first singing voice is converted to a sample corresponding to the second singing voice using the generated mel-spectrogram features.

TIMBRE CREATION SYSTEM

A timbre creation method, system, and computer program product include performing a timbre analysis of a sound from an input source to generate a digital fingerprint of the sound, performing deep learning to create a patch that matches the digital fingerprint, and generating a second patch for a synthesizer which reproduces a timbre that complements the digital fingerprint based on the patch.

Analog recall synthesizer having patch and knob recall

A sound generating analog synthesizer that is comprised of potentiometers, a switch or switches and a set of patch jacks has a control system that can be operated in three modes, a manual mode, an automatic mode, and a guided mode; wherein manual mode allows potentiometer and switch positions as well as patch cable connections to be set by hand; wherein automatic mode, automatically sets patch connections as on or off, as well as set potentiometer positions and switch states with electromechanical or electrical devices; and wherein the guided mode provides at least one visual information on how to change the potentiometer positions, switch states, and patch jack connections such that a previously obtained sound can be reproduced.

Analog recall synthesizer having patch and knob recall

A sound generating analog synthesizer that is comprised of potentiometers, a switch or switches and a set of patch jacks has a control system that can be operated in three modes, a manual mode, an automatic mode, and a guided mode; wherein manual mode allows potentiometer and switch positions as well as patch cable connections to be set by hand; wherein automatic mode, automatically sets patch connections as on or off, as well as set potentiometer positions and switch states with electromechanical or electrical devices; and wherein the guided mode provides at least one visual information on how to change the potentiometer positions, switch states, and patch jack connections such that a previously obtained sound can be reproduced.

Smart voice enhancement architecture for tempo tracking among music, speech, and noise
10762887 · 2020-09-01 · ·

Audio data describing an audio signal may be received and used to determine a set of frames of the audio signal. A plurality of note onsets in the set of frames may be identified based on spectral energy of the audio signal in the set of frames. One or more tempos may be computed based on the identified plurality of note onsets. The one or more tempos may be validated based on a tempo validation condition. One or more music states of the audio signal may be determined based on the validated one or more tempos. Audio enhancement of the audio signal may be modified based on the one or more determined states of the audio signal.

ANALOG RECALL SYNTHESIZER HAVING PATCH AND KNOB RECALL

A sound generating analog synthesizer that is comprised of potentiometers, a switch or switches and a set of patch jacks has a control system that can be operated in three modes, a manual mode, an automatic mode, and a guided mode; wherein manual mode allows potentiometer and switch positions as well as patch cable connections to be set by hand; wherein automatic mode, automatically sets patch connections as on or off, as well as set potentiometer positions and switch states with electromechanical or electrical devices; and wherein the guided mode provides at least one visual information on how to change the potentiometer positions, switch states, and patch jack connections such that a previously obtained sound can be reproduced.

ANALOG RECALL SYNTHESIZER HAVING PATCH AND KNOB RECALL

A sound generating analog synthesizer that is comprised of potentiometers, a switch or switches and a set of patch jacks has a control system that can be operated in three modes, a manual mode, an automatic mode, and a guided mode; wherein manual mode allows potentiometer and switch positions as well as patch cable connections to be set by hand; wherein automatic mode, automatically sets patch connections as on or off, as well as set potentiometer positions and switch states with electromechanical or electrical devices; and wherein the guided mode provides at least one visual information on how to change the potentiometer positions, switch states, and patch jack connections such that a previously obtained sound can be reproduced.