G10H2250/235

DEVICE, METHOD, AND MEDIUM FOR INTEGRATING AUDITORY BEAT STIMULATION INTO MUSIC
20220262332 · 2022-08-18 ·

A device, method, and medium for integrating monaural and binaural beats into music are provided. The music is analyzed to determine key, root tone, and spectral range. It is then remixed with monaural beats and/or binaural beats at frequencies based on a desired entrainment frequency and the root tone and lowest dominant frequency range of the music. Additional harmonics of the beats in higher octaves may be integrated into the music as well using mixing and/or equalization.

MUSICAL PIECE ANALYSIS DEVICE, PROGRAM, AND MUSICAL PIECE ANALYSIS METHOD
20220262331 · 2022-08-18 ·

The following abstract will replace all prior versions of the abstract in the A music piece analyzer includes: a key candidate determiner configured to analyze music data to determine a plurality of key candidates; and a key selector configured to extract each one of the plurality of key candidates, detect keys corresponding to related keys from among remaining ones of the plurality of key candidates supposing that the extracted one of the plurality of key candidates is a main key, on each extracted one of the plurality of key candidates, calculate a related key score in accordance with the number of the keys corresponding to the related keys, and select a key of a music piece in accordance with the related key score from among the plurality of key candidates.

Media content identification on mobile devices

A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.

Complex linear projection for acoustic modeling

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition using complex linear projection are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The method further includes generating frequency domain data using the audio data. The method further includes processing the frequency domain data using complex linear projection. The method further includes providing the processed frequency domain data to a neural network trained as an acoustic model. The method further includes generating a transcription for the utterance that is determined based at least on output that the neural network provides in response to receiving the processed frequency domain data.

Voice synthesis method, voice synthesis device, and storage medium

A voice synthesis method according to an embodiment includes altering a series of synthesis spectra in a partial period of a synthesis voice based on a series of amplitude spectrum envelope contours of a voice expression to obtain a series of altered spectra to which the voice expression has been imparted, and synthesizing a series of voice samples to which the voice expression has been imparted, based on the series of altered spectra.

AUDIO SIGNAL ANALYSIS METHOD, AUDIO SIGNAL ANALYSIS SYSTEM AND NON-TRANSITORY COMPUTER-READABLE MEDIUM
20220215820 · 2022-07-07 ·

An audio signal analysis system comprises an electronic controller that is configured to execute a plurality of modules including an acquisition module configured to acquire a first spectrum, which is a time average of a plurality of frequency spectra of an audio signal, a specification module configured to acquire a plurality of reference values corresponding to different pitches that follow a prescribed temperament and configured to specify, by a problem-solving search algorithm, a frequency difference corresponding to a second spectrum which includes a plurality of components each having a frequency difference with respect to each of the plurality of reference values, the second spectrum being similar to the first spectrum with a degree of similarity exceeding a prescribed threshold value, and a correction module configured to correct the frequency difference so as to reduce systematic error included in the frequency difference specified by the specification module.

SPIRAL CURVE TYPE MUSIC SHEET, APPARATUS AND METHOD FOR PROVIDING SPIRAL CURVE TYPE MUSIC SHEET

Disclosed are a spiral curve type music sheet in which different notes are displayed at different positions on a spiral curve based on the pitches of notes, and an apparatus and method for providing a spiral curve type music sheet. The apparatus for providing a spiral curve type music sheet may include a memory configured to store a spiral curve type music sheet in which different notes are displayed at different positions on a spiral curve based on the pitch of the note and note data, and a processor configured to determine the note symbol position related to the note data on the spiral curve in the spiral curve type music sheet based on the frequency of the note data.

Teaching vocal harmonies

Method of teaching a vocal harmony involves a computing device which automatically generates a plurality of audio presentations of a musical composition in a predetermined series. Each audio presentation in the series is different from the other audio presentations in the series and is configured to assist the user in progressively learning the selected vocal harmony part. Each of the plurality of audio presentations in the predetermined series is made different from others of the audio presentation in the predetermined series by selectively controlling (1) the particular ones of the plurality of vocal harmony parts that are included in each of the audio presentations, and/or (2) a magnitude of an audio volume that is applied to each of the plurality of vocal harmony parts that is included in each of the audio presentations.

Systems and methods for capturing and interpreting audio

A device is provided for capturing vibrations produced by an object such as a musical instrument such as a cymbal of a drum kit. The device comprises a detectable element, such as a ferromagnetic element, such as a metal shim and a sensor spaced apart from and located relative to the musical instrument. The detectable element is located between the sensor and the musical instrument. When the musical instrument vibrates, the sensor remains stationary and the detectable element is vibrated relative to the sensor by the musical instrument.

Timbre creation system

A timbre creation method, system, and computer program product include performing a timbre analysis of a sound from an input source to generate a digital fingerprint of the sound, performing deep learning to create a patch that matches the digital fingerprint, and generating a second patch for a synthesizer which reproduces a timbre that complements the digital fingerprint based on the patch.