G10H2210/051

Auto-generated accompaniment from singing a melody

A method for processing a voice signal by an electronic system to create a song is disclosed. The method comprises the steps in the electronic system of acquiring an input singing voice recording (11); estimating a musical key (15b) and a Tempo (15a) from the singing voice recording (11); defining a tuning control (16) and a timing control (17) able to align the singing voice recording (11) with the estimated musical key (15b) and Tempo (15a); applying the tuning control (16) and the timing control (17) to the singing voice recording (11) so that an aligned voice recording (20) is obtained. Next, the method comprises the step of generating an music accompaniment (23) as function of the estimated musical key (15b) and Tempo (15a) and an arrangement database (22) and mixing the aligned voice recording (20) and the music accompaniment (23) to obtain the song (12). A system a server and a device are also disclosed.

EVALUATING PERCUSSIVE PERFORMANCES
20230401975 · 2023-12-14 ·

Measures (for example, methods, systems and computer programs) are provided to evaluate a percussive performance. Percussive performance data captured by one or more sensors is received. The percussive performance data represents one or more impact waveforms of one or more hits on a performance surface. The one or more impact waveforms are analysed. The analysing comprises: (i) identifying one or more characteristics of the one or more impact waveforms; (ii) classifying the one or more hits as one or more percussive hit-types based on the one or more characteristics; and (iii) evaluating the one or more percussive hit-types against performance target data. Performance evaluation data is output based on said evaluating.

Systems and methods for generating a graphical representation of audio-file playback during playback manipulation

Systems and methods for generating a graphical representation of audio-file playback during playback manipulation are provided. The system may include a processor that performs a method including displaying a waveform during audio-file playback by scrolling the waveform from a right to a left portion of a display. The method includes receiving a command to manipulate the audio-file playback and displaying a first half of the waveform corresponding to the manipulated audio-file playback until a command is received to resume the audio-file playback. The first half of the waveform is a portion of the waveform adjacent to a horizontal or vertical axis. The method includes simultaneously displaying a second half of the waveform and displaying the first half of the waveform corresponding to the manipulated audio-file playback. The second half of the waveform is on an opposite side of the axis from the first half of the waveform.

BEAT DECOMPOSITION TO FACILITATE AUTOMATIC VIDEO EDITING
20210151018 · 2021-05-20 ·

The disclosed technology relates to a process for detecting musical artifacts within a musical composition. The detection of musical artifacts is based on analyzing the energy and frequency of the digital signal of the musical composition. The identification of musical artifacts within a musical composition would be used in connection with audio-video editing.

Systems and methods for generating a visual color display of audio-file data

Systems and methods for generating a visual color display of audio-file data are provided. The system includes a processor that performs a method including receiving audio-file data; generating filtered-audio data by processing the audio-file data by frequency-band filters. The frequency band filters have different frequency bands. The method includes generating one or more waveforms corresponding to the filtered-audio data and displaying the waveforms superimposed in unique color relative to one another. The method includes downsampling the waveforms. The method includes processing the waveforms through an envelope detector. The method includes processing the waveforms through an expander and applying a gain factor. The waveforms have transparency levels at sections that are proportional or inversely proportional to amplitudes at the sections.

Media content identification on mobile devices

A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.

SYSTEMS AND METHODS FOR GENERATING A VISUAL COLOR DISPLAY OF AUDIO-FILE DATA

Systems and methods for generating a visual color display of audio-file data are provided. The system includes a processor that performs a method including receiving audio-file data; generating filtered-audio data by processing the audio-file data by frequency-band filters. The frequency band filters have different frequency bands. The method includes generating one or more waveforms corresponding to the filtered-audio data and displaying the waveforms superimposed in unique color relative to one another. The method includes downsampling the waveforms. The method includes processing the waveforms through an envelope detector. The method includes processing the waveforms through an expander and applying a gain factor. The waveforms have transparency levels at sections that are proportional or inversely proportional to amplitudes at the sections.

System for creating, practicing and sharing of musical harmonies

Collaboratively creating musical harmonies includes receiving a user selection of a particular harmony. In response to this selection, there is displayed on a display screen of a computing device a plurality of musical note indicators or notes to specify a first harmony part of a musical piece to be performed. Real-time pitch detection is used to determine a pitch of each note which is voiced by a person, and a graphic indication of the actual pitch which is sung is displayed in conjunction with the musical note indicators.

Beat decomposition to facilitate automatic video editing
10916229 · 2021-02-09 · ·

The disclosed technology relates to a process for detecting musical artifacts within a musical composition. The detection of musical artifacts is based on analyzing the energy and frequency of the digital signal of the musical composition. The identification of musical artifacts within a musical composition would be used in connection with audio-video editing.

COMPUTING ORDERS OF MODELED EXPECTATION ACROSS FEATURES OF MEDIA

A method implemented by a determination engine is provided. The determination engine receives a media dataset comprising target piece music information, target piece audience information, corpus music information, corpus audience information, and corpus preference data. The determination engine determines a subset of the corpus music and preference information and determines at least one surprise factor of the subset of the corpus music and preference information across features at one of a plurality of orders. The determination engine learns a model that estimates a likelihood that time-varying surprise trends across the features achieves a preference level. The determination engine determines at least one surprise factor of the target piece music information across the features at the one of the plurality of orders and predicts, using the model, preference information using the time-varying surprise trends for the target piece music information across the features.