G10H2220/011

Densification in Music Search and Recommendation

Disclosed herein are computer-implemented method, system, and computer-readable storage-medium embodiments for implementing densification in music search. An embodiment includes processor(s) configured to obtain a first feature set extracted from a first audio recording, and a first fingerprint of the first audio recording; and evaluate, using at least one first machine-learning algorithm, a similarity index corresponding to the first audio recording with respect to at least one second audio recording, considering: the first feature set extracted from the first audio recording, and a second feature set extracted from the at least one second audio recording; or the first fingerprint of the first audio recording, and at least one second fingerprint of the at least one second audio recording. Further embodiments include defining arrangement group(s) including the first audio recording and the at least one second audio recording with similarity index within a predetermined range, outputting densified response(s) to a search query.

Systems and methods for generating audible versions of text sentences from audio snippets

A method is performed at a server system of a media-providing service. The server system has one or more processors and memory storing instructions for execution by the one or more processors. The method includes receiving a text sentence including a plurality of words from a device of a first user and extracting a plurality of audio snippets from one or more audio tracks. A respective audio snippet in the plurality of audio snippets corresponds to one or more words in the plurality of words of the text sentence. The method also includes assembling the plurality of audio snippets in a first order to produce an audible version of the text sentence. The method further includes providing, for playback at the device of the first user, the audible version of the text sentence including the plurality of audio snippets in the first order.

Method and apparatus for generating music

A terminal for generating music may identify, based on execution of scenario recognition, scenarios for images previously received by the terminal. The terminal may generate respective description texts for the scenarios. The terminal may execute keyword-based rhyme matching based on the respective description texts. The terminal may generate respective rhyming lyrics corresponding to the images. The terminal may convert the respective rhyming lyrics corresponding to the images into a speech. The terminal may synthesize the speech with preset background music to obtain image music.

Music notation using a disproportionate correlated scale
11289057 · 2022-03-29 ·

Methods and systems of music notation for visually representing music that provide a visual scale representing a range of an auditory scale of a portion of a musical composition spanning at least four and a half steps. The visual scale may comprise a plurality of whole-step segments each representing one whole step in the auditory scale. Each whole-step segment may be approximately a first height. The visual scale may also comprise one or more half-step segments each representing one half step in the auditory scale. Each half-step segment may be approximately a second height. A first ratio representing the first height divided by the second height may be significantly greater than a second ratio representing the whole step divided by the half step.

Method and device for processing multimedia information, electronic equipment and computer-readable storage medium

The present application discloses a method and device for processing multimedia information, an electronic equipment, and a computer-readable storage medium. The method for processing multimedia information includes: detecting whether multimedia configuration parameters have changed during a process of recording multimedia information; and recording the multimedia information based on the changed multimedia configuration parameters when detecting that the multimedia configuration parameters have changed. According to the embodiments of the present application, multimedia configuration parameters of the special effects such as stickers, make-up, filters, and mixing can be added during the recording of multimedia information, which improves the user experience.

ELECTRONIC MUSICAL INSTRUMENT, METHOD, AND STORAGE MEDIUM

An electronic musical instrument includes: a plurality of keys that include at least first keys corresponding to a first pitch range and second keys corresponding to a second pitch range; and at least one processor, configured to perform the following: in accordance with a key operation in the first pitch range, determining a syllable position contained in a phrase; and in accordance with a key operation in the second pitch rang, instructing a sound production of a digitally synthesized sound corresponding to the determined syllable position.

ELECTRONIC MUSICAL INSTRUMENT, METHOD, AND STORAGE MEDIUM

An electronic musical instrument includes: a plurality of keys that include at least first keys corresponding to a first pitch range and second keys corresponding to a second pitch range; and at least one processor, configured to perform the following: causing a syllable position in a phrase that is digitally synthesized for output to be not advanced no matter how the second keys in the second pitch range are operated while a key operation in the first pitch range is being continuously maintained; and causing the syllable position to advance every time a key operation in the second pitch rang is performed while none of the first keys in the first pitch range are being operated.

METHODS AND SYSTEMS FOR INTERACTIVE LYRIC GENERATION
20210335334 · 2021-10-28 ·

Various embodiments of an apparatus, methods, systems and computer program products described herein are directed to a Lyric Engine. In various embodiments, the Lyric Engine receives, at a user interface, a selection of at least one song criteria. The Lyric Engine receives a first set of suggested song lyrics that correspond to the selected song criteria. The Lyric Engine presents, in the user interface, the first set of suggested song lyrics. The Lyric Engine receives, at the user interface, a selection of one or more of the suggested song lyrics in the first set. The Lyric Engine receives a second set of suggested song lyrics that correspond to the selected song criteria and the selected song lyrics. The Lyric Engine concurrently presents, in the user interface, the selected song lyrics and the second set of suggested song lyrics.

Automated generation of coordinated audiovisual work based on content captured geographically distributed performers

Vocal audio of a user together with performance synchronized video is captured and coordinated with audiovisual contributions of other users to form composite duet-style or glee club-style or window-paned music video-style audiovisual performances. In some cases, the vocal performances of individual users are captured (together with performance synchronized video) on mobile devices, television-type display and/or set-top box equipment in the context of karaoke-style presentations of lyrics in correspondence with audible renderings of a backing track. Contributions of multiple vocalists are coordinated and mixed in a manner that selects for presentation, at any given time along a given performance timeline, performance synchronized video of one or more of the contributors. Selections are in accord with a visual progression that codes a sequence of visual layouts in correspondence with other coded aspects of a performance score such as pitch tracks, backing audio, lyrics, sections and/or vocal parts.

METHOD AND SYSTEM FOR INTERACTIVE SONG GENERATION
20210312897 · 2021-10-07 ·

A method and system may provide for interactive song generation. In one aspect, a computer system may present options for selecting a background track. The computer system may generate suggested lyrics based on parameters entered by the user. User interface elements allow the computer system to receive input of lyrics. As the user inputs lyrics, the computer system may update its suggestions of lyrics based on the previously input lyrics. In addition, the computer system may generate proposed melodies to go with the lyrics and the background track. The user may select from among the melodies created for each portion of lyrics. The computer system may optionally generate a computer-synthesized vocal(s) or capture a vocal track of a human voice singing the song. The background track, lyrics, melodies, and vocals may be combined to produce a complete song without requiring musical training or experience by the user.