Patent classifications
G10H2240/075
SERVER SIDE CROSSFADING FOR PROGRESSIVE DOWNLOAD MEDIA
Systems and methods are provided to implement and facilitate cross-fading, interstitials and other effects/processing of two or more media elements in a personalized media delivery service. Effects or crossfade processing can occur on the broadcast, publisher or server-side, but can still be personalized to a specific user, in a manner that minimizes processing on the downstream side or client device. The cross-fade can be implemented after decoding, processing, re-encoding, and rechunking the relevant chunks of each component clip. Alternatively, the cross-fade or other effect can be implemented on the relevant chunks in the compressed domain, thus obviating any loss of quality by re-encoding. A large scale personalized content delivery service can limit the processing to essentially the first and last chunks of any file, there being no need to process the full clip.
Systems and methods for embedding data in media content
An electronic device modifies a first media content item by superimposing a first set of data over a first accented musical event. The first accented musical event has a first audio profile. The first set of data has a second audio profile configured to be masked by the first audio profile during playback of the first media content item. The electronic device transmits, to a second electronic device, the modified first media content item.
Modular automated music production server
A music production system comprises: a computer interface comprising at least one input for receiving an external request for a piece of music and at least one output for transmitting a response to the external request which comprises or indicates a piece of music incorporating first music data; a first music production component configured to process second music data according to at least a first input setting so as to generate the first music data; a second music production component configured to receive via the computer interface an internal request, and provide the second music data based on at least a second input setting denoted by the internal request; and a controller configured to determine in response to the external request the first and second input settings, and instigate the internal request via the computer interface.
SOUND SIGNAL DATABASE GENERATION APPARATUS, SOUND SIGNAL SEARCH APPARATUS, SOUND SIGNAL DATABASE GENERATION METHOD, SOUND SIGNAL SEARCH METHOD, DATABASE GENERATION APPARATUS, DATA SEARCH APPARATUS, DATABASE GENERATION METHOD, DATA SEARCH METHOD, AND PROGRAM
To provide database generation techniques that can accurately and efficiently generate a database useable in text-based sound signal search. A sound signal database generation apparatus includes: a latent variable generation unit that generates, from a sound signal, a latent variable corresponding to the sound signal using a sound signal encoder; a data generation unit that generates a natural language representation corresponding to the sound signal from the latent variable and a condition concerning an index for a natural language representation using a natural language representation decoder; and a sound signal database generation unit that generates a record including the natural language representation corresponding to the sound signal and the sound signal from the natural language representation corresponding to the sound signal and the sound signal, and generates a sound signal database made up of the record.
METHOD AND APPARATUS FOR IDENTIFYING MUSIC IN CONTENT
The present invention relates to an apparatus and method for identifying music in a content, The present invention includes extracting and storing a fingerprint of an original audio in an audio fingerprint DB; extracting a first fingerprint of a first audio in the content; and searching for a fingerprint corresponding to the fingerprint of the first audio in the audio fingerprint DB, wherein the first audio is audio data in a music section detected from the content.
Generative composition with texture groups
A computer-implemented method of generating a musical composition containing a plurality of musical texture groups is disclosed. The method includes assembling musical texture groups from musical instrument components and associating therewith a tag expressing emotional textural connotation. The instrument components have musical textural classifiers selected from a set of pre-defined textural classifiers such that different instrument components may have a different subset of pre-defined textural classifiers. The textural classifiers within a texture group possess either no musical feature attribute or a single musical feature attribute and any number of musical accompaniment attributes. The method then generates at least one chord scheme to a narrative brief, to provide an emotional connotation to a series of events, the chord scheme generated by selecting and assembling Form Atoms. The final step includes applying a texture to the chord scheme to generate the musical composition reflecting the narrative brief.
SYSTEMS AND METHODS FOR EMBEDDING DATA IN MEDIA CONTENT
A method is provided for modifying a first media content item by superimposing a first set of data over a first audio event having an amplitude that satisfies a first threshold. The first audio event has a first audio profile, the first set of data has a second audio profile, playback of the second audio profile is configured to be masked by the first audio profile during playback of the first media content item, and the first set of data includes playlist information. The method includes transmitting, to a second electronic device, the modified first media content item.
System and methods for automatically generating a musical composition having audibly correct form
A generative composition system reduces existing musical artefacts to constituent elements termed “Form Atoms”. These Form Atoms may each be of varying length and have musical properties and associations that link together through Markov chains. To provide myriad new composition, a set of heuristics ensures that musical textures between concatenated musical sections follow a supplied and defined briefing narrative for the new composition whilst contiguous concatenated Form Atoms are also automatically selected to see that similarities in respective and identified attributes of musical textures for those musical sections are maintained to maintain good musical form. Within the composition work, chord spacing and control are practiced to maintain musical sense in the new composition and a primitive heuristics structure maintains pitch and permits key transformation. The system provides signal analysis and music generation by allowing emotional connotations to be specified and reproduced from cross-referenced Form-Atoms.
GENERATIVE COMPOSITION WITH DEFINED FORM ATOM HEURISTICS
A generative composition system reduces existing musical artefacts to constituent elements termed “Form Atoms”. These Form Atoms may each be of varying length and have musical properties and associations that link together through Markov chains. To provide myriad new composition, a set of heuristics ensures that musical textures between concatenated musical sections follow a supplied and defined briefing narrative for the new composition whilst contiguous concatenated Form Atoms are also automatically selected to see that similarities in respective and identified attributes of musical textures for those musical sections are maintained to support maintenance of musical form. Independent aspects of the disclosure further ensure that, within the composition work, such as a media product or a real-time audio stream, chord spacing determination and control are practiced to maintain musical sense in the new composition. Further, a structuring of primitive heuristics operates to maintain pitch and permit key transformation. The system and its functionality provides signal analysis and music generation through allowing emotional connotations to be specified and reproduced from cross-referenced Form-Atoms.
SYSTEM AND METHOD FOR AI/XI BASED AUTOMATIC SONG FINDING METHOD FOR VIDEOS
According to a first embodiment, one method presented herein involves methods of finding the best fitting song from a large audio database for a selected video production. The content of these songs utilized in the instant invention has been tagged by emotion tags describing the energy, the emotion of these songs over time—meaning that each song can contain a plurality of, even overlapping, emotions.