Patent classifications
G10H2240/131
AUDIO TRANSLATOR
Audio translation system includes a feature extractor and a style transfer machine learning model. The feature extractor generates for each of a plurality of source voice files one or more source voice parameters encoded as a collection of source feature vectors, and generates for each of a plurality of target voice files one or more target voice parameters encoded as a collection of target feature vectors. The style transfer machine learning model trained on the collection of source feature vectors for the plurality of source voice files and the collection of target feature vectors for the plurality of target voice files to generate a style transformed feature vector.
SYSTEM AND METHOD FOR CREATING A SENSORY EXPERIENCE BY MERGING BIOMETRIC DATA WITH USER-PROVIDED CONTENT
Systems and methods for generating sensory outputs (e.g., tactile, scent, and/or flavor) based on biometric/neurometric user data are provided. One exemplary method comprises receiving an incoming signal from a bio-generated data sensing device worn by a user; receiving an input signal representing sensory content experienced by the user in association with generation of the incoming signal; populating a common vocabulary with one or more values determined based on the input signal; determining a set of output values based on the incoming signal, the common vocabulary, which comprises a list of possible output values, and a parameter file, which comprises a set of instructions for applying the common vocabulary to the incoming signal to derive the set of output values; generating an output array comprising the set of output values; and providing the output array to an output delivery system configured to render the output array as a sensory output.
Systems and methods for generating audible versions of text sentences from audio snippets
A method is performed at a server system of a media-providing service. The server system has one or more processors and memory storing instructions for execution by the one or more processors. The method includes receiving a text sentence including a plurality of words from a device of a first user and extracting a plurality of audio snippets from one or more audio tracks. A respective audio snippet in the plurality of audio snippets corresponds to one or more words in the plurality of words of the text sentence. The method also includes assembling the plurality of audio snippets in a first order to produce an audible version of the text sentence. The method further includes providing, for playback at the device of the first user, the audible version of the text sentence including the plurality of audio snippets in the first order.
System and method for generating an audio file
A system and method for synchronizing an audio or MIDI file with a video file are provided. The method includes receiving a first audio or MIDI file, receiving a video file, and operating an audio synchronization module to perform steps of synchronizing the first audio or MIDI file with the video file, marking an event in the video file at a point on a timeline, detecting a first musical key for the event, retrieving a musical stinger or swell from a library, in which the musical stinger or swell is a second audio or MIDI file and is tagged with a second musical key, and the second musical key is relevant to the first musical key, and placing the musical stinger or swell at the point of the timeline marked for the event.
System and method for generating an audio file
A system and method for synchronizing an audio or MIDI file with a video file are provided. The method includes receiving a first audio or MIDI file, receiving a video file, and operating an audio synchronization module to perform steps of synchronizing the first audio or MIDI file with the video file, marking an event in the video file at a point on a timeline, detecting a first musical key for the event, retrieving a musical stinger or swell from a library, in which the musical stinger or swell is a second audio or MIDI file and is tagged with a second musical key, and the second musical key is relevant to the first musical key, and placing the musical stinger or swell at the point of the timeline marked for the event.
Variations audio playback
A method for controlling a playback tempo of an audio track to be presented at an audio output, the audio track comprising a plurality of audio components, a first audio component of the plurality of audio components being associated with a plurality of sets of audio data, wherein each set of audio data in the plurality of sets of audio data is associated with a respective playback tempo range, the method comprising receiving a playback tempo for presenting the audio track at the audio output, selecting, from the plurality of sets of audio data, a set of audio data that has an associated playback tempo range comprising the received playback tempo, and allocating the selected set of audio data to the first audio component for presenting the audio track at the audio output.
INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING METHOD, AND PROGRAM
An information processing system including a content generation unit that generates content with use of one or more pieces of material data generated from original content, and a metadata addition unit that adds, to the content, content metadata including material information associated with the material data used for generation of the content and generation information associated with generation of the content.
INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING METHOD, AND PROGRAM
An information processing system including a selection unit that selects one or more pieces of content in a use status that meets a predetermined condition in a specific period, an extraction unit that extracts, as a feature value of the content, information associated with one or more pieces of material data used at the time of generation of the content, on the basis of metadata added to each piece of the selected content, and a generation unit that generates support information for a user on the basis of the extracted feature value.
System and method for creating a sensory experience by merging biometric data with user-provided content
Systems and methods are provided for using a common “vocabulary,” predefined or dynamically generated based on user-provided content, to transform biometric and/or neurometric data collected from one or more people into a coherent audio and/or visual result. One method comprises receiving a first incoming signal from a bio-generated data sensing device worn by a first user; determining a first set of output values based on the first incoming signal, a common vocabulary comprising a list of possible output values, and a parameter file comprising a set of instructions for applying the common vocabulary to the first incoming signal to derive the first set of output values; generating a first output array comprising the first set of output values; and providing the first output array to an output delivery system configured to render the first output array as a first audio and/or visual output.
SYSTEM AND METHOD FOR GENERATING AN AUDIO FILE
A system and method for synchronizing an audio or MIDI file with a video file are provided. The method includes receiving a first audio or MIDI file, receiving a video file, and operating an audio synchronization module to perform steps of synchronizing the first audio or MIDI file with the video file, marking an event in the video file at a point on a timeline, detecting a first musical key for the event, retrieving a musical stinger or swell from a library, in which the musical stinger or swell is a second audio or MIDI file and is tagged with a second musical key, and the second musical key is relevant to the first musical key, and placing the musical stinger or swell at the point of the timeline marked for the event.