Patent classifications
G06F16/634
SONG GENERATION BASED ON A TEXT INPUT
The disclosure provides a method and an apparatus for song generation. A text input may be received. A topic and an emotion may be extracted from the text input. A melody may be determined according to the topic and the emotion. Lyrics may be generated according to the melody and the text input. A song may be generated at least according to the melody and the lyrics.
SOUND SIGNAL SEARCH APPARATUS, SOUND SIGNAL SEARCH METHOD, DATA SEARCH APPARATUS, DATA SEARCH METHOD, AND PROGRAM
To provide sound signal search techniques that can search for sound signals without tagging with text data. A sound signal search apparatus includes: a recording unit that records a sound signal database made up of records each including a latent variable corresponding to a sound signal and the sound signal, the latent variable being generated from the sound signal with a sound signal encoder; a latent variable generation unit that generates, from a natural language representation being input (hereinafter referred to as an input natural language representation), a latent variable corresponding to the input natural language representation using a natural language representation encoder; and a search unit that determines sound signals corresponding to the input natural language representation as a search result from the latent variable corresponding to the input natural language representation using the sound signal database.
System and method for acoustic vehicle location tracking
Techniques for acoustic vehicle location tracking are presented. In one embodiment, a processor receives, from a radio-frequency positioning system, measurements of a candidate location for a plurality of vehicles and receives, from a plurality of acoustic sensors, acoustic signals associated with a detected vehicle. The acoustic signals are compared to an acoustic vehicle signature library that includes acoustic information associated with the vehicles. Upon determining that the acoustic signals match acoustic information associated with a vehicle in the acoustic vehicle signature library, the measurements of the candidate location of the detected vehicle based on the radio-frequency signals are compared with a location of the vehicle based on the acoustic signals. Upon determining that the measurements of the candidate location are within a predetermined area of the location of the detected vehicle, the measurements are provided to a vehicle location database.
HIERARCHICAL MULTI-TIER LANGUAGE PROCESSING PLATFORM
Aspects of the disclosure relate to systems and methods for increasing the speed, accuracy, and efficiency of language processing systems. A provided method may include storing a plurality of modules in a database. The method may include configuring the plurality of modules in a multi-tier tree architecture. The method may include receiving an utterance. The method may include processing the utterance via a natural language processing (NLP) engine. The method may include routing the utterance. The routing may include identifying a highest tier module that matches a predetermined portion of the utterance. The method may include compiling a result set of modules. The method may include transmitting the result set of modules to the system user. The result set of modules may include a comprehensive and narrowly tailored response to the user request.
Methods and apparatus to identify media that has been pitch shifted, time shifted, and/or resampled
Methods, apparatus, systems and articles of manufacture are disclosed to identify media that has been pitch shifted, time shifted, and/or resampled. An example method includes: generating, by executing an instruction with a processor, a fingerprint from an audio signal; transmitting the fingerprint and adjusting instructions to a central facility to facilitate a query, the adjusting instructions identifying at least one of a pitch shift or a time shift; and receiving a response including an identifier for the audio signal and information corresponding to how the audio signal was adjusted; storing information indicative of the identifier and the information into a database.
Methods and apparatus to identify media
Methods, apparatus, systems and articles of manufacture are disclosed to identify media. An example method includes: in response to a query, generating an adjusted sample media fingerprint by applying an adjustment to a sample media fingerprint; comparing the adjusted sample media fingerprint to a reference media fingerprint; and in response to the adjusted sample media fingerprint matching the reference media fingerprint, transmitting information associated with the reference media fingerprint and the adjustment.
METHOD FOR SEARCHING AND DEVICE THEREOF
Provided are a method and an apparatus for searching for and acquiring information under a computing environment. The apparatus includes: at least one input device configured to receive a first query input of a first query type and a second query input of a second query type; and a controller configured to output a query input window including a first display item corresponding to the first query input and a second display item corresponding to the second query input, to automatically switch, in response to receiving the first query input, the apparatus from a first state to receive the first query input of the first query type to a second state to receive the second query input of the second query type, and to obtain a search result according to a query based on the first query input and the second query input.
MUSIC STREAMING, PLAYLIST CREATION AND STREAMING ARCHITECTURE
A system and method for making categorized music tracks available to end user applications. The tracks may be categorized based on computer-derived rhythm, texture and pitch (RTP) scores for tracks derived from high-level acoustic attributes, which is based on low level data extracted from the tracks. RTP scores are stored in a universal database common to all of the music publishers so that the same track, once RTP scored, does not need to be re-RTP scored by other music publishers. End user applications access an API server to import collections of tracks published by publishers, to create playlists and initiate music streaming. Each end user application is sponsored by a single music publisher so that only tracks capable of being streamed by the music publisher are available to the sponsored end user application.
Densification in Music Search and Recommendation
Disclosed herein are computer-implemented method, system, and computer-readable storage-medium embodiments for implementing densification in music search. An embodiment includes processor(s) configured to obtain a first feature set extracted from a first audio recording, and a first fingerprint of the first audio recording; and evaluate, using at least one first machine-learning algorithm, a similarity index corresponding to the first audio recording with respect to at least one second audio recording, considering: the first feature set extracted from the first audio recording, and a second feature set extracted from the at least one second audio recording; or the first fingerprint of the first audio recording, and at least one second fingerprint of the at least one second audio recording. Further embodiments include defining arrangement group(s) including the first audio recording and the at least one second audio recording with similarity index within a predetermined range, outputting densified response(s) to a search query.
Hierarchical multi-tier language processing platform
Aspects of the disclosure relate to systems and methods for increasing the speed, accuracy, and efficiency of language processing systems. A provided method may include storing a plurality of modules in a database. The method may include configuring the plurality of modules in a multi-tier tree architecture. The method may include receiving an utterance. The method may include processing the utterance via a natural language processing (NLP) engine. The method may include routing the utterance. The routing may include identifying a highest tier module that matches a predetermined portion of the utterance. The method may include compiling a result set of modules. The method may include transmitting the result set of modules to the system user. The result set of modules may include a comprehensive and narrowly tailored response to the user request.