Patent classifications
G06F16/634
COORDINATION OF OVERLAPPING PROCESSING OF AUDIO QUERIES
Coordinating processing of audio queries is provided. A system receives a query. The system provides the query to a first digital assistant component and a second digital assistant component for processing. The system receives a first response to the query from the first digital assistant component, and a second response to the query from the second digital assistant component. The first digital assistant component can be authorized to access a database the second digital assistant component is prohibited from accessing. The system determines, based on a ranking decision function, to select the second response to the query from the second digital assistant component. The system provides, responsive to the selection, the second response from the second digital assistant to a computing device.
CONSOLIDATION OF RESPONSES FROM QUERIES TO DISPARATE DATA SOURCES
Consolidation of disparate data source queries is provided. An application intake component can receive an application package file, and process the file to identify one or more actions and entities for the application. A natural language processor component can receive an input audio signal to identify an action and an entity on which to perform the action. An application delivery component can identify applications that can perform the action and applications that have access to the entity, and then select an application that is configured to perform the action on the entity. The application delivery component can provide an indication of the application selected by the system to a client device.
Method for searching and device thereof
Provided are a method and an apparatus for searching for and acquiring information under a computing environment. The apparatus includes: at least one input device configured to receive a first query input of a first query type and a second query input of a second query type; and a controller configured to output a query input window including a first display item corresponding to the first query input and a second display item corresponding to the second query input, to automatically switch, in response to receiving the first query input, the apparatus from a first state to receive the first query input of the first query type to a second state to receive the second query input of the second query type, and to obtain a search result according to a query based on the first query input and the second query input.
Systems, methods and computer program products for associating media content having different modalities
Systems, methods, and computer program products for associating a media content clip(s) with other media content clip(s) having a different modality by determining first embedding vectors of media content items of a first modality, receiving a media content clip of a second modality, determining a second embedding vector of the media content clip of the second modality, ranking the first embedding vectors based on a distance between the embedding vectors and the second embedding vector, and selecting one or more of the media content items of the first modality based on the ranking, thereby pairing media content clips based on emotion.
AUDIO STEM IDENTIFICATION SYSTEMS AND METHODS
Methods, systems and computer program products are provided for determining acoustic feature vectors of query and target items in a first vector space, and mapping the acoustic feature vectors to a second vector space having a lower dimension. The distribution of vectors in the second vector space can then be used to identify items from the same songs, and/or items that are complementary. A mapping function is trained using a machine learning algorithm, such that complementary audio items are closer in the second vector space than the first, according to a given distance metric.
ERROR DETECTION AND CORRECTION FOR AUDIO CACHE
An audio file associated with a user voice query may be received at a user device. The audio file may be compared to a plurality of references, such as cache entries, corresponding to a plurality of other voice queries. Based on a determination that the voice query corresponds to one of the references, an operation associated with the voice query may be executed. An indication may be received that the operation was not an intended operation associated with the voice query. Based on receiving this indication, the incorrectly identified operation, associated reference, e.g., voice query, may be disabled for the user or the device. However, the cache entry may remain enabled for one or more of a plurality of other devices.
Systems and methods for voice-based initiation of custom device actions
Systems and methods for enabling voice-based interactions with electronic devices can include a data processing system maintaining a plurality of device action data sets and a respective identifier for each device action data set. The data processing system can receive, from an electronic device, an audio signal representing a voice query and an identifier. The data processing system can identify, using the identifier, a device action data set. The data processing system can identify a device action from device action data set based on content of the audio signal. The data processing system can then identify, from the device action dataset, a command associated with the device action and send the command to the for execution device for execution.
Contextual Indexing of Media Items
Example techniques related to a sub-index of a media index. An example implementation may involve maintaining, on a mobile device, a first index of audio tracks associated with a particular user profile, the audio tracks indexed in the first index consisting of a particular subset of audio tracks that are indexed in a second index. Based on the receiving the input data indicating the search query, the mobile device searches, within the first index, for audio tracks corresponding to the search query. If the audio tracks corresponding to the search query are not found in the first index, the mobile device sends to one or more servers of the cloud service, a request to search the second index for audio tracks corresponding to the search query.
MEDIA CONTENT IDENTIFICATION AND PLAYBACK
Systems, devices, apparatuses, components, methods, and techniques for identifying and playing media content are provided. An example media-playback device for identifying and playing media content for a user traveling in a vehicle includes an audio identification engine and a media playback engine. Audio content is recorded and identified by comparison to media content databases. The audio content is identified and immediately played on the same device. Additional media content is selected for playback based on user listening preferences.
Tagging an Image with Audio-Related Metadata
In one aspect, an example method to be performed by a computing device includes (a) receiving a request to use a camera of the computing device; (b) in response to receiving the request, (i) using a microphone of the computing device to capture audio content and (ii) using the camera of the computing device to capture an image; (c) identifying reference audio content that has at least a threshold extent of similarity with the captured audio content; and (d) outputting an indication of the identified reference audio content while displaying the captured image.