G06F16/634

Karaoke query processing system
11366851 · 2022-06-21 · ·

Computer systems and methods are provided for processing audio queries. An electronic device receives an audio clip and performs a matching process on the audio clip. The matching process includes comparing at least a portion of the audio clip to a plurality of reference audio tracks and identifying, based on the comparing, a first portion of a particular reference track that corresponds to the audio sample. Upon identifying the matching portion, the electronic device provides a backing track for playback which corresponds to the particular reference track, and an initial playback position of the backing track.

Audio recognition-based industrial automation control

A system for performing industrial automation control may include an audio device that receives audio data from an element in an industrial automation system. The audio device may determine orientation data based on the audio data. In addition, the audio device may determine an automation command to control a machine in the industrial automation system based on the audio data and the orientation data. After determining the automation command, the audio device may implement a first control action for the machine based at least in part on the automation command, where the first control action causes the machine to adjust an operation.

Hierarchical multi-tier language processing platform

Aspects of the disclosure relate to systems and methods for increasing the speed, accuracy, and efficiency of language processing systems. A provided method may include storing a plurality of modules in a database. The method may include configuring the plurality of modules in a multi-tier tree architecture. The method may include receiving an utterance. The method may include processing the utterance via a natural language processing (NLP) engine. The method may include routing the utterance. The routing may include identifying a highest tier module that matches a predetermined portion of the utterance. The method may include compiling a result set of modules. The method may include transmitting the result set of modules to the system user. The result set of modules may include a comprehensive and narrowly tailored response to the user request.

INTERRUPTION DETECTION AND HANDLING BY DIGITAL ASSISTANTS

Systems and methods are described for managing digital assistant interaction. A query is received from a user, and a reply to the query is generated for output. An interruption for the user is detected, and subsequently an end of the interruption is detected. In response to detecting the end of the interruption, a predicted query related to the initial query is identified, and a prompt to provide a reply to the predicted query may be generated for output.

Method for searching and device thereof

Provided are a method and an apparatus for searching for and acquiring information under a computing environment. The apparatus includes: at least one input device configured to receive a first query input of a first query type and a second query input of a second query type; and a controller configured to output a query input window including a first display item corresponding to the first query input and a second display item corresponding to the second query input, to automatically switch, in response to receiving the first query input, the apparatus from a first state to receive the first query input of the first query type to a second state to receive the second query input of the second query type, and to obtain a search result according to a query based on the first query input and the second query input.

MUSIC STREAMING, PLAYLIST CREATION AND STREAMING ARCHITECTURE

A system and method for making categorized music tracks available to end user applications. The tracks may be categorized based on computer-derived rhythm, texture and pitch (RTP) scores for tracks derived from high-level acoustic attributes, which is based on low level data extracted from the tracks. RTP scores are stored in a universal database common to all of the music publishers so that the same track, once RTP scored, does not need to be re-RTP scored by other music publishers. End user applications access an API server to import collections of tracks published by publishers, to create playlists and initiate music streaming. Each end user application is sponsored by a single music publisher so that only tracks capable of being streamed by the music publisher are available to the sponsored end user application.

INTELLIGENT AUTOMATED ASSISTANT FOR MEDIA EXPLORATION

Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors and memory, receiving a first natural-language speech input indicative of a request for media, where the first natural-language speech input comprises a first search parameter; providing, by a digital assistant, a first media item identified based on the first search parameter. The method further includes, while providing the first media item, receiving a second natural-language speech input and determining whether the second input corresponds to a user intent of refining the request for media. The method further includes, in accordance with a determination that the second speech input corresponds to a user intent of refining the request for media: identifying, based on the first parameter and the second speech input, a second media item and providing the second media item.

Song similarity determination

Aspects of the technology described herein use acoustic features of a music track to capture information for a recommendation system. The recommendation can work without analyzing label data (e.g., genre, artist) or usage data for a track. For each audio track, a descriptor is generated that can be used to compare the track to other tracks. The comparisons between track descriptors result in a similarity measure that can be used to make a recommendation. In this process, the audio descriptors are used directly to form a track-to-track similarity measure between tracks. By measuring the similarity between a track that a user is known to like and an unknown track, a decision can be made whether to recommend the unknown track to the user.

RELATIONS BETWEEN MUSIC ITEMS
20230260492 · 2023-08-17 · ·

A method of determining relations between music items, the method comprising determining a first input representation for a symbolic representation of a first music item, mapping the first input representation onto to one or more subspaces derived from a vector space using a first model, wherein each subspace models a characteristic of the music items, determining a second input representation for music data representing a second music item, mapping the second input representation onto the one or more subspaces using a second model, determining a distance between the mappings of the first and second input representation in each subspace, wherein the distance represents the degree of relation between the first and second input representation with respect to the characteristic modelled by the subspace.

Method and apparatus for retrieving audio file, server, and computer-readable storage medium

A method for retrieving an audio file includes: collecting an audio segment in real time; and for every two chronologically adjacent audio frames in a plurality of audio frames of the audio segment, acquiring a difference value between spectral centroids of a sub-band corresponding to the two audio frames, to obtain a plurality of difference values; and obtaining an audio fingerprint corresponding to the two audio frames based on the plurality of difference values. A quantity of the plurality of difference values equaling a quantity of sub-bands of one of the two audio frames. Each bit of the audio fingerprint being determined based on a comparison between a difference value corresponding to the bit and a preset difference value threshold. The method also includes retrieving, in an audio file library based on audio fingerprints of the plurality of audio frames, a target audio file matching the audio segment.