G06F16/683

Phonetic comparison for virtual assistants

In an approach for optimizing an intelligent virtual assistant by using phonetic comparison to find a response stored in a local database, a processor receives an audio input on a computing device. A processor transcribes the audio input to text. A processor compares the text to a set of user queries and commands in a local database of the computing device using a phonetic algorithm. A processor determines whether a user query or command of the set of user queries and commands meets a pre-defined threshold of similarity. Responsive to determining that the user query or command meets the pre-defined threshold of similarity, a processor identifies an intention of a set of intentions stored in the local database corresponding to the user query or command. A processor identifies a response of a set of responses in the local database corresponding to the intention. A processor outputs the response audibly.

Text-to-speech from media content item snippets

A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.

Speech endpointing

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing are described. In one aspect, a method includes the action of accessing voice query log data that includes voice queries spoken by a particular user. The actions further include based on the voice query log data that includes voice queries spoken by a particular user, determining a pause threshold from the voice query log data that includes voice queries spoken by the particular user. The actions further include receiving, from the particular user, an utterance. The actions further include determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold. The actions further include based on determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold, processing the utterance as a voice query.

Systems and methods for digitally fetching music content
11709887 · 2023-07-25 · ·

Systems and methods for digitally fetching music content may include (1) providing a user of a device with a music service accessible via the device, (2) receiving a digital user request to (i) select a music composition based on a digital image corresponding to one or more music compositions and (ii) digitally fetch the music composition for use via the music service, and, in response to receiving the digital user request, (3) analyzing the digital image and selecting the music composition based on the analysis of the digital image, and (4) digitally fetching the music composition for use via the music service. Various other methods, systems, and computer-readable media are also disclosed.

Systems and methods for digitally fetching music content
11709887 · 2023-07-25 · ·

Systems and methods for digitally fetching music content may include (1) providing a user of a device with a music service accessible via the device, (2) receiving a digital user request to (i) select a music composition based on a digital image corresponding to one or more music compositions and (ii) digitally fetch the music composition for use via the music service, and, in response to receiving the digital user request, (3) analyzing the digital image and selecting the music composition based on the analysis of the digital image, and (4) digitally fetching the music composition for use via the music service. Various other methods, systems, and computer-readable media are also disclosed.

SCALABLE ARCHITECTURES FOR REFERENCE SIGNATURE MATCHING AND UPDATING
20230239547 · 2023-07-27 ·

Methods, apparatus, systems and articles of manufacture are disclosed for scalable architectures for reference signature matching and updating. An example method for scalable architectures for reference signature matching and updating includes accessing site signatures to be compared to reference signatures from a first group of media sources. Determining if a first reference node is an owner of a first one of the site signatures. Comparing a neighborhood of site signatures including the first site signature to reference signatures in a first subset of reference signatures when the first reference node is the owner of the first site signature, the first subset of references signatures stored in a first memory partition associated with the first reference node. Not comparing site signature to reference signatures when the first reference node is not the owner of the first one of the site signatures.

SCALABLE ARCHITECTURES FOR REFERENCE SIGNATURE MATCHING AND UPDATING
20230239547 · 2023-07-27 ·

Methods, apparatus, systems and articles of manufacture are disclosed for scalable architectures for reference signature matching and updating. An example method for scalable architectures for reference signature matching and updating includes accessing site signatures to be compared to reference signatures from a first group of media sources. Determining if a first reference node is an owner of a first one of the site signatures. Comparing a neighborhood of site signatures including the first site signature to reference signatures in a first subset of reference signatures when the first reference node is the owner of the first site signature, the first subset of references signatures stored in a first memory partition associated with the first reference node. Not comparing site signature to reference signatures when the first reference node is not the owner of the first one of the site signatures.

MEDIA SELECTION
20230004594 · 2023-01-05 ·

Systems, device and techniques are disclosed for receiving a first taste profile of a first user based on at least a first profile criteria and receiving a second taste profile of a second user based on at least a second profile criteria. An area of interest may be determined based on the first and second taste profiles. A playlist containing one or more media items (e.g., audio, video, text, etc.) may be generated based on the determined area of interest.

MEDIA SELECTION
20230004594 · 2023-01-05 ·

Systems, device and techniques are disclosed for receiving a first taste profile of a first user based on at least a first profile criteria and receiving a second taste profile of a second user based on at least a second profile criteria. An area of interest may be determined based on the first and second taste profiles. A playlist containing one or more media items (e.g., audio, video, text, etc.) may be generated based on the determined area of interest.

Mood determination of a collection of media content items

Systems, methods, and computer-readable media for determining at least one valid mood for a collection of media content items of a media library are provided.