IPIQ

G06F16/683

Phonetic comparison for virtual assistants

11710480 · 2023-07-25 ·

International Business Machines Corporation

In an approach for optimizing an intelligent virtual assistant by using phonetic comparison to find a response stored in a local database, a processor receives an audio input on a computing device. A processor transcribes the audio input to text. A processor compares the text to a set of user queries and commands in a local database of the computing device using a phonetic algorithm. A processor determines whether a user query or command of the set of user queries and commands meets a pre-defined threshold of similarity. Responsive to determining that the user query or command meets the pre-defined threshold of similarity, a processor identifies an intention of a set of intentions stored in the local database corresponding to the user query or command. A processor identifies a response of a set of responses in the local database corresponding to the intention. A processor outputs the response audibly.

Text-to-speech from media content item snippets

11710474 · 2023-07-25 ·

Spotify Ab

A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.

Speech endpointing

11710477 · 2023-07-25 ·

Google Llc

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing are described. In one aspect, a method includes the action of accessing voice query log data that includes voice queries spoken by a particular user. The actions further include based on the voice query log data that includes voice queries spoken by a particular user, determining a pause threshold from the voice query log data that includes voice queries spoken by the particular user. The actions further include receiving, from the particular user, an utterance. The actions further include determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold. The actions further include based on determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold, processing the utterance as a voice query.

Systems and methods for digitally fetching music content

11709887 · 2023-07-25 ·

Meta Platforms, Inc.

Debashish Paul

Systems and methods for digitally fetching music content may include (1) providing a user of a device with a music service accessible via the device, (2) receiving a digital user request to (i) select a music composition based on a digital image corresponding to one or more music compositions and (ii) digitally fetch the music composition for use via the music service, and, in response to receiving the digital user request, (3) analyzing the digital image and selecting the music composition based on the analysis of the digital image, and (4) digitally fetching the music composition for use via the music service. Various other methods, systems, and computer-readable media are also disclosed.

Systems and methods for digitally fetching music content

11709887 · 2023-07-25 ·

Meta Platforms, Inc.

Debashish Paul

SCALABLE ARCHITECTURES FOR REFERENCE SIGNATURE MATCHING AND UPDATING

20230239547 · 2023-07-27 ·

Methods, apparatus, systems and articles of manufacture are disclosed for scalable architectures for reference signature matching and updating. An example method for scalable architectures for reference signature matching and updating includes accessing site signatures to be compared to reference signatures from a first group of media sources. Determining if a first reference node is an owner of a first one of the site signatures. Comparing a neighborhood of site signatures including the first site signature to reference signatures in a first subset of reference signatures when the first reference node is the owner of the first site signature, the first subset of references signatures stored in a first memory partition associated with the first reference node. Not comparing site signature to reference signatures when the first reference node is not the owner of the first one of the site signatures.

SCALABLE ARCHITECTURES FOR REFERENCE SIGNATURE MATCHING AND UPDATING

20230239547 · 2023-07-27 ·

MEDIA SELECTION

20230004594 · 2023-01-05 ·

Damian Franken Manning

Systems, device and techniques are disclosed for receiving a first taste profile of a first user based on at least a first profile criteria and receiving a second taste profile of a second user based on at least a second profile criteria. An area of interest may be determined based on the first and second taste profiles. A playlist containing one or more media items (e.g., audio, video, text, etc.) may be generated based on the determined area of interest.

MEDIA SELECTION

20230004594 · 2023-01-05 ·

Damian Franken Manning

Mood determination of a collection of media content items

11567985 · 2023-01-31 ·

Apple Inc.

Systems, methods, and computer-readable media for determining at least one valid mood for a collection of media content items of a media library are provided.

Patent classifications

G06F16/683