Patent classifications
G06F16/432
SYSTEMS AND METHODS FOR LEVERAGING ACOUSTIC INFORMATION OF VOICE QUERIES
The methods and systems described herein leveraging acoustic features of a user to generate and present a personalized content to a user. In one example, the method receives a voice query and determines that the query refers to either a first content item or a second content item. The first content item is associated with a first type assigned with a first score and the second content item is associated with a second entity type assigned with a second score. The method also determines whether the query is from the second entity type. The method ranks the first and the second content items based on this determination and generates for presentation of the first and the second content items based on the ranking. The method also changes the first or the second scores based on this determination and selects one of the first or the second content item for presentation.
System and method for augmenting element records associated with the elements of a distributed computing environment with user-defined content
A distributed computing environment data store management system includes a computer-based system for identifying a subset of element records in a data store associated with the elements of a distributed computing environment, receiving at least one user-defined data element from a user interface. Using the user-defined data element, the system adds the user-defined data element to each of the subset of element records, and stores each of the subset of element records and their associated user-defined data in the database.
ACCESSIBLE MULTIMEDIA CONTENT
A method of generating accessible content is described. Embodiments of the method identifies a plurality of channels for a multimedia communication session, generate a master timeline for the communication session, wherein the master timeline comprises a chronological ordering of events from each of the channels, and wherein each of the events is associated with event-specific audio data, and present the multimedia communication session to a user to enable the user to transition among the channels based on the master timeline.
Generating personalized clusters of multimedia content elements based on user interests
A system and method for generating personalized multimedia content element clusters. The method includes determining, based on at least one interest, at least one personalized concept, wherein each personalized concept represents one of the at least one user interest; obtaining at least one multimedia content element related to a user; generating at least one signature for the at least one multimedia content element, each generated signature representing at least a portion of the at least one multimedia content element; determining, based on the generated at least one signature, at least one multimedia content element cluster, wherein each cluster includes a plurality of multimedia content elements sharing a common concept of the at least one personalized concept; and creating at least one personalized multimedia content element cluster by adding, to each determined cluster, at least one of the at least one multimedia content element sharing the common concept of the cluster.
Automatic camera angle switching in response to low noise audio to create combined audiovisual file
A system and method are provided for automatically concatenating two or more audiovisual clips containing video input from multiple cameras, and producing a combined audiovisual file containing video that switches between the two video inputs. In some examples, two video inputs and an audio input are recorded synchronously and are synchronized. The audio input can be sampled to locate low-noise audio events. The audiovisual file contains video that switches between two or more camera angles at the low-noise audio events. In one aspect, pauses are automatically removed from the audiovisual files. In another aspect, the system detects switch-initiating events, and switches between camera angles in response to detecting a switch-initiating event.
SYSTEMS AND METHODS FOR SELECTING IMAGES FOR A MEDIA ITEM
A server system obtains a collection of images, each image in the collection of images being associated with a first set of text descriptors. The server system obtains a media item being associated with a second set of text descriptors. The server system selects a subset of the collection of images, including: selecting an initial subset of the collection of images, wherein the initial subset of the collection of images consists of images that share a text descriptor with the media item; obtaining a set of preferences for a user of the media-providing service; and selecting the subset of the collection of images from the initial subset of the collection of images based on the set of preferences for the user of the media-providing service. The server system concurrently presents: a respective image of the subset of the collection of images; and the media item.
METHODS AND SYSTEMS FOR PROVIDING AUDIOVISUAL MEDIA ITEMS
The various embodiments described herein include methods and systems for providing audiovisual media items. In one aspect, a method performed at a client device includes: (1) receiving one or more natural language inputs from a user; (2) identifying audio files by extracting one or more commands from the natural language inputs; (3) receiving one or more second natural language inputs from the user; (4) identifying visual media files by extracting one or more commands from the second natural language inputs; (5) obtaining a request to generate the media item, the media item corresponding to the visual media files and the audio files; and (6) in response to obtaining the request, sending, to a server system, a creation request to create the media item, the creation request including information identifying the audio files and the visual media files.
INTERACTIVE INFORMATION PROCESSING METHOD, DEVICE AND MEDIUM
Disclosed are an interactive information processing method, an electronic device and a storage medium. The method includes establishing a position correspondence between a display text generated based on a multimedia data stream and the multimedia data stream; and presenting the display text and the multimedia data stream corresponding to the display text based on the position correspondence.
Presentation Assistance Device for Calling Attention to Words that are Forbidden to Speak
To provide a presentation assistance device that can display keywords related to presentation materials and call attention by displaying an alert when words that are forbidden to speak are spoken, A presentation assistance device 1 comprises: a presentation material storage means 3; a keyword storage means 5 which stores a plurality of keywords related to presentation materials; a related word storage means 7 which stores one or a plurality of related words for each of the plurality of keywords; an NG word storage means 9 which stores one or a plurality of NG words for each of the plurality of keywords; a voice recognition means 11; a term determination means 15 which determines whether a voice recognition term corresponds to a related word or an NG word; and a keyword output means 17 which when the voice recognition is a related word, outputs a keyword related to the related word, and when the voice recognition term is an NG word, outputs an alert and a keyword related to the NG word.
Adapting search query processing according to locally detected video content consumption
A process adapts user-initiated search queries. The process executes at a client device with a microphone. The process downloads audio fingerprints from a remote server for a plurality of video programs, and downloads information that correlates the audio fingerprint to the video programs. The audio fingerprints are preselected according to relevancy criteria, including stored user preferences and prior search queries by the user. The audio fingerprints and correlating information are stored locally. The process detects ambient sound using the microphone and computes one or more sample audio fingerprints from the detected ambient sound. The process matches a sample audio fingerprint to a locally stored audio fingerprint and uses the correlating information to identify a first video program corresponding to the matched sample audio fingerprint. The process then receives user input to initiate a search query. The process provides auto-complete suggestions for the search query based on the first video program.