Patent classifications
G06F16/432
VOICE COMMAND SYSTEM AND VOICE COMMAND METHOD
A voice command system according to a first disclosure comprises a gateway apparatus having an interface configured to receive a voice command, and a controller configured to perform a registration process of registering a speaker permitted to receive the voice command. The controller is configured to perform an authentication process of rejecting a reception of the voice command when a speaker of the voice command is not registered, and permitting a reception of the voice command when a speaker of the voice command is registered. The controller is configured to perform the authentication process for each voice command.
SYSTEM AND METHOD FOR AUTOMATIC SYNCHRONIZATION OF VIDEO WITH MUSIC, AND GAMING APPLICATIONS RELATED THERETO
A computer system including a server having a processor and a memory, the memory having a video database and a music database, the video database storing at least one video file having a plurality of video file markers, and the music database storing at least one music file having a plurality of music file markers, wherein the server receives and decodes encoded data from computer readable code, identifies and retrieves from the music database a music file based on the decoded data, synchronizes the retrieved music file with one of the video files by aligning the video file markers of the video file with the music file markers for the retrieved music file to produce a synchronized video--music file, and transmits the synchronized video-music file to a display, wherein the video file markers are generated for each video file and the music file markers are generated for each music file.
System and method for identifying social trends
A method and system for identifying social trends are provided. The method includes collecting multimedia content from a plurality of data sources; gathering environmental variables related to the collected multimedia content; extracting visual elements from the collected multimedia content; generating at least one signature for each extracted visual element; generating at least one cluster of visual elements by clustering at least similar signatures generated for the extracted visual elements; correlating environmental variables related to visual elements in the at least one cluster; determining at least one social trend by associating the correlated environmental variables with the at least one cluster.
METHOD OF OBTAINING EVENT INFORMATION, ELECTRONIC DEVICE, AND STORAGE MEDIUM
A method of obtaining an event information, an electronic device, and a storage medium, which relate to a field of artificial intelligence, in particular to fields of knowledge graph and deep learning technologies. A specific implementation solution of the method of obtaining the event information includes: determining, according to a query information in data to be processed, a first key information describing an event; determining, according to multimedia data in the data to be processed, a second key information describing an event, wherein the multimedia data includes data obtained by querying based on the query information; and fusing the first key information and the second key information, so as to obtain an event information of a target event described by the data to be processed.
METHOD OF OBTAINING EVENT INFORMATION, ELECTRONIC DEVICE, AND STORAGE MEDIUM
A method of obtaining an event information, an electronic device, and a storage medium, which relate to a field of artificial intelligence, in particular to fields of knowledge graph and deep learning technologies. A specific implementation solution of the method of obtaining the event information includes: determining, according to a query information in data to be processed, a first key information describing an event; determining, according to multimedia data in the data to be processed, a second key information describing an event, wherein the multimedia data includes data obtained by querying based on the query information; and fusing the first key information and the second key information, so as to obtain an event information of a target event described by the data to be processed.
Behavioral curation of media assets
In some implementations, a computing device may create a semantic mapping that includes identified features that appear in a particular percentage of assets in a subset of assets of a media library. Also, the computing device may analyze assets of the media library using the semantic mapping to generate semantic scores, which may be used to determine a first tier of assets from the media library that rate highest for semantic score out of all assets. The computing device may present at least one of the first tier assets prominently in a user interface when viewing assets of the media library.
Associating a graphical element to media content item collections
Various embodiments provide for associating a collection of media items with a graphical element. For instance, a system can: generate corpus data from a set of features of a collection of media content items; determine a set of candidate graphical elements for the collection of media content items based on the corpus data and further based on a set of first mappings associating at least one graphical element and at least one n-gram; determine a set of prediction scores corresponding to the set of candidate graphical elements based on the set of features; determine a ranking for the set of candidate graphical elements based on the set of prediction stores; select a set of predicted graphical elements, from the set of candidate graphical elements, based on the ranking; and provide the set of predicted graphical elements in association with the collection of media content items.
Media browsing user interface with intelligently selected representative media items
- Graham R. CLARKE ,
- Kevin Aujoulet ,
- Kevin Bessiere ,
- Simon BOVET ,
- Eric M. G. CIRCLAEYS ,
- Lynne DEVINE ,
- Alan C. DYE ,
- Benedikt M. Hirmer ,
- Andreas KARLSSON ,
- Chelsea L. Burnette ,
- Matthieu LUCAS ,
- Behkish J. Manzari ,
- Nicole R. Ryan ,
- William A. Sorrentino, III ,
- Andre SOUZA DOS SANTOS ,
- Gregg SUZUKI ,
- Sergey TATARCHUK ,
- Justin S. Titi
The present disclosure generally relates to navigating a collection of media items. In accordance with one embodiment, in response to receiving an input, a device displays a first view of a collection of media items, including concurrently displaying a representation of a first time period and a representation of a second time period. In accordance with a determination that a current time is associated with a first recurring temporal event: the representation of the first time period includes a first representative media item and the representation of the second time period includes a second representative media item. In accordance with a determination that the current time is associated with a second recurring temporal event, the representation of the first time period includes a third representative media item and the representation of the second time period includes a fourth representative media item.
Analysis of objects of interest in sensor data using deep neural networks
Sensor data captured by one or more sensors may be received at an analysis system. A neural network may be used to detect an object in the sensor data. A plurality of polygons surrounding the object may be generated in one or more subsets of the sensor data. A prediction of a future position of the object may be generated based at least in part on the polygons. One or more commands may be provided to a control system based on the prediction of the future position.
PREDICTIVE QUERY EXECUTION
First audio data associated with a first portion of a voice query (e.g., an incomplete voice query) may be received (e.g., by a device or a server). A first transcript may be determined by a speech recognition engine and based on the first audio data. A plurality of predicted queries may be determined by applying a prediction process to the first transcript. A response for each of the plurality of predicted queries may be determined by processing the plurality of the predicted queries. Second audio data associated with a second portion of the voice query (e.g., a complete voice query) may be received. A second transcript may be determined by the speech recognition engine and based on the second audio data. Based on comparing the second transcript to one of the plurality of predicted queries, a response for the voice query may be returned.