G06F16/785

TERMINAL AND APPARATUS FOR PROVIDING SEARCH INFORMATION BASED ON COLOR INFORMATION
20200285671 · 2020-09-10 ·

The present disclosure relates to a terminal, an apparatus, and a method for providing search information based on color information, the method including, acquiring a search keyword; selecting one or more colors, correlated with the acquired keyword, based on a pre-stored keyword-color information correlation; searching for an object that matches with the selected color; and configuring an interface page information including information about the searched object.

Decomposition of a video stream into salient fragments

The disclosure includes a system and method for decomposing a video to salient fragments and synthesizing a video composition based on the salient fragments. A video decomposition application extracts non-salient portions of a video, extracts a plurality of salient fragments of the video, builds a database of the plurality of salient fragments, receives a query, retrieves, from the database of the plurality of salient fragments, a set of salient fragments based on the query, and synthesizes a video composition based on the set of salient fragments and the non-salient portions of the video.

Method and apparatus for delocalized management of video data

A method for managing video data in a storage system (10), the video data comprising frames, and a storage system (10) configured to perform the method are described. The storage system (10) comprises a first input (11) configured to receive (1) one or more frames for storage. A storage more frames unit (12) stores (2) the one or more frames, whereas a unique identifier generator (13) associates (3) a unique identifier to each of the one or more frames. The storage system (10) further comprises a processor (14) configured to generate (4) a modified frame by processing one or more frames or to receive a modified frame generated externally. The unique identifier generator (13) associates (5) a derived unique identifier to such a modified frame, which comprises references to the unique identifiers of the one or more processed frames.

Subsumption architecture for processing fragments of a video stream

The disclosure includes a system and method for distributing video segments of a video to one or more brokers based on topics and storing the video segments in a distributed commit log associated with the topics. A video processing application decomposes a video into fragments, groups the fragments into topics based on identifiers associated with the fragments, breaks the fragments into a sequence of segments, distributes the sequence of segments to one or more brokers based on the topics, and stores, by the one or more brokers, the sequence of segments associated with a topic in a distributed commit log while preserving a sequence order of the sequence of segments.

Methods, systems, and media for detecting a presentation of media content on a display device
10679542 · 2020-06-09 · ·

Methods, systems, and media for detecting a presentation of media content on a display device are provided. In accordance with some implementations, methods for detecting a presentation of media content on a display device are provided, the methods comprising: detecting, using a light sensor, light levels in the light sensor's surroundings; generating a signal representing the light levels; detecting, using a hardware processor, at least one variation in light levels indicative of a presentation of a video scene based on the signal; detecting at least one variation in light levels indicative of a scene change subsequent to the video scene based on the signal; and determining that media content is being presented on a display device in response to detecting the variation in light levels indicative of the presentation of the video scene and the variation in light levels indicative of the scene change.

IDENTIFYING AND CATEGORIZING CONTEXTUAL DATA FOR MEDIA

Systems and methods for identifying and associating contextual metadata across related media.

Systems and methods for identifying matching content
10650241 · 2020-05-12 · ·

Systems, methods, and non-transitory computer-readable media can generate at least one fingerprint based on a set of frames corresponding to a test content item, generate a set of distorted fingerprints using at least a portion of the fingerprint, and determine one or more reference content items using the set of distorted fingerprints, wherein the test content item is evaluated against at least one reference content item to identify matching content.

Partitioning videos

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for partitioning videos. In one aspect, a method includes obtaining a partition of a video into one or more shots. Features are generated for each shot, including visual features and audio features. The generated features for each shot are provided as input to a partitioning neural network that is configured to process the generated features to generate a partitioning neural network output. The partition of the video into one or more chapters is determined based on the partitioning neural network output, where a chapter is a sequence of consecutive shots that are determined to be taken at one or more locations that are semantically related.

Enriching audio with lighting

A method of generating a lighting effect based on metadata of an audio stream, the method comprising steps of: extracting metadata items from the audio stream; retrieving a first set of one or more images based on the metadata items; controlling a light source to generate a lighting effect based on said first set of one or more images.

METRIC-BASED RECOGNITION, SYSTEMS AND METHODS
20190332884 · 2019-10-31 · ·

Apparatus, methods and systems of object recognition are disclosed. Embodiments of the inventive subject matter generates map-altered image data according to an object-specific metric map, derives a metric-based descriptor set by executing an image analysis algorithm on the map-altered image data, and retrieves digital content associated with a target object as a function of the metric-based descriptor set.