Patent classifications
H04N21/23418
METHOD OF INSERTING AN OBJECT INTO A SEQUENCE OF IMAGES
The invention relates to a method of inserting an insertion object into a sequence of images. The insertion object may be an image, a video, or a three-dimensional model, which could possibly be animated. Particularly, but not exclusively, the invention relates to the insertion of advertisement images into video, such as videos of sporting events. A method comprises capturing a sequence of images, the sequence of images comprising in order a first image, a second image, and a third image; estimating a first homographic transform from the first image to the third image; deriving a second homographic transform from the first image to the second image based on the first homographic transform; transforming the insertion object using the first homographic transformation to form a first warped insertion image, and inserting the first warped insertion image into the third image of the sequence of images; and transforming the insertion object using the second homographic transformation to form a second warped insertion image, and inserting the second warped insertion image into the second image of the sequence of images.
NON-OCCLUDING VIDEO OVERLAYS
Methods, systems, and computer media provide for identifying exclusion zones in frames of a video, aggregating those exclusion zones for a specified duration or number of frames, defining a inclusion zone within which overlaid content is eligible for inclusion, and providing overlaid content for inclusion in the inclusion zone. The exclusion zones can include regions in which significant features are detected such as text, human features, objects from a selected set of object categories, or moving objects.
IMAGE PROCESSING DEVICE, OPERATION METHOD OF IMAGE PROCESSING DEVICE, AND OPERATION PROGRAM OF IMAGE PROCESSING DEVICE
Provided is an image processing device including at least one processor, in which the processor acquires a first performance evaluation value related to performance of a first imaging apparatus of a first user who is one of a plurality of users, acquires a second performance evaluation value related to performance of a second imaging apparatus of a second user different from the first user among the plurality of users, specifies an image of the second user corresponding to the second imaging apparatus having the second performance evaluation value, which is equal to or larger than the first performance evaluation value, from shared images which are limitedly shared by the plurality of users, and sets a presentation priority of a specific image, which is the specified image, to the first user to be higher than presentation priorities of other shared images.
METHOD FOR DETERMINING VIDEO CODING TEST SEQUENCE, ELECTRONIC DEVICE AND COMPUTER STORAGE MEDIUM
An method for determining a video coding test sequence, an electronic device, and a computer readable storage medium are provided. The method includes: determining a candidate video set including multiple candidate videos corresponding to a target service requirement; classifying the candidate videos by content categories to obtain a target distribution of content categories; clustering the candidate videos by values of a preset coding complexity to obtain multiple video classes; selecting from each of the video classes respectively a target class-representative video such that an actual distribution of content categories is consistent with the target distribution of content categories; and constructing a target video coding test sequence based on the target class-representative videos.
DEEP LEARNING SYSTEM FOR DETERMINING AUDIO RECOMMENDATIONS BASED ON VIDEO CONTENT
Embodiments are disclosed for determining an answer to a query associated with a graphical representation of data. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including an unprocessed audio sequence and a request to perform an audio signal processing effect on the unprocessed audio sequence. The one or more embodiments further include analyzing, by a deep encoder, the unprocessed audio sequence to determine parameters for processing the unprocessed audio sequence. The one or more embodiments further include sending the unprocessed audio sequence and the parameters to one or more audio signal processing effects plugins to perform the requested audio signal processing effect using the parameters and outputting a processed audio sequence after processing of the unprocessed audio sequence using the parameters of the one or more audio signal processing effects plugins.
VIDEO PROCESSING OPTIMIZATION AND CONTENT SEARCHING
Techniques are disclosed for automatic scene detection and character extraction. In one example, audiovisual content with video frames, an audio recording, and timing information is received. A score, based on the frame's visual characteristics, is determined for a first frame and subsequent frames. The first frame's score and subsequent frame's scores are compared to determine if the difference between the scores is above a threshold. When the difference in scores is above a threshold, the subsequent frame is classified as a new scene. The audiovisual content is segmented into scenes and textual characters are identified in at least one frame from each scene. The characters are stored and indexed in a searchable database with the timing information for the scene where the characters were identified. The audio recording is transcribed and the transcribed words are stored and indexed in the searchable database with timing information.
Automatic content recognition (ACR) fingerprinting and video encoding
A processing device and method are disclosed to generate automatic content recognition (ACR) fingerprints. A device can include a processing device to execute a capture daemon. The capture daemon may include a capturer to receive media content from a content feed and to generate content frames of the media content. The capture daemon can also include ACR fingerprinter to generate ACR fingerprint information for the content frames. The ACR fingerprinter may communicate the ACR fingerprint information to an ACR server via a first communication channel. The processing device may determine when the ACR fingerprinter fails; cease communicating the ACR fingerprint information to the ACR server; and restart the ACR fingerprinter.
Systems and methods of dynamic optimization of data element utilization according to objectives
Systems and methods are disclosed for optimizing data element usage according to user-defined objectives, comprising receiving a plurality of user-defined objectives associated with a group of data elements; receiving one or more constraints associated with the group of data elements, wherein at least one of the constraints comprises resources apportionable to each data element in the group of data elements; apportioning at least a portion of the resources to each data element in the group of data elements in a manner that meets the one or more constraints; receiving metrics associated with the performance of the group of data elements in meeting the plurality of user-defined objectives; determining an effectiveness of each data element in the group of data elements for meeting the plurality of user-defined objectives; and automatically revising the at least a portion of resources associated with each data element in the group of data elements.
Video recommendation method and device, computer device and storage medium
A video recommendation method is provided, including: inputting a video to a first feature extraction network, performing feature extraction on at least one consecutive video frame in the video, and outputting a video feature of the video; inputting user data of a user to a second feature extraction network, performing feature extraction on the discrete user data, and outputting a user feature of the user; performing feature fusion based on the video feature and the user feature, and obtaining a recommendation probability of recommending the video to the user; and determining, according to the recommendation probability, whether to recommend the video to the user.
System and method for using multimedia content as search queries
There is provided a method for searching a plurality of information sources using a multimedia element, the method may include receiving at least one multimedia element; generating, by a signature generator, for the at least one multimedia element at least one signature that is unidirectional, and yields compression; generating at least one textual search query using the at least one signature; wherein the generating of the textual search query comprises: (a) searching for at least one matching stored signature that matches one or more of the at least one signature; and (b) using a mapping between stored signatures and textual search queries, selecting at least one textual search query mapped to at least one matching stored signature; searching the plurality of information sources using the at least one textual search query; and causing a display of search results retrieved from the plurality of information sources.