G06F16/7867

USING INTERPOLATION TO GENERATE A VIDEO FROM STATIC IMAGES

A media application selects, from a collection of images associated with a user account, candidate pairs of images, where each pair includes a first static image and a second static image from the user account. The media application applies a filter to select a particular pair of images from the candidate pairs of images. The media application generates, using an image interpolator, one or more intermediate images based on the particular pair of images. The media application generates a video that includes three or more frames arranged in a sequence, where a first frame of the sequence is the first static image, a last frame of the sequence is the second static image, and each of the one or more intermediate images is a corresponding intermediate frame of the sequence between the first frame and the last frame.

Systems and methods for controlling quality of content

Systems and methods for controlling quality of content is provided. A confidence tool of an automated quality control system may receive a request to analyze a tag indicating content to be presented by a content presentation service. The tag may be indicative of a link to the content and a tracking pixel associated with the content. The confidence tool may determine whether the tag meets criteria (e.g., pixel whitelisting criteria, specification of a content presentation service). The confidence tool may notify a user whether the tag meets the criteria to prevent problematic content from being presented by the content presentation service.

Methods, systems, and apparatuses to respond to voice requests to play desired video clips in streamed media based on matched close caption and sub-title text

Methods, Systems, and Apparatuses are described to implement voice search in media content for requesting media content of a video clip of a scene contained in the media content streamed to the client device; for capturing the voice request for the media content of the video clip to display at the client device wherein the streamed media content is a selected video streamed from a video source; for applying a NLP solution to convert the voice request to text for matching to a set of one or more words contained in at least close caption text of the selected video; for associating matched words to close caption text with a start index and an end index of the video clip contained in the selected video; and for streaming the video clip to the client device based on the start index and the end index associated with matched closed caption text.

PRODUCTION SYSTEM

An objective of the present invention is to provide a production system that can easily reference a video pertaining to a specific action or operation of a device. A production system (10) is provided with a log generation unit (51a), a video generation unit (51b), a recording unit (51c), and a control unit (51d). The log generation unit (51a) generates log information pertaining to an action of a device (20) implementing a process and/or log information pertaining to an operation of a worker operating the device (20). The video generation unit (51b) generates video data of an action of the device (20) and/or video data of an operation by the worker. The recording unit (51c) records the log information generated by the log generation unit (51a), in association with the video data generated by the video generation unit (51b). The control unit (51d) controls the recording unit (51c), and causes the recording unit (51c) to record the video data of a time zone in which the log information is used as a reference point, in association with the log information.

Manufacture of NFTs from film libraries

Methods and processes for manufacture of an image product from a digital image. An object in the digital image is detected and recognized. Object metadata is assigned to the object, the object metadata linking sound to the object in the digital image which produced the sound. At least one cryptographic hash of the object metadata is generated, and the hash is written to a node of a transaction processing network.

Method and system of pushing video viewfinder

The present disclosure describes techniques of pushing information associated with the at least one location that is associated with a video. The disclosed techniques comprises obtaining video data, wherein the video data comprise a plurality frames of a video and information associated with the video; determining at least one location associated with at least one frame among the plurality of frames of the video based on comparing the video data with data included in a database; determining information associated with the at least one location; and pushing the information associated with the at least one location to a first computing device based on a time point of playing the at least one frame among the plurality of frames of the video.

Text-driven video synthesis with phonetic dictionary

Presented herein are novel approaches to synthesize video of the speech from text. In a training phase, embodiments build a phoneme-pose dictionary and train a generative neural network model using a generative adversarial network (GAN) to generate video from interpolated phoneme poses. In deployment, the trained generative neural network in conjunction with the phoneme-pose dictionary convert an input text into a video of a person speaking the words of the input text. Compared to audio-driven video generation approaches, the embodiments herein have a number of advantages: 1) they only need a fraction of the training data used by an audio-driven approach; 2) they are more flexible and not subject to vulnerability due to speaker variation; and 3) they significantly reduce the preprocessing, training, and inference times.

SYSTEMS, METHODS, AND MEDIA FOR MEDIA SESSION CONCURRENCY MANAGEMENT WITH RECURRING LICENSE RENEWALS

The disclosed subject matter relates to systems, methods, and media for media session concurrency management with recurring license renewals. More particularly, the disclosed subject matter relates to using recurring license renewals for concurrent playback detection and concurrency limit enforcement for video delivery services and managing server resources for handling such recurring license renewals.

Adaptive search results for multimedia search queries

Certain embodiments involve adaptive search results for multimedia search queries to provide dynamic previews. For instance, a computing system receives a search query that includes a keyword. The computing system identifies, based on the search query, a video file having keyframes with content tags that match the search query. The computing system determines matching scores for respective keyframes of the identified video file. The computing system generates a dynamic preview from at least two keyframes having the highest matching scores.

LIVESTREAM VIDEO IDENTIFICATION

A computing system is described herein, where the computing system is configured to perform a search over a computer-readable index based upon a query for a user. The computer-readable index includes an identifier for a livestream video that is currently being livestreamed by way of a livestreaming service and values for respective attributes of the livestream video. The values for the respective attributes are updated as content of the livestream video alters over time. The livestream video is identified from amongst several livestream videos based upon the search, where the video is identified due to a set of values specified in the query corresponding to the values for the respective attributes in the computer-readable index. Upon the livestream video being identified, an identifier of the livestream video is transmitted to a client computing device of the user.