Patent classifications
G11B27/3081
Techniques for selecting resolutions for encoding different shot sequences
In various embodiments, an encoding optimization application encodes a source video sequence. The encoding optimization application partitions a source video sequence associated with a media title into multiple subsequences that include a first subsequence and a second subsequence. The encoding optimization application then generates a first encoded subsequence based on the first subsequence and a first resolution. The encoding optimization application generates a second encoded subsequence based on the second subsequence and a second resolution. Notably, the second resolution is higher than the first resolution. The encoding optimization application then aggregates the first encoded subsequence and the second encoded subsequence to generate an encoded video sequence. At least a portion of the encoded video sequence is subsequently streamed to an endpoint device during a playback of the media title.
SCENE AND ACTIVITY IDENTIFICATION IN VIDEO SUMMARY GENERATION
Video and corresponding metadata is accessed. Events of interest within the video are identified based on the corresponding metadata, and best scenes are identified based on the identified events of interest. A video summary can be generated including one or more of the identified best scenes. The video summary can be generated using a video summary template with slots corresponding to video clips selected from among sets of candidate video clips. Best scenes can also be identified by receiving an indication of an event of interest within video from a user during the capture of the video. Metadata patterns representing activities identified within video clips can be identified within other videos, which can subsequently be associated with the identified activities.
Layered 3-D images for augmented reality processing
A method for creating and storing a captured image and associated spatial data and augmented reality (AR) data in a file that allows subsequent manipulation and processing of AR objects is disclosed. In embodiments, one or more frames are extracted from a video stream, along with spatial information about the camera capturing the video stream. The one or more frames are analyzed in conjunction with the spatial information to calculate a point cloud of depth data. The one or more frames are stored in a file in a first layer, and the point cloud is stored in the file in a second layer. In some embodiments, one or more AR objects are stored in a third layer.
SYSTEM AND METHOD FOR SERVING INTERACTIVE ADVERTISING CONTENT WITH RESPONSIVE ANIMATIONS
One variation of a method for serving interactive advertisement content includes, at a visual element rendered within a window of a computing device: loading an interactive advertisement, including a digital advertisement and a mask overlaid the digital advertisement, into the visual element, the mask including a base layer and an aperture defining a second opacity less than a first opacity of the base layer; in response to a scroll event that moves the visual element a first position within the window, rendering the interactive advertisement within the visual element; and, in response to a scroll event that moves the visual element to a second position within the window, transforming the mask according to a sequence of transformations at a rate proportional to a scroll rate of the scroll event, the sequence of transformations corresponding to a sequence of vertical positions of the visual element within the window.
Scene and activity identification in video summary generation
Video and corresponding metadata is accessed. Events of interest within the video are identified based on the corresponding metadata, and best scenes are identified based on the identified events of interest. A video summary can be generated including one or more of the identified best scenes. The video summary can be generated using a video summary template with slots corresponding to video clips selected from among sets of candidate video clips. Best scenes can also be identified by receiving an indication of an event of interest within video from a user during the capture of the video. Metadata patterns representing activities identified within video clips can be identified within other videos, which can subsequently be associated with the identified activities.
Techniques for positioning key frames within encoded video sequences
In various embodiments, an encoding optimization application positions key frames within encoded video sequences based on shot changes. The encoding optimization application determines key frame location(s) based on shot change(s) included in a source video sequence associated with a media title. Each key frame location is associated with a different frame included in the source video sequence. For each of the key frame location(s), the encoding optimization application configures an encoding application to encode a frame of video content located at the key frame location as a key frame when performing encoding operations. Subsequently, the encoding optimization application causes the encoding application to perform encoding operation(s) on the source video sequence to generate a first encoded video sequence. During playback, the media title is switchable between a decoded version of the first encoded video sequence and a decoded version of a second encoded video sequence at the key frame location(s).
Methods for serving interactive content to a user
One variation of a method for serving interactive content to a user includes, at a visual element inserted into a document accessed by a computing device: loading a first frame from a digital video; in response to a scroll-down event that moves the visual element upward from a bottom of a window rendered on the computing device toward a top of the window, seeking from the first frame through a subset of frames in the digital video in a first direction at a rate corresponding to a scroll rate of the scroll-down event, the subset of frames spanning a duration of the digital video corresponding to a length of the scroll-down event; and, in response to termination of the scroll-down event with the visual element remaining in view within the window, playing the digital video forward from a last frame in the subset of frames in the digital video.
AGGREGATION OF RELATED MEDIA CONTENT
Systems and methods for media aggregation are disclosed herein. The system includes a media system that can transform media items into one aggregated media item. A synchronization component synchronizes media items with respect to time. The synchronized media items can be analyzed and transformed into an aggregated media item for storage and/or display. In one implementation, the aggregated media item is capable of being displayed in multiple ways to create an enhanced and customizable viewing and/or listening experience.
Method and device for displaying target object
Embodiments of the present application provide a method and apparatus for displaying objects. In the method, video frames that contain objects of predefined types are detected in pieces of video data to be displayed in a first display region. If such video frames are detected, the objects of the predefined types are extracted from the detected video frames. The extracted objects are then displayed in sub-regions of a second display region that is different from the first display region, with each object being displayed in one of the sub-regions. In this way, the objects that a user is interested in can be extracted from a video, and displayed in a way to be viewed with ease.
Video summarization using selected characteristics
Devices, systems and methods are disclosed for improving a playback of video data and generation of a video summary. For example, annotation data may be generated for individual video frames included in the video data to indicate content present in the individual video frames, such as faces, objects, pets, speech or the like. A video summary may be determined by calculating a priority metric for individual video frames based on the annotation data. In response to input indicating a face and a period of time, a video summary can be generated including video segments focused on the face within the period of time. The video summary may be directed to multiple faces and/or objects based on the annotation data.