H04N21/43072

System, method, and program product for generating graphical video clip representations associated with video clips correlated to electronic audio files
09848228 · 2017-12-19 · ·

Systems, methods, and program products for matching electronic audio files (such as songs) to associated electronic video work excerpts or electronic video clips from movies, televisions shows or advertisements in accordance with one or more sync licenses and generating and providing graphical representations of such video clips are disclosed.

Video Processing Method and Electronic Device
20230197115 · 2023-06-22 ·

A first audio timestamp of first audio is corrected based on a first latency corresponding to the first audio, to correct a correspondence between the first audio timestamp, the first audio, and a first image. In this way, a stored correspondence between the first image and the first audio is consistent with a correspondence between a picture corresponding to the first image and a sound corresponding to the first audio, thereby implementing audio and image synchronization.

SYSTEMS AND METHODS FOR PROVIDING OPTIMIZED TIME SCALES AND ACCURATE PRESENTATION TIME STAMPS

The disclosed computer-implemented method includes determining, for multiple different media items, a current time scale at which the media items are encoded for distribution, where at least two of the media items are encoded at different frame rates. The method then includes identifying, for the media items, a unified time scale that provides a constant frame interval for each of the media items. The method also includes changing at least one of the media items from the current time scale to the identified unified time scale to provide a constant frame interval for the changed media item(s). Various other methods, systems, and computer-readable media are also disclosed.

SYSTEMS AND METHODS FOR MATCHING AUDIO TO VIDEO PUNCHOUT
20230199243 · 2023-06-22 ·

An image capture device may capture multiple audio content during capture of visual content. A viewing window for the visual content and rotational position of the image capture device during capture of the visual content may be used to generate modified audio content from the multiple audio content. The modified audio content may provide sound for playback of a punchout of the visual content using the viewing window.

RECEIVING METHOD, RECEIVING DEVICE, AND TRANSMISSION AND RECEPTION SYSTEM
20170359611 · 2017-12-14 ·

A receiving method of receiving a first data unit in which data making up an encoded stream is stored and the first data unit stores a plurality of second data units. The receiving method includes: receiving the first data unit, first time information indicating a presentation time of the first data unit, second time information indicating, together with the first time information, a presentation time or a decoding time of each of the plurality of second data units, and identification information; calculating the presentation time or the decoding time of each of the plurality of second data units using the first time information and the second time information; and correcting the presentation time or the decoding time of each of the plurality of second data units based on the identification information.

Data processor and data processing method

The present invention relates to a data processor and data processing method that facilitate properly processing a stream. An input stream is formed by a plurality of packets. Each of the packets of the input stream is distributed to one of a plurality of channels and null packets (NP) are distributed to the other channels. This divides the input stream into divided streams on a plurality of channels including the packets of the input stream at a predetermined density. The present invention can be used, for example, for a channel bonding (CB) technique in which an input stream is divided into a plurality of channels and transmitted.

Audiovisual collaboration system and method with latency management for wide-area broadcast and social media-type user interface mechanics

Techniques have been developed to facilitate the livestreaming of group audiovisual performances. Audiovisual performances including vocal music are captured and coordinated with performances of other users in ways that can create compelling user and listener experiences. For example, in some cases or embodiments, duets with a host performer may be supported in a sing-with-the-artist style audiovisual livestream in which aspiring vocalists request or queue particular songs for a live radio show entertainment format. The developed techniques provide a communications latency-tolerant mechanism for synchronizing vocal performances captured at geographically-separated devices (e.g., at globally-distributed, but network-connected mobile phones or tablets or at audiovisual capture devices geographically separated from a live studio).

Quality of Media Synchronization

The play-out of a media stream by a play-out device may be synchronized by a synchronization subsystem with a reference play-out, thereby obtaining a degree of synchronicity with the reference play-out. To address a structural deficiency in the obtaining of the synchronicity with the reference play-out, the play-out device may report a quality of sync metric to a metric subsystem, which may then be analysed to identify the structural deficiency. A corrective action may then be scheduled to address the structural deficiency. Examples of corrective actions include re-clustering of play-out devices, adjusting priority and/or bandwidth in the streaming, etc. As such, the quality of experience of one or more users experiencing the play-out may be improved.

Systems and methods for generating a video clip and associated closed-captioning data

Disclosed herein are systems and methods for generating a video clip and associated closed-captioning (CC) data. An example method involves accessing a first video clip demarcated into frames; accessing CC data demarcated into CC blocks, identifying a starting frame from among the frames; determining a first set of frames that are within a range of the starting frame; determining a first set of CC blocks that correlate to the first set of frames; receiving a selection of a starting position from among the first set of CC blocks; identifying an ending frame among the frames; using the ending frame to identify an ending position; and generating a second video clip and associated CC data, wherein the second video clip includes the frames spanning from the starting frame to the ending frame, and wherein the generated CC data includes the CC blocks spanning from the starting position to the ending position.

Delivery of synchronised soundtracks for electronic media content

A method and system for streaming a soundtrack from a server to a remote user device for a reader of electronic media content. The soundtrack is defined by multiple audio regions. Each audio region defined by an audio track for playback in the audio region, a start position in the electronic media content corresponding to where the playback of the audio region is to begin, and a stop position in the electronic media content corresponding to where the playback of the audio region is to cease. The streaming of the soundtrack is based on control data generated by the remote user device.