H04N21/85406

CLOUD BASED VISION
20180007393 · 2018-01-04 ·

A method for receiving a real-time video feed of a region of interest includes generating, at a processor of a first device, a request for a real-time video stream of the region of interest. The request indicates a location of the region of interest. The method also includes transmitting the request to one or more other devices via a network to query whether another device is situated to capture a portion of the region of interest. The method also includes receiving the real-time video stream of the region of interest from a second device of the one or more other devices. The second device includes a camera having a field of view that includes at least a portion of the region of interest.

Systems and methods for encoding and playing back video at different frame rates using enhancement layers
11711552 · 2023-07-25 · ·

Systems and methods in accordance with embodiments of this invention provide for encoding and playing back video at different frame rates using enhancement layers. In a number of embodiments, video is encoded using temporal scalability involving the creation of a base layer at a first frame rate and an enhancement layer including additional frames enabling playback at a second higher frame rate. The second higher frame rate can also be referred to as an enhanced frame rate. In a number of embodiments, the base and enhancement layers are stored in one or more container files that contain metadata describing the enhancement layer. Based on the capabilities of a playback device, it can select the particular frame rate at which to playback encoded video.

Image processing apparatus and file generation apparatus

The present technology relates to an image processing apparatus and a file generation apparatus that make it possible to appropriately reproduce a BV content. An image processing apparatus includes: a file acquisition unit that acquires a file having a management region where information for management of a 3D object content is stored and a data region where a track in which streams included in the 3D object content are stored is stored, group information for selection, from a plurality of the streams included in the 3D object content, of the stream appropriate for reproduction of the 3D object content being stored in the management region; and a file processor that selects a plurality of the streams to be used for reproduction of the 3D object content on the basis of the group information. The present technology is applicable to a client apparatus.

V3C video component track alternatives
11711506 · 2023-07-25 · ·

An apparatus includes a communication interface and a processor operably coupled to the communication interface. The communication interface receives a plurality of tracks in a compressed bitstream. The processor identifies an atlas track corresponding to a point cloud compression (PCC) component and identifies a set of first component tracks that is referenced by the atlas track and at least one second component track, each of the at least one second component track is an alternative version of a first component track of the set of first component tracks. The processor also determines which of the set of first component tracks and the at least one second component track are appropriate versions of the PCC component. The processor further decodes the appropriate version of the PCC component from among the set of first component tracks and the at least one second component track.

Decoding capability information storage in video coding
11711518 · 2023-07-25 · ·

Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, where the format rule specifies that a type of a sample entry determines whether decoding capability information network abstraction layer units are included in either the sample entry of a video track in the visual media file or in a sample of the video track and the sample entry of the video track in the visual media file.

A METHOD AND APPARATUS FOR DECODING A 3D VIDEO

A method and an apparatus for decoding a volumetric video are disclosed. Such a method comprises receiving a data stream representative of a file comprising information for selecting, according to a rendering viewpoint, at least one atlas comprising color and depth data patches associated with a viewpoint in said volumetric video, said color and depth data patches being generated with respect to depth and color reference data acquired from a reference viewpoint in said volumetric video.

Indication of Tiles in a Video Picture

A method includes performing a conversion between a video including a video picture including one or more tiles and a bitstream of the video. The video picture refers to a picture parameter set, and the picture parameter set conforms to a format rule specifying that the picture parameter set includes a list of column widths for N tile columns, where N is an integer. An (N−1)-th tile column exists in the video picture and the (N−1)-th tile column has a width that is equal to an (N−1)-th entry in a list of explicitly included tile column widths plus one number of coding tree blocks.

Methods and systems for content control
11570503 · 2023-01-31 · ·

Methods and system for providing content are disclosed. One method can comprise receiving a request for data at a first time, wherein the data comprises a first time duration, determining a buffer size based upon the first time duration, and determining a playback window based upon the determined buffer size and the first time. The method can also comprise synchronizing presentation of content on a plurality of devices.

Method and apparatus for encapsulation of Motion Picture Experts Group Media Transport assets in International Organization for Standardization Base Media Files

An apparatus includes receive path circuitry configured to receive a Motion Picture Experts Group (MPEG) Media Transport (MMT) container and a processing device configured to identify locations of one or more media fragment units (MFUs) in the MMT container using a hint track within the MMT container. Another apparatus includes transmit path circuitry configured to transmit an MMT container and a processing device configured to identify locations of one or more MFUs in the MMT container using a hint track within the MMT container.

METHOD, DEVICE, AND COMPUTER PROGRAM FOR IMPROVING ENCAPSULATION OF MEDIA CONTENT

According to embodiments, the invention provides a method for encapsulating partitioned timed media data comprising timed samples, comprising in turn subsamples, the timed samples being grouped into groups, the method comprising: obtaining spatial tracks, each spatial track comprising at least one subsample of a first timed sample and one corresponding subsample of the other timed samples, the corresponding subsamples being located at the same spatial position in its own timed sample as the at least one subsample; creating a base track referencing at least some of the spatial tracks, the base track comprising reconstruction instructions, each of the reconstruction instructions being associated with a group of timed samples, enabling generating a portion of a bit-stream from sub-samples of spatial tracks, that belong to a same group of timed samples; and independently encapsulating each of the tracks in a least one media file.