H04N21/8451

Operating point entity group signaling in coded video
11671627 · 2023-06-06 · ·

Systems, methods and apparatus for generating or parsing a visual media file according to a file format include performing a conversion between a visual media data and a visual media file that stores a bitstream of the visual media data according to a format rule. The visual media file stores multiple tracks that belong to an entity group of a specific type. The format rule specifies that, responsive to the multiple tracks having a track reference to a particular type to a group identifier, the multiple tracks (A) omit carrying a sample group of a specific type or (B) carry the sample group of the specific type such that information in the sample group of the specific type is consistent with that in the entity group of the specific type.

Information processing device, information processing method, playing device, playing method, and program
09781402 · 2017-10-03 · ·

The present invention relating to an information processing device, information processing method, playing device, playing method, and program that enables playing of stereo image (3D image) contents appropriately. In the case that 3D playing using Base view video and D1 view video is instructed, “10000.ilvt (ref_to_D1-B_interleaved_file_name) described in interleaved_file_info ( ) of the PlayList file is specified, and Base view video chunks and D1 view video chunks are read out from an optical disc through a UDF file system. The Base view video chunks and D1 view video chunks that are read out are supplied to a decoder unit and played. The present invention is applicable to a playing device compatible with the BD-ROM standard.

FRAMEWORK FOR EMBEDDING DATA IN ENCODED VIDEO
20170251283 · 2017-08-31 ·

In one embodiment, a video stream is received, and a spatially distinct region of a frame in the received video stream to be modified is identified, the spatially distinct region of the frame being encoded separately from any other region in the frame. A segment of the spatially distinct region of the frame to be modified is extracted. The extracted segment of the spatially distinct region is modified. The extracted segment of the spatially distinct region into a single-segment spatially distinct region is encoded. A network abstraction layer (NAL) header is associated with the encoded single-segment spatially distinct region. The encoded single-segment spatially distinct region and its associated NAL header is inserted into the received video stream following the identified spatially distinct region of the frame to be modified. Related methods, systems and apparatus are also described.

METHOD, DEVICE, AND COMPUTER PROGRAM FOR ENCAPSULATING PARTITIONED TIMED MEDIA DATA

The invention relates to encapsulating tiled timed media data in a server and providing a timed media data bit-stream from encapsulated tiled timed media in a client device. The tiled timed media data comprises timed samples and each timed sample comprises a plurality of spatial subsamples. After having selected at least one spatial subsample from amongst the plurality of spatial subsamples of one of the timed samples, one tile track comprising the selected spatial subsample and one corresponding spatial subsample of each of the other timed samples is created for each selected spatial subsample. One composite track comprising at least one extractor identifying at least one of the created tile tracks is created. Each created track is independently encapsulated in at least one media segment file.

NON-TRANSFORM CODING

Techniques for selectively transforming one or more coding units when coding video content are described herein. The techniques may include determining whether or not to transform a particular coding unit. The determination may be based on a difference in pixel values of the particular coding unit and/or one or more predefined rate-distortion constraints. When it is determined to not perform a transform, the particular coding unit may be coded without transforming the particular coding unit.

Broadcast receiver and video data processing method thereof
09736452 · 2017-08-15 · ·

A broadcast receiver and a method for processing video data are disclosed. The method for controlling a three dimensional (3D) video display output of a broadcast receiver includes receiving a broadcast signal including a video stream, wherein the video stream includes a plurality of video stream sections having different view points, acquiring view point information indicating corresponding view points of the video stream sections, and controlling a three dimensional (3D) video display output of the video stream according to the obtained view point information.

Full random access from clean random access pictures in video coding
09736476 · 2017-08-15 · ·

Techniques of this disclosure provide an indication of whether performing random access from a particular access unit in a bitstream requires fetching of parameter sets from previous access units. A clean random access (CRA) picture can be positioned at any point within a coded video sequence and does not clean a decoded picture buffer (DPB) of a video decoder. In order to perform random access decoding from the CRA picture, a video decoder may need to fetch one or more parameter sets included in unavailable access units that precede the CRA picture. The techniques provide an indication, for each CRA picture, that indicates whether parameter sets included in previous access units are needed to perform random access from the picture. When no parameter sets from previous access units are needed for random access from a particular CRA picture, a video decoder may determine to perform random access from that picture.

Fragment server directed device fragment caching
11252453 · 2022-02-15 · ·

The media stream delivery system encodes and fragments media streams into numerous media stream fragments maintained on fragment servers. Devices obtain fragments to reconstruct media streams including live real-time media streams for playback on the devices. A device may perform caching of media stream fragments so that particular fragments need not be accessed again from a fragment server. A fragment server or even a content provider can analyze and monitor characteristics of media streams, viewing behavior, content popularity, etc., to identify fragments for caching at the playback devices. Caching indicators along with time period indicators may be included in the media stream fragments.

Multiview video encoding method and device, and multiview video decoding method and device

A multi-view video encoding method multiplexes an encoded multi-view image by a predetermined data unit and adds a scalable extension type (SET) indicating which view of image among a basic view image and an additional view image is related to data included in the predetermined data unit, a depth flag indicating which image is related to the data among a texture image and a depth map image, and a view ID of the data to a header of the predetermined data unit.

Multi-layer video stream decoding

Multi-layer dependencies are signaled in an efficient way for a multi-view video stream. Coding or decoding relationship information defining coding or decoding dependencies is represented in bit-efficient syntax code, preferably through usage of hierarchical layer dependencies using layer indices for representing layer dependencies.