H04N19/29

APPARATUS, A METHOD AND A COMPUTER PROGRAM FOR OMNIDIRECTIONAL VIDEO

There are disclosed various methods, apparatuses and computer program products for video encoding and decoding. In some embodiments the method for video encoding comprises obtaining compressed volumetric video data representing a three-dimensional scene or object (71); capsulating the compressed volumetric video data into a data structure (72); obtaining data of a two-dimensional projection of at least a part of the three-dimensional scene as seen from a certain viewport (73); and including the data of the two-dimensional projection into the data structure (74).

Method and System for Encoding a 3D Scene
20220353530 · 2022-11-03 ·

A computer-implemented method for encoding a scene volume includes: (a) identifying features of a scene volume that are within a camera perspective range with respect to a default camera perspective; (b) converting the identified features into rendered features; and (c) sorting the rendered features into a plurality of scene layers, each including corresponding depth, color, and transparency maps for the respective rendered features. Further, (a), (b), and (c) may be repeated, operating on temporally ordered scene volumes, to produce and output a sequence encoding a video. Corresponding systems and non-transitory computer-readable media are disclosed for encoding a 3D scene and for decoding an encoded 3D scene. Efficient compression, transmission, and playback of video describing a 3D scene can be enabled, including for virtual reality displays with updates based on a changing perspective of a user viewer for variable-perspective playback.

Framework for video conferencing based on face restoration
11659193 · 2023-05-23 · ·

There is included a method and apparatus comprising computer code configured to cause a processor or processors to perform obtaining video data, detecting at least one face from at least one frame of the video data, determining a set of facial landmark features of the at least one face from the at least one frame of the video data, and coding the video data at least partly by a neural network based on the determined set of facial landmark features.

Framework for video conferencing based on face restoration
11659193 · 2023-05-23 · ·

There is included a method and apparatus comprising computer code configured to cause a processor or processors to perform obtaining video data, detecting at least one face from at least one frame of the video data, determining a set of facial landmark features of the at least one face from the at least one frame of the video data, and coding the video data at least partly by a neural network based on the determined set of facial landmark features.

VIRTUAL REALITY PANORAMIC VIDEO SYSTEM USING SCALABLE VIDEO CODING LAYERS
20170347084 · 2017-11-30 · ·

A virtual reality panoramic video system is described that uses scalable video coding layers. One example includes a buffer to receive a wide field of view video, a region extractor to extract regions from the wide field of view video, and a scalable multi-layer video encoder to encode the extracted regions as separate layers and to combine the layers to form an encoded video.

Decoded picture buffer operations for video coding

A method of decoding video data comprising partitioning a decoded picture buffer (DPB) into a plurality of sub-DPBs, receiving at least one indication of a sub-DPB size for the plurality of sub-DPBs for one or more operation points of a multi-layer video coding process, and allocating memory space for the plurality of sub-DPBs based on the at least one indication.

Decoded picture buffer operations for video coding

A method of decoding video data comprising partitioning a decoded picture buffer (DPB) into a plurality of sub-DPBs, receiving at least one indication of a sub-DPB size for the plurality of sub-DPBs for one or more operation points of a multi-layer video coding process, and allocating memory space for the plurality of sub-DPBs based on the at least one indication.

Support of non-HEVC base layer in HEVC multi-layer extensions
09794558 · 2017-10-17 · ·

An apparatus configured to code video information includes a memory and a processor in communication with the memory. The memory is configured to store video information associated with a bitstream. The apparatus further includes a processor in communication with the memory, the processor configured to determine whether a reference layer is included in the bitstream. The processor is further configured to determine, based upon whether the reference layer is included in the bitstream, whether or not to process an indication and to, if the reference layer is included in the bitstream, process, in a video bitstream, the indication. The processor is also configured to code the video information based at least in part on the processed indication.

Support of non-HEVC base layer in HEVC multi-layer extensions
09794558 · 2017-10-17 · ·

An apparatus configured to code video information includes a memory and a processor in communication with the memory. The memory is configured to store video information associated with a bitstream. The apparatus further includes a processor in communication with the memory, the processor configured to determine whether a reference layer is included in the bitstream. The processor is further configured to determine, based upon whether the reference layer is included in the bitstream, whether or not to process an indication and to, if the reference layer is included in the bitstream, process, in a video bitstream, the indication. The processor is also configured to code the video information based at least in part on the processed indication.

APPARATUS AND METHODS FOR VIDEO COMPRESSION USING MULTI-RESOLUTION SCALABLE CODING
20170223368 · 2017-08-03 ·

Apparatus and methods for digital video data compression via a scalable, multi-resolution approach. In one embodiment, the video content may be encoded using a multi-resolution and/or multi-quality scalable coding approach that reduces computational and/or energy load on a client device. In one implementation, a low fidelity image is obtained based on a first full resolution image. The low fidelity image may be encoded to obtain a low fidelity bitstream. A second full resolution image may be obtained based on the low fidelity bitstream. A portion of a difference image obtained based on the second full resolution image and the first full resolution may be encoded to obtain a high fidelity bitstream. The low fidelity bitstream and the high fidelity bitstream may be provided to e.g., a receiving device.