H04N19/31

Multi-model selection for neural network based tools in video coding

A method, computer program, and computer system is provided for video encoding and decoding. Video data including one or more frames is received. One or more quantization parameters associated with the received video data are determined for frame generation or enhancement of a target frame from among the one or more frames. The video data is decoded based on the determined quantization parameters.

Frame-rate scalable video coding

Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

Frame-rate scalable video coding

Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

Signaling for addition or removal of layers in scalable video

In one embodiment, a method of signaling individual layers in a transport stream includes: determining a plurality of layers in a transport stream, wherein each layer includes a respective transport stream parameter setting; determining an additional layer for the plurality of layers in the transport stream, wherein the additional layer enhances one or more of the plurality of layers including a base layer and the respective layer parameter settings for the plurality of layers do not take into account the additional layer; and determining an additional transport stream parameter setting for the additional layer, the additional transport stream parameter setting specifying a relationship between the additional layer and at least a portion of the plurality of layers, wherein the additional transport stream parameter setting is used to decode the additional layer and the at least a portion of the plurality of layers.

Signaling for addition or removal of layers in scalable video

In one embodiment, a method of signaling individual layers in a transport stream includes: determining a plurality of layers in a transport stream, wherein each layer includes a respective transport stream parameter setting; determining an additional layer for the plurality of layers in the transport stream, wherein the additional layer enhances one or more of the plurality of layers including a base layer and the respective layer parameter settings for the plurality of layers do not take into account the additional layer; and determining an additional transport stream parameter setting for the additional layer, the additional transport stream parameter setting specifying a relationship between the additional layer and at least a portion of the plurality of layers, wherein the additional transport stream parameter setting is used to decode the additional layer and the at least a portion of the plurality of layers.

METHOD FOR ENCODING AND DECODING VIDEO, AND APPARATUS USING SAME

The present invention relates to a technique for encoding and decoding video data, and more particularly, to a method for performing inter-prediction in an effective manner. The present invention combines an inter-prediction method using an AMVP mode and an inter-prediction method using a merge mode so as to propose a method for using the same candidate. The method for encoding video data proposed by the present invention comprises the following steps: receiving mode information on an inter-prediction method of a current block; determining, on the basis of the received mode information, whether the interprediction method to be applied to the current block is an AMVP mode or a merge mode; and selecting a candidate to derive motion information of the current block, wherein the candidate is selected in a left region, top region and corner region of the current block and in the same position block as the current block, and the AMVP mode and the merge mode are applied on the basis of the selected candidate.

METHOD FOR ENCODING AND DECODING VIDEO, AND APPARATUS USING SAME

The present invention relates to a technique for encoding and decoding video data, and more particularly, to a method for performing inter-prediction in an effective manner. The present invention combines an inter-prediction method using an AMVP mode and an inter-prediction method using a merge mode so as to propose a method for using the same candidate. The method for encoding video data proposed by the present invention comprises the following steps: receiving mode information on an inter-prediction method of a current block; determining, on the basis of the received mode information, whether the interprediction method to be applied to the current block is an AMVP mode or a merge mode; and selecting a candidate to derive motion information of the current block, wherein the candidate is selected in a left region, top region and corner region of the current block and in the same position block as the current block, and the AMVP mode and the merge mode are applied on the basis of the selected candidate.

REDUCING LATENCY IN VIDEO ENCODING AND DECODING
20230239509 · 2023-07-27 · ·

Techniques and tools for reducing latency in video encoding and decoding by constraining latency due to reordering of video frames, and by indicating the constraint on frame reordering latency with one or more syntax elements that accompany encoded data for the video frames. For example, a real-time communication tool with a video encoder sets a syntax element that indicates a constraint on frame reordering latency, which is consistent with inter-frame dependencies between multiple frames of a video sequence, then outputs the syntax element. A corresponding real-time communication tool with a video decoder receives the syntax element that indicates the constraint on frame reordering latency, determines the constraint on frame reordering latency based on the syntax element, and uses the constraint on frame reordering latency to determine when a reconstructed frame is ready for output (in terms of output order).

REDUCING LATENCY IN VIDEO ENCODING AND DECODING
20230239509 · 2023-07-27 · ·

Techniques and tools for reducing latency in video encoding and decoding by constraining latency due to reordering of video frames, and by indicating the constraint on frame reordering latency with one or more syntax elements that accompany encoded data for the video frames. For example, a real-time communication tool with a video encoder sets a syntax element that indicates a constraint on frame reordering latency, which is consistent with inter-frame dependencies between multiple frames of a video sequence, then outputs the syntax element. A corresponding real-time communication tool with a video decoder receives the syntax element that indicates the constraint on frame reordering latency, determines the constraint on frame reordering latency based on the syntax element, and uses the constraint on frame reordering latency to determine when a reconstructed frame is ready for output (in terms of output order).

IMAGE CODING METHOD, IMAGE DECODING METHOD, IMAGE CODING APPARATUS, RECEIVING APPARATUS, AND TRANSMITTING APPARATUS
20230239484 · 2023-07-27 ·

An image coding method of hierarchically coding a plurality of pictures to generate a bitstream, includes: coding each of the plurality of pictures, which belongs to any one of a plurality of hierarchical layers, with reference to a picture belonging to a hierarchical layer which is same as or lower than a hierarchical layer to which the picture belongs, and without reference to a picture belonging to a hierarchical layer which is higher than the hierarchical layer to which the picture belongs; and generating the bitstream by coding the coded pictures and time information indicating decoding times of the coded pictures. The time information indicates that the decoding times are set at equal intervals for low-layer pictures which are the plurality of pictures other than highest-layer pictures belonging to a highest layer among the plurality of hierarchical layers.