H04N19/51

Overlapped block motion compensation using spatial neighbors

Devices, systems and methods for digital video coding, which includes an overlapped block motion compensation (OBMC) process based on spatially neighboring blocks, are described. An exemplary method for video processing includes generating a motion information associated with a current video block; generating, based on a weighted sum of at least two temporary prediction blocks, a prediction block for the current video block, a first of the at least two temporary prediction blocks being based on the motion information, and a second of the at least two temporary prediction blocks being based on an intra prediction mode of a neighboring block; and performing, based on the prediction block, a conversion between the current video block and a bitstream representation of the current video block.

IMAGE DECODING METHOD AND APPARATUS ACCORDING TO BLOCK DIVISION STRUCTURE IN IMAGE CODING SYSTEM
20230217023 · 2023-07-06 ·

Provided is a video decoding method performed by a decoding apparatus, which includes: obtaining split information for a target block from a bitstream; splitting the target block into a first sub-block and a second sub-block based on a split boundary indicated by the split information; deriving a first motion information candidate list for the first sub-block and a second motion information candidate list for the second sub-block based on the split information for the target block; performing inter prediction of the first sub-block based on the first motion information candidate list; and performing inter prediction of the second sub-block based on the second motion information candidate list, in which the first sub-block and the second sub-block are non-rectangular blocks, and the first motion information candidate list for the first sub-block is different from the second motion information candidate list for the second sub-block.

IMAGE DECODING METHOD AND APPARATUS ACCORDING TO BLOCK DIVISION STRUCTURE IN IMAGE CODING SYSTEM
20230217023 · 2023-07-06 ·

Provided is a video decoding method performed by a decoding apparatus, which includes: obtaining split information for a target block from a bitstream; splitting the target block into a first sub-block and a second sub-block based on a split boundary indicated by the split information; deriving a first motion information candidate list for the first sub-block and a second motion information candidate list for the second sub-block based on the split information for the target block; performing inter prediction of the first sub-block based on the first motion information candidate list; and performing inter prediction of the second sub-block based on the second motion information candidate list, in which the first sub-block and the second sub-block are non-rectangular blocks, and the first motion information candidate list for the first sub-block is different from the second motion information candidate list for the second sub-block.

Method and apparatus for processing a video signal

The present invention discloses a method and apparatus for encoding or decoding a video signal. The method for processing a video signal according to the present invention uses a merging mode in which prediction information on a neighbor unit is used instead of transmitting prediction information on the present unit, so as to improve coding efficiency. In this case, the number of available candidate units for merging among the units in a predetermined position is determined, and information for the merging mode is acquired on the basis of the number of the available candidate units for merging. The unit to be merged is determined using the information for the merging mode, and prediction information on the unit to be merged is acquired. The prediction value for the present unit is acquired using the prediction information on the unit to be merged, and the present unit is restored using the acquired prediction value.

Method and apparatus for processing a video signal

The present invention discloses a method and apparatus for encoding or decoding a video signal. The method for processing a video signal according to the present invention uses a merging mode in which prediction information on a neighbor unit is used instead of transmitting prediction information on the present unit, so as to improve coding efficiency. In this case, the number of available candidate units for merging among the units in a predetermined position is determined, and information for the merging mode is acquired on the basis of the number of the available candidate units for merging. The unit to be merged is determined using the information for the merging mode, and prediction information on the unit to be merged is acquired. The prediction value for the present unit is acquired using the prediction information on the unit to be merged, and the present unit is restored using the acquired prediction value.

Method and device for encoding or decoding image

An image decoding method and apparatus according to an embodiment may extract, from a bitstream, a quantization coefficient generated through core transformation, secondary transformation, and quantization; generate an inverse-quantization coefficient by performing inverse quantization on the quantization coefficient; generate a secondary inverse-transformation coefficient by performing secondary inverse-transformation on a low frequency component of the inverse-quantization coefficient, the secondary inverse-transformation corresponding to the secondary transformation; and perform core inverse-transformation on the secondary inverse-transformation coefficient, the core inverse-transformation corresponding to the core transformation.

Method and device for encoding or decoding image

An image decoding method and apparatus according to an embodiment may extract, from a bitstream, a quantization coefficient generated through core transformation, secondary transformation, and quantization; generate an inverse-quantization coefficient by performing inverse quantization on the quantization coefficient; generate a secondary inverse-transformation coefficient by performing secondary inverse-transformation on a low frequency component of the inverse-quantization coefficient, the secondary inverse-transformation corresponding to the secondary transformation; and perform core inverse-transformation on the secondary inverse-transformation coefficient, the core inverse-transformation corresponding to the core transformation.

Applications for decoder-side modeling of objects identified in decoded video data

Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.

Applications for decoder-side modeling of objects identified in decoded video data

Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.

Image encoder using machine learning and data processing method of the image encoder
11694125 · 2023-07-04 · ·

An image encoder for outputting a bitstream by encoding an input image includes a predictive block, a machine learning based prediction enhancement (MLBE) block, and a subtractor. The predictive block is configured to generate a prediction block using data of a previous input block. The MLBE block is configured to transform the prediction block into an enhanced prediction block by applying a machine learning technique to the prediction block. The subtractor is configured to generate a residual block by subtracting pixel data of the enhanced prediction block from pixel data of a current input block.