H04N19/19

Method for inter prediction and device therefor, and method for motion compensation and device therefor

Provided are an inter prediction method and a motion compensation method. The inter prediction method includes: performing inter prediction on a current image by using a long-term reference image stored in a decoded picture buffer; determining residual data and a motion vector of the current image generated via the inter prediction; and determining least significant bit (LSB) information as a long-term reference index indicating the long-term reference image by dividing picture order count (POC) information of the long-term reference image into most significant bit (MSB) information and the LSB information.

Method and an apparatus for processing a video signal

An apparatus for processing a video signal and method thereof are disclosed. The present invention includes receiving prediction mode information, interpolating information and a residual of a current block, reconstructing an interpolating pixel using the interpolating information and a neighbor block, and reconstructing the current block using the interpolating pixel, the prediction mode information and the residual, wherein the interpolating information is generated based on a location of the current block. According to an apparatus and method for processing a video signal, high reconstruction rate can be obtained by improving the related art method having limited intra prediction modes available for a current block located on a boundary area of a picture in encoding in a manner of reconstructing and using an interpolating pixel based on interpolating information.

Method and an apparatus for processing a video signal

An apparatus for processing a video signal and method thereof are disclosed. The present invention includes receiving prediction mode information, interpolating information and a residual of a current block, reconstructing an interpolating pixel using the interpolating information and a neighbor block, and reconstructing the current block using the interpolating pixel, the prediction mode information and the residual, wherein the interpolating information is generated based on a location of the current block. According to an apparatus and method for processing a video signal, high reconstruction rate can be obtained by improving the related art method having limited intra prediction modes available for a current block located on a boundary area of a picture in encoding in a manner of reconstructing and using an interpolating pixel based on interpolating information.

METHODS, SYSTEMS, AND APPARATUSES FOR PROCESSING VIDEO BY ADAPTIVE RATE DISTORTION OPTIMIZATION
20200322643 · 2020-10-08 ·

Systems and methods are described herein for processing video. An encoder implementing the systems and methods described herein may receive video data comprising a plurality of frames and may partition each frame of the plurality of frames into a plurality of coding units. The encoder may then partition a coding unit into two or more prediction units. The encoder may determine, based on one or more coding parameters, a target bit rate, and characteristics of a human visual system (HVS), a coding mode for each of the two or more prediction units to minimize distortion in the encoded bitstream. The encoder may then determine a residual signal comprising a difference between each of the two or more prediction units and each of one or more corresponding prediction areas in a previously encoded frame and then generate an encoded bitstream comprising the residual signal.

Apparatus and method for motion estimation of three dimension video

Disclosed is a 3D video motion estimating apparatus and method. The 3D video motion estimating apparatus may enable a motion vector of a color image and a motion vector of a depth image refer to each other, thereby increasing a compression rate.

Apparatus and method for motion estimation of three dimension video

Disclosed is a 3D video motion estimating apparatus and method. The 3D video motion estimating apparatus may enable a motion vector of a color image and a motion vector of a depth image refer to each other, thereby increasing a compression rate.

METHOD AND APPARATUS FOR VARIABLE RATE COMPRESSION WITH A CONDITIONAL AUTOENCODER

A method and apparatus for variable rate compression with a conditional autoencoder is herein provided. According to one embodiment, a method includes training a conditional autoencoder using a Lagrange multiplier and training a neural network that includes the conditional autoencoder with mixed quantization bin sizes.

METHOD AND APPARATUS FOR VARIABLE RATE COMPRESSION WITH A CONDITIONAL AUTOENCODER

A method and apparatus for variable rate compression with a conditional autoencoder is herein provided. According to one embodiment, a method includes training a conditional autoencoder using a Lagrange multiplier and training a neural network that includes the conditional autoencoder with mixed quantization bin sizes.

FACE REGION DETECTION BASED LIGHT FIELD VIDEO COMPRESSION
20200296419 · 2020-09-17 ·

A method of perceptual video coding based on face detection is provided. The method includes calculating a bit allocation scheme for coding a light field video based on a saliency map of the face, calculating an LCU level Lagrange multiplier for coding a light field video based on a saliency map of the face and calculating an LCU level quantization parameter for coding a light field video based on a saliency map of the face.

METHOD AND APPARATUS FOR ENCODING A PICTURE

A method and an apparatus for encoding a picture are disclosed. A first rate-distortion cost is determined (42) for a current block of said picture when said current block is not split into subblocks, from a distortion computed for each subblock of said current block, said distortion being scaled by an inverse of a lagrangian parameter determined (40) according to a quantization parameter assigned to said subblock. It is then determined (44) whether said current block is split or not according at least to the determined first rate-distortion cost, and the current block is encoded (45) according to the result of determining whether a current block is split or not into subblocks.