Patent classifications
H04N19/517
IMAGE PROCESSING APPARATUS AND METHOD
There is provided an image processing apparatus and method that make it possible to suppress degradation of the encoding efficiency. In the case where primary transform that is a transform process for a prediction residual that is a difference between an image and a prediction image of the image is to be skipped, also secondary transform, which is a transform process for a primary transform coefficient obtained by the primary transform of the prediction residual, is skipped. The present disclosure can be applied, for example, to an image processing apparatus, an image encoding apparatus, an image decoding apparatus and so forth.
IMAGE PROCESSING APPARATUS AND METHOD
There is provided an image processing apparatus and method that make it possible to suppress degradation of the encoding efficiency. In the case where primary transform that is a transform process for a prediction residual that is a difference between an image and a prediction image of the image is to be skipped, also secondary transform, which is a transform process for a primary transform coefficient obtained by the primary transform of the prediction residual, is skipped. The present disclosure can be applied, for example, to an image processing apparatus, an image encoding apparatus, an image decoding apparatus and so forth.
METHOD AND DEVICE FOR IMAGE CODING USING MOTION VECTOR
According to embodiments of the present document, symmetric motion vector differences (SMVDs) and merge mode motion vector differences (MMVDs) according to an inter prediction mode may be performed on the basis of reference picture types and, particularly, short term reference pictures may be used. Therefore, prediction performance and coding efficiency in the inter prediction mode can be improved.
DECODER SIDE MOTION DERIVATION USING SPATIAL CORRELATION
A method of decoding video data includes determining a plurality of hypotheses of a current block based on a plurality of motion vectors. Each of the plurality of motion vectors is associated with one of the plurality of hypotheses, and each of the plurality of hypotheses is based on a set of samples in a reference picture having a motion vector that identifies a top-left sample of the set of samples. The method includes determining one or more neighboring samples in the same picture as the current block, for each of the plurality of hypotheses, determining respective correlation values between at least one sample of a respective hypothesis and at least one sample of the one or more neighboring samples, determining the motion vector for the current block based on the determined respective correlation values, and reconstructing the current block based on the determined motion vector.
DYNAMIC INSERTION OF CONTENT VIA MACROBLOCK MODIFICATION
Systems, methods, and devices for inserting content into a video frame are disclosed herein. A frame of video data encoded to include a plurality of macroblocks is received. An insertion region of the frame for inserting content is defined, the insertion region spanning a subset of the macroblocks. The frame is augmented with a duplication region configured as a non-displayed region, the duplication region including duplicated macroblocks that duplicate the macroblocks of insertion region. The macroblocks of the insertion region are replaced with replacement macroblocks that encode replacement content.
DYNAMIC INSERTION OF CONTENT VIA MACROBLOCK MODIFICATION
Systems, methods, and devices for inserting content into a video frame are disclosed herein. A frame of video data encoded to include a plurality of macroblocks is received. An insertion region of the frame for inserting content is defined, the insertion region spanning a subset of the macroblocks. The frame is augmented with a duplication region configured as a non-displayed region, the duplication region including duplicated macroblocks that duplicate the macroblocks of insertion region. The macroblocks of the insertion region are replaced with replacement macroblocks that encode replacement content.
Inter prediction with refinement in video processing
A method for processing a video includes performing a conversion between a current block of visual media data and a corresponding coded representation of the visual media data, wherein the conversion of the current block includes determining whether a use of one or both of a bi-directional optical flow (BIO) technique or a decoder-side motion vector refinement (DMVR) technique to the current block is enabled or disabled, and wherein the determining the use of the BIO technique or the DMVR technique is based on a cost criterion associated with the current block.
Inter prediction with refinement in video processing
A method for processing a video includes performing a conversion between a current block of visual media data and a corresponding coded representation of the visual media data, wherein the conversion of the current block includes determining whether a use of one or both of a bi-directional optical flow (BIO) technique or a decoder-side motion vector refinement (DMVR) technique to the current block is enabled or disabled, and wherein the determining the use of the BIO technique or the DMVR technique is based on a cost criterion associated with the current block.
Virtual merge candidates
A video encoding method includes generating, for a conversion between a video block in a video picture and a bitstream representation of the video block, a list of merge candidates for the video block, wherein the list of merge candidates includes at least a first merge candidate that is a virtual merge candidate derived by scaling a motion vector of a second merge candidate to a different reference picture and modifying a reference picture of a second merge candidate; and performing, using the list of merge candidates, the conversion between the video block and the video bitstream.
Virtual merge candidates
A video encoding method includes generating, for a conversion between a video block in a video picture and a bitstream representation of the video block, a list of merge candidates for the video block, wherein the list of merge candidates includes at least a first merge candidate that is a virtual merge candidate derived by scaling a motion vector of a second merge candidate to a different reference picture and modifying a reference picture of a second merge candidate; and performing, using the list of merge candidates, the conversion between the video block and the video bitstream.