Patent classifications
H04N19/577
PICTURE PREDICTION METHOD AND APPARATUS, AND CODEC
A picture prediction method is provided, which includes: obtaining initial motion information of a current picture block; determining, based on a matching cost criterion, that positions of a pair of reference blocks are a position of a target forward reference block of the current picture block and a position of a target backward reference block of the current picture block, where positions of each pair of reference blocks include a position of a forward reference block and a position of a backward reference block; and for the positions of each pair of reference blocks, a first position offset and a second position offset are in a mirror relationship; and obtaining a predicted value of a pixel value of the current picture block based on a pixel value of the target forward reference block and a pixel value of the target backward reference block.
Illumination compensation flag in frame rate up-conversion with template matching
Inferring an illumination compensation flag during encoding or decoding of a video image signal using frame rate up conversion can save one bit and eliminate complexity. The illumination compensation flag can be derived from the corresponding flags of at least one bi-predictive or bi-directional prediction candidates. The flag can also be derived from some function of the flags from those candidates. Alternatively, several flags can be used for respective coding or decoding of blocks if there are more than one prediction candidate using illumination compensation.
SIGNALING DECODED PICTURE BUFFER SIZE IN MULTI-LOOP SCALABLE VIDEO CODING
A method for encoding a video sequence in a scalable video encoder to generate a scalable bitstream is provided that includes encoding the video sequence in a first layer encoder of the scalable video encoder to generate a first sub-bitstream, encoding the video sequence in a second layer encoder of the scalable video encoder to generate a second sub-bitstream, wherein portions of the video sequence being encoded in the second layer encoder are predicted using reference portions of the video sequence encoded in the first layer encoder, combining the first sub-bitstream and the second sub-bitstream to generate the scalable bitstream, and signaling in the scalable bitstream an indication of a maximum decoded picture buffer (DPB) size needed for decoding the second sub-bitstream and the first sub-bitstream when the second sub-bitstream is a target sub-bitstream for decoding.
SIGNALING DECODED PICTURE BUFFER SIZE IN MULTI-LOOP SCALABLE VIDEO CODING
A method for encoding a video sequence in a scalable video encoder to generate a scalable bitstream is provided that includes encoding the video sequence in a first layer encoder of the scalable video encoder to generate a first sub-bitstream, encoding the video sequence in a second layer encoder of the scalable video encoder to generate a second sub-bitstream, wherein portions of the video sequence being encoded in the second layer encoder are predicted using reference portions of the video sequence encoded in the first layer encoder, combining the first sub-bitstream and the second sub-bitstream to generate the scalable bitstream, and signaling in the scalable bitstream an indication of a maximum decoded picture buffer (DPB) size needed for decoding the second sub-bitstream and the first sub-bitstream when the second sub-bitstream is a target sub-bitstream for decoding.
Image decoding device, image decoding method, and image decoding program
A merge candidate list is generated, a merge candidate is selected from the merge candidate list as a merge candidate, a bitstream is decoded to derive a motion vector difference, and a corrected merge candidate is derived by adding the motion vector difference to a motion vector of the selected merge candidate for a first prediction without scaling and subtracting the motion vector difference from a motion vector of the selected merge candidate for a second prediction without scaling.
Interaction of asymmetric weighted merges and other coding tools
Devices, systems and methods for asymmetric weighted bi-predictive merging in video coding are described. In a representative aspect, a method for decoding video data includes determining that a current video block of the video data is predicted from an asymmetric bi-predictive merge candidate, wherein different weights are used for a first reference block and a second reference block, and making, based on the determination, a determination between enabling and disabling a coding tool for the current video block. In a case that the current video block is predicted from an asymmetric bi-predictive merge candidate, the coding tool is not applied to the current video block.
Inter prediction with refinement in video processing
Devices, systems and methods for digital video coding, which includes inter prediction with refinement, are described. An exemplary method of video processing includes determining to use, for a conversion between a current block of a video and a bitstream representation of the video, a first linear optimization model for the conversion using a first coding mode, the first linear optimization model being derived from a second linear optimization model that is used for the conversion using a second coding mode, and performing, based on the determining, the conversion. Another exemplary method of video processing includes determining to use, for a conversion between a current block of a video and a bitstream representation of the video, a gradient value computation algorithm for a bi-directional optical flow tool, and performing, based on the determining, the conversion.
ENCODER, DECODER, IMAGE PROCESSING APPARATUS, ENCODING METHOD, DECODING METHOD, IMAGE PROCESSING METHOD, BITSTREAM TRANSMITTING APPARATUS, AND NON-TRANSITORY MEDIUM
An encoder includes circuitry and memory coupled to circuitry. In operation, circuitry: designs each of subpictures included in a picture in such a manner that at least one pixel included in the subpicture is included in a conformance cropping window; encodes arrangement information indicating an arrangement of each of the subpictures; and encodes each of the subpictures.
DECODING METHODS AND DEVICES, AND ENCODING METHODS AND DEVICES
The present disclosure provides decoding methods, encoding methods, apparatuses and devices. A decoding method includes: obtaining index information of an enhanced temporal motion vector prediction mode from a bit stream of a current block when it is determined that an enhanced temporal motion vector prediction technology is enabled for the current block; determining one or more matching blocks of the current block; determining candidate enhanced temporal motion vector prediction modes based on the one or more matching blocks and one or more new matching blocks obtained by offsetting each of the one or more matching blocks, and establishing a temporal candidate mode list; determining the enhanced temporal motion vector prediction mode based on the index information; determining motion information of each sub-block in the current block based on the enhanced temporal motion vector prediction mode, and performing motion compensation on each sub-block in the current block.
DECODING METHODS AND DEVICES, AND ENCODING METHODS AND DEVICES
The present disclosure provides decoding methods, encoding methods, apparatuses and devices. A decoding method includes: obtaining index information of an enhanced temporal motion vector prediction mode from a bit stream of a current block when it is determined that an enhanced temporal motion vector prediction technology is enabled for the current block; determining one or more matching blocks of the current block; determining candidate enhanced temporal motion vector prediction modes based on the one or more matching blocks and one or more new matching blocks obtained by offsetting each of the one or more matching blocks, and establishing a temporal candidate mode list; determining the enhanced temporal motion vector prediction mode based on the index information; determining motion information of each sub-block in the current block based on the enhanced temporal motion vector prediction mode, and performing motion compensation on each sub-block in the current block.