Patent classifications
H04N19/57
Decoder-side motion vector derivation
Improved systems and methods related to decoder-side motion vector derivation (DMVD), for example, in applying one or more constraints to motion information, such as a MV derived by DMVD, and/or a MV difference between an initial MV and an MV derived by DMVD. These techniques may be applied to any of the existing video codecs, such as HEVC (High Efficiency Video Coding), and/or may be an efficient coding tool in any future video coding standards. In one example, the block size used for DMVD can be restricted. In another example, FRUC bilateral matching can be simplified by not searching outside reference blocks indicated by the original motion vector.
Decoder-side motion vector derivation
Improved systems and methods related to decoder-side motion vector derivation (DMVD), for example, in applying one or more constraints to motion information, such as a MV derived by DMVD, and/or a MV difference between an initial MV and an MV derived by DMVD. These techniques may be applied to any of the existing video codecs, such as HEVC (High Efficiency Video Coding), and/or may be an efficient coding tool in any future video coding standards. In one example, the block size used for DMVD can be restricted. In another example, FRUC bilateral matching can be simplified by not searching outside reference blocks indicated by the original motion vector.
Image processor
The codec processor includes an SRAM that holds a reference image read from an image storage, and a motion search unit that performs motion search on the basis of a reference image held in the SRAM to generate a prediction block for a target block in an input image. The SRAM holds a reference image having a horizontally equivalent number of pixels to a horizontal number of pixels of the input image and a number of pixels vertically larger than or equal to a vertical motion search range.
Image processor
The codec processor includes an SRAM that holds a reference image read from an image storage, and a motion search unit that performs motion search on the basis of a reference image held in the SRAM to generate a prediction block for a target block in an input image. The SRAM holds a reference image having a horizontally equivalent number of pixels to a horizontal number of pixels of the input image and a number of pixels vertically larger than or equal to a vertical motion search range.
USE OF EXTENDED SAMPLES DURING SEARCH IN DECODER-SIDE MOTION REFINEMENT
A method for inter prediction of a block of a picture and corresponding apparatus are provided. The method comprises: determining a first prediction block based on an initial first motion vector of a current block in a first reference picture, so as to obtain a first extension block; obtaining from the first reference picture, one or more first reference blocks each of which is associated with a combination of the initial first motion vector and one of motion vector offsets; obtaining a target motion vector offset with a smallest cost among the motion vector offsets; determining for the current block, a refined first motion vector which is a combination of the initial first motion vector and the target motion vector offset; and obtaining a refined prediction block of the current block based on the refined first motion vector, thereby improving coherence of refined motion vectors to achieve higher coding efficiencies.
Apparatus and methods for image encoding using spatially weighted encoding quality parameters
A method includes obtaining visual content comprising spatial portions; determining respective spatial qualities of the spatial portions, wherein the respective spatial qualities are based on locations of the spatial portions within the visual content; and encoding the spatial portions of the visual content based on the respective spatial qualities. An apparatus includes a camera, a display, and a processor. The processor is configured to identify, using facial recognition, a face of a user of the apparatus; identify a distance of the face of the user to the display; and render visual content on the display using a quality that is based on the distance.
Apparatus and methods for image encoding using spatially weighted encoding quality parameters
A method includes obtaining visual content comprising spatial portions; determining respective spatial qualities of the spatial portions, wherein the respective spatial qualities are based on locations of the spatial portions within the visual content; and encoding the spatial portions of the visual content based on the respective spatial qualities. An apparatus includes a camera, a display, and a processor. The processor is configured to identify, using facial recognition, a face of a user of the apparatus; identify a distance of the face of the user to the display; and render visual content on the display using a quality that is based on the distance.
Method and apparatus for encoding video using variable partitions for predictive encoding, and method and apparatus for decoding video using variable partitions for predictive encoding
A video encoding method and apparatus and a video decoding method and apparatus are provided. The video encoding method includes: prediction encoding in units of a coding unit as a data unit for encoding a picture, by using partitions determined based on a first partition mode and a partition level, so as to select a partition for outputting an encoding result from among the determined partitions; and encoding and outputting partition information representing a first partition mode and a partition level of the selected partition. The first partition mode represents a shape and directionality of a partition as a data unit for performing the prediction encoding on the coding unit, and the partition level represents a degree to which the coding unit is split into partitions for detailed motion prediction.
Method and apparatus for encoding video using variable partitions for predictive encoding, and method and apparatus for decoding video using variable partitions for predictive encoding
A video encoding method and apparatus and a video decoding method and apparatus are provided. The video encoding method includes: prediction encoding in units of a coding unit as a data unit for encoding a picture, by using partitions determined based on a first partition mode and a partition level, so as to select a partition for outputting an encoding result from among the determined partitions; and encoding and outputting partition information representing a first partition mode and a partition level of the selected partition. The first partition mode represents a shape and directionality of a partition as a data unit for performing the prediction encoding on the coding unit, and the partition level represents a degree to which the coding unit is split into partitions for detailed motion prediction.
METHODS AND APPARATUS FOR ADAPTIVE GEOMETRIC PARTITIONING FOR VIDEO DECODING
There are provided methods and apparatus for adaptive geometric partitioning for video encoding and decoding. An apparatus includes an encoder for encoding image data corresponding to pictures by adaptively partitioning at least portions of the pictures responsive to at least one parametric model. The at least one parametric model involves at least one of implicit and explicit formulation of at least one curve.