Patent classifications
H04N19/137
Motion vector prediction for video coding
Methods and systems for providing a bitstream comprising video data encoded by an encoder apparatus are described, wherein the method may include: a processor of the encoder apparatus determining a current motion vector of a current block of a current video frame of a sequence of video frames comprising video data, the current motion vector defining a spatial offset of the current block relative to a prediction block of a previously encoded reference video frame stored in a memory of the encoder apparatus; the processor determining or receiving motion information associated with the current video frame, the motion information signaling the processor whether at least part of the offset defined by the current motion vector is associated with non-uniform motion in the video data of the current video frame; the processor determining a motion vector predictor candidate based on the motion information, at least a first motion vector predicator algorithm and a second motion vector predictor algorithm; and, the processor determining a motion vector difference based on the selected motion vector predictor candidate and the current motion vector; and, the processor using an encoding process to encode a residual block, the motion vector difference, an indication of the selected motion vector predictor candidate, and at least part of the motion information into a bitstream, wherein the residual block defines a difference between the current block and the prediction block.
Motion vector prediction for video coding
Methods and systems for providing a bitstream comprising video data encoded by an encoder apparatus are described, wherein the method may include: a processor of the encoder apparatus determining a current motion vector of a current block of a current video frame of a sequence of video frames comprising video data, the current motion vector defining a spatial offset of the current block relative to a prediction block of a previously encoded reference video frame stored in a memory of the encoder apparatus; the processor determining or receiving motion information associated with the current video frame, the motion information signaling the processor whether at least part of the offset defined by the current motion vector is associated with non-uniform motion in the video data of the current video frame; the processor determining a motion vector predictor candidate based on the motion information, at least a first motion vector predicator algorithm and a second motion vector predictor algorithm; and, the processor determining a motion vector difference based on the selected motion vector predictor candidate and the current motion vector; and, the processor using an encoding process to encode a residual block, the motion vector difference, an indication of the selected motion vector predictor candidate, and at least part of the motion information into a bitstream, wherein the residual block defines a difference between the current block and the prediction block.
Triangulation-Based Adaptive Subsampling of Dense Motion Vector Fields
The present disclosure relates to an apparatus and a method for providing a plurality of motion vectors related to an image coded in a bitstream, comprising obtaining a set of sample positions within the image, obtaining respective motion vectors associated with the set of sample positions, deriving an additional motion vector based on information coded in the bitstream, determining an additional sample position located within a triangle, which is formed by three sample positions of the set of sample positions, based on the respective motion vectors associated with the three sample positions, the triangle not including any other sample position of the set, adding the additional sample position to the set of sample positions, and associating the derived additional motion vector with the additional sample position. Such method and apparatus provide for an efficient motion flow compression and reconstruction enabling to recover some sample positions without signaling them in the bitstream.
Triangulation-Based Adaptive Subsampling of Dense Motion Vector Fields
The present disclosure relates to an apparatus and a method for providing a plurality of motion vectors related to an image coded in a bitstream, comprising obtaining a set of sample positions within the image, obtaining respective motion vectors associated with the set of sample positions, deriving an additional motion vector based on information coded in the bitstream, determining an additional sample position located within a triangle, which is formed by three sample positions of the set of sample positions, based on the respective motion vectors associated with the three sample positions, the triangle not including any other sample position of the set, adding the additional sample position to the set of sample positions, and associating the derived additional motion vector with the additional sample position. Such method and apparatus provide for an efficient motion flow compression and reconstruction enabling to recover some sample positions without signaling them in the bitstream.
Coding concept allowing efficient multi-view/layer coding
Various concepts which further improve multi-view/layer coding concepts, are described.
Coding concept allowing efficient multi-view/layer coding
Various concepts which further improve multi-view/layer coding concepts, are described.
Adaptive motion vector precision for affine motion model based video coding
Systems and methods are described for video coding using affine motion models with adaptive precision. In an example, a block of video is encoded in a bitstream using an affine motion model, where the affine motion model is characterized by at least two motion vectors. A precision is selected for each of the motion vectors, and the selected precisions are signaled in the bitstream. In some embodiments, the precisions are signaled by including in the bitstream information that identifies one of a plurality of elements in a selected predetermined precision set. The identified element indicates the precision of each of the motion vectors that characterize the affine motion model. In some embodiments, the precision set to be used is signaled expressly in the bitstream; in other embodiments, the precision set may be inferred, e.g., from the block size, block shape or temporal layer.
Adaptive motion vector precision for affine motion model based video coding
Systems and methods are described for video coding using affine motion models with adaptive precision. In an example, a block of video is encoded in a bitstream using an affine motion model, where the affine motion model is characterized by at least two motion vectors. A precision is selected for each of the motion vectors, and the selected precisions are signaled in the bitstream. In some embodiments, the precisions are signaled by including in the bitstream information that identifies one of a plurality of elements in a selected predetermined precision set. The identified element indicates the precision of each of the motion vectors that characterize the affine motion model. In some embodiments, the precision set to be used is signaled expressly in the bitstream; in other embodiments, the precision set may be inferred, e.g., from the block size, block shape or temporal layer.
Method and system for picture segmentation using columns
Described is picture segmentation through columns and slices in video encoding and decoding. A video picture is divided into a plurality of columns, each column covering only a part of the video picture in a horizontal dimension. All coded tree blocks (“CTBs”) belonging to a slice may belong to one or more columns. The columns may be used to break the same or different prediction or in-loop filtering mechanisms of the video coding, and the CTB scan order used for encoding and/or decoding may be local to a column. Column widths may be indicated in a parameter set and/or may be adjusted at the slice level. At the decoder, column width may be parsed from the bitstream, and slice decoding may occur in one or more columns.
Method and system for picture segmentation using columns
Described is picture segmentation through columns and slices in video encoding and decoding. A video picture is divided into a plurality of columns, each column covering only a part of the video picture in a horizontal dimension. All coded tree blocks (“CTBs”) belonging to a slice may belong to one or more columns. The columns may be used to break the same or different prediction or in-loop filtering mechanisms of the video coding, and the CTB scan order used for encoding and/or decoding may be local to a column. Column widths may be indicated in a parameter set and/or may be adjusted at the slice level. At the decoder, column width may be parsed from the bitstream, and slice decoding may occur in one or more columns.