Patent classifications
H04N19/567
Encoding/decoding method, encoding apparatus, decoding apparatus, and system for video with forward and backward reference blocks
Encoding methods, decoding methods, encoding apparatuses, and decoding apparatuses are provided, for a video image with forward and backward reference blocks. One encoding method includes: determining an optimal integrated neighboring block for a current block based on a motion vector integration technology; determining, based a prediction direction of the optimal integrated neighboring block, a motion vector derivation mode that needs to be used by a decoder; correcting a motion vector of the current block based on the motion vector derivation mode, and determining a residual between a predicted value and an original value of the current block based on the corrected motion vector, thereby encoding the current block. According to the technical solutions, a more accurate predicted value is obtained by correcting the motion vector, and a smaller residual is generated.
Dynamic point cloud compression using inter-prediction
A system comprises an encoder configured to compress attribute information for a dynamic point cloud and/or a decoder configured to decompress compressed attribute information for a dynamic point cloud. The dynamic point cloud may include multiple versions of the point cloud at multiple moments in time Attribute values for the point cloud may be compressed at a reference frame using an intra-prediction process and may be compressed at one or more reference frames using an inter-prediction process that takes advantage of temporal relationships between different frames (e.g. versions) of the dynamic point cloud at the different moments in time.
TEMPLATE MATCHING IN VIDEO CODING
A video decoder may be configured to determine a motion vector and a motion vector precision for a current block; identify a current block template within the current picture; search within a search area for a final reference block template that corresponds to the current block template, wherein to search within the search area, the one or more processors are further configured to: identify an initial reference block template based on the motion vector, search other reference block templates around the initial reference block template using a step size that is set to an initial step size, and iteratively reduce the step size from the initial step size until the step size is set to a final step size that equals the motion vector precision; determine a prediction block for the current block based on the final reference block template.
SYSTEM AND METHOD FOR FRAME RATE UP-CONVERSION OF VIDEO DATA BASED ON A QUALITY RELIABILITY PREDICTION
According to one aspect of the disclosure, a computer-implemented method for performing frame rate up-conversion of video data including a sequence of image frames is provided. The method may include performing, by a video processor, an interpolation quality reliability prediction for a target image level based on a reliability metric. In response to the interpolation quality reliability prediction meeting a first reliability threshold condition associated with a first reliability threshold, the method may include performing, by the video processor, a motion-compensation interpolation at the target image level. In response to the interpolation quality reliability prediction not meeting the first reliability threshold, the method may include performing, by the video processor, a fallback interpolation at the target image level or performing a new interpolation quality reliability prediction for a new image level below the target image level.
SYSTEM AND METHOD FOR FRAME RATE UP-CONVERSION OF VIDEO DATA BASED ON A QUALITY RELIABILITY PREDICTION
According to one aspect of the disclosure, a computer-implemented method for performing frame rate up-conversion of video data including a sequence of image frames is provided. The method may include performing, by a video processor, an interpolation quality reliability prediction for a target image level based on a reliability metric. In response to the interpolation quality reliability prediction meeting a first reliability threshold condition associated with a first reliability threshold, the method may include performing, by the video processor, a motion-compensation interpolation at the target image level. In response to the interpolation quality reliability prediction not meeting the first reliability threshold, the method may include performing, by the video processor, a fallback interpolation at the target image level or performing a new interpolation quality reliability prediction for a new image level below the target image level.
Image coding method, image decoding method, image coding apparatus, and image decoding apparatus
According to an embodiment, an image coding method is for coding an image including a luminance component and color difference components. The method includes acquiring a reference image; and generating a predicted image by interpolating the luminance component and the color difference components in the reference image according to a motion vector. If a size of a block, which is designated as a unit of the interpolation, is equal to or smaller than a predetermined first threshold value, the generating includes inhibiting a bi-directional prediction, and performing only a uni-directional prediction to generate the predicted image according to the motion vector.
Video motion estimation method and apparatus, and storage medium
A terminal performs video estimation by dividing an image frame of a video into a plurality of prediction units, and dividing the plurality of prediction units into a first type of prediction units and a second type of prediction units. A motion vector of a prediction unit of the first type is then obtained according to a first search algorithm. A motion vector of a prediction unit of the second type is obtained according to a second search algorithm. The second search algorithm is different from the first search algorithm. Then sub-pixel motion estimation is performed on the image frame according to the motion vector of the prediction unit of the first type and the motion vector of the prediction unit of the second type to generate a motion estimate result.
BI-DIRECTIONAL OPTICAL FLOW IN VIDEO CODING
A method of decoding video data includes determining that bi-directional optical flow (BDOF) is enabled for a block of the video data; dividing the block into a plurality of sub-blocks based on the determination that BDOF is enabled for the block, determining, for each sub-block of one or more sub-blocks of the plurality of sub-blocks, respective distortion values, determining that one of per-pixel BDOF is performed or BDOF is bypassed for each sub-block of the one or more sub-blocks of the plurality of sub-blocks based on the respective distortion values, determining prediction samples for each sub-block of the one or more sub-blocks based on the determination of per-pixel BDOF being performed or BDOF being bypassed, and reconstructing the block based on the prediction samples.
No Reference Image Quality Assessment Based Decoder Side Inter Prediction
A decoder may determine first motion information, from of a plurality of motion information, for a block. The decoder may generate, for each respective motion information of the plurality of motion information, a reconstructed block. The decoder may generate the reconstructed block, for each respective motion information of the plurality of motion information, based on: a prediction block generated for the respective motion information; and a reconstructed residual block based on the first motion information. The decoder may determine second motion information, from the plurality of motion information, for the block based on a visual quality of each of the reconstructed blocks. The decoder may signal, based on the second motion information, the first motion information in a bit stream.
Visual Quality Assessment-based Affine Transformation
A decoder may receive, for a block and from a bit stream, an indication of a decoder-side affine transform, a prediction mode, and a residual block. The decoder may generate a compensated prediction of the block. For example, the decoder may generate the compensated prediction of the block based on the residual block and the prediction mode. The decoder may generate, based on the indication and for each of a plurality of affine transform parameters, an affine transformation of the compensated prediction. The decoder may determine an affine transform parameter, from the plurality of affine transform parameters, based on a visual quality of each of the affine transformations of the compensated prediction.