Patent classifications
H04N19/70
Predictive tree coding for point cloud coding
A method and device for decoding a point cloud using octree partitioning and a predictive tree include obtaining the point cloud. A bounding box of the point cloud is determined. Octree nodes are generated by partitioning the bounding box using octree partitioning. The predictive tree is generated for points in at least one octree node of the octree nodes. A transform is applied to the predictive tree. The points in the at least one octree node are decoded using the predictive tree.
Method and apparatus for video coding
Aspects of the disclosure provide methods and apparatuses for video encoding/decoding. In some examples, an apparatus for video decoding includes receiving circuitry and processing circuitry. For example, the processing circuitry decodes partition information from a coded video bitstream. The partition information is indicative of a minimum allowed quaternary tree (QT) leaf node size for an intra coded (I) slice. The minimum allowed QT leaf node size for the I slice is constrained by a threshold that is lower than a coding tree unit (CTU) size. Further, the processing circuitry partitions a coding, tree block in the I slice into coding blocks based on the minimum allowed QT leaf node size, and reconstructs the coding blocks respectively from the coded video bitstream.
Method and apparatus for low-complexity bidirectional intra prediction in video encoding and decoding
A method for video decoding is disclosed. A directional intra prediction mode is decoded for a block of a picture, the directional intra prediction mode having a direction. Based on the directional intra prediction mode, a first predictor and a second predictor for a sample in the current block are accessed, the first and second predictors being on a line at least approximating the direction. A sample value of the sample is predicted by interpolation using the first and second predictors, responsive to a difference between the second and first predictors, wherein the difference is scaled by at least a ratio that is based on a location of the sample in the block, and wherein a denominator of the ratio is a power of two regardless of the location of the sample. The sample of the block is reconstructed based on the predicted sample value.
Video encoding and decoding method based on entry point information in a slice header, and apparatus using same
A video encoding method includes deriving entry point information specifying an entry point of a substream for a picture. The entry point information may include a number syntax element representing a number of offset syntax elements in a slice header; an offset syntax element representing an entry point offset between in bytes two entry points; and a length syntax element representing a bits length of the offset syntax element. A value of the length syntax element plus one corresponds to the bits length of the offset syntax element and the length syntax element is signaled when the number of the offset syntax elements is larger than 0. A value of the number syntax element corresponds to the number of offset syntax elements in the slice header, and the offset syntax element is signaled when the number of the offset syntax elements is larger than 0.
Video encoding and decoding method based on entry point information in a slice header, and apparatus using same
A video encoding method includes deriving entry point information specifying an entry point of a substream for a picture. The entry point information may include a number syntax element representing a number of offset syntax elements in a slice header; an offset syntax element representing an entry point offset between in bytes two entry points; and a length syntax element representing a bits length of the offset syntax element. A value of the length syntax element plus one corresponds to the bits length of the offset syntax element and the length syntax element is signaled when the number of the offset syntax elements is larger than 0. A value of the number syntax element corresponds to the number of offset syntax elements in the slice header, and the offset syntax element is signaled when the number of the offset syntax elements is larger than 0.
Method and apparatus for processing video signals using reduced transform
Provided is a method for decoding a video signal based on a reduced transform, which includes: checking whether a transform skip is applied to a current block; obtaining a transform index indicating a transform kernel of the current block from the video signal when the transform skip is not applied to the current block; determining a region where a primary transform is applied to the current block based on the transform kernel indicated by the transform index and a size of the current block; and performing an inverse primary transform on the region to which the primary transform is applied by using the transform kernel indicated by the transform index.
Arithmetic coding information for parallel octree coding
Aspects of the disclosure include a method for point cloud coding. In the method, whether decoding of occupancy codes of nodes in a range of octree partition depths in an octree partitioning structure of a point cloud reaches a minimum octree partition depth at which parallel decoding is enabled is determined. Arithmetic coding information for decoding the occupancy codes of the nodes in the minimum octree partition depth is stored based on the decoding of the occupancy codes of the nodes in the range of octree partition depths reaching the minimum octree partition depth. The parallel decoding is performed on occupancy codes of the nodes in each of the at least one remaining octree partitions depth based on the stored arithmetic coding information. The point cloud is reconstructed based on the occupancy codes of the nodes in the range of octree partition depths in the octree partitioning structure.
Arithmetic coding information for parallel octree coding
Aspects of the disclosure include a method for point cloud coding. In the method, whether decoding of occupancy codes of nodes in a range of octree partition depths in an octree partitioning structure of a point cloud reaches a minimum octree partition depth at which parallel decoding is enabled is determined. Arithmetic coding information for decoding the occupancy codes of the nodes in the minimum octree partition depth is stored based on the decoding of the occupancy codes of the nodes in the range of octree partition depths reaching the minimum octree partition depth. The parallel decoding is performed on occupancy codes of the nodes in each of the at least one remaining octree partitions depth based on the stored arithmetic coding information. The point cloud is reconstructed based on the occupancy codes of the nodes in the range of octree partition depths in the octree partitioning structure.
REDUCING LATENCY IN VIDEO ENCODING AND DECODING
Techniques and tools for reducing latency in video encoding and decoding by constraining latency due to reordering of video frames, and by indicating the constraint on frame reordering latency with one or more syntax elements that accompany encoded data for the video frames. For example, a real-time communication tool with a video encoder sets a syntax element that indicates a constraint on frame reordering latency, which is consistent with inter-frame dependencies between multiple frames of a video sequence, then outputs the syntax element. A corresponding real-time communication tool with a video decoder receives the syntax element that indicates the constraint on frame reordering latency, determines the constraint on frame reordering latency based on the syntax element, and uses the constraint on frame reordering latency to determine when a reconstructed frame is ready for output (in terms of output order).
REDUCING LATENCY IN VIDEO ENCODING AND DECODING
Techniques and tools for reducing latency in video encoding and decoding by constraining latency due to reordering of video frames, and by indicating the constraint on frame reordering latency with one or more syntax elements that accompany encoded data for the video frames. For example, a real-time communication tool with a video encoder sets a syntax element that indicates a constraint on frame reordering latency, which is consistent with inter-frame dependencies between multiple frames of a video sequence, then outputs the syntax element. A corresponding real-time communication tool with a video decoder receives the syntax element that indicates the constraint on frame reordering latency, determines the constraint on frame reordering latency based on the syntax element, and uses the constraint on frame reordering latency to determine when a reconstructed frame is ready for output (in terms of output order).