Patent classifications
H04N19/88
Reducing latency in video encoding and decoding
Techniques and tools for reducing latency in video encoding and decoding by constraining latency due to reordering of video frames, and by indicating the constraint on frame reordering latency with one or more syntax elements that accompany encoded data for the video frames. For example, a real-time communication tool with a video encoder sets a syntax element that indicates a constraint on frame reordering latency, which is consistent with inter-frame dependencies between multiple frames of a video sequence, then outputs the syntax element. A corresponding real-time communication tool with a video decoder receives the syntax element that indicates the constraint on frame reordering latency, determines the constraint on frame reordering latency based on the syntax element, and uses the constraint on frame reordering latency to determine when a reconstructed frame is ready for output (in terms of output order).
Multifocal display devices and methods
An image transmission device of a multifocal display system is described, for transmitting a set of N composite images based on a set of N primary images to an image reception device. N is greater or equal 2, and each of the N primary images has a focus distance associated therewith. The image transmission device includes processing circuitry configured to partition each of the N primary images into image portions and generate the N composite images by placing each image portion of each of the N primary images into one of the N composite images such that each of the N composite images comprises image portions from two or more of the N primary images. A communication interface transmits the N composite images to the image reception device (103).
DYNAMIC INSERTION OF CONTENT VIA MACROBLOCK MODIFICATION
Systems, methods, and devices for inserting content into a video frame are disclosed herein. A frame of video data encoded to include a plurality of macroblocks is received. An insertion region of the frame for inserting content is defined, the insertion region spanning a subset of the macroblocks. The frame is augmented with a duplication region configured as a non-displayed region, the duplication region including duplicated macroblocks that duplicate the macroblocks of insertion region. The macroblocks of the insertion region are replaced with replacement macroblocks that encode replacement content.
APPARATUS AND METHOD FOR PROCESSING POINT CLOUD DATA
A method for processing point cloud data, according to embodiments, can encode and transmit point cloud data. A method for processing point cloud data, according to embodiments, can receive and decode point cloud data.
APPARATUS AND METHOD FOR PROCESSING POINT CLOUD DATA
A method for processing point cloud data, according to embodiments, can encode and transmit point cloud data. A method for processing point cloud data, according to embodiments, can receive and decode point cloud data.
TRANSFORM METHOD, ENCODER, DECODER, AND STORAGE MEDIUM
A transform method includes: determining a prediction mode parameter of a current block; determining a MIP parameter when the prediction mode parameter indicates that MIP is used for the current block to determine an intra prediction value; determining the intra prediction value of the current block according to the MIP parameter, and calculating a residual value between the current block and the intra prediction value; performing a first transform on the residual value to obtain a first coefficient matrix; determining a scanning order of LFNST coefficients used for the current block according to the MIP parameter when an LFNST is used for the current block; constructing an input coefficient matrix of the LFNST based on the first coefficient matrix according to the scanning order of LFNST coefficients; and performing an LFNST processing on the input coefficient matrix to obtain a transform coefficient matrix of the current block.
TRANSFORM METHOD, ENCODER, DECODER, AND STORAGE MEDIUM
A transform method includes: determining a prediction mode parameter of a current block; determining a MIP parameter when the prediction mode parameter indicates that MIP is used for the current block to determine an intra prediction value; determining the intra prediction value of the current block according to the MIP parameter, and calculating a residual value between the current block and the intra prediction value; performing a first transform on the residual value to obtain a first coefficient matrix; determining a scanning order of LFNST coefficients used for the current block according to the MIP parameter when an LFNST is used for the current block; constructing an input coefficient matrix of the LFNST based on the first coefficient matrix according to the scanning order of LFNST coefficients; and performing an LFNST processing on the input coefficient matrix to obtain a transform coefficient matrix of the current block.
Fast projection method in video-based point cloud compression codecs
An encoding device and a method for point cloud encoding are disclosed. The method for encoding includes segmenting an area including points representing a three-dimensional (3D) point cloud into multiple voxels. The method also includes identifying a normal score for each of the points of the 3D point cloud and a smoothing score for each of the multiple voxels that include at least one of the points of the 3D point cloud. The method further includes grouping each point of the 3D point cloud to one of multiple projection planes based on the normal score and the smoothing score to generate refined patches that represent the 3D point cloud. Additionally, the method includes generating frames that include pixels that represent the refined patches. The method also includes encoding the frames to generate a bitstream and transmitting the bitstream.
Fast projection method in video-based point cloud compression codecs
An encoding device and a method for point cloud encoding are disclosed. The method for encoding includes segmenting an area including points representing a three-dimensional (3D) point cloud into multiple voxels. The method also includes identifying a normal score for each of the points of the 3D point cloud and a smoothing score for each of the multiple voxels that include at least one of the points of the 3D point cloud. The method further includes grouping each point of the 3D point cloud to one of multiple projection planes based on the normal score and the smoothing score to generate refined patches that represent the 3D point cloud. Additionally, the method includes generating frames that include pixels that represent the refined patches. The method also includes encoding the frames to generate a bitstream and transmitting the bitstream.
Video coding method and apparatus utilizing group of encoding units
A decoding method comprises the steps of: combining two or more encoding units of maximum size into a single encoding unit group; acquiring encoding data corresponding to the combined single encoding unit group; and decoding, according to a decoding order, the two or more encoding units of maximum size contained in the single encoding unit group. Also disclosed is a block partitioning structure used for encoding and decoding video.