Patent classifications
H04N19/187
Processing of Filler Data Units in Video Streams
A method of video processing includes performing a conversion between a video and a bitstream of the video according to a rule, wherein the rule specifies that a sub-bitstream extraction process is implemented to generate a sub-bitstream for decoding, wherein the sub-bitstream extraction process is configured to extract, from the bitstream, a sub-bitstream with a target highest temporal identifier, and wherein, the rule specifies that, during the extracting, upon removing a video coding layer (VCL) network abstraction layer (NAL) unit, filler data units and filler supplemental enhancement information (SEI) messages in SEI NAL units that are associated with the VCL NAL unit are also removed.
Processing of Filler Data Units in Video Streams
A method of video processing includes performing a conversion between a video and a bitstream of the video according to a rule, wherein the rule specifies that a sub-bitstream extraction process is implemented to generate a sub-bitstream for decoding, wherein the sub-bitstream extraction process is configured to extract, from the bitstream, a sub-bitstream with a target highest temporal identifier, and wherein, the rule specifies that, during the extracting, upon removing a video coding layer (VCL) network abstraction layer (NAL) unit, filler data units and filler supplemental enhancement information (SEI) messages in SEI NAL units that are associated with the VCL NAL unit are also removed.
Optimization of multi-sink Wi-Fi display with intelligent multi-session encoding
Systems, apparatuses, and methods may provide for multi-session encoding to optimize multiple encoding sessions on Wi-Fi display (WFD) source devices when the WFD source devices are connected to multiple sink devices. The multiple encoding sessions may be optimized with encoding hints that are generated by a compositor and transmitted to a pre-encoding checking device. The encoding session that has the highest encoding resolution is subjected to hierarchical motion estimation (HME) processing, and the encoding sessions that have lower resolutions are optimized based on a motion vector prediction hint generated by the encoding session that has the highest encoding resolution and a scaling factor.
Optimization of multi-sink Wi-Fi display with intelligent multi-session encoding
Systems, apparatuses, and methods may provide for multi-session encoding to optimize multiple encoding sessions on Wi-Fi display (WFD) source devices when the WFD source devices are connected to multiple sink devices. The multiple encoding sessions may be optimized with encoding hints that are generated by a compositor and transmitted to a pre-encoding checking device. The encoding session that has the highest encoding resolution is subjected to hierarchical motion estimation (HME) processing, and the encoding sessions that have lower resolutions are optimized based on a motion vector prediction hint generated by the encoding session that has the highest encoding resolution and a scaling factor.
Data delivery device, system, method, and recording medium
[Problem] In order to improve the likelihood of being able to continue delivery even during degradation of communication quality and improve the efficiency of use of the capability of a transmission path, even with hierarchical encoding having a small number of hierarchies, the present invention comprises: measuring the communication quality of the transmission path with a data receiving device; determining a hierarchical structure of hierarchical data on the basis of the results of measuring the communication quality; generating the hierarchical data of the hierarchical structure by hierarchical encoding of input data; and delivering delivery data of at least some hierarchies of the hierarchical data to the data receiving device.
Subpicture layout and partial output with layers
There is included a method and apparatus comprising computer code configured to cause a processor or processors to perform obtaining video data, parsing a video parameter set (VPS) syntax of the video data, determining whether a value of a syntax element of the VPS syntax indicates a picture order count (POC) value of an access unit (AU) of the video data, and setting at least one of a plurality of pictures, slices, and tiles of the video data to the AU based on the value of the syntax element.
Subpicture layout and partial output with layers
There is included a method and apparatus comprising computer code configured to cause a processor or processors to perform obtaining video data, parsing a video parameter set (VPS) syntax of the video data, determining whether a value of a syntax element of the VPS syntax indicates a picture order count (POC) value of an access unit (AU) of the video data, and setting at least one of a plurality of pictures, slices, and tiles of the video data to the AU based on the value of the syntax element.
ADAPTIVE QUANTIZATION FOR ENHANCEMENT LAYER VIDEO CODING
Techniques and tools for encoding enhancement layer video with quantization that varies spatially and/or between color channels are presented, along with corresponding decoding techniques and tools. For example, an encoding tool determines whether quantization varies spatially over a picture, and the tool also determines whether quantization varies between color channels in the picture. The tool signals quantization parameters for macroblocks in the picture in an encoded bit stream. In some implementations, to signal the quantization parameters, the tool predicts the quantization parameters, and the quantization parameters are signaled with reference to the predicted quantization parameters. A decoding tool receives the encoded bit stream, predicts the quantization parameters, and uses the signaled information to determine the quantization parameters for the macroblocks of the enhancement layer video. The decoding tool performs inverse quantization that can vary spatially and/or between color channels.
ADAPTIVE QUANTIZATION FOR ENHANCEMENT LAYER VIDEO CODING
Techniques and tools for encoding enhancement layer video with quantization that varies spatially and/or between color channels are presented, along with corresponding decoding techniques and tools. For example, an encoding tool determines whether quantization varies spatially over a picture, and the tool also determines whether quantization varies between color channels in the picture. The tool signals quantization parameters for macroblocks in the picture in an encoded bit stream. In some implementations, to signal the quantization parameters, the tool predicts the quantization parameters, and the quantization parameters are signaled with reference to the predicted quantization parameters. A decoding tool receives the encoded bit stream, predicts the quantization parameters, and uses the signaled information to determine the quantization parameters for the macroblocks of the enhancement layer video. The decoding tool performs inverse quantization that can vary spatially and/or between color channels.
Indication of max sublayer numbers in multilayered video stream
A method and device identify a maximum number of temporal sublayers that is allowed in a coded video sequence referring to a parameter set. A coded video sequence is decoded based on the identified maximum number of temporal sublayers that is allowed in the coded video sequence referring to the parameter set.