Patent classifications
H04N19/30
IMAGE CODING METHOD ON BASIS OF ENTRY POINT-RELATED INFORMATION IN VIDEO OR IMAGE CODING SYSTEM
According to the disclosures of the present document, entry point-related information/parameter can be signaled by means of upper-level syntax, thereby allowing the amount of data necessary to be signaled for video/image coding to be reduced and coding efficiency to be increased.
CONTENT-ADAPTIVE ONLINE TRAINING FOR DNN-BASED CROSS COMPONENT PREDICTION WITH SCALING FACTORS
A method and apparatus for neural network based cross component prediction with scaling factors during encoding or decoding of an image frame or a video sequence, which may include training a deep neural network (DNN) cross component prediction (CCP) model with at least one or more scaling factors, wherein the at least one or more scaling factors are learned by optimizing a rate-distortion loss based on an input video sequence comprising a luma component, and reconstructing a chroma component based on the luma component using the trained DNN CCP model with the at least one or more scaling factors for chroma prediction. The trained DNN CCP may be updated for chroma prediction of the input video sequence using the one or more scaling factors, and performing chroma prediction of the input video sequence using the updated DNN CCP model with the one or more scaling factors.
CONTENT-ADAPTIVE ONLINE TRAINING FOR DNN-BASED CROSS COMPONENT PREDICTION WITH SCALING FACTORS
A method and apparatus for neural network based cross component prediction with scaling factors during encoding or decoding of an image frame or a video sequence, which may include training a deep neural network (DNN) cross component prediction (CCP) model with at least one or more scaling factors, wherein the at least one or more scaling factors are learned by optimizing a rate-distortion loss based on an input video sequence comprising a luma component, and reconstructing a chroma component based on the luma component using the trained DNN CCP model with the at least one or more scaling factors for chroma prediction. The trained DNN CCP may be updated for chroma prediction of the input video sequence using the one or more scaling factors, and performing chroma prediction of the input video sequence using the updated DNN CCP model with the one or more scaling factors.
Picture coding and decoding
A picture with multiple slices is encoded by generating a coded slice representation for each of the slices. A slice flag is set to a first value for the first slice in the picture and corresponding slice flags of the remaining slices are set to a second defined value. A respective slice address is generated for each remaining slice to enable identification of the slice start position within the picture for the slice. A coded picture representation of the picture comprises the coded slice representations, the slice addresses and the slice flags. The slice flags enable differentiation between slices for which slice addresses are required and the slice per picture for which no slice address is needed to identify its slice start position.
Picture coding and decoding
A picture with multiple slices is encoded by generating a coded slice representation for each of the slices. A slice flag is set to a first value for the first slice in the picture and corresponding slice flags of the remaining slices are set to a second defined value. A respective slice address is generated for each remaining slice to enable identification of the slice start position within the picture for the slice. A coded picture representation of the picture comprises the coded slice representations, the slice addresses and the slice flags. The slice flags enable differentiation between slices for which slice addresses are required and the slice per picture for which no slice address is needed to identify its slice start position.
Three-dimensional data encoding method, three-dimensional data decoding method, three-dimensional data encoding device, and three-dimensional data decoding device
A three-dimensional data encoding method includes: (i) when a number of three-dimensional points included in point cloud data to be encoded is n that is greater than a predetermined number, n being an integer greater than or equal to 2, calculating an encoding coefficient by generating a hierarchical structure in which each of n pieces of attribute information on the three-dimensional points is sorted into one of a higher frequency component and a lower frequency component to be layered, and generating a bitstream including the encoding coefficient calculated in the calculating; and (ii) when a number of three-dimensional points included in the point cloud data is m that is smaller than or equal to the predetermined number, m being an integer greater than or equal to 1, generating a bitstream in accordance with m pieces of attribute information on the three-dimensional points without generating a hierarchy structure.
IMAGE ENCODING DEVICE AND METHOD AND IMAGE DECODING DEVICE AND METHOD
The present disclosure relates to an image encoding device and method and an image decoding device and method, which are capable of suppressing an increase in encoding or decoding workload. A current layer of image data including a plurality of layers is encoded and/or decoded with reference to encoding-related information of some areas, of another layer encoded for each of a plurality of certain areas obtained by dividing a picture, according to control of control information used to control the certain area in which the encoding-related information of the other layer is referred to regarding the current layer of the image data. The present disclosure can be applied to image processing devices such as an image encoding device for performing scalable coding on image data and an image decoding device for decoding an encoded data obtained by performing scalable coding on image data
IMAGE ENCODING DEVICE AND METHOD AND IMAGE DECODING DEVICE AND METHOD
The present disclosure relates to an image encoding device and method and an image decoding device and method, which are capable of suppressing an increase in encoding or decoding workload. A current layer of image data including a plurality of layers is encoded and/or decoded with reference to encoding-related information of some areas, of another layer encoded for each of a plurality of certain areas obtained by dividing a picture, according to control of control information used to control the certain area in which the encoding-related information of the other layer is referred to regarding the current layer of the image data. The present disclosure can be applied to image processing devices such as an image encoding device for performing scalable coding on image data and an image decoding device for decoding an encoded data obtained by performing scalable coding on image data
SUBPICTURE INFORMATION SIGNALING IN VIDEO BITSTREAMS
Methods, apparatus, systems for video processing, including video encoding or video decoding are described. One example method includes performing a conversion between a video including a video picture that includes one or more subpictures and a bitstream of the video. The bitstream conforms to a format rule specifying that a first syntax element indicating whether an operation is performed across a boundary of a subpicture in the coded layer video sequence is selectively included responsive to a number of subpictures in the video picture.
SUBPICTURE INFORMATION SIGNALING IN VIDEO BITSTREAMS
Methods, apparatus, systems for video processing, including video encoding or video decoding are described. One example method includes performing a conversion between a video including a video picture that includes one or more subpictures and a bitstream of the video. The bitstream conforms to a format rule specifying that a first syntax element indicating whether an operation is performed across a boundary of a subpicture in the coded layer video sequence is selectively included responsive to a number of subpictures in the video picture.