H04N19/136

Three-dimensional data encoding method, three-dimensional data decoding method, three-dimensional data encoding device, and three-dimensional data decoding device

A three-dimensional data encoding method includes encoding information of a current node included in an N-ary tree structure of three-dimensional points included in three-dimensional data, where N is an integer greater than or equal to 2. In the encoding, first information is encoded, the first information indicating a range for one or more referable neighboring nodes among neighboring nodes spatially neighboring the current node, and the current node is encoded with reference to a neighboring node within the range.

ARTIFICIAL INTELLIGENCE-BASED IMAGE ENCODING APPARATUS AND METHOD AND DECODING APPARATUS AND METHOD

A method of decoding an image based on cross-channel prediction using artificial intelligence (AI) includes obtaining cross-channel prediction information by applying feature data for cross-channel prediction to a neural-network-based cross-channel decoder, obtaining a predicted image of a chroma image by performing cross-channel prediction based on a reconstructed luma image and the cross-channel prediction information, obtaining a residual image of the chroma image by applying feature data of the chroma image to a neural-network-based chroma residual decoder, and reconstructing the chroma image based on the predicted image and the residual image.

ARTIFICIAL INTELLIGENCE-BASED IMAGE ENCODING APPARATUS AND METHOD AND DECODING APPARATUS AND METHOD

A method of decoding an image based on cross-channel prediction using artificial intelligence (AI) includes obtaining cross-channel prediction information by applying feature data for cross-channel prediction to a neural-network-based cross-channel decoder, obtaining a predicted image of a chroma image by performing cross-channel prediction based on a reconstructed luma image and the cross-channel prediction information, obtaining a residual image of the chroma image by applying feature data of the chroma image to a neural-network-based chroma residual decoder, and reconstructing the chroma image based on the predicted image and the residual image.

Top palette colors selection using sorting for palette mode in video encoding

An encoding method is provided which includes receiving a plurality of images, obtaining values of elements in a portion of the images, sorting the elements according to different values of the elements, sorting the elements according to a number of occurrences of the different values and encoding the elements using a subset of the different values having corresponding numbers of occurrences that are higher than corresponding numbers of occurrences of other values. Examples also include a processing device and method for use with palette mode encoding in which the elements are a portion of pixels in images and the values are color values of the portion of pixels in the images.

Top palette colors selection using sorting for palette mode in video encoding

An encoding method is provided which includes receiving a plurality of images, obtaining values of elements in a portion of the images, sorting the elements according to different values of the elements, sorting the elements according to a number of occurrences of the different values and encoding the elements using a subset of the different values having corresponding numbers of occurrences that are higher than corresponding numbers of occurrences of other values. Examples also include a processing device and method for use with palette mode encoding in which the elements are a portion of pixels in images and the values are color values of the portion of pixels in the images.

Event/object-of-interest centric timelapse video generation on camera device with the assistance of neural network input
11594254 · 2023-02-28 · ·

An apparatus including an interface and a processor. The interface may be configured to receive pixel data generated by a capture device. The processor may be configured to generate video frames in response to the pixel data, perform computer vision operations on the video frames to detect objects, perform a classification of the objects detected based on characteristics of the objects, determine whether the classification of the objects corresponds to a user-defined event and generate encoded video frames from the video frames. The encoded video frames may be communicated to a cloud storage service. The encoded video frames may comprise a first sample of the video frames selected at a first rate when the user-defined event is not detected and a second sample of the video frames selected at a second rate while the user-defined event is detected. The second rate may be greater than the first rate.

Method and apparatus for storage and signaling of sub-sample entry descriptions

Described are methods, apparatuses and computer program products for signaling and storing compressed point clouds. Sub-sample entries associated with sequences of sub-samples within sequences of samples may indicate whether sequences of sub-samples were encapsulated alone in a track, without other sub-samples or additional header data. Sub-sample entry types can be indexed at track-level sub-sample description boxes. Point cloud compression coded bitstream component types may be signaled by including respective point cloud unit header information in a codec-specific parameters-related field of track level sub-sample description boxes. Sub-sample information boxes may indicate sub-sample entry indices for respective sub-samples. A flag in such information boxes may indicate the presence of sub-sample description entry indexes. Description index boxes can contain sub-sample description entry indexes in the same container as sub-sample information boxes. Track fragment header boxes can include sub-sample description entry indices that apply to samples of a track fragment.

Method and apparatus for storage and signaling of sub-sample entry descriptions

Described are methods, apparatuses and computer program products for signaling and storing compressed point clouds. Sub-sample entries associated with sequences of sub-samples within sequences of samples may indicate whether sequences of sub-samples were encapsulated alone in a track, without other sub-samples or additional header data. Sub-sample entry types can be indexed at track-level sub-sample description boxes. Point cloud compression coded bitstream component types may be signaled by including respective point cloud unit header information in a codec-specific parameters-related field of track level sub-sample description boxes. Sub-sample information boxes may indicate sub-sample entry indices for respective sub-samples. A flag in such information boxes may indicate the presence of sub-sample description entry indexes. Description index boxes can contain sub-sample description entry indexes in the same container as sub-sample information boxes. Track fragment header boxes can include sub-sample description entry indices that apply to samples of a track fragment.

Method and apparatus of transform coefficient coding with TB-level constraint
11595656 · 2023-02-28 · ·

A method and apparatus for encoding or decoding transform coefficients in a video coding system are disclosed. According to this method, a region after zero-outis determined. A remaining number of context-coded bins is determined for the current TB based on a size of the region after zero-out. The current TB is encoded or decoded using context-based coding with a constraint based on the remaining number of context-coded bins. According to another method, a remaining number of context-coded bins is determined for the current TB based on the last significant CG index.

Method and device for encoding/decoding images

A method and a device for encoding/decoding images are disclosed. The method for encoding images comprises the steps of: deriving a scan type of a residual signal for a current block according to whether or not the current block is a transform skip block; and applying the scan type to the residual signal for the current block, wherein the transform skip block is a block to which transform for the current block is not applied and is specified on the basis of information indicating whether or not transform for the current block is to be applied.