H04N19/33

Three-dimensional data encoding method, three-dimensional data decoding method, three-dimensional data encoding device, and three-dimensional data decoding device

A three-dimensional data encoding method includes: (i) when a number of three-dimensional points included in point cloud data to be encoded is n that is greater than a predetermined number, n being an integer greater than or equal to 2, calculating an encoding coefficient by generating a hierarchical structure in which each of n pieces of attribute information on the three-dimensional points is sorted into one of a higher frequency component and a lower frequency component to be layered, and generating a bitstream including the encoding coefficient calculated in the calculating; and (ii) when a number of three-dimensional points included in the point cloud data is m that is smaller than or equal to the predetermined number, m being an integer greater than or equal to 1, generating a bitstream in accordance with m pieces of attribute information on the three-dimensional points without generating a hierarchy structure.

VIDEO ENCODING AND DECODING METHOD, APPARATUS AND COMPUTER DEVICE
20220394283 · 2022-12-08 ·

A video encoding/decoding method and apparatus, a computer device, and a storage medium. The method comprises: for each non-key frame in a video frame sequence, dividing a current non-key frame into a plurality of sub-image blocks according to information of an object in the current non-key image, and determining an importance level of each sub-image block; according to a pre-stored first correlation between different importance levels and different resolutions, performing conversion to make each sub-image block in each non-key frame have a resolution corresponding to the importance level of the sub-image block, wherein in the first correlation, a higher importance level corresponds to a higher resolution, and the highest importance level corresponds to a target highest resolution; and performing video encoding on the video frame sequence to obtain encoded video data. By means of the method, memory and bandwidth occupied by a video can be reduced.

VIDEO ENCODING AND DECODING METHOD, APPARATUS AND COMPUTER DEVICE
20220394283 · 2022-12-08 ·

A video encoding/decoding method and apparatus, a computer device, and a storage medium. The method comprises: for each non-key frame in a video frame sequence, dividing a current non-key frame into a plurality of sub-image blocks according to information of an object in the current non-key image, and determining an importance level of each sub-image block; according to a pre-stored first correlation between different importance levels and different resolutions, performing conversion to make each sub-image block in each non-key frame have a resolution corresponding to the importance level of the sub-image block, wherein in the first correlation, a higher importance level corresponds to a higher resolution, and the highest importance level corresponds to a target highest resolution; and performing video encoding on the video frame sequence to obtain encoded video data. By means of the method, memory and bandwidth occupied by a video can be reduced.

CANVAS SIZE SCALABLE VIDEO CODING

Methods and systems for canvas size scalability across the same or different bitstream layers of a video coded bitstream are described. Offset parameters for a conformance window, a reference region of interest (ROI) in a reference layer, and a current ROI in a current layer are received. The width and height of a current ROI and a reference ROI are computed based on the offset parameters and they are used to generate a width and height scaling factor to be used by a reference picture resampling unit to generate an output picture based on the current ROI and the reference ROI.

CANVAS SIZE SCALABLE VIDEO CODING

Methods and systems for canvas size scalability across the same or different bitstream layers of a video coded bitstream are described. Offset parameters for a conformance window, a reference region of interest (ROI) in a reference layer, and a current ROI in a current layer are received. The width and height of a current ROI and a reference ROI are computed based on the offset parameters and they are used to generate a width and height scaling factor to be used by a reference picture resampling unit to generate an output picture based on the current ROI and the reference ROI.

INFORMATION PROCESSING APPARATUS, CONTROL METHOD, STORAGE MEDIUM, AND INFORMATION PROCESSING SYSTEM
20220385921 · 2022-12-01 ·

An information processing apparatus that communicates with a storage device that stores, at a plurality of resolutions, tile images obtained by dividing a picked-up image into a plurality of predetermined regions and to which encoding processing including intra-frame encoding and inter-frame encoding has been performed comprises: a designation unit for designating a predetermined region of the picked-up image for a preset tour in advance; a request unit for requesting the storage device to transmit the tile image including the predetermined region with higher resolution from among the plurality of resolutions; and a display control unit for cutting out the predetermined region from the tile image and cause a display unit to display the cut-out predetermined region, wherein, when the predetermined region is designated in advance, wherein, when the predetermined region is designated in advance, the request unit requests the storage device to transmit the tile image including the predetermined region.

Processing of motion information in multidimensional signals through motion zones and auxiliary information through auxiliary zones

Computer processor hardware receives zone information specifying multiple elements of a rendition of a signal belonging to a zone. The computer processor hardware also receives motion information associated with the zone. The motion information can be encoded to indicate to which corresponding element in a reference signal each of the multiple elements in the zone pertains. For each respective element in the zone as specified by the zone information, the computer processor hardware utilizes the motion information to derive a corresponding location value in the reference signal; the corresponding location value indicates a location in the reference signal to which the respective element pertains.

Processing of motion information in multidimensional signals through motion zones and auxiliary information through auxiliary zones

Computer processor hardware receives zone information specifying multiple elements of a rendition of a signal belonging to a zone. The computer processor hardware also receives motion information associated with the zone. The motion information can be encoded to indicate to which corresponding element in a reference signal each of the multiple elements in the zone pertains. For each respective element in the zone as specified by the zone information, the computer processor hardware utilizes the motion information to derive a corresponding location value in the reference signal; the corresponding location value indicates a location in the reference signal to which the respective element pertains.

Method and apparatus for video-encoding/decoding using filter information prediction

Provided is a scalable video-decoding method based on multiple layers. The scalable video-decoding method according to the present invention comprises: a step of predicting first filter information of a video to be filtered using the information contained in an object layer and/or information contained in another layer, and generating second filter information in accordance with the prediction; and a step of filtering the video to be filtered using the second filter information. According to the present invention, the amount of information being transmitted is reduced, and video compression performance is improved.

Deep learning based on image encoding and decoding
11593632 · 2023-02-28 · ·

A deep learning based compression (DLBC) system trains multiple models that, when deployed, generates a compressed binary encoding of an input image that achieves a reconstruction quality and a target compression ratio. The applied models effectively identifies structures of an input image, quantizes the input image to a target bit precision, and compresses the binary code of the input image via adaptive arithmetic coding to a target codelength. During training, the DLBC system reconstructs the input image from the compressed binary encoding and determines the loss in quality from the encoding process. Thus, the models can be continually trained to, when applied to an input image, minimize the loss in reconstruction quality that arises due to the encoding process while also achieving the target compression ratio.