Patent classifications
H04N19/20
HYBRID CODING ORDER FOR POINT CLOUD CODING
An apparatus for point cloud coding, includes processing circuitry that receives a coded bitstream for a point cloud. The coded bitstream includes encoded data for nodes in an octree structure for the point cloud corresponding to three dimensional (3D) partitions of a space of the point cloud, node sizes of the nodes being associated with sizes of the corresponding 3D partitions of the nodes. The processing circuitry decodes, from the coded bitstream, a first set of occupancy codes for a first set of nodes in the nodes using a first coding order and a second set of occupancy codes for a second set of nodes in the nodes using a second coding order that is different from the first coding order. Further, the processing circuitry reconstructs the octree structure based on at least the first set of occupancy codes and the second set of occupancy codes.
IMAGE PROCESSING METHOD, ELECTRONIC DEVICE, AND IMAGE DISPLAY SYSTEM
An image processing method includes: acquiring first image data of a first image, the first image data including pixel values of a plurality of pixels in the first image; a first compression-allowed region existing in the first image, obtaining region expression information of the first compression-allowed region, the first compression-allowed region including a region where a plurality of first pixels are located, and a difference between pixel values of any two first pixels in the plurality of first pixels being within a preset range; determining a region pixel value of the first compression-allowed region according to a pixel value of at least one first pixel in the first compression-allowed region; and generating second image data of the first image, the second image data including region expression information and the region pixel value of the first compression-allowed region.
HAPTIC ATLAS CODING AND DECODING FORMAT
Methods and devices for encoding and decoding a data stream representative of a 3D volumetric scene comprising haptic features associated with objects of the 3D scene are disclosed. At the encoding, haptic features are associated with objects of the scene, for instance as haptic maps. Haptic components are stored in points of the 3D scene as color may be. These components are projected onto patch pictures which are packed in atlas images. At the decoding, haptic components are un-projected onto reconstructed points as color may be according to the depth component of pixels of the decoded atlases.
HAPTIC ATLAS CODING AND DECODING FORMAT
Methods and devices for encoding and decoding a data stream representative of a 3D volumetric scene comprising haptic features associated with objects of the 3D scene are disclosed. At the encoding, haptic features are associated with objects of the scene, for instance as haptic maps. Haptic components are stored in points of the 3D scene as color may be. These components are projected onto patch pictures which are packed in atlas images. At the decoding, haptic components are un-projected onto reconstructed points as color may be according to the depth component of pixels of the decoded atlases.
RANK INFORMATION IN IMMERSIVE MEDIA PROCESSING
Methods, apparatus, and systems for providing consistent immersive media viewing experiences to user while reducing bandwidth consumption are disclosed. In one example aspect, a method for processing multimedia content includes determining, for a conversion between a frame of panoramic media content comprising multiple segments and a bitstream representation of the frame of panoramic media content, multiple sets of rank information associated with the frame. Each set of the rank information indicates a priority level for processing a segment of the frame of panoramic media content. The method also includes performing the conversion based on the multiple sets of rank information.
RANK INFORMATION IN IMMERSIVE MEDIA PROCESSING
Methods, apparatus, and systems for providing consistent immersive media viewing experiences to user while reducing bandwidth consumption are disclosed. In one example aspect, a method for processing multimedia content includes determining, for a conversion between a frame of panoramic media content comprising multiple segments and a bitstream representation of the frame of panoramic media content, multiple sets of rank information associated with the frame. Each set of the rank information indicates a priority level for processing a segment of the frame of panoramic media content. The method also includes performing the conversion based on the multiple sets of rank information.
Generative adversarial neural network assisted reconstruction
A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
Generative adversarial neural network assisted reconstruction
A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
Method and device for coding the geometry of a point cloud
The present principles relate to a method and device method for encoding depth values of orthogonally projected points of a point cloud onto a projection plane. The present principles also relate to a method and device for decoding a point cloud, a computer readable program and a video signal.
Apparatus and method of using AI metadata related to image quality
An image providing apparatus configured to generate, by using a first artificial intelligence (AI) network, AI metadata including class information and at least one class map, in which the class information includes at least one class corresponding to a type of an object among a plurality of predefined objects included in a first image and the at least one class map indicates a region corresponding to each class in the first image, generate an encoded image by encoding the first image, and output the encoded image and the AI metadata through the output interface.