H04N19/23

METHOD FOR COMPRESSING A SEQUENCE OF IMAGES DISPLAYING SYNTHETIC GRAPHICAL ELEMENTS OF NON-PHOTOGRAPHIC ORIGIN

Method for compressing a sequence of images comprising a first image and a second image, the method comprising the steps of: generating a first descriptor comprising parameters for displaying a computer-generated graphical element in the first image, the graphical element being of non-photographic origin, and the display parameters not comprising pixel values; processing the second image so as to determine an event which gave rise to a potential variation in the parameters for displaying the graphical element between the first image and the second image; generating a second descriptor comprising an event code indicating the determined event.

METHOD AND APPARATUS FOR PROCESSING NON-SEQUENTIAL POINT CLOUD MEDIA, DEVICE, AND STORAGE MEDIUM
20230048474 · 2023-02-16 ·

This application provides a method and apparatus for processing non-sequential point cloud media, a device, and a storage medium. The method includes: processing non-sequential point cloud data of a static object using a Geometry-based Point Cloud Compression (GPCC) coding scheme to obtain a GPCC bitstream; encapsulating the GPCC bitstream to generate an item of at least one GPCC region; encapsulating the item of the at least one GPCC region to generate at least one piece of non-sequential point cloud media of the static object; transmitting media presentation description (MPD) signaling of the at least one piece of non-sequential point cloud media; receiving a first request message transmitted by a video playback device; and transmitting first non-sequential point cloud media, the item of the GPCC region being used to represent a GPCC component of a three-dimensional (3D) spatial region corresponding to the GPCC region, and the non-sequential point cloud media including: an identifier of the static object, so that a user can purposefully request non-sequential point cloud media of a same static object a plurality of times, thereby improving the user experience.

METHOD AND APPARATUS FOR PROCESSING NON-SEQUENTIAL POINT CLOUD MEDIA, DEVICE, AND STORAGE MEDIUM
20230048474 · 2023-02-16 ·

This application provides a method and apparatus for processing non-sequential point cloud media, a device, and a storage medium. The method includes: processing non-sequential point cloud data of a static object using a Geometry-based Point Cloud Compression (GPCC) coding scheme to obtain a GPCC bitstream; encapsulating the GPCC bitstream to generate an item of at least one GPCC region; encapsulating the item of the at least one GPCC region to generate at least one piece of non-sequential point cloud media of the static object; transmitting media presentation description (MPD) signaling of the at least one piece of non-sequential point cloud media; receiving a first request message transmitted by a video playback device; and transmitting first non-sequential point cloud media, the item of the GPCC region being used to represent a GPCC component of a three-dimensional (3D) spatial region corresponding to the GPCC region, and the non-sequential point cloud media including: an identifier of the static object, so that a user can purposefully request non-sequential point cloud media of a same static object a plurality of times, thereby improving the user experience.

PICTURE ENCODING/DECODING METHOD AND RELATED APPARATUS

A picture encoding/decoding method and a related apparatus are provided. The picture decoding method includes obtaining a current picture; selecting, from a knowledge base, K reference pictures of the current picture, where at least one picture in the knowledge base does not belong to a random access segment in which the current picture is located and wherein K is an integer greater than or equal to 1; and decoding the current picture according to the K reference pictures.

INFORMATION PROCESSING DEVICE AND METHOD

The present disclosure relates to an information processing device and a method capable of more easily reproducing 3D data using spatial scalability.

2D data obtained by two-dimensionally converting a point cloud representing an object having a three-dimensional shape as a set of points and corresponding to spatial scalability is encoded, a bitstream including a sub-bitstream obtained by encoding the point cloud corresponding to a single or plurality of layers of the spatial scalability is generated, spatial scalability information regarding the spatial scalability of the sub-bitstream is generated, and a file that stores the bitstream generated and the spatial scalability information generated is generated. The present disclosure can be applied to, for example, an information processing device, an information processing method, or the like.

INFORMATION PROCESSING DEVICE AND METHOD

The present disclosure relates to an information processing device and a method capable of more easily reproducing 3D data using spatial scalability.

2D data obtained by two-dimensionally converting a point cloud representing an object having a three-dimensional shape as a set of points and corresponding to spatial scalability is encoded, a bitstream including a sub-bitstream obtained by encoding the point cloud corresponding to a single or plurality of layers of the spatial scalability is generated, spatial scalability information regarding the spatial scalability of the sub-bitstream is generated, and a file that stores the bitstream generated and the spatial scalability information generated is generated. The present disclosure can be applied to, for example, an information processing device, an information processing method, or the like.

METHOD AND DEVICE FOR GENERATING SPEECH MOVING IMAGE
20220398793 · 2022-12-15 ·

A device for generating a speech moving image according to an embodiment includes a first encoder that receives a person background image in which a portion related to speech of a person that is a video part of the speech moving image of the person is covered with a mask, extracts an image feature vector from the person background image, and compresses the extracted image feature vector, a second encoder that receives a speech audio signal that is an audio part of the speech moving image, extracts a voice feature vector from the speech audio signal, and compresses the extracted voice feature vector, a combination unit that generates a combination vector of the compressed image feature vector and the compressed voice feature vector, and an image reconstruction unit that reconstructs the speech moving image of the person with the combination as an input.

Encoding apparatus, decoding apparatus, encoding method, decoding method, and storage medium
11516503 · 2022-11-29 · ·

An encoding apparatus, includes a memory; and a processor coupled to the memory and configured to: perform intra-screen prediction encoding on an image of a region of a still image cut out from a first decoded image corresponding to a screen image before a predetermined region is switched from a video to a still image, generate a second decoded image obtained by decoding information for which the intra-screen prediction is performed, and when a first screen image after the predetermined region is switched from a video to a still image is received, perform inter-screen prediction encoding on an image of a region of a still image cut out from the first screen image after switching to a still image, by referring to the generated second decoded image.

VIDEO ENCODING METHOD, VIDEO ENCODING APPARATUS AND COMPUTER PROGRAM

A video encoding method includes: a provisional image generation step of generating one provisional image from a plurality of frames to be coded; a transformation step of transforming the generated provisional image to a transformed image having the same number of pixels as that of each of the plurality of frames to be coded; and a prediction image generation step of generating a prediction image for each of the frames to be coded, using the transformed image as a reference image.

VIDEO ENCODING METHOD, VIDEO ENCODING APPARATUS AND COMPUTER PROGRAM

A video encoding method includes: a provisional image generation step of generating one provisional image from a plurality of frames to be coded; a transformation step of transforming the generated provisional image to a transformed image having the same number of pixels as that of each of the plurality of frames to be coded; and a prediction image generation step of generating a prediction image for each of the frames to be coded, using the transformed image as a reference image.