H04N19/167

Triangulation-Based Adaptive Subsampling of Dense Motion Vector Fields

The present disclosure relates to an apparatus and a method for providing a plurality of motion vectors related to an image coded in a bitstream, comprising obtaining a set of sample positions within the image, obtaining respective motion vectors associated with the set of sample positions, deriving an additional motion vector based on information coded in the bitstream, determining an additional sample position located within a triangle, which is formed by three sample positions of the set of sample positions, based on the respective motion vectors associated with the three sample positions, the triangle not including any other sample position of the set, adding the additional sample position to the set of sample positions, and associating the derived additional motion vector with the additional sample position. Such method and apparatus provide for an efficient motion flow compression and reconstruction enabling to recover some sample positions without signaling them in the bitstream.

Triangulation-Based Adaptive Subsampling of Dense Motion Vector Fields

The present disclosure relates to an apparatus and a method for providing a plurality of motion vectors related to an image coded in a bitstream, comprising obtaining a set of sample positions within the image, obtaining respective motion vectors associated with the set of sample positions, deriving an additional motion vector based on information coded in the bitstream, determining an additional sample position located within a triangle, which is formed by three sample positions of the set of sample positions, based on the respective motion vectors associated with the three sample positions, the triangle not including any other sample position of the set, adding the additional sample position to the set of sample positions, and associating the derived additional motion vector with the additional sample position. Such method and apparatus provide for an efficient motion flow compression and reconstruction enabling to recover some sample positions without signaling them in the bitstream.

RIO-BASED VIDEO CODING METHOD AND DEIVICE

A video recording method and a video recording device are provided. The method includes: obtaining video data to be recorded; dividing, based on the video data, each frame of the video data into a region of interest and a background region by using a preset neural network model; and encoding the region of interest of the video data based on a first encoding bit rate, and the background region based on a second bit rate, and storing the encoded video data into a storage device through a video buffer.

RIO-BASED VIDEO CODING METHOD AND DEIVICE

A video recording method and a video recording device are provided. The method includes: obtaining video data to be recorded; dividing, based on the video data, each frame of the video data into a region of interest and a background region by using a preset neural network model; and encoding the region of interest of the video data based on a first encoding bit rate, and the background region based on a second bit rate, and storing the encoded video data into a storage device through a video buffer.

METHOD AND APPARATUS FOR PROCESSING NON-SEQUENTIAL POINT CLOUD MEDIA, DEVICE, AND STORAGE MEDIUM
20230048474 · 2023-02-16 ·

This application provides a method and apparatus for processing non-sequential point cloud media, a device, and a storage medium. The method includes: processing non-sequential point cloud data of a static object using a Geometry-based Point Cloud Compression (GPCC) coding scheme to obtain a GPCC bitstream; encapsulating the GPCC bitstream to generate an item of at least one GPCC region; encapsulating the item of the at least one GPCC region to generate at least one piece of non-sequential point cloud media of the static object; transmitting media presentation description (MPD) signaling of the at least one piece of non-sequential point cloud media; receiving a first request message transmitted by a video playback device; and transmitting first non-sequential point cloud media, the item of the GPCC region being used to represent a GPCC component of a three-dimensional (3D) spatial region corresponding to the GPCC region, and the non-sequential point cloud media including: an identifier of the static object, so that a user can purposefully request non-sequential point cloud media of a same static object a plurality of times, thereby improving the user experience.

METHOD AND APPARATUS FOR PROCESSING NON-SEQUENTIAL POINT CLOUD MEDIA, DEVICE, AND STORAGE MEDIUM
20230048474 · 2023-02-16 ·

This application provides a method and apparatus for processing non-sequential point cloud media, a device, and a storage medium. The method includes: processing non-sequential point cloud data of a static object using a Geometry-based Point Cloud Compression (GPCC) coding scheme to obtain a GPCC bitstream; encapsulating the GPCC bitstream to generate an item of at least one GPCC region; encapsulating the item of the at least one GPCC region to generate at least one piece of non-sequential point cloud media of the static object; transmitting media presentation description (MPD) signaling of the at least one piece of non-sequential point cloud media; receiving a first request message transmitted by a video playback device; and transmitting first non-sequential point cloud media, the item of the GPCC region being used to represent a GPCC component of a three-dimensional (3D) spatial region corresponding to the GPCC region, and the non-sequential point cloud media including: an identifier of the static object, so that a user can purposefully request non-sequential point cloud media of a same static object a plurality of times, thereby improving the user experience.

Using morphological operations to process frame masks in video content

A computer implemented method can decode a frame of video data comprising an array of pixels to obtain decoded luma values and decoded chroma values corresponding to the array of pixels, and extract a frame mask based on the decoded luma values. The frame mask can include an array of mask values respectively corresponding to the array of pixels. A mask value indicates whether a corresponding pixel is in foreground or background of the frame. The method can perform a morphological operation to the frame mask to change one or more mask values to indicate their corresponding pixels are removed from the foreground and added to the background of the frame. The method can also identify foreground pixels after performing the morphological operation to the frame mask, and render a foreground image for display based on the decoded luma values and decoded chroma values of the foreground pixels.

Using morphological operations to process frame masks in video content

A computer implemented method can decode a frame of video data comprising an array of pixels to obtain decoded luma values and decoded chroma values corresponding to the array of pixels, and extract a frame mask based on the decoded luma values. The frame mask can include an array of mask values respectively corresponding to the array of pixels. A mask value indicates whether a corresponding pixel is in foreground or background of the frame. The method can perform a morphological operation to the frame mask to change one or more mask values to indicate their corresponding pixels are removed from the foreground and added to the background of the frame. The method can also identify foreground pixels after performing the morphological operation to the frame mask, and render a foreground image for display based on the decoded luma values and decoded chroma values of the foreground pixels.

Method for alignment across layers in coded video stream

A method, computer program, and computer system is provided for aligning across layers in a coded video stream. A video bitstream having multiple layers is decoded. One or more subpicture regions are identified from among the multiple layers of the decoded video bitstream, the subpicture regions including a background region and one or more foreground subpicture regions. An enhanced subpicture is decoded and displayed based on a determination that a foreground subpicture region is selected. The background region is decoded and displayed based on a determination that a foreground subpicture region was not selected.

Method for alignment across layers in coded video stream

A method, computer program, and computer system is provided for aligning across layers in a coded video stream. A video bitstream having multiple layers is decoded. One or more subpicture regions are identified from among the multiple layers of the decoded video bitstream, the subpicture regions including a background region and one or more foreground subpicture regions. An enhanced subpicture is decoded and displayed based on a determination that a foreground subpicture region is selected. The background region is decoded and displayed based on a determination that a foreground subpicture region was not selected.