H04N19/59

ENCODING METHOD AND APPARATUS, AND DECODING METHOD AND APPARATUS

An encoding apparatus for encoding an image includes: a communicator configured to receive, from a device, device information related to the device; and a processor configured to encode the image by using image information of the image and the device information, wherein the processor is further configured to process the image according to at least one of the device information and the image information, determine a non-encoding region, a block-based encoding region, and a pixel-based encoding region of the image according to at least one of the device information and the image information, performs block-based encoding on the block-based encoding region by using a quantization parameter determined according to at least one of the device information and the image information, perform pixel-based encoding on the pixel-based encoding region, generates an encoded image by entropy encoding a symbol determined by the block-based encoding or the pixel-based encoding, and generate a bitstream comprising the encoded image, region information of the block-based encoding region and the pixel-based encoding region, and quantization information of the quantization parameter, and wherein the communicator is further configured to transmit the bitstream to the device.

SUPER-TRANSFORM VIDEO CODING
20180014031 · 2018-01-11 ·

Super-transform coding may include identifying a plurality of sub-blocks for prediction coding a current block, determining whether to encode the current block using a super-transform, and super-prediction coding the current block. Super-prediction coding may include generating a super-prediction block for the current block by generating a prediction block for each unpartitioned sub-block of the current block, generating a super-prediction block for each partitioned sub-block of the current block by super-prediction coding the sub-block, and including the prediction blocks and super-prediction blocks for the sub-blocks in a super-prediction block for the current block. Including the prediction blocks and super-prediction blocks for the sub-blocks in a super-prediction block for the current block may include filtering at least a portion of each prediction block and each super-prediction block based on a spatially adjacent prediction block. Super-transform coding may include transforming the super-prediction block for the current block using a corresponding super-transform.

SUPER-TRANSFORM VIDEO CODING
20180014031 · 2018-01-11 ·

Super-transform coding may include identifying a plurality of sub-blocks for prediction coding a current block, determining whether to encode the current block using a super-transform, and super-prediction coding the current block. Super-prediction coding may include generating a super-prediction block for the current block by generating a prediction block for each unpartitioned sub-block of the current block, generating a super-prediction block for each partitioned sub-block of the current block by super-prediction coding the sub-block, and including the prediction blocks and super-prediction blocks for the sub-blocks in a super-prediction block for the current block. Including the prediction blocks and super-prediction blocks for the sub-blocks in a super-prediction block for the current block may include filtering at least a portion of each prediction block and each super-prediction block based on a spatially adjacent prediction block. Super-transform coding may include transforming the super-prediction block for the current block using a corresponding super-transform.

Adaptive chroma downsampling and color space conversion techniques

A method of adaptive chroma downsampling is presented. The method comprises converting a source image to a converted image in an output color format, applying a plurality of downsample filters to the converted image and estimating a distortion for each filter chose the filter that produces the minimum distortion. The distortion estimation includes applying an upsample filter, and a pixel is output based on the chosen filter. Methods for closed loop conversions are also presented.

Adaptive chroma downsampling and color space conversion techniques

A method of adaptive chroma downsampling is presented. The method comprises converting a source image to a converted image in an output color format, applying a plurality of downsample filters to the converted image and estimating a distortion for each filter chose the filter that produces the minimum distortion. The distortion estimation includes applying an upsample filter, and a pixel is output based on the chosen filter. Methods for closed loop conversions are also presented.

CODING AND DECODING OF AN OMNIDIRECTIONAL VIDEO
20230239503 · 2023-07-27 ·

A process for coding an image of a view from among a plurality of views, including the following steps: selecting a first or a second coding method to code image data from the image; generating a data signal containing information indicating whether it is the first or the second coding method that has been selected, and, if it is the first coding method, coding the original image data so as to provide coded original data, and, if it is the second coding method, coding processed image data from the image obtained by image processing of the original image data so as to provide coded processed data; and coding information describing the image processing which has been applied.

SIMPLIFICATIONS OF CROSS-COMPONENT LINEAR MODEL
20230239482 · 2023-07-27 ·

A computing device performs a method of decoding video data by reconstructing a luma block corresponding to a chroma block; searching a sub-group of a plurality of reconstructed neighboring luma samples in a predefined order to identify a maximum luma sample and a minimum luma sample; computing a down-sampled maximum luma sample corresponding to the maximum luma sample; computing a down-sampled minimum luma sample corresponding to the minimum luma sample; generating a linear model using the down-sampled maximum luma sample, the down-sampled minimum luma sample, the first reconstructed chroma sample, and the second reconstructed chroma sample; computing down-sampled luma samples from luma samples of the reconstructed luma block, wherein each down-sampled luma sample corresponds to a chroma sample of the chroma block; and predicting chroma samples of the chroma block by applying the liner model to the corresponding down-sampled luma samples.

SIMPLIFICATIONS OF CROSS-COMPONENT LINEAR MODEL
20230239482 · 2023-07-27 ·

A computing device performs a method of decoding video data by reconstructing a luma block corresponding to a chroma block; searching a sub-group of a plurality of reconstructed neighboring luma samples in a predefined order to identify a maximum luma sample and a minimum luma sample; computing a down-sampled maximum luma sample corresponding to the maximum luma sample; computing a down-sampled minimum luma sample corresponding to the minimum luma sample; generating a linear model using the down-sampled maximum luma sample, the down-sampled minimum luma sample, the first reconstructed chroma sample, and the second reconstructed chroma sample; computing down-sampled luma samples from luma samples of the reconstructed luma block, wherein each down-sampled luma sample corresponds to a chroma sample of the chroma block; and predicting chroma samples of the chroma block by applying the liner model to the corresponding down-sampled luma samples.

METHOD FOR SIGNALING OF REFERENCE PICTURE RESAMPLING WITH RESAMPLING PICTURE SIZE INDICATION IN VIDEO BITSTREAM
20230239476 · 2023-07-27 · ·

A method, device, and computer-readable medium for decoding an encoded video bitstream using at least one processor, including obtaining a flag indicating that a conformance window is not used for reference picture resampling; based on the flag indicating that the conformance window is not used for the reference picture resampling, determining whether a resampling picture size is signaled; based on determining that the resampling picture size is signaled, determining a resampling ratio based on the resampling picture size; based on determining that the resampling picture size is not signaled, determining the resampling ratio based on an output picture size; and performing the reference picture resampling on a current picture using the resampling ratio.

Pixel-Level Video Prediction with Improved Performance and Efficiency

One aspect provides a machine-learned video prediction model configured to receive and process one or more previous video frames to generate one or more predicted subsequent video frames, wherein the machine-learned video prediction model comprises a convolutional variational auto encoder, and wherein the convolutional variational auto encoder comprises an encoder portion comprising one or more encoding cells and a decoder portion comprising one or more decoding cells.