H04N19/645

IMAGE DATA ENCODING AND DECODING

Apparatus comprises a data encoder configured to derive, from an array of sample values, sample range flags each indicative of whether one or more sample values of the array of sample values lie in a predetermined range of sample values, the data encoder being configured in a first encoding mode to encode the array of sample values, at least in part, by encoding the sample range flags to an output data stream; a predictor configured to predict the state of a group of the sample range flags for a given array of sample values, the group being at least a subset of the sample range flags; and a comparator configured to compare the predicted state of the group of sample range flags with the actual state of the respective sample range flags for the given array of sample values; the data encoder being configured, in response to the comparator, to encode the given array of samples values in a second encoding mode in which the encoder is configured to encode to the output data stream a predetermined number of indicator flags, fewer than the group of sample range flags, to indicate whether the predicted state of the group of sample range flags is the same as the actual state of the group of sample range flags for the given array of sample values.

Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor

The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.

Temporal prediction shifting for scalable video coding
11323734 · 2022-05-03 · ·

A method includes receiving an input video stream and scaling the input video stream into two or more spatial layers. For each spatial layer, the method also includes generating a temporal layer prediction pattern by: obtaining a temporal base layer for a corresponding spatial layer; identifying, based on the temporal base layer, a plurality of temporal layers and a plurality of temporal time slots during a temporal period; and aligning the temporal base layer for the corresponding spatial layer with one of the temporal time slots during the temporal period. Each temporal time slot is associated with one of the temporal base layer or one of the plurality of temporal layers for the corresponding spatial layer. The temporal base layer for each corresponding spatial layer is aligned with a different temporal time slot than each other temporal base layer for each other corresponding spatial layer.

Temporal prediction shifting for scalable video coding
11323734 · 2022-05-03 · ·

A method includes receiving an input video stream and scaling the input video stream into two or more spatial layers. For each spatial layer, the method also includes generating a temporal layer prediction pattern by: obtaining a temporal base layer for a corresponding spatial layer; identifying, based on the temporal base layer, a plurality of temporal layers and a plurality of temporal time slots during a temporal period; and aligning the temporal base layer for the corresponding spatial layer with one of the temporal time slots during the temporal period. Each temporal time slot is associated with one of the temporal base layer or one of the plurality of temporal layers for the corresponding spatial layer. The temporal base layer for each corresponding spatial layer is aligned with a different temporal time slot than each other temporal base layer for each other corresponding spatial layer.

Non-transform coding

Techniques for selectively transforming one or more coding units when coding video content are described herein. The techniques may include determining whether or not to transform a particular coding unit. The determination may be based on a difference in pixel values of the particular coding unit and/or one or more predefined rate-distortion constraints. When it is determined to not perform a transform, the particular coding unit may be coded without transforming the particular coding unit.

Non-transform coding

Techniques for selectively transforming one or more coding units when coding video content are described herein. The techniques may include determining whether or not to transform a particular coding unit. The determination may be based on a difference in pixel values of the particular coding unit and/or one or more predefined rate-distortion constraints. When it is determined to not perform a transform, the particular coding unit may be coded without transforming the particular coding unit.

Method, apparatus and system for encoding or decoding video data of precincts by using wavelet transform

A method of encoding video data into a video bitstream having a plurality of precincts. The method comprises generating a plurality of coding cost estimates for a current precinct by testing a corresponding candidate coefficient truncation level for the current precinct, each of the coding cost estimates being an over estimate of an encoded data size for coding the current precinct at the candidate truncation level and being determined using a most significant bit plane index, wherein each of the coding cost estimates is independent of a value of coefficient bits in the current precinct. The method includes selecting one of the candidate truncation levels according to the corresponding coding cost estimate and a budgeted coding cost for the current precinct, the budgeted coding cost representing an allowable size of encoding the precinct; and encoding the current precinct of video data into the video bitstream to generate the video bitstream.

Method, apparatus and system for encoding or decoding video data of precincts by using wavelet transform

A method of encoding video data into a video bitstream having a plurality of precincts. The method comprises generating a plurality of coding cost estimates for a current precinct by testing a corresponding candidate coefficient truncation level for the current precinct, each of the coding cost estimates being an over estimate of an encoded data size for coding the current precinct at the candidate truncation level and being determined using a most significant bit plane index, wherein each of the coding cost estimates is independent of a value of coefficient bits in the current precinct. The method includes selecting one of the candidate truncation levels according to the corresponding coding cost estimate and a budgeted coding cost for the current precinct, the budgeted coding cost representing an allowable size of encoding the precinct; and encoding the current precinct of video data into the video bitstream to generate the video bitstream.

Temporal prediction shifting for scalable video coding
11818382 · 2023-11-14 · ·

A method includes receiving an input video stream and scaling the input video stream into two or more spatial layers. For each spatial layer, the method also includes generating a temporal layer prediction pattern by: obtaining a temporal base layer for a corresponding spatial layer; identifying, based on the temporal base layer, a plurality of temporal layers and a plurality of temporal time slots during a temporal period; and aligning the temporal base layer for the corresponding spatial layer with one of the temporal time slots during the temporal period. Each temporal time slot is associated with one of the temporal base layer or one of the plurality of temporal layers for the corresponding spatial layer. The temporal base layer for each corresponding spatial layer is aligned with a different temporal time slot than each other temporal base layer for each other corresponding spatial layer.

Temporal prediction shifting for scalable video coding
11818382 · 2023-11-14 · ·

A method includes receiving an input video stream and scaling the input video stream into two or more spatial layers. For each spatial layer, the method also includes generating a temporal layer prediction pattern by: obtaining a temporal base layer for a corresponding spatial layer; identifying, based on the temporal base layer, a plurality of temporal layers and a plurality of temporal time slots during a temporal period; and aligning the temporal base layer for the corresponding spatial layer with one of the temporal time slots during the temporal period. Each temporal time slot is associated with one of the temporal base layer or one of the plurality of temporal layers for the corresponding spatial layer. The temporal base layer for each corresponding spatial layer is aligned with a different temporal time slot than each other temporal base layer for each other corresponding spatial layer.