Patent classifications
H04N19/615
Affine motion compensation with low bandwidth
An example device for coding video data is configured to determine that a block of the video data includes a plurality of sub-blocks, each of the sub-blocks having respective motion information referring to respective reference blocks in a reference picture in a memory, determine a single reference block of the reference picture, the single reference block including each of the respective reference blocks, wherein determining the single reference block comprises: determine four corner sub-blocks of the block included in the plurality of sub-blocks; and determine the single reference block according to the respective motion information for the four corner sub-blocks such that corners of the single reference block correspond to corners of the respective reference blocks of the four corner sub-blocks, retrieve data of the single reference block from the reference picture, and predict the sub-blocks from the respective reference blocks using the data of the single reference block.
Video intra prediction using hybrid recursive filters
An example method of decoding video data includes obtaining, from a coded video bitstream and for a current block of the video data, an indication of an intra-prediction mode that identifies an initial predictive block; filtering, in parallel, samples in a current line of a plurality of lines of the initial predictive block based on filtered values of samples in a preceding line of the plurality of lines and unfiltered values of samples in the current line to generate filtered values for samples for the current line; and reconstructing, using intra prediction, values of samples of the current block based on the filtered values of the samples of the current initial predictive block and residual data for the current block that represents a difference between the filtered values of the samples of the current initial predictive block and the values of samples of the current block.
Video intra prediction using hybrid recursive filters
An example method of decoding video data includes obtaining, from a coded video bitstream and for a current block of the video data, an indication of an intra-prediction mode that identifies an initial predictive block; filtering, in parallel, samples in a current line of a plurality of lines of the initial predictive block based on filtered values of samples in a preceding line of the plurality of lines and unfiltered values of samples in the current line to generate filtered values for samples for the current line; and reconstructing, using intra prediction, values of samples of the current block based on the filtered values of the samples of the current initial predictive block and residual data for the current block that represents a difference between the filtered values of the samples of the current initial predictive block and the values of samples of the current block.
Using motion compensated temporal filter (MCTF) statistics for scene change detection when a fade, dissolve or cut occurs
A method is provided to better detect a scene change to provide a prediction to an encoder to enable more efficient encoding. The method uses a Motion Compensated Temporal Filter (MCTF) that provides motion estimation and is located prior to an encoder. The MCTF provides a Motion Compensated Residual (MCR) used to detect the scene change transition. When a scene is relatively stable, the MCR score is also relatively stable. However, when a scene transition is in process, the MCR score behavior changes, Algorithmically, the MCR score is used by comparing the sliding mean of the MCR score to the sliding median. This comparison highlights the transition points. In the case of a scene cut, the MCR score exhibits a distinct spike. In the case of a fade or dissolve, the MCR score exhibits a transitional period of degradation followed by recovery. By implementing the above detection using the MCR, the location of the I-pictures in the downstream encoding process can be accurately determined for the encoder.
Using motion compensated temporal filter (MCTF) statistics for scene change detection when a fade, dissolve or cut occurs
A method is provided to better detect a scene change to provide a prediction to an encoder to enable more efficient encoding. The method uses a Motion Compensated Temporal Filter (MCTF) that provides motion estimation and is located prior to an encoder. The MCTF provides a Motion Compensated Residual (MCR) used to detect the scene change transition. When a scene is relatively stable, the MCR score is also relatively stable. However, when a scene transition is in process, the MCR score behavior changes, Algorithmically, the MCR score is used by comparing the sliding mean of the MCR score to the sliding median. This comparison highlights the transition points. In the case of a scene cut, the MCR score exhibits a distinct spike. In the case of a fade or dissolve, the MCR score exhibits a transitional period of degradation followed by recovery. By implementing the above detection using the MCR, the location of the I-pictures in the downstream encoding process can be accurately determined for the encoder.
Scalable video coding using subblock-based coding of transform coefficient blocks in the enhancement layer
A subblock-based coding of transform coefficient blocks of the enhancement layer is rendered more efficient. To this end, the subblock subdivision of the respective transform coefficient block is controlled on the basis of the base layer residual signal or the base layer signal. In particular, by exploiting the respective base layer hint, the subblocks may be made longer along a spatial frequency axis transverse to edge extensions observable from the base layer residual signal or the base layer signal.
Scalable video coding using subblock-based coding of transform coefficient blocks in the enhancement layer
A subblock-based coding of transform coefficient blocks of the enhancement layer is rendered more efficient. To this end, the subblock subdivision of the respective transform coefficient block is controlled on the basis of the base layer residual signal or the base layer signal. In particular, by exploiting the respective base layer hint, the subblocks may be made longer along a spatial frequency axis transverse to edge extensions observable from the base layer residual signal or the base layer signal.
Scalable Video Coding Using Derivation Of Subblock Subdivision For Prediction From Base Layer
Scalable video coding is rendered more efficient by deriving/selecting a subblock subdivision to be used for enhancement layer prediction, among a set of possible subblock subdivisions of an enhancement layer block by evaluating the spatial variation of the base layer coding parameters over the base layer signal. By this measure, less of the signalization overhead has to be spent on signaling this subblock subdivision within the enhancement layer data stream, if any. The subblock subdivision thus selected may be used in predictively coding/decoding the enhancement layer signal.
Scalable Video Coding Using Derivation Of Subblock Subdivision For Prediction From Base Layer
Scalable video coding is rendered more efficient by deriving/selecting a subblock subdivision to be used for enhancement layer prediction, among a set of possible subblock subdivisions of an enhancement layer block by evaluating the spatial variation of the base layer coding parameters over the base layer signal. By this measure, less of the signalization overhead has to be spent on signaling this subblock subdivision within the enhancement layer data stream, if any. The subblock subdivision thus selected may be used in predictively coding/decoding the enhancement layer signal.
PROCESSING A VIDEO FRAME HAVING SLICES AND TILES
A method for decoding video includes receiving a frame of the video that includes at least one slice and at least one time. Each of the at least one slice and the at least one tile are not all aligned with one another. Each of the at least one slice is characterized that it is decoded independently of the other the at least one slice. Each of the at least one tile is characterized that it is a rectangular region of the frame and having coding units for the decoding arranged in a raster scan order. The at least one tile of the frame are collectively arranged in a raster scan order of the frame.