H04N19/39

Apparatus, a method and a computer program for video coding and decoding

A method comprising: encoding an input picture into a coded constituent picture; reconstructing, as a part of said encoding, a decoded constituent picture corresponding to the coded constituent picture; encoding a spatial region into a coded tile, the encoding comprising: determining a horizontal offset and a vertical offset indicative of a region-wise anchor position of the spatial region within the decoded constituent picture; encoding the horizontal offset and the vertical offset; determining that a prediction unit at position of a first horizontal coordinate and a first vertical coordinate of the coded tile is predicted relative to the region-wise anchor position, wherein the first horizontal coordinate and the first vertical coordinate are horizontal and vertical coordinates, respectively, within the spatial region; indicating that the prediction unit is predicted relative to a prediction-unit anchor position that is relative to the horizontal and vertical offset; deriving a prediction-unit anchor position equal to sum of the first horizontal coordinate and the horizontal offset, and the first vertical coordinate and the vertical offset, respectively; and determining a motion vector for the prediction unit; and applying the motion vector relative to the prediction-unit anchor position to obtain a prediction block.

Apparatus, a method and a computer program for video coding and decoding

A method comprising: encoding an input picture into a coded constituent picture; reconstructing, as a part of said encoding, a decoded constituent picture corresponding to the coded constituent picture; encoding a spatial region into a coded tile, the encoding comprising: determining a horizontal offset and a vertical offset indicative of a region-wise anchor position of the spatial region within the decoded constituent picture; encoding the horizontal offset and the vertical offset; determining that a prediction unit at position of a first horizontal coordinate and a first vertical coordinate of the coded tile is predicted relative to the region-wise anchor position, wherein the first horizontal coordinate and the first vertical coordinate are horizontal and vertical coordinates, respectively, within the spatial region; indicating that the prediction unit is predicted relative to a prediction-unit anchor position that is relative to the horizontal and vertical offset; deriving a prediction-unit anchor position equal to sum of the first horizontal coordinate and the horizontal offset, and the first vertical coordinate and the vertical offset, respectively; and determining a motion vector for the prediction unit; and applying the motion vector relative to the prediction-unit anchor position to obtain a prediction block.

Support of non-HEVC base layer in HEVC multi-layer extensions
09794558 · 2017-10-17 · ·

An apparatus configured to code video information includes a memory and a processor in communication with the memory. The memory is configured to store video information associated with a bitstream. The apparatus further includes a processor in communication with the memory, the processor configured to determine whether a reference layer is included in the bitstream. The processor is further configured to determine, based upon whether the reference layer is included in the bitstream, whether or not to process an indication and to, if the reference layer is included in the bitstream, process, in a video bitstream, the indication. The processor is also configured to code the video information based at least in part on the processed indication.

Support of non-HEVC base layer in HEVC multi-layer extensions
09794558 · 2017-10-17 · ·

An apparatus configured to code video information includes a memory and a processor in communication with the memory. The memory is configured to store video information associated with a bitstream. The apparatus further includes a processor in communication with the memory, the processor configured to determine whether a reference layer is included in the bitstream. The processor is further configured to determine, based upon whether the reference layer is included in the bitstream, whether or not to process an indication and to, if the reference layer is included in the bitstream, process, in a video bitstream, the indication. The processor is also configured to code the video information based at least in part on the processed indication.

Apparatus, a method and a computer program for video coding and decoding
11671625 · 2023-06-06 · ·

A method comprising: encoding at least four bitstream versions of a same content divided into segments of independently coded tile sets representing a plurality of spatial regions, wherein a first and a second bitstream comprise independently coded tile sets encoded at a first quality, and a third and a fourth bitstream comprise independently coded tile sets encoded at a second quality, wherein the first and the third bitstream have first random access picture interval and the second and the fourth bitstream have second random access picture interval, which is an integer multiple of the first random access picture interval; grouping the independently coded tile sets of all four bitstreams representing a common spatial region into a plurality of groups of collocated sub-picture tracks, wherein only one of said tile sets per group is intended to be received and/or decoded per any segment; and generating at least one instruction for merging tile sets of different spatial locations into at least one coded picture, the at least one instruction causing a tile set originating from a random access picture to be decoded as a tile set originating from anon-random-access picture when merged with a tile set originating from a non-random-access picture.

Signaling of information indicating transform kernel set in image coding

An image decoding method according to the present document comprises a step of generating residual samples of a current block on the basis of residual information, wherein the residual information comprises a multiple transform selection (MTS) index and information regarding transform coefficients, the residual samples are generated from transform coefficients according to the information regarding the transform coefficients by using a transform kernel set, the transform kernel set is determined by the MTS index from among transform kernel set candidates, at least one of bins of a bin string of the MTS index is derived on the basis of context coding, the context coding is performed based on a value of a context index with respect to the MTS index.

VIDEO IMAGE TRANSMISSION METHOD, DEVICE, INTERACTIVE INTELLIGENT TABLET AND STORAGE MEDIUM

The present application relates to a video image transmission method, device, an interactive intelligent tablet and a storage medium. The method comprises: acquiring a video image captured by a first video communication end; acquiring semantic information in the video image; and sending the semantic information to a second video communication end, wherein the semantic information is used to reconstruct a reconstruction image of the video image at the second video communication end.

HANDLING OF END OF BITSTREAM NAL UNITS IN L-HEVC FILE FORMAT AND IMPROVEMENTS TO HEVC AND L-HEVC TILE TRACKS
20170237999 · 2017-08-17 ·

In various implementations, techniques are described herein to resolve issues related to the location of an EOB NAL unit in an ISO base media file. In various implementations, these techniques include relaxing the requirement that a file include only one EOB NAL unit. These techniques further include providing the EOB NAL unit from the highest layer when reconstructing the bitstream. This EOB NAL unit can be added to a last access unit in the reconstructed bitstream. When the EOB NAL unit from the highest layer is not present, an EOB NAL unit from a lower layer can be provided instead.

HANDLING OF END OF BITSTREAM NAL UNITS IN L-HEVC FILE FORMAT AND IMPROVEMENTS TO HEVC AND L-HEVC TILE TRACKS
20170237999 · 2017-08-17 ·

In various implementations, techniques are described herein to resolve issues related to the location of an EOB NAL unit in an ISO base media file. In various implementations, these techniques include relaxing the requirement that a file include only one EOB NAL unit. These techniques further include providing the EOB NAL unit from the highest layer when reconstructing the bitstream. This EOB NAL unit can be added to a last access unit in the reconstructed bitstream. When the EOB NAL unit from the highest layer is not present, an EOB NAL unit from a lower layer can be provided instead.

Video encoding method and method for reducing file size of encoded video
11457228 · 2022-09-27 · ·

A video encoding method comprises encoding a series of images of original video data into an encoded video stream comprising key frames and delta frames, wherein the delta frames are organized in a hierarchical prediction pattern comprising a plurality of temporal layers. The video encoding method further comprises adding to the encoded video stream a hidden delta frame for at least some of the key frames. Each hidden delta frame corresponds to a key frame, is based on same original video data as the corresponding key frame and is referring to a previous key frame in the encoded video stream. Also, a method of reducing a file size of the video stream encoded according to the video encoding method is disclosed.