H04N19/134

IMAGE CODING METHOD, IMAGE CODING APPARATUS, IMAGE DECODING METHOD, IMAGE DECODING APPARATUS, AND IMAGE CODING AND DECODING APPARATUS

A method for transmitting a bitstream via a network is provided. The bitstream is for coding a current block and is generated by: deriving a first candidate from a first motion vector that has been used to code a first block; coding a first index identifying a reference picture to be used for coding the current block; deriving a second candidate having a second motion vector that includes a non-zero value, which is assigned to the reference picture; and coding a second index identifying a selected candidate to be used for coding the current block. The second motion vector is not derived by coding a block adjacent to the current block, and the non-zero value is assigned by offset values which include an OffsetX and an OffsetY and which are located in a picture header.

IMAGE CODING METHOD, IMAGE CODING APPARATUS, IMAGE DECODING METHOD, IMAGE DECODING APPARATUS, AND IMAGE CODING AND DECODING APPARATUS

A method for transmitting a bitstream via a network is provided. The bitstream is for coding a current block and is generated by: deriving a first candidate from a first motion vector that has been used to code a first block; coding a first index identifying a reference picture to be used for coding the current block; deriving a second candidate having a second motion vector that includes a non-zero value, which is assigned to the reference picture; and coding a second index identifying a selected candidate to be used for coding the current block. The second motion vector is not derived by coding a block adjacent to the current block, and the non-zero value is assigned by offset values which include an OffsetX and an OffsetY and which are located in a picture header.

VIDEO PROCESSING DEVICE, VIDEO PROCESSING METHOD, VIDEO GENERATION DEVICE, VIDEO GENERATION METHOD, AND RECORDING MEDIUM
20230076845 · 2023-03-09 ·

A video processing device includes an acquirer that acquires video data via a predetermined transmission line, the video data including video and metadata that indicates a first frequency band that is a spatial frequency range in which the video is present; an adjuster that makes sharpness gain adjustment to video such that, among a plurality of regions of the video included in the video data acquired by the acquirer, a sharpness gain for a first region that belongs to the first frequency band indicated by the metadata exceeds a sharpness gain for a second region that belongs to a second frequency band that is a range outside the first frequency band; and an output device that outputs video adjusted by the adjuster.

CONFIGURABLE NAL AND SLICE CODE POINT MECHANISM FOR STREAM MERGING

Video decoder configured to decode a video comprising a plurality of pictures from a video data stream by decoding each picture from one or more video coding units within an access unit of the video data stream which is associated with the respective picture; read a substitute coding unit type from a parameter set unit of the video data stream; for each predetermined video coding unit, read a coding unit type identifier (100) from the respective video coding unit; check whether the coding unit identifier identifies a coding unit type out of a first subset of one or more coding unit types (102) or out of a second subset of coding unit types (104), if the coding unit identifier identifies a coding unit type out of the first subset of one or more coding unit types, attribute the respective video coding unit to the substitute coding unit type; if the coding unit identifier identifies a coding unit type out of the second subset of coding unit types, attribute the respective video coding unit to the coding unit type out of the second subset of coding unit types identified by the coding unit identifier.

Method and apparatus for prediction and transform for small blocks
11665359 · 2023-05-30 · ·

A method of video decoding for a video decoder includes determining whether a block size of a chroma block is less than or equal to a block size threshold. The method further includes, in response to a determination that the block size of the chroma block is greater than the block size threshold, selecting an intra prediction mode for the chroma block from a plurality of intra prediction modes. The method further includes, in response to a determination that the block size of the chroma block is less than or equal to the block size threshold, selecting the intra prediction mode for the chroma block from a subset of the plurality of intra prediction modes. The method further includes performing intra prediction for the chroma block based on a chroma sample obtained with the selected intra prediction mode to encode the chroma block.

Methods and apparatuses for performing artificial intelligence encoding and artificial intelligence decoding on image

Provided is an artificial intelligence (AI) decoding apparatus includes: a memory storing one or more instructions; and a processor configured to execute the one or more instructions stored in the memory, the processor is configured to: obtain AI data related to AI down-scaling an original image to a first image; obtain image data corresponding to an encoding result on the first image; obtain a second image corresponding to the first image by performing a decoding on the image data; obtain deep neural network (DNN) setting information among a plurality of DNN setting information from the AI data; and obtain, by an up-scaling DNN, a third image by performing the AI up-scaling on the second image, the up-scaling DNN being configured with the obtained DNN setting information, wherein the plurality of DNN setting information comprises a parameter used in the up-scaling DNN, the parameter being obtained through joint training of the up-scaling DNN and a down-scaling DNN, and wherein the down-scaling DNN is used to obtain the first image from the original image.

SIGNALING BLOCK PARTITIONING OF IMAGE AND VIDEO

A video system that applies constraints on block partitioning is provided. The system receives a partitioning control parameter from a bitstream specifying a maximum block size for enabling ternary-tree split that is constrained to be 64 or smaller. The system receives data from a bitstream for a block of pixels to be decoded as a current block of a current picture of a video. The system splits the current block into one or more partitions recursively, wherein ternary split is disallowed for a partition of the current block unless the partition is less than or equal to the maximum block size. The system reconstructs the one or more partitions of the current block.

METHOD AND APPARATUS FOR ADAPTIVE IMAGE PREPROCESSING AND RECONSTRUCTION

Disclosed herein is a method for adaptive image preprocessing and reconstruction. The method includes preprocessing an input image, encoding and decoding the preprocessed image, and reconstructing the encoded and decoded image. Here, preprocessing the input image may be performed using a preprocessing kernel generated based on a control parameter indicating a weight for human vision and machine vision.

Video coding method and apparatus utilizing group of encoding units

A decoding method comprises the steps of: combining two or more encoding units of maximum size into a single encoding unit group; acquiring encoding data corresponding to the combined single encoding unit group; and decoding, according to a decoding order, the two or more encoding units of maximum size contained in the single encoding unit group. Also disclosed is a block partitioning structure used for encoding and decoding video.

Video coding method and apparatus utilizing group of encoding units

A decoding method comprises the steps of: combining two or more encoding units of maximum size into a single encoding unit group; acquiring encoding data corresponding to the combined single encoding unit group; and decoding, according to a decoding order, the two or more encoding units of maximum size contained in the single encoding unit group. Also disclosed is a block partitioning structure used for encoding and decoding video.