H04N19/46

MULTI-DOMAIN CONVOLUTIONAL NEURAL NETWORK

In one embodiment, an apparatus comprises a memory and a processor. The memory is to store visual data associated with a visual representation captured by one or more sensors. The processor is to: obtain the visual data associated with the visual representation captured by the one or more sensors, wherein the visual data comprises uncompressed visual data or compressed visual data; process the visual data using a convolutional neural network (CNN), wherein the CNN comprises a plurality of layers, wherein the plurality of layers comprises a plurality of filters, and wherein the plurality of filters comprises one or more pixel-domain filters to perform processing associated with uncompressed data and one or more compressed-domain filters to perform processing associated with compressed data; and classify the visual data based on an output of the CNN.

UPSAMPLING FOR SIGNAL ENHANCEMENT CODING
20230239505 · 2023-07-27 ·

There is disclosed a method of encoding an input signal, the method comprising: receiving a base encoded signal, the base encoded signal being generated by feeding an encoder with a down-sampled version of an input signal; producing a first residual signal by: decoding the base encoded signal to produce a first decoded signal; and using a difference between the base decoded signal and the down-sampled version of the input signal to produce the first residual signal; producing a second residual signal by: correcting the base decoded signal using the residual signal to create a corrected decoded version; up-sampling the corrected decoded version; and using a difference between the up-sampled corrected decoded signal and the input signal to produce the second residual signal; wherein the up-sampling is one of bilinear or bicubic up-sampling. A corresponding decoding method is also disclosed.

UPSAMPLING FOR SIGNAL ENHANCEMENT CODING
20230239505 · 2023-07-27 ·

There is disclosed a method of encoding an input signal, the method comprising: receiving a base encoded signal, the base encoded signal being generated by feeding an encoder with a down-sampled version of an input signal; producing a first residual signal by: decoding the base encoded signal to produce a first decoded signal; and using a difference between the base decoded signal and the down-sampled version of the input signal to produce the first residual signal; producing a second residual signal by: correcting the base decoded signal using the residual signal to create a corrected decoded version; up-sampling the corrected decoded version; and using a difference between the up-sampled corrected decoded signal and the input signal to produce the second residual signal; wherein the up-sampling is one of bilinear or bicubic up-sampling. A corresponding decoding method is also disclosed.

METHOD AND APPARATUS FOR ENCODING/DECODING IMAGE

Disclosed herein are an image encoding method and an image decoding method. The image decoding method includes determining an initial motion vector of a current block using a motion vector of a reconstructed region, searching for the motion vector of the current block based on the initial motion vector, and generating a prediction sample of the current block using the motion vector. The initial motion vector includes a motion vector in a past direction and a motion vector in a future direction.

SETTING MOTION VECTOR PRECISION FOR INTRA PREDICTION WITH MOTION VECTOR DIFFERENCE
20230239485 · 2023-07-27 · ·

A method for video encoding includes setting a motion vector precision associated with a current block to be encoded in a current picture, and determining a motion vector for encoding the current block based on the motion vector precision. The method also includes determining a motion vector difference for the current block based on (i) the determined motion vector for encoding the current block, (ii) a predicted motion vector of the current block in inter prediction mode, and (iii) the motion vector precision. The method further includes encoding the current block according to the determined motion vector, and generating a coded video bitstream including the encoded current block and including prediction information indicating that the current block is coded in inter prediction mode and indicating the determined motion vector difference for the current block.

SETTING MOTION VECTOR PRECISION FOR INTRA PREDICTION WITH MOTION VECTOR DIFFERENCE
20230239485 · 2023-07-27 · ·

A method for video encoding includes setting a motion vector precision associated with a current block to be encoded in a current picture, and determining a motion vector for encoding the current block based on the motion vector precision. The method also includes determining a motion vector difference for the current block based on (i) the determined motion vector for encoding the current block, (ii) a predicted motion vector of the current block in inter prediction mode, and (iii) the motion vector precision. The method further includes encoding the current block according to the determined motion vector, and generating a coded video bitstream including the encoded current block and including prediction information indicating that the current block is coded in inter prediction mode and indicating the determined motion vector difference for the current block.

METHOD AND DEVICE FOR PROCESSING IMAGE INFORMATION FOR IMAGE/VIDEO CODING
20230239478 · 2023-07-27 ·

A video decoding method performed by a video decoding device according to the present document may comprise the steps of: acquiring image information from a bitstream, the image information including a picture header associated with the current picture including a plurality of slices; parsing, from the picture header, at least one of a first flag indicating whether information necessary for an inter-prediction operation for a decoding process is present in the picture header, or a second flag indicating whether information necessary for an intra-prediction operation for the decoding process is present in the picture header; generating prediction samples by performing at least one of intra-prediction or inter-prediction for the slices in the current picture on the basis of at least one of the first flag or the second flag; and generating reconstructed samples on the basis of the prediction samples.

METHOD AND DEVICE FOR PROCESSING IMAGE INFORMATION FOR IMAGE/VIDEO CODING
20230239478 · 2023-07-27 ·

A video decoding method performed by a video decoding device according to the present document may comprise the steps of: acquiring image information from a bitstream, the image information including a picture header associated with the current picture including a plurality of slices; parsing, from the picture header, at least one of a first flag indicating whether information necessary for an inter-prediction operation for a decoding process is present in the picture header, or a second flag indicating whether information necessary for an intra-prediction operation for the decoding process is present in the picture header; generating prediction samples by performing at least one of intra-prediction or inter-prediction for the slices in the current picture on the basis of at least one of the first flag or the second flag; and generating reconstructed samples on the basis of the prediction samples.

IMAGE/VIDEO CODING METHOD AND DEVICE BASED ON WEIGHTED PREDICTION
20230239463 · 2023-07-27 ·

A video decoding method performed by a video decoding device according to this document comprises the steps of: parsing prediction weighted table syntax from a bitstream; parsing information on the number of reference pictures in a reference picture list from the prediction weighted table syntax; deriving weights for weighted prediction on the basis of the information on the number of reference pictures; deriving prediction samples of a current block by performing weighted prediction for the current block on the basis of the weights; and reconstructing a current picture on the basis of the prediction samples, wherein the prediction weighted table syntax may be parsed from a picture header of the bitstream.

IMAGE/VIDEO CODING METHOD AND DEVICE BASED ON WEIGHTED PREDICTION
20230239463 · 2023-07-27 ·

A video decoding method performed by a video decoding device according to this document comprises the steps of: parsing prediction weighted table syntax from a bitstream; parsing information on the number of reference pictures in a reference picture list from the prediction weighted table syntax; deriving weights for weighted prediction on the basis of the information on the number of reference pictures; deriving prediction samples of a current block by performing weighted prediction for the current block on the basis of the weights; and reconstructing a current picture on the basis of the prediction samples, wherein the prediction weighted table syntax may be parsed from a picture header of the bitstream.