Patent classifications
H04N19/124
Method and device for sharing a candidate list
The present invention relates to a method and device for sharing a candidate list. A method of generating a merging candidate list for a predictive block may include: producing, on the basis of a coding block including a predictive block on which a parallel merging process is performed, at least one of a spatial merging candidate and a temporal merging candidate of the predictive block; and generating a single merging candidate list for the coding block on the basis of the produced merging candidate. Thus, it is possible to increase processing speeds for coding and decoding by performing inter-picture prediction in parallel on a plurality of predictive blocks.
Video encoding and decoding method based on entry point information in a slice header, and apparatus using same
A video encoding method includes deriving entry point information specifying an entry point of a substream for a picture. The entry point information may include a number syntax element representing a number of offset syntax elements in a slice header; an offset syntax element representing an entry point offset between in bytes two entry points; and a length syntax element representing a bits length of the offset syntax element. A value of the length syntax element plus one corresponds to the bits length of the offset syntax element and the length syntax element is signaled when the number of the offset syntax elements is larger than 0. A value of the number syntax element corresponds to the number of offset syntax elements in the slice header, and the offset syntax element is signaled when the number of the offset syntax elements is larger than 0.
Moving picture coding method, moving picture coding apparatus, moving picture decoding method, moving picture decoding apparatus, and moving picture coding and decoding apparatus
A moving picture coding apparatus includes an intra-inter prediction unit which calculates a second motion vector by performing a scaling process on a first motion vector of a temporally neighboring corresponding block, when selectively adding, to a list, a motion vector of each of one or more corresponding blocks each of which is either a block included in a current picture to be coded and spatially neighboring a current block to be coded or a block included in a picture other than the current picture and temporally neighboring the current block, determines whether the second motion vector has a magnitude that is within a predetermined magnitude or not within the predetermined magnitude, and adds the second motion vector to the list when the intra-inter prediction unit determines that the second motion vector has a magnitude that is within the predetermined magnitude range.
Moving picture coding method, moving picture coding apparatus, moving picture decoding method, moving picture decoding apparatus, and moving picture coding and decoding apparatus
A moving picture coding apparatus includes an intra-inter prediction unit which calculates a second motion vector by performing a scaling process on a first motion vector of a temporally neighboring corresponding block, when selectively adding, to a list, a motion vector of each of one or more corresponding blocks each of which is either a block included in a current picture to be coded and spatially neighboring a current block to be coded or a block included in a picture other than the current picture and temporally neighboring the current block, determines whether the second motion vector has a magnitude that is within a predetermined magnitude or not within the predetermined magnitude, and adds the second motion vector to the list when the intra-inter prediction unit determines that the second motion vector has a magnitude that is within the predetermined magnitude range.
MULTI-DOMAIN CONVOLUTIONAL NEURAL NETWORK
In one embodiment, an apparatus comprises a memory and a processor. The memory is to store visual data associated with a visual representation captured by one or more sensors. The processor is to: obtain the visual data associated with the visual representation captured by the one or more sensors, wherein the visual data comprises uncompressed visual data or compressed visual data; process the visual data using a convolutional neural network (CNN), wherein the CNN comprises a plurality of layers, wherein the plurality of layers comprises a plurality of filters, and wherein the plurality of filters comprises one or more pixel-domain filters to perform processing associated with uncompressed data and one or more compressed-domain filters to perform processing associated with compressed data; and classify the visual data based on an output of the CNN.
Spatial Layer Rate Allocation
A method includes receiving transform coefficients corresponding to a scaled video input signal, the scaled video input signal including a plurality of spatial layers that include a base layer. The method also includes determining a spatial rate factor based on a sample of frames from the scaled video input signal. The spatial rate factor defines a factor for bit rate allocation at each spatial layer of an encoded bit stream formed from the scaled video input signal. The spatial rate factor is represented by a difference between a rate of bits per transform coefficient of the base layer and an average rate of bits per transform coefficient. The method also includes reducing a distortion for the plurality of spatial layers by allocating a bit rate to each spatial layer based on the spatial rate factor and the sample of frames.
Spatial Layer Rate Allocation
A method includes receiving transform coefficients corresponding to a scaled video input signal, the scaled video input signal including a plurality of spatial layers that include a base layer. The method also includes determining a spatial rate factor based on a sample of frames from the scaled video input signal. The spatial rate factor defines a factor for bit rate allocation at each spatial layer of an encoded bit stream formed from the scaled video input signal. The spatial rate factor is represented by a difference between a rate of bits per transform coefficient of the base layer and an average rate of bits per transform coefficient. The method also includes reducing a distortion for the plurality of spatial layers by allocating a bit rate to each spatial layer based on the spatial rate factor and the sample of frames.
UPSAMPLING FOR SIGNAL ENHANCEMENT CODING
There is disclosed a method of encoding an input signal, the method comprising: receiving a base encoded signal, the base encoded signal being generated by feeding an encoder with a down-sampled version of an input signal; producing a first residual signal by: decoding the base encoded signal to produce a first decoded signal; and using a difference between the base decoded signal and the down-sampled version of the input signal to produce the first residual signal; producing a second residual signal by: correcting the base decoded signal using the residual signal to create a corrected decoded version; up-sampling the corrected decoded version; and using a difference between the up-sampled corrected decoded signal and the input signal to produce the second residual signal; wherein the up-sampling is one of bilinear or bicubic up-sampling. A corresponding decoding method is also disclosed.
UPSAMPLING FOR SIGNAL ENHANCEMENT CODING
There is disclosed a method of encoding an input signal, the method comprising: receiving a base encoded signal, the base encoded signal being generated by feeding an encoder with a down-sampled version of an input signal; producing a first residual signal by: decoding the base encoded signal to produce a first decoded signal; and using a difference between the base decoded signal and the down-sampled version of the input signal to produce the first residual signal; producing a second residual signal by: correcting the base decoded signal using the residual signal to create a corrected decoded version; up-sampling the corrected decoded version; and using a difference between the up-sampled corrected decoded signal and the input signal to produce the second residual signal; wherein the up-sampling is one of bilinear or bicubic up-sampling. A corresponding decoding method is also disclosed.
METHOD OF CODING TRANSFORM COEFFICIENT BASED ON HIGH FREQUENCY ZEROING AND APPARATUS THEREOF
An image decoding method performed by a decoding apparatus according to the present disclosure includes receiving a bitstream including residual information; deriving quantized transform coefficients for a current block based on the residual information included in the bitstream; deriving residual samples for the current block based on the quantized transform coefficients; and generating a reconstructed picture based on the residual samples for the current block.