Patent classifications
H04N19/90
Method and system for optimized delta encoding
Methods and systems are provided for image processing. A plurality of correlation parameters representing degrees of correlation between two or more images of a plurality of images may be produced. An optimized correlation dependency graph may be produced according to the plurality of correlation parameters. The plurality of images may then be delta encoded according to the optimized correlation dependency graph. For example, the optimized correlation dependency graph may be used for performing a correlation encoding operation. The plurality of correlation parameters may be produced, for example, in accordance with one or more correlation metrics associated with the correlation encoding operation.
Method and system for optimized delta encoding
Methods and systems are provided for image processing. A plurality of correlation parameters representing degrees of correlation between two or more images of a plurality of images may be produced. An optimized correlation dependency graph may be produced according to the plurality of correlation parameters. The plurality of images may then be delta encoded according to the optimized correlation dependency graph. For example, the optimized correlation dependency graph may be used for performing a correlation encoding operation. The plurality of correlation parameters may be produced, for example, in accordance with one or more correlation metrics associated with the correlation encoding operation.
METHOD FOR IMAGE COMPRESSION AND APPARATUS FOR IMPLEMENTING THE SAME
A method for encoding image data of an image divided into a plurality of pixel blocks using a machine learning algorithm, is proposed, which comprises, by a computing platform comprising a processor configured for implementing the machine learning algorithm, for a block of the pixel blocks: obtaining a block neighborhood set of at least one pixel of the image, wherein the at least one pixel is located outside the block in a spatial neighborhood of a corresponding pixel located on an edge of the pixel block, and generating, by the machine learning algorithm configured for performing end-to-end image compression, a bitstream representing the encoded pixel block, by encoding the pixel block based on input data comprising the pixel block and the block neighborhood set.
PREDICTED FRAME GENERATION BY DEFORMABLE CONVOLUTION FOR VIDEO CODING
A method of video coding at a video coding device includes performing a deformable convolution through a deformable convolutional deep neural network (DNN) to generate one or more first feature maps based on a set of one or more previously reconstructed reference frames, generating a predicted frame based on the one or more first feature maps, and reconstructing a current frame based on the predicted frame. In an embodiment, a set of one or more second feature maps corresponding to the one or more previously reconstructed reference frames can be generated based on a feature extraction DNN. One or more offset maps corresponding to the one or more second feature maps can be generated, respectively, using an offset generation DNN.
Transmission and detection of multi-channel signals in reduced channel format
Embodiments of the invention are generally directed to transmission and detection of multi-channel signals in reduced channel format. An embodiment of a method for transmitting data includes determining whether a first type or a second type of content data is to be transmitted, where the first type of content data is to be transmitted at a first multiple of a base frequency and the second type of data is to be transmitted at a second multiple of the base frequency. The method further includes selecting one or more channels from a plurality of channels based on the type of content data, clocking a frequency on the first or second multiple of the base frequency according to the type of content data in the selected channels, modifying the content data to fit within a single output channel, and transmitting the modified data via a single output channel at the chosen multiple of the base frequency.
Transmission and detection of multi-channel signals in reduced channel format
Embodiments of the invention are generally directed to transmission and detection of multi-channel signals in reduced channel format. An embodiment of a method for transmitting data includes determining whether a first type or a second type of content data is to be transmitted, where the first type of content data is to be transmitted at a first multiple of a base frequency and the second type of data is to be transmitted at a second multiple of the base frequency. The method further includes selecting one or more channels from a plurality of channels based on the type of content data, clocking a frequency on the first or second multiple of the base frequency according to the type of content data in the selected channels, modifying the content data to fit within a single output channel, and transmitting the modified data via a single output channel at the chosen multiple of the base frequency.
ENCODING APPARATUS, DECODING APPARATUS, ENCODING SYSTEM, LEARNING METHOD AND PROGRAM
An encoding apparatus encodes an input image and includes: a provisional encoded data acquisition unit configured to obtain provisional encoded data which has a size greater than a target size of encoded data which is data obtained by encoding the image based on the image and a parameter for determining the target size; and an encoded data acquisition unit configured to obtain the encoded data by converting data within a data range outside of a data range corresponding to the target size in the provisional encoded data into a predetermined value. The provisional encoded data acquisition unit obtains the provisional encoded data so that features for determining the image are contained more within the data range corresponding to the target size than in the data range outside of the data range corresponding to the target size.
Dictionary-based data compression
Approaches to dictionary-based data compression are disclosed, including those in which a Zbuffer is used to identify phrases that represent uncompressed data and/or to generate decompressed data by combining a set of overlapping occluded phrases.
Systems and methods for spatial prediction
Systems, methods, and instrumentalities are disclosed relating to intra prediction of a video signal based on mode-dependent subsampling. A block of coefficients associated with a first sub block of a video block, one or more blocks of coefficients associated with one or more remaining sub blocks of the video block, and an indication of a prediction mode for the video block may be received. One or more interpolating techniques, a predicted first sub block, and the predicted sub blocks of the one or more remaining sub blocks may be determined. A reconstructed first sub block and one or more reconstructed remaining sub blocks may be generated. A reconstructed video block may be formed based on the prediction mode, the reconstructed first sub block, and the one or more reconstructed remaining sub blocks.
Systems and methods for spatial prediction
Systems, methods, and instrumentalities are disclosed relating to intra prediction of a video signal based on mode-dependent subsampling. A block of coefficients associated with a first sub block of a video block, one or more blocks of coefficients associated with one or more remaining sub blocks of the video block, and an indication of a prediction mode for the video block may be received. One or more interpolating techniques, a predicted first sub block, and the predicted sub blocks of the one or more remaining sub blocks may be determined. A reconstructed first sub block and one or more reconstructed remaining sub blocks may be generated. A reconstructed video block may be formed based on the prediction mode, the reconstructed first sub block, and the one or more reconstructed remaining sub blocks.