Patent classifications
H04N19/149
GENERATING QUANTIZATION TABLES FOR IMAGE COMPRESSION
Methods, systems, and computer programs encoded on a computer storage medium, that relate to generating quantization tables that are used during digital image compression of a digital image. Multiple training images are obtained. A model can be trained using the training images to generate a quantization table that can be used during encoding of an input image. For each training image, a quantization table can be obtained using the model. Using the quantization table, an encoded digital image is obtained for the training image. Using the encoded digital image and the training image, an image quality loss and a compression loss can be determined. An overall loss of the model can be determined by combining the image quality loss and the compression loss for the training image. The model can be updated based on the overall loss.
Systems and methods for player input motion compensation by anticipating motion vectors and/or caching repetitive motion vectors
Systems and methods for reducing latency through motion estimation and compensation techniques are disclosed. The systems and methods include a client device that uses transmitted lookup tables from a remote server to match user input to motion vectors, and tag and sum those motion vectors. When a remote server transmits encoded video frames to the client, the client decodes those video frames and applies the summed motion vectors to the decoded frames to estimate motion in those frames. In certain embodiments, the systems and methods generate motion vectors at a server based on predetermined criteria and transmit the generated motion vectors and one or more invalidators to a client, which caches those motion vectors and invalidators. The server instructs the client to receive input from a user, and use that input to match to cached motion vectors or invalidators. Based on that comparison, the client then applies the matched motion vectors or invalidators to effect motion compensation in a graphic interface. In other embodiments, the systems and methods cache repetitive motion vectors at a server, which transmits a previously generated motion vector library to a client. The client stores the motion vector library, and monitors for user input data. The server instructs the client to calculate a motion estimate from the input data and instructs the client to update the stored motion vector library based on the input data, so that the client applies the stored motion vector library to initiate motion in a graphic interface prior to receiving actual motion vector data from the server. In this manner, latency in video data streams is reduced.
Setting intra-block copy virtual buffer based on virtual pipeline data unit
A method of video processing is described. The method includes determining, for a conversion between a current video block of a video picture of a video and a coded representation of the video, whether or how to reset reference sample values of a reference region of the video picture used for predicting the current video block according to a rule; and performing the conversion based on the determining.
Multimedia Distribution System
A multimedia file and methods of generating, distributing and using the multimedia file are described. Multimedia files in accordance with embodiments of the present invention can contain multiple video tracks, multiple audio tracks, multiple subtitle tracks, a complete index that can be used to locate each data chunk in each of these tracks and an abridged index that can enable the location of a subset of the data chunks in each track, data that can be used to generate a menu interface to access the contents of the file and ‘meta data’ concerning the contents of the file. Multimedia files in accordance with several embodiments of the present invention also include references to video tracks, audio tracks, subtitle tracks and ‘meta data’ external to the file. One embodiment of a multimedia file in accordance with the present invention includes a series of encoded video frames, a first index that includes information indicative of the location within the file and characteristics of each encoded video frame and a separate second index that includes information indicative of the location within the file of a subset of the encoded video frames.
Multimedia Distribution System
A multimedia file and methods of generating, distributing and using the multimedia file are described. Multimedia files in accordance with embodiments of the present invention can contain multiple video tracks, multiple audio tracks, multiple subtitle tracks, a complete index that can be used to locate each data chunk in each of these tracks and an abridged index that can enable the location of a subset of the data chunks in each track, data that can be used to generate a menu interface to access the contents of the file and ‘meta data’ concerning the contents of the file. Multimedia files in accordance with several embodiments of the present invention also include references to video tracks, audio tracks, subtitle tracks and ‘meta data’ external to the file. One embodiment of a multimedia file in accordance with the present invention includes a series of encoded video frames, a first index that includes information indicative of the location within the file and characteristics of each encoded video frame and a separate second index that includes information indicative of the location within the file of a subset of the encoded video frames.
A METHOD, AN APPARATUS AND A COMPUTER PROGRAM PRODUCT FOR VIDEO ENCODING AND VIDEO DECODING
The embodiments relate to a method for encoding and a decoding, and apparatuses for the same. The method for encoding comprises receiving a block of a video frame for encoding (1510); making a decision on whether or not a learning-based model is to be applied as a processing step for encoding the block (1520); applying the learning-based model for said input block according to the decision, where the learning-based model has been selectively fine-tuned according to information relating to activation of the learning-based model of previously-decoded blocks (1530); encoding a signal corresponding to the decision on usage of the learning-based model into a bitstream (1540); and encoding the block into a bitstream with an information whether the block is to be used for finetuning (1550).
A METHOD, AN APPARATUS AND A COMPUTER PROGRAM PRODUCT FOR VIDEO ENCODING AND VIDEO DECODING
The embodiments relate to a method for encoding and a decoding, and apparatuses for the same. The method for encoding comprises receiving a block of a video frame for encoding (1510); making a decision on whether or not a learning-based model is to be applied as a processing step for encoding the block (1520); applying the learning-based model for said input block according to the decision, where the learning-based model has been selectively fine-tuned according to information relating to activation of the learning-based model of previously-decoded blocks (1530); encoding a signal corresponding to the decision on usage of the learning-based model into a bitstream (1540); and encoding the block into a bitstream with an information whether the block is to be used for finetuning (1550).
IMAGE COMPRESSION METHOD AND APPARATUS THEREOF
An image compression method includes: obtaining a target image and a target code rate corresponding to the target image; determining a first code rate parameter corresponding to the target code rate; and inputting the target image and the first code rate parameter into an image compression model, that has been trained, for processing to obtain a compressed image with the target code rate, wherein the image compression model is obtained by training an initial image compression model based on multiple code rate parameters including the first code rate parameter.
IMAGE COMPRESSION METHOD AND APPARATUS THEREOF
An image compression method includes: obtaining a target image and a target code rate corresponding to the target image; determining a first code rate parameter corresponding to the target code rate; and inputting the target image and the first code rate parameter into an image compression model, that has been trained, for processing to obtain a compressed image with the target code rate, wherein the image compression model is obtained by training an initial image compression model based on multiple code rate parameters including the first code rate parameter.
IMAGE SIGNAL PROCESSING PIPELINES FOR HIGH DYNAMIC RANGE SENSORS
Apparatuses, systems, and techniques to receive, at one or more processor associated with an image signal processing (ISP) pipeline, a compressed image generated by an image sensor, wherein the compressed image is captured at a first bit-depth associated with the image sensor and is compressed to a second bit-depth that is lower than the first bit-depth, and wherein the ISP is associated with a third bit-depth that is lower than the first bit-depth and higher than the second bit-depth; and decompress the compressed image according to a power curve to generate a partially decompressed image having the third bit-depth, wherein a plurality of regions of the partially decompressed image are decompressed at separate decompression amounts based on a corresponding pixel value of each region of the plurality of regions.