Patent classifications
H04N19/59
Pixel-Level Video Prediction with Improved Performance and Efficiency
One aspect provides a machine-learned video prediction model configured to receive and process one or more previous video frames to generate one or more predicted subsequent video frames, wherein the machine-learned video prediction model comprises a convolutional variational auto encoder, and wherein the convolutional variational auto encoder comprises an encoder portion comprising one or more encoding cells and a decoder portion comprising one or more decoding cells.
Video Encoding or Decoding Methods and Apparatuses with Scaling Ratio Constraint
Video processing methods and apparatuses for processing a current block in a current picture by reference picture resampling include receiving input data of the current block, determining a scaling window of the current picture and a scaling window of a reference picture. The current picture and reference picture may have different scaling window sizes. A ratio between a scaling window width, height, or size of the current picture and a scaling window width, height, or size of the reference picture is constrained to be within a ratio constraint. A reference block is generated from the reference picture according to the ratio, and used to encode or decode the current block.
IMAGE TRANSMISSION METHOD AND APPARATUS
Embodiments of this application provide an image transmission method and apparatus. The method includes: converting a first high-resolution image into a first low-resolution image, where first resolution of the first high-resolution image is higher than second resolution of the first low-resolution image; encoding the first low-resolution image to obtain a first bitstream; obtaining a second high-resolution image, where third resolution of the second high-resolution image is higher than the second resolution, and the second high-resolution image includes high-frequency information of the first high-resolution image and excludes low-frequency information of the first high-resolution image; obtaining an image residual between the first high-resolution image and the second high-resolution image, where the image residual is used to reflect the low-frequency information of the first high-resolution image; encoding the image residual to obtain a second bitstream; and sending the first bitstream and the second bitstream.
Switchable chroma sampling for wireless display
Methods, apparatuses and systems may provide for a video transmitter that generates a primary bitstream based on a video signal, wherein the primary bitstream is encoded with subsampled chroma information, and detects a static condition with respect to the video signal. Additionally, a plurality of auxiliary bitstreams may be generated, in response to the static condition, based on the video signal. Each of the plurality of auxiliary bitstreams may be encoded with full resolution chroma information. In one example, a video receiver may detect that the auxiliary bitstreams are associated with the primary bitstream, decode the primary bitstream and the plurality of auxiliary bitstreams to obtain luma information and the full resolution chroma information, and multiplex the luma information with the full resolution chroma information.
Image processing device and method of pre-processing images of a video stream before encoding
An image processing device 300, a non-transitory computer readable storage medium, a monitoring camera 200 and a method 100 of pre-processing images of a video stream before encoding the video stream are disclosed. The images are obtained S110, wherein the obtained images have a first resolution. The obtained images are subsampled S120 to intermediate images having a second resolution lower than the first resolution and lower than a third resolution. The intermediate images are upsampled S130 to output images having the third resolution, wherein the third resolution is the same for all images of the video stream.
Image processing device and method of pre-processing images of a video stream before encoding
An image processing device 300, a non-transitory computer readable storage medium, a monitoring camera 200 and a method 100 of pre-processing images of a video stream before encoding the video stream are disclosed. The images are obtained S110, wherein the obtained images have a first resolution. The obtained images are subsampled S120 to intermediate images having a second resolution lower than the first resolution and lower than a third resolution. The intermediate images are upsampled S130 to output images having the third resolution, wherein the third resolution is the same for all images of the video stream.
Data preprocessing and data augmentation in frequency domain
Methods and systems are provided for implementing preprocessing operations and augmentation operations upon image datasets transformed to frequency domain representations, including decoding images of an image dataset to generate a frequency domain representation of the image dataset; performing a resizing operation based on resizing factors on the image dataset in a frequency domain representation; performing a reshaping operation based on reshaping factors on the image dataset in a frequency domain representation; and performing a cropping operation on the image dataset in a frequency domain representation. The methods and systems may further include performing an augmentation operation on the image dataset in a frequency domain representation. Methods and systems of the present disclosure may free learning models from computational overhead caused by transforming image datasets into frequency domain representations. Furthermore, computational overhead caused by inverse transformation operations is also alleviated.
Camera module, image processing device and image compression method
A camera module includes a compressor configured to divide a plurality of pixels included in image data, into a plurality of pixel groups, with respect to each of the plurality of pixel groups into which the plurality of pixels is divided, calculate a representative pixel value of a corresponding pixel group, based on pixel values of multiple pixels included in the corresponding pixel group, generate first compressed data, based on the calculated representative pixel value of each of the plurality of pixel groups, with respect to each of the plurality of pixel groups into which the plurality of pixels is divided, calculate residual values representing differences between the pixel values of the multiple pixels included in the corresponding pixel group and the representative pixel value of the corresponding pixel group, and generate second compressed data, based on the calculated residual values of each of the plurality of pixel groups.
Video decoding apparatus using parameter decoding circuit to derive weight coefficients
A video decoding apparatus is provided. The video decoding apparatus includes a parameter decoding circuit, a prediction parameter derivation circuit, a motion compensation circuit, and a weighted prediction circuit to derive weight coefficients.
Video decoding apparatus using parameter decoding circuit to derive weight coefficients
A video decoding apparatus is provided. The video decoding apparatus includes a parameter decoding circuit, a prediction parameter derivation circuit, a motion compensation circuit, and a weighted prediction circuit to derive weight coefficients.