Patent classifications
H04N19/98
TECHNIQUES FOR CONSTRAINT FLAG SIGNALING FOR RANGE EXTENSION WITH EXTENDED PRECISION
Aspects of the disclosure provide methods and apparatuses for video data processing. In some examples, an apparatus for video data processing includes processing circuitry. The processing circuitry determines a first syntax element for coding control in a first scope of coded video data in a bitstream. The first syntax element is associated with a coding tool for processing transform coefficients with a dynamic range that is extended from a predetermined dynamic range. The dynamic range is associated with an extended precision. Then, in response to the first syntax element being a first value indicative of disabling of the coding tool in the first scope, the processing circuitry decodes the first scope of coded video data in the bitstream that includes one or more second scopes of coded video data without invoking the coding tool.
Integrated image reshaping and video coding
Given a sequence of images in a first codeword representation, methods, processes, and systems are presented for integrating reshaping into a next generation video codec for encoding and decoding the images, wherein reshaping allows part of the images to be coded in a second codeword representation which allows more efficient compression than using the first codeword representation. A variety of architectures are discussed, including: an out-of-loop reshaping architecture, an in-loop-for intra pictures only reshaping architecture, an in-loop architecture for prediction residuals, and a hybrid in-loop reshaping architecture. Syntax methods for signaling reshaping parameters, and image-encoding methods optimized with respect to reshaping are also presented.
SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.
SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.
Information processing device, information processing method, image capturing device, and program
In an information compression unit 311, in an image capturing unit 121 (221), among a plurality of pixel output units that receives object light that enters without passing through any of an image capturing lens and a pinhole, pixel outputs of at least two of the pixel output units have incident angle directivity modulated into different incident angle directivity according to an incident angle of the object light. The information compression unit 311 performs compression processing to reduce an amount of data of pixel output information generated by the image capturing unit 121 (221). For example, by computation of a difference between set reference value information and the pixel output information, linear calculation of a set calculation parameter and the pixel output information, and the like, the information compression unit 311 reduces a word length of the pixel output information, and reduces the amount of data of the pixel output information generated according to the object light that enters without passing through any of an image capturing lens and a pinhole.
SIGNALING CHROMA OFFSET PRESENCE IN VIDEO CODING
A video coding system handling at least a block of at least an image of a video comprises an encoding process and decoding process respectively providing or using signaling information related to the video, wherein the signaling information comprises at least an information representative of the presence of chroma offset values, wherein when chroma is present and does not use separate color planes, the information representative of the presence of chroma offset values is set and the signaling information further comprises information representative of a chroma offsets values and wherein when chroma is not present or uses separate color planes, the information representative of the presence of chroma offset values is reset and no information representative of a chroma offsets values is further signaled.
ITERATIVE OPTIMIZATION OF RESHAPING FUNCTIONS IN SINGLE-LAYER HDR IMAGE CODEC
A method, for generating (a) a forward reshaping function for compressing an input high-dynamic range (HDR) image into a reshaped standard-dynamic-range (SDR) image and (b) a backward reshaping function for decompressing the reshaped SDR image into a reconstructed HDR image, includes (i) optimizing the forward reshaping function to minimize a deviation between the reshaped SDR image and an input SDR image corresponding to the input HDR image, (ii) optimizing the backward reshaping function to minimize a deviation between the reconstructed HDR image and the input HDR image, and (iii) until a termination condition is met, applying a correction to the input SDR image and reiterating, based on the input SDR image as corrected, the steps of optimizing the forward and backward reshaping functions.
ITERATIVE OPTIMIZATION OF RESHAPING FUNCTIONS IN SINGLE-LAYER HDR IMAGE CODEC
A method, for generating (a) a forward reshaping function for compressing an input high-dynamic range (HDR) image into a reshaped standard-dynamic-range (SDR) image and (b) a backward reshaping function for decompressing the reshaped SDR image into a reconstructed HDR image, includes (i) optimizing the forward reshaping function to minimize a deviation between the reshaped SDR image and an input SDR image corresponding to the input HDR image, (ii) optimizing the backward reshaping function to minimize a deviation between the reconstructed HDR image and the input HDR image, and (iii) until a termination condition is met, applying a correction to the input SDR image and reiterating, based on the input SDR image as corrected, the steps of optimizing the forward and backward reshaping functions.
Network video streaming with trick play based on separate trick play files
Network services encode multimedia content, such as video, into multiple adaptive bitrate streams of encoded video and a separate trick play stream of encoded video to support trick play features. The trick play stream is encoded at a lower encoding bitrate and frame rate than each of the adaptive bitrate streams. The adaptive bitrate streams and the trick play stream are stored in the network services. During normal content streaming and playback, a client device downloads a selected one of the adaptive bitrate streams from network serviced for playback at the client device. To implement a trick play feature, the client device downloads the trick play stream from the network services for trick play playback.
Network video streaming with trick play based on separate trick play files
Network services encode multimedia content, such as video, into multiple adaptive bitrate streams of encoded video and a separate trick play stream of encoded video to support trick play features. The trick play stream is encoded at a lower encoding bitrate and frame rate than each of the adaptive bitrate streams. The adaptive bitrate streams and the trick play stream are stored in the network services. During normal content streaming and playback, a client device downloads a selected one of the adaptive bitrate streams from network serviced for playback at the client device. To implement a trick play feature, the client device downloads the trick play stream from the network services for trick play playback.