H04N19/59

Methods for generating video-and audience-specific encoding ladders with audio and video just-in-time transcoding

A method including: populating an encoding ladder with a subset of bitrate-resolution pairs, from a set of bitrate-resolution pairs, based on a distribution of audience bandwidths; receiving a first request for a first playback segment, at a first bitrate-resolution pair in a encoding ladder, in the video from a first device; in response to determining an absence of video segments, at the first bitrate-resolution pair and corresponding to the segment, in a first rendition cache: identifying a first set of mezzanine segments, in the video, corresponding to the first playback segment; assigning the first set of mezzanine segments to a set of workers for transcoding into a first set of video segments according to the first bitrate-resolution pair; storing the first set of video segments in the first rendition cache; and based on the first request, releasing the first set of video segments to the first device.

Video coding device and method
11695944 · 2023-07-04 · ·

A video encoding method includes (i) determining a current bit rate of a communication channel between a destination device and a source device that stores an input video frame, and (ii) generating a current reconstructed frame and an encoded bitstream at least in part via inter-frame coding of a current input video frame of a sequence of input video frames using a previously-generated reconstructed frame generated at least in part via inter-frame coding of a previous input video frame. The current reconstructed frame is a compressed version of the current input video frame. When both (i) a subsequent bit rate, determined after said inter-frame coding, is less than a threshold and (ii) the current bit rate exceeds the threshold, the method includes: (a) generating a downscaled reconstructed frame at least in part by downscaling the current reconstructed frame; and (b) appending the encoded bitstream with a bit sequence representing the downscaled reconstructed frame.

Per-Title Encoding Using Spatial and Temporal Resolution Downscaling
20230007298 · 2023-01-05 · ·

Techniques relating to per-title encoding using spatial and temporal resolution downscaling is disclosed. A method for per-title encoding includes receiving a video input comprised of video segments, spatially downscaling the video input, temporally downscaling the video input, encoding the video input to generate an encoded video, then temporally and spatially upscaling the encoded video. Spatially downscaling may include reducing a resolution of the video input, and temporally downscaling may include reducing a framerate of the video input. Objective metrics for the upscaled encoded video show improved quality over conventional methods.

Per-Title Encoding Using Spatial and Temporal Resolution Downscaling
20230007298 · 2023-01-05 · ·

Techniques relating to per-title encoding using spatial and temporal resolution downscaling is disclosed. A method for per-title encoding includes receiving a video input comprised of video segments, spatially downscaling the video input, temporally downscaling the video input, encoding the video input to generate an encoded video, then temporally and spatially upscaling the encoded video. Spatially downscaling may include reducing a resolution of the video input, and temporally downscaling may include reducing a framerate of the video input. Objective metrics for the upscaled encoded video show improved quality over conventional methods.

Coding of a spatial sampling of a two-dimensional information signal using sub-division

Coding schemes for coding a spatially sampled information signal using sub-division and coding schemes for coding a sub-division or a multitree structure are described, wherein representative embodiments relate to picture and/or video coding applications.

Coding of a spatial sampling of a two-dimensional information signal using sub-division

Coding schemes for coding a spatially sampled information signal using sub-division and coding schemes for coding a sub-division or a multitree structure are described, wherein representative embodiments relate to picture and/or video coding applications.

Data stream encoder configuration

A media encoder for encoding a stream of media data blocks has an encoder pipeline including a sequence of processing modules for processing a stream of media data blocks, and a pipeline configurator configured effect a switch in the encoder pipeline from one or more first encode parameters to one or more second encode parameters. The first processing module of the pipeline can be configured to associate a trigger value with at least a first media data block processed at the first processing module in accordance with second encode parameters, the trigger value passing to subsequent modules so as to cause those modules to adopt the second encode parameters.

Data stream encoder configuration

A media encoder for encoding a stream of media data blocks has an encoder pipeline including a sequence of processing modules for processing a stream of media data blocks, and a pipeline configurator configured effect a switch in the encoder pipeline from one or more first encode parameters to one or more second encode parameters. The first processing module of the pipeline can be configured to associate a trigger value with at least a first media data block processed at the first processing module in accordance with second encode parameters, the trigger value passing to subsequent modules so as to cause those modules to adopt the second encode parameters.

METHOD FOR DECODING IMAGE ON BASIS OF CCLM PREDICTION IN IMAGE CODING SYSTEM, AND DEVICE THEREFOR

A video decoding method performed by a decoding apparatus according to the present disclosure includes deriving one of a plurality of cross-component linear model (CCLM) prediction mode as a CCLM prediction mode of the current chroma block, deriving a sample number of neighboring chroma samples of the current chroma block based on the CCLM prediction mode of the current chroma block, a size of the current chroma block, and a specific value; deriving the neighboring chroma samples of the sample number, calculating CCLM parameters based on the neighboring chroma samples and the down sampled neighboring luma samples, deriving prediction samples for the current chroma block based on the CCLM parameters and the down sampled luma samples and generating reconstructed samples for the current chroma block based on the prediction samples, wherein the specific value is derived as 2.

Foveated video link for VR with gaze tracking

Video stream data is selectively scaled so that sections within regions of interest (ROI) maintain high resolution while areas not within the region of interest are down-scaled to reduce bandwidth cost of transmission. A low compression encoder compresses sections of a video frame corresponding to one or more ROI without motion search or prediction mode decision to generate low-compression section data. The video frame is downscaled and a high compression encoder compresses the resulting downscaled video frame with prediction mode decision to generate high-compression frame data.