H04N19/177

Method, An Apparatus and a Computer Program Product for Video Encoding and Video Decoding

The embodiments relate to a method for encoding including receiving a sequence of volumetric video frames including a volumetric visual object being defined with a mesh of interconnected vertices; selecting one or more reference frames from the sequence of volumetric video frames for a group of pictures; clustering a mesh of the one or more reference frames into patches, each patch being associated with a corresponding bounding volume; creating matching patches in frames dependent on the reference frame; estimating scaling and rotation parameters for each individual patch in the dependent frame; applying the estimated scaling and rotation parameters to bounding volume of a patch of the dependent frames; packing the patches to an atlas bitstream of a volumetric video stream and including into a bitstream the estimated rotation parameter alongside the bounding volume of a patch. The embodiments also relate to a method for decoding, and corresponding equipment.

Re-encoding predicted picture frames in live video stream applications
11265599 · 2022-03-01 · ·

In various examples, a media stream may be received by a re-encode system that may leverage a recode engine to convert (e.g., at an interval, based on a request, etc.) an inter-frame associated with the media stream to an intra-frame. The intra-frame may be converted from the inter-frame using parameters or other information associated with and received with the media stream. The converted intra-frame may be merged into an updated segment of the media stream in place of the original inter-frame to enable storage of the updated segment—or a portion thereof—for later use.

Error resilience for interactive real-time multimedia applications

An error resilience method comprising: using a computer, creating and storing, in computer memory, one or more FEC filter tables for use by the FEC filter for selectively forwarding a FEC packet; selectively forwarding a request for the FEC packet through a FEC filter based on the FEC table and a dynamic packet loss level at a receiver; limiting a re-transmission request for a particular packet through the FEC filter based on a number of re-transmission requests for the particular packet; and selectively skipping a key frame request based on a number of key frame requests received from a plurality receiver devices, wherein the method is performed by one or more special-purpose computing devices.

METHOD AND SYSTEM FOR OPTIMIZED DELTA ENCODING
20220060724 · 2022-02-24 ·

Methods and systems are provided for image processing. A plurality of correlation parameters representing degrees of correlation between two or more images of a plurality of images may be produced. An optimized correlation dependency graph may be produced according to the plurality of correlation parameters. The plurality of images may then be delta encoded according to the optimized correlation dependency graph. For example, the optimized correlation dependency graph may be used for performing a correlation encoding operation. The plurality of correlation parameters may be produced, for example, in accordance with one or more correlation metrics associated with the correlation encoding operation.

METHOD AND SYSTEM FOR OPTIMIZED DELTA ENCODING
20220060724 · 2022-02-24 ·

Methods and systems are provided for image processing. A plurality of correlation parameters representing degrees of correlation between two or more images of a plurality of images may be produced. An optimized correlation dependency graph may be produced according to the plurality of correlation parameters. The plurality of images may then be delta encoded according to the optimized correlation dependency graph. For example, the optimized correlation dependency graph may be used for performing a correlation encoding operation. The plurality of correlation parameters may be produced, for example, in accordance with one or more correlation metrics associated with the correlation encoding operation.

OPTIMIZATION USING MULTI-THREADED PARALLEL PROCESSING FRAMEWORK
20170310983 · 2017-10-26 · ·

Systems, methods, and instrumentalities are disclosed for encoder and/or decoder optimization using a multi-threaded parallel processing framework. An encoding and/or decoding device may receive a video sequence that includes a plurality of first-temporal level pictures associated with a first temporal level and a plurality of second-temporal level pictures associated with a second temporal level. The encoding and/or decoding device may allocate a first number of parallel processing threads for encoding and/or decoding the first-temporal level pictures and a second number of parallel processing threads for encoding and/or decoding the second-temporal level pictures. The device may perform this allocation based on temporal level priority, for example. The encoding and/or decoding device may encode and/or decode the first-temporal level pictures and the second-temporal level pictures. This encoding and/or decoding may be based on the allocation of the first number of parallel processing threads and the second number of parallel processing threads.

OPTIMIZATION USING MULTI-THREADED PARALLEL PROCESSING FRAMEWORK
20170310983 · 2017-10-26 · ·

Systems, methods, and instrumentalities are disclosed for encoder and/or decoder optimization using a multi-threaded parallel processing framework. An encoding and/or decoding device may receive a video sequence that includes a plurality of first-temporal level pictures associated with a first temporal level and a plurality of second-temporal level pictures associated with a second temporal level. The encoding and/or decoding device may allocate a first number of parallel processing threads for encoding and/or decoding the first-temporal level pictures and a second number of parallel processing threads for encoding and/or decoding the second-temporal level pictures. The device may perform this allocation based on temporal level priority, for example. The encoding and/or decoding device may encode and/or decode the first-temporal level pictures and the second-temporal level pictures. This encoding and/or decoding may be based on the allocation of the first number of parallel processing threads and the second number of parallel processing threads.

SIGNALING FOR REFERENCE PICTURE RESAMPLING
20220060712 · 2022-02-24 ·

Devices, systems and methods for digital video coding, which includes reference picture resampling, are described. An example method for video processing includes performing a conversion between a video comprising one or more video segments comprising one or more video units and a bitstream representation of the video, wherein the bitstream representation conforms to a format rule and comprises information related to an adaptive resolution conversion (ARC) process, wherein the format rule specifies the applicability of the ARC process to a video segment, wherein an indication that the one or more video units of the video segment are coded with different resolutions is included in the bitstream representation in a syntax structure that is different from a header syntax structure, a decoder parameter set, a video parameter set, a picture parameter set, a sequence parameter set, and an adaptation parameter set.

EVENT/OBJECT-OF-INTEREST CENTRIC TIMELAPSE VIDEO GENERATION ON CAMERA DEVICE WITH THE ASSISTANCE OF NEURAL NETWORK INPUT
20220059132 · 2022-02-24 ·

An apparatus including an interface and a processor. The interface may be configured to receive pixel data generated by a capture device. The processor may be configured to generate video frames in response to the pixel data, perform computer vision operations on the video frames to detect objects, perform a classification of the objects detected based on characteristics of the objects, determine whether the classification of the objects corresponds to a user-defined event and generate encoded video frames from the video frames. The encoded video frames may be communicated to a cloud storage service. The encoded video frames may comprise a first sample of the video frames selected at a first rate when the user-defined event is not detected and a second sample of the video frames selected at a second rate while the user-defined event is detected. The second rate may be greater than the first rate.

Apparatus, a method and a computer program for video coding and decoding
11671625 · 2023-06-06 · ·

A method comprising: encoding at least four bitstream versions of a same content divided into segments of independently coded tile sets representing a plurality of spatial regions, wherein a first and a second bitstream comprise independently coded tile sets encoded at a first quality, and a third and a fourth bitstream comprise independently coded tile sets encoded at a second quality, wherein the first and the third bitstream have first random access picture interval and the second and the fourth bitstream have second random access picture interval, which is an integer multiple of the first random access picture interval; grouping the independently coded tile sets of all four bitstreams representing a common spatial region into a plurality of groups of collocated sub-picture tracks, wherein only one of said tile sets per group is intended to be received and/or decoded per any segment; and generating at least one instruction for merging tile sets of different spatial locations into at least one coded picture, the at least one instruction causing a tile set originating from a random access picture to be decoded as a tile set originating from anon-random-access picture when merged with a tile set originating from a non-random-access picture.