H04N19/115

SYSTEMS AND METHODS FOR REGION-OF-INTEREST TONE REMAPPING
20230039194 · 2023-02-09 ·

Systems and methods are described for providing viewers of streamed video content with the option to view alternative video content in which alternative tone maps are applied to respective regions of interest. In some embodiments, the streamed video content is divided into slices, and alternative tone maps are applied to respective regions of interest within the slices. When a server receives a request from a client for alternative tone mappings of different regions, slices with the appropriate mapping may be assembled on demand and delivered to the client in a single video stream. Tone mappings may be used, for example, to highlight particular objects, such as players in a sporting event captured in the streamed video content.

METHODS AND APPARATUS OF VIDEO CODING FOR TRIANGLE PREDICTION

A method for video coding is provided. The method includes: partitioning video pictures into a plurality of coding units (CUs), at least one of which is further portioned into two prediction units (PUs) including at least one triangular shaped PU with a partitioning orientation in one of: from top-left corner to bottom-right corner, and from top-right corner to bottom-left corner; constructing a uni-prediction motion vector candidate list; determining whether a current CU is coded as triangle prediction mode according to coded information; signaling a partition orientation flag indicating the partitioning orientation; and signaling index values that indicate selected entries in the constructed uni-prediction motion vector candidate list.

METHODS AND APPARATUS OF VIDEO CODING FOR TRIANGLE PREDICTION

A method for video coding is provided. The method includes: partitioning video pictures into a plurality of coding units (CUs), at least one of which is further portioned into two prediction units (PUs) including at least one triangular shaped PU with a partitioning orientation in one of: from top-left corner to bottom-right corner, and from top-right corner to bottom-left corner; constructing a uni-prediction motion vector candidate list; determining whether a current CU is coded as triangle prediction mode according to coded information; signaling a partition orientation flag indicating the partitioning orientation; and signaling index values that indicate selected entries in the constructed uni-prediction motion vector candidate list.

IMAGE DATA TRANSFER APPARATUS, IMAGE DISPLAY SYSTEM, AND IMAGE DATA TRANSFER METHOD
20230099405 · 2023-03-30 ·

At a server 400, a rate control map storage section 430 of a compressing/encoding section 422 stores a rate control map on which an image quality deterioration tolerance is set for each compression unit. A status acquiring section 432 acquires a distribution of image resolutions and positional coordinates of a point of gaze of a user, and a map generating section 434 generates or corrects the rate control map on the basis of the acquired information. On the basis of the rate control map, a compression/encoding processing section 436 sets a target value of a data size, and compresses and encodes image data while performing feedback control. A communication section 426 transmits the data obtained after the compression, in units of partial images formed by splitting a frame.

METHODS AND APPARATUSES FOR ENCODING, DECODING AND RENDERING 6DOF CONTENT FROM 3DOF+ COMPOSED ELEMENTS

A volumetric content is encoded as a set of clusters by an encoder and transmitted to a decoder which retrieves the volumetric content. Clusters common to different viewpoints are obtained and mutualized. Clusters are projected onto 2D images and encoded as independent video streams. Reduction in visual artefacts and reduction of data for storage and streaming are achieved.

VIDEO ENCODING OPTIMIZATION FOR MACHINE LEARNING CONTENT CATEGORIZATION
20230095541 · 2023-03-30 ·

Systems, apparatuses, and methods for performing machine learning content categorization leveraging video encoding pre-processing are disclosed. A system includes at least a motion vector unit and a machine learning (ML) engine. The motion vector unit pre-processes a frame to determine if there is temporal locality with previous frames. If the objects of the scene have not changed by a threshold amount, then the ML engine does not process the frame, saving computational resources that would typically be used. Otherwise, if there is a change of scene or other significant changes, then the ML engine is activated to process the frame. The ML engine can then generate a QP map and/or perform content categorization analysis on this frame and a subset of the other frames of the video sequence.

TRANSMISSION APPARATUS AND TRANSMISSION METHOD
20230034162 · 2023-02-02 ·

A transmission apparatus includes a video encoder that encodes each piece of frame data of an image, and a transmission processing unit. During the transmission processing of image data encoded by the video encoder, the transmission processing unit performs rate decrease control on an encoding rate in the video encoder according to the transmission delay to the reception-side device, and executes delay decrease processing of decreasing the delay amount of the transmission data for the frame data of one or a plural number of target frames.

REMOTE IMAGE PROCESSING METHOD AND APPARATUS
20230033785 · 2023-02-02 ·

A remote image processing method, applied to a remote server, includes: obtaining a recommended bit rate, where the recommended bit rate matches an environment parameter of a network in which the remote server is located, and the network environment parameter is used to represent a capability of transmitting an amount of data by the network in a unit time; and generating, based on the recommended bit rate, adjustment parameters corresponding to different regions in a to-be-processed image, and processing the corresponding regions by using the adjustment parameters, to obtain a single-frame image used for display, so that an amount of data included in the single-frame image matches the recommended bit rate.

Optimized reduced bitrate encoding for titles and credits in video content

Embodiments include systems, methods, and computer-readable media for optimized reduced bitrate encoding for text-based content in video frames. Example methods may include determining that a first segment of video content includes a content scene, determining that a second segment of the video content includes text, and determining a first encoder configuration to encode the first segment of video content, where the first encoder configuration includes a first encoding parameter setting. Example methods may include determining a second encoder configuration to encode the second segment of the video content, where the second encoder configuration includes a second encoding parameter setting, encoding the first segment using the first encoder configuration, and encoding the second segment using the second encoder configuration. The first segment may be encoded at a first bitrate that is greater than a second bitrate at which the second segment is encoded.

CONTENT ADAPTIVE ENCODING
20230085928 · 2023-03-23 ·

The described technology is generally directed towards developing an adaptive bitrate stack (ladder) on a per-title basis. Variable bitrate encodings are used to obtain complexity information for a title and per-frames scores for the encodings; another encoding provides scene data. The complexity information is analyzed and processed based on the scene data to determine scene-based (e.g., objective and/or subjective quality) scores, which are used to determine scores for the encodings. The results are used to derive a candidate stack, comprising various resolutions and bitrates that provide desirable results. The candidate stack is evaluated by encoding the title using the candidate stack. These encodings are evaluated to select one resolution from any duplicate resolutions for a bitrate (e.g., based on relative quality), resulting in a pruned, final ladder that is associated with the title as the adaptive bitrate stack to be used for streaming that title's content.