Patent classifications
H04N19/103
Prioritizing encoding of video data received by an online system to maximize visual quality while accounting for fixed computing capacity
An online system receives video data items from users and encodes the video data items using various codecs. To account for different computational resources used for encoding using different codecs, the online system ranks combinations of video data items by ratios of encoding video data items with different codecs to computational costs of encoding different video data items with different codecs. The benefit of encoding a video data item with a codec is based on a compression efficiency of the codec and a predicted aggregate amount of the video data item displayed to various users of the online system. Encoding video data items with codecs based on the determined ratios allows the online system to optimize a duration of video data having at least a threshold video quality to users.
Three-dimensional data encoding method, three-dimensional data decoding method, three-dimensional data encoding device, and three-dimensional data decoding device
A three-dimensional data encoding method encodes a plurality of three-dimensional points, and includes: selecting one prediction mode from two or more prediction modes for calculating a predicted value of an attribute information item of a first three-dimensional point, in accordance with attribute information items of one or more second three-dimensional points in vicinity of the first three-dimensional point; calculating the predicted value by the selected prediction mode; calculating, as a prediction residual, a difference between a value of the attribute information item of the first three-dimensional point and the calculated predicted value; and generating a first bit stream that includes the selected prediction mode, the prediction residual, and a number of the two or more prediction modes.
Three-dimensional data encoding method, three-dimensional data decoding method, three-dimensional data encoding device, and three-dimensional data decoding device
A three-dimensional data encoding method encodes a plurality of three-dimensional points, and includes: selecting one prediction mode from two or more prediction modes for calculating a predicted value of an attribute information item of a first three-dimensional point, in accordance with attribute information items of one or more second three-dimensional points in vicinity of the first three-dimensional point; calculating the predicted value by the selected prediction mode; calculating, as a prediction residual, a difference between a value of the attribute information item of the first three-dimensional point and the calculated predicted value; and generating a first bit stream that includes the selected prediction mode, the prediction residual, and a number of the two or more prediction modes.
Method and apparatus for processing video signal
A method for decoding a video according to the present invention may comprise: determining a first prediction mode for a first sub-block in a current block and a second intra prediction mode for a second sub-block, performing a first prediction for the first sub-block based on the first intra prediction mode, performing a second prediction for the second sub-block based on the second intra prediction mode, and obtaining a prediction sample of the current block according to a result of the first prediction and the second prediction.
IMAGE ENCODING METHOD AND IMAGE DECODING METHOD
An image encoding method for encoding an image includes: a prediction image generation step of generating a synthesis-prediction image by performing synthesis processing of synthesizing an inter-prediction image and an intra-prediction image to an encoding-target block; and an encoding step of encoding a difference between the prediction image generated in the prediction image generation step and a pixel value of an image of the encoding-target block, the synthesis processing includes weighting processing performed to the inter-prediction image and the intra-prediction image, a weighting parameter in the weighting processing is determined in accordance with a type of the intra prediction of the encoding-target block and a combination of prediction modes of a plurality of adjacent blocks adjacent to the encoding-target block, and types of the intra prediction of the encoding-target block include a matrix weighting intra prediction.
IMAGE ENCODING METHOD AND IMAGE DECODING METHOD
An image encoding method for encoding an image includes: a prediction image generation step of generating a prediction image of a synthesis prediction by performing synthesis processing of synthesizing a prediction image of an in-screen block copy prediction and a prediction image of an intra prediction different from the in-screen block copy prediction, to an encoding-target block; and an encoding step of encoding a difference between the prediction image generated in the prediction image generation step and a pixel value of an image of the encoding-target block, the synthesis processing includes weighting processing performed to the prediction image of the in-screen block copy prediction and the prediction image of the intra prediction that is different from the in-screen block copy prediction, and a weighting parameter of the prediction image of the intra prediction in the weighting processing is determined by information of an adjacent block adjacent to the encoding-target block.
METHODS AND APPARATUS OF VIDEO CODING USING PALETTE MODE
An electronic apparatus performs a method of decoding video data. The method comprises: receiving, from the bitstream, a plurality of syntax elements associated with a coding unit, wherein the plurality of syntax elements indicate the coding tree type of the coding unit, and whether the local dual tree mode is enabled for the coding unit; and in accordance with the determination that the coding tree type of the coding unit is a single tree, and the local dual tree mode is enabled for the coding unit: disabling the palette mode for the coding unit when the coding unit has the size equal to or less than a predefined threshold. In some embodiments, disabling palette mode for the coding unit includes: disabling the palette mode for both the luma component and chroma component of the coding unit, or disabling the palette mode for only the chroma component of the coding unit.
METHOD AND SYSTEM FOR OPTIMIZING IMAGE AND VIDEO COMPRESSION FOR MACHINE VISION
A method and a system described herein provide optimizing image and/or video compression for machine perception. According to an aspect, the method comprises receiving a raw image frame from a camera sensor; detecting a predefined object in the raw image frame and marking a region around the predefined object within the raw image frame as ROI. Based on the ROI, a partitioning scheme, a prediction mode, and quantization parameter are determined for improving coding efficiency. Machine perception efficiency is improved by selecting a quantization parameter table used for compressing and encoding the raw image or video frame based on a selected machine vision task. The selection of the quantization parameter table is based on training of the selected machine vision task using cost function optimization.
SYSTEM AND METHOD FOR PROCESSING A STREAM OF IMAGES
A method for processing a stream of images including the steps of obtaining coding information from the stream of images to determine one or more bitrate/distortion models representative of the bitrate/distortion relationship of the stream of images, determining a set of coding parameters arranged for use to encode a stream of images with the one or more bitrate/distortion models, reformulating the bitrate/distortion relationship into a decoupled relationship arranged to be applied to a subset of the stream of images, and using the decoupled relationship and the set of coding parameters to generate an adaptive quantization parameter for encoding the stream of images with the bitrate/distortion relationship.
SYSTEM AND METHOD FOR PROCESSING A STREAM OF IMAGES
A method for processing a stream of images including the steps of obtaining coding information from the stream of images to determine one or more bitrate/distortion models representative of the bitrate/distortion relationship of the stream of images, determining a set of coding parameters arranged for use to encode a stream of images with the one or more bitrate/distortion models, reformulating the bitrate/distortion relationship into a decoupled relationship arranged to be applied to a subset of the stream of images, and using the decoupled relationship and the set of coding parameters to generate an adaptive quantization parameter for encoding the stream of images with the bitrate/distortion relationship.