Patent classifications
H04N19/507
Methods and devices for encoding and decoding using parameter sets, and electronic equipment
Provided are methods and devices for encoding and decoding using parameter sets, and electronic equipment. In the method for encoding, an encoder determines parameter sets and/or virtual parameter sets for a slice, wherein the virtual parameter set is a data structure which is generated by loading information acquired from a bitstream into a syntax structure of an existing parameter set and/or a preset syntax structure and includes tool parameters and/or control parameters; and the encoder writes identification number (ID) (s) of the parameter sets and/or virtual parameter sets into a bitstream. Using the method, encoding and decoding efficiency is improved.
Methods and devices for encoding and decoding using parameter sets, and electronic equipment
Provided are methods and devices for encoding and decoding using parameter sets, and electronic equipment. In the method for encoding, an encoder determines parameter sets and/or virtual parameter sets for a slice, wherein the virtual parameter set is a data structure which is generated by loading information acquired from a bitstream into a syntax structure of an existing parameter set and/or a preset syntax structure and includes tool parameters and/or control parameters; and the encoder writes identification number (ID) (s) of the parameter sets and/or virtual parameter sets into a bitstream. Using the method, encoding and decoding efficiency is improved.
Efficient coding of source video sequences partitioned into tiles
Techniques are disclosed for coding video data in which frames from a video source are partitioned into a plurality of tiles of common size, and the tiles are coded as a virtual video sequence according to motion-compensated prediction, each tile treated as having respective temporal location of the virtual video sequence. The coding scheme permits relative allocation of coding resources to tiles that are likely to have greater significance in a video coding session, which may lead to certain tiles that have low complexity or low motion content to be skipped during coding of the tiles for select source frames. Moreover, coding of the tiles may be ordered to achieve low coding latencies during a coding session.
Efficient coding of source video sequences partitioned into tiles
Techniques are disclosed for coding video data in which frames from a video source are partitioned into a plurality of tiles of common size, and the tiles are coded as a virtual video sequence according to motion-compensated prediction, each tile treated as having respective temporal location of the virtual video sequence. The coding scheme permits relative allocation of coding resources to tiles that are likely to have greater significance in a video coding session, which may lead to certain tiles that have low complexity or low motion content to be skipped during coding of the tiles for select source frames. Moreover, coding of the tiles may be ordered to achieve low coding latencies during a coding session.
Method and system for producing streams of image frames
Methods, systems and computer program products, for producing streams of image frames. Image frames in streaming video are segmented into background segments and instance segments. A background image frame containing the background segments is created. At least some of the instance segments are classified into movable objects of interest and movable objects of non-interest. During a background update time period, the background image frame is updated when a movable object of non-interest has moved to reveal a background area, to include the revealed background area in the background image frame. A foreground image containing the movable objects of interest is created. Blocks of pixels of the updated background and foreground image frames are encoded. A stream of encoded foreground image frames having a first frame rate is produced. A stream of encoded updated background image frames a second, lower frame rate is produced.
MERGE CANDIDATE LIST FOR GRADUAL DECODING REFRESH
A method, apparatus, and computer program product provide for solving merge candidate list mismatches. In the context of a method, the method accesses a current coding unit of a picture. The method can also generate a merge candidate list comprising a plurality of coding unit representations in a predefined order. The method also determines whether a particular coding unit representation within the merge candidate list is representative of a coding unit within a dirty area of the picture and prevents the particular coding unit representation and one or more coding unit representations following the particular coding unit in the predefined order from being selected as a merge candidate for the current coding unit.
Intra Prediction For Multi-Hypothesis
A video decoder that decodes a current block of pixels by using multi-hypothesis combined prediction mode is provided. The video decoder generates a first prediction of the current block based on an inter prediction mode. The video decoder enables the combined prediction mode for the current block based on a block size of the current block determined according to a width and a height of the current block. The combined prediction mode is disabled when the width of or the height of the current block is greater than a threshold length. When the combined prediction mode is enabled, the video decoder generates a second prediction based on an intra prediction mode that is inferred to be a planar mode, and subsequently a combined prediction for the current block based on the first prediction and the second prediction. The video decoder reconstructs the current block by using the combined prediction.
Intra Prediction For Multi-Hypothesis
A video decoder that decodes a current block of pixels by using multi-hypothesis combined prediction mode is provided. The video decoder generates a first prediction of the current block based on an inter prediction mode. The video decoder enables the combined prediction mode for the current block based on a block size of the current block determined according to a width and a height of the current block. The combined prediction mode is disabled when the width of or the height of the current block is greater than a threshold length. When the combined prediction mode is enabled, the video decoder generates a second prediction based on an intra prediction mode that is inferred to be a planar mode, and subsequently a combined prediction for the current block based on the first prediction and the second prediction. The video decoder reconstructs the current block by using the combined prediction.
DIFFERENTIAL IMAGE PROCESSING
An image signal processor (ISP) processes received image frame data. In order to reduce power consumption caused by the processing of substantially similar image tiles of received image frames, differential processing may be used to determine which image tiles of a received image frame should be processed by the ISP, based upon a comparison between the received image tiles and previously received image tile data. In addition, differential processing may be automatically disabled in response to a determination that the image frames are too different from each other to realize any savings in processing resources or memory usage, and reenabled in response to a determination the received image frames are similar enough such that power savings can be realized through differential processing.
Method and program for producing multi reactive video, and generate meta data to make multi reactive video, and analyze into interaction data to understand human act
Disclosed is a multi-reactive video generating method and program that performs various condition playbacks depending on a user's manipulation, based on a video database (e.g., a basic video) in which a general video or a plurality of image frames are stored. According to an embodiment of the inventive concept, various actions (i.e., reactions) may be applied as the multi-reactive video generation file is played with a general video or a combination of a plurality of image frames.