Patent classifications
H04N19/132
ACTION SYNCHRONIZATION FOR TARGET OBJECT
A method for synchronizing an action of a target object with source audio is provided. Facial parameter conversion is performed on an audio parameter of the source audio at different time periods to obtain source parameter information of the source audio at the respective time periods. Parameter extraction is performed on a target video that includes the target object to obtain target parameter information of the target video. Image reconstruction is performed on the target object in the target video based on the source parameter information of the source audio and the target parameter information of the target video, to obtain a reconstructed image. Further, a synthetic video is generated based on the reconstructed image, the synthetic video including the target object, and the action of the target object being synchronized with the source audio.
Configuring luma-dependent chroma residue scaling for video coding
A method for video processing is provided to include: performing a conversion between a current video block of a video that is a chroma block and a coded representation of the video, wherein, during the conversion, the current video block is constructed based on a first domain and a second domain, and wherein the conversion further includes applying a forward reshaping process and/or an inverse reshaping process to one or more chroma components of the current video block.
Configuring luma-dependent chroma residue scaling for video coding
A method for video processing is provided to include: performing a conversion between a current video block of a video that is a chroma block and a coded representation of the video, wherein, during the conversion, the current video block is constructed based on a first domain and a second domain, and wherein the conversion further includes applying a forward reshaping process and/or an inverse reshaping process to one or more chroma components of the current video block.
Video coding method and system based on multiple reference line intra prediction
A video decoding method includes: obtaining a bitstream including a plurality of coded frames of a video signal; decoding each of the plurality of coded frames into a plurality of super blocks and each of the plurality of super blocks into a plurality of residual blocks; recovering a coded block (CB) for each of the plurality of residual blocks based on multiple reference line intra prediction (MRLP) flags and reference samples included in each coded frame, wherein multiple reference lines are divided into above-side reference lines and left-side reference lines and one above-side reference line and one left-side reference line are selected for intra prediction; reconstructing each frame of the video signal by storing the recovered CB for each of the plurality of residual blocks in a frame buffer; and continuously outputting the reconstructed frames to restore the video signal.
Video coding method and system based on multiple reference line intra prediction
A video decoding method includes: obtaining a bitstream including a plurality of coded frames of a video signal; decoding each of the plurality of coded frames into a plurality of super blocks and each of the plurality of super blocks into a plurality of residual blocks; recovering a coded block (CB) for each of the plurality of residual blocks based on multiple reference line intra prediction (MRLP) flags and reference samples included in each coded frame, wherein multiple reference lines are divided into above-side reference lines and left-side reference lines and one above-side reference line and one left-side reference line are selected for intra prediction; reconstructing each frame of the video signal by storing the recovered CB for each of the plurality of residual blocks in a frame buffer; and continuously outputting the reconstructed frames to restore the video signal.
Prediction refinement for combined inter intra prediction mode
A method for processing a video includes performing a conversion between a current block of visual media data and a corresponding coded representation of the visual media data, wherein the conversion of the current block includes determining whether a use of one or both of a bi-directional optical flow (BIO) technique or a decoder-side motion vector refinement (DMVR) technique to the current block is enabled or disabled, and wherein the determining the use of the BIO technique or the DMVR technique is based on a cost criterion associated with the current block.
Prediction refinement for combined inter intra prediction mode
A method for processing a video includes performing a conversion between a current block of visual media data and a corresponding coded representation of the visual media data, wherein the conversion of the current block includes determining whether a use of one or both of a bi-directional optical flow (BIO) technique or a decoder-side motion vector refinement (DMVR) technique to the current block is enabled or disabled, and wherein the determining the use of the BIO technique or the DMVR technique is based on a cost criterion associated with the current block.
METHOD FOR ENCODING IMMERSIVE IMAGE AND METHOD FOR DECODING IMMERSIVE IMAGE
Disclosed herein is a method for encoding an immersive image. The method includes detecting a non-diffuse surface in a first texture image of a first view, generating an additional texture image from the first texture image based on the detected non-diffuse surface, performing pruning on the additional texture image based on a second texture image of a second view, generating a texture atlas based on the pruned additional texture image, and encoding the texture atlas.
Contour mode prediction
A video decoder and method for supporting a prediction mode for predicting blocks of a video is configured to predict each of the blocks by extrapolating a neighborhood of the respective block into the block along a direction which varies across the respective block.
Contour mode prediction
A video decoder and method for supporting a prediction mode for predicting blocks of a video is configured to predict each of the blocks by extrapolating a neighborhood of the respective block into the block along a direction which varies across the respective block.