H04N19/21

IMAGE DECODING METHOD, IMAGE CODING METHOD, IMAGE DECODING APPARATUS, IMAGE CODING APPARATUS, AND IMAGE CODING AND DECODING APPARATUS

The image decoding method includes determining a context for use in a current block to be processed, from among a plurality of contexts, wherein in the determining: the context is determined under a condition that control parameters of a left block and an upper block are used, when the signal type is a first type; and the context is determined under a third condition that the control parameter of the upper block is not used and a hierarchical depth of a data unit to which the control parameter of the current block belongs is used, when the signal type is a third type, and the third type is one or more of (i) “merge_flag”, (ii) “ref_idx_l0” or “ref_idx_l1”, (iii) “inter_pred_flag”, (iv) “mvd_l0” or “mvd_l1”, (v) “intra_chroma_pred_mode”, (vi) “cbf_luma”, and (vii) “cbf_cb” or “cbf_cr”.

Efficient coding of depth transitions in 3D video

A method of encoding a video data signal (15) is provided, together with a method for decoding. The encoding comprises providing color information (51) for pixels in an image, providing a depth map with depth information (52) for the pixels, providing transition information (56, 57, 60, 70, 71) being representative of a width (63, 73) of a transition region (61, 72) in the image, the transition region (61, 72) comprising a depth transition (62) and blended pixels in which colors of a foreground object and a background object are blended, and generating (24) the video data signal (15) comprising encoded data representing the color information (51), the depth map (52) and the transition information (56, 57, 60, 70, 71). The decoding comprises using the transition information (56, 57, 60, 70, 71) for determining the width (63, 73) of the transition regions (61, 72) and for determining alpha values (53) for pixels inside the transition regions (61, 72). The determined alpha values (53) are used for determining the color of a blended pixel at the transition of a foreground object and a background object.

Efficient coding of depth transitions in 3D video

A method of encoding a video data signal (15) is provided, together with a method for decoding. The encoding comprises providing color information (51) for pixels in an image, providing a depth map with depth information (52) for the pixels, providing transition information (56, 57, 60, 70, 71) being representative of a width (63, 73) of a transition region (61, 72) in the image, the transition region (61, 72) comprising a depth transition (62) and blended pixels in which colors of a foreground object and a background object are blended, and generating (24) the video data signal (15) comprising encoded data representing the color information (51), the depth map (52) and the transition information (56, 57, 60, 70, 71). The decoding comprises using the transition information (56, 57, 60, 70, 71) for determining the width (63, 73) of the transition regions (61, 72) and for determining alpha values (53) for pixels inside the transition regions (61, 72). The determined alpha values (53) are used for determining the color of a blended pixel at the transition of a foreground object and a background object.

IMAGE ALIGNMENT METHOD AND DEVICE THEREFOR

Provided is a method for automatically performing image alignment without a user input. An image alignment method performed by an image alignment device, according to one embodiment of the present invention, can comprise the steps of: recognizing at least one person in an inputted image; determining a person-of-interest among the recognized persons; and performing image alignment, on the basis of the person-of-interest, on the inputted image, wherein the image alignment is performed without an input of a user of the image alignment deice for the image alignment.

Segmenting generic foreground objects in images and videos

A method, system and computer program product for segmenting generic foreground objects in images and videos. For segmenting generic foreground objects in videos, an appearance stream of an image in a video frame is processed using a first deep neural network. Furthermore, a motion stream of an optical flow image in the video frame is processed using a second deep neural network. The appearance and motion streams are then joined to combine complementary appearance and motion information to perform segmentation of generic objects in the video frame. Generic foreground objects are segmented in images by training a convolutional deep neural network to estimate a likelihood that a pixel in an image belongs to a foreground object. After receiving the image, the likelihood that the pixel in the image is part of the foreground object as opposed to background is then determined using the trained convolutional deep neural network.

IMAGE DECODING METHOD, IMAGE CODING METHOD, IMAGE DECODING APPARATUS, IMAGE CODING APPARATUS, AND IMAGE CODING AND DECODING APPARATUS

The image decoding method includes determining a context for use in a current block to be processed, from among a plurality of contexts, wherein in the determining: the context is determined under a condition that control parameters of a left block and an upper block are used, when the signal type is a first type; and the context is determined under a third condition that the control parameter of the upper block is not used and a hierarchical depth of a data unit to which the control parameter of the current block belongs is used, when the signal type is a third type, and the third type is one or more of (i) “merge_flag”, (ii) “ref_idx_I0” or “ref_idx_I1”, (iii) “inter_pred_flag”, (iv) “mvd_I0” or “mvd_I1”, (v) “intra_chroma_pred_mode”, (vi) “cbf_luma”, and (vii) “cbf_cb” or “cbf_cr”.

IMAGE DECODING METHOD, IMAGE CODING METHOD, IMAGE DECODING APPARATUS, IMAGE CODING APPARATUS, AND IMAGE CODING AND DECODING APPARATUS

The image decoding method includes determining a context for use in a current block to be processed, from among a plurality of contexts, wherein in the determining: the context is determined under a condition that control parameters of a left block and an upper block are used, when the signal type is a first type; and the context is determined under a third condition that the control parameter of the upper block is not used and a hierarchical depth of a data unit to which the control parameter of the current block belongs is used, when the signal type is a third type, and the third type is one or more of (i) “merge_flag”, (ii) “ref_idx_I0” or “ref_idx_I1”, (iii) “inter_pred_flag”, (iv) “mvd_I0” or “mvd_I1”, (v) “intra_chroma_pred_mode”, (vi) “cbf_luma”, and (vii) “cbf_cb” or “cbf_cr”.

FACE-BASED FRAME RATE UPSAMPLING FOR VIDEO CALLS
20220222968 · 2022-07-14 · ·

A method includes receiving a set of video frames that correspond to a video, including a first video frame and a second video frame that each include a face, wherein the second video frame is subsequent to the first video frame. The method further includes performing face tracking on the first video frame to identify a first face resampling keyframe and performing face tracking on the second video frame to identify a second face resampling keyframe. The method further includes deriving an interpolation amount. The method further includes determining a first interpolated face frame based on the first face resampling keyframe and the interpolation amount. The method further includes determining a second interpolated face frame based on the second face resampling keyframe and the interpolation amount. The method further includes rendering an interpolated first face and an interpolated second face. The method further includes displaying a final frame.

EFFICIENT CODING OF DEPTH TRANSITIONS IN 3D VIDEO

A method of encoding a video data signal (15) is provided, together with a method for decoding. The encoding comprises providing color information (51) for pixels in an image, providing a depth map with depth information (52) for the pixels, providing transition information (56, 57, 60, 70, 71) being representative of a width (63, 73) of a transition region (61, 72) in the image, the transition region (61, 72) comprising a depth transition (62) and blended pixels in which colors of a foreground object and a background object are blended, and generating (24) the video data signal (15) comprising encoded data representing the color information (51), the depth map (52) and the transition information (56, 57, 60, 70, 71). The decoding comprises using the transition information (56, 57, 60, 70, 71) for determining the width (63, 73) of the transition regions (61, 72) and for determining alpha values (53) for pixels inside the transition regions (61, 72). The determined alpha values (53) are used for determining the color of a blended pixel at the transition of a foreground object and a background object.

EFFICIENT CODING OF DEPTH TRANSITIONS IN 3D VIDEO

A method of encoding a video data signal (15) is provided, together with a method for decoding. The encoding comprises providing color information (51) for pixels in an image, providing a depth map with depth information (52) for the pixels, providing transition information (56, 57, 60, 70, 71) being representative of a width (63, 73) of a transition region (61, 72) in the image, the transition region (61, 72) comprising a depth transition (62) and blended pixels in which colors of a foreground object and a background object are blended, and generating (24) the video data signal (15) comprising encoded data representing the color information (51), the depth map (52) and the transition information (56, 57, 60, 70, 71). The decoding comprises using the transition information (56, 57, 60, 70, 71) for determining the width (63, 73) of the transition regions (61, 72) and for determining alpha values (53) for pixels inside the transition regions (61, 72). The determined alpha values (53) are used for determining the color of a blended pixel at the transition of a foreground object and a background object.