Patent classifications
H04N19/553
Video frame rate conversion using streamed metadata
A video server generates metadata representative of interpolation parameters for portions of a first frame representative of a scene in a stream of frames including the first frame. The interpolation parameters are used to generate at least one interpolated frame representative of the scene subsequent to the first frame and prior to a second frame in the stream of frames. The video server incorporates the metadata into the stream and transmits the stream including the multiplexed metadata. A video client receives the first frame representative the stream of frames including the metadata. The video client generates one or more interpolated frames representative of the scene subsequent to the first frame and prior to a second frame in the stream of frames based on the first frame and the metadata. The video client displays the first frame, the one or more interpolated frames, and the second frame.
Apparatuses and Methods for Encoding and Decoding a Video Coding Block of a Video Signal
A decoding apparatus partitions a video coding block based on coding information into two or more segments including a first segment and a second segment. The coding information comprises a first segment motion vector associated with the first segment and a second segment motion vector associated with the second segment. A co-located first segment in a first reference frame is determined based on the first segment motion vector and a co-located second segment in a second reference frame is determined based on the second segment motion vector. A predicted video coding block is generated based on the co-located first segment and the co-located second segment. A divergence measure is determined based on the first segment motion vector and the second segment motion vector and a first or second filter is applied depending on the divergence measure to the predicted video coding block.
VIDEO MOTION PROCESSING INCLUDING STATIC DETERMINATION, OCCLUSION DETECTION, FRAME RATE CONVERSION, AND ADJUSTING COMPRESSION RATIO
Systems and methods may provide for occlusion detection in frame rate conversion. Detecting the occlusion allows frame rate conversion to be more accurately performed. In some embodiments, one or more stereoscopic depth cameras may be used to determine the depth of a moving object to more accurately determine the occlusion. In some embodiments, the compression ratio may be adjusted to balance the frame rate and power to help ensure compliance with a power budget. In at least some embodiments, the motion of a camera may be passed from a 3D render pipe to an encoder to avoid motion calculation and thereby saving power.
VIDEO MOTION PROCESSING INCLUDING STATIC DETERMINATION, OCCLUSION DETECTION, FRAME RATE CONVERSION, AND ADJUSTING COMPRESSION RATIO
Systems and methods may provide for occlusion detection in frame rate conversion. Detecting the occlusion allows frame rate conversion to be more accurately performed. In some embodiments, one or more stereoscopic depth cameras may be used to determine the depth of a moving object to more accurately determine the occlusion. In some embodiments, the compression ratio may be adjusted to balance the frame rate and power to help ensure compliance with a power budget. In at least some embodiments, the motion of a camera may be passed from a 3D render pipe to an encoder to avoid motion calculation and thereby saving power.
Image processing method and apparatus
An image processing method includes obtaining multiple video frames, where the multiple video frames are collected from a same scene at different angles and determining a depth map of each video frame according to corresponding pixels among the multiple video frames; supplementing background missing regions of the multiple video frames according to depth maps of the multiple video frames, to obtain supplemented video frames of the multiple video frames and depth maps of the multiple supplemented video frames. The method also includes generating an alpha image of each video frame according to an occlusion relationship between each of the multiple video frames and a supplemented video frame of each video frame in a background missing region and generating a browsing frame at a specified browsing angle according to the multiple video frames, the supplemented video frames of the multiple video frames, and alpha images of the multiple video frames.
Image processing method and apparatus
An image processing method includes obtaining multiple video frames, where the multiple video frames are collected from a same scene at different angles and determining a depth map of each video frame according to corresponding pixels among the multiple video frames; supplementing background missing regions of the multiple video frames according to depth maps of the multiple video frames, to obtain supplemented video frames of the multiple video frames and depth maps of the multiple supplemented video frames. The method also includes generating an alpha image of each video frame according to an occlusion relationship between each of the multiple video frames and a supplemented video frame of each video frame in a background missing region and generating a browsing frame at a specified browsing angle according to the multiple video frames, the supplemented video frames of the multiple video frames, and alpha images of the multiple video frames.
MULTI-VIEW CODING WITH EFFECTIVE HANDLING OF RENDERABLE PORTIONS
A proposed intermediate way of handling the renderable portion of the first view results in more efficient coding. Instead of omitting the coding of the renderable portion completely, even more efficient coding of multi-view signals entails merely suppressing the coding of the residual signal within the renderable portion, whereas the prediction parameter coding still takes place from the non-renderable portion of the multi-view signal across the renderable portion so that prediction parameters for the renderable portion may be exploited for predicting parameters for the non-renderable portion. The additional coding rate for transmitting the prediction parameters for the renderable portion may be kept low as this merely aims at forming a continuation of the parameter history across the renderable portion to serve as a basis for prediction parameters of other portions of the multi-view signal.
MULTI-VIEW CODING WITH EFFECTIVE HANDLING OF RENDERABLE PORTIONS
A proposed intermediate way of handling the renderable portion of the first view results in more efficient coding. Instead of omitting the coding of the renderable portion completely, even more efficient coding of multi-view signals entails merely suppressing the coding of the residual signal within the renderable portion, whereas the prediction parameter coding still takes place from the non-renderable portion of the multi-view signal across the renderable portion so that prediction parameters for the renderable portion may be exploited for predicting parameters for the non-renderable portion. The additional coding rate for transmitting the prediction parameters for the renderable portion may be kept low as this merely aims at forming a continuation of the parameter history across the renderable portion to serve as a basis for prediction parameters of other portions of the multi-view signal.
P frame-based multi-hypothesis motion compensation method
A P frame-based multi-hypothesis motion compensation method includes: taking an encoded image block adjacent to a current image block as a reference image block and obtaining a first motion vector of the current image block by using a motion vector of the reference image block, the first motion vector pointing to a first prediction block; taking the first motion vector as a reference value and performing joint motion estimation on the current image block to obtain a second motion vector of the current image block, the second motion vector pointing to a second prediction block; and performing weighted averaging on the first prediction block and the second prediction block to obtain a final prediction block of the current image block. The method increases the accuracy of the obtained prediction block of the current image block without increasing the code rate.
P frame-based multi-hypothesis motion compensation method
A P frame-based multi-hypothesis motion compensation method includes: taking an encoded image block adjacent to a current image block as a reference image block and obtaining a first motion vector of the current image block by using a motion vector of the reference image block, the first motion vector pointing to a first prediction block; taking the first motion vector as a reference value and performing joint motion estimation on the current image block to obtain a second motion vector of the current image block, the second motion vector pointing to a second prediction block; and performing weighted averaging on the first prediction block and the second prediction block to obtain a final prediction block of the current image block. The method increases the accuracy of the obtained prediction block of the current image block without increasing the code rate.