Patent classifications
H04N19/587
Video frame pulldown based on frame analysis
The described technology is directed towards generating a new video image sequence (e.g., for playback at 30 frames per second) based on an existing video image sequence (e.g., originated for playback at 24 frames per second). The technology is based on processing frames, e.g., adjacent pairs of frames in a four-frame sequence, to obtain candidate frames for selecting a similar candidate frame to insert into the original sequence to create the new sequence (e.g., a five-frame sequence). Aspects include selecting a repeated frame to insert or creating a new frame from existing frames to insert, to generate the new sequence based on a difference/scoring comparison.
DIRECTED INTERPOLATION AND DATA POST-PROCESSING
An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.
DIRECTED INTERPOLATION AND DATA POST-PROCESSING
An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.
Transmission device, transmitting method, reception device, and receiving method
In the case where image data of a super-high definition service is transmitted without scalable coding, image data suitable to own display capability in a receiver not supporting the super-high definition service can be easily obtained. A container in a predetermined format having a video stream including encoded image data is transmitted. Auxiliary information for downscaling a spatial and/or temporal resolution of the image data is inserted into the video stream. For example, the auxiliary information indicates a limit of accuracy for a motion vector included in the encoded image data. Further, for example, the auxiliary information identifies a picture to be selected at the time of downscaling the temporal resolution at a predetermined ratio.
Transmission device, transmitting method, reception device, and receiving method
In the case where image data of a super-high definition service is transmitted without scalable coding, image data suitable to own display capability in a receiver not supporting the super-high definition service can be easily obtained. A container in a predetermined format having a video stream including encoded image data is transmitted. Auxiliary information for downscaling a spatial and/or temporal resolution of the image data is inserted into the video stream. For example, the auxiliary information indicates a limit of accuracy for a motion vector included in the encoded image data. Further, for example, the auxiliary information identifies a picture to be selected at the time of downscaling the temporal resolution at a predetermined ratio.
Method and apparatus of matrix based intra prediction in image and video processing
An image or video processing method comprises receiving input data of a current block, mapping an MIP mode index to a default mode, and encoding or decoding the current block by referencing the default mode. Another image or video processing method comprises receiving input data of a current block coded or to be coded by an MIP mode, and parsing or signaling a transpose flag for the current block indicating whether transposing is applied in coding the current block. Reference samples of the current block are prepared from neighboring boundary samples according to the transpose flag. A matrix is selected to be multiplied to the reference samples to generate predicted samples. A transposing process is adaptively applied to the current block according to the transpose flag, and the predicted samples are used to encode or decode the current block.
CODING AND DECODING OF INTERLEAVED IMAGE DATA
Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.
Methods for generating video-and audience-specific encoding ladders with audio and video just-in-time transcoding
A method including: populating an encoding ladder with a subset of bitrate-resolution pairs, from a set of bitrate-resolution pairs, based on a distribution of audience bandwidths; receiving a first request for a first playback segment, at a first bitrate-resolution pair in a encoding ladder, in the video from a first device; in response to determining an absence of video segments, at the first bitrate-resolution pair and corresponding to the segment, in a first rendition cache: identifying a first set of mezzanine segments, in the video, corresponding to the first playback segment; assigning the first set of mezzanine segments to a set of workers for transcoding into a first set of video segments according to the first bitrate-resolution pair; storing the first set of video segments in the first rendition cache; and based on the first request, releasing the first set of video segments to the first device.
Per-Title Encoding Using Spatial and Temporal Resolution Downscaling
Techniques relating to per-title encoding using spatial and temporal resolution downscaling is disclosed. A method for per-title encoding includes receiving a video input comprised of video segments, spatially downscaling the video input, temporally downscaling the video input, encoding the video input to generate an encoded video, then temporally and spatially upscaling the encoded video. Spatially downscaling may include reducing a resolution of the video input, and temporally downscaling may include reducing a framerate of the video input. Objective metrics for the upscaled encoded video show improved quality over conventional methods.
IMAGE PROCESSING APPARATUS AND METHOD
An image of a fixed subpicture being a subpicture in which a position of a reference pixel is fixed in a time direction, in subpictures that are partial areas obtained by dividing a picture, is encoded with a resolution variable in a time direction. Furthermore, coded data obtained by encoding an image of a fixed subpicture being a subpicture in which a position of a reference pixel is fixed in a time direction, in subpictures that are partial areas obtained by dividing a picture, with a resolution variable in a time direction is decoded to generate the image of the resolution of the fixed subpicture. The present disclosure can be applied to, for example, an image processing apparatus, an image encoding apparatus, an image decoding apparatus, an information processing apparatus, an image processing method, an information processing method, or the like.