H04N19/16

Encoding device and encoding method

An encoding method includes determining video format information, (i) setting each of all frames or all fields which are included in the video, as a picture, regardless of whether the video format is the interlace format or the progressive format, (ii) setting a POC indicating display order to each of all of the set pictures one by one, the POC being different each other, and encoding a picture to be encoded which is the frame or the field with reference to a picture previously encoded before encoding the picture to be encoded. In the encoding, the video is encoded with a syntax structure which is not dependent on the video format, the video format information is encoded in a header of a sequence which is a unit of the video, and the encoded bit stream is generated.

Encoding device and encoding method

An encoding method includes determining video format information, (i) setting each of all frames or all fields which are included in the video, as a picture, regardless of whether the video format is the interlace format or the progressive format, (ii) setting a POC indicating display order to each of all of the set pictures one by one, the POC being different each other, and encoding a picture to be encoded which is the frame or the field with reference to a picture previously encoded before encoding the picture to be encoded. In the encoding, the video is encoded with a syntax structure which is not dependent on the video format, the video format information is encoded in a header of a sequence which is a unit of the video, and the encoded bit stream is generated.

VIDEO DECODING IMPLEMENTATIONS FOR A GRAPHICS PROCESSING UNIT

Video decoding innovations for multithreading implementations and graphics processor unit (“GPU”) implementations are described. For example, for multithreaded decoding, a decoder uses innovations in the areas of layered data structures, picture extent discovery, a picture command queue, and/or task scheduling for multithreading. Or, for a GPU implementation, a decoder uses innovations in the areas of inverse transforms, inverse quantization, fractional interpolation, intra prediction using waves, loop filtering using waves, memory usage and/or performance-adaptive loop filtering. Innovations are also described in the areas of error handling and recovery, determination of neighbor availability for operations such as context modeling and intra prediction, CABAC decoding, computation of collocated information for direct mode macroblocks in B slices, reduction of memory consumption, implementation of trick play modes, and picture dropping for quality adjustment.

VIDEO DECODING IMPLEMENTATIONS FOR A GRAPHICS PROCESSING UNIT

Video decoding innovations for multithreading implementations and graphics processor unit (“GPU”) implementations are described. For example, for multithreaded decoding, a decoder uses innovations in the areas of layered data structures, picture extent discovery, a picture command queue, and/or task scheduling for multithreading. Or, for a GPU implementation, a decoder uses innovations in the areas of inverse transforms, inverse quantization, fractional interpolation, intra prediction using waves, loop filtering using waves, memory usage and/or performance-adaptive loop filtering. Innovations are also described in the areas of error handling and recovery, determination of neighbor availability for operations such as context modeling and intra prediction, CABAC decoding, computation of collocated information for direct mode macroblocks in B slices, reduction of memory consumption, implementation of trick play modes, and picture dropping for quality adjustment.

CODING AND DECODING OF INTERLEAVED IMAGE DATA

Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.

Method and apparatus for thumbnail generation for a video device

Systems and processes are provided to generate a thumbnail image including a tuner configured for receiving and demodulating a video signal including a video packet having a packet header and a video frame, a processor configured for generating a modified packet header by modifying the packet header to set a number of reference frames to zero and to designate the video frame as unused for reference, decoding the video frame in response to the modified packet header to extract the video frame, and generating the thumbnail image in response to the video frame, a memory configured for storing the thumbnail image, and a video output configured for coupling the thumbnail image to a display in response to a user video search request.

Method and apparatus for thumbnail generation for a video device

Systems and processes are provided to generate a thumbnail image including a tuner configured for receiving and demodulating a video signal including a video packet having a packet header and a video frame, a processor configured for generating a modified packet header by modifying the packet header to set a number of reference frames to zero and to designate the video frame as unused for reference, decoding the video frame in response to the modified packet header to extract the video frame, and generating the thumbnail image in response to the video frame, a memory configured for storing the thumbnail image, and a video output configured for coupling the thumbnail image to a display in response to a user video search request.

Controlling a pan-tilt-zoom camera
11539909 · 2022-12-27 · ·

Methods and apparatus, including computer program products, for controlling a Pan-Tilt-Zoom (PTZ) camera. A video stream is provided by the PTZ camera. The video stream includes a main image and a smaller picture-in-picture overlay. The main image and the overlay show the same view. The overlay has both a higher frame rate and a lower resolution compared to the main image. A movement control instruction is received, by the PTZ camera, in response to an evaluation of at least one of the main image and the overlay.

Controlling a pan-tilt-zoom camera
11539909 · 2022-12-27 · ·

Methods and apparatus, including computer program products, for controlling a Pan-Tilt-Zoom (PTZ) camera. A video stream is provided by the PTZ camera. The video stream includes a main image and a smaller picture-in-picture overlay. The main image and the overlay show the same view. The overlay has both a higher frame rate and a lower resolution compared to the main image. A movement control instruction is received, by the PTZ camera, in response to an evaluation of at least one of the main image and the overlay.

Video encoding and decoding method, apparatus and system

The present disclosure discloses a video encoding and decoding method, apparatus and system, where the video encoding method includes: determining an independently decodable view in a to-be-encoded picture according to a configuration file corresponding to the video; dividing the picture into at least two tiles, where an area corresponding to one or more tiles that cover the independently decodable view is an independently decodable area; generating an auxiliary message corresponding to the picture, where the auxiliary message includes a location identifier of the independently decodable area; and encoding all tiles included in the picture to form an encoded video bitstream, where the encoded video bitstream includes the auxiliary message. Decoding this encoded video bitstream can lower a requirement for performance of a decoder and improve decoding efficiency.