Patent classifications
H04N19/33
Upscaling lower resolution image data for processing
In an example method and system, image data to an image processing module. Image data is read from memory into a down-scaler, which down-scales the image data to a first resolution, which is stored in a first buffer. A region of image data which the image processing module will request is predicted, and image data corresponding to at least part of the predicted region of image data is stored in a first buffer, in a second resolution, higher than the first. When a request for image data is received, it is then determined whether image data corresponding to the requested image data is in the second buffer, and if so, then image data is provided to the image processing module from the second buffer. If not, then image data from the first buffer is up-scaled, and the up-scaled image data is provided to the image processing module.
Upscaling lower resolution image data for processing
In an example method and system, image data to an image processing module. Image data is read from memory into a down-scaler, which down-scales the image data to a first resolution, which is stored in a first buffer. A region of image data which the image processing module will request is predicted, and image data corresponding to at least part of the predicted region of image data is stored in a first buffer, in a second resolution, higher than the first. When a request for image data is received, it is then determined whether image data corresponding to the requested image data is in the second buffer, and if so, then image data is provided to the image processing module from the second buffer. If not, then image data from the first buffer is up-scaled, and the up-scaled image data is provided to the image processing module.
Techniques for bitstream extraction for subpicture in coded video stream
A method, computer program, and computer system are provided for video coding. Video data having one or more subpictures is received. Resampling parameters and spatial scalability parameters corresponding to the subpictures are extracted. The resampling and spatial scalability parameters correspond to one or more flags signaled in a parameter set associated with the video data The video data is decoded based on the extracted resampling and spatial scalability parameters.
Multi-person pose recognition method and apparatus, electronic device, and storage medium
In a multi-person pose recognition method, a to-be-recognized image is obtained, and a circuitous pyramid network is constructed. The circuitous network pyramid includes parallel phases, and each phase includes downsampling network layers, upsampling network layers, and a first residual connection layer to connect the downsampling and upsampling network layers. The phases are interconnected by a second residual connection layer. The circuitous pyramid network is traversed, by extracting a feature map for each phase, and the feature map of the last phase is determined to be the feature map of the to-be-recognized image. Multi-pose recognition is then performed on the to-be-recognized image according to the feature map to obtain a pose recognition result for the to-be-recognized image.
USE OF TRANSFORMED COEFFICIENTS TO PROVIDE EMBEDDED SIGNALLING FOR WATERMARKING
Examples described herein relate to decoding and encoding signals. A method of performing signal decoding operations on one or more portions of a signal is presented. The performing is based at least in part on information embedded in one or more values received in one or more encoded data layers transmitted within a stream of encoded data, wherein said values are associated with transformed coefficients intended to be processed by a decoder for deriving elements of the signal, and wherein said information comprises an indication of watermarking information associated with the signal.
USE OF TRANSFORMED COEFFICIENTS TO PROVIDE EMBEDDED SIGNALLING FOR WATERMARKING
Examples described herein relate to decoding and encoding signals. A method of performing signal decoding operations on one or more portions of a signal is presented. The performing is based at least in part on information embedded in one or more values received in one or more encoded data layers transmitted within a stream of encoded data, wherein said values are associated with transformed coefficients intended to be processed by a decoder for deriving elements of the signal, and wherein said information comprises an indication of watermarking information associated with the signal.
Inference method, inference device and display
An inference method, an inference device, and a display are provided. The method includes: receiving an input signal through a first inference device or a second inference device; performing a first inference operation according to the input signal through the first inference device to obtain first inference information; performing a second inference operation according to the input signal through the second inference device to obtain second inference information; and providing an output signal according to the input signal, the first inference information and the second inference information through the second inference device.
Scalable video coding using inter-layer prediction of spatial intra prediction parameters
The coding efficiency of scalable video coding is increased by substituting missing spatial intra prediction parameter candidates in a spatial neighborhood of a current block of the enhancement layer by use of intra prediction parameters of a co-located block of the base layer signal. By this measure, the coding efficiency for coding the spatial intra prediction parameters is increased due to the improved prediction quality of the set of intra prediction parameters of the enhancement layer, or, more precisely stated, the increased likelihood, that appropriate predictors for the intra prediction parameters for an intra predicted block of the enhancement layer are available thereby increasing the likelihood that the signaling of the intra prediction parameter of the respective enhancement layer block may be performed, on average, with less bits.
Scalable video coding using inter-layer prediction of spatial intra prediction parameters
The coding efficiency of scalable video coding is increased by substituting missing spatial intra prediction parameter candidates in a spatial neighborhood of a current block of the enhancement layer by use of intra prediction parameters of a co-located block of the base layer signal. By this measure, the coding efficiency for coding the spatial intra prediction parameters is increased due to the improved prediction quality of the set of intra prediction parameters of the enhancement layer, or, more precisely stated, the increased likelihood, that appropriate predictors for the intra prediction parameters for an intra predicted block of the enhancement layer are available thereby increasing the likelihood that the signaling of the intra prediction parameter of the respective enhancement layer block may be performed, on average, with less bits.
Overlay processing method in 360 video system, and device thereof
A 360 image data processing method performed by a 360 video receiving device, according to the present invention, comprises the steps of: receiving 360 image data; acquiring information and metadata on an encoded picture from the 360 image data; decoding the picture on the basis of the information on the encoded picture; and rendering the decoded picture and an overlay on the basis of the metadata, wherein the metadata includes overlay-related metadata, the overlay is rendered on the basis of the overlay-related metadata, and the overlay-related metadata includes information on a region of the overlay.