Patent classifications
H04N19/635
Processing of motion information in multidimensional signals through motion zones and auxiliary information through auxiliary zones
Computer processor hardware receives zone information specifying multiple elements of a rendition of a signal belonging to a zone. The computer processor hardware also receives motion information associated with the zone. The motion information can be encoded to indicate to which corresponding element in a reference signal each of the multiple elements in the zone pertains. For each respective element in the zone as specified by the zone information, the computer processor hardware utilizes the motion information to derive a corresponding location value in the reference signal; the corresponding location value indicates a location in the reference signal to which the respective element pertains.
Method and apparatus for video-encoding/decoding using filter information prediction
Provided is a scalable video-decoding method based on multiple layers. The scalable video-decoding method according to the present invention comprises: a step of predicting first filter information of a video to be filtered using the information contained in an object layer and/or information contained in another layer, and generating second filter information in accordance with the prediction; and a step of filtering the video to be filtered using the second filter information. According to the present invention, the amount of information being transmitted is reduced, and video compression performance is improved.
Method and apparatus for video-encoding/decoding using filter information prediction
Provided is a scalable video-decoding method based on multiple layers. The scalable video-decoding method according to the present invention comprises: a step of predicting first filter information of a video to be filtered using the information contained in an object layer and/or information contained in another layer, and generating second filter information in accordance with the prediction; and a step of filtering the video to be filtered using the second filter information. According to the present invention, the amount of information being transmitted is reduced, and video compression performance is improved.
Image compression
A method of processing image data for transmittal to a display device involves receiving a frame of image data, the frame being divided into tile groups composed of tiles of pixels, each having a number of colour component values of a first colour space. Each tile includes a number of colour component planes of the first colour space having the colour component values for the pixels forming the tile. Each tile group is processed in an execution unit, formed by arithmetic logic units (ALUs) and a local shared memory, where each ALU includes dedicated register space for use solely by the ALU, and each tile of each tile group is processed by a number of the ALUs of the execution unit. Each ALU performs a reversible colour transformation (SI) on the colour component values from the first colour space to a second colour space and discards the remaining colour component values and then performs a discrete wavelet transformation (S2) on the colour component values of one colour component plane of the second colour space to produce wavelet coefficients, which are quantized (S3) and entropy encoded (S4) into variable length codes. The variable length codes for all the tiles of the tile group are assembled together for transmittal to a display device. Each ALU stores the data at each stage of the processing in its dedicated register space but not in the local shared memory of the execution unit.
Image compression
A method of processing image data for transmittal to a display device involves receiving a frame of image data, the frame being divided into tile groups composed of tiles of pixels, each having a number of colour component values of a first colour space. Each tile includes a number of colour component planes of the first colour space having the colour component values for the pixels forming the tile. Each tile group is processed in an execution unit, formed by arithmetic logic units (ALUs) and a local shared memory, where each ALU includes dedicated register space for use solely by the ALU, and each tile of each tile group is processed by a number of the ALUs of the execution unit. Each ALU performs a reversible colour transformation (SI) on the colour component values from the first colour space to a second colour space and discards the remaining colour component values and then performs a discrete wavelet transformation (S2) on the colour component values of one colour component plane of the second colour space to produce wavelet coefficients, which are quantized (S3) and entropy encoded (S4) into variable length codes. The variable length codes for all the tiles of the tile group are assembled together for transmittal to a display device. Each ALU stores the data at each stage of the processing in its dedicated register space but not in the local shared memory of the execution unit.
IMAGE COMPRESSION
A method of processing image data for transmittal to a display device involves receiving a frame of image data, the frame being divided into tile groups composed of tiles of pixels, each having a number of colour component values of a first colour space. Each tile includes a number of colour component planes of the first colour space having the colour component values for the pixels forming the tile. Each tile group is processed in an execution unit, formed by arithmetic logic units (ALUs) and a local shared memory, where each ALU includes dedicated register space for use solely by the ALU, and each tile of each tile group is processed by a number of the ALUs of the execution unit. Each ALU performs a reversible colour transformation (S1) on the colour component values from the first colour space to a second colour space and discards the remaining colour component values and then performs a discrete wavelet transformation (S2) on the colour component values of one colour component plane of the second colour space to produce wavelet coefficients, which are quantized (S3) and entropy encoded (S4) into variable length codes. The variable length codes for all the tiles of the tile group are assembled together for transmittal to a display device. Each ALU stores the data at each stage of the processing in its dedicated register space but not in the local shared memory of the execution unit.
IMAGE COMPRESSION
A method of processing image data for transmittal to a display device involves receiving a frame of image data, the frame being divided into tile groups composed of tiles of pixels, each having a number of colour component values of a first colour space. Each tile includes a number of colour component planes of the first colour space having the colour component values for the pixels forming the tile. Each tile group is processed in an execution unit, formed by arithmetic logic units (ALUs) and a local shared memory, where each ALU includes dedicated register space for use solely by the ALU, and each tile of each tile group is processed by a number of the ALUs of the execution unit. Each ALU performs a reversible colour transformation (S1) on the colour component values from the first colour space to a second colour space and discards the remaining colour component values and then performs a discrete wavelet transformation (S2) on the colour component values of one colour component plane of the second colour space to produce wavelet coefficients, which are quantized (S3) and entropy encoded (S4) into variable length codes. The variable length codes for all the tiles of the tile group are assembled together for transmittal to a display device. Each ALU stores the data at each stage of the processing in its dedicated register space but not in the local shared memory of the execution unit.
DEBLOCKING OF IMPLICIT TRANSFORM UNIT BOUNDARIES
In one aspect there is disclosed a method of applying deblocking on implicit vertical TU boundaries when the CU width is larger than the maximum TU width and applying deblocking on implicit horizontal TU boundaries when the CU height is larger than the maximum TU height. Some exemplary embodiments include HEVC deblocking and deblocking using longer filters.
DEBLOCKING OF IMPLICIT TRANSFORM UNIT BOUNDARIES
In one aspect there is disclosed a method of applying deblocking on implicit vertical TU boundaries when the CU width is larger than the maximum TU width and applying deblocking on implicit horizontal TU boundaries when the CU height is larger than the maximum TU height. Some exemplary embodiments include HEVC deblocking and deblocking using longer filters.
Perceptually motivated video pre-filter
Techniques for providing perceptually motivated video pre-filtering are described. According to some embodiments, a computer-implemented method includes receiving a request at a content delivery service to encode a video, performing a discrete cosine transform (DCT) on a first pixel block of a frame of the video to generate a first DCT block, and on a second spatial pixel block of the frame, spatially offset from and overlapping with the first pixel block, to generate a second DCT block, performing a wavelet transform on the first DCT block and on the second DCT block to generate wavelet coefficients, performing a filtering on the wavelet coefficients to generate filtered wavelet coefficients, performing an inverse wavelet transform on the filtered wavelet coefficients to generate a filtered DCT block, performing an inverse discrete cosine transform on the filtered DCT block to generate a filtered pixel block, encoding the filtered pixel block to generate an encoded video, and transmitting the encoded video to a viewer device or to a storage location.