Patent classifications
H04N19/48
Multimedia distribution system
A multimedia file and methods of generating, distributing and using the multimedia file are described. Multimedia files in accordance with embodiments of the present invention can contain multiple video tracks, multiple audio tracks, multiple subtitle tracks, a complete index that can be used to locate each data chunk in each of these tracks and an abridged index that can enable the location of a subset of the data chunks in each track, data that can be used to generate a menu interface to access the contents of the file and ‘meta data’ concerning the contents of the file. Multimedia files in accordance with several embodiments of the present invention also include references to video tracks, audio tracks, subtitle tracks and ‘meta data’ external to the file. One embodiment of a multimedia file in accordance with the present invention includes a series of encoded video frames, a first index that includes information indicative of the location within the file and characteristics of each encoded video frame and a separate second index that includes information indicative of the location within the file of a subset of the encoded video frames.
Multimedia distribution system
A multimedia file and methods of generating, distributing and using the multimedia file are described. Multimedia files in accordance with embodiments of the present invention can contain multiple video tracks, multiple audio tracks, multiple subtitle tracks, a complete index that can be used to locate each data chunk in each of these tracks and an abridged index that can enable the location of a subset of the data chunks in each track, data that can be used to generate a menu interface to access the contents of the file and ‘meta data’ concerning the contents of the file. Multimedia files in accordance with several embodiments of the present invention also include references to video tracks, audio tracks, subtitle tracks and ‘meta data’ external to the file. One embodiment of a multimedia file in accordance with the present invention includes a series of encoded video frames, a first index that includes information indicative of the location within the file and characteristics of each encoded video frame and a separate second index that includes information indicative of the location within the file of a subset of the encoded video frames.
Scalable Video Coding Using Inter-Layer Prediction of Spatial Intra Prediction Parameters
The coding efficiency of scalable video coding is increased by substituting missing spatial intra prediction parameter candidates in a spatial neighborhood of a current block of the enhancement layer by use of intra prediction parameters of a co-located block of the base layer signal. By this measure, the coding efficiency for coding the spatial intra prediction parameters is increased due to the improved prediction quality of the set of intra prediction parameters of the enhancement layer, or, more precisely stated, the increased likelihood, that appropriate predictors for the intra prediction parameters for an intra predicted block of the enhancement layer are available thereby increasing the likelihood that the signaling of the intra prediction parameter of the respective enhancement layer block may be performed, on average, with less bits.
Scalable Video Coding Using Inter-Layer Prediction of Spatial Intra Prediction Parameters
The coding efficiency of scalable video coding is increased by substituting missing spatial intra prediction parameter candidates in a spatial neighborhood of a current block of the enhancement layer by use of intra prediction parameters of a co-located block of the base layer signal. By this measure, the coding efficiency for coding the spatial intra prediction parameters is increased due to the improved prediction quality of the set of intra prediction parameters of the enhancement layer, or, more precisely stated, the increased likelihood, that appropriate predictors for the intra prediction parameters for an intra predicted block of the enhancement layer are available thereby increasing the likelihood that the signaling of the intra prediction parameter of the respective enhancement layer block may be performed, on average, with less bits.
Cascade convolutional neural network
In one embodiment, an apparatus comprises a communication interface and a processor. The communication interface is to communicate with a plurality of devices. The processor is to: receive compressed data from a first device, wherein the compressed data is associated with visual data captured by sensor(s); perform a current stage of processing on the compressed data using a current CNN, wherein the current stage of processing corresponds to one of a plurality of processing stages associated with the visual data, and wherein the current CNN corresponds to one of a plurality of CNNs associated with the plurality of processing stages; obtain an output associated with the current stage of processing; determine, based on the output, whether processing associated with the visual data is complete; if the processing is complete, output a result associated with the visual data; if the processing is incomplete, transmit the compressed data to a second device.
SCALAR QUANTIZER DECISION SCHEME FOR DEPENDENT SCALAR QUANTIZATION
When dependent scalar quantization is used, the choice of the quantizer depends on the decoding of the preceding transform coefficient, and the entropy decoding of a transform coefficient depends on quantizer choice. To maintain high throughput in hardware implementations for transform coefficient entropy coding, several decision schemes of the scaler quantizer are proposed. In one implementation, the state transition and the context model selection are based on only regular coded bins. For example, the state transition can be based on the sum of the SIG, gt1 and gt2 flags, the exclusive-or function of the SIG, gt1 and gt2 flags, or based on only the gt1 or gt2 flag. When a block of transform coefficients is coded, the regular mode bins can be coded first in one or more scan passes, and the remaining bypass coded bins are grouped together in another one or more scan passes.
Algorithm management blockchain
In one embodiment, an apparatus comprises a communication interface, a memory, and a processor. The communication interface is to communicate with one or more devices. The memory to store a device identity blockchain. The processor is to: receive a device identity transaction from a first device, wherein the device identity transaction comprises a device identity; compute a hash of the device identity; determine, based on the hash, whether the device identity is registered in the device identity blockchain; and upon a determination that the device identity is not registered in the device identity blockchain, add the device identity transaction to the device identity blockchain.
LOSSLESS COMPRESSION OF DIGITAL IMAGES USING PRIOR IMAGE CONTEXT
Techniques for lossless compression of a digital image using prior image context.
LOSSLESS COMPRESSION OF DIGITAL IMAGES USING PRIOR IMAGE CONTEXT
Techniques for lossless compression of a digital image using prior image context.
SYSTEM AND METHOD TO ESTIMATE BLOCKINESS IN TRANSFORM-BASED VIDEO ENCODING
A method for estimating blockiness in a video frame of transform-based video encoding includes: obtaining a bitstream of a transform coded video signal, the signal being partitioned into video frames and all operations being performed on a per frame basis, wherein coefficients constituting transforms encoded in the bitstream of the video frames are read; averaging the coefficients of the transforms encoded in the bitstream into one averaged transform matrix per transform block size i; generating or making available one weighting matrix per averaged transform of block size i; computing intermediate weighted average transform matrices; processing all members of each weighted and averaged transform matrix into a single value per transform of block size i, to obtain intermediate signals; and computing a single value by weighting values of the intermediate signals according to an area in the respective video frame and adding up the weighted values of the intermediate signals.