H04N19/65

Applications for decoder-side modeling of objects identified in decoded video data

Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.

Applications for decoder-side modeling of objects identified in decoded video data

Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.

Temporal alignment of MPEG and GLTF media
11695932 · 2023-07-04 · ·

An apparatus includes at least one processor; and at least one memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: provide an animation timing extension; wherein the animation timing extension links a graphics library transmission format animation to timed metadata and a metadata track of the timed metadata; wherein the metadata track of the timed metadata is listed with an object associated with moving picture media; and align at least one timeline of the moving picture media with at least one timeline of the graphics library transmission format animation; wherein a sample of the metadata track is used to manipulate an animation event.

Temporal alignment of MPEG and GLTF media
11695932 · 2023-07-04 · ·

An apparatus includes at least one processor; and at least one memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: provide an animation timing extension; wherein the animation timing extension links a graphics library transmission format animation to timed metadata and a metadata track of the timed metadata; wherein the metadata track of the timed metadata is listed with an object associated with moving picture media; and align at least one timeline of the moving picture media with at least one timeline of the graphics library transmission format animation; wherein a sample of the metadata track is used to manipulate an animation event.

IMAGE TRANSMISSION DEVICE, IMAGE RECEPTION DEVICE AND COMPUTER READABLE MEDIUM

An image transmission device (100) includes a compression unit (16) to generate first compression images (5) which are obtained by irreversibly compressing divided images, and to generate second compression images (6) which are obtained by reversibly compressing each of the first compression images (5), a decompression unit (17) to decompress each of the first compression images (5) as decompression images (7), a division sum image generation unit (12) to generate a division sum image (imgSUM), an error sum image generation unit (13) to generate an error sum image (ΔimgSUM), a judgment unit (14) to generate judgment data (Dj), and a transmission unit (15) to transmit the second compression images (6), the division sum image (imgSUM), the error sum image (ΔimgSUM) and the judgment data (Dj).

Encoding digital videos using controllers of data storage devices

In some embodiments, an apparatus includes a memory configured to store data and a controller coupled to the memory. The controller is configured to receive, from a computing device coupled to the apparatus, one or more frames of a digital video. The controller is also configured to analyze one or more components of the memory. The controller is further configured to determine a set of states for the one or more components of the memory based on the analysis of the one or more components of the memory. The controller is further configured to determine a first encoding rate for the digital video from a plurality of encoding rates based on the set of states for the one or more components of the memory. The controller is further configured to encode the digital video based on the first encoding rate and to store the encoded digital video in the memory.

Encoding digital videos using controllers of data storage devices

In some embodiments, an apparatus includes a memory configured to store data and a controller coupled to the memory. The controller is configured to receive, from a computing device coupled to the apparatus, one or more frames of a digital video. The controller is also configured to analyze one or more components of the memory. The controller is further configured to determine a set of states for the one or more components of the memory based on the analysis of the one or more components of the memory. The controller is further configured to determine a first encoding rate for the digital video from a plurality of encoding rates based on the set of states for the one or more components of the memory. The controller is further configured to encode the digital video based on the first encoding rate and to store the encoded digital video in the memory.

Layer ID signaling using extension mechanism

A signaling of the layer ID is described which each of the packets of a multi-layered video signal is associated with. In particular, an efficient way of signaling this layer association is achieved, with nevertheless maintaining the backward compatibility with codecs according to which a certain value of the base layer-ID field is restricted to be non-extendable such as base layer-ID value 0 in the base layer-ID field. Instead of circumventing this restriction specifically with respect to this non-extendable base layer-ID value, the layer-ID of portions of the multi-layer data stream is signaled in an extendable manner by sub-dividing the base layer-ID field into a first sub-field and a second sub-field: whenever the first sub-field of the base layer-ID field fulfills a predetermined criterion, an extension layer-ID field is provided, and if the first sub-field of the base layer-ID field does not fulfill the predetermined criterion, the extension layer-ID field is omitted.

Layer ID signaling using extension mechanism

A signaling of the layer ID is described which each of the packets of a multi-layered video signal is associated with. In particular, an efficient way of signaling this layer association is achieved, with nevertheless maintaining the backward compatibility with codecs according to which a certain value of the base layer-ID field is restricted to be non-extendable such as base layer-ID value 0 in the base layer-ID field. Instead of circumventing this restriction specifically with respect to this non-extendable base layer-ID value, the layer-ID of portions of the multi-layer data stream is signaled in an extendable manner by sub-dividing the base layer-ID field into a first sub-field and a second sub-field: whenever the first sub-field of the base layer-ID field fulfills a predetermined criterion, an extension layer-ID field is provided, and if the first sub-field of the base layer-ID field does not fulfill the predetermined criterion, the extension layer-ID field is omitted.

VIDEO COMPRESSION TECHNIQUES FOR RELIABLE TRANSMISSION
20230034884 · 2023-02-02 ·

Systems and methods are presented for reliable transmission of time-sensitive data. In particular, various embodiments provide for the generation of compressed sequential data, where individual instances of a sequence represent differentials from prior instances in that sequence. In order to reduce an amount of data that needs to be transmitted, instances of data (such as individual video frames) can be provided using a prior video frame as a reference, sending only data for those pixel locations where the pixel value differs from the reference frame. A reference frame can include a previously-received and successfully-decoded frame, in order to minimize the impact of dropped, incomplete, or corrupted frames. In order to further reduce data transmission requirements, a reference frame can be selected which is determined to be optimal for the current frame, such as may represent a least amount of data to be transmitted for a given frame.