Patent classifications
H04N21/85406
DIFFERENT ATLAS PACKINGS FOR VOLUMETRIC VIDEO
Methods, devices and stream are disclosed to encode and decode a scene (such as a point cloud) in the context of a patch-based transmission of a volumetric video content. Attributes of points of the scene are projected onto patches. Every point has a geometry attribute. For other attributes, like transparency of displacement attribute, some points may have no value. According to the present principles, each attribute is encoded in a different atlas with its own layout. This allow to save pixel rate in memory of the renderer.
IMAGE ENCODING/DECODING METHOD AND DEVICE BASED ON HYBRID NAL UNIT TYPE, AND RECORDING MEDIUM FOR STORING BITSTREAM
An image encoding/decoding method and apparatus are provided. The image decoding method includes obtaining, from a bitstream, network abstraction layer (NAL) unit type information of at least one NAL unit including coded image data, determining at least one NAL unit type of one or more slices in the current picture based on the obtained NAL unit type information, and decoding the current picture based on the determined NAL unit type. The current picture is determined to be a random access skipped leading (RASL) picture, based on the determined NAL unit type including a RASL picture NAL unit type (RASL_NUT). When an intra random access point (IRAP) picture associated with the RASL picture is a first picture in decoding order, the RASL picture is decoded, based on the RASL picture including one or more slices having a random access decodable leading (RADL) picture NAL unit type (RADL_NUT).
TIME SIGNALING FOR MEDIA STREAMING
Systems and techniques are described for processing media content, such as samples of sparse media content whose sample durations can be unknown at the time of decoding the samples, include obtaining, at a current time instance, a current fragment comprising at least a current time component. From the current time component, a modified duration is obtained for at least one media sample, the modified duration indicating a time duration by which a presentation of a previous media sample of a previous fragment is to be extended or reduced in relation to the current time instance. The at least one media sample is presented for a time duration based on the modified duration.
INFORMATION PROCESSING DEVICE AND METHOD
A scene descriptive file describing a scene of 3D object content is generated, in the scene descriptive file, timed metadata identification information indicating that metadata of an associated external file changes in a time direction being stored in an MPEG_media extension, and timed metadata access information associating a camera object with the metadata being stored in the camera object. Furthermore, timed metadata that changes in the time direction is acquired on the basis of the timed metadata identification information and the timed metadata access information stored in the scene descriptive file, and a display image of the 3D object content is generated on the basis of the acquired timed metadata. The present disclosure is applicable to, for example, an information processing device, an information processing method, or the like.
PROVIDING SEMANTIC INFORMATION WITH ENCODED IMAGE DATA
A method (400) performed by a decoder. The method includes the decoder receiving (s402) a plurality of Network Abstraction Layer, NAL, units, wherein the plurality of NAL units comprises: i) one or more Video Coding Layer, VCL, NAL units comprising pixel data for one or more pictures and ii) a first non-VCL NAL unit, characterized in that the first non-VCL NAL unit comprises: i) at least a first syntax element identifying at least a first data type, DT1, and ii) semantic information that comprises at least a first feature for one or more machine vision tasks, wherein the first feature comprises at least first data of the first data type. The method also includes the decoder obtaining (s404) the first feature from the first non-VCL NAL unit.
INFORMATION PROCESSING APPARATUS AND METHOD
The present disclosure relates to an information processing apparatus and method capable of suppressing an increase in load of reproduction processing.
To generate, by using tile identification information indicating a tile of a point cloud corresponding to a data unit of a bitstream of the point cloud expressing an object having a three-dimensional shape as a set of points, tile management information that is information for managing the tile corresponding to a subsample including a single or a plurality of consecutive data units of the bitstream stored as a sample in a file, and to generate the file that stores the bitstream and the tile management information. The present disclosure can be applied to, for example, an information processing apparatus, an information processing method, or the like.
METHOD FOR DASH MEDIA STREAMING, ELECTRIC DEVICE AND STORAGE MEDIUM
Embodiments of the present disclosure relate to communication technology. A method for DASH media streaming, an electronic device and a storage medium are provided. A first MPD file, a media initialization file, and a first media slice file of a DASH media stream to be played are acquired from a server by sending at most two acquisition requests to the server. The media initialization file and the first media slice file are acquired through one of the at most two acquisition requests. Subsequent MPD files are requested from the server according to the first MPD file, and other media slice files in addition to the first media slice file are acquired from the server according to the subsequent MPD files.
Method and apparatus for generating media data
The present invention concerns a method for generating media files from video sequences, the method comprising by a server: obtaining from the video sequences, video data composed of a plurality of samples; generating a video track based on the obtained video data, each video track comprises samples of a video sequence, and the video track is associated with descriptive metadata, the descriptive metadata comprises: a spatial information related to one or more samples of the associated video track; and a composition information for organizing generated video tracks to get a full picture when displayed by a client; and generating media files including the generated video tracks.
MP4 FILE VIRTUAL MSS FRAGMENTATION METHOD, MSS ADAPTER, MEDIA SERVER, AND STORAGE MEDIUM
A MSS chunking method for an MP4 file, an MSS adapter, and a non-transitory computer-readable storage medium are disclosed. The method may include: receiving an MSS download request for downloading of at least one MSS chunk of a first MP4 file; determining, according to MSS chunk indexes and virtual chunk indexes of the first MP4 file, data segments in the first MP4 file corresponding to data of the at least one MSS chunk, each MSS chunk index containing a playing address of a respective MSS chunk of the first MP4 file, and each virtual chunk index containing a correspondence between the data of a respective MSS chunk and a respective one of data segments of the first MP4 file; sending a data segment download request to a media server and receiving data segments; and assembling contents of the data segments to generate the at least one MSS chunk.
Method and apparatus for signaling and storing grouping types in an image container file
A method, apparatus and computer program product are provided to store and signal pre-derivation properties in an image container file (24, 26). Relative to the construction of image package comprising an image container file, the method, apparatus and computer program product assign a pre-derivation property identifier data structure identifying one or more pre-derivation properties of one or more pre-derived images (22). With respect to the processing of an image container file, the method, apparatus and computer program product permit an image container file and a pre-derivation property identifier data structure identifying one or more pre-derivation properties of one or more pre-derived images in the image be processed to cause one or more pre-derived image items from the image container file to be rendered or edited and regenerated in accordance with the pre-derivation properties.