Patent classifications
H04N21/816
INFORMATION PROCESSING APPARATUS AND METHOD
The present disclosure relates to an information processing apparatus and method capable of suppressing an increase in load of reproduction processing.
To generate, by using tile identification information indicating a tile of a point cloud corresponding to a data unit of a bitstream of the point cloud expressing an object having a three-dimensional shape as a set of points, tile management information that is information for managing the tile corresponding to a subsample including a single or a plurality of consecutive data units of the bitstream stored as a sample in a file, and to generate the file that stores the bitstream and the tile management information. The present disclosure can be applied to, for example, an information processing apparatus, an information processing method, or the like.
Multi-viewpoint multi-user audio user experience
An apparatus including circuitry configured for receiving a spatial media content file including a plurality of viewpoints; circuitry configured for determining a first viewpoint from the plurality of viewpoints for a first user consuming the spatial media content file; circuitry configured for receiving an indication that affects an audio rendering of the first viewpoint for the first user, wherein the indication is associated with one or more actions of at least one second user consuming the spatial media content file; and circuitry configured for controlling the audio rendering of the first viewpoint for the first user in response to the receiving of the indication based on at least one of: a position and/or orientation of the first user, and the one or more actions of the second user.
SERVER APPARATUS, TERMINAL APPARATUS, INFORMATION PROCESSING SYSTEM, AND INFORMATION PROCESSING METHOD
There is provided a technology capable of reducing the processing load on a server apparatus side in cloud rendering. A server apparatus according to the present technology includes a controller. The controller groups terminal apparatuses each having a viewing position within an identical segment on the basis of viewing position information of each terminal apparatus within a viewing region including a plurality of segments, and transmits common video information to each of the grouped terminal apparatuses by multicasting.
Facilitating panoramic video streaming with brain-computer interactions
Aspects of the subject disclosure may include, for example, obtaining one or more signals, the one or more signals being based upon brain activity of a viewer while the viewer is viewing media content; predicting, based upon the one or more signals, a first predicted desired viewport of the viewer; obtaining head movement data associated with the media content; predicting, based upon the head movement data, a second predicted desired viewport of the viewer; comparing the first predicted desired viewport to the second predicted desired viewport, resulting in a comparison; and determining, based upon the comparison, to use the first predicted desired viewport to facilitate obtaining a first subsequent portion of the media content or to use the second predicted desired viewport to facilitate obtaining a second subsequent portion of the media content. Other embodiments are disclosed.
SYSTEM AND METHOD OF SERVER-SIDE DYNAMIC ADAPTATION FOR SPLIT RENDERING
The techniques described herein relate to methods, apparatus, and computer readable media configured to provide video data for immersive media implemented by a server in communication with a client device. A request to access a stream of media data associated with immersive content at a point in time the client is first accessing the stream of media data for the immersive content is received from the client device. In response to the request from the client, the server transmits a response indication whether it has rendered at least part of the stream of media data. The server may also determine, based on the request from the client, whether to render at least part of the stream of media data for delivery to the client device.
MULTIDIMENSIONAL METADATA FOR PARALLEL PROCESSING OF SEGMENTED MEDIA DATA
A method executed by at least one processor, the method comprising: segmenting a multidimensional media stream into a plurality of segments of multidimensional media in a multidimensional space; representing each segment of the plurality of segments of multidimensional media using a respective sequence vector, the respective sequence vector comprising one or more predefined multidimensional metadata, wherein the predefined multidimensional metadata includes one of a starting vector, a length vector, and a scaling vector, and a startcode; and deriving a network based media processing (NBMP) workflow based on the respective sequence vectors of each segment of the plurality of segments.
Image capturing device, image communication system, and method for display control
An image capturing device includes: an imaging device to capture a first image; and circuitry to receive a second image from another image capturing device, the second image being captured with the another image capturing device and having an angle of view wider than that of the first image, and control a display to sequentially display the first image, and an image of an area of the second image that is similar to the first image, the image similar to the first image having the degree of similarity to the first image that is equal to or greater than a threshold.
A METHOD AND APPARATUS FOR ENCODING AND DECODING VOLUMETRIC VIDEO
Methods, devices and stream are disclosed to encode and decode a volumetric content. At the encoding, the space of the volumetric content is divided in distinct sectors according to at least two different sectorizations. One atlas is generated for each sectorization or a single atlas is generated encoding all the sectorizations. At the decoding, a sectorization is selected according to the current direction and field of view, according to user's gaze navigation and according to prediction of the upcoming pose of the virtual camera controlled by the user. Sectors are selected according the selected sectorization and the current direction and field of view and only patches encoded in regions of the atlas associated with these sectors are accessed to generate the viewport image representative of the content seen from the current point of view.
REPRESENTING VOLUMETRIC VIDEO IN SALIENCY VIDEO STREAMS
Saliency regions are identified in a global scene depicted by volumetric video. Saliency video streams that track the saliency regions are generated. Each saliency video stream tracks a respective saliency region. A saliency stream based representation of the volumetric video is generated to include the saliency video streams. The saliency stream based representation of the volumetric video is transmitted to a video streaming client.
Low latency wireless virtual reality systems and methods
Virtual Reality (VR) processing devices and methods are provided for transmitting user feedback information comprising at least one of user position information and user orientation information, receiving encoded audio-video (A/V) data, which is generated based on the transmitted user feedback information, separating the A/V data into video data and audio data corresponding to a portion of a next frame of a sequence of frames of the video data to be displayed, decoding the portion of a next frame of the video data and the corresponding audio data, providing the audio data for aural presentation and controlling the portion of the next frame of the video data to be displayed in synchronization with the corresponding audio data.