Patent classifications
H04N13/178
IMMERSIVE MEDIA INTEROPERABILITY
Aspects of the disclosure provide methods and apparatuses for audio processing. In some examples, an apparatus for media processing includes processing circuitry. The processing circuitry receives first 3 degrees of freedom (3 DoF) information associated with a first media content for a scene in a media application. The first 3 DoF information includes a first revolution orientation for describing the first media content on a first sphere centered at a user of the media application. The processing circuitry determines that a rendering platform for rendering the first media content is a six degrees of freedom (6 DoF) platform, and calculates, first spatial location information of the first media content based on the first revolution orientation and first parameters of the first sphere. The first spatial location information is used in first 6 DoF information associated with the first media content for rendering the first media content on the 6 DoF platform.
MULTI-TRACK BASED IMMERSIVE MEDIA PLAYOUT
Methods, apparatus, and systems for dynamically indicating a playout group in various scenarios are disclosed. In one example aspect, a method includes determining a playout group corresponding to a volumetric video in a three-dimensional space based on a playout group information, the playout group including a plurality of components, wherein each component of a certain type has a version that is different from another component of the certain type, obtaining a two-dimensional volumetric video content from a media track corresponding to the playout group, and reconstructing the volumetric video into a three-dimensional space based on the plurality of components with different versions.
MULTI-APERTURE RANGING DEVICES AND METHODS
Embodiments of systems and methods for multi-aperture ranging are disclosed. An embodiment of an image processing system includes at least one processor and memory configured to receive a multi-aperture image set that includes a high-resolution subaperture image and a low-resolution subaperture image, wherein the high-resolution subaperture image and the low-resolution subaperture image were captured simultaneously from a camera using dissimilar focal lengths, predict a high-resolution predicted disparity map from the high-resolution subaperture image using a neural network, predict a low-resolution predicted disparity map from the low-resolution subaperture image using the neural network, and generate an integrated range map from the high-resolution and low-resolution predicted disparity maps, wherein the integrated range map includes an array of range information that corresponds to the multi-aperture image set and that is generated by overlaying common points in both the high-resolution predicted disparity map and the low-resolution predicted disparity map.
MULTI-APERTURE RANGING DEVICES AND METHODS
Embodiments of systems and methods for multi-aperture ranging are disclosed. An embodiment of an image processing system includes at least one processor and memory configured to receive a multi-aperture image set that includes a high-resolution subaperture image and a low-resolution subaperture image, wherein the high-resolution subaperture image and the low-resolution subaperture image were captured simultaneously from a camera using dissimilar focal lengths, predict a high-resolution predicted disparity map from the high-resolution subaperture image using a neural network, predict a low-resolution predicted disparity map from the low-resolution subaperture image using the neural network, and generate an integrated range map from the high-resolution and low-resolution predicted disparity maps, wherein the integrated range map includes an array of range information that corresponds to the multi-aperture image set and that is generated by overlaying common points in both the high-resolution predicted disparity map and the low-resolution predicted disparity map.
Information processing apparatus, information processing method and storage medium
The technology disclosed herein is an information processing apparatus comprising: one or more memories storing instructions; and one or more processors executing the instructions to function as: an obtaining unit configured to obtain information for specifying a position of an object included in multi-viewpoint image data obtained by image capturing using a plurality of imaging apparatuses; and a generation unit configured to generate a virtual viewpoint path data to generate virtual viewpoint image data by inputting the information obtained by the obtaining unit to an output unit which is a learned model learned from the virtual viewpoint path data to be training data and at least information for specifying a position of an object to be input data corresponding to the virtual viewpoint path data and is configured to output virtual viewpoint data by receiving input of information for specifying a position of an object.
Information processing apparatus, information processing method and storage medium
The technology disclosed herein is an information processing apparatus comprising: one or more memories storing instructions; and one or more processors executing the instructions to function as: an obtaining unit configured to obtain information for specifying a position of an object included in multi-viewpoint image data obtained by image capturing using a plurality of imaging apparatuses; and a generation unit configured to generate a virtual viewpoint path data to generate virtual viewpoint image data by inputting the information obtained by the obtaining unit to an output unit which is a learned model learned from the virtual viewpoint path data to be training data and at least information for specifying a position of an object to be input data corresponding to the virtual viewpoint path data and is configured to output virtual viewpoint data by receiving input of information for specifying a position of an object.
SUGGESTED VIEWPORT INDICATION FOR PANORAMIC VIDEO
A suggested viewport indication is generated and sent for use in a panoramic video. In one example, a method includes receiving encoded video including viewport metadata, decoding the video, extracting the viewport metadata, generating a viewport of the decoded video based on the viewport metadata, and buffering the generated viewport for display.
SUGGESTED VIEWPORT INDICATION FOR PANORAMIC VIDEO
A suggested viewport indication is generated and sent for use in a panoramic video. In one example, a method includes receiving encoded video including viewport metadata, decoding the video, extracting the viewport metadata, generating a viewport of the decoded video based on the viewport metadata, and buffering the generated viewport for display.
Overlay processing method in 360 video system, and device thereof
A 360 image data processing method performed by a 360 video receiving device, according to the present invention, comprises the steps of: receiving 360 image data; acquiring information and metadata on an encoded picture from the 360 image data; decoding the picture on the basis of the information on the encoded picture; and rendering the decoded picture and an overlay on the basis of the metadata, wherein the metadata includes overlay-related metadata, the overlay is rendered on the basis of the overlay-related metadata, and the overlay-related metadata includes information on a region of the overlay.
Overlay processing method in 360 video system, and device thereof
A 360 image data processing method performed by a 360 video receiving device, according to the present invention, comprises the steps of: receiving 360 image data; acquiring information and metadata on an encoded picture from the 360 image data; decoding the picture on the basis of the information on the encoded picture; and rendering the decoded picture and an overlay on the basis of the metadata, wherein the metadata includes overlay-related metadata, the overlay is rendered on the basis of the overlay-related metadata, and the overlay-related metadata includes information on a region of the overlay.