H04N13/282

Methods and apparatus for using track derivations for network based media processing
11509878 · 2022-11-22 · ·

The techniques described herein relate to methods, apparatus, and computer readable media configured to perform media processing. A media processing entity includes at least one processor in communication with a memory, wherein the memory stores computer-readable instructions that, when executed by the at least one processor, cause the at least one processor to perform receiving, from a remote computing device, multi-view multimedia data comprising a hierarchical track structure comprising at least a first track comprising first media data at a first level of the hierarchical track structure, and metadata associated with a second track at a second level in the hierarchical track structure that is different than the first level of the first track. The instructions further cause the processor to perform processing the first media data of the first track based on the metadata associated with the second track to generate second media data for the second track.

Methods and apparatus for using track derivations for network based media processing
11509878 · 2022-11-22 · ·

The techniques described herein relate to methods, apparatus, and computer readable media configured to perform media processing. A media processing entity includes at least one processor in communication with a memory, wherein the memory stores computer-readable instructions that, when executed by the at least one processor, cause the at least one processor to perform receiving, from a remote computing device, multi-view multimedia data comprising a hierarchical track structure comprising at least a first track comprising first media data at a first level of the hierarchical track structure, and metadata associated with a second track at a second level in the hierarchical track structure that is different than the first level of the first track. The instructions further cause the processor to perform processing the first media data of the first track based on the metadata associated with the second track to generate second media data for the second track.

MULTI-DIMENSIONAL DATA CAPTURE OF AN ENVIRONMENT USING PLURAL DEVICES
20230058599 · 2023-02-23 ·

Embodiments of the invention describe apparatuses, systems, and methods related to data capture of objects and/or an environment. In one embodiment, a user can capture time-indexed three-dimensional (3D) depth data using one or more portable data capture devices that can capture time indexed color images of a scene with depth information and location and orientation data. In addition, the data capture devices may be configured to captured a spherical view of the environment around the data capture device.

A METHOD AND APPARATUS FOR ENCODING, TRANSMITTING AND DECODING VOLUMETRIC VIDEO
20220368879 · 2022-11-17 ·

Methods, devices and stream for encoding, decoding and transmitting a multi-views frame are disclosed. A non-pruned MVD frame is obtained and an acyclic graph representing pruning precedence relations between views is determined. The MVD is pruned by using these precedence relations. The pruned MVD and data representative of the graph are encoded in the data stream. At the decoding, the contribution of each view for a pixel of a viewport frame to generate is determined as a function of the decoded pruning graph.

A METHOD AND APPARATUS FOR ENCODING, TRANSMITTING AND DECODING VOLUMETRIC VIDEO
20220368879 · 2022-11-17 ·

Methods, devices and stream for encoding, decoding and transmitting a multi-views frame are disclosed. A non-pruned MVD frame is obtained and an acyclic graph representing pruning precedence relations between views is determined. The MVD is pruned by using these precedence relations. The pruned MVD and data representative of the graph are encoded in the data stream. At the decoding, the contribution of each view for a pixel of a viewport frame to generate is determined as a function of the decoded pruning graph.

METHOD AND SYSTEM FOR DRAPING A 3D GARMENT ON A 3D HUMAN BODY

This disclosure relates generally to method and system for draping a 3D garment on a 3D human body. Dressing digital humans in 3D have gained much attention due to its use in online shopping and draping 3D garments over the 3D human body has immense applications in virtual try-on, animations, and accurate fitment of the 3D garment is the utmost importance. The proposed disclosure is a single unified garment deformation model that learns the shared space of variations for a body shape, a body pose, and a styling garment. The method receives a plurality of human body inputs to construct a 3D skinned garments for the subject. The deep draper network trained using a plurality of losses provides efficient deep neural network based method that predicts fast and accurate 3D garment images. The method couples the geometric and multi-view perceptual constraints that efficiently learn the garment deformation's high-frequency geometry.

METHOD AND SYSTEM FOR DRAPING A 3D GARMENT ON A 3D HUMAN BODY

This disclosure relates generally to method and system for draping a 3D garment on a 3D human body. Dressing digital humans in 3D have gained much attention due to its use in online shopping and draping 3D garments over the 3D human body has immense applications in virtual try-on, animations, and accurate fitment of the 3D garment is the utmost importance. The proposed disclosure is a single unified garment deformation model that learns the shared space of variations for a body shape, a body pose, and a styling garment. The method receives a plurality of human body inputs to construct a 3D skinned garments for the subject. The deep draper network trained using a plurality of losses provides efficient deep neural network based method that predicts fast and accurate 3D garment images. The method couples the geometric and multi-view perceptual constraints that efficiently learn the garment deformation's high-frequency geometry.

Methods, systems, and media for generating and rendering immersive video content
11589027 · 2023-02-21 · ·

Methods, systems, and media for generating and rendering immersive video content are provided. In some embodiments, the method comprises: receiving information indicating positions of cameras in a plurality of cameras; generating a mesh on which video content is to be projected based on the positions of the cameras in the plurality of cameras, wherein the mesh is comprised of a portion of a faceted cylinder, and wherein the faceted cylinder has a plurality of facets each corresponding to a projection from a camera in the plurality of cameras; receiving video content corresponding to the plurality of cameras; and transmitting the video content and the generated mesh to a user device in response to receiving a request for the video content from the user device.

Methods, systems, and media for generating and rendering immersive video content
11589027 · 2023-02-21 · ·

Methods, systems, and media for generating and rendering immersive video content are provided. In some embodiments, the method comprises: receiving information indicating positions of cameras in a plurality of cameras; generating a mesh on which video content is to be projected based on the positions of the cameras in the plurality of cameras, wherein the mesh is comprised of a portion of a faceted cylinder, and wherein the faceted cylinder has a plurality of facets each corresponding to a projection from a camera in the plurality of cameras; receiving video content corresponding to the plurality of cameras; and transmitting the video content and the generated mesh to a user device in response to receiving a request for the video content from the user device.

IMAGE ACQUISITION

An image acquisition method and apparatus are provided. By controlling a motion device, at least one of an image acquisition device or a photographed target object moves under the driving of the motion device, so that a sample image including the target object can be acquired in a preset designated acquisition pose after movement, thereby improving the accuracy of a relative pose between the image acquisition device and the target object during acquisition, reducing human intervention during acquisition, improving the automation degree, and providing the possibility for subsequent services that need to be performed according to sample images captured with relatively high shooting pose accuracy.