Patent classifications
H04N13/275
Information processing apparatus, information processing method, and program for point cloud sample processing
The present disclosure relates to an information processing apparatus and an information processing method that enable processing to be performed simply, and a program. By converting a point cloud representing a three-dimensional structure into two dimensions, a geometry image and a texture image, and three-dimensional information metadata required for constructing the geometry image and the texture image in three dimensions are obtained. Then, one PC sample included in a Point Cloud displayed at a specific time is generated by storing the geometry image, the texture image, and the three-dimensional information metadata in accordance with a playback order required at a time of reproducing and playing back the geometry image and the texture image in three dimensions on the basis of the three-dimensional information metadata. It is possible to apply to a data generation device that generates data for distribution of a Point Cloud.
Information processing apparatus, information processing method, and program for point cloud sample processing
The present disclosure relates to an information processing apparatus and an information processing method that enable processing to be performed simply, and a program. By converting a point cloud representing a three-dimensional structure into two dimensions, a geometry image and a texture image, and three-dimensional information metadata required for constructing the geometry image and the texture image in three dimensions are obtained. Then, one PC sample included in a Point Cloud displayed at a specific time is generated by storing the geometry image, the texture image, and the three-dimensional information metadata in accordance with a playback order required at a time of reproducing and playing back the geometry image and the texture image in three dimensions on the basis of the three-dimensional information metadata. It is possible to apply to a data generation device that generates data for distribution of a Point Cloud.
METHOD FOR DISPLAYING OBJECTS, ELECTRONIC DEVICE, AND STORAGE MEDIUM
A method for displaying objects is provided. The method includes: acquiring a scene image and a plurality of object images of a target object; determining, based on a target viewing angle of the scene image, X reference viewing angles from the viewing angles corresponding to the plurality of object images, wherein each of the X reference viewing angles is adjacent to the target viewing angle in terms of spatial position; generating intermediate images corresponding to the X reference viewing angles by transforming, based on position difference information between the X reference viewing angles and the target viewing angle, positions of pixel points in object images corresponding to the X reference viewing angles, wherein each of the intermediate images indicates a display effect of the target object at the target viewing angle; and displaying the intermediate images superimposed on the scene image.
METHOD FOR DISPLAYING OBJECTS, ELECTRONIC DEVICE, AND STORAGE MEDIUM
A method for displaying objects is provided. The method includes: acquiring a scene image and a plurality of object images of a target object; determining, based on a target viewing angle of the scene image, X reference viewing angles from the viewing angles corresponding to the plurality of object images, wherein each of the X reference viewing angles is adjacent to the target viewing angle in terms of spatial position; generating intermediate images corresponding to the X reference viewing angles by transforming, based on position difference information between the X reference viewing angles and the target viewing angle, positions of pixel points in object images corresponding to the X reference viewing angles, wherein each of the intermediate images indicates a display effect of the target object at the target viewing angle; and displaying the intermediate images superimposed on the scene image.
Generating composite stereoscopic images
A system, method or compute program product for generating composite images. One of the systems includes a capture device to capture an image of a physical environment; and one or more storage devices storing instructions that are operable, when executed by one or more processors of the system, to cause the one or more processors to: obtain an image of the physical environment as captured by the capture device, identify a visually-demarked region on a surface in the physical environment as depicted in the image, process the image to generate a composite image of the physical environment that includes a depiction of a virtual object, wherein a location of the depiction of the virtual object in the composite image is based on a location of the depiction of the visually-demarked region in the image, and cause the composite image to be displayed for a user.
METHOD AND SYSTEM FOR DRAPING A 3D GARMENT ON A 3D HUMAN BODY
This disclosure relates generally to method and system for draping a 3D garment on a 3D human body. Dressing digital humans in 3D have gained much attention due to its use in online shopping and draping 3D garments over the 3D human body has immense applications in virtual try-on, animations, and accurate fitment of the 3D garment is the utmost importance. The proposed disclosure is a single unified garment deformation model that learns the shared space of variations for a body shape, a body pose, and a styling garment. The method receives a plurality of human body inputs to construct a 3D skinned garments for the subject. The deep draper network trained using a plurality of losses provides efficient deep neural network based method that predicts fast and accurate 3D garment images. The method couples the geometric and multi-view perceptual constraints that efficiently learn the garment deformation's high-frequency geometry.
METHOD AND SYSTEM FOR DRAPING A 3D GARMENT ON A 3D HUMAN BODY
This disclosure relates generally to method and system for draping a 3D garment on a 3D human body. Dressing digital humans in 3D have gained much attention due to its use in online shopping and draping 3D garments over the 3D human body has immense applications in virtual try-on, animations, and accurate fitment of the 3D garment is the utmost importance. The proposed disclosure is a single unified garment deformation model that learns the shared space of variations for a body shape, a body pose, and a styling garment. The method receives a plurality of human body inputs to construct a 3D skinned garments for the subject. The deep draper network trained using a plurality of losses provides efficient deep neural network based method that predicts fast and accurate 3D garment images. The method couples the geometric and multi-view perceptual constraints that efficiently learn the garment deformation's high-frequency geometry.
Methods, systems, and media for generating and rendering immersive video content
Methods, systems, and media for generating and rendering immersive video content are provided. In some embodiments, the method comprises: receiving information indicating positions of cameras in a plurality of cameras; generating a mesh on which video content is to be projected based on the positions of the cameras in the plurality of cameras, wherein the mesh is comprised of a portion of a faceted cylinder, and wherein the faceted cylinder has a plurality of facets each corresponding to a projection from a camera in the plurality of cameras; receiving video content corresponding to the plurality of cameras; and transmitting the video content and the generated mesh to a user device in response to receiving a request for the video content from the user device.
Methods, systems, and media for generating and rendering immersive video content
Methods, systems, and media for generating and rendering immersive video content are provided. In some embodiments, the method comprises: receiving information indicating positions of cameras in a plurality of cameras; generating a mesh on which video content is to be projected based on the positions of the cameras in the plurality of cameras, wherein the mesh is comprised of a portion of a faceted cylinder, and wherein the faceted cylinder has a plurality of facets each corresponding to a projection from a camera in the plurality of cameras; receiving video content corresponding to the plurality of cameras; and transmitting the video content and the generated mesh to a user device in response to receiving a request for the video content from the user device.
Methods and apparatus for using track derivations to generate new tracks for network based media processing applications
The techniques described herein relate to methods, apparatus, and computer readable media configured to perform media processing tasks. A media processing entity includes a processor in communication with a memory, wherein the memory stores computer-readable instructions that, when executed by the processor, cause the processor to perform receiving, from a remote computing device, multi-view multimedia data comprising a hierarchical track structure comprising at least a first track comprising first media data at a first level of the hierarchical track structure, and a second track comprising task instruction data at a second level in the hierarchical track structure that is different than the first level of the first track. The instructions further cause the processor to perform processing the first media data of the first track based on the task instruction data of the second track to generate modified media data and an output track that includes the modified media data.