Patent classifications
H04N13/282
IMAGE ACQUISITION
An image acquisition method and apparatus are provided. By controlling a motion device, at least one of an image acquisition device or a photographed target object moves under the driving of the motion device, so that a sample image including the target object can be acquired in a preset designated acquisition pose after movement, thereby improving the accuracy of a relative pose between the image acquisition device and the target object during acquisition, reducing human intervention during acquisition, improving the automation degree, and providing the possibility for subsequent services that need to be performed according to sample images captured with relatively high shooting pose accuracy.
Image processing apparatus, image processing method, and storage medium
An image processing apparatus acquires first shape information representing a three-dimensional shape about an object located within an image capturing region based on one or more images obtained by one or more imaging apparatuses for performing image capturing of the image capturing region from a plurality of directions, acquires second shape information representing a three-dimensional shape about an object located within the image capturing region based on one or more images obtained by one or more imaging apparatuses, acquires viewpoint information indicating a position and direction of a viewpoint, and generates a virtual viewpoint image corresponding to the position and direction of the viewpoint indicated by the acquired viewpoint information based on the acquired first shape information and the acquired second shape information, such that at least a part of the object corresponding to the second shape information is displayed in a translucent way within the virtual viewpoint image.
Image processing apparatus, image processing method, and storage medium
An image processing apparatus acquires first shape information representing a three-dimensional shape about an object located within an image capturing region based on one or more images obtained by one or more imaging apparatuses for performing image capturing of the image capturing region from a plurality of directions, acquires second shape information representing a three-dimensional shape about an object located within the image capturing region based on one or more images obtained by one or more imaging apparatuses, acquires viewpoint information indicating a position and direction of a viewpoint, and generates a virtual viewpoint image corresponding to the position and direction of the viewpoint indicated by the acquired viewpoint information based on the acquired first shape information and the acquired second shape information, such that at least a part of the object corresponding to the second shape information is displayed in a translucent way within the virtual viewpoint image.
Methods and apparatus for using track derivations to generate new tracks for network based media processing applications
The techniques described herein relate to methods, apparatus, and computer readable media configured to perform media processing tasks. A media processing entity includes a processor in communication with a memory, wherein the memory stores computer-readable instructions that, when executed by the processor, cause the processor to perform receiving, from a remote computing device, multi-view multimedia data comprising a hierarchical track structure comprising at least a first track comprising first media data at a first level of the hierarchical track structure, and a second track comprising task instruction data at a second level in the hierarchical track structure that is different than the first level of the first track. The instructions further cause the processor to perform processing the first media data of the first track based on the task instruction data of the second track to generate modified media data and an output track that includes the modified media data.
Methods and apparatus for using track derivations to generate new tracks for network based media processing applications
The techniques described herein relate to methods, apparatus, and computer readable media configured to perform media processing tasks. A media processing entity includes a processor in communication with a memory, wherein the memory stores computer-readable instructions that, when executed by the processor, cause the processor to perform receiving, from a remote computing device, multi-view multimedia data comprising a hierarchical track structure comprising at least a first track comprising first media data at a first level of the hierarchical track structure, and a second track comprising task instruction data at a second level in the hierarchical track structure that is different than the first level of the first track. The instructions further cause the processor to perform processing the first media data of the first track based on the task instruction data of the second track to generate modified media data and an output track that includes the modified media data.
Information processing apparatus, information processing method and storage medium
The technology disclosed herein is an information processing apparatus comprising: one or more memories storing instructions; and one or more processors executing the instructions to function as: an obtaining unit configured to obtain information for specifying a position of an object included in multi-viewpoint image data obtained by image capturing using a plurality of imaging apparatuses; and a generation unit configured to generate a virtual viewpoint path data to generate virtual viewpoint image data by inputting the information obtained by the obtaining unit to an output unit which is a learned model learned from the virtual viewpoint path data to be training data and at least information for specifying a position of an object to be input data corresponding to the virtual viewpoint path data and is configured to output virtual viewpoint data by receiving input of information for specifying a position of an object.
Information processing apparatus, information processing method and storage medium
The technology disclosed herein is an information processing apparatus comprising: one or more memories storing instructions; and one or more processors executing the instructions to function as: an obtaining unit configured to obtain information for specifying a position of an object included in multi-viewpoint image data obtained by image capturing using a plurality of imaging apparatuses; and a generation unit configured to generate a virtual viewpoint path data to generate virtual viewpoint image data by inputting the information obtained by the obtaining unit to an output unit which is a learned model learned from the virtual viewpoint path data to be training data and at least information for specifying a position of an object to be input data corresponding to the virtual viewpoint path data and is configured to output virtual viewpoint data by receiving input of information for specifying a position of an object.
IMAGE PROCESSING APPARATUS AND METHOD, AND IMAGE CAPTURING APPARATUS AND CONTROL METHOD THEREOF, AND STORAGE MEDIUM
An image processing apparatus comprises: an acquisition unit that acquires a plurality of different viewpoint images obtained by shooting a same scene from different viewpoints, and acquires at least one parallax image pair having parallax by pupil division; a first generator that generates a first distance image from the parallax image pair; a second generator that generates a second distance image from the plurality of different viewpoint images; and an integrator that integrates the first distance image and the second distance image and generates an integrated distance image.
IMAGE PROCESSING APPARATUS AND METHOD, AND IMAGE CAPTURING APPARATUS AND CONTROL METHOD THEREOF, AND STORAGE MEDIUM
An image processing apparatus comprises: an acquisition unit that acquires a plurality of different viewpoint images obtained by shooting a same scene from different viewpoints, and acquires at least one parallax image pair having parallax by pupil division; a first generator that generates a first distance image from the parallax image pair; a second generator that generates a second distance image from the plurality of different viewpoint images; and an integrator that integrates the first distance image and the second distance image and generates an integrated distance image.
Stereo viewing
The invention relates to creating and viewing stereo images, for example stereo video images, also called 3D video. At least three camera sources with overlapping fields of view are used to capture a scene so that an area of the scene is covered by at least three cameras. At the viewer, a camera pair is chosen from the multiple cameras to create a stereo camera pair that best matches the location of the eyes of the user if they were located at the place of the camera sources. That is, a camera pair is chosen so that the disparity created by the camera sources resembles the disparity that the user's eyes would have at that location. If the user tilts his head, or the view orientation is otherwise altered, a new pair can be formed, for example by switching the other camera. The viewer device then forms the images of the video frames for the left and right eyes by picking the best sources for each area of each image for realistic stereo disparity.