Patent classifications
H04N13/282
System and method for calibrating a plurality of 3D sensors with respect to a motion conveyance
This invention provides an easy-to-manufacture, easy-to-analyze calibration object which combines measurable and repeatable, but not necessarily accurate, 3D features—such as a two-sided calibration object/target in (e.g.) the form of a frustum, with a pair of accurate and measurable features, more particularly parallel faces separated by a precise specified thickness, so as to provide for simple field calibration of opposite-facing DS sensors. Illustratively, a composite calibration object can be constructed, which includes the two-sided frustum that has been sandblasted and anodized (to provide measurable, repeatable features), with a flange whose above/below parallel surfaces have been ground to a precise specified thickness. The 3D corner positions of the two-sided frustum are used to calibrate the two sensors in X and Y, but cannot establish absolute Z without accurate information about the thickness of the two-sided frustum; the flange provides the absolute Z information.
Method and apparatus for overlay processing in 360 video system
Provided is a 360-degree image data processing method performed by a 360-degree video reception apparatus. The method includes receiving 360-degree image data, obtaining information on an encoded picture and metadata from the 360-degree image data, decoding a picture based on the information on the encoded picture, rendering the decoded picture and an overlay based on the metadata, in which the metadata includes overlay related metadata, the overlay is rendered based on the overlay related metadata, the overlay related metadata includes information on an alpha plane of the overlay, and the information on the alpha plane of the overlay is included in a image item or a video track.
Method and apparatus for overlay processing in 360 video system
Provided is a 360-degree image data processing method performed by a 360-degree video reception apparatus. The method includes receiving 360-degree image data, obtaining information on an encoded picture and metadata from the 360-degree image data, decoding a picture based on the information on the encoded picture, rendering the decoded picture and an overlay based on the metadata, in which the metadata includes overlay related metadata, the overlay is rendered based on the overlay related metadata, the overlay related metadata includes information on an alpha plane of the overlay, and the information on the alpha plane of the overlay is included in a image item or a video track.
Image processing apparatus that performs processing concerning display of stereoscopic image, image processing method, and storage medium
An image processing apparatus that performs processing concerning the display of a stereoscopic image so as to improve the convenience of a user in viewing a stereoscopic image. A head mount display displays a stereoscopic image using image data including a plurality of images having different viewpoints. The image data is processed based on metadata attached to the image data. In a case where information indicating that the image data is associated with a file format which cannot cause the head mount display to perform display is included in the metadata, the image data is converted into a file format which can cause the head mount display to perform display.
Method for acquiring distance from moving body to at least one object located in any direction of moving body by utilizing camera-view depth map and image processing device using the same
A method for acquiring a distance from a moving body to an object located in any direction of the moving body includes steps of: an image processing device (a) instructing a sweep network to project pixels of images, generated by cameras covering all directions of the moving body, onto main virtual geometries and apply 3D concatenation operation thereon to generate an initial 4D cost volume, (b) generating a final main 3D cost volume therefrom through a cost volume computation network, and (c) generating sub inverse distance indices corresponding to inverse values of sub separation distances between a sub reference point and sub virtual geometries, and main inverse distance indices corresponding to inverse values of main separation distances between a main reference point and the main virtual geometries, by using a sub cost volume and the final main 3D cost volume, to thereby acquire the distance to the object.
Automated Spatial Indexing of Images to Video
A spatial indexing system receives a video that is a sequence of frames depicting an environment, such as a floor of a construction site, and performs a spatial indexing process to automatically identify the spatial locations at which each of the images were captured. The spatial indexing system also generates an immersive model of the environment and provides a visualization interface that allows a user to view each of the images at its corresponding location within the model.
INFORMATION PROCESSING APPARATUS, DISPLAY RANGE DECISION METHOD, AND PROGRAM
An information processing apparatus according to the present technology includes a display range decision unit configured, in switching a display image as to multiple-viewpoint images capable of displaying a display target from multiple viewpoints, from a switching source viewpoint image corresponding to a switching source viewpoint to a switching destination viewpoint image corresponding to a switching destination viewpoint, to decide a display range of the switching destination viewpoint image on the basis of viewpoint position information of the switching destination viewpoint, specific target information which is information regarding a specific target in the display target, and line-of-sight direction information of an estimated orientation of a line-of-sight of a user to the display image.
INFORMATION PROCESSING APPARATUS, DISPLAY RANGE DECISION METHOD, AND PROGRAM
An information processing apparatus according to the present technology includes a display range decision unit configured, in switching a display image as to multiple-viewpoint images capable of displaying a display target from multiple viewpoints, from a switching source viewpoint image corresponding to a switching source viewpoint to a switching destination viewpoint image corresponding to a switching destination viewpoint, to decide a display range of the switching destination viewpoint image on the basis of viewpoint position information of the switching destination viewpoint, specific target information which is information regarding a specific target in the display target, and line-of-sight direction information of an estimated orientation of a line-of-sight of a user to the display image.
Efficient Delivery of Multi-Camera Interactive Content
Techniques are disclosed relating to encoding recorded content for distribution to other computing devices. In various embodiments, a first computing device records content of a physical environment in which the first computing device is located, the content being deliverable to a second computing device configured to present a corresponding environment based on the recorded content and content recorded by one or more additional computing devices. The first computing device determines a pose of the first computing device within the physical environment and encodes the pose in a manifest usable to stream the content recorded by the first computing device to the second computing device. The encoded pose is usable by the second computing device to determine whether to stream the content recorded by the first computing device.
MULTIVIEW DISPLAY SYSTEM AND METHOD WITH ADAPTIVE BACKGROUND
An adaptive background multiview image display system and method provides improved multiview image quality. Systems and methods may involve generating crosstalk data that reduces crosstalk between a first view of subject image and a second view of the subject image. The subject image may be a multiview image to be overlaid on a background image. A crosstalk violation may be detected in the subject image based on the crosstalk data. At least one of a color value or a brightness value of the background image is determined according to a degree of the crosstalk violation to generate the background image. The subject image may then be overlaid on the generated background image.