H04N2013/0085

Glasses-Free Determination of Absolute Motion

During operation, an electronic device may capture images using multiple image sensors having different fields of view and positions. Then, the electronic device may determine, based at least in part on an apparent size of an anatomical feature in the images (such as an interpupillary distance) and a predefined or predetermined size of the anatomical feature, absolute motion of at least a portion of the individual along a direction between at least the portion of the individual and the electronic device. Moreover, the electronic device may compute based at least in part on an estimated distance along the direction corresponding to the apparent size and the predefined or predetermined size and angular information associated with one or more objects in the images relative to the positions, absolute motion of at least the portion of the individual in a plane that is perpendicular to the direction.

Methods for automatic registration of 3D image data

A method for automatic registration of 3D image data, captured by a 3D image capture system having an RGB camera and a depth camera, includes capturing 2D image data with the RGB camera at a first pose; capturing depth data with the depth camera at the first pose; performing an initial registration of the RGB camera to the depth camera; capturing 2D image data with the RGB camera at a second pose; capturing depth data at the second pose; and calculating an updated registration of the RGB camera to the depth camera.

ELECTRONIC DEVICE AND OPERATION METHOD THEREFOR

Provided is an electronic device for stabilizing a 360-degree video, the electronic device including a memory storing one or more instructions, and a processor for executing the one or more instructions stored in the memory, wherein the processor is configured to execute the one or more instructions to: when a 360-degree video is reproduced, allocate probability values to a plurality of pixels included in a frame of the 360-degree video, based on a possibility that each of the plurality of pixels is included in a user's field of view (FOV), determine a three-dimensional (3D) rotation for the 360-degree video, based on the allocated probability values, and generate a stabilized 360-degree video by applying the 3D rotation to the 360-degree video.

Multi-overlay variable support and order kernel-based representation for image deformation and view synthesis
10742954 · 2020-08-11 · ·

A technique for displaying 3D videos creates a representation of image deformation, such as depth maps, in terms of a function for overlaying kernels with variable support and order. By optimizing the kernel type, order and support, which are allowed to be varied across different region of the deformation, it is able to approximate the image deformations in terms of the kernel parameters. Since the number of kernel parameters is usually much smaller than that of the number of pixels, this allows a significant reduction in the storage size of the image deformation.

Artificially rendering images using interpolation of tracked control points

Various embodiments of the present invention relate generally to systems and processes for artificially rendering images using interpolation of tracked control points. According to particular embodiments, a set of control points is tracked between a first frame and a second frame, where the first frame includes a first image captured from a first location and the second frame includes a second image captured from a second location. An artificially rendered image corresponding to a third location is then generated by interpolating individual control points for the third location using the set of control points and interpolating pixel locations using the individual control points. The individual control points are used to transform image data.

Information processing device and information processing method

The present disclosure relates to an information processing device and an information processing method that are capable of estimating the self-position by accurately and continuously estimating the self-movement. The information processing device according to an aspect of the present disclosure includes a downward imaging section and a movement estimation section. The downward imaging section is disposed on the bottom of a moving object traveling on a road surface and captures an image of the road surface. The movement estimation section estimates the movement of the moving object in accordance with a plurality of images representing the road surface and captured at different time points by the downward imaging section. The present disclosure can be applied, for example, to a position sensor mounted in an automobile.

Virtual reality environment based manipulation of multi-layered multi-view interactive digital media representations

Various embodiments of the present disclosure relate generally to systems and methods for generating multi-view interactive digital media representations in a virtual reality environment. According to particular embodiments, a plurality of images is fused into a first content model and a first context model, both of which include multi-view interactive digital media representations of s. Next, a virtual reality environment is generated using the first content model and the first context model. The virtual reality environment includes a first layer and a second layer. The user can navigate through and within the virtual reality environment to switch between multiple viewpoints of the content model via corresponding physical movements. The first layer includes the first content model and the second layer includes a second content model and wherein selection of the first layer provides access to the second layer with the second content model.

Artificially rendering images using interpolation of tracked control points

Various embodiments of the present invention relate generally to systems and processes for artificially rendering images using interpolation of tracked control points. According to particular embodiments, a set of control points is tracked between a first frame and a second frame, where the first frame includes a first image captured from a first location and the second frame includes a second image captured from a second location. An artificially rendered image corresponding to a third location is then generated by interpolating individual control points for the third location using the set of control points and interpolating pixel locations using the individual control points. The individual control points are used to transform image data.

Artificially rendering images using interpolation of tracked control points

Various embodiments of the present invention relate generally to systems and processes for artificially rendering images using interpolation of tracked control points. According to particular embodiments, a set of control points is tracked between a first frame and a second frame, where the first frame includes a first image captured from a first location and the second frame includes a second image captured from a second location. An artificially rendered image corresponding to a third location is then generated by interpolating individual control points for the third location using the set of control points and interpolating pixel locations using the individual control points. The individual control points are used to transform image data.

AUTOMATIC COMPOSITION OF COMPOSITE IMAGES OR VIDEOS FROM FRAMES CAPTURED WITH MOVING CAMERA
20200228730 · 2020-07-16 ·

A processing device generates composite images from a sequence of images. The composite images may be used as frames of video. A foreground/background segmentation is performed at selected frames to extract a plurality of foreground object images depicting a foreground object at different locations as it moves across a scene. The foreground object images are stored to a foreground object list. The foreground object images in the foreground object list are overlaid onto subsequent video frames that follow the respective frames from which they were extracted, thereby generating a composite video.