Patent classifications
H04N13/111
SYSTEMS AND METHODS FOR IMAGE REPROJECTION
An imaging system receives depth data (corresponding to an environment) from a depth sensor and first image data (a depiction of the environment) from an image sensor. The imaging system generates, based on the depth data, first motion vectors corresponding to a change in perspective of the depiction of the environment in the first image data. The imaging system generates, using grid inversion based on the first motion vectors, second motion vectors that indicate respective distances moved by respective pixels of the depiction of the environment in the first image data for the change in perspective. The imaging system generates second image data by modifying the first image data according to the second motion vectors. The second image data includes a second depiction of the environment from a different perspective than the first image data. Some image reprojection applications (e.g., frame interpolation) can be performed without the depth data.
SYSTEMS AND METHODS FOR IMAGE REPROJECTION
An imaging system receives depth data (corresponding to an environment) from a depth sensor and first image data (a depiction of the environment) from an image sensor. The imaging system generates, based on the depth data, first motion vectors corresponding to a change in perspective of the depiction of the environment in the first image data. The imaging system generates, using grid inversion based on the first motion vectors, second motion vectors that indicate respective distances moved by respective pixels of the depiction of the environment in the first image data for the change in perspective. The imaging system generates second image data by modifying the first image data according to the second motion vectors. The second image data includes a second depiction of the environment from a different perspective than the first image data. Some image reprojection applications (e.g., frame interpolation) can be performed without the depth data.
SYSTEM AND METHOD FOR GENERATING COMBINED EMBEDDED MULTI-VIEW INTERACTIVE DIGITAL MEDIA REPRESENTATIONS
Various embodiments describe systems and processes for capturing and generating multi-view interactive digital media representations (MIDMRs). In one aspect, a method for automatically generating a MIDMR comprises obtaining a first MIDMR and a second MIDMR. The first MIDMR includes a convex or concave motion capture using a recording device and is a general object MIDMR. The second MIDMR is a specific feature MIDMR. The first and second MIDMRs may be obtained using different capture motions. A third MIDMR is generated from the first and second MIDMRs, and is a combined embedded MIDMR. The combined embedded MIDMR may comprise the second MIDMR being embedded in the first MIDMR, forming an embedded second MIDMR. The third MIDMR may include a general view in which the first MIDMR is displayed for interactive viewing by a user on a user device. The embedded second MIDMR may not be viewable in the general view.
Advanced driver assist system and method of detecting object in the same
ADAS includes a processing circuit and a memory which stores instructions executable by the processing circuit. The processing circuit executes the instructions to cause the ADAS to receive, from a vehicle that is in motion, a video sequence, generate a position image including at least one object included in the stereo image, generate a second position information associated with the at least one object based on reflected signals received from the vehicle, determine regions each including at least a portion of the at least one object as candidate bounding boxes based on the stereo image and the position image, and selectively adjusting class scores of respective ones of the candidate bounding boxes associated with the at least one object based on whether a respective first position information of the respective ones of the candidate bounding boxes matches the second position information.
Advanced driver assist system and method of detecting object in the same
ADAS includes a processing circuit and a memory which stores instructions executable by the processing circuit. The processing circuit executes the instructions to cause the ADAS to receive, from a vehicle that is in motion, a video sequence, generate a position image including at least one object included in the stereo image, generate a second position information associated with the at least one object based on reflected signals received from the vehicle, determine regions each including at least a portion of the at least one object as candidate bounding boxes based on the stereo image and the position image, and selectively adjusting class scores of respective ones of the candidate bounding boxes associated with the at least one object based on whether a respective first position information of the respective ones of the candidate bounding boxes matches the second position information.
Intelligent vehicle systems and control logic for surround view augmentation with object model recognition
Presented are intelligent vehicle systems with networked on-body vehicle cameras with camera-view augmentation capabilities, methods for making/using such systems, and vehicles equipped with such systems. A method for operating a motor vehicle includes a system controller receiving, from a network of vehicle-mounted cameras, camera image data containing a target object from a perspective of one or more cameras. The controller analyzes the camera image to identify characteristics of the target object and classify these characteristics to a corresponding model collection set associated with the type of target object. The controller then identifies a 3D object model assigned to the model collection set associated with the target object type. A new “virtual” image is generated by replacing the target object with the 3D object model positioned in a new orientation. The controller commands a resident vehicle system to execute a control operation using the new image.
Intelligent vehicle systems and control logic for surround view augmentation with object model recognition
Presented are intelligent vehicle systems with networked on-body vehicle cameras with camera-view augmentation capabilities, methods for making/using such systems, and vehicles equipped with such systems. A method for operating a motor vehicle includes a system controller receiving, from a network of vehicle-mounted cameras, camera image data containing a target object from a perspective of one or more cameras. The controller analyzes the camera image to identify characteristics of the target object and classify these characteristics to a corresponding model collection set associated with the type of target object. The controller then identifies a 3D object model assigned to the model collection set associated with the target object type. A new “virtual” image is generated by replacing the target object with the 3D object model positioned in a new orientation. The controller commands a resident vehicle system to execute a control operation using the new image.
Method and system for generating a multiview stereoscopic image
A method and a system for generating a multiview stereoscopic image are provided. The method includes the following steps. An image capturing apparatus captures a real calibration panel to obtain multiple images, and a processor obtains a datum image and multiple images to be calibrated by analyzing the images including the real calibration panel. The processor respectively calculates a homography matrix of each of the images to be calibrated corresponding to the datum image according to the datum image and the images to be calibrated. The processor obtains a calibration matrix of the homography matrix by performing a matrix disassembly calculation on each of the homography matrices. The processor multiplies the images to be calibrated by the corresponding calibration matrix to obtain multiple calibrated images. The processor outputs the multiview stereoscopic image including the datum image and the calibrated images.
View synthesis for dynamic scenes
Apparatuses, systems, and techniques are presented to perform monocular view synthesis of a dynamic scene. Single and multi-view depth information can be determined for a collection of images of a dynamic scene, and a blender network can be used to combine image features for foreground, background, and missing image regions using fused depth maps inferred form the single and multi-view depth information.
Dual system optical alignment for separated cameras
Improved techniques for generating images are disclosed herein. A first image is generated by an integrated camera. The pose of the computer system is determined based on the image, and a timestamp is determined. A detached camera generates a second image. The second image is aligned with the first image. An overlaid image is generated by overlaying the second image onto the first image based on the alignment. A pose difference is then identified between a current pose of the computer and the initial pose. Consequently, late stage reprojection (LSR) is performed on the overlaid image to account for the pose difference. The LSR-corrected overlaid image is then displayed.