Patent classifications
G06T2210/22
VISION DISPLAY SYSTEM FOR VEHICLE
A vision display system for a vehicle includes a rear backup camera, a driver side camera, a passenger side camera, a front camera, and a display system including a video display screen. Rear backup video images derived from image data captured by the rear backup camera may be displayed by the video display screen for no more than ten seconds after shifting of the vehicle transmission of the vehicle out of reverse gear. Upon the engine of the vehicle being first started after initial ignition on of the vehicle, priority may be given to display by the video display screen of rear backup video images. Within two seconds after shifting of the vehicle transmission of the vehicle into reverse gear, rear backup video images may be displayed by the video display screen.
Modification of post-viewing parameters for digital images using image region or feature information
A method of generating one or more new digital images using an original digitally-acquired image including a selected image feature includes identifying within a digital image acquisition device one or more groups of pixels that correspond to the selected image feature based on information from one or more preview images. A portion of the original image is selected that includes the one or more groups of pixels. The technique includes automatically generating values of pixels of one or more new images based on the selected portion in a manner which includes the selected image feature within the one or more new images.
System and method for local three dimensional volume reconstruction using a standard fluoroscope
A system and method for constructing fluoroscopic-based three dimensional volumetric data from two dimensional fluoroscopic images including a computing device configured to facilitate navigation of a medical device to a target area within a patient and a fluoroscopic imaging device configured to acquire a fluoroscopic video of the target area about a plurality of angles relative to the target area. The computing device is configured to determine a pose of the fluoroscopic imaging device for each frame of the fluoroscopic video and to construct fluoroscopic-based three dimensional volumetric data of the target area in which soft tissue objects are visible using a fast iterative three dimensional construction algorithm.
Vehicular rear backup system with video display
A vehicular rear backup system includes a rear backup digital camera disposed at a vehicle and a display device having a video display screen disposed in an interior cabin of the vehicle and viewable by the driver of the vehicle. Rear backup video images that are derived from image data captured by the rear backup digital camera are displayed on the video display screen no later than two seconds after the driver of the vehicle first changes propulsion of the vehicle during a new ignition cycle to reverse mode to commence a first backup event. Until the first backup event is completed, the video display screen displays rear backup video images. During a backup event of the vehicle, image data captured by the rear backup digital camera is provided to and is processed by an image processor of the system to detect presence of an object rearward of the vehicle.
ENHANCING DOCUMENTS PORTRAYED IN DIGITAL IMAGES
The present disclosure is directed toward systems and methods that efficiently and effectively generate an enhanced document image of a displayed document in an image frame captured from a live image feed. For example, systems and methods described herein apply a document enhancement process to a displayed document in an image frame that result in an enhanced document image that is cropped, rectified, un-shadowed, and with dark text against a mostly white background. Additionally, systems and method described herein determine whether a stored digital content item includes a displayed document. In response to determining that a stored digital content item does include a displayed document, systems and methods described herein generate an enhanced document image of a displayed document included in the stored digital content item.
Method for reconstructing a 3D object based on dynamic graph network
The present invention provides a method for reconstructing a 3D object based on dynamic graph network, first, obtaining a plurality of feature vectors from 2D image I of an object; then, preparing input data: predefining an initial ellipsoid mesh, obtaining a feature input X by filling initial features and creating a relationship matrix A corresponding to the feature input X; then, inputting the feature input X and corresponding relationship matrix A to a dynamic graph network for integrating and deducing of each vertex's feature, thus new relationship matrix is obtained and used for the later graph convoluting, which improves the initial graph information and makes the initial graph information adapted to the mesh relation of the corresponding object, therefore the accuracy and the effect of 3D object reconstruction have been improved; last, regressing the position, thus the 3D structure of the object is deduced, and the 3D object reconstruction is completed.
DEEP NEURAL NETWORK FRAMEWORK FOR PROCESSING OCT IMAGES TO PREDICT TREATMENT INTENSITY
Systems and methods relate to processing optical tomography coherence (OCT) images to predict characteristics of a treatment to be administered to effectively treat age-related macular degeneration. The processing can include pre-processing the image by flattening and/or cropping the image and processing the pre-processed image using a neural network. The neural network can include a deep convolutional neural network. An output of the neural network can indicate a predicted frequency and/or interval at which a treatment (e.g., anti-vascular endothelial growth factor therapy) is to be administered so as to prevent leakage of vasculature in the eye.
HIERARCHICAL IMAGE GENERATION VIA TRANSFORMER-BASED SEQUENTIAL PATCH SELECTION
Systems and methods for image processing are described. Embodiments of the present disclosure identify a first image depicting a first object; identify a plurality of candidate images depicting a second object; select a second image from the plurality of candidate images depicting the second object based on the second image and a sequence of previous images including the first image using a crop selection network trained to select a next compatible image based on the sequence of previous images; and generate a composite image depicting the first object and the second object based on the first image and the second image.
Methods and apparatus for using wide area networks to support parking systems
A customer premises parking system including one or more cameras and wireless communications transmission capability is described. The presence of a vehicle with a license plate at an entrance or exit of a parking area is detected and one or more high resolution images are captured and time stamped. A license plate portion of the HD image is captured and stored. The license plate number and corresponding time is determined and communicated using a wide area wireless network. The HD license plate portion of the image is processed to produce a low resolution version of the license plate image which is segmented and processed into fragments which are transmitted via the wide area network using available transmission opportunities. A high resolution version of the license plate portion is stored at the premises where the parking facilitate is located and can be retrieved via a cellular wireless interface in the event of a dispute or discrepancy with regard to the detected and reported license plate number. From the time stamps associated with entrance and exit images of license plates the amount of time and/or date at which a vehicle was present at the parking garage is determined and the vehicle owner billed for the use of the parking facility.
Scalable FOV+ for VR 360 video delivery to remote end users
A distribution device for delivering a selected viewport stream of virtual reality (VR) data to each of a plurality of client devices, comprising a processor configured for receiving a plurality of extended viewport streams of a VR video file each comprising a sequence of extended field of view (EFOV) frames created for a respective one of a plurality of overlapping segments constituting a sphere defined in the VR video file and delivering a selected one of the plurality of extended viewport streams to each of a plurality of client devices by performing the following for each of the client devices in each of a plurality of iterations: (1) receiving a current orientation data of the respective client device; (2) selecting one of the plurality of extended viewport streams according to the current orientation data; and (3) transmitting the selected extended viewport stream to the respective client device.