Patent classifications
G06V10/16
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND NON-TRANSITORY COMPUTER READABLE MEDIUM
An information processing apparatus acquires projection data obtained by dividing a subject into first and second divided areas and capturing the first and second divided areas, the projection data including first projection data obtained by capturing a dynamic state of the subject in a first capturing range including the first divided area and second projection data obtained by capturing the dynamic state of the subject in a second capturing range including the second divided area. The apparatus acquires similarity relating to the dynamic state of the subject on a basis of projection data of a first partial area and projection data of a second partial area, and acquires a first timing for reconstructing an image of the first divided area and a second timing for reconstructing an image of the second divided area, on a basis of the similarity.
Multi-angle object recognition
Methods, systems, and apparatus for controlling smart devices are described. In one aspect a method includes capturing, by a camera on a user device, a plurality of successive images for display in an application environment of an application executing on the user device, performing an object recognition process on the images, the object recognition process including determining that a plurality of images, each depicting a particular object, are required to perform object recognition on the particular object, and in response to the determination, generating a user interface element that indicates a camera operation to be performed, the camera option capturing two or more images, determining that a user, in response to the user interface element, has caused the indicated camera operation to be performed to capture the two or more images, and in response, determining whether a particular object is positively identified from the plurality of images.
METHOD FOR ASCERTAINING THE SUITABILITY OF A POSITION FOR A DEPLOYMENT FOR SURVEYING
One aspect of the invention relates to a fully automatic method for calculating the current, geo-referenced position and alignment of a terrestrial scan-surveying device in situ on the basis of a current panoramic image recorded by the surveying device and at least one stored, geo-referenced 3D scan panoramic image.
Face detection in spherical images
A face located along a stitch line in a spherical image is detected by rendering views of regions of the spherical image along the stitch line. The spherical image may be produced by combining first and second images. A first view of a projection of the spherical image is rendered. A scaling factor for rendering a second view of the projection is determined based characteristics of the first portion of the face. The second view is then rendered according to the scaling factor. The use of the scaling factor to render the second view causes a change in the depiction of the second portion of the face. For example, the scaling factor can indicate to change the resolution or expected size of the second portion of the face when rendering the second view. A face is then detected within the spherical image based on the rendered first and second views.
METHOD FOR STITCHING IMAGES OF CAPSULE ENDOSCOPE, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM
A method for stitching images of a capsule endoscope, an electronic device, and a readable storage medium are provided. The method comprises: performing image rectification and circular edge masking on original images to form pre-processed images, and performing image enhancement (S1); completing detection and pairing of feature points (S2); calculating a transformation model of all pre-processed images to a same optimal plane according to a set of the feature points (S3); performing projective transformation for each enhanced image to a same coordinate system (S4); and stitching images according to an obtained sequence to form a fused image for output (S5), thus expanding the field of view of a single image of the capsule endoscope.
CLOUD-BASED FRAMEWORK FOR PROCESSING, ANALYZING, AND VISUALIZING IMAGING DATA
Embodiments of the present disclosure provide methods, apparatus, systems, computing devices, computing entities, and/or the like for detecting objects located in an area of interest. In accordance with one embodiment, a method is provided comprising: receiving, via an interface provided through a general instance on a cloud environment, imaging data comprising raw images collected on the area of interest; upon receiving the images: activating a central processing unit (CPU) focused instance on the cloud environment and processing, via the image, the raw images to generate an image map of the area of interest; and after generating the image map: activating a graphical processing unit (GPU) focused instance on the cloud environment and performing object detection, via the image, on a region within the image map by applying one or more object detection algorithms to the region to identify locations of the objects in the region.
VIDEO GENERATION METHOD AND APPARATUS, STORAGE MEDIUM, AND COMPUTER DEVICE
A video generation method includes: performing video shooting in response to a trigger operation for a video duet option; obtaining a second video currently shot, where the second video corresponds to a video clip including a target character in a first video; and fusing the second video into video content of the first video based on one or more characters recognized in the first video, to obtain a duet video, the one or more characters including the target character.
METHOD, APPARATUS, AND NON-TRANSITORY COMPUTER READABLE STORAGE MEDIUM FOR CONFIRMING A PERCEIVED POSITION OF A TRAFFIC LIGHT
A method, apparatus, and computer-readable medium for confirming a perceived position of a traffic light, by obtaining identifiers and results of a first perception of traffic lights associated with the identifiers, the results of the first perception including a first estimation of an ellipse encompassing each of the traffic lights, receiving results of a second perception of traffic lights associated with the identifiers, the results of the second perception including a second estimation of an ellipse encompassing each of the traffic lights, calculating, based on the first perception and the second perception, association parameters for each possible pair of estimated ellipses, selecting, based on the calculated association parameters for each possible pair of estimated ellipses, matching pairs of estimated ellipses, and fusing each matching pair of estimated ellipses.
APPEARANCE CAPTURE UNDER MULTIPLE LIGHTING CONDITIONS
One embodiment of the present invention sets forth a technique for performing appearance capture. The technique includes receiving a first sequence of images of an object, wherein the first sequence of images includes a first set of images interleaved with a second set of images, and wherein the first set of images is captured based on illumination of the object using a first lighting pattern and the second set of images is captured based on illumination of the object using one or more lighting patterns that are different from the first lighting pattern. The technique also includes generating a first set of appearance parameters associated with the object based on a first inverse rendering associated with the first sequence of images.
IMAGE PROCESSING APPARATUS, IMAGE PROCESSING METHOD, AND RECORDING MEDIUM
An image processing apparatus according to one aspect includes a hardware processor connected to a memory. The hardware processor performs a process including: acquiring a plurality of captured images whose capturing areas overlap with one another; determining whether an object is included in an overlap portion of adjacent ones of the plurality of captured images in a projected image, the projected image being obtained by projecting the plurality of captured images onto a reference projection plane, the reference projection plane being an image projection plane virtually disposed in a virtual space corresponding to a real space; and performing an adjustment process on an overlap area of the reference projection plane, the overlap area including the object on the reference projection plane and corresponding to the overlap portion including the object.