Patent classifications
G06V10/462
INVARIANT-BASED DIMENSIONAL REDUCTION OF OBJECT RECOGNITION FEATURES, SYSTEMS AND METHODS
A sensor data processing system and method is described. Contemplated systems and methods derive a first recognition trait of an object from a first data set that represents the object in a first environmental state. A second recognition trait of the object is then derived from a second data set that represents the object in a second environmental state. The sensor data processing systems and methods then identifies a mapping of elements of the first and second recognition traits in a new representation space. The mapping of elements satisfies a variance criterion for corresponding elements, which allows the mapping to be used for object recognition. The sensor data processing systems and methods described herein provide new object recognition techniques that are computationally efficient and can be performed in real-time by the mobile phone technology that is currently available.
LOCALIZATION AND MAPPING METHOD
A method comprising: obtaining a three-dimensional (3D) point cloud about an object; obtaining binary feature descriptors for feature points in a 2D image about the object; assigning a plurality of index values for each feature point as multiple bits of the corresponding binary feature descriptor; storing the binary feature descriptor in a table entry of a plurality of hash key tables of a database image; obtaining query binary feature descriptors for feature points in a query image; matching the query binary feature descriptors to the binary feature descriptors of the database image; reselecting one bit of the hash key of the matched database image; and re-indexing the feature points in the table entries of the hash key table of the database image.
DETERMINING A LOCATION AT WHICH A GIVEN FEATURE IS REPRESENTED IN MEDICAL IMAGING DATA
A computer implemented method and apparatus for determining a location at which a given feature is represented in medical imaging data is disclosed. A first descriptor for a first location in first medical imaging data is obtained. The first location is the location within the first medical imaging data at which the given feature is represented. A second descriptor for each of a plurality of candidate second locations in second medical imaging data is obtained. A similarity metric indicating a degree of similarity with the first descriptor is calculated for each of the plurality of candidate second locations. A candidate second location is selected from among the plurality of candidate second locations based on the calculated similarity metrics. The location at which the given feature is represented in the second medical imaging data is determined based on the selected candidate second location.
Techniques for optimizing the display of videos
The disclosed embodiments disclose techniques for optimizing the display of videos. During operation, a computing device receives a video stream to be displayed. The computing device determines a preferred orientation for the video stream, determines a present orientation for the computing device, and determines a mismatch between the preferred orientation and the present orientation. The computing device adjusts the video stream while displaying the video stream on the display. As the video stream plays, the computing device detects any rotation of the computing device, and if so, re-adjusts how the video stream is displayed.
Apparatus for real-time monitoring for construction object and monitoring method and computer program for the same
Disclosed herein is an apparatus for the real-time monitoring of construction objects. The apparatus for the real-time monitoring of construction objects includes: a communication unit configured to receive image data acquired by photographing a construction site, and to transmit safety information to an external device; and a monitoring unit provided with a prediction model pre-trained using binary image sequences of construction objects at the construction site as training data, and configured to detect a plurality of construction objects from image frames included in image data received via the communication unit and convert the detected construction objects into binary images, to generate future frames by inputting the resulting binary images to the prediction model, and to derive proximity between the construction objects by comparing the generated future frames with the resulting binary images and generate the safety information.
Co-heterogeneous and adaptive 3D pathological abdominal organ segmentation using multi-source and multi-phase clinical image datasets
The present disclosure describes a computer-implemented method for processing clinical three-dimensional image. The method includes training a fully supervised segmentation model using a labelled image dataset containing images for a disease at a predefined set of contrast phases or modalities, allow the segmentation model to segment images at the predefined set of contrast phases or modalities; finetuning the fully supervised segmentation model through co-heterogenous training and adversarial domain adaptation (ADA) using an unlabelled image dataset containing clinical multi-phase or multi-modality image data, to allow the segmentation model to segment images at contrast phases or modalities other than the predefined set of contrast phases or modalities; and further finetuning the fully supervised segmentation model using domain-specific pseudo labelling to identify pathological regions missed by the segmentation model.
Systems and methods for screenshot linking
A system for analyzing screenshots can include a computing device including a processor coupled to a memory and a display screen configured to display content. The system can include an application stored on the memory and executable by the processor. The application can include a screenshot receiver configured to access, from storage to which a screenshot of the content displayed on the display screen captured using a screenshot function of the computing device is stored, the screenshot including an image and a predetermined marker. The application can include a marker detector configured to detect the predetermined marker included in the screenshot. The application can include a link identifier configured to identify, using the predetermined marker, a link to a resource mapped to the image included in the screenshot, the resource accessible by the computing device via the link.
RUNTIME OPTIMISED ARTIFICIAL VISION
A method for creating artificial vision with an implantable visual stimulation device. The method comprises receiving image data comprising, for each of multiple points of an image, a depth value, performing a local background enclosure calculation on the image data to determine salient object information, and generating a visual stimulus to visualise the salient object information using the implantable visual stimulation device. Performing the local background enclosure calculation is based on a subset of the multiple points of the input image, and the subset of the multiple points is defined based on the depth value of the multiple points.
Real time region of interest (ROI) detection in thermal face images based on heuristic approach
Embodiments herein provide a method and system for real time ROI detection in thermal face images based on a heuristic approach. The ROI of the thermal images, once detected, is then further used to detect temperature of a subject corresponding to the ROI. Unlike state of the art techniques, the heuristic approach is computationally less intensive and provides fast and accurate ROI detection even in case of occluded faces in a crowd with a single thermal image having a plurality of subject being scanned. The heuristics applied does not focus on face detection but directly on point of interest detection. Once the point of interest (ROI) is detected, it may be used for plurality of applications such as subject tracking and the like, not limited to subject or object temperature sensing since the method disclosed herein is easily implementable on low power devices.
DESIGN OPTIMIZATION AND USE OF CODEBOOKS FOR DOCUMENT ANALYSIS
A method of generating and optimizing a codebooks for document analysis comprises: receiving a first set of document images; extracting a plurality of keypoint regions from each document image of the first set of document images; calculating local descriptors for each keypoint region of the extracted keypoint regions; clustering the local descriptors such that each center of a cluster of local descriptors corresponds to a respective visual word; generating a codebook containing a set of visual words; and optimizing the codebook by maximizing mutual information (MI) between a target field of a second set of document images and at least one visual word of the set of visual words.