Patent classifications
G06V10/87
Object recognition system and object recognition method
In a data driven object recognition system and object recognition method, a connection relationship between an object feature extraction unit and a plurality of task-specific identification units is stored in a connection switch according to a type of task. The connection relationship is changed based on the connection information to suppress the amount of labeling in constructing a learning data set.
Classifying individual elements of an infrastructure model
In example embodiments, techniques are provided to automatically classify individual elements of an infrastructure model by training one or more machine learning algorithms on classified infrastructure models, producing a classification model that maps features to classification labels, and utilizing the classification model to classify the individual elements of the infrastructure model. The resulting classified elements may then be readily subject to analytics, for example, enabling the display of dashboards for monitoring project performance and the impact of design changes. Such techniques enable classification of elements of new infrastructure models or in updates to existing infrastructure models.
Three-dimensional room analysis with audio input
System and methods are provided that generate a three-dimensional model from a physical space. While a user is scanning and/or recording the physical space with a user computing device, user speech describing the physical space is recorded. A transcript is generated from the audio captured during the scan and/or image recording of the physical space. Keywords from the transcript are used to improve computer-vision object identification, which is incorporated in the three-dimensional model.
Device and method for selecting a deep learning network for processing images
A method for selecting a deep learning network which is optimal for solving an image processing task obtaining a type of the image processing task, selecting a data set according to the type of problem, and dividing selected data set into training data and test data. Similarities between different training data are calculated, and a batch size of the training data is adjusted according to the similarities of the training data. A plurality of deep learning networks is selected according to the type of problem, and the plurality of deep learning networks is trained through the training data to obtain network models. Each of the network models is tested through the test data, and the optimal deep learning network with the best test result is selected from the plurality of deep learning networks appropriate for image processing.
Federated learning using local ground truth estimation
Various implementations disclosed herein include devices, systems, and methods that involve federated learning techniques that utilize locally-determined ground truth data that may be used in addition to, or in the alternative to, user-provided ground truth data. Some implementations provide an improved federated learning technique that creates ground truth data on the user device using a second prediction technique that differs from a first prediction technique/model that is being trained. The second prediction technique may be better but may be less suited for real time, general use than the first prediction technique.
MODEL COMBINING AND INTERACTION FOR MEDICAL IMAGING
This disclosure relates to the combining and interaction of multiple artificial intelligence (AI) models for medical image analysis. An example method includes obtaining AI models from model providers and organizing them to form associations. In response to a user request, base models are selected and provided. Additional models are further selected to combine with the base models, and medical image analysis results are presented based on applying a combination of the models to target medical image data.
Object recognition device, object recognition method, and object recognition program
An object recognition device 80 includes a scene determination unit 81, a learning-model selection unit 82, and an object recognition unit 83. The scene determination unit 81 determines, based on information obtained during driving of a vehicle, a scene of the vehicle. The learning-model selection unit 82 selects, in accordance with the determined scene, a learning model to be used for object recognition from two or more learning models. The object recognition unit 83 recognizes, using the selected learning model, an object in an image to be photographed during driving of the vehicle.
Vehicle listing image detection and alert system
An image error identification system retrieves an image associated with a vehicle listing and uses various machine learning models to classify the image and generate identification data that may include a vehicle make, model, trim level, and/or various features of the vehicle present in the image. The identification data is compared to the rest of the vehicle listing to detect a mismatch between the image and the vehicle listing. An alert is generated, when a mismatch is detected, indicating the one of the image or the data in the vehicle listing is incorrect.
ELECTRONIC DEVICE AND OPERATING METHOD THEREOF
According to an embodiment of the specification, disclosed is an electronic device a camera, a processor operatively connected to the camera and a memory operatively connected to the processor, wherein the memory stores instructions that, when executed, cause the processor to obtain an image by using the camera, identify an object-of-interest among a plurality of objects included in the image, determine a selected segmentation model among a plurality of segmentation models based on a size of the object-of-interest and apply the determined segmentation model to a region of interest (ROI) of the image containing the object-of-interest.
Systems and methods for stream recognition
The present disclosure provides systems and methods for providing augmented reality experiences. Consistent with disclosed embodiments, one or more machine-learning models can be trained to selectively process image data. A pre-processor can be configured to receive image data provided by a user device and trained to automatically determine whether to select and apply a preprocessing technique to the image data. A classifier can be trained to identify whether the image data received from the pre-processor includes a match to one of a plurality of triggers. A selection engine can be trained to select, based on a matched trigger and in response to the identification of the match, a processing engine. The processing engine can be configured to generate an output using the image data, and store the output or provide the output to the user device or a client system.