G06V10/778

Information processing apparatus, information processing method, and program
11710567 · 2023-07-25 · ·

Provided are an information processing apparatus, an information processing method, and a program capable of accumulating appropriate relearning data. An information processing apparatus includes an input unit that inputs input data to a learned model acquired in advance through machine learning using learning data, an acquisition unit that acquires output data output from the learned model through the input using the input unit, a reception unit that receives correction performed by a user for the output data acquired by the acquisition unit, and a storage controller that performs control for storing, as relearning data of the learned model, the input data and the output data that reflects the correction received by the reception unit in a storage unit in a case where a value indicating a correction amount acquired by performing the correction for the output data is equal to or greater than a threshold value.

Vehicle damage estimation

A computer, including a processor and a memory, the memory including instructions to be executed by the processor to train a generative adversarial network (GAN) to reconstruct a missing portion of an image by determining a reconstructed portion of the image based on data from portions of the image surrounding the missing portion and compare an acquired image with the reconstructed portion of the image to determine a damaged portion. The instructions further include instructions to determine estimated damage based on the damaged portion.

SYSTEMS AND METHODS FOR RAPID DEVELOPMENT OF OBJECT DETECTOR MODELS

A computer vision system configured for detection and recognition of objects in video and still imagery in a live or historical setting uses a teacher-student object detector training approach to yield a merged student model capable of detecting all of the classes of objects any of the teacher models is trained to detect. Further, training is simplified by providing an iterative training process wherein a relatively small number of images is labeled manually as initial training data, after which an iterated model cooperates with a machine-assisted labeling process and an active learning process where detector model accuracy improves with each iteration, yielding improved computational efficiency. Further, synthetic data is generated by which an object of interest can be placed in a variety of setting sufficient to permit training of models. A user interface guides the operator in the construction of a custom model capable of detecting a new object.

INTENTION DETECTION DEVICE, INTENTION DETECTION METHOD COMPUTER-READABLE STORAGE MEDIUM
20230230354 · 2023-07-20 · ·

An intention detection device 1X includes a preprocessor 21X, a motion pattern/object relation identifier 22X and a detector 23X. The preprocessor 21X is configured to generate preprocessed data associated with a human and a relevant object by processing a detection signal outputted by a sensor. The motion pattern/object relation identifier 22X is configured to identify a motion pattern of the human and a relation between the human and the object based on the preprocessed data. The detector 23X is configured to detect at least one of an activity, a gesture or a predicted step regarding the human based on the identified motion pattern and the identified relation able to integrate and provide lexical descriptions of the at least one of the activity, the gesture or the predicted step.

SEMI-SUPERVISED LEARNING VIA DIFFERENT MODALITIES
20230230350 · 2023-07-20 · ·

A method for semi-supervised learning via different modalities, the method may include obtaining a training sensed information units of a first modality that are associated with a certain pattern; obtaining multimodality information units that are untagged; wherein a multimodality information unit comprises a first modality portion and a second modality portion; searching for certain pattern related multimodality information units, wherein a certain pattern related multimodality information unit comprises a first modality portion that is related to the certain pattern; clustering the second portions of the certain pattern related multimodality information units to provide second portion clusters; generating certain pattern identifiers based on the second portion clusters; and responding to the generating of the certain pattern identifiers; wherein the responding comprises at least one out of storing the certain pattern identifiers, transmitting the certain pattern identifiers, and generating notifications to be sent once a signature of a query sensed information unit of the second modality comprises the certain pattern identifier.

SYSTEM AND METHODS FOR ACTIVE DOMAIN ADAPTATION

Systems and methods for machine learning are described. The systems and methods include receiving target training data including a training image and ground truth label data for the training image, generating source network features for the training image using a source network trained on source training data, generating target network features for the training image using a target network, generating at least one attention map for training the target network based on the source network features and the target network features using a guided attention transfer network, and updating parameters of the target network based on the attention map and the ground truth label data.

SYSTEM AND METHODS FOR ACTIVE DOMAIN ADAPTATION

Systems and methods for machine learning are described. The systems and methods include receiving target training data including a training image and ground truth label data for the training image, generating source network features for the training image using a source network trained on source training data, generating target network features for the training image using a target network, generating at least one attention map for training the target network based on the source network features and the target network features using a guided attention transfer network, and updating parameters of the target network based on the attention map and the ground truth label data.

COMPUTER VISION TECHNOLOGIES FOR RAPID DETECTION

A computing system includes a processor; and a memory having stored thereon an adjustment application comprising computer-executable instructions that, when executed, cause the computing system to: display a graphical user interface including a digital medical image of a patient; superimpose a bounding box; receive an adjustment of an area of interest; and provide an adjusted digital medical image. A non-transitory computer-readable medium includes computer-executable instructions that, when executed via one or more processors, cause a computer to: display a graphical user interface including a digital medical image of a patient; superimpose a bounding box; receive an adjustment of an area of interest; and provide an adjusted digital medical image. A computer-implemented method includes: displaying a graphical user interface including a digital medical image of a patient; superimposing a bounding box; receiving an adjustment of an area of interest; and providing an adjusted digital medical image.

Adaptive cyber-physical system for efficient monitoring of unstructured environments

The present disclosure provides a system for monitoring unstructured environments. A predetermined path can be determined according to an assignment of geolocations to one or more agronomically anomalous target areas, where the one or more agronomically anomalous target areas are determined according to an analysis of a plurality of first images that automatically identifies a target area that deviates from a determination of an average of the plurality of first images that represents an anomalous place within a predetermined area, where the plurality of first images of the predetermined area are captured by a camera during a flight over the predetermined area. A camera of an unmanned vehicle can capture at least one second image of the one or more agronomically anomalous target areas as the unmanned vehicle travels along the predetermined path.

Machine learning system and method for determining or inferring user action and intent based on screen image analysis
11704898 · 2023-07-18 · ·

System(s) and method(s) that analyze image data associated with a computing screen operated by a user, and learns the image data (e.g., using pattern recognition, historical information analysis, user implicit and explicit training data, optical character recognition (OCR), video information, 360°/panoramic recordings, and so on) to concurrently glean information regarding multiple states of user interaction (e.g., analyzing data associated with multiple applications open on a desktop, mobile phone or tablet). A machine learning model is trained on analysis of graphical image data associated with screen display to determine or infer user intent. An input component receives image data regarding a screen display associated with user interaction with a computing device. An analysis component employs the model to determine or infer user intent based on the image data analysis; and an action component provisions services to the user as a function of the determined or inferred user intent. In an implementation, a gaming component gamifies interaction with the user in connection with explicitly training the model.