Patent classifications
G06V10/7747
LANDING TRACKING CONTROL METHOD AND SYSTEM BASED ON LIGHTWEIGHT TWIN NETWORK AND UNMANNED AERIAL VEHICLE
A landing tracking control method comprises the following contents: a tracking model training stage and an unmanned aerial vehicle real-time tracking stage. The landing tracking control method extracts a network Snet by using a lightweight feature and makes modification, so that an extraction speed of the feature is increased to better meet a real-time requirement. Weight allocation on the importance of channel information is carried out to differentiate effective features more purposefully and utilize the features, so that the tracking precision is improved. In order to improve a training effect of the network, a loss function of an RPN network is optimized, a regression precision of a target frame is measured by using CIOU, and meanwhile, calculation of classified loss function is adjusted according to CIOU, and a relation between a regression network and classification network is enhanced.
METHOD FOR TRAINING NEURAL NETWORK BY USING DE-IDENTIFIED IMAGE AND SERVER PROVIDING SAME
The present invention relates to a neural network training method. The neural network training method using a de-identified image according to the present invention comprises the steps of: encoding a first image represented by a n-th dimensional vector into a predetermined p-th dimensional second image; decoding the second image into a q-th dimensional third image; inputting the third image to a neural network and extracting object information included in the third image; and training at least one parameter information used for computation in the neural network by using the extracted object information. According to the present invention, de-identified images are used for neural network training such that neural network training is made possible without using personal information included in images.
Weakly Supervised Action Selection Learning in Video
A video localization system localizes actions in videos based on a classification model and an actionness model. The classification model is trained to make predictions of which segments of a video depict an action and to classify the actions in the segments. The actionness model predicts whether any action is occurring in each segment, rather than predicting a particular type of action. This reduces the likelihood that the video localization system over-relies on contextual information in localizing actions in video. Furthermore, the classification model and the actionness model are trained based on weakly-labeled data, thereby reducing the cost and time required to generate training data for the video localization system.
Attenuating visual artifacts of image processing systems using adversarial networks on-the-fly
An apparatus, method, and a computer readable medium for attenuating visual artifacts in processed images. An annotated dataset of images to be processed by an image processing system is created. An adversarial control network is trained to operate as an image quality expert in classifying images. After the adversarial control network has been trained, the adversarial control network is used to supervise the image processing system on-the-fly.
Generative adversarial neural network assisted compression and broadcast
A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
NON-CONTACT, NON-INVASIVE, AND QUANTITATIVE AGRICULTURAL MONITORING DEVICES AND SYSTEMS
Methods and systems for agricultural monitoring are disclosed. The methods and systems include: projecting, by a light source, a coherent light output on at least a part of a plant material; acquiring, by an imaging device, one or more speckle images based on the coherent light output backscattered by at least the part of the plant material; providing the one or more speckle images to a trained machine learning algorithm; identifying one or more healthy status indications of at least the part of the plant material based on one or more outputs from the trained machine learning algorithm, the one or more outputs corresponding to the one or more speckle images; and providing plant health information based on the healthy status indication. Other aspects, embodiments, and features are also claimed and described.
EFFICIENT RETRIEVAL OF A TARGET FROM AN IMAGE IN A COLLECTION OF REMOTELY SENSED DATA
State of art techniques performing image labeling of remotely sensed data are computation intensive, consume time and resources. A method and system for efficient retrieval of a target in an image in a collection of remotely sensed data is disclosed. Image scanning is performed efficiently, wherein only a small percentage of pixels from the entire image are scanned to identify the target. One or more samples are intelligently identified based on sample selection criteria and are scanned for detecting presence of the target based on cumulative evidence score Plurality of sampling approaches comprising active sampling, distributed sampling and hybrid sampling are disclosed that either detect and localize the target or perform image labeling indicating only presence of the target.
APPROACH TO UNSUPERVISED DATA LABELING
Systems and methods for labelling data is provided. The method includes receiving data at a detector, and identifying a set of objects and features in the data using a neural network. The method further includes annotating the data based on the identified set of objects and features, and receiving a query from a user. The method further includes transforming the query into a representation that can be processed by a symbolic engine, and receiving the annotated data and a transformed query at the symbolic engine. The method further includes matching the transformed query with the annotated data, and presenting the annotated data that matches the transformed query to the user in a labelling interface. The method further includes applying new labels received from the user for the annotated data that matches the transformed query, recursively utilizing the newly annotated data to refine the detector.
Method for temporal stabilization of landmark localization
Various embodiments set forth systems and techniques for training a landmark model. The techniques include determining, using the landmark model, a first landmark in a set of first landmarks associated with a first image; performing, on the first image, a first perturbation to obtain a second image; determining, using the landmark model, a second landmark in a set of second landmarks associated with the second image; determining, based on a first distance between the first landmark and the second landmark, a first loss function; and updating, based on the first loss function, a first parameter of the landmark model.
METHOD FOR ASSESSING HAZARD ON FLOOD SENSITIVITY BASED ON ENSEMBLE LEARNING
A method for assessing a hazard on flood sensitivity based on an ensemble learning includes collecting such data as topography, hydrometeorology, soil vegetation in a research region as feature data, and standardizing the feature data; extracting the historical inundation points and non-inundation points in the research basin according to historical water level data and remote sensing data; selecting an optimal feature subset by using Laplace scores. The method includes dividing sample points into a training set and a testing set and training the ensemble learning model; and calculating the hazard on the flood sensitivity for the whole basin by using the trained model to generate a grade distribution map of the hazard on the flood sensitivity in the basin. In the present disclosure, each of the feature data in the research region is taken as an input, the ensemble learning model improves accuracy for assessing the flood in the basin.