G06V10/80

Techniques for determining a location of a mobile object

Techniques are disclosed for determining a location of an object based at least in part on a motion of the object. The techniques include generating a motion profile based at least in part on motion data received from a mobile device that is associated with the object. The techniques further include receiving, from a camera at a location, a plurality of images that identifies a candidate motion of a candidate object through at least a portion of the location. The techniques further include generating a candidate motion profile corresponding to the candidate motion of the candidate object based at least in part on the plurality of images. Based at least in part on a score generated by comparing the motion profile with the candidate motion profile, the techniques may determine that the candidate object is the object.

Landslide recognition method based on laplacian pyramid remote sensing image fusion

A landslide recognition method based on Laplacian pyramid remote sensing image fusion includes: performing original remote sensing image reconstruction based on extracted local features and global features of remote sensing images through a Laplacian pyramid fusion module to generate a fused image, constructing a deep learning semantic segmentation model through a semantic segmentation network, labeling the fused image to obtain a dataset of landslide disaster label map, and training the deep learning semantic segmentation model by the dataset, and then storing when a loss curve is fitted and a landslide recognition accuracy of remote sensing image of the deep learning semantics segmentation model meets a requirement by modifying a structure of the semantic segmentation network and adjusting parameters of the deep learning semantics segmentation model. Combined with the image fusion model based on Laplacian pyramid, the method can provide effective decision-making basis for prevention and mitigation of landslide disasters.

METHOD, APPARATUS, ELECTRONIC DEVICE AND MEDIUM FOR IMAGE SUPER-RESOLUTION AND MODEL TRAINING
20220383452 · 2022-12-01 ·

The embodiments of the present application provide method, apparatus, electronic device, and medium for image super-resolution and model training. The method includes: inputting the image to be processed into a first super-resolution network model and a second super-resolution network model trained in advance, respectively; the first super-resolution network model is a trained convolutional neural network; the second super-resolution network model is a generative network included in a trained generative adversarial network; obtaining a first image output from the first super-resolution network model and a second image output from the second super-resolution network model; fusing the first image and the second image to obtain a target image, wherein the resolution of the target image is greater than the resolution of the image to be processed.

METHOD, APPARATUS, ELECTRONIC DEVICE AND MEDIUM FOR IMAGE SUPER-RESOLUTION AND MODEL TRAINING
20220383452 · 2022-12-01 ·

The embodiments of the present application provide method, apparatus, electronic device, and medium for image super-resolution and model training. The method includes: inputting the image to be processed into a first super-resolution network model and a second super-resolution network model trained in advance, respectively; the first super-resolution network model is a trained convolutional neural network; the second super-resolution network model is a generative network included in a trained generative adversarial network; obtaining a first image output from the first super-resolution network model and a second image output from the second super-resolution network model; fusing the first image and the second image to obtain a target image, wherein the resolution of the target image is greater than the resolution of the image to be processed.

System and Method for Identity Preservative Representation of Persons and Objects Using Spatial and Appearance Attributes

A method is described, for processing images of persons or objects to generate an identity preservative feature descriptor learnt for each person or object. The method includes obtaining an image of a person or object, extracting at least one spatial attribute of the person or object from the obtained image, and extracting at least one appearance feature of the person or object from the image by using a mapping function to translate image pixels into appearance attributes represented by at least one numerical feature. The method also includes combining the at least one spatial attribute and the at least one appearance feature to generate the unique feature descriptor representing the person or the object, to assign the unique feature descriptor to the image to enable feature descriptors representing the same person or object to be compared to feature descriptors representing different people or objects given a predefined mathematical pseudo-distance metric according to a least a distance from each other.

IMAGE PROCESSING METHOD, MODEL TRAINING METHOD, RELEVANT DEVICES AND ELECTRONIC DEVICE
20220383626 · 2022-12-01 ·

An image processing method includes: obtaining a first categorical feature and M first image features corresponding to M first images respectively, each first image being associated with a task index, task indices associated with different first images being different from each other, M being a positive integer; fusing the M first image features with the first categorical feature respectively so as to obtain M first target features; performing feature extraction on the M first target features so as to obtain M second categorical features; selecting a second categorical feature corresponding to each task index from the M second categorical features, and performing regularization corresponding to the task index on the second categorical feature, to obtain a third categorical feature corresponding to the task index; and performing image processing in accordance with M third categorical features so as to obtain M first image processing results of the M first images.

Method and apparatus for processing image

Embodiments of the present disclosure disclose a method and apparatus for processing an image. A specific embodiment of the method includes: acquiring a feature map of a target image, where the target image contains a target object; determining a local feature map of a target size in the feature map; combining features of different channels in the local feature map to obtain a local texture feature map; and obtaining location information of the target object based on the local texture feature map.

Scene recognition method, training method and device based on pyramid attention

The present invention discloses a scene recognition method, a training method and a device based on pyramid attention, belonging to the field of computer vision. The method includes: pyramid layering a color feature map and a depth feature map respectively, calculating the corresponding attention map of each layer; taking the output of the attention of the last layer as the output; taking the attention output of the last layer as the final feature map, for the remaining layers, adding the result after upsampling of the final feature map of an upper layer to the attention output of this layer as the final feature map of this layer; scaling the attention map and the final feature map, using the average of two new attention maps as the final attention map, mapping the largest k position in the final attention map to the final feature map of this layer.

ACTION-MODEL GENERATION APPARATUS AND ACTION-MODEL GENERATION METHOD

A basic-action-model acquisition unit (103) acquires a basic-action model which has been generated by analyzing a state of each of a plurality of parts of a movable object when the movable object performs a basic action, and which is a model for recognizing the basic action, for each of a plurality of the basic actions. An advanced-action-model generation unit (104) combines two or more basic-action models among a plurality of basic-action models, and generates an advanced-action model which is a model for recognizing an advanced action which is an action more complex than the basic action.

ACTION-MODEL GENERATION APPARATUS AND ACTION-MODEL GENERATION METHOD

A basic-action-model acquisition unit (103) acquires a basic-action model which has been generated by analyzing a state of each of a plurality of parts of a movable object when the movable object performs a basic action, and which is a model for recognizing the basic action, for each of a plurality of the basic actions. An advanced-action-model generation unit (104) combines two or more basic-action models among a plurality of basic-action models, and generates an advanced-action model which is a model for recognizing an advanced action which is an action more complex than the basic action.