G06V10/806

METHOD AND DEVICE FOR DETECTING AN OBJECT IN AN IMAGE

A method for detecting an object in an image includes: obtaining an image to be detected; generating a plurality of feature maps based on the image to be detected by a plurality of feature extracting networks in a neural network model trained for object detection, in which the plurality of feature extracting networks are connected sequentially, and input data of a latter feature extracting network in the plurality of feature extracting networks is based on output data and input data of a previous feature extracting network; and generating an object detection result based on the plurality of feature maps by an object detecting network in the neural network model.

Systems and methods for predicting crop size and yield

A computer system obtains, using a camera, a first plurality of images of a canopy an agricultural plot. For each respective fruit of a plurality of fruit growing in the agricultural plot, the computer system identifies a first respective image in the first plurality of images that comprises the respective fruit. The first respective image has a corresponding first camera location. The computer system also identifies a second respective image in the first plurality of images that comprises the respective fruit. The second respective image has a corresponding second camera location. The computer system uses at least i) the first and second respective images and ii) a distance between the first and second camera locations to determine a corresponding size of the respective fruit.

Product defect detection method and apparatus, electronic device and storage medium

A product defect detection method and apparatus, an electronic device, and a storage medium are provided. A method includes: acquiring a multi-channel image of a target product; inputting the multi-channel image to a defect detection model, wherein the defect detection model includes a plurality of convolutional branches, a merging module and a convolutional headbranch; performing feature extraction on each channel in the multi-channel image by using the plurality of convolutional branches, to obtain a plurality of first characteristic information; merging the plurality of first characteristic information by using the merging module, to obtain second characteristic information; performing feature extraction on the second characteristic information by using the convolutional headbranch, to obtain third characteristic information to be output by the defect detection model; and determining defect information of the target product based on the third characteristic information.

Vehicle information detection method, electronic device and storage medium

A vehicle information detection method, an electronic device and a storage medium are provided, and relates to the technical field of artificial intelligence, in particular to the technical field of computer vision and deep learning. The method includes: determining a bird's-eye view of a target vehicle based on an image of the target vehicle; performing feature extraction on the image of the target vehicle and the bird's-eye view respectively, to obtain first feature information corresponding to the image of the target vehicle and second feature information corresponding to the bird's-eye view of the target vehicle; and determining three-dimensional information of the target vehicle based on the first feature information and the second feature information. According to embodiments of the disclosure, accurate detection of vehicle information can be realized based on a monocular image.

Automatic image annotations

A computer-implemented method for annotating images is disclosed. The computer-implemented method includes generating a saliency map corresponding to an input image, wherein the input image is an image that requires annotation, generating a behavior saliency map, wherein the behavior saliency map is a saliency map formed from an average of a plurality of objects contained within respective bounding boxes of a plurality of sample images, generating a historical saliency map, wherein the historical saliency map is a saliency map formed from an average of a plurality of tagged objects in the plurality of sample images, fusing the saliency map corresponding to the input image, the behavior saliency map, and the historical saliency map to form a fused saliency map, and generating, based on the fused saliency map, a bounding box around an object in the input image.

Image Description Method and Apparatus, Computing Device, and Storage Medium
20220351487 · 2022-11-03 ·

Disclosed is an image description method and apparatus, a computing device and a storage medium, an example method includes: performing feature extraction on a target image with a plurality of first feature extraction models to obtain image features generated by each of the first feature extraction models; performing fusion processing on the image features generated by the plurality of first feature extraction models to generate global image features corresponding to the target image; performing feature extraction on the target image with a second feature extraction model to obtain target detection features corresponding to the target image; inputting the global image features corresponding to the target image and the target detection features corresponding to the target image into a translation model to generate a translation sentence, and taking the translation sentence as a description sentence of the target image.

Electronic apparatus and method of controlling the same
11487975 · 2022-11-01 · ·

Disclosed is an electronic apparatus comprising, a memory configured to store instructions; and at least one processor connected to the memory, and configured to detect at least one object of a first-class object or a second-class object included in a target image by the electronic apparatus using an artificial intelligent algorithm to apply the target image to a learned neural network model, and identify and apply an image-quality processing method to be individually applied to at least one detected object, the neural network model is set to detect an object included in an image, as trained based on learning data such as an image, a class to which the image belongs, information about the first-class object included in the image, and information about the second-class object included in the image.

NEURAL NETWORK CONSTRUCTION METHOD AND APPARATUS
20230089380 · 2023-03-23 ·

A neural network construction method and apparatus in the field of artificial intelligence, to accurately and efficiently construct a target neural network. The constructed target neural network has high output accuracy, may be further applied to different application scenarios, and has a strong generalization capability. The method includes: obtaining a start point network, where the start point network includes a plurality of serial subnets; performing at least one time of transformation on the start point network based on a preset first search space to obtain a serial network, where the first search space includes a range of parameters used for transforming the start point network; and if the serial network meets a preset condition, training the serial network by using a preset dataset to obtain a trained serial network; and if the trained serial network meets a termination condition, obtaining a target neural network based on the trained serial network.

IMAGE PROCESSING METHOD AND APPARATUS, DEVICE, STORAGE MEDIUM, AND COMPUTER PROGRAM PRODUCT

An image processing method, including: obtaining an original image, the original image including biological features of a to-be-detected object, fusing a global frequency domain map corresponding to the original image with the original image to obtain a global fusion image, performing living body detection on the global fusion image to obtain a first detection result, determining that the original image does not pass the living body detection when the first detection result indicates that the original image belongs to a screen reproduced image, obtaining, when the original image does not belong to the screen reproduced image, a biological feature image based on the biological features, fusing a local frequency domain map with the biological feature image to obtain a local fusion image, performing living body detection on the local fusion image to obtain a second detection result, and determining a living body detection result corresponding to the original image.

FACE DETECTION DEVICE, METHOD AND FACE UNLOCK SYSTEM
20220351491 · 2022-11-03 ·

A face detection device based on a convolutional neural network is provided. The device includes a feature extractor assembly and a detector assembly. The feature extractor assembly includes a first feature extractor, a second feature extractor and a third feature extractor. The first feature extractor is used to apply a first set of convolution kernels on an input grayscale image thereby generate a set of basic feature maps. The second feature extractor is used to apply a second set of convolution kernels on the set of basic feature maps and thereby generate more than one set of intermediate feature maps, which are concatenated. The third feature extractor is used to perform at least one convolution operation on a concatenated layer. The detector assembly includes at least one detector whose input is derived from one of the second feature extractor and the third feature extractor.