Patent classifications
G06V10/806
ASSESSING RISK OF BREAST CANCER RECURRENCE
The subject disclosure presents systems and computer-implemented methods for assessing a risk of cancer recurrence in a patient based on a holistic integration of large amounts of prognostic information for said patient into a single comparative prognostic dataset. A risk classification system may be trained using the large amounts of information from a cohort of training slides from several patients, along with survival data for said patients. For example, a machine-learning-based binary classifier in the risk classification system may be trained using a set of granular image features computed from a plurality of slides corresponding to several cancer patients whose survival information is known and input into the system. The trained classifier may be used to classify image features from one or more test patients into a low-risk or high-risk group.
Memory-Guided Video Object Detection
Systems and methods for detecting objects in a video are provided. A method can include inputting a video comprising a plurality of frames into an interleaved object detection model comprising a plurality of feature extractor networks and a shared memory layer. For each of one or more frames, the operations can include selecting one of the plurality of feature extractor networks to analyze the one or more frames, analyzing the one or more frames by the selected feature extractor network to determine one or more features of the one or more frames, determining an updated set of features based at least in part on the one or more features and one or more previously extracted features extracted from a previous frame stored in the shared memory layer, and detecting an object in the one or more frames based at least in part on the updated set of features.
DEEP LEARNING BASED AUXILIARY DIAGNOSIS SYSTEM FOR EARLY GASTROINTESTINAL CANCER AND INSPECTION DEVICE
A deep learning-based examination and diagnosis assistance system and apparatus for early digestive tract cancer comprising a feature extraction network, an image classification model, an endoscope classifier, and an early cancer recognition model. The feature extraction network is used for performing initial feature extraction on endoscope images based on a neural network model; the image classification model is used for performing extraction on the initial features to acquire image classification features; the endoscope classifier is used for performing feature extraction on the initial features to acquire endoscope classification features and classify gastroscope/colonoscope images; the early cancer recognition model is used for splicing the initial features, the endoscope classification features, and the image classification features to acquire the probability of early cancer lesions in white light images, electronic dye images or chemical dye images of a corresponding site or acquire a flushing prompt or position recognition prompt for the corresponding site.
IMAGE ENHANCEMENT METHOD AND APPARATUS
This application relates to an image enhancement technology in the field of computer vision in the field of artificial intelligence, and provides an image enhancement method and apparatus. This application relates to the field of artificial intelligence, and specifically, to the field of computer vision. The method includes: adjusting a pixel value of a to-be-processed image, to obtain K images, where pixel values of the K images are different, and K is a positive integer greater than 1; extracting local features of the K images; extracting a global feature of the to-be-processed image; and performing image enhancement processing on the to-be-processed image based on the global feature and the local features, to obtain an image-enhanced output image. This method helps to improve the effect of image quality enhancement processing.
OPTICAL SYSTEM AND OPTICAL IMAGE PROCESSING METHOD USING IMAGE RESTORATION
Disclosed is an optical system using image restoration, including a light source, a pinhole, a testing platform, an image sensor and an image processing device. The pinhole is disposed on a light transmission path of the light source. The testing platform is disposed on the light transmission path of the light source and the pinhole is located between the light source and the testing platform. The testing platform is used to place a testing sample. The image sensor is disposed below the testing platform, and used to sense the testing sample so as to output an optical diffraction signal. The image processing device is electrically connected to the image sensor and used to perform signal processing and optical signal recognition on the optical diffraction signal of the testing sample so as to obtain a clear image of the testing sample.
Object height estimation from monocular images
Systems and methods for estimating a height of an object from a monocular image are described herein. Objects are detected in the image, each object being indicated by a region of interest. The image is then cropped for each region of interest and the cropped image scaled to a predetermined size. The cropped and scaled image is then input into a convolutional neural network (CNN), the output of which is an estimated height for the object. The height may be represented by a mean of a probability distribution of possible sizes, a standard deviation, as well as a level of confidence. A location of the object may be determined based on the estimated height and region of interest. A ground truth dataset may be generated for training the CNN by simultaneously capturing a LIDAR sequence with a monocular image sequence.
Semantically-aware image-based visual localization
A method, apparatus and system for visual localization includes extracting appearance features of an image, extracting semantic features of the image, fusing the extracted appearance features and semantic features, pooling and projecting the fused features into a semantic embedding space having been trained using fused appearance and semantic features of images having known locations, computing a similarity measure between the projected fused features and embedded, fused appearance and semantic features of images, and predicting a location of the image associated with the projected, fused features. An image can include at least one image from a plurality of modalities such as a Light Detection and Ranging image, a Radio Detection and Ranging image, or a 3D Computer Aided Design modeling image, and an image from a different sensor, such as an RGB image sensor, captured from a same geo-location, which is used to determine the semantic features of the multi-modal image.
IMAGE PROCESSING METHOD AND APPARATUS, AND ELECTRONIC DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM
An image processing method includes: performing first segmentation processing on an image to be processed, and determining a segmentation region of a target in said image (S11); determining, according to the position of the center point of the segmentation region of the target, an image region where the target is located (S12); and performing second segmentation processing on the image region where each target is located, and determining the segmentation result of the target in said image (S13).
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING PROGRAM
An information processing apparatus comprising, at least one first processor configured to carry out a first process on data input from at least one sensor to produce first processed data, a selector configured to select, according to a first predetermined condition, at least one of a plurality of second processes, and at least one second processor configured to receive the first processed data from the at least one first processor and to carry out the selected at least one of the plurality of second processes on the first processed data to produce second processed data, each of the plurality of second processes having a lower processing load than the first process.
GAZE POINT ESTIMATION METHOD, DEVICE, AND ELECTRONIC DEVICE
The present application provides a gaze point estimation method, device, and an electronic device. The method includes: acquiring user image data; acquiring a facial feature vector according to a preset first convolutional neural network and the facial image; acquiring a position feature vector according to a preset first fully connected network and the position data; acquiring a binocular fusion feature vector according to a preset eye feature fusion network, the left-eye image and the right-eye image; and acquiring position information about a gaze point of a user according to a preset second fully connected network, the facial feature vector, the position feature vector, and the binocular fusion feature vector. In this technical solution, relation between eye images and face images is utilized to achieve accurate gaze point estimation.