Patent classifications
G06V10/454
RADIOMICS-BASED TREATMENT DECISION SUPPORT FOR LUNG CANCER
Two major treatment strategies employed in fighting non-small cell lung cancer (NSCLC) are tyrosine kinase inhibitors (TKIs) and immune checkpoint inhibitors (ICIs). The choice of strategy is based on heterogeneous biomarkers expressed by the lung tumor tissue. A major challenge for molecular testing of these biomarkers is the insufficiency of biopsy specimens from patients with advanced NSCLC. Disclosed herein is a method for predicting a response to immune-checkpoint blockade immunotherapy. The method generally involves imaging the subject with positron emission tomography with 2-deoxy-2-[fluorine-18] fluoro-D-glucose integrated with computed tomography to produce .sup.18F-FDG PET/CT images of the tumor, analyzing the images using PET, CT, and Kulbek Leibler Divergence statistical (KLD) features or, alternatively using deep leaning such as Neural Networks; generating a radiomic signature from the identified features or Network characteristics; and computing a radiomic score based on the radiomic signature that is predictive of responsiveness to ICIs or TKIs.
METHOD AND APPARATUS FOR AUTOMATIC COUGH DETECTION
A method for identifying cough sounds in an audio recording of a subject including: operating at least one electronic processor to identify potential cough sounds in the audio recording; operating the at least one electronic processor to transform one or more of the potential cough sounds into corresponding one or more image representations; operating the at least one electronic processor to apply the one or more image representations to a representation pattern classifier trained to confirm that a potential cough sound is a cough sound or is not a cough sound; and operating the at least one electronic processor to flag one or more of the potential cough sounds as confirmed cough sounds based on an output of the representation pattern classifier.
BEHAVIOR RECOGNITION METHOD AND SYSTEM, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIUM
A behavior recognition method and system, including: dividing video data into a plurality of video clips, performing frame extraction processing on each video clip to obtain frame images, and performing optical flow extraction on the frame images to obtain optical flow images; performing feature extraction on the frame images and the optical flow images to obtain feature maps of the frame images and the optical flow images; performing spatio-temporal convolution processing on the feature maps of the frame images and the optical flow images, and determining a spatial prediction result and a temporal prediction result; fusing the spatial prediction results of all the video clips to obtain a spatial fusion result, and fusing the temporal prediction results of all the video clips to obtain a temporal fusion result; and performing two-stream fusion on the spatial fusion result and the temporal fusion result to obtain a behavior recognition result.
METHOD AND APPARATUS FOR DETECTING OBJECT IN IMAGE
An object detection method performed by an object detection apparatus, includes receiving an input image, obtaining, using an object detection model, a result of detecting a target candidate object from the input image, obtaining, using an error prediction model, a result of detecting an error object from the input image, and detecting a target object in the input image based on the result of detecting the target candidate object and the result of detecting the error object.
Systems and methods for polygon object annotation and a method of training an object annotation system
The present invention relates generally to object annotation, specifically to polygonal annotations of objects. Described are methods of annotating an object including steps of receiving an image depicting an object, generating a set of image features using a CNN encoder implemented on one or more computers, and producing a polygon object annotation via a recurrent decoder or a Graph Neural Network. The recurrent decoder may include a recurrent neural network, a graph neural network or a gated graph neural network. A system for annotating an object and a method of training an object annotation system are also described.
Domain adaptation and fusion using weakly supervised target-irrelevant data
Aspects include receiving a request to perform an image classification task in a target domain. The image classification task includes identifying a feature in images in the target domain. Classification information related to the feature is transferred from a source domain to the target domain. The transferring includes receiving a plurality of pairs of task-irrelevant images that each includes a task-irrelevant image in the source domain and in the target domain. The task-irrelevant image in the source domain has a fixed correspondence to the task-irrelevant image in the target domain. A target neural network is trained to perform the image classification task in the target domain. The training is based on the plurality of pairs of task-irrelevant images. The image classification task is performed in the target domain and includes applying the target neural network to an image in the target domain and outputting an identified feature.
Automatic image-based skin diagnostics using deep learning
There is shown and described a deep learning based system and method for skin diagnostics as well as testing metrics that show that such a deep learning based system outperforms human experts on the task of apparent skin diagnostics. Also shown and described is a system and method of monitoring a skin treatment regime using a deep learning based system and method for skin diagnostics.
Neural network processing for multi-object 3D modeling
Embodiments are directed to neural network processing for multi-object three-dimensional (3D) modeling. An embodiment of a computer-readable storage medium includes executable computer program instructions for obtaining data from multiple cameras, the data including multiple images, and generating a 3D model for 3D imaging based at least in part on the data from the cameras, wherein generating the 3D model includes one or more of performing processing with a first neural network to determine temporal direction based at least in part on motion of one or more objects identified in an image of the multiple images or performing processing with a second neural network to determine semantic content information for an image of the multiple images.
Method and an apparatus for predicting a future state of a biological system, a system and a computer program
An embodiment of a method 100 for predicting a future state of a biological system is provided. The method 100 comprises receiving 101a microscope image depicting the biological system at an associated time and receiving 102 metadata corresponding to the microscope image. The method 100 further comprises extracting 103 features from the microscope image having information on a state of the biological system and using 104 the features and the metadata to predict the future state of the biological system.
Continuous machine learning for extracting description of visual content
Aspects of the present disclosure relate to machine learning techniques for continuous implementation and training of a machine learning system for identifying the natural language meaning of visual content. A computer vision model or other suitable machine learning model can predict whether a given descriptor is associated with the visual content. A set of such models can be used to determine whether particular ones of a set of descriptors are associated with the visual content, with the determined descriptors representing a meaning of the visual content. This meaning can be refined based on a multi-armed bandit tracking and analyzing interactions between the visual content and users associated with certain personas related to the determined descriptors.