METHOD AND APPARATUS FOR TRAINING A NEURAL NETWORK CLASSIFIER TO CLASSIFY AN IMAGE DEPICTING ONE OR MORE OBJECTS OF A BIOLOGICAL SAMPLE
20210264130 · 2021-08-26
Assignee
Inventors
Cpc classification
G06V10/454
PHYSICS
International classification
Abstract
The disclosure relates to a method for training a neural network classifier (100) to classify a digital image depicting one or more objects of a biological sample into a specific class, which class belongs to a predefined set of classes (C1-C3), the method comprising: providing a training set of digital images (110a-s) originating 5 from a plurality of biological samples, each digital image (110a) of the training set being labeled with a specific class (C1) of the one or more objects of the digital image (110a), each digital image (110a) of the training set being associated with global data (114a) pertaining to the respective sample, training the neural network (100) using pixel data of each digital image (110a) from the training set of digital 10 images (110a-s) and the global data (114a) associated with said digital image (110a) as input, and using the specific class (C1) of the label of said digital image (110a) as a correct output from the neural network (100). The disclosure further relates to an analyzing apparatus (400).
Claims
1. A method for training a neural network classifier to classify a digital image depicting one or more objects of a biological sample into a specific class, which class belongs to a predefined set of classes, the method comprising: providing a training set of digital images originating from a plurality of biological samples, each digital image of the training set being labeled with a specific class of the one or more objects of the digital image, each digital image of the training set being associated with global data pertaining to the respective sample, training the neural network using image data pertaining to each digital image from the training set of digital images and the global data associated with said digital image as input, and using the specific class of the label of said digital image as a correct output from the neural network.
2. The method according to claim 1, wherein the one or more objects comprise one or more biological cells.
3. The method according to claim 1, wherein the method comprises: for each digital image of the training set of digital images: deriving a first set of input values, wherein each input value pertains to a specific feature of the digital image, deriving, based at least in part on said associated global data, a second set of input values, defining a feature vector comprising said first set of input values and said second set of input values, and the neural network classifier determining, based on the feature vector, a specific class to assign to the digital image.
4. The method according to claim 3, wherein the neural network classifier comprises a decision subnet being adapted to receive the feature vector and perform the step of determining the specific class to assign to the digital image.
5. The method according to claim 1, wherein the global data includes data pertaining to an average color of depicted objects of a set of digital images, wherein each digital image of the set depicts one or more objects of a specific biological sample.
6. The method according to claim 4, wherein the neural network classifier further comprises a feature extraction subnet adapted to receive the digital image of the training set, derive the first set of input values therefrom, and populate the feature vector with said first set of input values.
7. The method according to claim 6, wherein the neural network classifier is a convolutional neural network.
8. The method according to claim 7, wherein said second set of input values of the feature vector is derived by the feature extraction subnet.
9. The method according to claim 7, wherein at least a part of said associated global data is coded into pixel data of the respective digital image of the training set such that said at least a part of the global data is being input to the feature extraction subnet as a part of the digital image.
10. The method according to claim 9, wherein at least a part of said associated global data is coded into a coherent area of the pixel data of the respective digital image.
11. The method according to claim 10, wherein the coherent area of the pixel data is color coded so as to reflect a color information included in the global data.
12. The method according to claim 10, wherein said at least a part of said global data is coded into four coherent areas of the pixel data of the digital image, each coherent area being located in a respective corner of the digital image.
13. The method according to claim 1, wherein the neural network classifier consists of two or more fully connected layers.
14. The method according to claim 1, wherein the global data includes data acquired based on an analysis of at least one digital image depicting a calibration area of a microscope slide.
15. The method according to claim 14, wherein the microscope slide comprises one or more chemical reference patches disposed in the calibration area, wherein said chemical reference patches are arranged to react to substances in contact therewith.
16. The method according to claim 15, wherein each of the one or more chemical reference patches are arranged to react to a respective staining agent used for staining the biological sample.
17. The method according to claim 16, wherein each staining agent is at least one from the list of: hematoxylin, eosin, orange G, eosin Y, light green SF yellowish, Bismarck Brown Y, eosinate, methylene blue, azure B, azure A, and methylene violet.
18. The method according to claim 14, wherein the microscope slide comprises one or more imaging test targets disposed in the calibration area, wherein the data acquired based on said analysis pertains at least to the shape of the one or more imaging test targets.
19. The method according to claim 14, wherein the microscope slide is a calibration microscope slide adapted to be used to calibrate a device for capturing the digital image depicting one or more objects of a biological sample.
20. A method for classifying a digital image depicting one or more objects of a biological sample into a specific class, which class belongs to a predefined set of classes, the method comprising: feeding a digital image depicting one or more objects of a biological sample into a neural network classifier, the neural network classifier being trained for classifying digital images into one of the predefined set of classes using the method according to claim 1, feeding global data associated with the sample into the neural network classifier, and the neural network classifier determining, based on at least said digital image and said global data, a specific class to assign to the digital image.
21. An analyzing device for optical analysis of a biological sample, said analyzing device comprising: optics, including a camera, adapted to capture a digital image depicting one or more objects of a biological sample, a processor adapted to: determine global data pertaining to the sample based on an analysis of the digital image, and/or receiving global data from an external source, associate the digital image with the global data, and perform the method according to claim 14 on the digital image.
22. A computer-readable medium comprising computer code instructions which when executed by a device having processing capability are adapted to perform the method according to claim 1.
Description
BRIEF DESCRIPTIONS OF THE DRAWINGS
[0056] The invention will by way of example be described in more detail with reference to the appended schematic drawings, which shows presently preferred embodiments of the invention.
[0057]
[0058]
[0059]
[0060]
[0061]
[0062]
[0063]
[0064]
[0065]
[0066]
DETAILED DESCRIPTION
[0067] The present invention will now be described more fully hereinafter with reference to the accompanying drawings, in which currently preferred embodiments of the invention are shown. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided for thoroughness and completeness, and fully convey the scope of the invention to the skilled person.
[0068] The disclosure relates to methods of training, and using, neural networks for classifying a digital image depicting an object of a biological sample into a specific class. The object may for example be a biological cell, or a cell nucleus. The detailed description will describe the method using a series of example embodiments where different aspects of the method is discussed. The neural networks used in the different embodiments are sometimes different from each other.
[0069]
[0070] The method comprises providing a training set of digital images 110a-s originating from a plurality of biological samples. Each digital image of the training set is labeled with a specific class of the biological cell of the digital image, and each digital image of the training set is associated with global data pertaining to the respective sample. Thus, for the training set of digital images 110a-s used in the example of
[0071] The method further comprises training the neural network 10 using image data pertaining to each digital image 110a from the training set of digital images 110a-s and the global data 114a associated with said digital image 110a as input. In the training, the specific class C1 of the label of said digital image 110a will be used as a correct output from the neural network 10.
[0072] More specifically, for each digital image 110a of the training set of digital images 110a-s, the method comprises deriving a first set of input values F, wherein each input value F pertains to a specific feature of the digital image 110a. As previously mentioned this step may be carried out by the classifier but may, alternatively, be carried out by a preprocessing step.
[0073] The method further comprises the step of deriving, based at least in part on said associated global data 114a, a second set of input values G. The first set of input values F and the second set of input values G then together defines a feature vector 160. Thus, the feature vector will contain both data pertaining to extracted features of the images, and data pertaining to the global data. The neural network classifier 10 then determines, based on the feature vector 160, a specific class C1 to assign to the digital image 110a.
[0074] Training of the neural network is usually carried out using backpropagation. Referring now to
[0075] In the following detailed description, a number of example embodiments will be provided. The first examples discussed with reference to
[0076]
[0077] The feature extraction is carried out by convolution. Convolution preserves the spatial relationship between pixels by learning image features using small matrices of input data. These small matrices of input data are usually referred to as filters, and are slided over the input image and computed the dot product to provide an output matrix usually termed a feature map. Several filters may be used to detect different kind of features from the image. For example, filters may be arranged to sharpen the image, blur the image, detect edges of the image etc. During the training process, the convolutional neural network 100 adjusts the values of the filters as a part of the training process. For a given network architecture, input requested parameters are typically the number of filters and the size of the filters. The convolutional neural network 100 may have several convolutional layers, i.e. layers where a filter is convolved with the input to that layer. The convolutional neural network 100 may further comprise ReLU layers to introduce non-linearity in the data so as to better meet the non-linear nature of real-world data. Often, ReLU is carried out after each convolution step. The convolutional neural network 100 may further comprise pooling layers to perform down sampling of an input thereto. The purpose of the pooling layers is to progressively reduce the size of the input data. Finally, output data from the last layer of the convolutional neural network 100 is concatenated to form a feature vector 160 comprising a first set of input values F. Thus, the feature vector 160 will be populated with at least the first set of input values F derived by the feature extraction subnet 140.
[0078] Now, consider the image 110a shown in
[0079] The neural network classifier 100 further comprises a decision subnet 150 being adapted to receive the feature vector 160 and perform the step of determining the specific class C1 to assign to the digital image 110a. The decision subnet 150 may consist of fully connected layers, such as for example a multilayer perceptron that uses a softmax activation function in the output layer to provide an output vector with probability values Y. The output vector 170 from the decision subnet 150 consists of two or more cells (in the example: three cells), each cell comprising a probability value Y determined by the convolutional neural network 100 as the likelihood for the image belonging to the specific class associated with the cell.
[0080] Global data may alternatively be added after the feature extraction subnet 140. This is exemplified in
[0081] Global data may alternatively be added to an intermediate layer of the feature extraction subnet 140. This is exemplified in
[0082] The examples described so far has been based on using a convolutional neural network 100. However, the method is not limited to convolutional neural networks. One example of a conventional neural network is provided in
[0083] Once the training of the neural network is completed, the neural network may be used for image classification. This is carried out using forward propagation as already mentioned herein. The method for classifying a digital image 110 depicting a biological cell into a specific class, which class belongs to a predefined set of classes C, thus comprises the following steps: Firstly, a digital image 110 depicting a biological cell is fed into a neural network classifier 100, 200, the neural network classifier 100, 200 being trained for classifying digital images 110 into one of the predefined set of classes C using the method for training described hereinabove. Then, global data 116 associated with the sample is fed into the neural network classifier 100, 200, and the neural network classifier 100 determining, based on at least said digital image 110 and said global data 116, a specific class to assign to the digital image.
[0084] In the examples discussed herein, global data includes data pertaining to an average color of depicted biological cells of a set of digital images, wherein each digital image of the set depicts a biological cell from a specific biological sample.
[0085] Global data may be derived in other ways.
[0086] In an alternative embodiment illustrated in
[0087] The global data may pertain to more than an appearance of chemical reference patches. In an alternative embodiment illustrated in
[0088] Specific examples of global data have been provided herein. However, global data may comprise other kind of data acquired in other ways. For example, according to an embodiment, the global data further includes data pertaining to a patient from which the biological sample originates. This may be medical data, such as test results from medical tests previously performed on the patient, the age of the patient, the blood group of the patient, or the BMI of the patient.
[0089] The disclosure further relates to an analyzing device, an example embodiment of which is shown in
[0090] The person skilled in the art realizes that the present invention by no means is limited to the preferred embodiments described above. On the contrary, many modifications and variations are possible within the scope of the appended claims. Additionally, variations to the disclosed embodiments can be understood and effected by the skilled person in practicing the claimed invention, from a study of the drawings, the disclosure, and the appended claims.