A SYSTEM AND METHOD THEREOF FOR REAL-TIME AUTOMATIC LABEL-FREE HOLOGRAPHY-ACTIVATED SORTING OF CELLS
20230258554 · 2023-08-17
Inventors
- Natan Tzvi Shaked (Mazkeret Batya, IL)
- Itay Barnea (Petach Tikva, IL)
- Matan DUDAIE (Giv'atayim, IL)
- Noga NISSIM (Ramat-Hasharon, IL)
- Michael KIRSCHBAUM (Nuthetal, DE)
- Marten Tobias GERLING (Potsdam, DE)
Cpc classification
G03H2001/005
PHYSICS
G03H1/0866
PHYSICS
B01L2200/148
PERFORMING OPERATIONS; TRANSPORTING
B01L3/502715
PERFORMING OPERATIONS; TRANSPORTING
B01L2200/0652
PERFORMING OPERATIONS; TRANSPORTING
G06V20/52
PHYSICS
G01N2015/1454
PHYSICS
G03H1/0443
PHYSICS
B01L2300/0864
PERFORMING OPERATIONS; TRANSPORTING
B01L3/502761
PERFORMING OPERATIONS; TRANSPORTING
International classification
Abstract
The present invention relates to an automatic real-time label-free holography-activated sorting of the cell's technique. The technique provides high-discriminative power on the level of the individual cell. The technique includes rapid automated cell processing during cell visualization and flow, with high discriminative power on the level of the individual cell. The technique may be useful in detection of cancer and to identify different stages of oncogenesis.
Claims
1. A method comprising: performing a holographic imaging of a flow of a heterogeneous population of cells to enable label-free quantitative imaging of the flow of cells; automatically processing image data of the holographic imaging to identify a certain type of cells during the flow; and automatically sorting the certain type of cells during flow, thereby obtaining a real-time, automatic, label-free holography-activated sorting of the cells.
2. The method of claim 1, wherein performing a holographic imaging of the flow of cells comprises at least one of (i) performing a digital holographic microscopy and quantitative phase microscopy to measure the quantitative phase profile of the cell being indicative of the optical path delay (OPD) profile of the cell to enable label-free interferometric phase microscopy or (ii) obtaining a plurality of off-axis holograms and performing a quantitative phase reconstruction process.
3. The method of claim 2, wherein automatically processing image data comprises at least one of (i) reconstructing the OPD map for each cell individually; (ii) extracting from each OPD 2D and 3D morphological and quantitative features.
4. (canceled)
5. The method of claim 34, wherein automatically processing image data comprises performing classification based on 2D and 3D morphological quantitative features of the cells during the cell flow.
6. The method of claim 5, wherein automatically processing image data comprises at least one of (i) performing a real-time classification of each cell or (ii) performing classification of unlabeled cancer cells in blood to enable label-free imaging and sorting of cancer cells in blood.
7. The method of claim 6, wherein classifying the cells comprises performing machine learning.
8. The method of claim 6, wherein automatically processing image data comprises performing a sequence of classification, wherein each classification is capable of identifying different types of cells.
9. (canceled)
10. The method of claim 6, further comprising automatically classifying primary and metastatic cancer cells.
11. The method of claim 1, wherein performing a holographic imaging of the flow of cells comprises acquiring at least one single-cell hologram during flow.
12. (canceled)
13. The method of claim 1, wherein automatically sorting the certain type of cells comprises at least one of (i) isolating at least one certain type of cells from other cells in the flow or (ii) automatically sorting the certain type of cells during or following cell visualization.
14. (canceled)
15. The method of claim 1, further comprising at least one of (i) counting cells; or (ii) identifying in the certain type of cells at least one of DNA, RNA, protein or any other metabolite to provide a genetic metabolic profile of a patient.
16. (canceled)
17. The method of claim 15, further comprising analyzing the genetic metabolic profile to enable at least one of a diagnosis or an optimization of a treatment of the patient.
18. A system comprising: a holographic imaging module being configured and operable to image a flow of heterogeneous population of cell; a cell sorting module being configured and operable to sort the flow of cells; and a control unit being configured and operable to receive from said holographic imaging module image data being indicative of the flow of cells; automatically process the image data and identify a certain type of cells during the flow and upon identification of a certain type of cells activating said cell sorting module to enable real-time, automatic, label-free holography-activated sorting of the cells.
19. The system of claim 18, wherein said control unit is configured and operable to at least one of (i) process digital holograms of the cells dynamically and classify the cells during the flow or (ii) calculate an OPD map using a database and extract a plurality of features based on the image phase and amplitude.
20. The system of claim 19, wherein said control unit is configured and operable to classify the cells by using machine learning.
21. (canceled)
22. The system of claim 18, wherein said cell sorting microfluidic module is placed inside the holographic imaging module such that the cell sorting module is viewed through the holographic imaging module.
23. The system of claim 18, wherein said cell sorting module comprises a dielectrophoretic microfluidic module comprises an array of spaced-apart electrodes, when activating said dielectrophoretic microfluidic module comprises operating at least one electrode by alternatively switching on or off one or more relevant electrodes to direct the cells of interest.
24. The system of claim 23, wherein said array of spaced-apart electrodes are positioned on both sides of the dielectrophoretic microfluidic module to define a sorting trajectory for the flow of cells along the dielectrophoretic microfluidic module.
25. The system of claim 23, wherein at least one electrode of said plurality of electrodes is configured and operable to at least one of the following: center and direct the cells along the sorting trajectory into an imaging field of view and a holographic region of interest, push the cells to either side of the dielectrophoretic microfluidic module, increase the distance between cell streams.
26. The system of claim 18, wherein said holographic imaging module comprises a high- or low-coherence off-axis interferometric phase microscope and a microfluidic channel to quantitatively image cells during flow.
27. The system of claim 18, further comprising at least one (i) at least one container for collecting sorted-out cells; or (ii) a plurality of microfluidics pumps.
28. (canceled)
29. The system of claim 27, wherein at least one microfluidics pump of said plurality of microfluidics pumps is configured and operable to direct the sorted-out cells towards said at least one container.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0072] In order to better understand the subject matter that is disclosed herein and to exemplify how it may be carried out in practice, embodiments will now be described, by way of non-limiting example only, with reference to the accompanying drawings, in which:
[0073]
[0074]
[0075]
[0076]
[0077]
[0078]
[0079]
[0080]
[0081]
[0082]
[0083]
[0084]
[0085]
DETAILED DESCRIPTION OF EMBODIMENTS
[0086]
[0087]
[0088] Automatically processing image data in 204 may comprise performing a sequence of classification, wherein each classification is capable of identifying another type of cells. Automatically sorting the certain type of cells in 206 may comprise isolating at least one certain type of cells from other cells in the flow.
[0089] The technique of the present invention is not limited to any type of cells to be sorted. The cells may include blood cells, cancer cells, stem cells. For example, the technique of the present invention may be used for the classification of unlabeled cancer cells in blood to enable label-free imaging and sorting of cancer cells in blood. Reference is made to
[0090] Reference is made to
[0091] Reference is made to
[0092] In this specific and non-limiting example, a fluorescence module (in the dashed box) wascombined to verify the label-free classification accuracy. Laser-driven light source (LDLS) is used as the fluorescence excitation source, and a CCD camera is used to record the fluorescence images. Low pass filter (LPF) prevents the fluorescence excitation light from getting to the CMOS camera used for holography. The epi-fluorescence system is composed of a laser-driven light source (Energetic, EQ-99 LDLS) and comprises a 4f system to collimate the light, a fluorescence filter cube (Zeiss, filter set 38) to fit to use with green fluorescence protein (GFP), and a digital CCD camera (Zeiss, AxioCam MRm). The fluorescence imaging system was built to have an external validation for the cell label-free classification results, where only the cancer cells emitted fluorescence. The fluorescence excitation light is focused by the objective to a spot in the FOV, located just before the sorting electrodes. Since the emission light is split by the second beam-splitter, an LPF (BrightLine, FF01-496\LP-25) was placed to remove the excitation light and receive the fluorescence emission light on the camera used for the holographic imaging module.
[0093] At the end of the process all the CTCs are concentrated in a single reservoir and may be analyzed genetically by a next generation PCR to provide patients with the most beneficial treatment for their disease. During the experiments, the following techniques were used: (1) preparation of blood sample using a filtration kit to remove small sized cells; (2) microfluidics system including pumps, microchannel and electrical components for directing the cells to designated reservoirs; (3) a holographic imaging module imaging cells (e.g. IPM); (4) a database that include geometrical and optical parameters for the recognition of different subsets of cells; (5) a control unit being configured and operable to obtain the captured pictures from the holographic imaging module, analyze the pictures for the formation of an OPD map using the database and activate the electrical components in the microfluidic system thus directing cells of interest to a reservoir. Later, the cells in the reservoir can be analyzed genetically.
[0094] In some embodiments, the system further comprising at least one communicable and readable database storing instructions which, when executed by at least one data processor, result in operations comprising: training a machine learning model to identify a certain type of cells in the communicable and readable database in order to generate information data being indicative of at least one cell; and, after the step of training, real time identify the cells by means of the trained machine learning model.
[0095] In some embodiments, control unit 406 comprises a data input utility including a communication module for receiving image data being indicative of the flow of cells, an optional data output utility for generating data relating to identified cell(s), a memory (i.e. non-volatile computer readable medium) for storing database i.e. preselected data indicative of different OPD maps, and a data processing utility adapted for identifying a certain type of cells during the flow. Data processing utility may operate as a classifier or may comprise a classifier module. Memory may be relayed via wireless or wired connection by an external unit to a central database. The database may be implemented with Microsoft Access, Cybase, Oracle, or other suitable commercial database systems.
[0096] In some embodiments control unit 406 is configured in a cloud-based configuration and/or utilize Internet based computing so that parts of processing utility, and/or memory may reside in multiple distinct geographic locations. Upon identification of certain cells, the data processing utility sends signals to the DEP module 430 to direct the cells along a certain trajectory and sort them. Data processing utility may transmit data regarding the activation of the DEP module via the data output utility, via a data communication (e.g. via cellular network) to a communication module of a central computer. The data processing utility may record the received image data in database in memory and/or may query/cross-reference the received image data with OPD data in the database to identify if the cell is a cell of interest. To this end, the preselected data stored in a database may be used to compare the image data with the OPD maps previously used for identifying cells and stored in the learning database. The memory mat thus be configured for storing a learning database i.e. preselected data indicative of cells correlated with OPD maps. The correspondence between the different OPD maps and the different type of cells may be predetermined. For example, a table of correspondence between the different OPD maps and the different type of cells may be stored in a database. Such table may be stored in the memory. Alternatively, storage may be separate from the server(s) (e.g. SAN storage). If separate, the location(s) of the storage may be in one physical location, or in multiple locations and connected through any type of wired or wireless communication infrastructure. The database may rely on any kind of methodology or platform for storing digital data. The database may include for example, traditional SQL databases such as Oracle and MS SQL Server, file systems, Big Data, NoSQL, in-memory database appliances, parallel computing (e.g. Hadoop clusters), etc. If memory is configured as the storage medium of the database, it may include any standard or proprietary storage medium, such as magnetic disks or tape, optical storage, semiconductor storage, etc.
[0097] The inventors conducted experiments as follows: a blood sample is taken from a cancer patient, and undergo CTC enrichment by filtration (e.g. through a ScreenCell Cyto kit), containing a microporous membrane of 6.5-8 μm pores. This commercial kit captures CTCs and removes smaller cells like erythrocytes and most nucleated blood cells. From each 1 mL of blood, this preliminary filtering results in 11,000 white blood cells and 1-10 CTCs within approximately 3 minutes, with an average CTC recovery rate of more than 90%. This enhanced blood sample is then diluted with 10% Nycodenz, a chemical used to increase the buffer density to help the cells flow smoother. The sample is poured into the DEP microfluidic module of and undergo the final sorting process: As illustrated below, the DEP microfluidic module may include a flow chamber containing several twin-electrodes that create between them an altering electrical field, to deflect (negative-DEP) the cells from the electrodes and direct them inside the dielectrophoretic microfluidic module during flow. The field is created through inducing altering voltage on the electrodes using the control unit (e.g. a computer-controlled generator). The voltage is of 3 Vpp and 1 MHz. All the cells are directed to the same region of interest (ROI), there their hologram is captured. If a cancer cell is detected, the control unit (e.g. a computer-based decision) activates the electrodes and deflect the cancer cell to a reservoir of CTCs, away from the rest of the non-cancer cells. Each cell of the parallel flow is directed to its own outlet port to be collected.
[0098] Three types of cancer cells and four types of blood cells were imaged and analyzed. For cancer cells, HT29-GFP cells were used. HT29-GFP are colon adenocarcinoma cells that have been transfected by adenovirus vector to express GFP and neomycin resistance gene. A stable clone was produced by growing the cells with medium supplemented by 600 μg ml.sup.−1 G418 (Sigma, SN. A1720). The other two cancer cell types are a pair of isogenic cancer cell lines: colon adenocarcinoma, SW-480 (CCL-228), and metastatic stage of colon adenocarcinoma from the lymph node, SW-620 (CCL-227). The growth medium used for the cancer cells was Dulbecco's Modified Eagle's Medium (DMEM) (BI, SN. 01-55-1A) supplemented with 10% fetal bovine serum (FBS) (BI, SN. 04-007-1A), 4 mM L-Glutamin (BI, SN. 03-020-1B) and 1% antibiotics (BI, SN. 03-033-1B). The cell lines were incubated under standard cell culture conditions at 37° C. and 5% CO.sub.2 in a humidified incubator until 80% confluence was achieved. Blood was used to isolate four types of blood cells: erythrocytes, lymphocytes, monocytes, and granulocytes. A dilution medium composed of phosphate-buffered saline (PBS) (BI, SN. 02-023-1A) was prepared supplemented with 2 m
[0099] To extract the quantitative phase maps from the acquired off-axis image holograms, the off-axis interferometry Fourier-based algorithm was used, including a digital 2-D Fourier transform, filtering one of the cross-correlation terms, and an inverse 2-D Fourier transform, where the argument of the resulting complex-wavefront matrix was the wrapped phase of the sample. Each off-axis hologram obtained by the holographic imaging module undergoes a quantitative phase reconstruction process before entering the classification algorithm. In other words, the ROI is determined inside the FOV manually before flowing starts, in relation to the sorting DEP electrode positions. For each off-axis imaging hologram, the ROI is cut, and the 2D Fourier transform of the ROI is calculated. One of the cross-correlation terms is cut and undergoes a 2D inverse Fourier transform, resulting in the complex wavefront of the light passing through the sample.
[0100] To compensate for stationary aberrations and field curvatures, a phase map was subtracted from the wrapped phase map of the sample that is extracted from a hologram acquired with no sample present. In other words, to remove aberrations and field curvatures, this complex wavefront is divided by the background wavefront, i.e., the complex wavefront obtained by the same reconstruction process but without the cell present in the ROI. The quantitative phase profile is the angle of the resulting complex wavefront. This phase may be wrapped around 2π. To resolve this phase ambiguity, an unweighted least-squares phase unwrapping algorithm was applied. The resulting unwrapped phase is multiplied by the wavelength and divided by 2π, resulting in the optical phase delay (OPD) map of the sample, and defined as follows:
OPD.sub.c(x, y)=[
[0101] where n.sub.m is the RI of the medium, h.sub.c is the thickness profile of the cell, and
[0102] In the resulting OPD profile, the cell area was isolated by a simple threshold, followed by a morphological dilation. In cases of frames with no cell, the classification process was not needed. Therefore, another threshold was applied for the minimum size of the connected component. A maximum size threshold was also applied in cases of attached cells that could not be classified as one object. Partial images of cells on the edges of the FOV were not classified as well. Using the above-described methods, a dataset containing the OPD information was created across the cell areas only and the different parameters that were based directly on the OPD map defined in Eq. (1) were calculated, without decoupling the cellular thickness profile from the refractive index as a prior stage.
[0103] The features that have been extracted from each OPD map divide into two categories: (1) 2D morphological features; and (2) 3D quantitative features. The 2D morphological features are based on the binary image indicating the cell area only. The 3D quantitative features are based on the OPD map across the cell area. These features are presented in Table 1. Table 1 below show 2D (left) and 3D (right) handcrafted features extracted in real time from the OPD profile of the flowing cells. The 3D quantitative features rely on previous works that demonstrate the ability to distinguish between the different stages of the cell lifecycle, as well as other biological phenomena (11,37,38).
TABLE-US-00001 TABLE 1 Cell 2D morphological features Cell 3D quantitative features 1 Diameter Mean 2 Area Energy 3 Major axis length Volume 4 Minor axis length Area to Volume ratio 5 Minor to Major ratio Dry mass 6 Convex Area Variance 7 Eccentricity Kurtosis 8 Circularity Skewness 9 Contrast 10 Entropy 11 Homogeneity 12 Correlation
[0104] The OPD map is an image containing quantitative values that represent the optical thickness of the sample. Since it is quantitative (i.e., contains meaningful optical thickness values on each of the spatial map points), it can be used to calculate both morphological and content-related features of the inspected cell. In earlier work [15, 5], these features were described and as well as the way to use them to discriminate between different types of cells. It should be noted that the algorithm works well for one cell in the ROI simultaneously. For the classification between cancer cells and blood cells, a support vector machine (SVM) algorithm, a common machine-learning algorithm, was used. A dataset from nearly 6,300 static and dynamic OPD images of different cell types (HT29-GFP, SW480, SW620 cancer cells, erythrocytes, monocytes, lymphocytes, and granulocytes) was created for training and testing the algorithm (80% of data was for training and the rest 20% for testing). This classifier receives the reconstructed unwrapped OPD image, extracts 20 features based on 2D morphological features and optical topology features. Eight 2D features are drawn from the binary image (area, diameters, eccentricity, and solidity), the rest of the features are OPD based (mean value, energy, volume, dry mass, variance, kurtosis, skewness, contrast, entropy, homogeneity, and correlation). The algorithm finds a discriminative hyperplane in the features space to distinguish and classify the data points or cells. A radial basis function kernel was used for one-class learning. For dimension reduction and for creating new highly discriminating features based on a linear summation of the extracted original features, principal component analysis (PCA) was then used. Principal component analysis (PCA) is a common method for dimension reduction and for finding highly discriminative features. The PCA method is based on projecting the data onto a lower-dimension subspace, and receiving new features, which are linear combinations of the original features. The first principal component has the largest possible variance of the data, and therefore enables better discrimination between the classes, the second principal component has the second largest variance of the data, and so on.
[0105] The DEP microfluidic module was configured to sort specific selected cells. The dielectrophoretic microfluidic module was designed to use the negative-DEP technique to deflect flowing cells from the electrodes and direct them left or right by applying a voltage at the correct time [9]. A square wave of 1 MHz and 3-5 Volts was applied on the electrodes to exert the DEP phenomenon. The electrodes were controlled by a computer-activated proprietarily developed generator designed for this type of DEP modules. Using these parameters, flow rates up to 20 μl hr.sup.−1 were controlled. The cells flow in and out of the dielectrophoretic microfluidic module may be performed by using four low-pressure pumps (Cetoni, neMESYS 290N) in operating rates of 0.5 μl hr.sup.−1 and up to 20,000 μl hr.sup.−1. In this specific and non-limiting example, one pump was used to insert the cells, one pump was used for washing, one pump was used for collecting the cancer cells, and one pump was used for collecting the non-cancer cells.
[0106] Reference is made to
[0107] The ability to analyze stain-free isolated cells is important for flow-cytometry via quantitative imaging of cells during flow.
[0108] For classification, the more complex task is classification of white blood cells and cancer cells, since red blood cells and platelets are much easier to detect, since they are very different than cancer cells. A data set of about 4,000 OPD maps of two types of colorectal cancer cells (SW480 and SW620) and four types of blood cells (granulocytes, lymphocytes, monocytes, and erythrocytes) was created. Reference is made to
[0109] Using the holographic imaging module and the DEP flow module of the present invention, electrodes (1), (2) and (3) were turned on, the DEP microfluidic module and syringes were filled with buffer solution (PBS+10% FBS) to remove air and loaded a remodeled blood sample spiked with SW480 cancer cells, which represent the CTCs, in the ratio of 3:5:2 (cancer: white blood: red blood cells). Total cell concentration was 300 cells μl.sup.−1. This low number of cells was used to decrease the probability of having more than one cell at a time in the ROI or clustering. This ratio was chosen to demonstrate the system sorting abilities and is much higher than the ratio of CTCs in blood. For this sorting system, the ROI size was approximately 60 μm×40 μm. The flow was set at rates between 4-7 μl hr.sup.−1, or 45-80 μm sec.sup.−1, per the DEP microfluidic module cross-section (700 μm×35 μm), giving us half to one image of cell per sec. The framerate for this experiment was 8 frames per second (FPS), so each cell had about 2-5 frames in the ROI. The OPD map was imaged and reconstructed of each cell individually and classified to cancer or non-cancer cell using the machine-learning classifier. This automatic decision activated the control unit controlling the electrodes to remove the detected cancer cell from the main flow. For validation only, the SW480 cancer cells were labeled using acridine-orange fluorescence dye, to show if the classifier is precise. The classification process was carried out for each frame in real-time (during cell flow). Electrode switching, once a cancer cell was detected, was active for a defined number of frames as follows: the distance between the ROI and the sorting area near electrode 3 was less than 60 μm or about 1-1.5 seconds for this cell velocity. Since the reaction time of the electrodes was 7 ms, and the images were acquired at 8 FPS, the electrodes were set to stay in the same state after command for 15-20 frames, to allow the cell to be sorted as required. Table 2 below shows the sorting process timing, from an image taken until the sorting electrodes switches. Average values are displayed. SVM features and classification happen only when a cell is present in the ROI.
TABLE-US-00002 TABLE 2 Step Time [ms] Total time [ms] Image acquiring 45 — Phase 2D FFT 26 71 retrieval Phase unwrapping 2 73 Cell Threshold 0.1 73.1 Segmentation Holes' filling 3.5 76.6 Dilation 5 81.6 Largest object 1.5 83.1 3D mask 3 86.1 No cell check Area size (big/small) 2.7 88.8 Edges 0.4 89.2 Feature's extraction 9 98.2 Classification 10 108.2 Total classification time 63.2 Electrode switching time 7 115.2 Saving frame 1 116.2 Total time 60-120 Frames per second [FPS] 8-16
[0110] Examples from experiments are shown in
[0111] The use of OPD for the classifier yields 12 more powerful discriminating features to those of brightfield. Although CTCs are usually larger than PBMC and erythrocytes and brightfield images would suffice, for comparison to granulocytes, the OPD based features have a higher impact on the discrimination. SVM classification based on the OPD maps can be used to discriminate between healthy and cancer cells. In addition, the full OPD image of the cell gives the option to the clinician with other quantitative parameters on the cell, such as its dry mass and phase volume [11]. These data have proven itself useful for detecting abnormalities or introducing more complex analysis methods such as cell tomography [8]. The holographic imaging module is able to produce off-axis holographic videos of up to 15 frames sec.sup.−1. Since one cell per frame is needed, with a flow rate of just a few μl hr.sup.−1, a throughput of 15 cells sec.sup.−1 may be obtained.
[0112] Preliminary purification of the blood based on the size of the cells may be first performed. This preliminary process takes only three minutes and from 1 ml of blood it leaves 1-10 CTC in 11,000 white blood cells. Then, the holographic classification method of the present invention further purifies the sample and detect the single cancer cells based on the OPD profiles. Since the processing rate of the sorting system of the present invention is about 15 cells per second, processing the entire pre-enriched sample may take about 12 minutes, and thus the entire processing of 1 ml of blood, including the pre-enrichment process, may take about 15 min. The framerate is mainly affected form the algorithm running time, but the total throughput is also a function of the maximum cell velocity the DEP allows. With 15 cells sec.sup.−1, flow rates up to 180 μl hr.sup.−1 with 300 cells μl.sup.−1 may be used, but for the electrode to affect the direction of the cells in the mentioned setup, flow rates lower than 10 μl hr.sup.−1 should be used. From Table 2 above, it can be seen that a blind sorting (i.e., without live video) can increase the framerate dramatically. Together with a DEP microfluidic module with a larger ROI and a dedicated set of electrodes, a much higher flow rate and higher cell concentration may be obtained to achieve higher throughput.
[0113] The classifier has been built using SVM, which is a common machine-learning classification algorithm that is based on features extraction. The goal of the SVM algorithm is to find a hyperplane in the features space that distinctly classifies the data points [26]. As shown in
[0114] The holographic imaging module 320 of
[0115] Prior to the analysis of the OPD maps, the segmentation image-processing procedure was applied to track the cell area during flow. Next, the twenty features, mentioned in Table 1, were extracted from the cell OPD area selected by the segmentation process. During the training process, the twenty hand-crafted features were used as an input for PCA analysis in order to extract the best combinations of these features, which were the most useful for classification between various cell types. The best classification results were obtained for the eight, six, ten and thirteen first principal components for SVM 1, SVM 2, SVM 3 and one-step SVM, respectively.
[0116] For comparison, all SVM models were trained separately based the: (1) 2D morphological features; and (2) 2D morphological and 3D quantitative features together. Table 3 below presents the accuracies of these two assays for all SVM models. More specifically, Table 3 below presents accuracy results when using the 2D features and the 3D features for all SVM models. The improvement obtained for each classification method when also using the 3D features in comparison to be using the 2D features only is indicated in bold. As can be seen, the accuracy is higher when considering both the 2D morphological features and the 3D quantitative features for all trained models, demonstrating the advantage of using quantitative phase images for classification rather than simple 2D imaging Table 4 below presents the precision of performing wrong and right classifications with all SVM models combined with PCA on the test set, considering both the 2D morphological features and the 3D quantitative features. Table 4 shows precision of wrong and right predictions for all SVM models on the test set. As seen in Table 3 and Table 4 below, the two-step classifier exhibits the best overall accuracy when examining it on the test set.
TABLE-US-00003 TABLE 3 Total accuracy 2D morphological 2D morphological + Model features 3D features SVM 1 92.88% 98.22%↑ 5.34% SVM 2 95.6% 96.98% ↑ 1.38% SVM 3 74.68% 91.77% ↑ 17.09% One-step SVM 78.46% 91.3% ↑ 12.84% Two-step SVM 77.6% 92.56% ↑ 14.96%
TABLE-US-00004 TABLE 4 Right Wrong Model Class prediction prediction SVM 1 Cancer 97.3% 2.7% Not Cancer 99.1% 0.9% SVM 2 SW480 97.5% 2.5% SW620 96.3% 3.7% SVM 3 Granulocyte .sup. 90% 10% Lymphocyte 97.8% 2.2% Erythrocyte 91.2% 8.8% Monocyte 82.6% 17.4% One-step SVM SW480 97.8% 2.2% SW620 88.6% 11.4% Granulocyte 88.8% 11.2% Lymphocyte 98.6% 1.4% Erythrocyte 86.2% 13.8% Monocyte 82.6% 17.4% Two-step SVM SW480 97.5% 2.5% SW620 90.1% 9.9% Granulocyte 87.6% 12.4% Lymphocyte 97.8% 2.2% Erythrocyte 91.2% 8.8% Monocyte 82.6% 17.4%
[0117] Next, the performance of the one-step and the two-step classifiers were on different samples during flow.
[0118] Next, an even amount of SW480 and SW620 cells was mixed and made them flow in the channel. Table 5 below presents the classification results of a sample of containing a 1:1 mixture of flowing SW480 and SW620 cells of three classifiers: one-step SVM, two-step SVM and SVM 2. As expected, SVM 2 achieved the best results for classifying between the two cancer classes only.
TABLE-US-00005 TABLE 5 One-step Two-steps Class SVM SVM SVM 2 SW480 42.21% 46.5% 50.26% SW620 10.3% .sup. 29% 49.74% Granulocyte .sup. 46% 21.75% 0% Lymphocyte 1% 2% 0% Erythrocyte 0.5% 0.5% 0% Monocyte 0% 0.25% 0% Total cancer cells 52.5% 75.5% 100% Total blood cells 47.5% 24.5% 0%
[0119] Next, a homogeneous sample of granulocytes was used and imaged during flow. Table 6 below shows the classification results of a homogeneous sample of flowing granulocytes of three classifiers: one-step SVM, two-step SVM and SVM 3. Unsurprisingly, SVM 3 achieved the best results, since it classified between blood cells only. Here as well, the two-step SVM achieved better results than the one-step SVM.
TABLE-US-00006 TABLE 6 One-step Two-steps Class SVM SVM SVM 3 SW480 0.59% 1.77% 0% SW620 4.12% 1.64% 0% Granulocyte 78.34% 80.12% 83.04% Lymphocyte 4.5% 3.29% 4.5%.sup. Erythrocyte 0.2% 0.3% 0.2%.sup. Monocyte 12.25% 12.88% 12.25% Total cancer 4.71% 3.41% 0% Total blood 95.29% 96.59% 100%
[0120] The average processing times for each step in the algorithm were as follows: (1) 0.028 sec for the reconstruction of the unwrapped phase profile, (2) 0.025 sec for cell segmentation and features extraction, (3) 0.01 sec and 0.095 sec for cell classification by the two-step SVM and the one-step SVM, respectively. Although the two-step classifier included two SVM models, while the one-step classifier included only one, the two-step SVM exhibited faster execution time. Combining all, the total processing times for each off-axis hologram containing 1 megapixel is 0.063 sec and 0.148 sec for the two-step SVM and the one-step SVM, respectively.
[0121] These results demonstrate the ability of the presented automatic algorithm to classify cancer in different cancer stages and white blood cells using flow-cytometry combined with machine learning, using OPD-map-based features. High classification rates for stain-free cells were obtained during real-time flow. The accuracy values and prediction precisions correspond with the separation between the groups presented in the 2D PCA space (see