System and method for the discrimination of tissues using a fast infrared cancer probe
11678802 · 2023-06-20
Assignee
Inventors
- James Coe (Worthington, OH, US)
- Heather Allen (Columbus, OH, US)
- Charles Hitchcock (Upper Arlington, OH, US)
- Edward W. Martin (Delaware, OH)
Cpc classification
G01J3/42
PHYSICS
A61B2576/00
HUMAN NECESSITIES
A61B5/444
HUMAN NECESSITIES
A61B5/0075
HUMAN NECESSITIES
A61B5/0022
HUMAN NECESSITIES
International classification
A61B5/00
HUMAN NECESSITIES
A61B90/00
HUMAN NECESSITIES
G01J3/42
PHYSICS
Abstract
Systems and methods utilize an infrared probe and discriminating software to rapidly discriminate normal tissue processes from normal tissue during surgery, physical examination of in-situ lesions, and in the assessment of biopsy and resected tissue specimens. Examples demonstrate discrimination of cancerous from noncancerous tissues. The discriminating software, i.e. the metrics, algorithms, calibrant spectra, and decision equations, allows tissue to be identified as abnormal or normal using a minimum of infrared (IR) wavelengths in order to be measured rapidly. The probe records IR metrics approximately 1000 times faster than current commercial instruments, i.e. on a timescale fast enough for clinical use. The probe uses a tunable mid-infrared laser with a small set of selected wavelengths that are optimized for detecting the chemical and molecular signatures of tissue specific lesions to include, but not limited to, cancer, preneoplasia, intracellular accumulations (e.g. steatosis), inflammation, and wound healing.
Claims
1. A method of discriminating tissue of a specimen, the method comprising: performing infrared (IR) spectroscopy on a specimen using a probe, wherein the IR spectroscopy is performed including by using only one single tunable Quantum Cascade Laser (QCL), having a specified output wavelength range, to a selected subset of IR wavelengths within the specified output wavelength range of the single tunable QCL for illuminating the specimen, wherein the selected subset of IR wavelengths is selected based on a correlation between IR absorbance data and histological data to discriminate normal tissue from abnormal tissue in the specimen; obtaining a portion of an IR spectrum from the specimen via the probe in response to the illuminating the specimen using the selected subset of IR wavelengths within the specified output wavelength range of the single tunable QCL; and evaluating the obtained portion of the IR spectrum using one or more metrics, wherein the one or more metrics discriminate normal tissue of the specimen from abnormal tissue of the specimen using response wavelengths in the obtained portion of the IR spectrum in response to the selected subset of IR wavelengths used for the illuminating the specimen.
2. The method of claim 1, wherein discriminating normal tissue of the specimen from abnormal tissue of the specimen comprises determining non-cancerous regions of the specimen from cancerous regions of the specimen using response wavelengths in the obtained portion of the IR spectrum in response to the selected subset of IR wavelengths used for the illuminating the specimen.
3. The method of claim 1, comprising using a plurality of QCLs having different specified output wavelength ranges for selecting the subset of IR wavelengths within the specified output wavelength range of the single tunable QCL.
4. The method of claim 1, further comprising identifying the normal tissue of the specimen from the abnormal tissue of the specimen using response wavelengths in the obtained portion of the IR spectrum in response to the selected subset of IR wavelengths used for the illuminating the specimen.
5. The method of claim 4, wherein the identifying, using response wavelengths in the obtained portion of the IR spectrum in response to the selected subset of IR wavelengths used for the illuminating the specimen, is used to provide an assessment of the surgical margin made in the operating room by a surgeon as the abnormal tissue is removed.
6. The method of claim 1, wherein the selected subset of IR wavelengths comprises 10 or fewer wavelengths.
7. The method of claim 6, wherein the selected subset of IR wavelengths comprises six wavelengths.
8. A system for discriminating tissue of a specimen, the system comprising: an infrared (IR) source; a probe in communication with the IR source via a fiber optic cable, wherein the probe is used to obtain a portion of an IR spectrum from the specimen in response to illuminations using a specified subset of IR wavelengths provided by the IR source over a first measurement time period; an IR detector, wherein the IR detector receives the portion of the IR spectrum from the probe via a fiber optic cable; and a computing device comprising a processor and a memory in communication with the processor, said memory comprising computer-executable instructions, wherein the computing device receives a signal representative of the detected portion of the IR spectrum from the IR detector and said computer-executable instructions cause the processor to evaluate the obtained portion of IR spectrum using one or more metrics, wherein the one or more metrics discriminate normal tissue of the specimen from abnormal tissue of the specimen using selected wavelengths that are pre-selected to characterize (1) one or more spectral response inflection points of the obtained portion of the IR spectrum over the same first measurement time period and (2) an absorbance vs. wavelength spectral response steepness of the same obtained portion of the IR spectrum obtained over the same first measurement time period in response to the same illuminations, to discriminate between the normal tissue and the abnormal tissue.
9. The system of claim 8, wherein the IR source comprises a single quantum cascade IR laser (QCL) that is a tunable mid-infrared laser with a selected subset of IR wavelengths, within the output wavelength range of the single QCL, that have been selected for detecting tissue specific chemical and molecular signatures to include, but not limited to, one or more of cancer, preneoplasia, steatosis or other intracellular accumulations, inflammation, and wound healing.
10. The system of claim 8, wherein the IR detector comprises a thermal microbolometer array detector.
11. The system of claim 8, wherein discriminating normal tissue of the specimen from abnormal tissue of the specimen comprises determining non-cancerous regions of the specimen from cancerous regions of the specimen using the selected wavelengths that are pre-selected to characterize (1) one or more spectral response inflection points and (2) a spectral response steepness, to discriminate between the normal tissue and the abnormal tissue.
12. The system of claim 8, further comprising the computing device executing computer-executable instructions stored in the memory that cause the processor to perform identifying the normal tissue of the specimen from the abnormal tissue of the specimen.
13. The system of claim 12, wherein the computing device includes or is coupled to a display that displays the identified normal tissue of the specimen and the identified abnormal tissue of the specimen to a surgeon in an operating room to provide an assessment of the surgical margin made in the operating room by the surgeon as the abnormal tissue is removed.
14. The system of claim 12, wherein the computing device includes or is coupled to a display that displays the identified normal tissue of the specimen or the identified abnormal tissue of the specimen to a surgeon in an operating room to provide an assessment of the surgical margin made in the operating room by the surgeon as the abnormal tissue is removed.
15. The system of claim 8, wherein the selected wavelengths include a subset of IR wavelengths that comprises 10 or fewer wavelengths.
16. The system of claim 15, wherein the selected subset of IR wavelengths comprises six or fewer wavelengths.
17. A non-transitory computer-readable medium comprising computer-executable code stored thereon for causing a computing device to perform the method of: receiving a portion of an IR spectrum of a specimen from a probe, wherein the probe has been used to perform infrared (IR) spectroscopy illumination of the specimen tuned using a specified subset of IR wavelengths that are selected from a set of IR wavelengths defined by an IR band of interest for spectroscopic discrimination of normal tissue from abnormal tissue, wherein the specified subset of IR wavelengths is selected by using a Support Vector Machine (SVM) supervised machine learning (ML) model trained using IR absorption spectra of specimens and corresponding histology training inputs from corresponding specimens, wherein the specified subset of IR wavelengths is selected based on a correlation between IR absorbance data and histological data to discriminate normal tissue from abnormal tissue in the specimen; and evaluating the obtained portion of the IR spectrum using one or more metrics, wherein the one or more metrics determine normal tissue of the specimen from abnormal tissue of the specimen.
18. The computer-readable medium of claim 17, wherein determining the normal tissue of the specimen from the abnormal tissue of the specimen comprises determining non-cancerous regions of the specimen from cancerous regions of the specimen.
19. The computer-readable medium of claim 17, wherein the method further comprises identifying the normal tissue of the specimen from the abnormal tissue of the specimen.
20. The computer-readable medium of claim 19, wherein the method further comprises using the identifying to provide an assessment of the surgical margin made in the operating room by a surgeon as the abnormal tissue is removed.
21. The computer-readable medium of claim 17, wherein the specified subset of IR wavelengths comprises 10 or fewer wavelengths.
22. The computer-readable medium of claim 21, wherein the specified subset of IR wavelengths comprises six or fewer wavelengths.
23. A method of discriminating tissue of a specimen, the method comprising: performing infrared (IR) spectroscopy on a specimen using a probe, wherein the IR spectroscopy is performed including by using a tunable Quantum Cascade Laser (QCL) to a selected subset of IR wavelengths for illuminating the specimen, the selected subset of IR wavelengths selected based on a correlation between IR absorbance data and histological data to discriminate normal tissue from abnormal tissue in the specimen; obtaining a portion of an IR spectrum from the specimen via the probe in response to the illuminating the specimen using the selected subset of IR wavelengths; and evaluating the obtained portion of the IR spectrum using one or more metrics, wherein the one or more metrics discriminate normal tissue of the specimen from abnormal tissue of the specimen using response wavelengths in the obtained portion of the IR spectrum in response to the selected subset of IR wavelengths used for the illuminating the specimen.
24. The method of claim 23, wherein the specified subset of IR wavelengths are selected to be limited to 10 or fewer wavelengths.
25. The method of claim 23, comprising tuning the tunable QCL having a specified output wavelength range to a specified subset of IR wavelengths selected to be fewer than 451 wavelengths.
26. The method of claim 23, comprising tuning the tunable QCL to a specified subset of IR wavelengths selected to include one or more wavelengths in an amide I band.
27. The method of claim 23, comprising tuning the tunable QCL to a specified subset of IR wavelengths selected to include one or more wavelengths in an amide II band.
28. The method of claim 23, comprising tuning the tunable QCL to a specified subset of IR wavelengths selected using a Figure of Merit (FOM) based on a spread of a metric, based on a ratio of absorbance determined using a machine learning technique.
29. The method of claim 23, comprising tuning the tunable QCL to a specified subset of IR wavelengths selected using a metric based on a spectral response steepness characteristic.
30. The method of claim 23, wherein the selected subset of IR wavelengths is selected using a ratio of absorbances at the selected IR wavelengths.
31. The method of claim 23, wherein the selected subset of IR wavelengths is selected using a composite of multiple different metrics for the selecting the subset of IR wavelengths.
32. The method of claim 23, wherein the selected subset of IR wavelengths is selected using a machine learning model trained using: (1) a difference spectral response between a first spectral response from a histologically-classified tumor specimen and a second spectral response from a histologically-classified non-tumor specimen; and (2) a metric based on ratios of absorbances at maxima in the difference spectral response.
33. The method of claim 23, wherein the selected subset of IR wavelengths includes one or more selected wavelengths that are: (1) offset from spectral response peaks associated with the amide I and amide II bands; and (2) selected using a metric based on a ratio of absorbances.
34. The method of claim 23, wherein the selected subset of IR wavelengths is selected using: (1) a spectral response steepness on the high wavenumber side of an amide II band of protein backbones; and (2) a local spectral response maxima associated with a glycogen or other polysaccharide.
35. The method of claim 23, comprising selecting the subset of IR wavelengths using machine learning SVM training based on the correlation between IR imaging data and dye stain pixel grid data.
36. The method of claim 23, comprising tuning the tunable QCL to a specified subset of IR wavelengths selected using a metric based on a baseline corrected tissue spectra.
37. The method of claim 36, comprising tuning the tunable QCL to a specified subset of IR wavelengths selected using a metric based on a spectral response steepness characteristic of a side of an amide II band of protein backbones.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments and together with the description, serve to explain the principles of the methods and systems. The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
(26)
(27)
(28)
(29)
(30)
(31)
(32)
(33)
(34)
(35)
(36)
DETAILED DESCRIPTION
(37) Before the present methods and systems are disclosed and described, it is to be understood that the methods and systems are not limited to specific synthetic methods, specific components, or to particular compositions. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting.
(38) As used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. Ranges may be expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, another embodiment includes from the one particular value and/or to the other particular value. Similarly, when values are expressed as approximations, by use of the antecedent “about,” it will be understood that the particular value forms another embodiment. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint.
(39) “Optional” or “optionally” means that the subsequently described event or circumstance may or may not occur, and that the description includes instances where said event or circumstance occurs and instances where it does not.
(40) Throughout the description and claims of this specification, the word “comprise” and variations of the word, such as “comprising” and “comprises,” means “including but not limited to,” and is not intended to exclude, for example, other additives, components, integers or steps. “Exemplary” means “an example of” and is not intended to convey an indication of a preferred or ideal embodiment. “Such as” is not used in a restrictive sense, but for explanatory purposes.
(41) Disclosed are components that can be used to perform the disclosed methods and systems. These and other components are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these components are disclosed that while specific reference of each various individual and collective combinations and permutation of these may not be explicitly disclosed, each is specifically contemplated and described herein, for all methods and systems. This applies to all aspects of this application including, but not limited to, steps in disclosed methods. Thus, if there are a variety of additional steps that can be performed it is understood that each of these additional steps can be performed with any specific embodiment or combination of embodiments of the disclosed methods.
(42) The present methods and systems may be understood more readily by reference to the following detailed description of preferred embodiments and the Examples included therein and to the Figures and their previous and following description.
(43) As will be appreciated by one skilled in the art, the methods and systems may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the methods and systems may take the form of a computer program product on a computer-readable storage medium having computer-readable program instructions (e.g., computer software) embodied in the storage medium. More particularly, the present methods and systems may take the form of web-implemented computer software. Any suitable computer-readable storage medium may be utilized including hard disks, CD-ROMs, optical storage devices, or magnetic storage devices.
(44) Embodiments of the methods and systems are described below with reference to block diagrams and flowchart illustrations of methods, systems, apparatuses and computer program products. It will be understood that each block of the block diagrams and flowchart illustrations, and combinations of blocks in the block diagrams and flowchart illustrations, respectively, can be implemented by computer program instructions. These computer program instructions may be loaded onto a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions which execute on the computer or other programmable data processing apparatus create a means for implementing the functions specified in the flowchart block or blocks.
(45) These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including computer-readable instructions for implementing the function specified in the flowchart block or blocks. The computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer-implemented process such that the instructions that execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart block or blocks.
(46) Accordingly, blocks of the block diagrams and flowchart illustrations support combinations of means for performing the specified functions, combinations of steps for performing the specified functions and program instruction means for performing the specified functions. It will also be understood that each block of the block diagrams and flowchart illustrations, and combinations of blocks in the block diagrams and flowchart illustrations, can be implemented by special purpose hardware-based computer systems that perform the specified functions or steps, or combinations of special purpose hardware and computer instructions.
(47) Described herein is an infrared attenuated total reflection (ATR) probe and discriminating software that can rapidly (on the timescale of seconds) discriminate abnormal tissue processes from those of normal tissue during: surgery, physical examination of in-situ lesions, and in the assessment of biopsy and resected tissue specimens. Non-limiting examples provided herein demonstrate discrimination of cancerous from noncancerous tissues. The discriminating software, i.e. the metrics, algorithms, calibrant spectra, and decision equations, provide a determination of whether the tissue is abnormal or normal using a minimum of infrared (IR) wavelengths in order to be measured rapidly. The disclosed probe embodiments can record IR metrics approximately 1000 times faster than current commercial instruments, i.e. on a timescale fast enough for clinical use. In one aspect, the probe comprises a tunable mid-infrared laser with a small set of selected wavelengths that have been optimized for detecting the chemical and molecular signatures of tissue specific lesions to include, but not limited to, cancer, preneoplasia, intracellular accumulations (e.g. steatosis), inflammation, and wound healing.
(48)
(49) It currently takes approximately 15 hours of FTIR imaging on tissue slices to record the data from one cancer patient. IR laser-based, hyperspectral imaging will improve this, but such devices will not soon be ready for in-situ use. However, an FTIR-ATR probe such as illustrated in
(50)
(51) The system has been described above as comprised of units. One skilled in the art will appreciate that this is a functional description and that the respective functions can be performed by software, hardware, or a combination of software and hardware. A unit can be software, hardware, or a combination of software and hardware. The units can comprise software for discriminating tissue of a specimen. In one exemplary aspect, the units can comprise a computing device that comprises a processor 321 as illustrated in
(52)
(53) Processor 321 may include one or more processors, each configured to execute instructions and process data to perform one or more functions associated with a computer for discriminating tissue of a specimen. Processor 321 may be communicatively coupled to RAM 322, ROM 323, storage 324, database 325, I/O devices 326, and interface 327. Processor 321 may be configured to execute sequences of computer program instructions to perform various processes. The computer program instructions may be loaded into RAM 322 for execution by processor 321.
(54) RAM 322 and ROM 323 may each include one or more devices for storing information associated with operation of processor 321. For example, ROM 323 may include a memory device configured to access and store information associated with the computer, including information for identifying, initializing, and monitoring the operation of one or more components and subsystems. RAM 322 may include a memory device for storing data associated with one or more operations of processor 321. For example, ROM 323 may load instructions into RAM 322 for execution by processor 321.
(55) Storage 324 may include any type of mass storage device configured to store information that processor 321 may need to perform processes consistent with the disclosed embodiments. For example, storage 324 may include one or more magnetic and/or optical disk devices, such as hard drives, CD-ROMs, DVD-ROMs, or any other type of mass media device.
(56) Database 325 may include one or more software and/or hardware components that cooperate to store, organize, sort, filter, and/or arrange data used by the computer and/or processor 321. For example, database 325 may store the wavelengths used for performing infrared (IR) spectroscopy on a specimen using an attenuated total reflection (ATR) probe, wherein the IR spectroscopy is performed using a reduced set of IR wavelengths. The database 325 may also store the IR spectrum of the specimen is obtained from the ATR probe and the one or more metrics that are used to determine normal tissue of the specimen from abnormal tissue of the specimen. It is contemplated that database 325 may store additional and/or different information than that listed above.
(57) I/O devices 326 may include one or more components configured to communicate information with a user associated with computer. For example, I/O devices may include a console with an integrated keyboard and mouse to allow a user to maintain a database of IR spectrums, metrics and the like. I/O devices 326 may also include a display including a graphical user interface (GUI) for outputting information on a monitor. I/O devices 326 may also include peripheral devices such as, for example, a printer for printing information associated with the computer, a user-accessible disk drive (e.g., a USB port, a floppy, CD-ROM, or DVD-ROM drive, etc.) to allow a user to input data stored on a portable media device, a microphone, a speaker system, or any other suitable type of interface device.
(58) Interface 327 may include one or more components configured to transmit and receive data via a communication network, such as the Internet, a local area network, a workstation peer-to-peer network, a direct link network, a wireless network, or any other suitable communication platform. For example, interface 327 may include one or more modulators, demodulators, multiplexers, demultiplexers, network communication devices, wireless devices, antennas, modems, and any other type of device configured to enable data communication via a communication network.
EXAMPLES
(59) The following examples are set forth below to illustrate the methods and results according to the disclosed subject matter. These examples are not intended to be inclusive of all aspects of the subject matter disclosed herein, but rather to illustrate representative methods and results. These examples are not intended to exclude equivalents and variations of the present invention which are apparent to one skilled in the art.
(60) Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.) but some errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in ° C. or is at ambient temperature, and pressure is at or near atmospheric. There are numerous variations and combinations of reaction conditions, e.g., component concentrations, temperatures, pressures and other reaction ranges and conditions that can be used to optimize the product purity and yield obtained from the described process.
Example 1—Proof of Concept with Experimental FTIR-ATR Probe Data for Cancer
(61) Infrared (IR) spectra were recorded by touching a diamond tip FTIR-ATR probe to liver tissues resected from patients with colorectal cancer metastatic to the liver, though probes having tips comprised of other materials are contemplated as being within the scope of this disclosure. A model based on the ratio of absorbances at selected IR wavelengths employed relative weightings that were optimized for separation of tumor and nontumor tissues. The model quantifies the contributions of each metric, enabling the performance of different metrics to be quantitatively compared. The model of this example also employs only 6 different wavelengths, so the prospect of even faster measurements arises by the measurement of absorption at just these wavelengths rather than wavelengths across the full IR spectral range.
(62) Portions of remnant liver tissue containing both cancer-bearing tissue and normal surrounding liver tissue were obtained from the Department of Pathology at The Ohio State University (Columbus, Ohio) at the time of the patient's planned surgical procedure.
(63) The process of discovering IR metrics started by collecting the individual nontumor spectra of both cases as rows in matrix X1 and the individual tumor spectra of both cases as rows in matrix X2. The average spectra of the tumor (red) and nontumor (blue) groups are shown in the bottom panel of
[α.sub.1I.sub.1556cm-1/I.sub.1572cm-1,α.sub.2I.sub.1158cm-1/I.sub.1182cm-1,α.sub.3I.sub.1032cm-1/I.sub.1000cm-1], (1)
where α.sub.j values are relative weights for each metric (to be determined by fitting) and I.sub.xcm-1 is the measured absorbance at a particular value x in cm.sup.−1. Only the relative values of the α.sub.j are important, so they were normalized such that Σ.sub.jα.sub.j.sup.2=1. The positions of the critical pairs of wavenumbers are labeled in the bottom panel of
(64) The figure of merit (FOM) for optimization involved comparing the average of the spreads of the nontumor (σ.sub.1) and tumor (σ.sub.2) metric values relative to the distance between the centroids (D.sub.12) of the nontumor and tumor groups:
(65)
where C1.sub.j and C2.sub.j are the centroids or average metric values for the nontumor and tumor groups, respectively, and j=1, 2, . . . m is an index for the metrics. The centroids are the averages down the columns of the Xr1 and Xr2 matrices. The sums are over the metrics. The
d1.sub.i1=√{square root over (Σ.sub.j=1.sup.m(Xr1.sub.i1,j−C1.sub.j).sup.2)} and d1.sub.i1=√{square root over (Σ.sub.j=1.sup.m(Xr2.sub.i2,j−C2.sub.j).sup.2)}, (3)
where the sums are over the metrics. Given a choice of metrics such as equation (1), the best fit values are determined by minimizing FOM with respect to α.sub.j, the metric weights, which is the same as maximizing the separation between the tumor and nontumor groups. This was accomplished by writing a routine using the “fminsearch” function. A function was written and called by “fminsearch” which calculated the weighted metrics, centroids, distances, and FOM given an initial set of metric weights. The result was an optimized and normalized set of α.sub.j. Many other metrics were tried besides the ones reported in Table 1, below; however, the others had α.sub.j values that were considerably smaller, so they were dropped from consideration. The wavenumber positions in the metric definitions were also varied manually, to minimize the FOM and maximize separation between tumor and nontumor. The resulting centroids and best fit metric weights are given in Table 1. Table 1 shows experimental average ratio of absorbances for nontumor and tumor groups (2nd and 3rd rows), optimized and normalized metric weights (4th row), and centroids based on weighted metrics (5th and 6th rows) associated with FOM=0.1488. The metrics are defined as α.sub.jI.sub.x1cm-1/I.sub.x2cm-1, where x1 and x2 are 1556 and 1572 for metric 1, 1158 and 1182 for metric 2, and 1032 and 1000 for metric 3, respectively, which were also optimized.
(66) Comparison of the first two numbers in a column for a metric gives the experimental difference in the metric measurements between nontumor and tumor groups which amounts to 0.0929, 0.1342, and 0.1588 for M1, M2, and M3 respectively. However, the figure of merit necessarily accounts for the spread or errors in these numbers leading to the optimized weights given as the third number in each column. After the optimized weights are determined, the difference between the weighted centroids, i.e. the fourth and fifth number in each column, are 0.0702, 0.0316, and 0.0971 for M1, M2, and M3, respectively. Now the size reveals the relative importance. These results are relative and can be normalized yielding 35.3%, 15.9% and 48.8% as the importance of each metric for M1, M2, and M3, respectively. Metric M3 is the most important metric in this set which was culled from 16 initial metrics.
(67) TABLE-US-00001 TABLE I Metric j M1 M2 M3 Exp. Avg. Nontumor Ratio of Absorbances 1.0507 1.1402 1.1317 Exp. Avg. Tumor Ratio of Absorbances 0.9578 1.0060 0.9729 Metric Weight, α.sub.j 0.7552 0.2253 0.6117 Nontumor Centroid, C1.sub.j 0.7935 0.2683 0.6923 Tumor Centroid, C2.sub.j 0.7233 0.2367 0.5952
(68) Projection of Hyperdimensional Metrics.
(69) Hyperdimensional data can be difficult to visualize. However in this case, the data came in the form of two groups, tumor and nontumor, so the hyperdimensional metric data was projected onto a line defined by the centroids of the nontumor (C1.sub.j) and tumor (C2.sub.j) groups. The projection yields a parameterized value or t value for the metrics of each individual spectrum. The t values for the nontumor (t1.sub.i1) and tumor groups (t2.sub.i2) are given by:
(70)
(71) The average nontumor group metric has a projected t value of 0, while the average tumor group has a projected t value of 1. The occurrence oft values (a unitless quantity) is shown as a histogram in
(72) The results in Table 1 can be used to predict whether any general ATR spectrum of liver tissue corresponds to tumor or nontumor. If one measures the ATR ratio of absorbances (I.sub.x1cm-1/I.sub.x2cm-1) at the pairs of wavenumbers specified by the x1 and x2 wavenumbers for each of the three metrics, then a nontumor/tumor prediction can be rendered using the results in Table 1. One multiplies each ratio of absorbances by the corresponding metric weight (α.sub.1) producing a three element row vector like Xr1.sub.i1,j (or Xr2.sub.i2,j) of equation (4). The measured values of Xr1.sub.i1,j (or Xr2.sub.i2,j) are combined with the C2.sub.j and C1.sub.j centroid values from Table 1 to determine a t value using equation (4). In this case, if t>0.60, then tumor has been detected and if t<0.60, then nontumor is detected. One should make as many measurements as time allows in order to get a good statistical assessment of a subject like a surgical margin.
(73) Simple FTIR-ATR probe measurements requiring approximately one minute of data acquisition time each have been shown to reliably determine whether liver tissues contain tumors for two cases of colorectal cancer metastatic to the liver. A method was developed which involved weighting metrics based on ratios of absorbances at selected wavelengths. Three metrics were found to be efficient at distinguishing tumor and nontumor. The first metric measures the steepness of the high wavenumber side of the amide II band of protein backbones, i.e. changes in protein between nontumor and tumor regions are readily apparent. Since albumin is a dominant protein in normal liver and since it is an α-helix dominated structure, it is likely that cancerous transformation will on average reduce α-helix and increase other protein secondary structures such as β-sheet in liver tissues. The second and third metrics measure peaks at 1158 and 1032 cm.sup.−1 which are reduced on average in the tumor spectra and may be related to polysaccharides like glycogen. One can assess their relative importance by calculating C2.sub.j−C1.sub.j for each metric using the data in Table 1 which gives the relative importance as 0.36, 0.15, and 0.49 for metrics 1, 2, and 3, respectively. In these particular cases, the peak at 1032 cm.sup.−1, which may have large contributions from glycogen, is quite important for discerning tumor. While the current model has been kept simple by only using three metrics, one can also use the method to weigh other metrics against the current set. New metrics can always be added to the model and their importance can be assessed. Since absorbances were only required at six different wavelengths with this particular model, there is the prospect of making much faster measurements with non-FTIR technologies, such as a with a tunable midIR laser, at only these six wavelengths. An increase in speed associated with direct measurements of the metrics rather than the whole spectra will render these methods useful in real-time during surgery or in a clinical setting.
(74) IR Metrics.
(75) There are many other metrics besides ratios of peak absorbances that can be used in practice to evaluate IR spectra of tissues including principal component scores, calibrant dot product scores, tissue scattering metrics, and baseline correction metrics. All of these metrics are extracted from matrix representations of the data, i.e. matrices are created in which each row contains an IR spectrum (either different spectral measurements with an IR probe or for each pixel in an IR imaging data set). We call this matrix the X matrix as shown in
(76) Peak Ratio Metrics.
(77) A peak ratio metric is the absorbance at one wavelength divided by the absorbance at another wavelength as shown in
(78) Principal Component Scores.
(79) Principal component analysis (PCA) finds the eigenvectors (W) of the matrix X.sup.T.Math.X, i.e. W=eigs(X.sup.T.Math.X). The eigenvectors in this case are an orthogonal set of vectors that look like IR spectra and are ordered by their contributions to variance. Libraries or X matrices are constructed for tissues of interest and
(80) Calibrant Dot Product Scores.
(81) Many biomolecules are known to be in tissues including many types of proteins, phospholipids, triglycerides, and polysaccharides. If a matrix C is constructed with rows containing the IR spectra of calibrant molecules, then scores (S.sub.cal) or metrics of each IR spectrum in the X matrix are obtained with S.sub.cal=X.Math.C.sup.T. In practice, it is useful to normalize all IR spectra before calculating these scores. Calibrant IR spectra useful for liver tissue are shown in
(82) Baseline Correction Metrics.
(83) The great abundance of IR work on tissues concerns absorption, but it turns out that tissues have very important scattering contributions because cells have changes in index of refraction associated with structures whose size matches the wavelength of probing IR light (approximately 2-20 μm). Scattering is typically ignored by a flattening of the baseline. One simple way to flatten the baseline of an IR spectrum is to pick three points that are isolated from the fundamental vibrations, for instance in this example we take the absorptions at 800, 1800, and 3900 cm.sup.−1. These three points determine exactly the three unknowns (A, B, C) in a parabolic equation, Abs=A{tilde over (v)}.sup.2+B{tilde over (v)}+C, whose values are in turn used to determine a more physically meaningful form, Abs=α({tilde over (v)}−β).sup.2+γ, where α=A (curvature), β=−B/(2A) (position of the minimum), and γ=C−B.sup.2/(4A) (absorbance offset at minimum) as illustrated in
Metric Examples
(84) Software has been developed that calculates all of the above types of metrics, then the user selects the metrics for comparison by optimization using weighted metrics. The following sections show some results of this metric work. There is an example using FTIR-ATR probe data and there are some examples with a library of full range IR spectra of colorectal cancer metastatic to the liver.
(85) Calibrant Scores with FTIR-ATR Probe Data.
(86) A recently published a paper [J. V. Coe, S. V. Nystrom, Z. Chen, R. Li, D. Verreault, C. L. Hitchcock, E. W. Martin Jr., and H. C. Allen, “Extracting Infrared Spectra of Protein Secondary Structures using a Library of Protein Spectra and the Ramachandran Plot”, J. Phys. Chem B, 119:41 13079-13092 (2015)], incorporated by reference, described extracting the IR spectra of pure α-helix and β-sheet from a library of protein IR spectra as shown in
(87) While these are good metrics and they lead to new biomolecular understanding, they each involve 301 wavelengths and so are not appropriate for use with a Fast IR Probe. By limiting the range to between 1500 and 1700 cm“.sup.1 and using only results spaced by 16 cm”.sup.1 through this range, the set is reduced to just 13 wavelengths. The reduction in wavelengths still exhibits separation of tumor 1502 and nontumor tissue 1504 as shown in
(88) Colorectal Cancer Metastatic to the Liver Library with FTIR Imaging Data.
(89) A unique library of IR imaging spectra for Colorectal Cancer Metastatic to the Liver (CCML Library) has been created by Professors Heather Allen and James Coe, in collaboration with several pathologists, Charles L. Hitchcock MD PhD and Dr. Tatiana Oberyszyn, and a team of surgical oncologists headed by Edward Martin, Jr. MD from The James, The Ohio State University Comprehensive Cancer Center. The liver is one of the most common sites for metastatic cancers. The library consists of 462,336 IR spectra from eight imaging experiments with 7 different patients. The tissues were collected with permission from patients having liver tumor resections (IRB #2011C0085, reapproved Aug. 21, 2014). The library is unique because the tissues are snap frozen and prepared for the IR microscope without the standard fixation (neutral buffered formalin solution, then dehydration with a sequence of graded ethanols, xylene, and finally paraffin). By only snap freezing, the natural fats are retained in the tissues, and thus denaturation of the proteins is significantly lessened. The IR imaging spectra have a 750-4000 cm.sup.−1 range, a spectral resolution of 4 cm.sup.−1, a spatial resolution of approximately six μm, and require about 15 hours of scanning to cover a 2 mm by 2 mm area of a tissue slice. The purpose of this library is to enable data mining for the development of the best IR metrics. The first 11 principal components of the CCML library were already displayed in
(90) Comparing Different Metrics with CCML Library.
(91) An example of the quantitative comparison used eight metrics chosen from the different types: 1) peak ratio at 1024 and 1080 cm.sup.−1, 2) peak ratio at 1032 and 1000 cm.sup.−1 which is M3, the best from the ATR-FTIR study described above, 3) the baseline correction curvature, 4) the norm after baseline correction, 5) principal component #6, 6) principle component #10, 7) the calibrant glycogen, and 8) the calibrant triglyceride. Previously, the probe experiments were performed on either a tumor or nontumor region; however, this is not known in advance in FTIR imaging microscope work. This process proceeds with a 25 cluster k-means cluster analysis on the whole CCML Library, which identifies each image pixel with one of 25 clusters along and then allows calculation of each cluster's IR spectrum. Several clusters were chosen and used to construct an X1 matrix for nontumor and several other cluster groups were used to construct an X2 matrix for tumor. These are subsets of the X matrix. The weights of each of the eight metrics were optimized for separation of tumor and nontumor in this example. It should be understood that the results pertain to the specific choice of clusters groups identified as tumor and nontumor. If one uses a different choice, one obtains different results. The scores of each metric were plotted as greyscale images for comparison to an H&E stain for a particular case (Case 8) in
(92) TABLE-US-00002 TABLE 2 Results of optimization with eight weighted metrics using the CCML Library C1.sub.j, C2.sub.j, Nontumor Tumor α.sub.j Weighted Weighted Nontumor Tumor Metric j Importance Weight Centroid Centroid Measured Measured 1 peak ratio 1024/1080 0.0000 0.0000 0.0000 0.0000 0.7576 0.3718 2 peak ratio 1032/1000 0.5181 0.0723 0.1962 0.3541 2.7147 4.8987 3 BC curvature 0.0000 −0.4355 0.0000 0.0000 0.0000 0.0000 4 norm after BC 0.2899 0.0510 0.3862 0.2979 7.5734 5.8413 5 PC #6 0.0267 0.0655 −0.0058 0.0023 −0.0883 0.0358 6 PC #10 0.0142 0.1118 −0.0046 −0.0003 −0.0410 −0.0023 7 calibrant glycogen 0.0845 −0.8389 −0.5127 −0.4869 0.6112 0.5805 8 calibrant triglyceride 0.0665 0.2863 0.1279 0.1076 0.4467 0.3760
(93) Note that these calculations pertain to whole CCML Library even though images are shown from only one of the cases.
(94) Beating the Curse of Dimensionality.
(95) The use of a large number of metrics helps with subtle discriminations between groups, but imposes great difficulty in visualization of the hyperdimensional results. The equations for projection of hyperdimensional metrics onto a line between the centroids of two chosen clusters were given above.
(96) Using the same case as with
(97) Metrics Across Individuals.
(98) Our methods enable one to search and identify metrics that are common to individuals, as well as those that vary with individuals. The CCML Library is a unique source of tissue information because the samples are not fixed. These samples still contain lipid and water and are likely to have less denatured protein. While many of the previous examples focus on one particular case, it is fruitful to extract information from all of the cases. We performed a 25 cluster preliminary k-means cluster analysis on the whole CCML Library and the same five peak ratios metrics (1744/1244, 2924/1544, 1016/1080, 1744/1162, and 1080/3290) as shown in
Example 2—SVM Results of Proof of Concept Probe Experiment on Two Cases of Colorectal Cancer Metastatic Two the Liver Showing Good Separation
(99) Referring back to
d=b+Σα.sub.i(S.sub.i,′|∫.sub.j(T.sub.j+o.sub.j))i
where i is an index over the support vectors, j is an index over the metrics, b is the bias constant, αi are the weights, S.sub.i,j′ are the scaled support vectors, f.sub.j is a multiplicative scaling factor, o.sub.j is an offset, and T.sub.j is the metric data to be tested. Test data is part of a group, such as nontumor or tumor, if d>0 (we call this a soft margin criteria, i.e. if data is on one side or other of the d=0 line). The SVM hard statistical margin extends from −1 to 1 as indicated with dotted vertical lines in
(100) Statistical aspects of this data analysis involve the number of: (i) cases, (ii) measurements per case, (iii) and wavelengths used to obtain a confident decision. Concerning (i), collection from more than two cases is essential and proposed. Concerning (ii), each of the 57 measurements required 2.0 min or 2.50 s per wavelength step. But a traditional FTIR system simultaneously makes measurements at all wavelengths, so there is no option to measure at a selected set of wavelengths. However, a QCL system could make measurements 10 times faster at the same S/N (Daylight Solutions) and at only 6 out of the effective 451 wavelengths. Together this is a possible gain in speed factor of ˜750. Measurements of 0.25 s per wavelength step for 6 wavelengths would take 1.5 s, hence the use of the word “Fast” in the disclosed invention. Concerning (iii), the
(101) However, there is a caveat that broadly tunable QCL systems currently contain four separate QCLs that program like one device—each with its own range, as shown in
(102) Recalling
(103) Proof-of-Concept Probe Experiment with Skin Cancer.
(104) 5.4 million cases of nonmelanoma skin cancer (basal and squamous cell carcinoma) were treated in 3.3 million people in the US in 2012. There are more new cases of skin cancer than the combined incidence of breast, prostate, lung and colon cancer, but they are not tracked by the central cancer registries since they are not as lethal. One in five Americans will develop skin cancer over their lifetime and the rates are increasing.
(105) The SKH1 mouse model of UV-induced cutaneous squamous cell carcinoma (SCC) is an accepted murine model for studying SCC development in that it very closely recapitulates the human disease. This disease involves multiple cell types including keratinocytes, fibroblasts, endothelial cells, and infiltrating immune and inflammatory cells and the interplay between these cell types during UV-mediated skin cancer initiation, promotion, and progression and it cannot be modeled ex vivo. Under the University PHS Welfare Assurance number, A3261-01, and our IACUC protocol (2010A00000083), we recorded FTIR probe spectra (4 cm.sup.−1 resolution, 700-1800 cm.sup.−1, 1101 steps of 1 cm.sup.−1, 25 scans, PE Spectrum 100) on one live SKH1 mouse with skin tumors in week 16 after 10 initial weeks of UV treatments as shown in
d=bias+Σαi
where i is an index over the support vectors, Tj are the metrics to be tested, and S.sub.i,j′, bias, α.sub.i, f.sub.j, and o.sub.j are outputs of the SVM program. The histogram of d values is given in
(106) Statistical aspects of this data analysis again involve the number of: (i) cases, (ii) measurements per case, (iii) and wavelengths used to obtain a confident decision. We have proposed a preclinical SKH1 mouse study which will get data on 5 different mice (with 5 controls) and likely more than 25 tumors. In this experiment (see
(107)
(108) The QCL tunable laser system offers flexibility enabling adaptation to harder problems. In proposed preclinical trials, we are devising an experiment where the probe can be tested on cancerous lesions, noncancerous lesions, and normal skin, i.e. there are three groups to discern, not two. Also, SKH1 mice can have tumors analyzed for whether they are benign or malignant. These discriminations might require more wavelengths and the QCL system affords that possibility.
(109) Merged IR and H&E Metrics.
(110) Since H&E imaging provides the most common standard for evaluating the surgeon's margin in liver resections, it is important to correlate our new IR metrics with those extracted from H&E imaging. This speaks to the medical community's acceptance of new IR metrics. We uniquely merge these metrics at input before performing SVM routines and our results correlate the IR with the H&E metrics. Experimentally this means performing tissue slice imaging on the exact same tissue with both H&E and IR. Such studies are proposed in parallel with the probe experiments mentioned in the previous two sections. With success, our new IR work will be connected to the current common standard for diagnosing cancer in liver resections. As an example, consider a region of tissue slice in
(111) Statistical considerations are different with tissue slice imaging as compared to probe measurements. If 10 consenting patients yield a 2 mm×2 mm tissue slice each, there will be 302 pixels by 320 pixel (at 6.25 μm per pixel) yielding 102,400 IR spectra. The H&E stain images of the same region will have at least twice as many pixels and this could be 20 times more (at 0.5 μm per pixel). So the libraries of IR spectra will involve more than a million IR spectra with typically 1626 wavelength steps with our instrument (PE Spotlight 300). The challenges here concern performing SVM on such large data sets and in obtaining classifications for training. Our desktop computers allow about 2000 metric sets in the group and 2000 out of the group which are chosen at random for SVM training. Then the resulting decision equation is run on the whole set of 102,400 pixels as shown in the histograms of
(112) While embodiments of the system and methods described herein are generally described in relation to aid in resections of cancer tissues, additional applications reside in detecting skin cancer in a clinical setting. Skin cancer is the most common of all cancer types according to the American Cancer Society.
(113) While the methods and systems have been described in connection with preferred embodiments and specific examples, it is not intended that the scope be limited to the particular embodiments set forth, as the embodiments herein are intended in all respects to be illustrative rather than restrictive.
(114) Unless otherwise expressly stated, it is in no way intended that any method set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not actually recite an order to be followed by its steps or it is not otherwise specifically stated in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any possible non-express basis for interpretation, including: matters of logic with respect to arrangement of steps or operational flow; plain meaning derived from grammatical organization or punctuation; the number or type of embodiments described in the specification.
(115) Throughout this application, various publications may be referenced. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art to which the methods and systems pertain.
(116) It will be apparent to those skilled in the art that various modifications and variations can be made without departing from the scope or spirit. Other embodiments will be apparent to those skilled in the art from consideration of the specification and practice disclosed herein. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit being indicated by the following claims.