EVALUATION OF BRAIN TISSUE AND MATERIAL BASED ON A FRACTION-PRODUCT AND OPTICAL SPECTROSCOPY
20230175955 · 2023-06-08
Inventors
Cpc classification
G01N21/31
PHYSICS
A61B5/4088
HUMAN NECESSITIES
A61B5/7264
HUMAN NECESSITIES
G16H50/00
PHYSICS
International classification
Abstract
The methods, apparatuses, computer-readable media, and systems described enable regions of electromagnetic spectra that may distinguish different biological specimens to be determined. Regions of electromagnetic spectra that distinguish known biological specimens then become candidates for methods to classify unknown specimens and/or make a medical diagnosis. A fraction-product, determined from two arrays associated with two groups, may be used to determine optimal discriminants for the two groups given numerical measurements of particular properties of the members of both groups.
Claims
1. A computer-implemented method comprising: receiving, by a computing device, a first array of numbers and a second array of numbers, wherein the first array and the second array are associated with a common index set, wherein the common index set is associated with a plurality of index elements; determining: a) at each index element of the plurality of index elements, for the first array and the second array, a median of numeric values of members of the respective array, b) at each index element of the plurality of index elements, based on an average of the two median values determined at (a), a taxonomic cut-off value, c) for the respective array associated with the greater median value between the two median values, a fraction of the members of the respective array whose numeric values exceed the taxonomic cut-off value determined at (b), and d) for the respective array associated with the lesser median value between the two median values, the fraction of the members of the respective array whose numeric values are less than the taxonomic cut-off value determined at (b); determining, at each index element of the plurality of index elements, a respective fraction-product value, wherein the respective fraction-product value is based on a product of the respective value determined at step (c) and the respective value determined at step (d); determining, based on each of the respective fraction-product values, a third array of numbers associated with the common index set; and determining, based on the third array, one or more optimal discriminants for separating a first group and a second group, wherein the first group is associated with the first array and the second group is associated with the second array, wherein the one or more optimal discriminants are equal to the largest value of the third array.
2. The computer-implemented method of claim 1, wherein determining the one or more optimal discriminants comprises: 1) selecting the largest fraction-product value of the fraction-product values associated with the third array, 2) selecting index elements of the third array associated with the largest fraction-product value; 3) assessing, based on the index elements of the third array selected at step (2), a separation of the first group and the second group; 4) if the separation is inadequate, determining a fraction-product value less than the largest fraction-product value; 5) selecting index elements of the third array associated with the fraction-product value less than the largest fraction-product value; 6) assessing, based on the index elements of the third array selected at step (5), a separation of the first group and the second group; and 7) if the separation is inadequate, repeating steps (4)-(7).
3. The computer-implemented method of claim 1, wherein the first array and the second array are associated with optical spectra.
4. The computer-implemented method of claim 1, further comprising: determining that the fraction-product value exceeds a value for one or more contiguous elements of the common index set; and selecting, based on the fraction-product value exceeding the value for the one or more contiguous elements of the common index set, a feature of optical spectra.
5. An apparatus comprising: one or more processors; and memory storing processor-executable instructions that, when executed by the one or more processors, cause the apparatus to: receive, a first array and a second array, wherein the first array and the second array are associated with a common index set, wherein the common index set is associated with a plurality of index elements; determine, at each index element of the plurality of index elements, a respective median value within the first array and a respective median value within the second array; determine, for each index element of the plurality of index elements, a respective cut-off value, wherein the respective cut-off value comprises an average value of the respective median values within the first array and the respective median value within the second array; determine: a) for either the first array or the second array, based on the higher value of the respective cut-off values, a fraction of each index element of the plurality of index elements with a respective value higher than the respective cut-off value, and b) for either the first array or the second array, based the lower value of the respective cut-off values, a fraction of each index element of the plurality of index elements with a respective value lower than the respective cut-off value, determine, at each index element of the plurality of index elements, a respective fraction-product value, wherein the respective fraction-product value is based on a product of the respective value determined at step (a) and the respective value determined at step (b); determine, based on each of the respective fraction-product values, a third array associated with the common index set; and determine, based on the third array, one or more optimal discriminants for separating a first group and a second group, wherein the first group is associated with the first array and the second group is associated with the second array, wherein the one or more optimal discriminants are equal to the largest value of the third array.
6. The apparatus of claim 3, wherein the processor-executable instructions that cause the apparatus to determine the one or more optimal discriminants, further cause the apparatus to: 1) select the largest fraction-product value of the fraction-product values associated with the third array, 2) select index elements of the third array associated with the largest fraction-product value; 3) assess, based on the index elements of the third array selected at step (2), a separation of the first group and the second group; 4) if the separation is inadequate, determine a fraction-product value less than the largest fraction-product value; 5) select index elements of the third array associated with the fraction-product value less than the largest fraction-product value; 6) assess, based on the index elements of the third array selected at step (5), a separation of the first group and the second group; and 7) if the separation is inadequate, repeat steps (4)-(7).
7. One or more computer-readable media storing processor-executable instructions that, when executed by at least one processor, cause at least one processor to: receive, a first array and a second array, wherein the first array and the second array are associated with a common index set, wherein the common index set is associated with a plurality of index elements; determine, at each index element of the plurality of index elements, a respective median value within the first array and a respective median value within the second array; determine, for each index element of the plurality of index elements, a respective cut-off value, wherein the respective cut-off value comprises an average value of the respective median value within the first array and the respective median value within the second array; determine: c) for either the first array or the second array, based on the higher value of the respective median values, a fraction of each index element of the plurality of index elements with a respective value higher than the respective cut-off value, and d) for either the first array or the second array, based on the lower value of the respective median values, a fraction of each index element of the plurality of index elements with a respective value lower than the respective cut-off value, determine, at each index element of the plurality of index elements, a respective fraction-product value, wherein the respective fraction-product value is based on a product of the respective value determined at step (a) and the respective value determined at step (b); determine, based on each of the respective fraction-product values, a third array associated with the common index set; and determine, based on the third array, one or more optimal discriminants for separating a first group and a second group, wherein the first group is associated with the first array and the second group is associated with the second array, wherein the one or more optimal discriminants are equal to a largest value of the third array.
8. The one or more computer-readable media of claim 5, wherein the processor- executable instructions that cause the at least one processor to determine the one or more optimal discriminants, further cause the at least one processor to: 1) select the largest fraction-product value of the fraction-product values associated with the third array, 2) select index elements of the third array associated with the largest fraction-product value; 3) assess, based on the index elements of the third array selected at step (2), a separation of the first group and the second group; 4) if the separation is inadequate, determine a fraction-product value less than the largest fraction-product value; 5) select index elements of the third array associated with the fraction-product value less than the largest fraction-product value; 6) assess, based on the index elements of the third array selected at step (5), a separation of the first group and the second group; and 7) if the separation is inadequate, repeat steps (4)-(7).
9. A computer-implemented method comprising: receiving, by a computing device, first optical spectra and second optical spectra; determining, based on an average of median values of spectral intensity for wavelengths present in each of the first optical spectra and each of the second optical spectra, a diagnostic cut-off value; determining, based on the diagnostic cut-off value, a first quantity of the first optical spectra and a second quantity of the second optical spectra; determining, based on the first quantity and the second quantity, a discriminant statistic; and selecting, based on discriminant statistic values, a threshold that selects index elements having data that satisfactorily separate a group of subjects corresponding to the first optical spectra and a second group of subjects corresponding to the second optical spectra, the index elements being candidate optical discriminants.
10. The computer-implemented method of claim 9, further comprising, determining a subset of the candidate optical discriminants; and determining a principal component analysis (PCA) transformation to a basis that reduces the dimensionality of the subject of the candidate optical discriminants; and reducing the dimensionality of the subject of the candidate optical discriminants.
11. The computer-implemented method of claim 10, further comprising, receiving an optical spectrum corresponding to a subject; and designating the subject as pertaining to a group of normal subjects or a group of non-normal subjects by applying, based on the reduced subset of the candidate optical discriminants, the PCA transformation to the optical spectrum.
12. The computer-implemented method of claim 9, wherein the first optical spectra are associated with one or more first specimens having a medical condition, and wherein the second optical spectra are associated with one or more second specimens not having the medical condition.
13. The computer-implemented method of claim 12, wherein the medical condition comprises Alzheimer's disease.
14. The computer-implemented method of claim 12, wherein the medical condition comprises one or more Lewy bodies in brain tissue of the one or more first specimens.
15. The computer-implemented method of claim 12, wherein the medical condition comprises Gulf War Illness.
16. The computer-implemented method of claim 9, further comprising: determining one or more wavelengths present in each of the first optical spectra and one or more wavelengths present in each of the second optical spectra; and determining, based on the one or more wavelengths present within each of the first optical spectra and the second optical spectra, the median value of spectral intensity for each of the first optical spectra and the second optical spectra.
17. The computer-implemented method of claim 9, wherein the first quantity of the first optical spectra comprises median values of spectral intensity that are less than or equal to the diagnostic cut-off value.
18. The computer-implemented method of claim 9, wherein the second quantity of the second optical spectra comprises median values of spectral intensity that are less than or equal to the diagnostic cut-off value.
19. The computer-implemented method of claim 9, wherein the threshold value comprises a product of the first quantity and the second quantity.
20. The computer-implemented method of claim 19, wherein the threshold value is 0.45.
21. An apparatus comprising: one or more processors; and memory storing processor-executable instructions that, when executed by the one or more processors, cause the apparatus to: receive first optical spectra and second optical spectra; determine, based on an average of median values of spectral intensity for wavelengths present in each of the first optical spectra and each of the second optical spectra, a diagnostic cut-off value; determine, based on the diagnostic cut-off value, a first quantity of the first optical spectra and a second quantity of the second optical spectra; determine, based on the first quantity and the second quantity, a discriminant statistic; and select, based on discriminant statistic values, a threshold that selects index elements having data that satisfactorily separate a group of subjects corresponding to the first optical spectra and a second group of subjects corresponding to the second optical spectra as candidate optical discriminants.
22. The apparatus of claim 21, wherein the first optical spectra are associated with one or more first specimens having a medical condition, and wherein the second optical spectra are associated with one or more second specimens not having the medical condition.
23. A computer-implemented method, comprising: receiving an optical spectrum corresponding to a subject; designating the subject as pertaining to a group of normal subjects or a group of non-normal subjects by applying a principal component analysis (PCA) transformation to the optical spectrum, the PCA transformation reduces a dimensionality of a set of candidate optical discriminants.
24. The computer-implemented method of claim 23, wherein the group of normal subjects comprises at least one subject not afflicted by a neurological medical condition, and wherein the group of non-normal subjects comprises at least one subject afflicted by the neurological medical condition.
25. The computer-implemented method of claim 23, further comprising, determining the set of the candidate optical discriminants; and determining the PCA transformation to a basis that reduces the dimensionality of the set of the candidate optical discriminants.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0010] The accompanying drawings, which are incorporated in and constitute a part of this specification, together with the description, serve to explain the principles of the methods and systems:
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
DETAILED DESCRIPTION
[0035] As used in the specification and the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Ranges may be expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, another configuration includes from the one /particular value and/or to the other particular value. When values are expressed as approximations, by use of the antecedent “about,” it will be understood that the particular value forms another configuration. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint.
[0036] “Optional” or “optionally” means that the subsequently described event or circumstance may or may not occur, and that the description includes cases where said event or circumstance occurs and cases where it does not.
[0037] Throughout the description and claims of this specification, the word “comprise” and variations of the word, such as “comprising” and “comprises,” means “including but not limited to,” and is not intended to exclude other components, integers or steps. “Exemplary” means “an example of” and is not intended to convey an indication of a preferred or ideal configuration. “Such as” is not used in a restrictive sense, but for explanatory purposes. Further, the words “diagnostic,” “classification,” and “taxonomic” are synonyms of one another and, thus, are used interchangeably.
[0038] It is understood that when combinations, subsets, interactions, groups, etc. of components are described that, while specific reference of each various individual and collective combinations and permutations of these may not be explicitly described, each is specifically contemplated and described herein. This applies to all parts of this application including, but not limited to, steps in described methods. Thus, if there are a variety of additional steps that may be performed it is understood that each of these additional steps may be performed with any specific configuration or combination of configurations of the described methods.
[0039] As will be appreciated by one skilled in the art, hardware, software, or a combination of software and hardware may be implemented. Furthermore, a computer program product on a computer-readable storage medium (e.g., non-transitory) having processor-executable instructions (e.g., computer software) embodied in the storage medium. Any suitable computer-readable storage medium may be utilized including hard disks, CD-ROMs, optical storage devices, magnetic storage devices, memresistors, Non-Volatile Random Access Memory (NVRAM), flash memory, or a combination thereof
[0040] Throughout this application reference is made to block diagrams and flowcharts. It will be understood that each block of the block diagrams and flowcharts, and combinations of blocks in the block diagrams and flowcharts, respectively, may be implemented by processor-executable instructions. These processor-executable instructions may be loaded onto a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the processor-executable instructions which execute on the computer or other programmable data processing apparatus create a device for implementing the functions specified in the flowchart block or blocks.
[0041] These processor-executable instructions may also be stored in a computer-readable memory that may direct a computer or other programmable data processing apparatus to function in a particular manner, such that the processor-executable instructions stored in the computer-readable memory produce an article of manufacture including processor-executable instructions for implementing the function specified in the flowchart block or blocks. The processor-executable instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer-implemented process such that the processor-executable instructions that execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart block or blocks.
[0042] Accordingly, blocks of the block diagrams and flowcharts support combinations of devices for performing the specified functions, combinations of steps for performing the specified functions and program instruction means for performing the specified functions. It will also be understood that each block of the block diagrams and flowcharts, and combinations of blocks in the block diagrams and flowcharts, may be implemented by special purpose hardware-based computer systems that perform the specified functions or steps, or combinations of special purpose hardware and computer instructions.
[0043] This detailed description may refer to a given entity performing some action. It should be understood that this language may in some cases mean that a system (e.g., a computer) owned and/or controlled by the given entity is actually performing the action.
[0044] Methods, apparatuses, computer-readable media, and systems for evaluating brain tissue and material based on a fraction-product are described. Regions of electromagnetic spectra may be determined and used to distinguish different biological specimens (e.g., a specimen with Alzheimer's disease vs. a normal (age-matched) specimen, a specimen with Lewy bodies vs. a specimen without Lewy bodies, a specimen affected by Gulf War Illness (GWI) vs. specimens unaffected by GWI, etc.). Regions of electromagnetic spectra that distinguish known biological specimens then become candidate discriminants for methods to classify unknown specimens, that is, to make a medical diagnosis. The methods described herein may be used to evaluate and process data (e.g., spectral data) to analyze brain tissue and material based on a fraction-product, such as brain tissue and material associated with different biological specimens (e.g., a specimen with Alzheimer's disease vs. a normal (age-matched) specimen, a specimen with Lewy bodies vs. a specimen without Lewy bodies, a specimen affected by Gulf War Illness (GWI) vs. specimens unaffected by GWI, etc.).
[0045]
[0046] At block 102, data records (and/or datasets) may first be collected to determine relevant variables. For example, data records for two groups (group A and group B) of specimens that are known to differ in a significant manner (e.g., disease vs. non-disease, Alzheimer's disease vs normal, Lewy bodies in brain tissue vs. no Lewy bodies in brain tissue, Gulf War illness identified in blood samples vs. no Gulf War illness identified in blood samples, etc.). Data records may be collected by any appropriate technique. For example, data records may be collected from specimens, services, an application, an entity, and/or the like.
[0047] At block 104, the data records may be pre-processed to remove obvious erroneous or inconsistent data records. Such pre-processing may be referred to as data normalization.
[0048] At block 106, pre-processed data may be provided to an algorithm, such as a fraction-product algorithm. The fraction-product algorithm enables the determination/identification of the features, such as features associated with optical spectra that may be useful for discriminating between two groups, and the adjustment of parameters of any model used afterward. For example, the fraction-product algorithm may be used to reduce any number of potential parameters to a desired subset of parameters. The reduced subset of variables may be used to create accurate data models.
[0049] At block 108, the reduced subset of variables may further be outputted to a data storage for later retrieval.
[0050] At block 110, the reduced subset of variables may be outputted to other application software programs to further analyze and/or model the data set. For example, three-dimensional (3D) and two-dimensional (2D) plots of spectral features may be determined, previously undiscoverable distinctions between two groups may be determined, and spectral aspects of brain tissue and material may be evaluated. The fraction-product described herein provides a statistical argument that reduces the need for “discovery” and “test” sets within volumes of data. For example, for optical spectra, different optical features entail distinct materials. Therefore, the use of the fraction-product described herein enables certain inferences without a test set. Application software programs may include any appropriate type of data processing software program. Blocks 102-110 of the flowchart 100 may be performed by one or more computer systems.
[0051]
[0052] The computing device 201 may comprise one or more processors 203, a system memory 212, and a bus 213 that couples various components of the computing device 201 including the one or more processors 203 to the system memory 212. In the case of multiple processors 203, the computing device 201 may utilize parallel computing.
[0053] The bus 213 may comprise one or more of several possible types of bus structures, such as a memory bus, memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures.
[0054] The computing device 201 may operate on and/or comprise a variety of computer-readable media (e.g., non-transitory). Computer-readable media may be any available media that is accessible by the computing device 201 and comprises, non-transitory, volatile and/or non-volatile media, removable and non-removable media. The system memory 212 has computer-readable media in the form of volatile memory, such as random access memory (RAM), and/or non-volatile memory, such as read-only memory (ROM). The system memory 212 may store data such as spectral analysis data 207 and/or program modules such as operating system 205 and fraction-product determination software 206 that are accessible to and/or are operated on (e.g., executed) by the one or more processors 203.
[0055] The computing device 201 may also comprise other removable/non-removable, volatile/non-volatile computer storage media. The mass storage device 204 may provide non-volatile storage of computer code, computer-readable instructions, data structures, program modules, and other data for the computing device 201. The mass storage device 204 may be a hard disk, a removable magnetic disk, a removable optical disk, magnetic cassettes or other magnetic storage devices, flash memory cards, CD-ROM, digital versatile disks (DVD) or other optical storage, random access memories (RAM), read-only memories (ROM), electrically erasable programmable read-only memory (EEPROM), and the like.
[0056] Any number of program modules may be stored on the mass storage device 204. An operating system 205 and fraction-product determination software 206 may be stored on the mass storage device 204. One or more of the operating system 205 and fraction-product determination software 206 (or some combination thereof) may comprise program modules and the fraction-product determination software 206. The fraction-product determination software 206 can include multiple components that, in response to being executed by one or more processors, can perform analyses of optical spectroscopic data in accordance with aspects of this disclosure. Analysis of the optical spectroscopic data results in spectral analysis data 207. Spectral analysis data 207 may also be stored on the mass storage device 204. Spectral analysis data 207 may be stored in any of one or more databases known in the art. The databases may be centralized or distributed across multiple locations within the network 215.
[0057] Optical spectroscopic data can be acquired in vivo using optical spectroscopy equipment coupled (optically and mechanically) to an optical probe device 230 attached to the subject 240. A room where the subject is located during optical measurements can be darkened during optical spectroscopy experiments. The optical probe device 230 is herein referred to as template probe and is attached to the temple region of a subject 240. The optical spectroscopy equipment 220 can include a light source device (such as a tungsten lamp), an optical spectrograph, and a light detector device. Optical coupling between the optical spectroscopy equipment 220 and the optical probe device 230 can be provided by a source optical fiber 224 and a readout optical fiber 226. The optical probe device 230 can have multiple openings, where a first opening of the multiple openings can receive or otherwise engage the source optical fiber 224 and a second opening of the multiple openings can receive or otherwise engage the readout optical fiber 226. By selecting the first opening and second opening, a particular source-readout (or source-detector) separation can be configured. Examples of source-readout separation include 10 mm, 15 mm, 20 mm, 25 mm, and 30 mm. In this disclosure, optical spectroscopic data (or optical spectroscopy data) also can be referred to as spectroscopic data. Several subjects 240 can be probed using optical spectroscopy in accordance with aspects described herein. In some scenarios, a first group of the several subjects 240 can be afflicted by a neurological medical condition (e.g., a neuropathologic condition, such as AD) and a second group of the several subjects 240 may be a control group of subjects not afflicted by the neurological medical condition.
[0058] In some cases, the computing device 201 can obtain the spectroscopic data directly from the optical spectroscopy equipment 220. In such cases, the fraction-product determination software 206 also can include one or more components that permit controlling the acquisition of optical spectroscopic data in vivo from the subject 240. In other cases, the optical spectroscopy equipment 220 can retain the spectroscopic data obtained from the subject 240 within one or more memory devices accessible to the computing device 201. For example, such spectroscopic data can be retained within memory device(s) hosted by one or more of remote computing devices 214a,b,c. In such other cases, the computing device 201 can download or otherwise obtain the optical spectroscopic data from the one or more memory devices that retain the optical spectroscopic data.
[0059] A user may enter commands and information into the computing device 201 via an input device (not shown). Such input devices comprise, but are not limited to, a keyboard, pointing device (e.g., a computer mouse, remote control), a microphone, a joystick, a scanner, tactile input devices such as gloves, and other body coverings, motion sensor, and the like These and other input devices may be connected to the one or more processors 203 via a human-machine interface 202 that is coupled to the bus 213, but can be connected by other interface and bus structures, such as a parallel port, game port, an IEEE 1394 Port (also known as a Firewire port), a serial port, network adapter 208, and/or a universal serial bus (USB).
[0060] A display device 211 may also be connected to the bus 213 via an interface, such as a display adapter 209. It is contemplated that the computing device 201 may have more than one display adapter 209 and the computing device 201 may have more than one display device 211. A display device 211 may be a monitor, an LCD (Liquid Crystal Display), a light-emitting diode (LED) display, a television, smart lens, smart glass, and/ or a projector. In addition to the display device 211, other output peripheral devices may comprise components such as speakers (not shown) and a printer (not shown) which may be connected to the computing device 201 via Input/Output Interface 210. Any step and/or result of the methods may be output (or caused to be output) in any form to an output device. Such output may be any form of visual representation, including, but not limited to, textual, graphical, animation, audio, tactile, and the like. The display 211 and computing device 201 may be part of one device, or separate devices.
[0061] The computing device 201 may operate in a networked environment using logical connections to one or more remote computing devices 214a,b,c. A remote computing device 214a,b,c may be a personal computer, computing station (e.g., workstation), portable computer (e.g., laptop, mobile phone, tablet device), smart device (e.g., smartphone, smartwatch, activity tracker, smart apparel, smart accessory), security and/or monitoring device, a server, a router, a network computer, a peer device, edge device or other common network nodes, and so on. Logical connections between the computing device 201 and a remote computing device 214a,b,c may be made via a network 215, such as a local area network (LAN) and/or a general wide area network (WAN). Such network connections may be through a network adapter 208. A network adapter 208 may be implemented in both wired and wireless environments. Such networking environments are conventional and commonplace in dwellings, offices, enterprise-wide computer networks, intranets, and the Internet.
[0062] Application programs and other executable program components such as the operating system 205 are shown herein as discrete blocks, although it is recognized that such programs and components may reside at various times in different storage components of the computing device 201, and are executed by the one or more processors 203 of the computing device 201. An implementation of fraction-product determination software 206 may be stored on or sent across some form of computer-readable media. Any of the disclosed methods may be performed by processor-executable instructions embodied on computer-readable media.
I. Classification of Human Neuropathology in vivo Using Near-Infrared Optical Spectroscopy
[0063] Diffuse light scattering in tissue can obscure structure within an optical spectrum, which has slowed the application of optical spectroscopy to medical diagnosis. The methods, apparatuses, computer-readable media, and systems described herein implement a novel discriminant statistic, called the “fraction-product,” that helps to discover regions of optical spectra that are useful for classifying subjects (taxonomic signal).
[0064] First-order analysis with the fraction-product reveals an optical spectroscopic feature near 861 nm that not only distinguishes those with and without a neuropathologic condition but also classifies them with 93% accuracy. The second-order analysis adds features at 677 and 809 nm. When these three limited regions of the spectrum are examined by principal component analysis, subjects are classified with 100% accuracy.
[0065] Using the fraction-product and optical spectroscopy, the methods, apparatuses, computer-readable media, and systems described herein provide the first successful classification of a human neuropathologic condition—e.g., the presence or absence of Lewy bodies as determined by autopsy—using optical spectra (e.g., near-infrared reflectance) obtained at a subject's temple while the subject is alive.
[0066] The ability to identify taxonomic signals in clinical optical spectra can enable the development of in vivo optical spectroscopy methods for neurological diagnoses, such as demonstrated here for the classification of patients based on the presence or absence of Lewy bodies in the living brain.
A. Introduction
[0067] Following the application of near-infrared (NIR) spectroscopy to the development of pulse oximetry, in 1977 Jobsis reported the measurement in vivo of hemoglobin saturation in a cat's brain with the source fiber on one of the cat's temples and the readout optical fiber on the other one. In that paper, Jobsis also included results with the same optical configuration applied to a human head, which, being so much broader, revealed only a change in blood volume with hyperventilation. Since then most applications of NIR spectroscopy in vivo have concerned estimates of hemoglobin saturation and blood flow, although there has also been work to classify tissues biochemically and to construct images. The problem posed by the thick human head may be mitigated by placing source and readout optical fibers at the same human temple. With such a reflectance configuration, photons diffuse from the source optical fiber into the tissues and scatter back to the readout optical fiber. Time-of-flight and modeling studies confirm the notion that the greater the source-readout separation, the deeper is the mean path of the diffusing photons.
[0068] In contrast to the above-mentioned existing work, the methods, apparatuses, computer-readable media, and systems described herein determine that NIR spectroscopy can distinguish Alzheimer's disease in autopsy samples of temporal isocortex. Efforts to extend this method to living subjects have been hampered by two factors: 1) the inherent tendency of overlying tissues to scatter light, which obscures optical spectroscopic features specific to the brain, and 2) the presence of multiple neuropathologic conditions in the same individual, which create confounding taxonomic signals.
[0069] The methods, apparatuses, computer-readable media, and systems described herein mitigate the problems of tissue scattering and multiple pathologies. The emphasis is placed on identifying regions of the clinical optical spectra useful for classification, rather than on identifying underlying chemistry or morphology (extraction of which may be intractable due to the convolution of scattering and absorption in overlying tissues).
[0070] The “fraction-product” (f.sub.p), described herein is a novel discriminant statistic that aids in the discovery of “taxonomic signal”, that is, regions of the optical spectrum most useful for classification of a subject (or a specimen corresponding to the subject) as pertaining to a group having a neuropathological condition or a group lacking the neuropathological condition. By limiting the analysis to these regions of the optical spectrum, the methods, apparatuses, computer-readable media, and systems described herein can mitigate the tissue scattering problem. The fraction-product is applied to optical spectra from subjects selected to minimize other pathology and to isolate as the major distinguishing factor Lewy bodies, neuropathologic structures whose presence or absence is assessed routinely at autopsy. These optical spectra (data/information) can be acquired at the subject's temple while the subject is alive. The methods, apparatuses, computer-readable media, and systems described herein provide the first successful classification by NIR spectroscopy of a human neuropathologic condition in vivo.
B. An Operational Definition of Fraction-Product f.SUB.p
[0071] The operational definition of the fraction-product f.sub.p, may be described as, wherein given optical spectra of samples from two distinct populations, samples 1 and 2, at each wavelength (represented by a pixel when using a digital light detector) performing the following algorithm/steps: [0072] 1. Determine for sample 1 the median value (M.sub.1) of the spectral feature (or optical discriminant). It is noted that each wavelength can be deemed to be an “atomic” feature that is an element of an optical feature. The optical feature is thus defined by satisfactory atomic features corresponding to respective adjacent pixels, In other words, a collection of adjacent pixels corresponding to respective atomic features forms an optical feature with linewidth. [0073] 2. Determine for sample 2 the median value (M.sub.2) of the spectral feature. [0074] 3. Assuming that M.sub.1 and M.sub.2 correspond to the population values, estimate a classification cutoff as (M.sub.1+M.sub.2)/2. As such, M.sub.1 and M.sub.2 are assumed to be the median values for this data set. By using those medians as estimates of the theoretical population value, it can be asserted that members of the population not included in this data set can be classified. [0075] 4. Determine the fraction of sample 1 on the correct side of the cut-off (such fraction denoted as f.sub.1). In other words, determined the fraction of data points of this class that are on the same side of the cut-off as their class median. [0076] 5. Determine the fraction of sample 2 on the correct side of the cut-off (such fraction denoted as f.sub.2). In other words, determined the fraction of data points of this class that are on the same side of the cut-off as their class median. [0077] 6. Calculate the fraction-product: f.sub.p=f.sub.1×f.sub.2.
[0078] Defining the fraction-product, the algorithm/steps (steps 1-6) presuppose a paradigm in which two distinct groups are studied to devise an optical spectroscopy method (also referred to as spectroscopic method) for classifying unknowns as belonging to one group or the other. A technique for exploratory data analysis is to display the optical spectra from each group in different colors and to look for regions where the colors separate. However, regions where the colors separate may be indistinguishable to the human eye. As such, in some cases, the fraction-product may be used when distinct features may not be apparent to the eye. Accordingly, not only is the application of the fraction-product superior to techniques that rely on human intervention, but the human mind is unable to determine adequate taxonomic signals by mere visual inspection of optical spectra measured in vivo for a subject's brain.
[0079] Although optical spectra are often displayed as intensities, as is used herein the term “feature” includes common transformations. For example, besides utilizing optical spectral intensity, the methods, apparatuses, computer-readable media, and systems described herein also may utilize (via the fraction-product determination software 206 (
[0080] Analysis reveals that when the two distributions at a pixel are completely separated, f.sub.1=f.sub.2=1 and f.sub.p=1. When the two distributions at a pixel essentially coincide, f.sub.1≈f.sub.2≈0.5, and f.sub.p≈0.25. Therefore, f.sub.p ranges from 0.25 to 1.0. Regions of the spectra where every pixel has an f.sub.p value close to 1 can be most useful in classifying subjects. Just as inferential statistics requires an element of judgment, e.g., choosing a level of significance, so will the criteria for choosing a threshold value of f.sub.p such that “f.sub.p>value” means “close to 1.”
[0081] Criteria to be considered are further described. Because the two groups of optical spectra are presumed to be very similar in appearance, embodiments of this disclosure can first determine (via the fraction-product determination software 206 (
[0082] Two important results of the simulations described later herein are the following: (1) for smaller numbers of subjects (N<15), a random simulation matching the physical experiment in pixel number and subject number must be done for comparison; (2) although the value of f.sub.p alone may be significant for N>15, the occurrence of values of f.sub.p close to 1 on contiguous pixels (linewidth) is far more useful for discovering taxonomic signal. Given these results, optical spectroscopic data are compared to a random simulation with corresponding numbers of subjects and pixels.
C. Application of the Discriminant Algorithm to Subjects With and Without Lewy Bodies
[0083] In collaboration with the Boston University Alzheimer's Disease Center, the use of NIR spectroscopy following the methods, apparatuses, computer-readable media, and systems described herein can be used as a tool for understanding neurodegenerative disease. As described later herein, SM3 contains a detailed description of the spectroscopic methods and subject demographics. In some embodiments, light from a tungsten lamp passes through an optical fiber to the subject's temple. For the measurements discussed herein, the readout optical fiber (e.g., readout optical fiber 226 (
[0084] All subjects were patients in the Dementia Special Care Unit of the Edith Nourse Rogers Memorial Veterans Hospital enrolled in a protocol approved by the institutional review board (IRB). All subjects reported came to autopsy, and Lewy bodies were demarcated by anti-synuclein antibodies. In the setting of advanced dementia, two points of background information come into play. First, as mentioned, it is expected that each subject likely has multiple neuropathologic findings, minimally those of Alzheimer's disease and vascular disease. Second, it is a principle of neuropathology that lesions in one area of the brain can cause changes in other areas through transneuronal degeneration. To isolate the presence of Lewy bodies as a factor and to address the issue of multiple pathologies, those subjects with frontotemporal lobar degeneration and obvious infarcts were eliminated. Further control for variations in Alzheimer's pathology was employed by selecting those subjects with the same classification, “high likelihood” by the Reagan criteria. In this selected subset, there are 7 spectra in the group without Lewy bodies and 9 spectra in the group with Lewy bodies, 2 with Lewy bodies in the temporal isocortex probed by the light field in our measurements, and 7 with Lewy bodies elsewhere in the brain. It is reasonable to assume that in advanced dementia, sufficient time has elapsed for transneuronal degeneration effects, if any, to have taken place in the temporal lobe due to Lewy bodies outside it. Therefore, to avoid confounding the effects of transneuronal degeneration with those due to the physical presence of Lewy bodies, the 2 subjects with Lewy bodies in the temporal isocortex are excluded from the analysis. The effect of removing these two subjects is discussed below and later herein (SM8).
[0085] To assess what range values of f.sub.p can assume by chance alone under experimental conditions, 1000 simulations were performed with the numbers of subjects and pixels matching those of the physical experiment and the values analogous to the slope variates generated randomly.
[0086]
Principal Component Analysis and Linear Discriminant Analysis
[0087]
[0088] Linear discriminant analysis (LDA) can be performed on the first two principal components using the fraction-product determination software 206 (
D. Discussion
[0089] In principle, when light from a single source illuminates two distinct materials, the resultant spectra will be distinct, a priori. The fraction-product enables the discovery of the optical spectroscopic features responsible for such distinctions.
[0090] Eight optical spectroscopic slope variates (starred entries in the p-value column in
[0091] Based on the methods, apparatuses, computer-readable media, and systems described herein, the fraction-product can be used (via the fraction-product determination software 206 (
[0092] Lewy bodies were originally described as eosinophilic, proteinaceous structures that were later associated with Parkinson's disease. Since alpha-synuclein was determined to be a major protein component of Lewy bodies, a more general concept of synucleinopathies has emerged, which also includes dementia with Lewy bodies and multiple system atrophy. It has been recognized for well over a century that neurodegeneration starting in one location can affect distant parts of the brain. The staging of Parkinson's disease proposed by Braak and colleagues mentions that the pathology in Stage 6 can affect the superior temporal gyrus; therefore, the region probed by our light field includes tissue known to be affected by the most common synucleinopathy. Other studies that document the effects of synucleinopathy on the temporal cortex are summarized in
[0093] Several observation can me made regarding the results yielded by the systems and methods of this disclosure. First, the successful classification is due in part to the elimination of subjects with other significant pathology such as infarcts and frontotemporal lobar degeneration. The rapid discovery of potentially useful regions of the spectra by the fraction-product leads to optimism that the extension to more complicated cases will be attainable. Second, the number of subjects involved is small. However, p-values for the features around 861 nm and 677 nm indicate statistical significance, and the correct classification of two subjects with Lewy bodies in the temporal isocortex that were not included in the determination of the principal components (see SM8) also supports the expectation that the method will apply to larger samples. The ability to extend this non-invasive approach to studies of the general population underscores its potential for use in clinical screening and, ultimately, diagnosis.
E. Simulations
Simulations 1 (SM1)—The Fraction-Product in the Analysis of Lewy Bodies
[0094] In this analysis, taxonomic signal was searched between one group that had Alzheimer's disease without Lewy bodies (the group consisting of 7 subjects) and another group that had Alzheimer's disease and Lewy bodies in areas of the brain outside the temporal isocortex (that other group also consisted of 7 subjects). As it can be gleaned from
[0095] As mentioned, when light from the same source illuminates two distinct materials, the two resultant spectra are distinct a priori. In practice, especially in biological systems, spectra from different specimens may appear to be so similar as to be indistinguishable. In the instant case, it is unknown in advance whether the parenchyma of the superior temporal gyrus would reveal whether Lewy bodies were present elsewhere in the brain; however, based on exploratory data analyses and the classical process of transneuronal degeneration, it was expected that the parenchyma would differ (see
[0096]
Simulations 2 (SM2)
[0097] To assess the taxonomic signal using the fraction-product, it is first determined how the variate's distribution is affected by chance alone. As mentioned, randomness enters this experimental paradigm through two different routes. First, via two groups that have been drawn from larger populations. Second, a distribution of values from each group occurs at each pixel, and the more pixels, the greater the opportunity for a rare event to occur. Simulations were used to determine how changes in the number of pixels and the number of optical spectra (or number of subjects, in some cases) affect the utility of the fraction-product. As also mentioned, these simulations treat two adjacent pixels as if the distributions at each are completely independent. However, real optical features have linewidth and extend over several contiguous pixels. Applying this fact to the interpretation of the fraction-product greatly increases its utility.
[0098] Simulations can be performed using the fraction-product determination software 206 (
Simulations 3 (SM3)—Behavior of the Fraction-Product for Small Sample Sizes
[0099] Because the estimated cut-off is the average of the two medians, the values of f.sub.1 and f.sub.2 must be greater than 0.5. Therefore, for small sample sizes, the f.sub.p can take on only a few values. For example, if the two groups have only 5 subjects each, then f.sub.1 and f.sub.2 can assume the values 0.6, 0.8, and 1.0 and f.sub.p can take on one of the six values: 0.36, 0.48, 0.6, 0.64, 0.8, 1.0. When the number of subjects is 10 or more, the range of f.sub.p is fine enough that this discreteness is no longer a significant issue. Because of this behavior, for small N it is best to perform simulations with the exact numbers of subjects if N<10. For the data presented here, the results of the simulation for N1=N2=7 are given in
[0100]
[0101] This behavior might limit the utility to use only the value of f.sub.p to assess how likely is a result due to chance. However, it has little impact on using contiguous pixels with f.sub.p above a particular value as an indication that the result is not due to chance.
Simulations 4 (SM4)—Application of the Fraction Product
[0102] The utility of the fraction product lies in its ability to find a taxonomic signal. As with any other inferential statistic, there is flexibility in the selection of cut-offs for significance.
Simulations 5 (SM5)—Subject Demographics
[0103] Subjects were recruited as part of an ongoing project to monitor the progression of neurodegenerative diseases, especially Alzheimer's disease. All subjects were recruited through a process approved by the Institutional Review Board; informed consent was obtained in all cases. Subjects clinically diagnosed with senile dementia of the Alzheimer's type were recruited from the inpatient dementia unit of the Edith Nourse Rogers Memorial Veterans Hospital. All autopsies were performed by one neuropathologist.
[0104] General demographic information is summarized in
Simulations 6 (SM6)—Near-Infrared Reflectance Measurements
[0105] In some embodiments, optical measurements can be performed through fiber optic cables made of silica with a low hydroxyl concentration, with a diameter of 600 μm (source optical fiber) and 200 μm (readout optical fiber) and with a numerical aperture (NA) of 0.22. The disclosure is not limited to optics having such attributes. Other optics can be utilized to collect spectroscopic data in vivo in accordance with aspects of this disclosure. Regardless of the optics utilized to collect the spectroscopic data in vivo, the spectroscopic data can be supplied to a computing device (e.g., computing device 201 (
[0106] At each source-readout separation two spectra were collected, one at each temple. The 25 mm source-detector separation spectra were analyzed. One reason to analyze such spectra is that such a separation is consistent with theory on light propagation in the human head indicating that at this source-detector separation the detected light has propagated through the temporal cortex, which is a site of early and extensive Alzheimer disease involvement. Each spectrum was corrected for acquisition time and background; correction for lamp output and detector response was achieved by a reference spectrum obtained by reflection from barium sulfate (first session) or a Spectralon low reflectance standard (Labsphere, North Sutton, N.H.) (second session). The average spectrum of the two temples was used for data analysis. To calculate the first derivative, the optical spectrum can be smoothed by boxcar averaging and the slope computed as a least-squares fit of a straight line through a region spanning n=11 pixels. The fraction-product determination software 206 (
[0107] It is noted that between the two sessions, there were different light sources, readout optical fiber cables, and methods of correcting for the reference spectrum. These differences support the conclusion that the results truly depend upon the acquired spectra rather than the technical features of the equipment used.
[0108] Simulations 7 (SM7)—Experimental Evidence for an Effect of Synucleinopathies on the Temporal Cortex
[0109] As previously described, it is a fundamental principle of neuropathology that disease in one area of the brain can lead to changes in other areas. Nonetheless, in the case of synucleinopathies, there is also empirical evidence that such effects occur in the temporal cortex regardless of where the pathology may be.
Simulations 8 (SM8)—Classification of Two Subjects Omitted from the Computation of PCA/LDA
[0110] Two subjects with Lewy bodies in the temporal isocortex were omitted from PCA/LDA so as not to confound the optical effects of actual Lewy bodies with those from transneuronal degeneration. When the optical spectra from these two subjects are analyzed (via the the fraction-product determination software 206 (
II. Alzheimer's Pathology Through the Near-Infrared Window
[0111] Near-infrared reflectance spectroscopy may be applied to the human temple. Using feature selection on the first derivative of the normalized optical intensity spectrum, regions around 860 nm and 895 nm that separate subjects who have autopsy-confirmed Alzheimer's disease without significant other pathology from age-matched controls may be determined. Principal component analysis demonstrates that these two wavelengths (or features) also separate mildly cognitively impaired subjects according to the degree of impairment. Linear discriminant analysis reveals that the 895 nm feature plays a greater role in separating mildly impaired subjects from controls (ratio of weights: 1.3), whereas the 860 nm feature is more important for distinguishing mildly impaired from Alzheimer's disease (ratio of weights: 8.2). Clinical trials may be used to validate/confirm the two features as useful for tracking disease progression and may be used to monitor potential therapeutic interventions early in the course of Alzheimer's disease.
[0112] Alzheimer's disease touches millions of families throughout the world and severely burdens all those affected. Research has improved understanding of the cellular pathology of the protein tau and amyloid precursor protein, which answer to the pathognomonic lesions: tangles and plaques. However, little progress has been made towards therapy because the insidious onset of symptoms masks the ongoing irreversible damage until a diagnosis can be established, a situation that leads to the stark observation that “Everyone knows a cancer survivor. Nobody knows an Alzheimer's survivor.”
[0113] The methods, apparatuses, computer-readable media, and systems described herein may be used to assess the effect of early interventions, which could greatly accelerate the development of effective treatment and prevention. Two features of near-infrared reflectance spectra, for example, acquired at the temple, can distinguish subjects with Alzheimer's disease from normal, age-matched controls. Moreover, these two features can classify subjects with mild cognitive impairment according to the degree of severity. This novel approach to analyzing the optical spectra may be applied in other areas, such as drug development, materials design, and semiconductor processing and packaging just to name a few examples. Clinical trials may be used to assess the utility of the methods, apparatuses, computer-readable media, and systems described herein.
[0114] Although most applications of near-infrared spectroscopy to the human head involve oximetry or blood flow, more recent advances have included imaging and co-registration studies. In contrast, the methods, apparatuses, computer-readable media, and systems described herein use optical spectroscopy as a non-invasive method to detect Alzheimer's disease (AD) that is suitable for widespread screening. It is demonstrated that reflectance spectroscopy distinguishes autopsy samples from brains with AD from those without. The approach is extended to living subjects by utilizing the relative transparency of biological tissue to light in the 700-1100 nm range, a region known as the near-infrared window. A standard reflectance configuration may be used with the optical fibers from the source with the same temple placed 25 mm apart from the readout optical fibers that transport reflected signal to a detector. With this configuration, the light field interrogated includes a portion of the superior temporal gyrus in addition to the overlying tissues.
[0115] A general principle, often left unstated, that motivates applications of optical spectroscopy to medical diagnosis is: if the same light source illuminates two distinct materials, the resultant optical spectra are a priori distinct. However, the scattering of light from biological tissues often renders it extremely difficult to discover spectral features that mark the difference. To mitigate this problem, algorithms from the field of feature selection are used to search for those regions of the optical spectra that best distinguishes two groups. Optical spectra from dementia subjects were acquired while they were alive, and only those for whom post mortem examination confirmed the diagnosis of AD were used. Autopsy reports raised another well-known problem: the brains of the elderly often have multiple morbidities such as infarcts and Lewy bodies. To mitigate this problem, seven subjects with AD (NIA-Reagan: high likelihood; Braak neurofibrillary stage: VI) but no other significant pathology were identified—the closest to “pure AD” in practice. Control subjects who volunteered for the Boston University Alzheimer's Disease Center's HOPE protocol, which evaluates the cognitive function of its subjects annually, were used. In some embodiments, the feature used was the first derivative of the normalized optical spectral intensity as a function of wavelength. This feature is herein referred to as the slope variate.
A. Results
[0116] Two feature selection algorithms can be applied to spectroscopic data corresponding to the pure AD and control groups. The fraction-product determination software 206 (
[0117]
[0118] Twelve subjects with mild cognitive impairment (MCI) were also studied and clinically subclassified as more and less severe.
[0119] Examining the MCI scores presented on panel 2020 in
B. Discussion
[0120] Starting with 5 control subjects and 5 subjects with pure AD, two spectral features that distinguish AD from control are determined. Excluding those AD subjects with other significant pathology from the search aided in identifying the features because the additional pathology caused many confounding optical signals. All of the 13 subjects subsequently plotted in
[0121] The fact that PCA shows two clusters of the MCI values that correspond to AD and control groups is important in potential applications. In a physical sense, it means that some MCI patients have brains that are similar to AD brains whereas others have brains similar to controls. These mathematical results are independent of clinical assessments but are consistent with the clinical assessments. Although
[0122] More certain is it that larger clinical studies on MCI patients can determine whether the optical spectral changes track the progression of the disease in a useful manner. The methods, apparatuses, computer-readable media, and systems described herein provide a safe, non-invasive technique for assessing response to treatments in real-time, as the treatment is implemented. An example application scenario is where the signal at 895 nm responds to an intervention that prevents the progression of the 860 nm signal. In that scenario, an Alzheimer's survivor may be identified.
C. Methods
Human Subjects
[0123] Subjects were recruited as part of an ongoing project to monitor the progression of neurodegenerative diseases, especially Alzheimer's disease. All subjects were recruited through a process approved by the Institutional Review Board; informed consent was obtained in all cases. Subjects clinically diagnosed with senile dementia of the Alzheimer's type (SDAT) were recruited from the inpatient dementia unit of the Edith Nourse Rogers Memorial Veterans Hospital. All autopsies were performed by the same neuropathologist. Control subjects and those with mild cognitive impairment had volunteered for the Health Outreach Program for the Elderly (HOPE) of the Boston University Alzheimer's Disease Center. Participants in the HOPE protocol are to undergo cognitive assessment yearly. The results are reviewed and an expert review panel assigns a consensus diagnosis. The control subjects were younger (mean age 76.4 years) than the Alzheimer's subjects (mean age 82.3 years) but it is not believed that this is clinically significant for these data
[0124] Exclusion criteria. 25 dementia subjects came to autopsy. Five were excluded from the analysis described herein. Two had Lewy bodies present in the temporal isocortex, which is included in the light field interrogated by the spectroscopic technique disclosed herein. This adds a confounding factor to the signal. The subjects with Lewy bodies in
Spectroscopy
[0125] Measurements can be made in two sessions, approximately two years apart, for example. The room where a subject being probed is located can be darkened to minimize ambient light. In some embodiments, optical measurements can be made through fiber optic cables made of silica with a low hydroxyl concentration, with a diameter of 600 μm and with an NA of 0.22. The disclosure is not limited to optics having such attributes. Other optics can be utilized to collect spectroscopic data in vivo in accordance with aspects of this disclosure. A plastic template probe (e.g., template probe 1400 (
[0126] The readout fiber conducted the reflected light to an imaging spectrograph (Kaiser Optical Systems, Ann Arbor, Mich., USA) that uses a camera cooled to −50° C. (Andor Technologies, South Windsor, Conn., USA) as light detector device. A 20 W tungsten lamp (Ocean Optics, Dunedin, Fla., USA) served as light source in the optical measurements. At each source-readout (or source-detector) separation, two optical spectra were collected, one at each of the subject's temple. The 25 mm source-detector separation optical spectra were analyzed. The disclosure, however, is not limited to analysis for such source-detector separation. Each spectrum can be corrected for acquisition time and background; correction for lamp output and detector response was achieved by a reference spectrum obtained by reflection from barium sulfate (first session) or a Spectralon low reflectance standard (second session). To calculate the first derivative of the spectral intensity at a particular pixel, the optical spectrum can be smoothed by boxcar averaging and a slope at the particular pixel was determined by performing a least-squares fit of a straight line through a region of 11 pixels about the particular pixel and then assigning the slope of the straight line to the slope at the particular pixel. That number of pixels is simply illustrative and more or fewer that 11 pixels can be considered. A computing device, e.g, the computing device 201 (
Feature Selection
[0127] Although the general principle is to select those features that maximally separate the groups of interest, various approaches can be utilized to achieve this. An approach is to maximize the Mahalanobis distance among the groups of interest, and for the univariate case with two groups, this reduces to the t-statistic calculated on the two means and variances. At each pixel, the t-statistic was determined for the pure AD group and control group; those features (or pixels) with the highest t-values became candidates to be used for classification. However, many high t-values were due to outliers and were rejected as useful features. To solve this problem, the methods, apparatuses, computer-readable media, and systems described herein implement a median-based approach in which the two group medians were determined and an estimated cut-off determined by the average of the two group medians. The fraction of each group correctly classified was determined and their product was taken as the measure of separation between the two groups. The median-based approach provides the same results as the t-statistics but with much more efficiency. Because optical phenomena have linewidth, features that show significant efficacy over several contiguous pixels were used/desired. The methods, apparatuses, computer-readable media, and systems described herein suggest regions around 895 nm and 860 nm as giving the best separation of AD from control. A computing device, e.g, the computing device 201 (
Statistical Analysis
[0128] As is described herein, the fraction-product determination software 206 (
[0129] In view of the aspects described herein, example methods that may be implemented in accordance with this disclosure can be better appreciated with reference, for example, to the flowcharts in
[0130]
[0131] In some embodiments, the computing device that implements the example method 2100 may be embodied in, or can constitute, the computing device 201 (
[0132] At block 2102, the computing device (e.g., a data analysis device, a smart device, a device configured with artificial intelligence, etc.) can receive two arrays/matrices of numerical data. For example, the computing device can execute, or can continue executing, the fraction-product determination software 206 (
[0133] At block 2104, at each index element, the computing device can determine (via the fraction-product determination software 206 (
[0134] At block 2106, the computing device can determine (via the fraction-product determination software 206 (
[0135] At block 2108, for the array with the greater median, the computing device can determine (via the fraction-product determination software 206 (
[0136] At block 2110, for the array with the lesser median, the computing device can determine (via the fraction-product determination software 206 (
[0137] At block 2112, for each index element, the computing device can determine (via the fraction-product determination software 206 (
[0138] At block 2114, the computing device can determine or otherwise generate (via the fraction-product determination software 206 (
[0139] At block 2116, the computing device can determine (via the fraction-product determination software 206 (
[0140] At block 2118, the computing device can select (via the fraction-product determination software 206 (
[0141] At block 2120, the computing device can assess (via the fraction-product determination software 206 (
[0142] In other cases, the computing device can determine that the separation of the two groups is not adequate. Responsive to that negative determination, the computing device can select (via the fraction-product determination software 206 (
[0143] At block 2124, the computing device can select (via the fraction-product determination software 206 (
[0144] At block 2126, the computing device can assess (via the fraction-product determination software 206 (
[0145]
[0146] In some embodiments, the computing device may be embodied in, or can constitute, the computing device 201 (
[0147] At block 2202, the computing device (e.g., a data analysis device, a smart device, a device configured with artificial intelligence, etc.) can receive a first array of numbers and a second array of numbers. For example, the computing device can execute, or can continue executing, the fraction-product determination software 206 (
[0148] At block 2204, the computing device can determine (via the fraction-product determination software 206 (
[0149] At block 2206, the computing device can determine (via the fraction-product determination software 206 (
[0150] At block 2208, the computing device can determine (via the fraction-product determination software 206 (
[0151] At block 2210, the computing device can determine (via the fraction-product determination software 206 (
[0152] Determining the one or more optimal discriminants may include: (1) selecting the largest fraction-product value of the fraction-product values associated with the third array; (2) selecting index elements of the third array associated with the largest fraction-product value; (3) assessing, based on the index elements of the third array selected at step (2), a separation of the first group and the second group; (4) if the separation is inadequate or otherwise unsatisfactory, determining a fraction-product value less than the largest fraction-product value; (5) selecting index elements of the third array associated with the fraction-product value less than the largest fraction-product value; (6) assessing, based on the index elements of the third array selected at step (5), a separation of the first group and the second group; and (7) if the separation is inadequate, repeating steps (4)-(7).
[0153] The example method 2200 may further include determining that the fraction-product value exceeds a value for one or more contiguous index elements of the common index set, and selecting, based on the fraction-product value exceeding the value for the one or more contiguous index elements of the common index set, a feature of optical spectra.
[0154]
[0155] In some embodiments, the computing device may be embodied in, or can constitute, the computing device 201 (
[0156] At block 2302, the computing device (e.g., a data analysis device, a smart device, a device configured with artificial intelligence, etc.) can receive first optical spectra and second optical spectra. For example, the computing device can execute, or can continue executing, the fraction-product determination software 206 (
[0157] At block 2304, the computing device can determine (via the fraction-product determination software 206 (
[0158] At block 2306, the computing device can determine (via the fraction-product determination software 206 (
[0159] At block 2308, the computing device can determine (via the fraction-product determination software 206 (
[0160] At block 2310, the computing device can select (via the fraction-product determination software 206 (
[0161] At block 2312, the computing device can identify or otherwise determine (via the fraction-product determination software 206 (
[0162] At block 2314, the computing device can determine (via the fraction-product determination software 206 (
[0163] At block 2316, the computing device can receive an optical spectrum corresponding to a subject (e.g., subject 240 (
[0164] At block 2318, the computing device can designate (via the fraction-product determination software 206 (
[0165] The method 2300 may also include determining one or more wavelengths present in each of the first optical spectra and one or more wavelengths present in each of the second optical spectra, and determining, based on the one or more wavelengths present within each of the first optical spectra and the second optical spectra, the median value of optical spectral intensity for each of the first optical spectra and the second optical spectra.
[0166] Blocks 2302 to 2310 can collectively embody an example discovery process, and blocks 2312 to 2318 can collectively embody an example diagnosis process. The disclosure is not limited to diagnosis processes that use PCA and/or LDA in accordance with aspects described herein. Indeed, other diagnosis processes also can be implemented by applying machine-learning techniques to spectroscopic data and one or more candidate optical discriminants.
[0167] As described herein, finding optical features (or optical discriminants) that are useful for classification of a subject as pertaining to a normal group or a non-normal group can permit implementing a practical diagnostic technique. Further, in connection with neurological disorders, for example, such a technique can be readily implemented in vivo and, thus, can be more efficient and superior to existing diagnostic techniques. For example, in the study of Alzheimer's disease against controls, two candidate classifiers completely separated the two groups (
[0168] As is used in this specification and annexed drawings, the terms “module,” “component,” “system,” “platform,” and the like, can refer to and/or can include a computer-related entity or an entity related to an operational machine with one or more specific functionalities. Such entities can be either hardware, a combination of hardware and software, software (program code or executable program code, for example), or software in execution. In one example, a component can be a process running on a processor, a processor, an object, an executable (e.g., binary software), a thread of execution, a computer program, and/or a computing device. Simply as an illustration, a software application running on a server device can be a component and the server device also can be a component. One or more modules can reside within a process and/or thread of execution. One or more components also can reside within a process and/or thread of execution. Each one of a module and a component can be localized on one computing device and/or distributed between two or more computing devices. In another example, respective components (or modules) can execute from various computer-readable storage media having various data structures stored thereon. The components (or modules) can communicate via local and/or remote processes such as in accordance with a signal having one or more data packets (e.g., data from one component interacting with another component in a local system, distributed system, and/or across a network such as the Internet with other systems via the signal). As another illustrations, in some cases, a component can emulate an electronic component via a virtual machine, e.g., within a cloud computing system. The terms “module” and “component” (and their plural instances) may be used interchangeably where clear from context, in some cases.
[0169] As is used in this specification and annexed drawings, the term “processor” can refer to substantially any computing processing unit or computing device, including single-core processors; single-processors with software multithread execution capability; multi-core processors; multi-core processors with software multithread execution capability; multi-core processors with hardware multithread technology; parallel platforms; and parallel platforms with distributed shared memory. Additionally, a processor can refer to electronic circuitry designed in assembled to execute code instructions and/or operate on data and signaling. Such electronic circuitry can be assembled in a chipset, for example. Accordingly, in some cases, a processor can be embodied, or can include, an application specific integrated circuit (ASIC), a digital signal processor (DSP), a field programmable gate array (FPGA), a complex programmable logic device (CPLD), a discrete gate or transistor logic, discrete hardware components, or any combination thereof designed and assembled to perform the functionality described herein. Further, in some cases, processors can exploit nano-scale architectures, such as molecular and quantum-dot based transistors, switches and gates, in order to optimize space usage or enhance performance of computing devices. A processor can also be implemented as a combination of computing processing units.
[0170] Further, in this specification and annexed drawings, terms such as “storage,” “data storage,” “repository,” and substantially any other information storage component relevant to operation and functionality of a system, subsystem, module, and component are utilized to refer to “memory components,” entities embodied in a “memory,” or components including a memory. As is described herein, memory and/or memory components of this disclosure can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory. Simply as an illustration, nonvolatile memory can include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable ROM (EEPROM), flash memory, or nonvolatile random access memory (RAM) (e.g., ferroelectric RAM (FeRAM). Volatile memory can include RAM, which can act as external cache memory, for example. By way of illustration and not limitation, RAM is available in many forms such as synchronous RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDR SDRAM), enhanced SDRAM (ESDRAM), Synchlink DRAM (SLDRAM), direct Rambus RAM (DRRAM), direct Rambus dynamic RAM (DRDRAM), and Rambus dynamic RAM (RDRAM). Embodiments of this disclosure are not limited to these types of memory, and other types of memory devices can be contemplated.
[0171] While specific configurations have been described, it is not intended that the scope be limited to the particular configurations set forth, as the configurations herein are intended in all respects to be possible configurations rather than restrictive.
[0172] Unless otherwise expressly stated, it is in no way intended that any method set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not actually recite an order to be followed by its steps or it is not otherwise specifically stated in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any possible non-express basis for interpretation, including: matters of logic concerning an arrangement of steps or operational flow; plain meaning derived from grammatical organization or punctuation; the number or type of configurations described in the specification.
[0173] It will be apparent to those skilled in the art that various modifications and variations may be made without departing from the scope or spirit. Other configurations will be apparent to those skilled in the art from consideration of the specification and practice described herein. It is intended that the specification and described configurations be considered as exemplary only, with a true scope and spirit being indicated by the following claims.