Global analysis peak fitting for chemical spectroscopy data
10551247 ยท 2020-02-04
Assignee
Inventors
Cpc classification
G01N21/31
PHYSICS
G01N23/085
PHYSICS
G01J3/42
PHYSICS
G06F17/18
PHYSICS
International classification
G01J3/42
PHYSICS
Abstract
The present invention relates to methods for analyzing a chemical sample. For instance, the methods herein allow for global analysis of spectroscopy data in order to extract useful chemical properties from complicated multidimensional data. Such analysis can optionally employ data compression to further expedite computer-implemented computation. In particular, the methods herein provide global analysis of data matrices explained by both linear and non-linear terms.
Claims
1. A method for analyzing a chemical test sample, the method comprising: (i) applying a thermal or photo-induced aging process to a chemical test sample, thereby producing an aged chemical test sample, wherein the aged chemical test sample is the polymer comprising nitrile or polytetrafluoroethylene; (ii) irradiating the aged chemical test sample; (iii) obtaining an image array of the aged chemical test sample after irradiation; (iv) employing a computer-implemented method to analyze the image array, wherein the computer-implemented method comprises: (a) receiving, with a processor, the image array of the aged chemical test sample after irradiation, wherein the array comprises d.sub.n number of emission images for each n.sup.th energy level, and wherein each image comprises dimensions of a by b, and wherein each of d.sub.n, n, a, and b is, independently, an integer of 1 or more; (b) generating, with a processor, a data matrix D based on the image array, wherein D has a dimension of m by n, and wherein m is a times b; (c) performing, with a processor, global analysis of D, thereby generating a matrix A comprising linear terms and a matrix S comprising non-linear terms, wherein D=AS.sup.T, A has a dimension of m by q, S has a dimension of n by q, and q is an integer of 1 or more that represents a number of factors, wherein generating the matrix A and the matrix S comprises initially non-linearly solving the non-linear terms in S and then linearly solving separately for the linear terms in A, and wherein each of q factors is characterized by a mathematical function that describes a spectral feature of a representation of the matrix D, and wherein q is less than n, and wherein the one or more factors are selected from the group consisting of a Gaussian function, a Lorentzian function, a pseudo-Voigt function, an asymmetric peak function, a shaped step function, and an offset function; and (d) converting the matrix A and the matrix S into one or more chemical spectra comprising q number of spectral features that characterizes the aged chemical test sample, thereby identifying one or more chemical bonds, atoms, or charges present in the aged chemical test sample; and (v) analyzing the one or more factors present in the aged chemical test sample, as compared to a native chemical sample, thereby characterizing an age of the aged chemical test sample.
2. The method of claim 1, wherein step (c) comprises a non-negativity constraint.
3. The method of claim 1, wherein step (c) comprises: (c4) solving for an estimated matrix by non-linearly solving D=AST, wherein the estimated matrix is an estimate of S; (c5) solving for an estimated matrix using the estimated matrix as S in the following equation D=AS.sup.T, wherein the estimated matrix is an estimate of A; (c6) computing a convergence metric; and (c7) iterating steps (c4) to (c6) until the convergence metric meets a convergence criterion.
4. The method of claim 3, further comprising: (c4) constructing an initial matrix .sub.initial composed of expected non-linear terms, wherein (c4) is conducted prior to step (c4).
5. The method of claim 4, wherein step (c4) comprises solving for an estimated matrix by non-linearly solving D=AS.sup.T by beginning from the initial matrix .sub.initial.
6. The method of claim 1, wherein step (c) comprises: (c8) constructing an initial matrix .sub.initial from expected non-linear terms; (c9) solving for an estimated matrix by non-linearly solving D=AS.sup.T by beginning from the initial matrix .sub.initial, wherein the estimated matrix is an estimate of S; (c10) solving for an estimated matrix using the estimated matrix as S in the following equation D=AS.sup.T, wherein the estimated matrix is an estimate of A; (c11) computing a convergence metric; and (c12) iterating steps (c9) to (c11) until the convergence metric meets a convergence criterion.
7. The method of claim 1, wherein step (c) comprises: (c13) constructing an initial matrix .sub.initial from expected non-linear terms; (c14) solving for an estimated matrix using the initial matrix .sub.initial as S in the following equation D=AS.sup.T, wherein the estimated matrix is an estimate of A; (c15) computing a convergence metric; (c16) if not converged, solving for an estimated matrix by non-linearly solving D=AS.sup.T, wherein the estimated matrix is an estimate of S, (c17) iterating steps (c14) to (c16) until the convergence metric meets a convergence criterion.
8. The method of claim 1, wherein step (c) comprises applying a least squares minimization criterion to generate the matrix A and the matrix S.
9. The method of claim 1, further comprising: (c) compressing the data matrix D to provide a score matrix T and a loading matrix P, wherein D=TP.sup.T and (c) is conducted prior to (c).
10. The method of claim 9, wherein step (c) comprises performing, with a processor, global analysis of compressed D, thereby generating a matrix A comprising linear terms and a matrix S comprising non-linear terms, wherein D=TP.sup.T=AS.sup.T.
11. The method of claim 1, wherein one or more factors comprise a plurality of pseudo-Voigt functions.
12. The method of claim 1, wherein the image array comprises one or more absorption images or spectra of the aged chemical test sample.
13. The method of claim 12, wherein the absorption images or spectra comprise one or more X-ray absorption spectra, near edge X-ray absorption fine structure spectra, fluorescence spectra, infrared spectra, photon emission spectra, or photon absorption spectra.
14. A method for analyzing a chemical test sample, the method comprising: (i) applying a thermal or photo-induced aging process to a chemical test sample, thereby producing an aged chemical test sample, wherein the aged chemical test sample is the polymer comprising nitrile or polytetrafluoroethylene; (ii) irradiating the aged chemical test sample; (iii) obtaining an image array of the aged chemical test sample after irradiation; (iv) employing a computer-implemented method to analyze the image array, wherein the computer-implemented method comprises: (a) receiving, with a processor, the image array of the aged chemical test sample after irradiation, wherein the array comprises d.sub.n number of emission images for each n.sup.th energy level, and wherein each image comprises dimensions of a by b, and wherein each of d.sub.n, n, a, and b is, independently, an integer of 1 or more; (b) generating, with a processor, a data matrix D based on the image array, wherein D has a dimension of m by n, and wherein m is a times b; (c) compressing the data matrix D to provide a score matrix T and a loading matrix P, wherein D=TP.sup.T; (d) performing, with a processor, global analysis of D, thereby generating a matrix A comprising linear terms and a matrix S comprising non-linear terms, wherein D=TP.sup.T=AS.sup.T, A has a dimension of m by q, S has a dimension of n by q, and q is an integer of 1 or more that represents a number of factors, wherein generating the matrix A and the matrix S comprises initially non-linearly solving the non-linear terms in S and then linearly solving separately for the linear terms in A, and wherein each of q factors is characterized by a mathematical function that describes a spectral feature of a representation of the matrix D, and wherein q is less than n, and wherein the one or more factors are selected from the group consisting of a Gaussian function, a Lorentzian function, a pseudo-Voigt function, an asymmetric peak function, a shaped step function, and an offset function; and (e) converting the matrix A and the matrix S into one or more chemical spectra comprising q number of spectral features that characterizes the aged chemical test sample, thereby identifying one or more chemical bonds, atoms, or charges present in the aged chemical test sample; and (v) analyzing the one or more factors present in the aged chemical test sample, as compared to a native chemical sample, thereby characterizing an age of the aged chemical test sample.
15. The method of claim 14, wherein step (c) comprises an eigenanalysis of D=TP.sup.T.
16. The method of claim 14, wherein step (c) comprises solving for the loading matrix P by analysis of D=TP.sup.T.
17. The method of claim 14, wherein step (c) comprises: (c1) solving for the loading matrix P by analysis of D=TP.sup.T; and (c2) solving for the score matrix T by using the loading matrix P.
18. The method of claim 17, wherein step (d) comprises: (d1) performing, with a processor, global analysis of D, thereby generating a first matrix A.sub.1 comprising linear terms and a first matrix S.sub.1 comprising non-linear terms; (d2) computing a first residual matrix R.sub.1 that characterizes error between D and generated matrix A.sub.1 and matrix S.sub.1; (d3) iterating steps (d1) and (d2) until convergence to provide matrix A and matrix S for step (e).
19. The method of claim 17, wherein step (d) comprises: (d4) solving for an estimated matrix by non-linearly solving D=AS.sup.T, wherein the estimated matrix is an estimate of S; (d5) solving for an estimated matrix using the estimated matrix as S, the matrix P, and the matrix T in the following equation AS.sup.T=TP.sup.T, wherein the estimated matrix is an estimate of A; (d6) computing a convergence metric; and (d7) iterating steps (d4) to (d6) until the convergence metric meets a convergence criterion.
20. The method of claim 19, wherein step (d) comprises: (d12) computing the matrix A by using the estimator matrix in the following equation A=T, wherein T=DP and (d12) is conducted after (d11).
21. The method of claim 17, wherein step (d) comprises: (d8) solving for an estimated matrix by non-linearly solving D=AS.sup.T, wherein the estimated matrix is an estimate of S; (d9) solving for an estimator matrix using the estimated matrix as S and the matrix P in the following equation P.sup.T=S.sup.T, wherein the estimator matrix is an estimator of A such that A=T; (d10) computing a convergence metric; and (d11) iterating steps (d8) to (d10) until the convergence metric meets a convergence criterion.
22. The method of claim 17, wherein step (d) comprises applying a least squares minimization criterion to generate the matrix A and the matrix S.
23. The method of claim 22, wherein the absorption images or spectra comprise one or more X-ray absorption spectra, near edge X-ray absorption fine structure spectra, fluorescence spectra, infrared spectra, photon emission spectra, or photon absorption spectra.
24. The method of claim 14, wherein D is an mn matrix, T is an mp matrix, and P.sup.T is an pn matrix; each of m, n, and p is an integer of 1 or more; and p is less than both m and n.
25. The method of claim 14, wherein step (d) comprises a non-negativity constraint.
26. The method of claim 14, wherein the image array comprises one or more absorption images or spectra of the aged chemical test sample.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
DETAILED DESCRIPTION OF THE INVENTION
(18) The present invention relates to global analysis of multidimensional data. In general, global analysis includes simultaneous analysis of all measurements provided in a data set (e.g., a data matrix D). Global analysis is different from analytical techniques in which an averaged image or spectrum is analyzed or in which an individual image or spectrum is analyzed one at a time (i.e., sequential analysis). Taking the example for NEXAFS analysis, conventional analytical techniques generally do not employ an entire, multidimensional data set. Instead, an average spectrum profile is generated by summing the intensity observed in an image for a particular photon energy level, thereby generating a profile having two dimensions: intensity and photon energy. Using this two-dimensional profile, the various peaks can be fit using minimal computation time and without employing global analysis.
(19) In contrast, global analysis requires simultaneous analysis of the entire data set in all dimensions, e.g., multiple (more than two) dimensions providing spatial information and spectral information. Due to the large amount of data, global analysis can be challenging. In addition, many chemical phenomena are more appropriately characterized by non-linear functions, but many curve analysis techniques require or impose linear constraints to facilitate analysis by matrix decomposition methodologies (e.g., principal component analysis). Thus, the present invention encompasses a method of analyzing multidimensional data for chemical test samples by fitting both non-linear and linear terms. In particular embodiments, the method fits emission bands (from chemical bonds or orbitals) to peaks described by non-linear functions using non-linear and linear optimization techniques. By maintaining the high dimensionality of the image space, the methods herein yield peaks with potentially greater reliability than single spectrum fitting. The methods herein can optionally employ data compression with principal component analysis to rapidly complete the analysis. Additional details follow, which describe how data matrices are generated and then analyzed to provide meaningful chemical information.
(20) Data Matrix and Image Arrays
(21) The present invention employs a data matrix to represent information from a multidimensional image array, which in turn is obtained by detecting radiation emitted by a test sample (e.g., a chemical sample). Many spectroscopy techniques apply energy to a test sample and then detect any resultant radiation emitted by the test sample. The type and intensity of the radiation can be detected and recorded in any useful manner. In particular, such detection can include spatial information, such as the type and intensity of radiation emitted from a particular surface region or volume region from the test sample. Furthermore, a range of energy levels can be applied. Thus, detection can proceed in an energy level-dependent manner, in which one or more radiation measurements are recorded at each energy level. The result is typically an array of spectral data (e.g., images, spectra, etc.) including information on multiple dimensions, such as spectral (e.g., a chemical parameter, such as concentration and identity of each chemical species), experimental (e.g., an energy level or an incidence angle of the radiation source as compared to the molecular axis of a chemical structure in the sample), temporal (e.g., time-dependent changes in a chemical attribute or parameter), and/or spatial (e.g., x-, y-, and/or z-dimensional image of a chemical sample) dimensions.
(22) In one non-limiting example, the image array is obtained from a near edge absorption X-ray fine structure (NEXAFS) spectroscopy experiment. NEXAFS employs an X-ray radiation source to irradiate a test sample at discrete energy levels and a detector to detect radiation emitted from the sample. In one instance, the raw (i.e., measured) data output by NEXAFS (in the form of electrical signals representing X-ray counts or counts per second/minute) is digitized and sorted into bins, where each bin generally contains four parameters: (1) energy channel number n (i.e., X-ray or photon energy level), (2) X-coordinate of the pixel, (3) Y-coordinate of the pixel, and (4) total number of counts (or intensity). NEXAFS can organize and store the measured data in digital memory in a variety of 2-D or 3-D formats, for example, by channel number, by pixels row-wise, by pixels column-wise, or by other types of channel interleaving.
(23)
(24) Image arrays can include any useful d number of images obtained at any useful n.sup.th energy level. For instance, a chemical test sample can be irradiated with energy at an n.sup.th level, and an image is obtained at each n.sup.th level. The number n can be any useful integer (e.g., one or more), and the absolute energy level can be determined by the type of chemical information desired by a user. In some embodiments, the energy levels correspond to X-ray energy levels (e.g., an energy level less than about 2,000 eV). In other embodiments, images and arrays are obtained for chemical test samples treated with X-ray absorption energy levels that result in electronic signatures about 50 eV to about 100 eV above the absorption edge (e.g., above the K-shell edge or above the deepest core shell ionization energy of electrons, such as the ionization energy of 1s electrons).
(25) Information from the image array can be used to generate a numerical format that facilitates computation. For instance, such information can be arranged into a data matrix that can be stored, manipulated, inputted, and outputted by a computer processor. Information from a NEXAFS image array can be unfolded into a two-dimensional mn matrix D, where m is the number of image pixels and n is the number of energy levels. If an entire image is to be analyzed, then m is the number of pixels in the entire image. For instance, for a two-dimensional image having dimensions a by b pixels, m is defined as a times b. As seen in
(26) The image can be of any useful dimension (e.g., a two-dimensional image having dimensions a by b, typically measured in pixels, such as 600 pixels by 800 pixels; or a three-dimensional voxel having dimensions a by b by c). In certain embodiments, the image can be a region of interest (ROI) of a larger image, and the ROI can also be of any useful dimension (e.g., a by b, typically measured in pixels, such as 25 pixels by 50 pixels). The dimensions a, b, and c can be any useful number (e.g., an integer that is one or more).
(27) Image arrays can be transformed (or unfolded) into a data matrix D in any useful manner. For instance, take an image array including n images, where each image has a dimension of a by b pixels (in the x-y plane). Each image can be rastered in the x-y plane to identify an intensity data point associated with each pixel, and then these data points can be strung out as a single vector having a length of a times b. This vector can be provided as a column in a matrix, where the column has a length of m=a times b. Repeating this process for each n.sup.th image then provides n number of columns, each column of length m. Mathematically, these data can be represented as an mn data matrix D having m rows and n columns, in which each element d.sub.m,n in the matrix represents a data point from a single pixel located at an (x,y) position at a particular n.sup.th energy level.
(28) Unfolding techniques can also be applied to arrays of higher dimensional information, such as a three-dimensional voxel including three spatial coordinates (i.e., the x-, y-, and z-coordinates) or a three-dimensional datum including two spatial and one temporal coordinates. For instance, take an image array including n images, where each image has a dimension of a by b by c pixels (in the x-y-z space). Then, the data matrix D can be represented as an mn matrix, in which m=a times b times c, in which each image is rastered in the x-y plane for each c.sup.th step along the z-axis to produce a vector having a length of m. After repeating for each n.sup.th energy level, these data can be represented as an mn data matrix D having m rows and n columns, in which each element d.sub.m,n in the matrix represents a data point from a single pixel located at an (x,y,z) position at a particular n.sup.th energy level. Additional details on unfolding higher dimensional data are described in Kotula P G et al., Tomographic spectral imaging with multivariate statistical analysis: comprehensive 3D microanalysis, Microsc. Microanal. 2006; 12:36-48, which is incorporated herein by reference in its entirety.
(29) In general, a chemical test sample can be analyzed with any useful spectroscopic method (e.g., any described herein) in order to obtain a set of images or spectra that captures emission signals of that sample when excited by a particular energy level. In brief, for NEXAFS, the chemical sample is irradiated with an X-ray source, thereby allowing for photoabsorption of the X-ray by the sample. This photoabsorption process results in a photoelectron that is excited to its next permitted energy level, as well as a core hole that is filled by radiatively emitting a fluorescence photon or by non-radiatively emitting an Auger electron. Emission signals can detected in any useful manner, such as by employing a channeltron, an energy analyzer, a scanning electron microscope, a photodetector (e.g., a charge-coupled device), or a fluorescence detector in order to produce one or more spectra or images. Such spectra or images, in turn, can be analyzed by any method herein to determine the type of electron transitions (e.g., is 1s.fwdarw.* or * transitions) associated with an emission signal. Different chemical bonds exhibit different emission signals, which can be characterized by the energy level providing that signal or a line shape of the signal, as well as the E.sub.0 associated with that signal. In this way, the chemical characteristics of the sample can be determined.
(30) The present invention can be employed for any useful chemical test sample, such as a polymer, a thin film, a liquid, a gas, an adsorbate, an absorbate, a molecular complex, etc. In particular embodiments, the test sample can include low atomic number molecules (e.g., an atomic number less than 10 or 18, such as hydrogen, carbon, nitrogen, oxygen, and/or fluorine).
(31) The image and the resultant image array can include any useful spectrum of a chemical test sample. Exemplary spectra include one or more one or more X-ray absorption spectra, near edge X-ray absorption fine structure spectra (NEXAFS), X-ray photoelectron spectra, energy dispersive X-ray spectra (EDS), Auger electron spectra, X-ray fluorescence spectra, proton-induced X-ray emission spectra, fluorescence spectra, infrared spectra, raman spectra, ultraviolet photoelectron spectra, photon emission spectra, mass spectroscopy spectra (e.g., time of flight-secondary ion mass spectroscopy (TOF-SIMS) or matrix-assisted laser desorption ionization secondary ion mass spectroscopy (MALDI-SIMS)), and/or photon absorption spectra. Additional chemical test samples, as well as spectra, are described in Hhner G, Near edge X-ray absorption fine structure spectroscopy as a tool to probe electronic and structural properties of thin organic films and liquids, Chem. Soc. Rev. 2006; 35:1244-55; Outka D A et al., Curve fitting analysis of near-edge core excitation spectra of free, adsorbed, and polymeric molecules, J. Chem. Phys. 1988; 88:3539-54; and Penner-Hahn J E, X-ray absorption spectroscopy, in Comprehensive Coordination Chemistry, Vol. II (eds.-in-chief J. A. McCleverty and T. J. Meyer; vol. ed. A.B.P. Lever), Elsevier Ltd., Amsterdam, Netherlands, 2003, pp. 159-86, chp. 2.13, each of which is incorporated herein by reference in its entirety.
(32) Analytical Model for Chemical Data
(33) We developed an analytical model to mathematically represent the complex phenomenon captured by chemical data. This analytical model accepts the use of non-linear and linear terms to describe spectral peaks or emissions observed in the data matrix. The line shape of many spectral emissions can be described by a mathematical expression, such as a Gaussian function, a step function, an offset function, etc. Two or more of these expressions, in turn, can be combined to describe an observed spectrum. Here, we propose an analytical model of data matrix D (
D=AS.sup.T,(Eq. 1)
where D is an mn data matrix, A is an mq matrix including linear terms, and S is a nq matrix including non-linear terms. As described herein, m is the number of pixels for an image (or region of an image) under analysis, and n is the number of energy levels. The term q represents the number of factors that adequately characterize (e.g., as determined by one or more convergence metrics or criteria described herein) the data matrix D. Also, as used herein, superscript T represent matrix transpose.
(34) Any useful factor can be employed to characterize the spectral emission. For instance, many spectra can be characterized by a Gaussian function (I.sub.G), a Lorentzian function (I.sub.L), a pseudo-Voigt function (I.sub.V), an asymmetric peak function (any factor (e.g., I.sub.G, I.sub.L, I.sub.V, or I.sub.S) modified to have w=mE+b, where energy level E is the variable and m and b are constants that are common to all shifted peaks), a shaped step function (I.sub.S), and/or an offset function (I.sub.O). Each of these functions can be employed as a factor in the methods herein. In most situations, two or more of these factors (i.e., q is more than or equal to 2) are employed together to describe spectral information in the data matrix. Exemplary factors are as follows:
(35) Gaussian:
(36)
(37) Lorentzian:
(38)
(39) Pseudo-Voigt:
(40)
Asymmetric peak: Set w=mE+b;(Eq. 2d) Asymm. Gaussian:
(41)
(42)
(43) Shaped step:
(44)
(45) Offset:
(46)
where for each of these factors, energy level E is the variable and term A (a linear term) is an amplitude of a peak for each factor. Additional terms include an energy value E.sub.0 at a position in the center of a peak or step for each factor, a constant c (e.g., where c=2 log 4 or 2 ln 4 when w is a full width at half maximum or FWHM), a full width w at half maximum of a peak for each factor, a constant (where 0<<1), a constant m that scales w, a constant b that shifts w, and a decay coefficient d (e.g., an exponential decay coefficient, such as d=2 log 2). For the asymmetric peak function, term w can be replaced with mE+b for any symmetric peak function (e.g., the Gaussian, Lorentzian, pseudo-Voigt, or shaped step factors). As can be seen, most factors (except the offset factor in Eq. 2f) include a linear term (e.g., term A in the Gaussian factor I.sub.G of Eq. 2a) and a non-linear term (e.g., term exp[((EE.sub.0)c/w).sup.2] for the Gaussian factor I.sub.G of Eq. 2a).
(47) Each spectral feature (e.g., peak, emission, step, etc.) can be modeled by a factor. For instance,
(48) Using the analytical model of Eq. 1, we can identify estimates or solutions for matrices A and S that accurately describe the real data matrix D in a global analysis manner. First, we select a set of factors (e.g., Eqs. 2a-2f) that can be used to represent a spectral feature provided in the data matrix D. Then, we construct matrix A to include all linear terms for each of the q factors selected to represent the data, as well as construct matrix S to include all non-linear terms for each of the q factors. Next, as S includes non-linear terms, a non-linear solver is employed to identify an estimate or solution for S. In S, each non-linear term for each factor is computed using estimated values for each term (e.g., E.sub.0, w, and c terms for Gaussian factor I.sub.G in Eq. 2a) and the given energy axis n (e.g., as the variable E in Eq. 2a). The offset, which lacks non-linear terms, is entered as a column of ones. Provided the estimated matrix for S, a linear solver (e.g., using least squares) is employed to identify an estimate or solution for A. Optionally, constraints (e.g., non-negativity or equality constraints) can be imposed for each solution of S and/or A. Estimates for A, S, and q can be refined by using any useful method, such as by applying the least squares criterion (e.g., as described herein) and by iterating such refinement steps until convergence. Additional details for applying global analysis are described herein.
(49) This analytical model provides numerous benefits. For instance, many conventional non-linear least squares methods can be applied in a global analysis manner but with an immense computational burden. In contrast, the analytical model herein separates the solving step for the non-linear terms in S from the solving step for linear terms in A. This distinctive feature of the analytical model results in an efficient method to analyze complex, multidimensional data. Take an example of a data matrix D.sub.G that is characterized four Gaussian-type peaks, where each peak is represented by one Gaussian factor and each Gaussian factor has one linear term (i.e. A in Eq. 2a) and two non-linear terms (i.e., E.sub.0 and w in Eq. 2a). In addition, the data matrix D.sub.G includes an image array having n number of images, in which each image has 100 pixels (i.e., m=100). Using conventional non-linear solving methods, the non-linear solver would have to solve for 1,200 different terms for each iteration, which is determined as follows:
(50)
If additional iterations are needed (i.e., i more than 1), then the conventional method would require a solution of 1,200 terms for each iteration or 1,200i number of terms. In contrast, using the analytical method herein, the number of terms requiring a solution is drastically decreased. Taking the same data matrix D.sub.G above, the analytical method herein requires a solution of only 408 terms:
Non-linear solutions:
(51)
Linear solutions:
(52)
Total solution: 8 terms in S+400 terms in A=408 total terms.(Eq. 4c)
(53) Thus, the number of terms in a conventional analysis scales linearly as the number of spectral features (or factors representing such features), the number of terms for each factor representing such features, and the size of the data matrix. In contrast, the present methods do not exhibit such linear scaling because the solution for non-linear terms does not depend on the number of image pixels. This reduction in the number of terms can have drastic implications on computation efficiency for bigger data matrices, as well as methods requiring numerous iterations.
(54) Global Analysis
(55) Upon obtaining data matrix D (e.g., as described herein), global analysis can be performed to obtain matrices A and S that describe the chemical signatures of the underlying chemical test sample.
(56) The method employs a non-linear solver to solve 301 non-linear terms within the selected factors. As seen in
(57) Any useful methodology can be employed to solve for S. Many non-linear solving methods are iterative procedures, in which initial estimates (e.g., a provided estimate or a random estimate) are further refined by minimization (e.g., by least squares, by damped least squares, etc.). Exemplary methods include the Levenberg-Marquardt method, the Gauss-Newton method, the gradient descent method, the steepest descent method, other non-linear least squares minimization methods, as well as computer-implementations of such methods, including lsqnonlin, a MATLAB function that uses the Levenberg-Marquardt algorithm, as well as those described in Beechem J M, Global analysis of biochemical and biophysical data, Methods Enzymol. 1992; 210:37-54, which is incorporated herein by reference in its entirety. Solutions for S or refinement of S can be obtained by using least squares, as follows:
(58)
where subscript F represents the Frobenius norm, D is the data matrix, superscript .sup.T is matrix transpose, A is any useful matrix of linear terms, and S is any useful matrix of non-linear terms.
(59) Given (e.g., S.sub.i, .sub.initial, or ) the linear terms of D are solved 303 and employed to construct estimated matrix , which is dimensioned as m number of image pixels by q number of factors. In one instance, estimated matrix can be determined as follows:
=DS(.sup.T).sup.1,(Eq. 6)
where is an estimated matrix for S. In other instances, A can be obtained using least squares, which can be formally described as follows:
(60)
In another instance, matrix A can be described as follows:
A=X,(Eq. 8)
where X is a matrix having mutually orthogonal columns (e.g., such as score matrix T, as further described herein) and is an estimator matrix for matrix A.
(61) Any useful methodology can be used to solve for matrix A. In one instance, eigenanalysis of the data matrix D, the covariance matrix D.sup.TD, or the association matrix DD.sup.T is employed. In another instance, principal component analysis is employed, e.g., as described herein. In yet another instance, orthogonal projection methods are employed, in which vector projection is used to generate approximations for A. Although the exemplary method 300 describes an estimate of S to precede the estimate for A, these steps can be reversed. For instance, the first step can include an estimate for A, which can be used in turn to provide an estimate of S.
(62) To determine the accuracy of the estimated matrices, a residual matrix R can be computed 304. Matrix R can be computed in any useful manner. In one instance, R is determined as follows:
R=D.sup.T,(Eq. 9a)
where estimated matrices and are compared against the real data matrix D. In another instance, R is determined as follows:
R=D(.sup.T).sup.1,(Eq. 9b)
which can be mathematically rearranged to be equivalent to Eq. 9a. However, Eq. 9b requires manipulation of mq matrices (for instance, R is an mq matrix), and Eq. 9a require matrix addition of the entire data set D that is an mn matrix. In most instances, q will be less than n. The residual matrix R, in turn, can be used to determine convergence as compared to an appropriate convergence criterion (e.g., a sum squared residual SSR), as described herein.
(63) To ensure that the estimated matrices S and A appropriately model the real data, steps 301, 302, 303, and 304 (performed in any useful order) can be iterated 305 as needed until convergence. Iterations can include estimating matrix and then matrix , or vice versa. As used herein, a carat ({circumflex over ()}) indicates an estimated matrix, and a subscript i indicates a matrix produced at an i.sup.th iteration. In some instances, i is the number of iterations until convergence. For instance, a first matrix of S is indicated as S.sub.1; an i.sup.th matrix of S is indicated as S.sub.i, where i is the number of iterations (e.g., until convergence); an estimated matrix of S is indicated as S; and an initial matrix of S, which is a matrix having estimated non-linear terms, is indicated as .sub.initial.
(64) Upon obtaining an estimate for A (e.g., A.sub.i, , .sub.initial, or ) or S (e.g., S.sub.i, .sub.initial, or ), further processes can be employed to refine this estimate, such as by using iterative constrained least squares methods, as described herein. Assuming an initial estimate or solution for S and/or A, the methods herein can be iterated until convergence. In one instance, a first matrix S.sub.1 can be used to solve for A (e.g., using Eqs. 1, 6, and/or 7), thereby providing a first matrix A.sub.1. Using this computed matrix A.sub.1, the first matrix S.sub.1 can be refined to provide a second matrix S.sub.2. In this iterative fashion, matrices S and A can be solved (e.g., by least squares, such as according to Eqs. 5 or 7) until the convergence criterion has been met. Thus, the methods herein can include i number of iterations to provide an estimated matrix S.sub.i and A.sub.i at each i.sup.th iteration step until convergence.
(65) Although examples are provided with an initial matrix S to solve for A, the converse can be true. For instance, a first matrix A.sub.1 can be used to solve for S, thereby providing a first matrix S.sub.1. Then, using this computed matrix S.sub.1, the first matrix A.sub.1 can be refined to provide a second matrix A.sub.2. These steps can be conducted in an iterative fashion until convergence. Additional details for these analytic methodologies are described in Keenan M R et al., Algorithms for constrained linear unmixing with application to the hyperspectral analysis of fluorophore mixtures, Proc. SPIE 2002; 4186:193-202; Keenan M R, Multivariate analysis of spectral images composed of count data, in Techniques and Applications of Hyperspectral Image Analysis (eds. H. F. Grahn and P. Geladi), John Wiley & Sons, Hoboken, N.J., 2007, pp. 89-126, chp. 5; and Andrew J J et al., Rapid analysis of Raman image data using two-way multivariate curve resolution, Appl. Spectrosc. 1998; 52(6):797-807, each of which is incorporated herein by reference in its entirety.
(66) Finally, upon reaching convergence, the obtained matrices S and A can be converted into an image array, a spectral array, or a spectrum that can be interpreted by a user. Conversion can be implemented in any useful manner, as described herein.
(67) Additional steps can be performed during global analysis. Optionally, non-negativity constraints on each element of S.sup.T or A can be applied as follows:
S.sup.T0 and/or(Eq. 10a)
A0.(Eq. 10b)
(68) Vector rotation may optionally be employed. For instance, Varimax rotation is one such vector rotation technique, in which factors are rotated to maximize the total variance of the squared vectors, while maintaining orthogonality between all vectors. Additional methods for orthogonal and non-orthogonal factor rotation include Quartimax, Quartimin, Equamax, Orthomax, Parsimax, Oblimax, Oblimin, and Promax, as well as any described in Harman H H, Modern factor analysis, (3.sup.rd ed. rev.), The University of Chicago Press, Chicago, Ill., 1976, pp. 278-99; and Keenan M R, Exploiting spatial-domain simplicity in spectral image analysis, Surf Interface Anal. 2009; 41:79-87, each of which is incorporated herein by reference in its entirety.
(69) The data matrix can be processed in any useful manner prior to global analysis, such as by compressing, weighting, scaling, centering, etc., where one or more of these processes can be applied. In some instances, the data matrix can be centered by subtracting the mean of all the spectra from each individual spectrum prior to global analysis, data compression, and/or matrix factorization. In other instances, the data matrix can be weighted in order to reduce noise (e.g., noise exhibiting Poisson characteristics), such as by defining an alternate space in which uncertainty of the data in that alternate space is more uniform. An exemplary weighted method is described in Keenan M R et al., Accounting for Poisson noise in multivariate analysis of ToF-SIMS spectrum images, Surf Interface Anal. 2004; 36:203-12, which is incorporated herein by reference in its entirety. In yet other instances, the data matrix can be adjusted for spectral background, such as by using the methods in U.S. Pat. Nos. 6,584,413 and 6,675,106, which is incorporated herein by reference in its entirety.
(70) Computer-Implemented Methods for Global Analysis
(71) The methods herein can be implemented in any useful manner. For instance, a spectrometer can be used to obtain spectral data, such as an array of images. In one example, the spectrometer is directly linked to a spectrum analyzer (e.g., via electrical or optical cable), or by wireless means, for real-time spectral analysis. Alternatively, the measured spectral data can be stored on computer readable memory means (e.g., CD-ROM, DVD, magnetic tape, etc.) and provided to a spectrum analyzer at a later time for off-line post-processing.
(72) A spectrum analyzer can include any useful components, such as a computer processor; an operator interface (such as a keyboard or computer mouse); a memory for storing data and executable instructions; an interface for data input and output; an executable software instruction embodied in an executable computer program code; and a display for displaying the results of a global spectral analysis. The computer processor can be a standard laboratory Personal Computer (PC), for example, using an Intel fourth generation microprocessor, such as made by IBM, Compaq, Dell, etc.; or a workstation, such as made by SUN, HP, Silicon Graphics, etc., running UNIX or Linux operating systems. Software instructions can be expressed in any of a variety of well-known computer languages, including MATLAB, FORTRAN, C, C++, and Java for implementing any useful program, subroutines, subprograms, and input/output routines.
(73) In another instance, the methods and steps herein are implemented using a computer. In particular, receiving steps can be implemented by a processor configured to receive input (e.g., data input) from a spectrometer (e.g., a detector of the spectrometer). Other steps (e.g., generating of one or more matrices, spectra, or images; performing global analysis; compressing data; computing matrices, eigenvalues, or eigenvectors; iterating; and/or converting matrices into one or more other matrices, spectra, or images) can be implemented by a processor optionally having memory to store data and/or executable instructions, software instructions embodying executable instructions (e.g., for generating, performing, converting, iterating, solving, computing, compressing, etc., according to the methods described herein), and an output (e.g., a display) to show any useful result to the user.
(74)
(75) Methods herein can be implemented on a computer, such as by a processor of the computer that is configured to perform executable instructions, by a memory to store data and such executable instructions, and by an executable computer program code embodying such instructions. One exemplary computer-implemented method 500 includes a routine of steps capable of being programmed in any useful computer code language (
(76) Other useful routines can be implemented to include any other useful step. For instance, the method can include the step of computing a residual matrix R as a convergence metric to characterize convergence.
(77) At times, an initial estimate for S can be provided, such as when the chemical sample is composed of a known polymer with known spectral peaks. To expedite computation, an initial estimate for such features can be provided as an initial matrix .sub.initial composed of expected non-linear terms.
(78) Exemplary method 5100 (
(79) Another exemplary method 5200 (
(80) Data Compression
(81) The data matrix D can be compressed in any useful manner. As described herein, matrix D is represented by a multidimensional data space. Depending on the size of each image (ab, abc, or m) and the n channels of images, the size of the data matrix D can be substantial, i.e., a mn matrix. In addition, matrix manipulation of such large matrices, in order to arrive at desired solutions for matrices S and A, can be time-consuming. Thus, reducing the size of matrix D can be helpful for computational purposes, but such compression techniques should not arbitrarily remove potentially important data.
(82) In one instance, principal component analysis (PCA) is employed to compress matrix D. Using PCA, matrix D can be characterized by a single direction (or a first principal component PC.sub.1) that maximally accounts for variance in the data. To more fully describe the data, additional principal components can be identified (e.g., PC.sub.2, PC.sub.3, . . . , PC.sub.p, where p is an integer). Each principal component is orthogonal to any other principal component. In addition, each successive principal component possesses a variance of successively lower quantity (i.e., variance of PC.sub.1>variance of PC.sub.2>variance of PC.sub.3, etc.). Typically, the first few principal components contain the chemical information of interest. The remaining principal components generally contain noise or other artifacts. Thus, if the p number of principal components adequately describes the underlying data, then the data matrix D can represented as follows:
D=TP.sup.T,(Eq. 11)
where D is an mn data matrix, T is an mp matrix having mutually orthogonal columns, and P is an np matrix having mutually orthonormal columns; superscript .sup.T indicates a matrix transpose; and each of m, n, and p is an integer of 1 or more. Matrix T is conventionally called the score matrix, and matrix P is conventionally called the loading matrix. This PCA model, as well as least square approximations of this model, can be shown graphically (
(83) The expression in Eq. 11 represents the data matrix D as a combination of the score matrix T and the loading matrix P. If dimension p is significantly less than both m and n (e.g., pmin(m, n)), then the data matrix can be compressed by being represented by matrices T and P having this dimension p. As shown in
(84) Any useful methodology can be employed to solve the PCA problem in Eq. 11. In one instance, eigenanalysis of the data matrix D, the covariance matrix D.sup.TD, or the association matrix DD.sup.T is employed. For example, eigenanalysis of the covariance matrix in conjunction with the compressed expression for D (e.g., Eq. 11) can be expressed as follows:
D.sup.TD=(TP.sup.T).sup.TTP.sup.T=PT.sup.TTP.sup.T=P(T.sup.TT)P.sup.T(Eq. 12)
where T.sup.TT is a diagonal pp matrix that can be optionally solved first in Eq. 12. Performing the calculation of this pp diagonal matrix, prior to performing matrix multiplication with np matrix P and pn matrix P.sup.T, can be desirable, especially asp is generally less than n.
(85) By solving the eigenproblem of Eq. 12, solutions for matrix P can be obtained. After obtaining the loading matrix P, the score matrix T can be obtained as follows:
T=DP,(Eq. 13)
which can be obtained by re-arranging Eq. 11.
(86) Singular value decomposition (SVD) can be used in conjunction with eigenanalysis to solve Eq. 11. For instance, SVD decomposes D into three matrices:
D=UN.sup.T,(Eq. 14)
where U is an mm orthogonal matrix containing left singular vectors, V is an nn orthogonal matrix containing right singular vectors, and A is an mn diagonal matrix with singular values placed along the diagonal. Singular values and corresponding vectors are ordered within these matrices according to their significance.
(87) By comparing Eqs. 11 and 14, PCA analysis can be performed with the following expressions:
P=V.sub.p, and(Eq. 15a)
T=(U).sub.p=DP(Eq. 15b)
where subscript p indicates that only the first p columns of the respective matrices are retained. Furthermore, solutions for V can be obtained by applying eigenanalysis of D as follows:
D.sup.TD=(UV.sup.T).sup.T(UV.sup.T)=V.sup.2V.sup.T,(Eq. 15c)
where this eigenvalue problem shows that the right singular vectors V can be obtained as eigenvectors of D.sup.TD and that the squares of the singular values .sup.2 can be obtained as eigenvalues of D.sup.TD. Thus, solving Eq. 15c provides V, where the first p components of V can be employed as V.sub.p, which in turn provides P according to Eq. 15a. Finally, T can be determined according to Eq. 15b.
(88) Additional methods for principal component analysis (e.g., SVD, eigenvalue decomposition (EVD), an arbitrary f-component factor model, etc.) are described in U.S. Pat. Nos. 6,584,413, 6,675,106, 7,283,684 and 7,400,772, which is incorporated herein by reference in its entirety. Algorithms for implementing SVD and EVD, as well as other PCA algorithms, are described in Wu W et al., The kernel PCA algorithms for wide data. Part I: theory and algorithms, Chemom. Intell. Lab. Sys. 1997; 36:165-72, which is incorporated herein by reference in its entirety.
(89) Furthermore, global analysis can be performed by analyzing the following expression:
D=TP.sup.T=AS.sup.T,(Eq. 16)
which is also depicted graphically in
A=TP.sup.TS(S.sup.TS).sup.1=T(P.sup.TS)(S.sup.TS).sup.1(Eq. 17)
In particular embodiments, the expression P.sup.TS is optionally solved first in Eq. 17. Performing this calculation provides a pq matrix, where p and q, independently, are generally less than m and n.
(90) The PCA model can be used jointly with the analytical model for data in any useful global analysis method (e.g., any global analysis method described herein). One exemplary method 700 combines a global analysis method with a PCA model. As seen in
(91) A further step 704 includes performing global analysis on the compressed data matrix, as described herein, of D=TP.sup.T=AS.sup.T (Eq. 16). This step 704 can be performed in order to (i) solve for the loading matrix P (e.g., using Eqs. 11, 12, 15a, or 15c), (ii) solve for the score matrix T (e.g. using Eqs. 13 or 15b), (iii) non-linearly solve for matrix S, and (iv) linearly solve for matrix A (e.g., using Eq. 17), where (i), (ii), (iii), and (iv) can be performed in any order or at the same time. For instance, steps (i) and (iii) can be performed in any order or at the same time. Step (ii), which benefits from a solution obtained in step (i), is generally but not always performed after step (i). However, step (ii) can be performed before or after step (iii). Finally, step (iv), which benefits from a solution obtained in steps (i), (ii), and (iii), is generally but not always performed after these steps.
(92) The final step 705 includes determining one or more properties of test sample (e.g., chemical properties, bonding properties, etc.) by inspecting converged matrices A and S (e.g., including converted versions of matrices A and S). Optionally, the global analysis step 704 includes applying a least squares criterion to A and/or S. Each of these steps can be performed using a computer, a processor, a memory, a program code, etc.
(93) The PCA model and analytical model can be performed in any useful manner, such as by using a computer (e.g., as described herein). Any of the global analysis methods herein can be modified to include PCA compression. For instance, an exemplary computer-implemented method 800 (
(94) The method 800 includes a start prompt 801 to launch the global analysis method; a step 802 of solving for the loading matrix P by analysis of D=TP.sup.T; a step 803 of solving for the score matrix T using provided loading matrix P (e.g., by using Eq. 13); a step 804 of solving for an estimated matrix by non-linearly solving D=AS.sup.T; a step 805 of solving for an estimated matrix using the provided loading matrix P, the provided score matrix T, and the estimated matrix as S in the following equation AS.sup.T=TP.sup.T or any other useful equation described herein (e.g., Eq. 17); a step 806 of computing a convergence metric; and a decision step 807 of determining whether the convergence metric meets the convergence criterion. If converged, then a stopping step 808 ends the routine. If not converged, then the routine is iterated until convergence. Optionally, a step 809 includes applying a least square minimization criterion to either solving steps 804,805 for matrices and/or . As described herein, step
(95) In another instance, the loading matrix P can be defined by an estimator matrix for matrix A. As seen in
P.sup.T=S.sup.T and(Eq. 18)
A=T,(Eq. 19)
where estimator matrix is a pq matrix.
(96) Using these expressions, the loading matrix P can be used in a global analysis in order to determine solutions for the estimator matrix , where solutions for this minimized matrix can be obtained in any useful manner (e.g., by least squares). In one instance, can be obtained using least squares, which can be formally described as follows:
.sub..sup.minP.sup.TS.sup.T.sub.F.sup.2.(Eq. 20)
The non-linear terms can be solved to obtain estimated matrix , where this step can be performed before or after obtaining the estimator matrix .
(97) Estimates for matrix S and the estimator matrix can be obtained in an iterative manner until convergence, which can be determined by any useful residual matrix R. In instance, the residual matrix is determined as follows:
R={tilde over ()}D(.sup.T).sup.1,(Eq. 21)
where {tilde over ()} is an estimated estimator matrix and is any useful estimated matrix for S. To ensure that the estimated matrices and {tilde over ()} appropriately model the real data, the solving steps for matrices S and (performed in any useful order) can be iterated as needed until convergence. Iterations can include estimating estimator matrix {tilde over ()} and then matrix , or vice versa. After convergence, estimator matrix can be converted to matrix A in any useful manner (e.g., by using Eq. 13 to obtain matrix T and Eq. 19 to get matrix A).
(98)
(99) Another exemplary method 9000 (
(100) Convergence and Residual Matrices Solutions for matrices S or A generally include an iterative process, in which i number of steps is repeated until convergence is reached. Convergence can be determined in any useful manner. For instance, the extent of convergence can be determined by calculating a residual matrix R, which represents any data in the data matrix D that is not accounted by estimated matrices for A, S, T, or P:
D=AS.sup.T+R or(Eq. 22a)
D=TS.sup.T+R or(Eq. 22a)
D=TP.sup.T+R.(Eq. 22b)
In essence, residual matrix R accounts for any error between the real data and the model:
R=D{circumflex over (D)},(Eq. 23)
where D is the real data matrix and {circumflex over (D)} is a reconstructed data matrix obtained from any matrices A, S, T, and P obtained by global analysis (e.g., estimated matrices of any of these).
(101) The measure of acceptable error is the convergence criterion, which can be determined in any useful manner. In terms of the residual matrix R, a convergence criterion can include a sum of the squares of the residual matrix or a sum of the square of the two-norm of the difference vector. In one instance, the convergence criterion is a sum of the squares of the residual matrix SSR determined as follows:
SSR=minR.sub.F.sup.2,(Eq. 22)
where subscript F represents the Frobenius norm and this squared Frobenius norm represents the sum of the squared difference over the entire dataset.
(102) In another example, residuals can be determined according to a reconstructed data matrix {circumflex over (D)}, as follows:
{circumflex over (D)}=AS.sup.T or(Eq. 23a)
{circumflex over (D)}=TP.sup.T, and(Eq. 23b)
SSR=minD{circumflex over (D)}.sub.F.sup.2,(Eq. 23c)
in which A, S, T, and P can be any matrices obtained as described herein.
(103) In yet another example, the residual is determined as follows:
R=DAS.sup.T, and(Eq. 24a)
SSR=minDAS.sup.T.sub.F.sup.2,(Eq. 24b)
where D is the real data matrix, A and S can be any matrices obtained as described herein, and r.sub.j is a vector of the residual matrix R, where the trace of a matrix is the sum of its diagonal components.
(104) In other examples, residuals can be determined by individual matrices that represent the entire data matrix D, such by individual matrices A or S. Thus, residuals can be determined as follows:
R=S.sub.i+1S.sub.i or(Eq. 25a)
R=A.sub.i+1A.sub.i, and(Eq. 25b)
SSR=(r.sub.j.sup.Tr.sub.j), where j=1, . . . , p or j=1, . . . , q.(Eq. 25c)
In general, asp and q are less than m and n, calculation of the SSR in Eq. 25c can be less intensive computationally, than calculating of the SSR in Eq. 24b.
(105) Convergence can be quantified by the residual matrix R or SSR, as described herein. The convergence criteria can be assessed numerically, such as by setting a convergence criteria to be an SSR value of 10.sup.3, 10.sup.4, 10.sup.5, 10.sup.6, 10.sup.7, 10.sup.8, 10.sup.9, 10.sup.10, or any other useful value.
(106) Conversion of Matrices into Spectra or Images As described herein, matrices A and S are obtained after analysis of the data matrix D. In addition, the accuracy of A and S in describing the data is measured to the extent that obtained matrices for A and S converge with D according to Eq. 1. Of course, if compression is employed, then matrices T and P will be obtained.
(107) After convergence, obtained matrices (e.g., A, S, T, and/or P) can be converted into spectra or images that are more readily interpretable. The obtained matrices can be used to provide a reconstructed data matrix {circumflex over (D)}, as described herein (e.g., Eqs. 23a and 23b).
(108) Conversion can be implemented in any useful manner. In one instance, each of the q factors can be displayed to show its contribution to an image, such as in
EXAMPLES
Example 1: Global Analysis of NEXAFS Images of Nitrile Samples
(109) We investigated the use of NEXAFS in elucidating how a material ages upon exposure to environmental test conditions. In particular, NEXAFS is an interesting spectroscopy technique that provides information about chemical bonds in various types of materials, including organic materials. Spectral data can be interpreted by one or more peak fitting techniques, in which peaks are identified by a particular line shape and intensity that, in turn, can be correlated with known peaks for known chemical bonds. Thus, peak fitting can help elucidate the nature of bonding in polymers. In conventional analysis, peak fitting is performed on a single spectrum. Here, we propose analyzing NEXAFS image arrays in a global analysis manner. By fitting NEXAFS images and multiple spectra simultaneously, we can provide information about the areal extent of chemical bonding in the material, as well as the presence of mixed species.
(110) The test sample included nitrile rubber o-ring gaskets filled with unknown fillers. O-rings were diced into 1 cm length pieces and then artificially aged under different conditions in air ovens, including 10 days in 125 C. and 21 days in 125 C. Native samples were not aged and employed as controls. After aging, 1 mm slices were extracted away from the end of each piece and placed onto a sample platen for NEXAFS analysis with double-sided copper tape. Samples were analyzed on the LARIAT imaging NEXAFS end station located at the National Synchrotron Light Source at the Brookhaven National Laboratory. Scans were obtained for energy levels from 270 eV to 348 eV with 0.2 eV per step (391 spectral channels) and acquired at a rate of 1 second per frame, 2 frames per step, at a 50V grid bias.
(111) Each individual image was processed to remove outliers, normalized to a baseline intensity, analyzed to determine variance from duplicate frames, and cropped to include only the material or interest. Then, data files were concatenated and processed using the global analysis routine described herein. In particular, 12 factors were employed in this analysis: five symmetrical Voigt peaks, five asymmetrical Voigt peaks, one step function, and one horizontal offset functions. Constraints for symmetrical Voigt peaks required that all have the same peak width and must be at a lower energy than the step function. Constraints for asymmetrical Voigt peaks required that the factors must follow the model in reference and must be at a higher energy than the step function. The step location was based on similar samples noted in the literature (e.g., Dhez O et al., Calibrated NEXAFS spectra of some common polymers, J. Electron Spectrosc. Relat. Phenom. 2003; 128(1):85-96; and Zhou P H et al., Electronic structure of photo-degraded polypropylene ultrathin films, Chem. Phys. Lett. 2008; 465(4-6):241-4), and image-mode factor intensities were scaled to a common total intensity for all samples.
(112) Using the analysis routine described herein, a mean spectrum was analyzed. This analysis provided a reconstructed spectrum (
(113) We further wished to understand how each factor was affected by aging. Polymer aging is an area of active research, and understanding how and why particular bonds or chemical attributes change over use in the field can be helpful in developing new polymers and in assessing the lifetime of a polymeric article.
(114) Although the present application focused on analysis of single images, this method can be applied to multiple images and spectra by providing a data matrix encompassing such multidimensional data. In particular, the methods herein show promise for identifying variation in the spatial domain to aid in data interpretation, where such image inhomogeneities can be the result of interesting dynamic chemical phenomena.
Example 2: Global Analysis of NEXAFS Images of PTFE Samples
(115) Any useful sample can be analyzed with the processes herein. In addition to the nitrile sample, polytetrafluoroethylene (PTFE) samples were analyzed using NEXAFS and global analysis of the NEXAFS images. Provided is a spectrally summed PTFE image (
Other Embodiments
(116) All publications, patents, and patent applications mentioned in this specification are incorporated herein by reference to the same extent as if each independent publication or patent application was specifically and individually indicated to be incorporated by reference.
(117) While the invention has been described in connection with specific embodiments thereof, it will be understood that it is capable of further modifications and this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure that come within known or customary practice within the art to which the invention pertains and may be applied to the essential features hereinbefore set forth, and follows in the scope of the claims.
(118) Other embodiments are within the claims.