METABOLOMICS RELATIVE QUANTITATIVE ANALYSIS METHOD BASED ON UPLC/HMRS
20230101558 · 2023-03-30
Inventors
- Ying Xu (Suzhou, Jiangsu, CN)
- Jianhong Ren (Suzhou, Jiangsu, CN)
- Guigang Xie (Suzhou, Jiangsu, CN)
- Zhenli Dai (Suzhou, Jiangsu, CN)
- Mengnan Gu (Suzhou, Jiangsu, CN)
- Congshun Gu (Suzhou, Jiangsu, CN)
Cpc classification
G01N30/8686
PHYSICS
International classification
Abstract
The present invention belongs to the technical field of metabolomics, and relates to a metabolomics relative quantitative analysis method based on UPLC/HRMS. Specifically, the present method mainly comprises the steps of formulating an isotope internal standard mixed solution, a standard curve correction solution, and a metabolomics sample solution; acquiring raw mass spectrum data of the standard curve correction solution and the metabolomics sample solution; transposing and deconvolving the raw data; identifying and selecting the optimal isotope internal standard; fitting a linear equation and calculating the relative quantitative results of metabolites in the metabolomics sample solution; and completing the structural identification of metabolites and differential metabolites in the metabolomics sample solution. The method can meet both qualitative and quantitative requirements using only a high-resolution mass spectrometry platform; the quantitative results are accurate, and the accuracy of the qualitative results are higher, having low costs, simple operation, and wide applicability.
Claims
1. A metabolomics relative quantitative analysis method based on ultra-high performance liquid chromatography/high-resolution mass spectrometry (UPLC/HRMS), comprising: a) formulating an isotope internal standard mixed solution based on multiple isotope internal standards; b) based on a metabolomics sample, determining a relative quantitative correction sample matching with the metabolomics sample, and formulating relative quantitative standard curve correction solutions in a series of concentration gradients by using the relative quantitative correction sample and the isotope internal standard mixed solution in a); c) formulating a metabolomics sample solution by using the metabolomics sample in b) and the isotope internal standard mixed solution in a); d) collecting the raw mass spectrometry data of the metabolomics sample solution in c) and the relative quantitative standard curve correction solutions in a series of concentration gradients in b) by using the UPLC/HRMS; e) acquiring primary mass spectrum transposed data and secondary mass spectrum transposed data based on the raw mass spectrometry data in d), and acquiring deconvolution results comprising a variety of primary variable information based on the primary mass spectrum transposed data; f) combining the deconvolution results of the primary mass spectrum transposed data and the secondary mass spectrum transposed data in e), and referring to the primary variable information and secondary variable information of single isotope internal standard to identify the isotope internal standard, and selecting the optimal isotope internal standard for linear fitting; g) performing linear fitting by using concentrations of the relative quantitative standard curve correction solutions in a series of concentration gradients in b) and the primary variable information acquired in e) to acquire a linear equation; h) based on the linear equation in g), acquiring the relative quantitative results of the metabolomics sample solution in c), and completing principal component analysis and differential metabolites analysis of the primary variable information; and i) completing identification of metabolites and identification of differential metabolites by combining the deconvolution results of the primary mass spectrum transposed data with the secondary mass spectrum transposed data in e).
2. The metabolomics relative quantitative analysis method according to claim 1, wherein the isotope internal standard mixed solution in a) is formulated by the following method: firstly taking an appropriate amount of multiple isotope internal standards, and adding a solvent thereto, respectively, to formulate a given concentration of a mother solution of single isotope internal standard; and then taking an appropriate amount of each of the mother solutions of single isotope internal standard, respectively, mixing same, and adding a solvent thereto to acquire an isotope internal standard mixed solution.
3. The metabolomics relative quantitative analysis method according to claim 1, wherein the metabolomics sample in b) is a serum or plasma sample, and the relative quantitative correction sample is NIST serum.
4. The metabolomics relative quantitative analysis method according to claim 1, wherein the relative quantitative standard curve correction solutions in a series of concentration gradients in b) are formulated by the following method: taking a series of volumes of relative quantitative correction samples, adding an isotope internal standard and a protein precipitation reagent thereto, respectively, and mixing and centrifuging the mixture; taking the supernatant, and concentrating same; adding a reconstituted solvent to the residue, and mixing same; and taking the supernatant to acquire the relative quantitative standard curve correction solutions in a series of concentration gradients.
5. The metabolomics relative quantitative analysis method according to claim 1, wherein the metabolomics sample solution in c) is formulated by the following method: taking a metabolomics sample, adding an isotope internal standard and a protein precipitation reagent thereto, respectively, and mixing and centrifuging the mixture; taking the supernatant, and concentrating same; adding a reconstituted solvent to the residue, and mixing same; and taking the supernatant to acquire the metabolomics sample solution.
6. The metabolomics relative quantitative analysis method according to claim 1, wherein the primary mass spectrum transposed data and the secondary mass spectrum transposed data in e) are acquired by means of data transposition, and the data transposition is completed by a software or an algorithm platform.
7. The metabolomics relative quantitative analysis method according to claim 1, wherein the deconvolution result in e) is acquired by means of deconvolution, and the deconvolution is completed by a software or an algorithm platform.
8. The metabolomics relative quantitative analysis method according to claim 1, wherein the identification of the isotope internal standard and the selection of the optimal isotope internal standard in f) are completed by a self-developed algorithm.
9. The metabolomics relative quantitative analysis method according to claim 1, wherein the principal component analysis in h) is completed by a self-adaptive conversion; the differential metabolites analysis in h) is completed by a multivariate statistical analysis.
10. The metabolomics relative quantitative analysis method according to claim 1, wherein the identification of metabolites and the identification of differential metabolites in i) are completed by a software or an algorithm platform.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
DETAILED DESCRIPTION OF EMBODIMENTS
[Definition of Terms]
[0042] Unless otherwise specified, the term “internal standard” as used herein refers to an appropriate pure product added to a test sample during quantitative analysis for calculating the content of the components to be tested.
[0043] Unless otherwise specified, the term “isotope” as used herein refers to the mutual designation between different nuclides which belong to the same element, and have the same number of protons and different numbers of neutrons. For example, hydrogen element comprises three isotopes: protium (H or .sup.1H), deuterium (D or .sup.2H) and tritium (T or .sup.3H); carbon element comprises three isotopes: carbon 12 (.sup.12C), carbon 13 (.sup.13C) and carbon 14 (.sup.14C); nitrogen element comprises two isotopes: nitrogen 14 (.sup.14N) and nitrogen 15 (.sup.15N); and oxygen element comprises three isotopes: oxygen 16 (.sup.16O), oxygen 17 (.sup.17O) and oxygen 18 (.sup.18O).
[0044] Unless otherwise specified, the term “isotope internal standard” as used herein refers to a stable internal standard substance formed after replacing at least isotope with a higher abundance contained in the structure with an isotope with a lower abundance. For example, in the molecular structure of choline chloride as the internal standard, if all of the nine hydrogen atoms (or protium atoms) contained in the three methyl groups are replaced by deuterium atoms, the corresponding isotope internal standard—choline chloride-trimethyl-d9 can be acquired. For another example, in the molecular structure of L-valine as the internal standard, if the carbon 12 atom contained in the carboxyl group is replaced by the carbon 13 atom, the corresponding isotope internal standard—L-valine-1-13C can be acquired. According to different metabolic functions, the isotope internal standards in the present invention can be divided into carbohydrates (such as D-fructose-1-13C), energy (such as 5′-adenosine triphosphate-15N5), nucleotides (such as uridine-2-13C), amino acids (such as L-valine-1-13C), cofactors and vitamins (such as vitamin B3-d4) and many other categories.
[0045] Unless otherwise specified, the term “NIST serum” as used herein refers to Frozen Human Serum (abbreviated as FHS) provided or sold by the National Institute of Standards and Technology (abbreviated as NIST).
[0046] Unless otherwise specified, the term “protein precipitation reagent” as used herein refers to a reagent added to a biological sample to precipitate proteins in the sample via denaturation. Common protein precipitation reagents include but are not limited to salts (such as ammonium sulfate, sodium sulfate, and sodium chloride), organic solvents (such as methanol, acetonitrile, and acetone), etc.
[0047] Unless otherwise specified, the term “metabolomics sample” as used herein refers to a biological sample used for metabolomics research. Common metabolomics samples include but are not limited to animal samples (such as blood, urine, feces, saliva, hair, and cells of humans or animals), plant samples (such as roots, stems, leaves, flowers, fruits, and seeds of plants), microbial samples (such as cells, spores, fermentation broth, and culture solutions of microorganisms), subcellular structure samples (such as mitochondria, exosomes, and vesicles of organelles), etc.
[0048] Unless otherwise specified, the term “quality control sample” (or “QC sample”) as used herein refers to a mixed sample acquired by mixing a certain amount of experimental sample for the purpose of monitoring the stability of the data collected by the instrument during the experiment, substantially checking chromatographic peak profiles of experimental samples and calibrating non-serum or plasma samples in the relative quantitative detection.
[0049] Unless otherwise specified, the term “primary mass spectrometry full scanning and data-dependent secondary mass spectrometry scanning mode” (or “full MS-ddMS2”) as used herein refers to a mode in which the ions are fragmented by the instrument while primary mass spectrum data is collected, so as to generate secondary mass spectrum fragments. The primary and secondary mass spectrum data can be acquired simultaneously. Preferably the top 3 ions in signal intensity within the specified time are subjected to secondary fragmentation, the selected top 3 ions in signal intensity are excluded firstly when cycling to the next time period, and additional top 3 ions in signal intensity are screened for secondary fragmentation again, and so on until the end of time periods.
[0050] Unless otherwise specified, the term “transposed data” as used herein refers to the conversion of raw mass spectrometry data generated by different mass spectrometers into normalized and text-processable mass spectrum data; the term “primary mass spectrum transposed data” refers to the mass spectrum transposed data comprising the primary molecular information of the detection target; the term “secondary mass spectrum transposed data” refers to the mass spectrum transposed data comprising the secondary molecular information of the detection target; and correspondingly, the term “numerical transposition” refers to the process of converting raw mass spectrometry data into normalized and text-processable data.
[0051] Unless otherwise specified, the term “deconvolution” as used herein refers to the separation and construction of information of each metabolite parent ion, including mass-to-charge ratios, retention times, and corresponding chromatographic peak information, etc., from primary mass spectrum transposed data.
[0052] Unless otherwise specified, the term “primary variable” as used herein refers to the mass-to-charge ratio and corresponding chromatographic peak information of all primary mass spectrum data (parent ions) collected by the instrument. In the field of metabolomics, common primary variables include but are not limited to primary variable number, parent ion mass-to-charge ratio, retention time, peak area value, etc.
[0053] Unless otherwise specified, the term “structure identification” (or “metabolite identification”) as used herein refers to the derivation process of comparing of secondary mass spectrum fragments (parent ion fragments) of a chromatographic peak to secondary mass spectrum fragments in a public database (e.g., KEGG) and obtaining the substance structure corresponding primary mass spectrum data (parent ions). Substances after structure identification will match the numbers in the corresponding public databases.
[0054] Unless otherwise specified, the term “relative quantitative concentration” (or “relative quantitative result”) as used herein refers to the relative concentration based on the NIST serum sample substance concentration acquired by the following steps: the concentrations of the metabolites in the NIST serum sample are linearly fitted with the ratios of the peak areas of the metabolites to the peak areas of the isotope internal standards to acquire a linear equation, and then the peak area ratio (or primary variable) of the metabolites in the metabolomics samples and the isotope internal standards is substituted into the linear equation to calculate the relative concentration based on the NIST serum sample substance concentration.
[0055] Unless otherwise specified, the term “primary analysis” as used herein refers to the process of statistical analysis based on the data of primary variables; the term “principal component analysis” refers to the following process: generating new characteristic variables by linear combination of metabolite variables according to a certain weight, and classifying each group of data through the main new variables (principal components); the term “self-adaptive conversion” refers to the data normalization process that performs weight conversion on the primary variable data; the term “differential metabolites analysis” (or “differential analysis of metabolite primary variables”) refers to the analysis of differences between groups on the information of primary variables of metabolites by using statistical difference analysis methods; and the term “multivariate statistical analysis” refers to the analysis of statistical patterns of multiple objects and multiple indicators when they are related to each other.
[0056] Unless otherwise specified, the term “secondary analysis” as used herein refers to the process of performing statistical analysis in combination with secondary mass spectrum data on the basis of primary analysis; and the term “differential metabolite identification” refers to the structure identification based on the differential metabolites of the primary analysis.
[0057] Specifically, the present invention discloses a metabolomics relative quantitative analysis method based on UPLC/HRMS. The method comprises the following steps: [0058] 1) formulating an isotope internal standard mixed solution based on multiple isotope internal standards; [0059] 2) based on a metabolomics sample, determining a relative quantitative correction sample matching with the metabolomics sample, and formulating relative quantitative standard curve correction solutions in a series of concentration gradients by using the relative quantitative correction sample and the isotope internal standard mixed solution in step 1); [0060] 3) formulating a metabolomics sample solution by using the metabolomics sample in step 2) and the isotope internal standard mixed solution in step 1); [0061] 4) collecting the raw mass spectrometry data of the metabolomics sample solution in step 3) and the relative quantitative standard curve correction solutions in a series of concentration gradients in step 2) by using the UPLC/HRMS platform; [0062] 5) acquiring primary mass spectrum transposed data and secondary mass spectrum transposed data based on the raw mass spectrometry data in step 4), and acquiring deconvolution results comprising a variety of primary variable information based on the primary mass spectrum transposed data; [0063] 6) combining the deconvolution results of the primary mass spectrum transposed data and the secondary mass spectrum transposed data in step 5), and referring to the primary variable information and secondary variable information of single isotope internal standard to identify the isotope internal standard, and selecting the optimal isotope internal standard for linear fitting; 0
[0064] 7) performing linear fitting by using concentrations of the relative quantitative standard curve correction solutions in a series of concentration gradients in step 2) and the primary variable information acquired in step 5) to acquire a linear equation; [0065] 8) based on the linear equation in step 7), acquiring the relative quantitative results of the metabolomics sample solution in step 3), and completing principal component analysis and differential metabolites analysis of the primary variable information; and [0066] 9) completing identification of metabolites and identification of differential metabolites by combining the deconvolution results of the primary mass spectrum transposed data with the secondary mass spectrum transposed data in step 5).
[0067] In some preferred embodiments, the isotope internal standard mixed solution in step 1) is formulated by the following method: firstly taking an appropriate amount of several (e.g., nine) isotope internal standards, and adding a solvent (e.g., water or methanol) thereto, respectively, to formulate a certain concentration (e.g., 1-15 mg/mL) of a mother solution of single isotope internal standard; and then taking an appropriate amount of each of the mother solutions of single isotope internal standard, respectively, mixing same, and adding a solvent (e.g., water or methanol) thereto to acquire an isotope internal standard mixed solution (the quantity and type of the isotope internal standard and the concentration of the mother solution of single isotope internal standard can be appropriately adjusted according to the specific type of metabolites in the metabolomics sample and the actual concentration response).
[0068] In some preferred embodiments, the metabolomics sample in step 2) is a serum or plasma sample, and in this case, NIST serum is used as a relative quantitative correction sample. In other embodiments, the metabolomics samples in step 2) are other samples (such as cells, tissues, organs or metabolites of animals or plants), and in this case, the quality control sample (which is formulated by mixing appropriate amounts of multiple metabolomics samples) is used as the relative quantitative correction sample.
[0069] In some preferred embodiments, the relative quantitative standard curve correction solutions in a series of concentration gradients in step 2) are formulated by the following method: taking a series of (e.g., seven) volumes of relative quantitative correction samples (which are determined based on the specific type of metabolomics samples, and the volume thereof can be appropriately adjusted according to the actual needs), adding an isotope internal standard and a protein precipitation reagent (e.g., methanol) thereto, respectively, and mixing and centrifuging the mixture; taking the supernatant, and concentrating same; adding a reconstituted solvent (e.g., 80% methanol in water (v/v)) to the residue, and mixing same; and taking the supernatant to acquire relative quantitative standard curve correction solutions in a series of concentration gradients.
[0070] In some preferred embodiments, the metabolomics sample solution in step 3) is formulated by the following method: taking a metabolomics sample (the number and the type of samples and the sampling amount of a single sample can be appropriately adjusted according to actual needs; in addition, according to the differences in the properties or types of the samples, the samples can be measured with a pipette or weighed with a balance), adding an isotope internal standard and a protein precipitation reagent (e.g., methanol) thereto, respectively, and mixing and centrifuging the mixture; taking the supernatant, and concentrating same; adding a reconstituted solvent (e.g., 80% methanol in water (v/v)) to the residue, and mixing same; and taking the supernatant to acquire a metabolomics sample solution.
[0071] In some preferred embodiments, the raw mass spectrometry data in step 4) needs to be collected by means of an ultra-high performance liquid chromatograph coupled with a high-resolution mass spectrometer (abbreviated as a “UPLC/HRMS platform”) and under certain chromatographic and mass spectrometry conditions. For example, in the chromatographic conditions, the chromatographic column used may be an octadecylsilane-bonded silica gel column (e.g., a Waters ACQUITY UPLC BEH C18 chromatographic column), and the mobile phase used may be a binary mobile phase system (e.g., a formic acid solution in water/a formic acid solution in acetonitrile in positive ion mode, or ammonium formate solution in water/acetonitrile in negative ion mode), the elution mode used may be gradient elution, and conditions such as flow rate, injection volume and column temperature can be appropriately adjusted according to the actual chromatographic behavior of the substance to be tested; In the mass spectrometry conditions, full MS-ddMS2 may be used for data collection, an ion source for soft ionization (e.g., an electrospray ion source/ESI) may be used as an ion source, conditions such as spray voltage, sheath gas/auxiliary gas pressure, capillary temperature, collision voltage and scanning resolution and range can be appropriately adjusted according to the actual chromatographic behavior of the substance to be tested.
[0072] In some preferred embodiments, the primary mass spectrum transposed data and the secondary mass spectrum transposed data in step 5) are acquired by means of data transposition. The data transposition process usually needs to be completed by software or an algorithm platform, such as the open source ProteoWizard software.
[0073] In some preferred embodiments, the deconvolution result in step 5) is acquired by means of deconvolution. The deconvolution process usually needs to be completed by software or an algorithm platform, such as a BioDeep metabolomics cloud analysis platform (abbreviated as a BioDeep cloud platform) or open source R software (http://www.bioconductor.org/).
[0074] In some preferred embodiments, the identification of the isotope internal standard in step 6) is completed by a self-developed algorithm. In the self-developed algorithm, the deconvolution results of the primary mass spectrum transposed data are compared with the secondary mass spectrum transposed data (for example, firstly acquiring the rt (retention time), mz (mass-to-charge ratio of the parent ion) and ms2 (secondary fragments) of a single isotope internal standard, and then referring to the rt, mz and ms2 of the single isotope internal standard in the metabolomics sample solution, the quality control sample solution and the relative quantitative standard curve correction solution, to ensure that the error range of the corresponding mz is within 10 ppm, the difference value of the corresponding rt is minimum, and ms2 can be exactly matched) to complete the identification of the isotope internal standard.
[0075] In some preferred embodiments, the optimal isotope internal standard in step 6) is selected by the following method: the peak area of the primary variable in the NIST serum sample (or the relative quantitative correction sample) is compared with the peak area of the primary variable of each internal standard in the isotope internal standard mixed solution to form a ratio, and then linearly fitted with the concentration of the NIST serum sample (or relative quantitative correction sample) to acquire the correlation coefficient r, preferred is the isotope internal standard which has the largest r value and the concentration of the primary variable in the NIST serum sample (or relative quantitative correction sample) ranging from 0.05 P to 5 P.
[0076] In some preferred embodiments, the principal component analysis in step 8) is completed by self-adaptive conversion. In the self-adaptive conversion, the value is acquired by subtracting the mean of each group of data from the original observation value and dividing by the standard deviation.
[0077] In some preferred embodiments, the differential metabolites analysis in step 8) is completed by a multivariate statistical analysis. The multivariate statistical analysis uses multivariate statistical analysis methods such as principal component analysis, t-test and VIP (spatial projection importance or difference weight contribution value) calculation to screen differential metabolites, preferred are the differential metabolites which have a t-test result of P value≤0.05 and meet the requirement of VIP≥1.
[0078] In some preferred embodiments, the identification of metabolites and the identification of differential metabolites in step 9) are completed by software or an algorithm platform, such as a BioDeep metabolomics cloud analysis platform (abbreviated as a BioDeep cloud platform).
[0079] The relative quantitative metabolomics analysis method of the present invention will be further illustrated below by specific examples. Unless otherwise specified, the materials, reagents, instruments, software, etc., used in the examples can be acquired by conventional commercial means.
EXAMPLE
[0080] Establishment of a metabolomics relative quantitative analysis method for healthy person and cerebral infarction patient serum based on UPLC/HRMS.
[0081] The process of establishing a metabolomics semi-quantitative analysis method for healthy person and cerebral infarction patient serum based on UPLC/HRMS is as shown in
1. Preparation of Internal Standards
1.1 Preparation of an Isotope Internal Standard Mixed Solution:
[0082] an appropriate amount of 9 isotope internal standards such as D-fructose-1-13C (F-13C), L-valine-1-13C (Val-13C), choline chloride-trimethyl-d9 (Choline-d9), 5′-adenosine triphosphate-15N5 (ATP-15N5), L-phenylalanine-1-13C (Phe-13C), uridine-2-13C (U-2-13C), vitamin B3-d4 (VB3-d4), L-carnitine hydrochloride-methyl-d3 (L-carnitine-d3) and betaine hydrochloride-trimethyl-d9 (betaine-d9) were taken, and water or methanol was added thereto, respectively, to prepare a mother solution of single isotope internal standard with a concentration of 1-15 mg/mL; then an appropriate amount of each of the mother solutions of 9 single isotope internal standard was taken, respectively, and mixed; and an appropriate amount of water or methanol were added thereto to prepare an isotope internal standard mixed solution of 9 isotope internal standards having a concentration ranging from 1-50 μg/mL.
1.2 Preparation of an NIST Serum Standard Curve Correction Solution:
[0083] NIST serums (Lot. No.: SRM 909c) (i.e., the NIST serum sample in
2. Preparation of Samples:
[0084] preparations of serum metabolomics sample solutions from healthy person and cerebral infarction patient as well as quality control sample solutions:
[0085] 10 serum samples from healthy person and 10 serum samples from cerebral infarction patient were selected as metabolomics samples (20 in total); 20 μL of each metabolomics sample was taken (400 μL in total) and mixed thoroughly, and then used as quality control samples (i.e., the QC samples in
3. Selection of Instruments and Condition Parameters Thereof:
3.1 Selected Instruments:
[0086] Thermo Scientific Ultimate 3000 liquid chromatograph coupled with Thermo Scientific Q-Exactive mass spectrometer.
3.2 Selected Liquid Chromatographic Conditions:
[0087] chromatographic column: Waters ACQUITY UPLC BEH C18 chromatographic column (100 mm×2.1 mm, 1.7 μm); mobile phase: in positive ion mode, 0.1% formic acid solution in water (1 mL of formic acid per 1 L of solution) was used as mobile phase A, and 0.1% formic acid solution in acetonitrile (1 mL of formic acid per 1 L of solution) was used as mobile phase B; in negative ion mode, 5 mM ammonium formate solution in water (5 mM ammonium formate per 1 L of solution) was used as mobile phase A and acetonitrile was used as mobile phase B;
[0088] elution mode: gradient elution was used, and the specific procedure was as follows: 0-1 min, 2% of mobile phase A; 1-9 min, 2%-50% of mobile phase A; 9-12 min, 50%-98% of mobile phase A; 12-13.5 min, 98% of mobile phase A; 13.5-14 min, 98%-2% of mobile phase A; 14-20 min, 2% of mobile phase A;
[0089] flow rate: 0.25 mL/min;
[0090] injection volume: 2 μL;
[0091] and column temperature: 40° C.
3.3 Selected Mass Spectrometry Conditions:
[0092] data collection method: primary mass spectrometry full scanning and data-dependent secondary mass spectrometry scanning (full MS-ddMS2) mode was used for determination to acquire primary and secondary mass spectrum data simultaneously;
[0093] ion source: electrospray ion source (ESI);
[0094] spray voltage: 3.5 kV in positive ion mode, and 2.5 kV in negative ion mode;
[0095] sheath gas pressure: 30 arb;
[0096] auxiliary gas pressure: 10 arb;
[0097] capillary temperature: 325° C.;
[0098] the full scanning resolution is 70000 FWHM, and the mass spectrum scanning range is m/z 81-1000;
[0099] secondary fragmentation was performed using high energy collision induced dissociation (HCD) with a collision voltage of 30 eV, and unnecessary secondary mass spectrum information was dynamically excluded.
4. Data Processing:
4.1 Transposition of Raw Mass Spectrometry Data:
[0100] in positive and negative ion modes, *.raw files comprising raw mass spectrometry data of 20 serum metabolomics sample solutions from healthy person and cerebral infarction patient and 4 QC sample solutions prepared in section 2 and 7 NIST serum standard curve correction solutions prepared in section 1.2 (31 sample solutions in total) were acquired, respectively. The original data files were subjected to data transposition by ProteoWizard software (open source software, available through http://proteowizard.sourceforge.net/) to form *.mzXML files comprising primary mass spectrum transposed data and secondary mass spectrum transposed data in positive and negative ion modes, respectively.
4.2 Data Deconvolution After Transposition:
[0101] the *.mzXML files which comprise primary mass spectrum transposed data in positive and negative ion modes and formed in section 4.1 were deconvoluted by the biodeep cloud platform (the corresponding open source software and scripts also can be downloaded through http://www.bioconductor.org/) to acquire 13128 primary variables in positive ion mode, and 17272 primary variables in negative ion mode, and corresponding files peaktable_pos.xlsx and peaktable_neg.xlsx were output as the results of primary deconvolution, wherein both files comprise primary variable information such as xc_ms id (primary variable number), mz (parent ion mass-to-charge ratio), rt (retention time), and intensity (peak area).
4.3 Identification and Preferred Selection of the Isotope Internal Standard:
[0102] the file peaktable_pos.xlsx or peaktable_neg.xlsx output in section 4.2 was combined with the secondary mass spectrum transposed data formed in section 4.1, respectively. Using the self-developed algorithm and referring to the rt (retention time), mz (parent ion mass-to-charge ratio) and ms2 (secondary fragmentation) of single isotope internal standard, it is ensured that the error range of mz in human serum metabolomics samples, QC samples and NIST serum samples was within 10 ppm, the difference value between the retention time of these samples and the retention time of the reference isotope internal standard was minimum, and secondary fragments can be exactly matched. 5 isotope internal standards, including internal standard substances Val-13C, Choline-d9, Phe-13C, L-carnitine-d3 and betaine-d9 were identified in positive ion mode, and 5 isotope internal standards, including F-13C, Val-13C, Phe-13C, U-2-13C and VB3-d4 were identified in negative ion mode. Correspondingly, an IS.xlsx file was output, which comprises primary variable information such as IS xc_ms id (isotope internal standard primary variable number), IS_name (isotope internal standard name), IS mz (isotope internal standard parent ion mass-to-charge ratio), rt (isotope internal standard retention time), and intensity (isotope internal standard peak area contained in NIST serum samples).
[0103] A suitable isotope internal standard is preferably selected by using a self-developed algorithm. The specific procedure is as follows: the ratios of the primary variable intensity (peak area) of the NIST serum samples in the file peaktable_pos.xlsx or peaktable_neg.xlsx output in section 4.2 to the peak areas of the 5 isotope internal standards, including Val-13C, Choline-d9, Phe-13C, L-carnitine-d3 and betaine-d9, identified in positive ion mode (5 isotope internal standards, including F-13C, Val-13C, Phe-13C, U 13C and VB3-d4 identified in negative ion mode) were linearly fitted with the 7 NIST serum standard curve correction concentrations, respectively. Each primary variable would acquire 5 linear equations and correlation coefficient r, preferred was the isotope internal standard having the largest correlation coefficient r value, and secondly, having primary variable concentrations ranging from 0.05 P to 5 P.
4.4 Linear Fitting of Primary Variables in NIST Serum Samples:
[0104] the peak areas of the metabolites in the NIST serum standard curve correction solution acquired in section 4.2 were ranked in ascending order, the concentration of 100 μL of NIST serum was P, the corresponding concentrations of 5, 10, 50, 100, 200, 300, and 500 μL of NIST serum were 0.05 P, 0.1 P, 0.5 P, P, 2 P, 3 P, and 5 P, respectively, and the concentration order was used as the concentration points for linear fitting. Then, the ratios of the peak areas of the metabolites in the NIST serum standard curve correction solutions to the peak area of each preferred isotope internal standard were linearly fitted with the above-mentioned 7 concentration points, and the linear equation and the correlation coefficient r were calculated, preferred was the isotope internal standard substance having the largest correlation coefficient r, and secondly, having primary variable concentrations ranging from 0.05 P to 5 P. In positive ion mode, there are 8558 primary variables with the correlation coefficient r greater than 0.80; and in negative ion mode, there are 5151 primary variables with the correlation coefficient r greater than 0.80.
4.5 Calculation of Quantitative Concentrations of Primary Variables in Metabolomics Samples:
[0105] the ratios of the peak areas of the metabolites in the metabolomics sample solutions acquired in section 4.2 to the peak areas of the preferred isotope internal standards in the NIST serum standard curve correction solutions were substituted into the linear equation acquired in section 4.4 to acquire a file biodeepProfiler_pos.xlsx or biodeepProfiler_neg.xlsx comprising the relative quantitative concentrations of metabolites in metabolomics samples (in P as concentration unit).
4.6 Analysis of Primary Mass Spectrum Data:
[0106] data adaptive (UV) conversion processing was selected for the file biodeepProfiler_pos.xlsx or biodeepProfiler_neg.xlsx acquired in section 4.5, respectively to acquire the base peak ion (BPI) chromatograms (as shown in
4.7 Analysis of Secondary Mass Spectrum Data-Identification of Metabolites:
[0107] the deconvoluted primary variables acquired in section 4.2 were combined with the secondary mass spectrum transposed data formed in section 4.1 to perform metabolite identification, thereby acquiring a biodeepMSMS_union.xlsx file comprising the results of metabolite identification, and positive ion mode and negative ion mode metabolites are pooled to obtain a total of 304 metabolites.
4.8 Analysis of Secondary Mass Spectrum Data-Identification of Differential Metabolites:
[0108] the metabolites screened in section 4.7 were integrated with the analysis results of primary variable differential metabolites acquired in section 4.6, and a total of 33 differential metabolites with determined substance structure were acquired.
5. Comparison of Metabolomics Relative Quantitative Analysis and Targeted Absolute Quantitative Analysis:
[0109] 4 choline analogs, including choline, betaine, L-carnitine and creatinine were used as metabolites for the comparison of metabolomics relative quantitative analysis and targeted absolute quantitative analysis results.
5.1 Targeted Absolute Quantitative Analysis:
5.1.1 Preparation of Samples for Targeted Absolute Quantitative Analysis:
[0110] the samples for targeted absolute quantitative analysis were consistent with the samples for metabolomics relative quantitative analysis. In the preparation of metabolomics samples, one more sample was prepared in parallel, respectively. The samples included serum samples of 10 healthy person and 10 cerebral infarction patient, 7 NIST serum samples and 4 QC samples, and the sample preparation process was the same as section 1 and section 2.
5.1.2 Targeted Absolute Quantitative Analysis Method:
5.1.2.1 Selected Instruments:
[0111] Waters ACQUITY UPLC liquid chromatograph coupled with AB 4000 mass spectrometer.
5.1.2.2 Selected Liquid Chromatographic Conditions:
[0112] chromatographic column: ACQUITY UPLC® BEH HILIC chromatographic column (100 mm×2.1 mm, 1.7 μm);
[0113] mobile phase: acetonitrile was used as mobile phase A, and 0.1% formic acid solution in water and 10 mM ammonium formate solution in water (1 mL formic acid and 10 mM ammonium formate per 1 L solution, respectively) were used as mobile phase B;
[0114] elution mode: gradient elution was used, and the specific procedure was as follows: 0-1 min, 80% of mobile phase A; 1-2 min, 80%-70% of mobile phase A; 2-2.5 min, 70% of mobile phase A; 2.5-3 min, 70%-50% of mobile phase A; 3-3.5 min, 50% of mobile phase A; 3.5-4 min, 50%-80% of mobile phase A; 4-6 min, 80% of mobile phase A;
[0115] flow rate: 0.40 mL/min;
[0116] injection volume: 5μL;
[0117] and column temperature: 40° C.
5.1.2.3 Selected Mass Spectrometry Conditions:
[0118] data collection method: multiple reaction monitoring (MRM) was used for scanning and data collection;
[0119] ion source: electrospray ion source (ESI);
[0120] spray voltage: 5.0 kV in positive ion mode;
[0121] ion source temperature: 500° C.;
[0122] collision gas: 6 psi;
[0123] curtain gas: 30 psi;
[0124] atomizing gas and auxiliary gas: 50 psi;
[0125] and the ion pairs used for quantitative analysis were choline (104.075≥44.9), betaine (118.080≥41.9), L-carnitine (162.023≥59.9) and creatinine (114.025≥86.1), respectively, and the ion pairs for isotope internal standard collection were choline-d9 (113.030≥69.0), betaine-d9 (127.064≥68.0) and L-carnitine-d3 (165.105≥103.1), respectively, wherein the internal standard corresponding to creatinine was L-carnitine-d3.
5.1.3 Targeted Absolute Quantitative Data Processing:
[0126] peak area values of choline, betaine, L-carnitine, creatinine, choline-d9, betaine-d9 and L-carnitine-d3 in serum samples of 10 healthy person and 10 cerebral infarction patient, 7 NIST serum samples and 4 QC samples were derived by using a self-developed algorithm. NIST serum sample concentrations 0.05 P, 0.1 P, 0.5 P, P, 2 P, 3 P and 5 P were used as independent variable gradient concentration points, the ratios of the peak areas of choline, betaine, L-carnitine, creatinine in NIST serum samples to the peak areas of the corresponding isotope internal standards were used as dependent variables, and the independent variables and the dependent variables were linearly fitted to acquire the linear equation and the correlation coefficient r. The ratios of the peak areas of choline, betaine, L-carnitine and creatinine in the metabolomics samples to the peak areas of the corresponding isotope internal standards were substituted into the linear equation to calculate the absolute quantitative concentrations of metabolites in the metabolomics samples.
5.2 Comparison of the Results of Metabolomics Relative Quantitative Analysis and targeted Absolute Quantitative Analysis:
[0127] 5.2.1 The metabolomics relative quantitative analysis took all metabolites as
[0128] research objects. In order to acquire the relative quantitative analysis results of the 4 metabolites choline, betaine, L-carnitine, and creatinine, the results acquired in section 4.5 and section 4.7 were subjected to integration analysis to acquire the relative quantitative concentrations of these 4 metabolites.
[0129] 5.2.2 Correlation analysis for the absolute quantitative concentrations of the 4 metabolites in serum samples of the 10 healthy person and 10 cerebral infarction patient acquired in section 5.1.3 and the relative quantitative concentrations acquired in section 5.2.1 was performed to acquire the scatter plots as shown in
[0130] 5.2.3 Furthermore, the relative standard deviations (RSD, %) of the absolute quantitative concentrations of the 4 metabolites in the serum samples of 10 healthy person and 10 cerebral infarction patient acquired in section 5.1.3 and the relative quantitative concentrations acquired in section 5.2.1 and two-sided paired test results are shown in Table 1. As shown in Table 1, there is no significant difference between the RSD values of the concentrations of the two detection results for the healthy person and cerebral infarction patient groups (P value of greater than 0.05), indicating that the two detection results are similar, and the metabolomics relative quantitative analysis method of the present invention has higher accuracy. Moreover, the metabolomics relative quantitative analysis method of the present invention can simultaneously realize the qualitative analysis of metabolites based on the same analysis and testing platform.
TABLE-US-00001 TABLE 1 Comparison results of RSD values of quantitative concentrations of 4 metabolites acquired by two detection methods RSD (%) Cerebral Healthy person (10) infarction patient (10) Absolute Relative Absolute Relative Metabolites quantification quantification quantification quantification Choline 18.11 16.99 24.02 22.07 Betaine 21.93 23.55 15.34 19.90 L-Carnitine 20.48 20.78 11.96 12.09 Creatinine 10.78 11.04 31.39 30.86 P value 0.6700 0.7197