Device for genotypic analysis and method for genotypic analysis
09605300 ยท 2017-03-28
Assignee
Inventors
Cpc classification
G01N21/6428
PHYSICS
G16B20/20
PHYSICS
G16B20/00
PHYSICS
International classification
G01J3/30
PHYSICS
Abstract
A technique for performing spectral calibration simultaneously with electrophoresis of an actual sample to be analyzed, without performing electrophoresis using a special matrix standard which is time-consuming and costly, is provided. The device for genotypic analysis is characterized by obtaining reference fluorescence spectra using a size standard and an allelic ladder, which provide information concerning known DNA fragments used for electrophoresis of an actual sample, and is characterized by performing spectral calibration for a capillary in which the allelic ladder is not used by detecting a shift amount of the fluorescence spectra of the size standard and shifting the reference fluorescence spectra using the shift amount to determine fluorescence spectra.
Claims
1. A method for genotypic analysis characterized by introducing a DNA sample containing DNA fragments labeled with two or more fluorescent dyes to a flow path array containing two or more flow paths, irradiating the DNA fragments with excitation light while moving the DNA fragments through the flow path array using electrophoresis means, calculating fluorescence intensity waveforms of the DNA fragments based on spectra obtained by separating light emitted from the flow path array by the irradiation, and determining fluorescence spectra of the fluorescent dyes based on the calculated fluorescence intensity waveforms.
2. The method for genotypic analysis described in claim 1, characterized by preparing an actual sample which is a subject of the genotype analysis as the DNA sample, extracting a spectrum of light emitted from one fluorescent dye from the spectra of the actual sample obtained by the separation and determining the fluorescence spectra based on the extracted spectrum.
3. The method for genotypic analysis described in claim 2, characterized by extracting the spectrum of light emitted from one fluorescent dye by searching a false peak of the fluorescence intensity waveforms.
4. The method for genotypic analysis described in claim 3, characterized by repeating the false peak search and the calculation of the fluorescence intensity waveforms until the condition that the false peak is not detected or the condition that a repetition number reaches a set upper limit is met.
5. The method for genotypic analysis described in claim 2, characterized in that the fluorescence spectra are fluorescence spectra which are determined previously.
6. The method for genotypic analysis described in claim 5, characterized in that a detection condition of the false peak or a detection result of the false peak is displayed during the false peak search and a user can modify the detection condition or the detection result referring to the display.
7. A device for genotypic analysis having a light source applying excitation light, an electrophoresis apparatus having a flow path array containing two or more flow paths, a voltage source applying voltage to both ends of the flow path array and a light detector detecting light emitted from the flow path array, and a data analyzer sending information to and receiving information from the electrophoresis apparatus, characterized by introducing a DNA sample containing DNA fragments labeled with two or more fluorescent dyes to the flow path array, irradiating the flow path array with the excitation light from the light source while moving the DNA fragments through the flow path array using the voltage source, separating light emitted from the flow path array by the irradiation and imaging spectra on the light detector, and calculating fluorescence intensity waveforms of the DNA fragments based on the imaged spectra with the data analyzer and determining fluorescence spectra of the fluorescent dyes based on the calculated fluorescence intensity waveforms.
8. The device for genotypic analysis described in claim 7, characterized in that an actual sample which is a subject of the genotype analysis is used as the DNA sample, and a spectrum of light emitted from one fluorescent dye is extracted from the spectra of the actual sample obtained by the separation and the fluorescence spectra are determined based on the extracted spectrum with the data analyzer.
9. The device for genotypic analysis described in claim 8, characterized by extracting the spectrum of light emitted from one fluorescent dye by searching a false peak of the fluorescence intensity waveforms with the data analyzer.
10. The device for genotypic analysis described in claim 9, characterized by repeating the false peak search and the calculation of the fluorescence intensity waveforms until the condition that the false peak is not detected or the condition that a repetition number reaches a set upper limit is met with the data analyzer.
11. The device for genotypic analysis described in claim 8, characterized in that the fluorescence spectra are fluorescence spectra which are determined previously.
12. The device for genotypic analysis described in claim 11, characterized in that a detection condition of the false peak or a detection result of the false peak is displayed during the false peak search with the data analyzer and a user can modify the detection condition or the detection result referring to the display.
Description
BRIEF DESCRIPTION OF DRAWINGS
(1) [
(2) [
(3) [
(4) [
(5) [
(6) [
(7) [
(8) [
(9) [
(10) [
(11) [
(12) [
(13) [
(14) [
(15) [
(16) [
(17) [
(18) [
(19) [
(20) [
(21) [
(22) [
(23) [
(24) [
(25) [
(26) [
(27) [
(28) [
(29) [
(30) [
(31) [
(32) [
(33) [
(34) [
(35) [
(36) [
DESCRIPTION OF EMBODIMENTS
(37) The invention provides a technique for performing spectral calibration simultaneously with electrophoresis of an actual sample.
(38) Examples are explained below, referring to the attached drawings. However, attention should be paid to the points that the Examples are merely examples for achieving the invention and do not limit the technical scope of the invention. Moreover, a same component in the figures is given a same reference number.
Example 1
(39) A constitution of the genotype analyzer (or device for genotypic analysis) of Example 1 of the invention is shown in
(40) The central control unit 102 is composed of a sample information setting unit 106, an electrophoresis apparatus control unit 108, a fluorescence intensity calculation unit 110, a peak detection unit 107, a matrix calculation unit 109 and an STR analysis unit 111. The respective functions will be described below.
(41)
(42) The electrophoresis apparatus 105 is composed of a detection unit 516 for optically detecting a sample, a thermostatic bath 518 for keeping a capillary at a constant temperature, a carrier 525 for carrying various vessels to a capillary cathode end, a high voltage power supply 504 for applying high voltage to the capillary, a first ammeter 505 for detecting a current generated from the high voltage power supply, a second ammeter 512 for detecting a current flowing to the anode-side electrode, a capillary array 517 composed of one or more capillaries 502 and a pumping mechanism 503 for injecting a polymer to a capillary.
(43) The capillary array 517 is an exchangeable member containing two or more (for example, eight) capillaries and contains a load header 529, the detection unit 516 and a capillary head. The capillary array 517 is replaced with a new capillary array when a capillary is damaged or the quality deteriorates.
(44) Each capillary is made of a glass tube having an internal diameter of several dozen to several hundred microns and an external diameter of several hundred microns, and its surface is coated with a polyimide to improve the strength. In this regard, however, the polyimide coating is removed from a light irradiation part which is irradiated with laser light, so that light emitted from inside easily leaks to the outside. The inside of the capillary 502 is filled with a separation medium for causing a difference in the migration speed during electrophoresis. There are fluid and non-fluid separation media, but a fluid polymer is used in this Example.
(45) The detection unit 516 is a member which obtains information depending on the sample. When excitation light is applied from a light source 514 to the detection unit 516, information light (fluorescence having a wavelength depending on the sample) is generated from the sample and emitted. The information light is separated in the wavelength direction with a diffraction grating 532, and the separated information light is detected with an optical detector 515 to analyze the sample.
(46) Each capillary cathode end 527 is fixed through a hollow electrode 526 made of metal, and the tip of the capillary of about 0.5 mm stands out from the hollow electrode 526. All of the hollow electrodes provided for the capillaries are combined and attached to the load header 529. Moreover, all the hollow electrodes 526 are electrically connected to the high voltage power supply 504 installed in the apparatus and works as cathodes during electrophoresis, sample injection and the like where voltage should be applied.
(47) The capillary ends which are opposite to the capillary cathode ends 527 (the other ends) are combined with the capillary head. The capillary head can be connected to a block 507 in a pressure resistant, airtight manner. A syringe 506 fills the capillaries with a fresh polymer from the other ends. The polymer in the capillaries is changed for every measurement to improve the measurement property.
(48) The pumping mechanism 503 is composed of the syringe 506 and a mechanism for applying pressure to the syringe. The block 507 is a connection unit for connecting the syringe 506, the capillary array 517, an anode buffer vessel 510 and a polymer vessel 509.
(49) An optical detection unit is composed of the light source 514 for irradiating the detection unit 516, the optical detector 515 for detecting light emitted in the detection unit 516 and the diffraction grating 532. When a sample in a capillary separated by electrophoresis is detected, the detection unit 516 of the capillary is irradiated with the light source 514, and light emitted from the detection unit 516 is separated with the diffraction grating 532 and detected with the optical detector 515.
(50) The thermostatic bath 518 is covered with a heat insulating material to keep the internal temperature of the thermostatic bath constant, and the temperature is controlled with a heating/cooling mechanism 520. A fan 519 circulates and agitates the air in the thermostatic bath, and the temperature of the capillary array 517 is kept uniform and constant at every position.
(51) The carrier 525 has three electric motors and linear actuators and is movable along three axes of top to bottom, left to right and depth directions. At least one or more vessels can be placed on a mobile stage 530 of the carrier 525. The mobile stage 530 has an electric grip 531, which can grip or release the vessels. Thus, a buffer vessel 521, a washing vessel 522, a waste fluid vessel 523 and a sample vessel 524 can be carried to the capillary cathode ends 527 according to the need. In this regard, an unnecessary vessel is stored in a certain storage place in the apparatus.
(52) The electrophoresis apparatus 105 is connected to the data analyzer 112 through a signal cable and used in this state. The operator can control the functions of the apparatus through the data analyzer 112 and give and receive data detected with the detector in the apparatus.
(53) A summary of the process flow of the genotype analyzer is explained by this Example, referring to
(54) First, an electrophoresis process is conducted for an actual sample to be analyzed in a step 601.
(55) Next, in a step 602, fluorescence intensities of respective fluorescent dyes are calculated from spectral waveform data obtained by electrophoresis.
(56) Then, in a step 603, peaks are detected from the waveforms of the fluorescence intensities.
(57) Next, in a step 604, the relation between the time and the DNA fragment length is determined by mapping the obtained peak times and information concerning known DNA fragment lengths of a size standard. This process is called size calling.
(58) After this, fluorescence spectra of a reference capillary are obtained in a step 605. Here, the reference capillary refers to the capillary in which electrophoresis of an allelic ladder, which will be described below, is performed.
(59) Next, in a step 606, fluorescence spectra of another capillary are calculated.
(60) Then, in a step 607, a matrix is calculated using the obtained fluorescence spectra.
(61) In a step 608, the fluorescence intensities of the respective fluorescent dyes are calculated using the obtained matrix.
(62) Then, in a step 609, STR analysis is conducted using the obtained fluorescence intensity data.
(63) In this regard, the step 610 in the flowchart shown in
(64) Details of the processes in the steps 601 to 609 are described below, referring to the drawings.
(65) <Explanation of the Step 601>
(66)
(67) Basic procedures of electrophoresis can be roughly classified into sample preparation (a step 701), the start of analysis (a step 702), filling of a migration medium (a step 703), preliminary electrophoresis (a step 704), sample injection (a step 705) and electrophoresis analysis (a step 706).
(68) The processes in the steps are explained in detail below.
(69) Step 701: The operator of the apparatus sets samples and regents to the apparatus as the sample preparation (the step 701) before the start of the analysis. More specifically, the buffer vessel 521 and the anode buffer vessel 510 shown in
(70) Here, the samples which are set in the sample plate 524 are a positive control, a negative control and an allelic ladder as well as the actual sample of DNA to be analyzed, and electrophoresis of the samples is performed in different capillaries. An example of the positive control is PCR-amplified fragments of known DNA, and the positive control is a sample for the control experiment to confirm that DNA fragments have been amplified properly by PCR. The negative control is a PCR product which does not contain DNA and is a sample for the control experiment to confirm that PCR-amplified fragments are not contaminated with DNA of the operator, dust or the like.
(71) The allelic ladder is an artificial sample containing many alleles which may be generally contained in DNA markers and is usually provided by reagent manufacturers as a reagent kit for DNA typing. The allelic ladder is used for the purpose of fine adjustment of the relation between the DNA fragment length and the allele of each DNA marker. The allelic ladder will be described below.
(72) The actual sample, the positive control, the negative control and the allelic ladder are all mixed with known DNA fragments labeled with a fluorescent dye, which are called a size standard. The kind of fluorescent dye assigned to the size standard varies with the reagent kit used. For example, in the size standard reagent shown in
(73) The operator specifies the kind of allelic ladder, the kind of size standard, the kinds of fluorescent reagent, the kinds of sample set in the respective capillaries (in the wells of the sample plate 524 corresponding to the respective capillaries) and the like. In this Example, the kind of one of the actual sample, the positive control, the negative control and the allelic ladder is specified as a kind of sample. The information is set in the sample information setting unit 106 on the data analyzer 112 through the user interface unit 103.
(74) Step 702: Then, after the completion of the sample preparation (the step 701), the operator operates the user interface unit 103 on the data analyzer 112 shown in
(75) Step 703: Next, filling of the migration medium (the step 703) is started in the electrophoresis apparatus 105 shown in
(76) In the process of filling the migration medium (the step 703) in Example 1, the waste fluid vessel 523 is first moved to right under the load header 529 with the carrier 525 shown in
(77) Step 704: Next, preliminary electrophoresis (the step 704) is performed. This step may be conducted automatically or conducted serially by sending a control signal from the electrophoresis apparatus control unit 108. The preliminary electrophoresis is a procedure of applying a certain level of voltage to the migration medium to make the migration medium suitable for electrophoresis. In the preliminary electrophoresis process (the step 704) in this Example, the capillary cathode ends 527 are first dipped into the buffer solution in the buffer vessel 521 with the carrier 525 to form current paths. Then, voltage of about several to several dozen kilo volts is applied to the migration medium with the high voltage power supply 504 for several to several dozen minutes to make the migration medium suitable for electrophoresis. Finally, the capillary cathode ends 527 are dipped into the washing solution in the washing vessel 522 to wash the capillary cathode ends 527 polluted by the buffer solution.
(78) Step 705: Next, sample injection (the step 705) is performed. This step may be conducted automatically or conducted serially by sending a control signal from the electrophoresis apparatus control unit 108. In the sample injection process (the step 705), the sample components are injected into the migration paths. In the sample injection process (the step 705) in this Example, the capillary cathode ends 527 are first dipped into the samples contained in the wells of the sample plate 524 with the carrier 525. This forms current paths, which enable the injection of the sample components to the migration paths. Then, pulse voltage is applied to the current paths with the high voltage power supply 504 to inject the sample components into the migration paths. Finally, the capillary cathode ends 527 are dipped into the washing solution in the washing vessel 522 to wash the capillary cathode ends 527 polluted by the samples.
(79) Step 706: Next, electrophoresis analysis (the step 706) is performed. This step may be conducted automatically or conducted serially by sending a control signal from the electrophoresis apparatus control unit 108. In the electrophoresis analysis process (the step 706), the sample components contained in the samples are separated by electrophoresis and analyzed. In the electrophoresis analysis process (the step 706) in this Example, the capillary cathode ends 527 are first dipped in the buffer solution in the buffer vessel 521 with the carrier 525 to form current paths. Next, high voltage of around 15 kV is applied to the current paths with the high voltage power supply 504 to generate an electric field through the migration paths. Due to the generated electric field, the sample components in the migration paths travel towards the detection unit 516 at speeds depending on the properties of the respective sample components. That is, the sample components are separated by the differences in the migration speed. The sample components are detected in the order of arrival at the detection unit 516. For example, when a sample contains a lot of DNA fragments with different nucleotide lengths, differences in the migration speed are caused by the nucleotide lengths, and the DNA fragments reach the detection unit 516 in the order of the nucleotide length. A fluorescent dye depending on the nucleotide sequence at the end is attached to each DNA fragment. When the detection unit 516 is irradiated with excitation light from the light source 514, information light (fluorescence having a wavelength depending on the sample) is generated from a sample and emitted. The information light is detected with the optical detector 515. An example of images detected with the optical detector 515 is shown in
(80) In this Example, it is supposed that, from the image data, only the brightness value data at 20 wavelengths ((0) to (19)) of each capillary are sent to the data analyzer 112, as described in the explanation of
(81) Step 707: Finally, the application of the voltage is stopped after obtaining the planned image data to complete the electrophoresis analysis (a step 707).
(82) The above processes are example processes of the electrophoresis process (the step 601) of
(83) <Explanation of the Step 602>
(84) Next, an example of the process of calculating of the fluorescent dye intensities in
(85) From the image data obtained in the electrophoresis process (the step 601), the intensities of the respective fluorescent dyes are calculated (the step 602). The fluorescence intensity calculation process is conducted in the fluorescence intensity calculation unit 110 shown in
(86) Here, the vector C is a fluorescence intensity vector, and the elements C.sub.F, C.sub.V, C.sub.N, C.sub.P and C.sub.L are the fluorescence intensities of 6FAM, VIC, NED, PET and LIZ, respectively.
(87) The vector f is a measured spectrum vector, and the elements f.sub.0 to f.sub.19 are the signal intensities (brightness values) at the wavelengths (0) to (19), respectively. Alternatively, the elements f.sub.0 to f.sub.19 may be the averages of the signal intensities near the wavelengths (0) to (19), respectively. In this regard, the measured signal at each of the wavelengths (0) to (19) detected with the optical detector 515 contains Raman-scattered light from the polymer filling the capillary as a baseline signal, in addition to the signal from a fluorescent dye. Thus, when the vector f is calculated, the baseline signal should be removed in advance (see
(88) In an example method for removing the baseline signal, the spectrum of the Raman-scattered light is measured in advance before shipping the apparatus and is stored in the storage unit 104 as the baseline signal. Then, by subtracting the baseline signal from the measured signal at a time, the signal of a fluorescent dye may be determined, and this signal may be used as the measured spectrum vector f. As shown in
(89) The matrix M is a matrix converting the measured spectrum f into the fluorescence intensity vector, and the elements correspond to the intensity ratios of the respective fluorescent dyes at the respective wavelengths. For example, the element W.sub.F0 in the matrix M in Formula 1 is the ratio of the fluorescence intensity of the fluorescent dye 6FAM at the wavelength (0). A higher value means that the contribution of the fluorescent dye to the intensity at the wavelength is higher.
(90)
(91) Originally, the matrix M is determined exclusively by the kinds of fluorescent dye and the conditions of the migration path, but in practice, the matrix M may change depending on the positional relation between the capillary and the detector and must be thus calculated when the capillary is changed or the like. The series of processes for determining the matrix M is spectral calibration.
(92) In this Example, it is supposed that the initial value of the matrix is obtained in advance through measurement. It is also supposed that the fluorescence spectra for obtaining the initial matrix, which will be described below, are similarly obtained in advance. Moreover, in this Example, it is supposed that the initial matrix determined in advance through measurement and the fluorescence spectra (described below) for obtaining the initial matrix are stored in the storage unit 104 for each kind or product of fluorescent reagent for the genotype analyzer. Alternatively, the matrix obtained in the previous spectral calibration and the fluorescence spectra thereof may be stored in the storage unit 104 as the initial values. However, because the matrix can be derived from the fluorescence spectra and the fluorescence spectra can be derived from the matrix as described below, only information about either of them may be stored.
(93) When the kinds of fluorescent reagent are set in the sample information setting unit 106 during the sample preparation (the step 701) of the electrophoresis process in
(94) When the matrix obtained in the previous spectral calibration is stored in the storage unit 104 and the kinds of fluorescent reagent are same as the kinds of fluorescent reagent used this time, the previous matrix may be used as the initial value.
(95) Using the initial value of the matrix M, the fluorescence intensities of the respective fluorescent dyes are calculated from measured spectra according to (Formula 1). By processing the spectra of a capillary at the respective times in the manner, time series data of the fluorescence intensities of the capillary can be obtained. The time series data of the fluorescence intensities are referred to as fluorescence intensity waveforms below.
(96) Examples of the change in the fluorescence intensity with time obtained in the step 602 after electrophoresis (the step 601) are shown in
(97) <Explanation of the Step 603>
(98) Next, an example of the peak detection process in
(99) Peaks are detected (the step 603) from the fluorescence intensity waveforms obtained in the fluorescence intensity calculation process (the step 602) in
(100) The concept of Gaussian fitting is shown in
(101) The peak parameters are determined for all of the fluorescence intensity waveforms of the fluorescent dyes in this manner. Here, when a peak width or a peak height does not meet the threshold condition which is determined in advance, the peak may be excluded.
(102) <Explanation of the Step 604>
(103) Next, an example of the size calling process in
(104) Size calling is a process of matching the period required for detecting a DNA fragment by electrophoresis to the DNA fragment length, and in this Example, size calling is performed by the STR analysis unit 111 in the data analyzer 112. Specifically, as described above, a reagent containing DNA fragments which have known lengths and are labeled with a specific fluorescent dye (here, the reagent is called a size standard) is subjected to electrophoresis. For example, with respect to the size standard reagent shown in
(105)
(106) <Explanation of the Step 605>
(107) Next, an example of the process of obtaining the reference fluorescence spectra in
(108) The reference fluorescence spectra are fluorescence spectra serving as standards to determine the fluorescence spectra of another capillary by shifting the reference fluorescence spectra. In this specification, the capillary in which electrophoresis of the allelic ladder is carried out is called the reference capillary, and the fluorescence spectra of the reference capillary are called the reference fluorescence spectra. In this Example, this process is conducted by the matrix calculation unit 109 of the data analyzer 112 in
(109) When the acquisition fails in the reference fluorescence spectrum process, the initial fluorescence spectra of the reference capillary (on the supposition that the initial fluorescence spectra are stored in the storage unit 104) may be used as the reference fluorescence spectra. Alternatively, STR analysis may be conducted using the fluorescence intensity waveforms to which the initial matrix is applied through the path 610.
(110) The acquisition process of the reference fluorescence spectra (the step 605) is explained using
(111)
(112) As shown in the figure, in this Example, a time at which a peak of only one fluorescent dye is observed (referred to as a single-color peak time below) is extracted from the five fluorescence intensity waveforms of the reference capillary (a step 1401). The spectrum at the single-color peak time is obtained (a step 1402) and used as the fluorescence spectrum of the color.
(113) The detection process of the single-color peak times in the step 1401 is explained below.
(114) In
(115) The single-color peak times can be determined depending on the kinds of allelic ladder and size standard set in the electrophoresis apparatus. In other words, information about the DNA fragment lengths contained in the reagent is always disclosed for the allelic ladder and the size standard, and the information is used for STR analysis, which will be described below. The known DNA fragment lengths of the size standard are used for obtaining the relational expression of the electrophoresis time and the DNA fragment length in the size calling process, as already described. The fluorescence intensity waveforms of the allelic ladder are as shown in
(116) In the figure, names of loci (Locus) labeled with fluorescent dyes (Dye), names of alleles contained in the loci (Allele), the DNA fragment lengths corresponding to the alleles (Length) and acceptable widths of the nucleotide length from the centers of the alleles (Min/Max) are shown as the information about the allelic ladder. For example, in the figure, DNA marker (locus) D10S1248 is labeled with 6FAM and contains alleles 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 and 18, and their DNA fragment lengths (unit is bp) are 77, 81, 85, 89, 93, 97, 101, 109, 113 and 117, respectively. It is shown that all of the alleles have an acceptable width of +0.5 bp to 0.5 bp. Each of the alleles corresponds to one peak in the fluorescence intensity waveforms. Although only an example of the information about DNA marker D10S1248 is shown in
(117) An example for extracting the single-color peak times using the information concerning the DNA fragment lengths attached to the allelic ladder is shown in
(118) When the single-color peak times are not obtained, STR analysis may be conducted using the fluorescence intensity waveforms to which the initial matrix is applied through the path 610.
(119) Next, the acquisition process of the reference fluorescence spectra (the step 1402) in
(120) As already described in the explanation of
(121) <Explanation of the Step 606>
(122) Next, an example of the fluorescence spectrum calculation process in
(123) This process is a process of determining the fluorescence spectra of a capillary other than the reference capillary based on the reference fluorescence spectra obtained in the step 605. In this Example, this process is conducted using the fluorescence spectra of the size standard, which is subjected to electrophoresis in all the capillaries.
(124) The flow of the fluorescence spectrum calculation process (the step 606) is shown in
(125) Next, the fluorescence spectrum of LIZ at the single-color peak time of LIZ is obtained (a step 1702). As in the acquisition process of the reference fluorescence spectra (the step 1402) in
(126) Next, a shift amount of the fluorescence spectra of a capillary other than the reference capillary (referred to as the other capillary below) from the reference fluorescence spectra is calculated (a step 1703). The fluorescence spectra of the fluorescent dye labeling the size standard (LIZ in this Example) are used for calculating the shift amount. The calculation of the shift amount is explained below using
(127)
(128) The shift amount between a fluorescence spectrum 1801 of LIZ among the reference fluorescence spectra and a fluorescence spectrum 1901 in the other capillary is calculated (see
(129) Next, all the reference fluorescence spectra of the fluorescent dyes are shifted by to determine the fluorescence spectra of the other capillary (the step 1704 in
(130) The fluorescence spectra of the other capillary are obtained in the above manner. Such a process is conducted for all of the capillaries other than the reference capillary to determine the fluorescence spectra of the capillaries.
(131) <Explanation of the Step 607>
(132) Next, an example of the process for calculating the matrix in
(133) In the flow shown in
(134) Although an Example of the matrix calculation process for one capillary is described below, a similar process can be applied to the other capillaries.
(135) With respect to a capillary, the fluorescence spectra of the five kinds of fluorescent dye, namely 6FAM, VIC, NED, PET and LIZ, obtained in the step 606 are represented by vectors S.sub.F, S.sub.V, S.sub.N, S.sub.P and S.sub.L, respectively. Here, the number of dimensions of the vectors S.sub.F, S.sub.V, S.sub.N, S.sub.P and S.sub.L is 20. In other words, as described above, the elements of each vector are the signal intensities of the respective fluorescence spectra at the wavelengths (0) to (19). Moreover, a spectrum measured at a certain time is represented by a vector f, and the fluorescence intensity vector thereof is represented by C. The vector f and the vector C are similar to those in (Formula 1). That is, the elements C.sub.F, C.sub.V, C.sub.N, C.sub.P and C.sub.L, of the vector C indicate the fluorescence intensities of 6FAM, VIC, NED, PET and LIZ, respectively. The elements f.sub.0 to f.sub.19 of the vector f indicate the signal intensities (brightness values) at the wavelengths (0) to (19), respectively.
(136) Here, when a matrix S containing the fluorescence spectrum vectors S.sub.F, S.sub.V, S.sub.N, S.sub.P and S.sub.L as columns is referred to as a fluorescence spectrum matrix, the relational expression of (Formula 2) is derived. By comparing (Formula 1) and (Formula 2), it is seen that the matrix M corresponds to the inverse of the fluorescence spectrum matrix S. However, since the fluorescence spectrum matrix S is a non-square matrix (205 in (Formula 2)), the inverse thereof cannot be derived in this form. This corresponds to a situation that the conditions are excessive because the number of conditional equations is 20 while the number of unknown quantities is 5 (C.sub.F, C.sub.V, C.sub.N, C.sub.P and C.sub.L) and thus the unknown quantities satisfying all the conditional equations cannot be obtained.
(137) Thus, it is known that the matrix M can be derived from a matrix (S.sup.tS).sup.1S.sup.t shown in (Formula 3), not from the inverse of the fluorescence spectrum matrix S. The matrix (S.sup.tS).sup.1S.sup.t is called a pseudoinverse matrix. Because the point that (Formula 3) is equivalent to the calculation of the five unknown quantities (C.sub.F, C.sub.V, C.sub.N, C.sub.P and C.sub.L) by the least square method is derived, the pseudoinverse matrix (S.sup.tS).sup.1S.sup.t is also called a least-square-type generalized inverse matrix.
(138)
(139) Accordingly, in the matrix calculation process of the step 607, the matrix M can be obtained by calculating the pseudoinverse matrix of (Formula 3) of the fluorescence spectrum matrix S obtained from the fluorescence spectra obtained in the step 606. Such matrices are determined for all the capillaries and stored.
(140) In this regard, because (Formula 4) is derived from the relation of (Formula 1) and (Formula 2) like (Formula 3), the information concerning the fluorescence spectra can easily be calculated from the matrix information.
(141) In the matrix calculation process in the step 607, the reliability of the matrix M is evaluated, and when the reliability is determined to be low, STR analysis may be conducted using the fluorescence intensity waveforms to which the initial matrix is applied through the path 610 in
[Math. 5]
ConditionNumber=(S.sup.tS).Math.(S.sup.tS).sup.1(Formula 5)
(142) With respect to the definition of the norm, there are L1 norm (the sum of the absolute values of the matrix elements), L2 norm (the sum of the squares of the matrix elements) and the like. The condition number represented by (Formula 5) indicates whether the conditions for deriving the inverse through numerical analysis are good or bad, and a larger condition number indicates that the accuracy of the derived inverse is lower. In other words, since the accuracy of the inverse of (S.sup.tS) used for deriving the matrix M by (Formula 3) is low, the reliability of the matrix M derived from the inverse is thus low. That the condition number derived by (Formula 5) is large means that the degree of the overlap of the fluorescence spectra in the fluorescence spectrum matrix S is considerable for some reason such as noise of the detection system or the deterioration of the migration medium and the reliability of the measured fluorescence spectrum matrix S is low.
(143) <Explanation of the Step 608>
(144) Next, an example of the fluorescence intensity calculation process in
(145) In the flow shown in
(146) <Explanation of the Step 609>
(147) Next, an example of the STR analysis process in
(148) In the flow shown in
(149)
(150) Next, the process of allele calling (the step 2003) is explained. Allele calling is a process of fine adjustment of the relation of an allele (corresponding to the number of repeating units of STR, as described above) and the DNA fragment length. In this Example, by selecting the kind of reagent used as the allelic ladder, the information about all the loci contained in the allelic ladder (see
(151) In summary, in the STR analysis process (the step 609), peaks are detected from the fluorescence intensity waveforms of an actual sample (the step 2001), the relational expression of the peak time and the DNA fragment length is derived from the peak times of the size standard among the peaks and the information about the known DNA fragment lengths (the step 2002), the relations between the allele and the DNA fragment length are finely adjusted by determining the DNA fragment lengths of the known alleles in the allelic ladder, and the alleles are determined from the peaks of the fluorescence intensities of the actual sample based on the relations (the step 2003).
(152) The alleles for all the loci are determined in this manner, and the combination pattern of the alleles is the information concerning the genotypes to identify an individual.
(153) As described above, in Example 1, the reference fluorescence spectra are obtained using the size standard and the allelic ladder, the shift amount from the reference fluorescence spectra is determined referring to the fluorescence spectra of the size standard, the fluorescence spectra of all the capillaries are calculated by shifting the reference fluorescence spectra by the shift amount, and the matrices M are calculated from the fluorescence spectra. The fluorescence intensity waveforms of the respective fluorescent dyes can be obtained from the measured spectra using the matrices. Since by such a method, it is possible to conduct spectral calibration simultaneously with electrophoresis of the actual sample without conducting electrophoresis using a special matrix standard and obtain the fluorescence intensity waveforms, the time and costs required for spectral calibration can be saved.
Example 2
(154) The genotype analyzer according to Example 2 of the invention is explained.
(155) In the genotype analyzer according to Example 1, using the capillary in which electrophoresis of the allelic ladder was carried out as the reference capillary, the single-color peak times of the allelic ladder and the size standard of the reference capillary were extracted, and the spectra at the times were used as the reference fluorescence spectra. Then, the reference fluorescence spectra were shifted, and thus the fluorescence spectra of all the capillaries other than the reference capillary were determined to calculate the matrices M for all the capillaries.
(156) As described above, however, ideally, the fluorescence spectrum of a fluorescent dye can be determined uniquely from the kind of fluorescent dye, the wavelength of the excitation light for the fluorescent dye and the properties of the migration medium of the capillary, irrespective of the apparatus.
(157) Thus, Example 2 of the invention is characterized in that the reference fluorescence spectra are determined by measuring in advance the fluorescence spectra, which are determined by the kinds of fluorescent dye, the wavelengths of the excitation light, the properties of the migration medium and the like as described above, comparing with the fluorescence spectra and correcting, instead of determining the reference fluorescence spectra only from the single-color peak times of the allelic ladder. The unique fluorescence spectra, which are measured in advance, are referred to as ideal fluorescence spectra below. The ideal fluorescence spectrum waveforms are either measured by the manufacturer of the apparatus in advance before shipping the genotype analyzer or measured by a service engineer or the like when the apparatus is installed and stored in the apparatus. Details of Example 2 of the invention are explained below, referring to the drawings.
(158) The constitution of the genotype analyzer according to Example 2 is similar to the constitution of Example 1 shown in
(159)
(160) The processes of single-color peak time detection (a step 2101) and acquisition of fluorescence spectra (a step 2102) in the figure are similar to the single-color peak time detection (the step 1401) and the acquisition process of the reference fluorescence spectra (the step 1402) in the acquisition process of the reference fluorescence spectra (the step 605) shown in
(161) Next, in the acquisition process of the ideal fluorescence spectra in
(162) Next, in the correction process in
(163) Referring to
(164) After the alignment in the wavelength direction, a difference spectrum is derived by subtracting the ideal fluorescence spectrum 2107 from the reference fluorescence spectrum 2106 (2202). The figure shows the process of clipping the value of the difference spectrum (2202) with a maximum difference (DMAX) and a minimum difference (DMIN) which are set in advance. In the figure, DMAX and DMIN are the maximum and minimum of the acceptable difference based on the offset value in the figure, respectively, where DMAX is the maximum of the plus difference and DMIN is the minimum of the minus difference. In the figure, the offset is a value indicating the general difference in the signal intensity of the reference fluorescence spectrum 2106 relative to the ideal fluorescence spectrum 2107, and the average of the difference spectrum may be used as the offset, for example. Based on the offset, the area of the difference spectrum is set as [Offset+DMIN, Offset+DMAX]. Data among the difference spectrum data which are outside the area are clipped by the maximum or the minimum of the area.
(165) Thus, in the correction process of the reference fluorescence spectra (the step 2104) of Example 2, by determining an acceptable width of the difference from the ideal fluorescence spectrum in advance and conducting the clipping process, it is possible to reduce the noise of the spectra caused during the measurement. In this regard, the correction process above is an example, and the correction process is not limited to the example. As long as the reference fluorescence spectra are corrected based on the comparison with the ideal fluorescence spectra, various processes can be employed.
(166) Although the above example of the correction process of the reference fluorescence spectra (the step 2104) is an example in which the difference from the ideal fluorescence spectrum is clipped, when the reliability of a measured reference fluorescence spectrum is determined to be low, the reference fluorescence spectrum may be replaced with the ideal fluorescence spectrum in the correction process (the step 2104).
(167) An example of such a process is explained using
(168) Accordingly, when the Q value is larger than the threshold as a result of the comparison in the step 2301, it is determined that the reliability of the reference fluorescence spectrum is low, and thus the reference fluorescence spectrum is replaced with the ideal fluorescence spectrum (a step 2303) without using the measured reference fluorescence spectrum. However, according to the need, the ideal fluorescence spectrum may be aligned in the wavelength direction with the reference fluorescence spectrum, as shown in
(169) On the other hand, when the Q value is smaller than the threshold as a result of the comparison in the step 2301, it is determined that the reliability of the reference fluorescence spectrum is high, and the correction process of the reference fluorescence spectrum described in the explanation of
(170) In this regard, the replacement of a reference fluorescence spectrum with the ideal fluorescence spectrum (a step 2302) described above can also be applied to genotype analysis using no allelic ladder. In other words, as long as the application of the genotype analysis does not require high nucleotide resolution in STR analysis (for example, a test of foods), the alleles can be identified without using the allelic ladder. In such a case, the reference fluorescence spectra cannot be obtained using the information of the allelic ladder, unlike in Example 1. In this case, using the ideal fluorescence spectra as the reference fluorescence spectra in the step 2302 to determine the shift amount for each capillary, the fluorescence spectra can be obtained.
(171) As described above, the genotype analyzer according to Example 2 of the invention is characterized in that the reference fluorescence spectra are determined by measuring in advance the ideal fluorescence spectra, which are determined by the kinds of fluorescent dye, the wavelengths of the excitation light, the properties of the migration medium and the like, comparing with the ideal fluorescence spectra and correcting, instead of determining the reference fluorescence spectra only from the single-color peak times of the allelic ladder, and thus spectral calibration is conducted. By correcting the reference fluorescence spectra through the comparison with the ideal fluorescence spectra, it is possible to reduce the influence of the noise and the like caused during the measurement during the spectral calibration.
Example 3
(172) The genotype analyzer according to Example 3 of the invention is explained.
(173) In the genotype analyzers according to Example 1 and Example 2, the capillary in which electrophoresis of the allelic ladder was performed was used as the reference capillary, and the reference fluorescence spectra of the reference capillary were determined. Then, the shift amounts between the reference fluorescence spectra and the spectra of the capillaries other than the reference capillary were determined, and the reference fluorescence spectra were shifted by the respective amounts to determine the fluorescence spectra of all of the capillaries other than the reference capillary, thereby calculating the matrices M of all the capillaries.
(174) However, in Example 1 and Example 2, since the fluorescence spectra of a capillary are determined using the shift from the reference fluorescence spectrum of the reference capillary as described above, it is not supposed that the shapes of the fluorescence spectra vary among the capillaries. In practice, due to some factors of the optical system or the measurement system, the shapes of the fluorescence spectra may vary slightly.
(175) Therefore, Example 3 of the invention is characterized in that the fluorescence spectra are determined using the spectra obtained by electrophoresis of an actual sample for a capillary, instead of the shift from the reference fluorescence spectrum.
(176) Details of the genotype analyzer according to Example 3 are explained below, referring to the drawings. The constitution of the genotype analyzer according to Example 3 is similar to the constitution shown in
(177)
(178) In Example 1, as shown in
(179) Similarly, in Example 3, the single-color peak times are detected from the waveforms of the fluorescence intensities obtained by electrophoresis of an actual sample to be measured in a capillary, and the spectra at the single-color peak times are used as the fluorescence spectra of the respective fluorescent dyes.
(180) However, at such a single-color peak time, in addition to a main peak of a fluorescent color, a false peak may be observed at the same peak position as that of the main peak, when the initial matrix is not appropriate (see
(181) Peaks of different fluorescent colors may appear at the same time in STR analysis. In such a case, it is not possible to determine that a smaller peak is the false peak of a larger peak. This is because, since the small peak is the true peak, a matrix which excludes the true peak may be calculated.
(182) Accordingly, it is desirable to carefully determine whether a peak is a false peak or a true peak. Example conditions (C1) to (C4) for determining that a peak is a false peak are shown below, referring to
(183) (C1): The difference between the peak time of the main peak (T1) and the peak time of the false peak (T2) is within a time difference which is determined in advance (namely, T1T2) (
(184) (C2): The peak height of the main peak (H1), the peak height of the false peak (H2) and the ratio thereof (H1/H2) are within ranges which are determined in advance (
(185) (C3): The number of patterns of the main peak and the false peak which satisfy the above conditions is a number which is determined in advance or larger (
(186) (C4): There is no individual main peak (
(187) And the like.
(188) The conditions (C3) and (C4) are believed to be useful for preventing the detection of a true peak as a false peak by mistake, because the probability that all the true peaks of different fluorescent colors are observed at the same times.
(189) In Example 3, patterns of false peaks satisfying the above conditions are searched from the time series fluorescence intensity waveforms, and the times of the false peaks are used as the single-color peak times to determine the fluorescence spectra.
(190) In this regard, among the two or more false peak times found above, it is desirable to select a time at which the fluorescent dyes other than the fluorescent colors of the main peak and the false peak do not emit light, to use as the single-color peak time (
(191) Although an example in which there is one false peak for one main peak is shown in
(192) With respect to electrophoresis of an actual sample, however, there is no guarantee that single-color peak times suitable for determining the fluorescence spectra can always be obtained, due to the characteristics of the genotypes contained in the sample, the excess or shortage of the DNA concentration of the sample and the like. In other words, for the reason that peak positions of different fluorescent dyes overlap or the width of a peak is large, a phenomenon that there is no false peak derived from only one fluorescent dye or a phenomenon that the peak intensities are too small or too large may arise. Moreover, when the initial matrix is appropriate, such false peaks are not observed.
(193) Therefore, Example 3 is characterized by determining whether or not desirable single-color peak times could be detected from an actual sample and then determining the fluorescence spectra using the spectra of the actual sample only when there are single-color peaks.
(194) The process flow of the genotype analyzer according to Example 3 is shown in
(195) Next, in a step 2502, the fluorescence intensities are calculated based on the spectrum data of the obtained electrophoresis results. This process is similar to the fluorescence intensity calculation process in Example 1 (the step 602 in
(196) Next, in a step 2503, a peak detection process is conducted with respect to the fluorescence intensity waveforms obtained in the step 2502. This process is similar to the peak detection process in Example 1 (the step 603 in
(197) Next, in a step 2504, a size calling process is conducted. This process is similar to the size calling process in Example 1 (the step 604 in
(198) Next, in a step 2505, the reference fluorescence spectra are obtained. This process is similar to the acquisition process of the reference fluorescence spectra in Example 1 (the step 605 in
(199) Next, in the step 2506a, the single-color peak times of a capillary containing an actual sample (a capillary other than the reference capillary) are detected, and it is determined whether or not the peaks are single-color peaks desirable for obtaining the fluorescence spectra, in other words, the presence or absence of a false peak is determined. Examples of the criteria for determining whether or not a single-color peak is desirable for obtaining a fluorescence spectrum are (1) the degree of overlap with another fluorescent dye, (2) the peak height and the like.
(200) The criterion for determination based on the degree of overlap with another fluorescent dye of (1) should be based on whether the fluorescence intensity of the other fluorescent dye at the peak time is sufficiently small. Moreover, it is desirable that the peak time is as far as possible from the peak time of the other fluorescent dye.
(201) Concerning the criterion for determination based on the peak height of (2), it is desirable that the intensity of the single-color peak is large enough. In addition, a peak which is too high and exceeds the dynamic range of the optical detection system should be excluded. Accordingly, the upper limit and the lower limit of an acceptable range of the intensity of the single-color peak should be determined in advance, and a time of a peak within the acceptable range should be used.
(202) When there is a single-color peak time which satisfies both of the criteria for determination of (1) and (2) in the determination process of the step 2506a (when there is a false peak), the spectrum at the peak time should be obtained and used as a fluorescence spectrum (a step 2507).
(203) When there is no single-color peak time which satisfies both of the criteria for determination of (1) and (2) in the determination process of the step 2506a (when there is no false peak), a step 2506b is conducted, and when the reference fluorescence spectra have been obtained in the step 2505 (when there are reference fluorescence spectra), a step 2508 is conducted, to conduct a process similar to the fluorescence spectrum calculation process described in the step 606 in
(204) When there is no single-color peak time which satisfies both of the criteria for determination of (1) and (2) in the determination process of the step 2506 (when there is no false peak), and when the reference fluorescence spectra have not been obtained in the step 2505 (when there is no reference fluorescence spectrum) or when the presence or absence of the reference fluorescence spectra does not matter, a path 2513 is taken, and STR analysis may be conducted with respect to the fluorescence intensity waveforms to which the initial matrix is applied without newly calculating the matrix M.
(205) Then, a matrix calculation process is conducted in a step 2510. This process is similar to the matrix calculation process in Example 1 (the step 607 in
(206) Then, a fluorescence intensity calculation process is carried out in a step 2511. This process is similar to the fluorescence intensity calculation process in Example 1 (the step 608 in
(207) Then, STR analysis is conducted in a step 2512. This process is similar to the STR analysis process in Example 1 (the step 609 in
(208) As described above, the genotype analyzer according to Example 3 of the invention is characterized in that spectral calibration is performed by detecting the single-color peaks using the spectra obtained by electrophoresis of an actual sample for each capillary and determining the fluorescence spectra based on the single-color peaks, instead of the shift from the reference fluorescence spectra. According to Example 3, by using excellent single-color peaks of an actual sample, it becomes possible to reflect the slight difference in the fluorescence spectrum waveforms among the capillaries and to perform spectral calibration with higher accuracy, as compared to Example 1 and Example 2.
(209) Moreover, with the genotype analyzer according to Example 3 of the invention, since the reference fluorescence spectra are not used, spectral calibration can be performed also for genotype analysis which does not use the allelic ladder, using the electrophoresis results of an actual sample.
Example 4
(210) The genotype analyzer according to Example 4 of the invention is explained.
(211) In Example 1 and Example 2, the reference fluorescence spectra were obtained using the reference capillary, and with respect to the other capillaries, the fluorescence spectra of the capillaries were calculated by shifting the reference fluorescence spectra. In Example 3, the single-color peak times were detected using the spectra of electrophoresis of an actual sample in each capillary to obtain the fluorescence spectra. However, also in Example 3, when desirable single-color peak times could not be detected, the fluorescence spectra were calculated by shifting the reference fluorescence spectra as in Example 1 or Example 2.
(212) In the Examples, the shift amount of the reference fluorescence spectra was calculated using the spectra of the size standard. That is, as shown in
(213) As a condition for calculating the shift amount using the size standard as described above, it was supposed that the single-color peak times could be detected from the fluorescence intensity waveforms of the size standard without any problem. In practice, however, the peak intensities may be outside the acceptable range due to the insufficiency of the concentration of the size standard in the reference capillary or the other capillaries or the like, or noise of another fluorescent dye may overlap the single-color peak times. In such cases, it is believed to be inappropriate to calculate the shift amount using the size standard.
(214) Thus, Example 4 is characterized by calculating the shift amount between capillaries using the peak patterns of Raman spectra of capillaries filled only with the migration medium before electrophoresis, in order to calculate the shift amount between the reference capillaries.
(215) The process of the genotype analyzer according to Example 4 is explained below, referring to the drawings. The constitution of the genotype analyzer according to Example 4 is similar to the constitution shown in
(216)
(217) The processes of steps 2601 to 2611 shown in
(218) Step 2601: In the genotype analyzer according to Example 4, Raman spectra are obtained before electrophoresis (the step 2601). When a Raman spectrum is obtained, the Raman-scattered light caused by irradiating a capillary filled with the electrophoresis medium with the light source 514 used for the actual electrophoresis process is detected with the optical detector 515 to obtain a picture image. The picture image is similar to the images shown in
(219) An example of the Raman spectrum of a capillary is shown in
(220) Step 2602: Therefore, the shift amount in the wavelength direction between the shapes of the Raman spectra of capillaries is calculated and is used as the shift amount of the reference fluorescence spectra.
(221) As the method for calculating the shift amount, means similar to the calculation process of the shift amount from the reference capillary in Example 1 (the step 1703 in
(222) Gaussian fitting described above may be applied to the Raman spectra. That is, because the wavelength of the light source and the migration medium are both known for each Raman spectrum, the number of peaks of the Raman spectrum, the peak wavelengths, the ratios of the signal intensities and the like can be determined in advance through measurement. Thus, by detecting the rough positions of the peaks though threshold processing and the like and by applying Gaussian fitting described above (see
(223) Step 2603: Next, electrophoresis of an actual sample is performed. This process is similar to the electrophoresis process in Example 1 (
(224) Step 2604: Next, the fluorescence intensities are calculated based on the spectrum data of the obtained electrophoresis results. This process is similar to the fluorescence intensity calculation process in Example 1 (the step 602 in
(225) Step 2605: Next, a peak detection process is conducted with respect to the fluorescence intensity waveforms obtained in the step 2604. This process is similar to the peak detection process in Example 1 (the step 603 in
(226) Step 2606: Next, a size calling process is conducted. This process is similar to the size calling process in Example 1 (the step 604 in
(227) Step 2607: Next, the reference fluorescence spectra are obtained. This process is similar to the acquisition process of the reference fluorescence spectra in Example 1 (the step 605 in
(228) Step 2608: Next, the fluorescence spectra are calculated. For this process, a process similar to the fluorescence spectrum calculation process in Example 1 (the step 606 in
(229) Step 2609: Then, a matrix calculation process is conducted. This process is similar to the matrix calculation process in Example 1 (the step 607 in
(230) Step 2610: Then, a fluorescence intensity calculation process is carried out. This process is similar to the fluorescence intensity calculation process in Example 1 (the step 608 in
(231) Step 2611: Then, STR analysis is conducted. This process is similar to the STR analysis process in Example 1 (the step 609 in
(232) As described above, the genotype analyzer according to Example 4 of the invention is characterized by using the Raman spectra measured before electrophoresis, instead of the spectra of the size standard, for determining the shift amount from the reference fluorescence spectra. With this characteristic, even when the single-color peaks cannot be obtained for example because the peak values of the fluorescence intensities of the size standard are small or overlapped with a signal of another fluorescent dye, the fluorescence spectra of each capillary can be calculated by shifting from the reference fluorescence spectra using the shift amount determined from the Raman spectra.
(233) In this regard, when the Raman spectra are used instead of the spectra of the size standard in Example 4, the conditions for the spectra of the size standard serving as the conditions should be determined carefully. In other words, when the quality of the fluorescence intensity data of the size standard of a capillary is extremely low, size calling may not be proper, resulting in the deterioration of the accuracy of the STR analysis results themselves. It is believed that the STR analysis itself should be cancelled for a capillary in such a state. For this reason, when Example 4 is applied, conditions under which the Raman spectra should be used should be determined without deteriorating the quality of size calling. Alternatively, the shift amount may be calculated always by Example 4, independently of the quality of the size standard.
Example 5
(234) The genotype analyzer according to Example 5 of the invention is explained.
(235) In the genetic analyzer according to Example 3, the fluorescence spectra were determined using the spectra obtained by electrophoresis of an actual sample for each capillary. Here, times at which a false peak of a fluorescent color could be observed were searched from the fluorescence intensity waveforms obtained from the actual sample, and the fluorescence spectra were determined using the times as the single-color peak times.
(236) In Example 3, when the single-color peak times were not observed, the capillary in which electrophoresis of the allelic ladder was performed was used as the reference capillary, and the reference fluorescence spectra of the reference capillary were determined, as in the genotype analyzers according to Example 1 and Example 2. Then, the shift amounts between the reference fluorescence spectra and the spectra of the capillaries other than the reference capillary were determined, and the fluorescence spectra of all the capillaries other than the reference capillary were determined by shifting the reference fluorescence spectra by the respective amounts to calculate the matrices M of all the capillaries.
(237) However, as described in Example 3, because the fluorescence spectra of a capillary are determined using the shift from the reference fluorescence spectra of the reference capillary, it is not taken into account that the shapes of the fluorescence spectra vary among the capillaries. In practice, the shapes of the fluorescence spectra may differ slightly due to some factors of the optical system or the measurement system.
(238) Accordingly, when no false peak can be observed, an appropriate matrix may not be obtained from the fluorescence spectra derived by shifting the reference fluorescence spectra. First of all, the initial matrix may be appropriate when no false peak can be observed, and the method in Example 3 may result in a matrix which is less appropriate than the initial matrix.
(239) Thus, the genetic analyzer according to Example 5 is characterized by determining the fluorescence spectra only from the fluorescence intensity waveforms of each capillary without determining the reference fluorescence spectra of the allelic ladder, as a modification of Example 3.
(240) The process of the genotype analyzer according to Example 5 is explained below, referring to the drawings. The constitution of the genotype analyzer according to Example 5 is similar to the constitution shown in
(241)
(242) The processes of steps 3101 to 3107 shown in
(243) In the step 3101, electrophoresis of an actual sample is performed. This process is similar to the electrophoresis process in Example 1 (
(244) Next, in the step 3102, the fluorescence intensities are calculated. The means for calculating the fluorescence intensities is similar to that of Example 1. In this Example, the fluorescence intensities are calculated using the initial matrix as in Example 1, when the step 3102 is the first fluorescence intensity calculation. In other words, on the supposition that the initial matrix and the fluorescence spectra for obtaining the initial matrix have been obtained in advance through measurement, the fluorescence intensity waveform of each fluorescent color is calculated by (Formula 1) using the initial matrix.
(245) When the step 3102 is the second fluorescence intensity calculation or after that, that is, when the matrix calculation according to this Example has already been conducted, the fluorescence intensity waveform of each fluorescent color may be calculated by (Formula 1) using the matrix calculated previously (a step 3106, which will be explained below).
(246) Next, peaks are detected from the fluorescence intensity waveforms of the fluorescent colors in the step 3103. This process is similar to that of Example 1 (the step 603 in
(247) Next, the presence or absence of a false peak is determined in the step 3104. As shown in
(248) When no false peak is detected for a main peak of a fluorescent color, the fluorescence spectrum corresponding to the matrix used in the step 3102 is employed. That is, this process corresponds to the replacement of a column vector of the fluorescence spectrum matrix S of (Formula 2) corresponding to the initial matrix with the spectrum at the false peak time.
(249) When there is no false peak for a fluorescent color, it is determined that the fluorescence spectrum is appropriate, and the spectrum of the fluorescent color is not modified. When not false peak is detected for any of the fluorescent colors, the matrix used in the step 3102 is used as the final matrix, and STR analysis in the step 3107 is conducted. The contents of the STR analysis process in the step 3107 are similar to those of Example 1.
(250) When a false peak is detected for one or more fluorescent colors in the step 3104, the step 3105 is conducted to obtain the fluorescence spectra.
(251) The process of the step 3105 is similar to that of Example 3, and a time at which there is a false peak is used as the single-color peak time, and the measured spectrum at this time is used as the fluorescence spectrum of the fluorescent color in the step 3105.
(252) Then, a matrix is calculated by (Formula 3) from the obtained fluorescence spectra in the step 3106. This process is similar to that of Example 1. After this, the step 3104 is conducted again using the obtained matrix, followed by the fluorescence intensity calculation (the step 3102) and the peak detection (the step 3103), and the false peak determination (the step 3104) is conducted again. Such a set of processes is repeated until no false peak is detected anymore to update the matrix.
(253) When the false peaks are searched properly in the step 3104, a matrix with which no false peak is detected can be generally obtained after repeating the set of processes for certain times. However, when the false peaks are not searched properly, a false peak may still be detected even when the set of processes is repeated.
(254) For such a case, it is desirable to set an upper limit for the repetition number in practice, and when the repetition number reaches the upper limit, it is desirable to conduct a process of employing a matrix which gives the lowest level of false peaks, employing the initial matrix or the like.
(255) Moreover, when the false peaks are searched in the step 3104, the user interface unit 103 may provide means for changing the conditions for determining the false peak and calculating the matrix again. Examples of the items of the conditions for determining the false peak are the ratio of the height of the false peak to the height of the main peak described in Example 3, the maximums and the minimums of the heights of the main peak and the false peak, the difference between the main peak time and the false peak time and the repetition number of the matrix.
(256) An example of the display form is shown in
(257) Moreover, in
(258) As described above, the genotype analyzer according to Example 5 of the invention is characterized by performing spectral calibration by detecting the single-color peaks using the spectra obtained by electrophoresis of an actual sample for each capillary and by determining the fluorescence spectra based on the single-color peaks, instead of using the shift from the reference fluorescence spectra. Here, by searching the false peaks of the actual sample and repeatedly calculating a matrix with which no false peak is detected, a more appropriate matrix can be obtained.
(259) Best embodiments for carrying out the invention have been explained above. However, the invention is not limited to the Examples, and modifications are acceptable within the scope of gist of the invention. For example, an electrophoresis apparatus of a microchip type having sample flow paths formed inside thereof may be used. In this case, the flow paths correspond to the capillaries in this specification. Moreover, the invention can be applied also to an electrophoresis apparatus using a slab gel.
(260) In addition, the invention can be achieved by a program code of software achieving the functions of the Examples. In this case, a storage medium storing the program code is installed in a system or an apparatus, and the computer of the system or the apparatus (or CPU or MPU) reads the program code stored in the storage medium. In this case, the program code read from the storage medium itself achieves the functions of the Examples, and the program code itself and the storage medium storing the program code constitute the invention. As the storage medium for providing the program code, a flexible disk, a CD-ROM, a DVD-ROM, a hard disk, an optical disk, a magneto-optical disk, a CD-R, a magnetic tape, a nonvolatile memory card and ROM are used for example.
(261) Based on the instructions of the program code, the OS (operating system) working on the computer or the like may conduct a part or all of the actual processes, and the functions of the embodiments may be achieved by the processes. Moreover, the program code read from the storage medium may be transferred to a memory on the computer. Then the CPU of the computer or the like may conduct a part or all of the actual processes based on the instructions of the program code, and the functions of the embodiments may be achieved by the processes.
(262) Moreover, by distributing the program code of software achieving the functions of the embodiments via a network, the program code may be stored in storage means such as the hard disk or memory of a system or an apparatus or a storage medium such as a CD-RW and a CD-R, and the computer of the system or the apparatus (or CPU or MPU) may read the program code stored in the storage means or the storage medium for the use and carry out the program.
(263) Constituent features of the invention comprehended from the embodiments are as follows.
(264) (1) A method for genotypic analysis for calculating fluorescence intensities of DNA samples labeled with two or more fluorescent dyes based on spectra obtained by electrophoresis means which irradiates a flow path array containing two or more flow paths with excitation light, separates light emitted from the flow path array and images spectra on a light detector; characterized by determining reference fluorescence spectra of the fluorescent dyes based on the spectra obtained by the electrophoresis of the DNA samples and shifting the reference fluorescence spectra in the wavelength direction to determine the fluorescence spectra of the flow paths.
(265) (2) The method for genotypic analysis of (1) characterized by determining the reference fluorescence spectra by extracting single spectra of light emitted from the fluorescent dyes from the spectra of a sample containing DNA fragments with known lengths.
(266) (3) The method for genotypic analysis of (1) characterized in that ideal fluorescence spectra which are measured in advance are used as the reference fluorescence spectra.
(267) (4) The method for genotypic analysis of (2) characterized in that the reference fluorescence spectra are corrected referring to ideal fluorescence spectra which are measured in advance.
(268) (5) The method for genotypic analysis of (1) characterized by determining the reference fluorescence spectra by extracting single spectra of light emitted from the fluorescent dyes from spectra of an actual sample which is a subject of the genotype analysis.
(269) (6) The method for genotypic analysis of any one of (1) to (5) characterized by determining a shift amount of fluorescence spectra of flow paths in the wavelength direction by extracting single fluorescence spectra of the fluorescent dye labeling a size standard for the flow paths and shifting the reference fluorescence spectra by the shift amount.
(270) (7) The method for genotypic analysis of any one of (1) to (5) characterized by determining a shift amount of fluorescence spectra of flow paths in the wavelength direction by measuring Raman spectra of the flow paths and shifting the reference fluorescence spectra by the shift amount.
(271) (8) The method for genotypic analysis of (1) characterized by determining whether or not a single spectrum of light emitted from one fluorescent dye can be extracted from spectra of an actual sample which is a subject of the genotype analysis and determining the fluorescence spectra by shifting the reference fluorescence spectra when the determination is negative.
(272) (9) A device for genotypic analysis having: a light source applying excitation light; an electrophoresis apparatus having a flow path array containing two or more flow paths, a voltage source applying voltage to both ends of the flow path array and a light detector detecting light emitted from the flow path array; and a data analyzer sending information to and receiving information from the electrophoresis apparatus:
(273) characterized by introducing DNA samples containing DNA fragments labeled with two or more fluorescent dyes to the flow path array, irradiating the flow path array with the excitation light from the light source while moving the DNA fragments through the flow path array using the voltage source, separating light emitted from the flow path array by the irradiation and imaging spectra on the light detector, determining reference fluorescence spectra of the fluorescent dyes based on the imaged spectra in the data analyzer, and determining fluorescence spectra of the flow paths by shifting the reference fluorescence spectra in the wavelength direction.
(274) (10) The device for genotypic analysis of (9) characterized by determining the reference fluorescence spectra and determining the fluorescence spectra of the flow paths by shifting the reference fluorescence spectra in the wavelength direction.
(275) (11) The device for genotypic analysis of (9) characterized by determining the reference fluorescence spectra by extracting single spectra of light emitted from the fluorescent dyes from spectra of a sample containing DNA fragments with known lengths.
(276) (12) The device for genotypic analysis of (9) characterized in that ideal fluorescence spectra which are measured in advance are used as the reference fluorescence spectra.
(277) (13) The device for genotypic analysis of (10) characterized in that the reference fluorescence spectra are corrected referring to ideal fluorescence spectra which are measured in advance.
(278) (14) The device for genotypic analysis of (9) characterized by determining the reference fluorescence spectra by extracting single spectra of light emitted from the fluorescent dyes from spectra of an actual sample which is a subject of the genotype analysis.
(279) (15) The device for genotypic analysis of any one of (9) to (14) characterized by determining a shift amount of fluorescence spectra of flow paths in the wavelength direction by extracting single fluorescence spectra of the fluorescent dye labeling a size standard for the flow paths and shifting the reference fluorescence spectra by the shift amount.
(280) (16) The device for genotypic analysis of any one of (9) to (14) characterized by determining a shift amount of fluorescence spectra of flow paths in the wavelength direction by measuring Raman spectra of the flow paths and shifting the reference fluorescence spectra by the shift amount.
REFERENCE SIGNS LIST
(281) 101: genotype analyzer (or device for genotypic analysis) 102: Central control unit 103: User interface unit 104: Storage unit 105: Electrophoresis apparatus 106: Sample information setting unit 107: Peak detection unit 108: Electrophoresis apparatus control unit 109: Matrix calculation unit 110: Fluorescence intensity calculation unit 111: STR analysis unit 112: Data analyzer