METHOD FOR PROVIDING A DNA-ENCODED LIBRARY, DNA-ENCODED LIBRARY AND METHOD OF DECODING A DNA-ENCODED LIBRARY
20200040326 ยท 2020-02-06
Assignee
Inventors
- Yixin ZHANG (Dresden, DE)
- Francesco Reddavide (Dresden, DE)
- Meiying CUI (Dresden, DE)
- Helena ANDRADE (Dresden, DE)
- Stephan HEIDEN (Dresden, DE)
Cpc classification
C12N15/1065
CHEMISTRY; METALLURGY
C12N15/1093
CHEMISTRY; METALLURGY
C40B70/00
CHEMISTRY; METALLURGY
C12N15/1093
CHEMISTRY; METALLURGY
C12N15/1089
CHEMISTRY; METALLURGY
International classification
C12N15/10
CHEMISTRY; METALLURGY
C40B70/00
CHEMISTRY; METALLURGY
Abstract
Disclosed are a method for providing a DNA-encoding library, the DNA-encoding library and a method of decoding a DNA-encoded library. Many different DNA molecules are synthesized which differ from each other in DNA barcode sequences. Each DNA molecule is bonded to a specific substance forming different DNA-substance conjugates. The DNA-encoded library has the advantage that, for example after an enrichment experiment performed with the library, the library may be decoded in a faster and less expensive manner than known DNA-encoded libraries.
Claims
1-15. (canceled)
16. A method for providing a DNA-encoded library, comprising a) synthesizing many different DNA molecules which differ from each other by comprising different DNA barcode sequences, wherein each DNA barcode sequence comprises at least a first coding region DNA sequence comprising at least a first part, a second part and a third part, wherein the second part is located between the first and third part and the second part differs between all the DNA molecules by at least two nucleotides; and b) bonding each of the many different DNA molecules to at least a specific substance forming different DNA-substance conjugates, wherein the DNA-substance conjugates differ from each other by the specific substance and by their DNA molecules; wherein the first part and the third part encode information regarding the second part of the first coding region, wherein a certain first part and/or a certain third part uniquely codes for a certain group of DNA-substance conjugates which is smaller than the group of all DNA-substance conjugates in the DNA-encoded library.
17. The method according to claim 16, wherein i) the first coding region DNA sequence comprises at least a fourth part, wherein the second part is located between the fourth and third part and wherein both the combination of the first part and the fourth part and the combination of the first part and the third part of the first coding region encode information about the second part of the first coding region; and ii) each barcode sequence comprises at least a second coding region DNA sequence comprising at least a first part, a second part, a third part, and a fourth part, wherein the second part is located between the fourth and third part and the second part differs between all the DNA molecules by at least two nucleotides, wherein both the combination of the first part and the fourth part and the combination of the first part and the third part of the second coding region encode information about the second part of the second coding region; wherein a certain combination of a first part and fourth part in a certain coding region uniquely codes for a certain group of DNA-substance conjugates which is smaller than the group of all DNA-substance conjugates which is encoded by the first part alone.
18. The method according to claim 16, wherein i) each barcode sequence comprises at least a second coding region DNA sequence comprising at least a first part, a second part, a third part, and a fourth part, wherein the second part is located between the fourth and third part and the second part differs between all the DNA molecules by at least two nucleotides, wherein both the combination of the first part and the fourth part and the combination of the first part and the third part of the second coding region encode information about the second part of the second coding region; and ii) each barcode sequence comprises at least a third coding region DNA sequence comprising at least a first part, a second part, a third part, and a fourth part, wherein the second part is located between the fourth and third part and the second part differs between all the DNA molecules by at least two nucleotides, wherein both the combination of the first part and the fourth part and the combination of the first part and third part and the of the third coding region encode information about the second part of the third coding region; wherein a certain combination of a first part and fourth part in a certain coding region uniquely codes for a certain group of DNA-substance conjugates which is smaller than the group of DNA-substance conjugates which is encoded by the first part.
19. The method according to claim 16, wherein at least one coding region DNA sequence comprises at least a first part, a second part, a third part, a fourth part and a fifth part, wherein the second part is located between the fourth and fifth part and the second part differs between all the DNA molecules by at least two nucleotides, wherein the combination of the first part and the fourth part and the combination of the fifth part and the third part of the coding region encode information about the second part of the coding region, preferably of all coding regions, wherein a certain combination of a first part and fourth part uniquely codes for a certain group of DNA-substance conjugates which is smaller than the group of DNA-substance conjugates which is encoded by the first part alone, and wherein a certain combination of a fifth part and third part uniquely codes for a certain group of DNA-substance conjugates which is smaller than the group of DNA-substance conjugates which is encoded by the third part alone.
20. A DNA-encoded library, comprising many different DNA-ligand conjugates, wherein the DNA-ligand conjugates differ from each other by their ligand and by their DNA molecules, wherein the DNA molecules of the DNA-ligand conjugates differ from each other by comprising different DNA barcode sequences, wherein each DNA barcode sequence comprises at least a first coding region DNA sequence comprising at least a first part, a second part and a third part, wherein the second part is located between the first and third part and the second part differs between all the DNA molecules by at least two nucleotides; wherein the first part and the third part encode information regarding the second part of the first coding region, wherein a certain first part and/or a certain third part uniquely codes for a certain group of DNA-ligand conjugates which is smaller than the group of all DNA-ligand conjugates in the DNA-encoded library.
21. The DNA-encoded library according to claim 20, wherein i) the first coding region DNA sequence comprises at least a fourth part, wherein the second part is located between the fourth and third part and wherein both the combination of the first part and the fourth part and the combination of the first part and the third part of the first coding region encode information about the second part of the first coding region; and ii) each barcode sequence comprises at least a second coding region DNA sequence comprising at least a first part, a second part, a third part, and a fourth part, wherein the second part is located between the fourth and third part and the second part differs between all the DNA molecules by at least two nucleotides, wherein both the combination of the first part and the fourth part and the combination of the first part and the third part of the second coding region encode information about the second part of the second coding region; wherein a certain combination of a first part and fourth part in a certain coding region uniquely codes for a certain group of DNA-ligand conjugates which is smaller than the group of all DNA-ligand conjugates which is encoded by the first part alone.
22. The DNA-encoded library according to claim 21, wherein each barcode sequence comprises at least a third coding region DNA sequence, which is on the same DNA strand as the second coding region, comprising at least a first part, a second part, a third part, and a fourth part, wherein the second part is located between the fourth and third part and the second part differs between all the DNA molecules by at least two nucleotides, wherein both the combination of the first part and the fourth part and the combination of the first part and the third part and the of the third coding region encode information about the second part of the third coding region, wherein a certain combination of a first part and fourth part in the second coding region and in the third coding region uniquely codes for a certain group of DNA-ligand conjugates which is smaller than the group of DNA-ligand conjugates which is encoded by the first part alone.
23. The DNA-encoded library according to claim 20, wherein at least one coding region DNA sequence comprises at least a first part, a second part, a third part, a fourth part and a fifth part, wherein the second part is located between the fourth and fifth part and the second part differs between all the DNA molecules by at least two nucleotides, wherein the combination of the first part and the fourth part and the combination of the fifth part and the third part of the coding region encode information about the second part of the coding region, and wherein a certain combination of a first part and fourth part uniquely codes for a certain group of DNA-ligand conjugates which is smaller than the group of DNA-ligand conjugates which is encoded by the first part alone and wherein a certain combination of a fifth part and third part uniquely codes for a certain group of DNA-ligand conjugates which is smaller than the group of DNA-ligand conjugates which is encoded by the third part alone.
24. A method of decoding a DNA-encoded library according to claim 20, comprising a) performing a qPCR with the DNA-encoded library, wherein the following primers are utilized: a primer A and a primer B for amplifying the first coding region of every DNA-ligand conjugate; and many different primers A-xN which anneal to the different first parts of the first coding region and many different primers B-yN which anneal to the different third parts of the first coding region, wherein primer A-xN has an identical length like the coding region primer A by shortening x nucleotides at its 5-end, primer B-yN has an identical length like the coding region primer B by shortening y nucleotides at its 5-end, N represents a A, T, G or C and x and y represent the total number of any one of A, T, G or C at the 3-end of the primers, wherein x is an integer from 2 to 6; b) calculating a mathematical product of the signal value of each primer A-xN and each primer B-xN by following equation:
Value (A-xN).sub.i=signal value [(A-xN).sub.i+B].Math.signal value [(A-xN).sub.i+(B-xN).sub.i];
and
Value (B-yN).sub.i=signal value [(B-yN).sub.i+A].Math.signal value [(B-yN).sub.i+(A-xn).sub.i], wherein i is an integer and defines a specific primer, and the +-sign indicates a combination of two primers; wherein signal value is the percentage of abundance related to the whole set of qPCR quantification using different primers annealed to the same region; and c) comparing the obtained mathematical products for each of the primers (A-xN).sub.i and (B-yN).sub.i, wherein those primers with high values code for DNA-ligand conjugates which are present at a high concentration in the DNA-encoded library.
25. The method according to claim 24, wherein the method comprises i) calculating a mathematical product of the value obtained for each primer A-xN and each primer B-yN by following equation Value (AB).sub.i=Value (A-xN).sub.i.Math.Value (B-yN).sub.i; ii) comparing the obtained mathematical products for each of the combination of primers (A-xN).sub.i and (B-yN).sub.i, wherein those primer combinations with high values code for DNA-ligand conjugates which are present at a high concentration in the DNA-encoded library.
26. The method according to claim 24, wherein the qPCR is performed with a DNA-encoded library as a template, wherein the DNA-encoded library comprises many different DNA-ligand conjugates, wherein the DNA-ligand conjugates differ from each other by their ligand and by their DNA molecules, wherein the DNA molecules of the DNA-ligand conjugates differ from each other by comprising different DNA barcode sequences, wherein each DNA barcode sequence comprises at least a first coding region DNA sequence comprising at least a first part, a second part and a third part, wherein the second part is located between the first and third part and the second part differs between all the DNA molecules by at least two nucleotides; wherein the first part and the third part encode information regarding the second part of the first coding region, wherein a certain first part and/or a certain third part uniquely codes for a certain group of DNA-ligand conjugates which is smaller than the group of all DNA-ligand conjugates in the DNA-encoded library; the method comprising: i) performing a qPCR with the following primers: a first coding region primer A and a first coding region primer primer B for amplifying the first coding region of every DNA-ligand conjugate; and many different primers A-xN which anneal to the different first parts, or first and fourth parts of the first coding region and many different primers B-yN which anneal to the different third parts of the first coding region, wherein A-xN has an identical length like the coding region primer A by shortening x nucleotides at its 5-end, B-yN has an identical length like the coding region primer B by shortening y nucleotides at its 5-end, N represents a A, T, G or C and x and y represent the total number of any one of A, T, G or C at the 3-end of the primers, wherein x is an integer from 6 to 10, and y is an integer from 2 to 6; and a second coding region primer C and a second coding region primer D for amplifying the second coding region of every DNA-ligand conjugate; and many different primers D-yN which anneal to the different first parts, or first and fourth parts of the second coding region and many different primers C-xN which anneal to the different third parts of the second coding region, wherein primer C-xN has an identical length like the coding region primer C by shortening x nucleotides at its 5-end, primer D-yN has an identical length like the coding region primer D by shortening y nucleotides at its 5-end, N represents a A, T, G or C and x and y represent the total number of any one of A, T, G or C at the 3-end of the primers, wherein x is an integer from 6 to 10, and y is an integer from 2 to 6; ii) calculating a mathematical product of the signal value of each primer A-xN, each primer B-yN, each primer C-xN and each primer D-yN by following equation:
Value (A-xN).sub.i=signal value [(A-xN).sub.i+B].Math.signal value [(A-xN).sub.i+(B-xN).sub.i];
Value (B-yN).sub.i=signal value [(B-yN).sub.i+A].Math.signal value [(B-yN).sub.i+(A-xN).sub.i],
Value (C-xN).sub.i=signal value [(C-xN).sub.i+D].Math.signal value [(C-xN).sub.i+(D-xn).sub.i],
Value (D-yN).sub.i=signal value [(D-yN).sub.i+C].Math.signal value [(D-yN).sub.i+(C-xn).sub.i], wherein i is an integer and defines a specific primer, and the +-sign indicates a combination of two primers; wherein signal value is the percentage of abundance related to the whole set of qPCR quantification using different primers annealed to the same region; and iii) comparing the obtained mathematical products for each of the primers (A-xN).sub.i, (B-yN).sub.i, (C-xN).sub.i and (D-yN).sub.i, wherein those primers with high values code for DNA-ligand conjugates which are present at a high concentration in the DNA-encoded library.
27. The method according to claim 26, wherein the method comprises i) calculating a mathematical product of the value obtained for each primer A-xN and each primer B-yN, for each primer A-xN and each primer D-xN and for each primer C-yN and D-xN by following equation
Value (AB).sub.i=Value (A-xN).sub.i.Math.Value (B-yN).sub.i;
Value (AD).sub.i=Value (A-xN).sub.i.Math.Value (D-yN).sub.i;
Value (CD).sub.i=Value (C-xN).sub.i.Math.Value (D-yN).sub.i; ii) calculating the mathematical product of the Value (AB).sub.i, (AD).sub.i and (CD).sub.i for each primer i by the following equation Value.sup.i=value (AB).sub.i.Math.value (AD).sub.i.Math.value (CD).sub.i iii) comparing the obtained mathematical products Value.sup.i, wherein those primer combinations i with high values code for DNA-ligand conjugates which are present at a high concentration in the DNA-encoded library.
28. The method according to claim 24, wherein the qPCR is performed with a DNA-encoded library, wherein the DNA-encoded library comprises many different DNA-ligand conjugates, wherein the DNA-ligand conjugates differ from each other by their ligand and by their DNA molecules, wherein the DNA molecules of the DNA-ligand conjugates differ from each other by comprising different DNA barcode sequences, wherein each DNA barcode sequence comprises at least a first coding region DNA sequence comprising at least a first part, a second part and a third part, wherein the second part is located between the first and third part and the second part differs between all the DNA molecules by at least two nucleotides; wherein the first part and the third part encode information regarding the second part of the first coding region, wherein a certain first part and/or a certain third part uniquely codes for a certain group of DNA-ligand conjugates which is smaller than the group of all DNA-ligand conjugates in the DNA-encoded library; the method comprising: i) performing a qPCR with the following primers: a first coding region primer A and a first coding region primer B for amplifying the first coding region of every DNA-ligand conjugate; and many different primers A-xN which anneal to the different first parts, or first and fourth parts of the first coding region, and many different primers B-yN which anneal to the different third parts of the first coding region, wherein A-xN has an identical length like the coding region primer A by shortening x nucleotides at its 5-end, B has an identical length like the coding region primer B-yN by shortening y nucleotides at its 5-end, N represents a A, T, G or C and x and y represent the total number of any one of A, T, G or C at the 3-end of the primers, wherein x is an integer from 6 to 10, preferably 8, and y is an integer from 2 to 6, preferably 4; and a second coding region primer C and a second coding region primer D for amplifying the second coding region of every DNA-ligand conjugate; and many different primers D-yN which anneal to the different first parts, or first and fourth parts of the second coding region and many different primers C-xN which anneal to the different third parts of the second coding region, wherein primer C-xN has an identical length like the coding region primer C by shortening x nucleotides at its 5-end, primer D-yN has an identical length like the coding region primer D by shortening y nucleotides at its 5-end, N represents a A, T, G or C and x and y represent the total number of any one of A, T, G or C at the 3-end of the primers, wherein x is an integer from 6 to 10, and y is an integer from 2 to 6; a third coding region primer E and a third coding region primer F for amplifying the third coding region of every DNA-ligand conjugate; and many different primers E-xN which anneal to the different first parts of the third coding region and many different primers F-yN which anneal to the different third parts of the third coding region, wherein primer E-xN has an identical length like the coding region primer E by shortening x nucleotides at its 5-end, primer F-yN has an identical length like the coding region primer F by shortening y nucleotides at its 5-end, N represents a A, T, G or C and x and y represent the total number of any one of A, T, G or C at the 3-end of the primers, wherein x is an integer from 6 to 10, and y is an integer from 2 to 6; ii) calculating a mathematical product of the signal value of each primer A-xN, each primer B-yN, each primer C-xN, each primer D-yN, each primer E-xN and each primer F-yN by following equation:
Value (A-xN).sub.i=signal value [(A-xN).sub.i+B].Math.signal value [(A-xN).sub.i+(B-xN).sub.i];
Value (B-yN).sub.i=signal value [(B-yN).sub.i+A].Math.signal value [(B-yN).sub.i+(A-xN).sub.i],
Value (C-xN).sub.i=signal value [(C-xN).sub.i+D].Math.signal value [(C-xN).sub.i+(D-xN).sub.i],
Value (D-yN).sub.i=signal value [(D-yN).sub.i+C].Math.signal value [(D-yN).sub.i+(C-xN).sub.i],
Value (E-xN).sub.i=signal value [(E-xN).sub.i+F].Math.signal value [(E-xN).sub.i+(F-xn).sub.i],
Value (F-yN).sub.i=signal value [(F-yN).sub.i+E].Math.signal value [(F-yN).sub.i+(E-xN).sub.i], wherein i is an integer and defines a specific primer, and the +-sign indicates a combination of two primers; wherein signal value is the percentage of abundance related to the whole set of qPCR quantification using different primers annealed to the same region; and iii) comparing the obtained mathematical products for each of the primers (A-xN).sub.i, (B-yN).sub.i, (C-xN).sub.i, (D-yN).sub.i, (E-xN).sub.i and (N-yN).sub.i, wherein those primers with high values code for DNA-ligand conjugates which are present at a high concentration in the DNA-encoded library.
29. The method according to claim 28, wherein the method comprises i) calculating a mathematical product of the value obtained for each primer A-xN and each primer B-yN, for each primer A-xN and each primer D-xN, for each primer C-yN and D-xN, for each primer A-xN and N-yN, for each primer M-xN and D-yN and for each primer M-xN and N-yN by following equation
Value (AB).sub.i=Value (A-xN).sub.i.Math.Value (B-yN).sub.i;
Value (AD).sub.i=Value (A-xN).sub.i.Math.Value (D-yN).sub.i;
Value (CD).sub.i=Value (C-xN).sub.i.Math.Value (D-yN).sub.i;
Value (AF).sub.i=Value (A-xN).sub.i.Math.Value (F-yN).sub.i;
Value (ED).sub.i=Value (E-xN).sub.i.Math.Value (D-yN).sub.i;
Value (EF).sub.i=Value (E-xN).sub.i.Math.Value (F-yN).sub.i; ii) calculating the mathematical product of the values (AB).sub.i, (AD).sub.i, (CD).sub.i, (A-F).sub.i, (E-D).sub.i and (E-F).sub.i for each primer combinations i by the following equation
Value.sup.i=value (AB).sub.i.Math.value (AD).sub.i.Math.value (CD).sub.i.Math.value (AF).sub.i.Math.value (ED).sub.i.Math.value (EF).sub.i; iii) comparing the obtained mathematical products Value.sup.i, wherein those primer combinations i with high values code for DNA-ligand conjugates which are present at a high concentration in the DNA-encoded library.
30. The method according to claim 29, wherein the method further comprises calculating a Value.sup.i by the following calculation:
Value.sup.i=log.sub.10[value (AB).sub.i value (AD).sub.i value (CD).sub.i value (AF).sub.i.Math.value (ED).sub.i.Math.value (EF).sub.i].
Description
[0087] With reference to the following Figures and Examples, the subject according to the invention is intended to be explained in more detail without wishing to restrict said subject to the special embodiments shown here.
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
[0098]
[0099]
EXAMPLE 1DEL COMPRISING DNA BARCODES WITH ONE SINGLE CODING REGION
[0100] For DNA codes containing only one single coding region, each code has 3 parts, #1 (first part), #2 (second part) and #3 (third part). Each #2 sequence is a unique code, while each combination of #1 and #3 can also represent a unique code (see e.g.
[0101] For each part, there is a minimal difference number n between any pair of sequences (e.g. between two different #1 sequences), while n should be 2.
EXAMPLE 2DEL COMPRISING DNA BARCODES WITH TWO CODING REGIONS
[0102] For DNA codes containing two coding regions, each sub-code has 4 parts, for example the first coding region #1 (first part), #2 (second part), #3 (third part) and #4 (fourth part) and the second coding region #1 (first part), #2 (second part), #3 (third part) and #4 (fourth part) (see e.g.
[0103] For each part, there is a minimal difference number n between any pair of sequences (e.g. between two different #1 sequences), while n should be 2.
EXAMPLE 3DEL COMPRISING DNA BARCODES WITH MORE THAN TWO CODING REGIONS
[0104] For DNA codes containing more than two coding regions (see e.g.
EXAMPLE 4DEL COMPRISING DNA BARCODES WITH 5 SUB-CODES
[0105] The DNA barcodes of this DEL have 5 parts, #1 (first part), #2 (second part), #3 (third part), #4 (fourth part) and #5 (fifth part).
[0106] Each #2 sequence (second part) is a unique sub-code, while each combination of #1 and #3 can also represent a unique sub-code. Therefore, a sequence of #2 is corresponding to a combination of #1 and #3. Each combination of #1 and #4 can also represent a unique sub-code. Therefore, a sequence of #2 is corresponding to a combination of #1 and #4. Each combination of #1 and #5 can also represent a unique sub-code. Therefore, a sequence of #2 is corresponding to a combination of #1 and #5.
[0107] For each part, there is a minimal difference number n between any pair of sequences (e.g. between two different #1 sequences), while n should be 2 in all designs.
EXAMPLE 5DESCRIPTION OF DECODING PROCESS: DECODING ONE-(SUB)-CODE
[0108] A primary qPCR matrix is built for the first coding region I using primer A with u different primers B-xb, and primer B with v different primers A-xa. Therefore, the size of resulting matrix is u.Math.v (see e.g.
[0109] A secondary qPCR matrix is built for the first coding region I using pairs of B-xb and A-xa, while B-xb and A-xa are chosen according to the signal intensity in the primary matrix. Same secondary matrices can be built for the second coding region II and the third coding region III. The ranking for each building block can thus be concluded.
[0110] For sequence containing two sub-codes, an additional secondary qPCR matrix can be built using A-xa and D-xd, while A-xa and D-xd are chosen according to the signal intensity in the primary matrices.
[0111] In combination with the two sub-code matrices (A-xa+B-xb and C-xc+D-xd), the ranking of the combinations can be concluded based on certain algorithm, for example:
Value.sup.i=Value.sup.i.sub.matrix-A+D.Math.Value.sup.i.sub.matrix-A+B.Math.Value.sup.i.sub.matrix-C+D,
[0112] wherein the Value.sup.i is a value relating and being proportional to the amount of a certain DNA barcode in the DEL. In other words, said Value.sup.i relates to an individual DNA sequence (barcode structure) which resulted from the combinatorial synthesis through joining two building blocks and two sub-codes.
[0113] To further validate the Value.sup.i ranking, an additional tertiary qPCR matrix can be built using A-xa-ya and D-xd-yd, while A-xa-ya and D-xd-yd are chosen according to the signal intensity in the primary and secondary matrices and the resulting Value.sup.i ranking.
[0114] A full matrix can also be built using A, D and all A-xa-ya and D-xd-yd, though it will be significantly more expensive than the method described before.
[0115] The method cannot provide a fully quantitative decoding solution for DEL containing more than two sub-codes. However, combining various primary, secondary, and tertiary rtPCR matrices can provide a Value.sup.i for certain compounds i, which is corresponding to a DNA code containing several sub-codes. All forward and backward primers can be combined to build a matrix.
[0116] For example, any primer A, A-xa, A-xa-ya can be combined with any primer B, B-xb, N, N-yn, N-xn-yn, D, D-xd, D-xd-yd to build QPCR matrices. A value for a particular compound can be calculated according to certain algorithm, for example:
Value.sup.i=log.sub.10(Value.sup.i.sub.matrix-A+D.Math.Value.sup.i.sub.matrix-A+N.Math.Value.sup.i.sub.matrix-M+D.Math.Value.sup.i.sub.matrix-A+B.Math.Value.sup.i.sub.matrix-C+D.Math.Value.sup.i.sub.matrix-M+N)
[0117] in which the Value.sup.i.sub.matrix-A+D, Value.sup.i.sub.matrix-A+N and Value.sup.i.sub.matrix-M+D can be either from the secondary, or tertiary matrices, or as a combination of them, and in which the Value.sup.i.sub.matrix-A+B.Math.Value.sup.i.sub.matrix-C+D.Math.Value.sup.i.sub.matrix-M+N are from the secondary matrices.
LIST OF REFERENCE SIGNS
[0118] DBC: DNA barcode sequence; [0119] S: substance; [0120] I: first coding region DNA sequence; [0121] II: second coding region DNA sequence; [0122] II: third coding region DNA sequence; [0123] #1: first part of a coding region DNA sequence; [0124] #2: second part of a coding region DNA sequence; [0125] #3: third part of a coding region DNA sequence; [0126] #4: fourth part of a coding region DNA sequence; [0127] #5: fifth part of a coding region DNA sequence; [0128] A, B, C, D, E, F, M, N: primary primer; [0129] A.sub.xa, B.sub.xb, C.sub.xc, D.sub.xd, E.sub.xe, F.sub.xf, M.sub.xm, N.sub.yn: secondary primer; [0130] A.sub.xaya, D.sub.xdyd, M.sub.xmym, N.sub.xnyn: tertiary primer; [0131] 1a, 1b: primary primer binding to all DBS; [0132] 2a, 2b: secondary primer binding to DBS of CBS only; [0133] 3a, 3b: secondary primer binding to Theo only; [0134] P2: primer annealing to first part #1 of coding region I; [0135] P2Y: primer annealing to third part #3 of coding region I; [0136] P2Y: primer annealing to first part #1 of coding region II; [0137] P1Y: primer annealing to third part #3 of coding region II; [0138] P4: primer annealing to first part #1 of coding region III; [0139] P5: primer annealing to third part #3 of coding region III.