Compositions and methods related to non-templated enzymatic nucleic acid synthesis
11505815 · 2022-11-22
Assignee
Inventors
Cpc classification
C07H1/00
CHEMISTRY; METALLURGY
C07H19/04
CHEMISTRY; METALLURGY
C07H19/10
CHEMISTRY; METALLURGY
C12P19/34
CHEMISTRY; METALLURGY
C07H19/20
CHEMISTRY; METALLURGY
International classification
C12P19/34
CHEMISTRY; METALLURGY
C07H19/10
CHEMISTRY; METALLURGY
C07H19/20
CHEMISTRY; METALLURGY
C07H19/04
CHEMISTRY; METALLURGY
Abstract
The invention relates to the use of an amine masked moiety in a method of enzymatic nucleic acid synthesis. The invention also relates to said amine masked moieties per se and a process for preparing nucleotide triphosphates comprising said amine masked moieties.
Claims
1. A method of non-templated enzymatic nucleic acid synthesis comprising: (i) providing a compound of formula (I): ##STR00056## wherein: R.sup.1 represents —ONH.sub.2, —ONC(CH.sub.3).sub.2, —OCH.sub.2N.sub.3, —OCH.sub.2CHCH.sub.2, —OPO.sub.3.sup.2−, —OCH.sub.2SSCH.sub.2CH.sub.3, —OCOCH.sub.2, —OCH.sub.2CH.sub.2CN, —O-methoxyethyl; R.sup.2 represents —H, X represents a triphosphate group; R.sup.3 represents an amine masking group selected from an azide, benzoylamine, isobutyrylamine, dimethylformamidylamine, 9-fluorenylmethyl carbamate, t-butyl carbamate, benzyl carbamate, acetamide, trifluoroacetamide, pthlamide, benzylamine, triphenylmethylamine, benxylideneamine, tosylamide, isothiocyanate, N-allyl or N-anisoyl; and B represents a nitrogenous heterocycle selected from a purine or pyrimidine; (ii) incorporating the compound of formula (I) into a nucleic acid molecule; (iii) repeating steps (i) and (ii) until a desired nucleic acid sequence having the amine masking group is synthesized; and (iv) unmasking the R.sup.3 amine masking group to reveal an amino(—NH.sub.2) group.
2. The method of claim 1, wherein the compound of formula (I) is selected from: ##STR00057## where R.sub.2 represents —H.
3. The method of claim 1, wherein R.sup.1 represents —ONH.sub.2, —ONC(CH.sub.3).sub.2, or —OCH.sub.2N.sub.3.
4. A compound of formula (I).sup.a: ##STR00058## wherein: R.sup.1 represents —ONH.sub.2, —ONC(CH.sub.3).sub.2 or —OCH.sub.2N.sub.3; R.sup.2 represents —H; X represents a triphosphate group; R.sup.3 represents azide, benzoyl amine, isobutyrylamine, dimethylformamidylamine, 9-fluorenylmethyl carbamate, t-butyl carbamate, benzyl carbamate, acetamide, trifluoroacetamide, pthlamide, benzylamine, triphenylmethylamine, benxylideneamine, tosylamide, isothiocyanate, N-allyl or N-anisoyl, and B represents a nitrogenous heterocycle selected from a purine or pyrimidine.
5. The method of claim 1, wherein the unmasking comprises unmasking with a reducing agent.
6. The method of claim 5, wherein the reducing agent is selected from beta-mercaptoethanol, dithiothreitol or a phosphine-based reducing agent.
Description
BRIEF DESCRIPTION OF THE FIGURES
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
DETAILED DESCRIPTION OF THE INVENTION
(13) According to a first aspect of the invention, there is provided the use of an amine masked derivative of a nitrogenous heterocycle, such as adenine, guanine, cytosine, isoguanine, isocytosine and 2,6-diaminopurine in a method of enzymatic nucleic acid synthesis.
(14) According to a further aspect of the invention which may be mentioned, there is provided the use of an amine masked derivative of a nitrogenous heterocycle, such as adenosine, guanosine, and cytidine, in a method of enzymatic nucleic acid synthesis.
(15) References herein to a derivative of adenosine, guanosine and cytidine refer to deoxy derivatives thereof (i.e. deoxyadenosine, deoxyguanosine and deoxycytidine) and the phosphated derivatives thereof (i.e. adenosine monophosphate, adenosine diphosphate, adenosine triphosphate, guanosine monophosphate, guanosine diphosphate, guanosine triphosphate, cytidine monophosphate, cytidine diphosphate, cytidine triphosphate and all the deoxyribose versions thereof).
(16) According to a further aspect of the invention, there is provided the use of a compound of formula (I):
(17) ##STR00008##
wherein:
R.sup.1 represents a moiety capable of being unmasked to reveal a hydroxyl group, including —H, —OH, —ONH.sub.2, —ONC(CH.sub.3).sub.2, —OCH.sub.2N.sub.3, —OCH.sub.2CHCH.sub.2, —OPO.sub.3.sup.2−, —OCH.sub.2SSCH.sub.2CH.sub.3, —OCOCH.sub.3, —OCH.sub.2CH.sub.2CN, —O-methoxyethyl, —O-alkyl, or —O-alkoxy;
R.sup.2 represents —H, —OH, —ONH.sub.2, —ONC(CH.sub.3).sub.2, —OCH.sub.2N.sub.3, —OCH.sub.2CHCH.sub.2, —OPO.sub.3.sup.2−, —OCH.sub.2SSCH.sub.2CH.sub.3, —OCOCH.sub.3, —OCH.sub.2CH.sub.2CN, —O-methoxyethyl, —O-alkyl, or —O-alkoxy or any other molecular moiety;
X represents an —OH group or one or more phosphate, phosphorothioate, boranophosphate or imidophosphate groups, or any combination thereof, wherein said group is capable of endowing competence for enzymatic addition;
R.sup.3 represents an amine masking group, wherein said amino group would be involved in hydrogen bond base-pairing with a complementary base and deamination of said amino group could result in altered hydrogen bonding with a complementary base; and
B represents a nitrogenous heterocycle;
in a method of enzymatic nucleic acid synthesis.
(18) According to a further aspect of the invention which may be mentioned, there is provided the use of a compound of formula (I):
(19) ##STR00009##
wherein:
R.sup.1 and R.sup.2 independently represent —H, —OH, —ONH.sub.2, —ONC(CH.sub.3).sub.2, —OCH.sub.2N.sub.3, —OCH.sub.2CHCH.sub.2, —O-methoxyethyl, —O-alkyl, —O-alkoxy, cyanoethyl, a thiol or a suitable hydroxy protecting group;
X represents an —OH group or one or more phosphate, phosphorothioate, boranophosphate or imidophosphate groups, or any combination thereof;
R.sup.3 represents an amine masking group, wherein said amino group is involved in hydrogen bond base-pairing with a complementary base; and
B represents a nitrogenous heterocycle;
in a method of enzymatic nucleic acid synthesis.
(20) Enzymatic nucleic acid synthesis is defined as any process in which a nucleotide is added to a nucleic acid strand through enzymatic catalysis in the presence or absence of a template.
(21) For example, a method of enzymatic nucleic acid synthesis could include non-templated de novo nucleic acid synthesis utilizing a PoIX family polymerase, such as terminal deoxynucleotidyl transferase, and reversibly terminated 2′-deoxynucleoside 5′-triphosphates or ribonucleoside 5′-triphosphate. Another method of enzymatic nucleic acid synthesis could include templated nucleic acid synthesis, including sequencing-by-synthesis. Reversibly terminated enzymatic nucleic acid synthesis is defined as any process in which a reversibly terminated nucleotide is added to a nucleic acid strand through enzymatic catalysis in the presence or absence of a template. A reversibly terminated nucleotide is a nucleotide containing a chemical moiety that blocks the addition of a subsequent nucleotide. The deprotection or removal of the reversibly terminating chemical moiety on the nucleotide by chemical, electromagnetic, electric current, and/or heat allows the addition of a subsequent nucleotide via enzymatic catalysis. Thus, in one embodiment, the method of enzymatic nucleic acid synthesis is selected from a method of reversibly terminated enzymatic nucleic acid synthesis and a method of templated and non-templated de novo enzymatic nucleic acid synthesis.
(22) The compound of formula (I) contains three synergistic components which may be summarized as follows: (i)—The R.sup.3 group. R.sup.3 is typically a chemical moiety on the nitrogenous heterocycle that can be unmasked to reveal an amino (—NH.sub.2) group; (ii)—The R.sup.1 group. R.sup.1 is typically a chemical moiety at the 3′-position on the sugar that can be unmasked to reveal a hydroxyl (—OH) group; and (iii)—The X group. X is typically a chemical moiety endowing competence for enzymatic addition (e.g., 5′-triphosphate group).
(23) Without being bound by theory, it is believed that the combination of R.sup.1, R.sup.3 and X result in nucleotide analogs that protect the amino group in component (i) from mutation during the method of enzymatic nucleic acid synthesis described herein. Specifically, a method of enzymatic nucleic acid synthesis would involve nucleotide analogs that have characteristic R.sup.3, X, and R.sup.1, where R.sup.1 is fixed as an —ONH.sub.2 group.
(24) In one embodiment, R.sup.1 and R.sup.2 independently represent —H, —OH, —ONH.sub.2, —ONC(CH.sub.3).sub.2, —OCH.sub.2N.sub.3, —OCH.sub.2CHCH.sub.2, —OPO.sub.3.sup.2−, —OCH.sub.2SSCH.sub.2CH.sub.3, —OCOCH.sub.3, —OCH.sub.2CH.sub.2CN, —O-methoxyethyl, —O-alkyl, or —O-alkoxy or a suitable hydroxyl protecting group.
(25) In one embodiment, the compound of formula (I) is selected from:
(26) ##STR00010##
where R.sup.2 is as defined herein, such as —OH or —H. In one embodiment, R.sup.2 is H.
(27) In one embodiment, the compound of formula (I) is a compound of formula (I).sup.b:
(28) ##STR00011##
(29) The compound of formula (I).sup.b is known chemically as N6-azido 2′-deoxyadenosine. Upon exposure of the compound of formula (I).sup.b to sodium nitrite, no conversion to 2′-deoxyinosine was observed, as shown in
(30) In one embodiment, the compound of formula (I) is a compound of formula (I).sup.b:
(31) ##STR00012##
(32) The compound of formula (I).sup.c is known chemically as N4-azido 2′-deoxycytidine. Upon exposure of the compound of formula (I).sup.c to sodium nitrite, no conversion to 2′-deoxyuracil was observed, as shown in
(33) In one embodiment, the compound of formula (I) is a compound of formula (I).sup.d:
(34) ##STR00013##
(35) The compound of formula (I).sup.d is known chemically as N6-acetyl 2′-deoxyadenosine. Upon exposure of the compound of formula (I).sup.d to sodium nitrite, no conversion to 2′-deoxyinosine was observed, as shown in
(36) In one embodiment, the compound of formula (I) is a compound of formula (I).sup.e:
(37) ##STR00014##
(38) The compound of formula (I).sup.e is known chemically as N4-acetyl 2′-deoxycytidine. Upon exposure of the compound of formula (I).sup.e to sodium nitrite, no conversion to 2′-deoxyuracil was observed, as shown in
(39) In one embodiment, the compound of formula (I) is a compound of formula (I).sup.f:
(40) ##STR00015##
(41) The compound of formula (I).sup.f is known chemically as N6-benzyl 2′-deoxyadenosine. Upon exposure of the compound of formula (I).sup.f to sodium nitrite, no conversion to 2′-deoxyinosine was observed, as shown in
(42) In one embodiment, the compound of formula (I) is a compound of formula (I).sup.g:
(43) ##STR00016##
(44) The compound of formula (I).sup.g is known chemically as N4-anisoyl 2′-deoxycytidine. Upon exposure of the compound of formula (I).sup.g to sodium nitrite, no conversion to 2′-deoxyinosine was observed, as shown in
(45) In one embodiment, the compound of formula (I) is a compound of formula (I).sup.h:
(46) ##STR00017##
(47) The compound of formula (I).sup.h is known chemically as N4-methyl 2′-deoxycytidine. As shown here, secondary amines are protected from oxidative deamination induced by nitrite solutions. Thus N-methyl would be an appropriate protecting group. Exocyclic N-methyl can be conveniently removed by treatment with demethylating enzymes such as AlkB (D. Li, et al., Chem. Res. Toxicol. 26 (2013) 1182-1187).
(48) In one embodiment, the compound of formula (I) is a compound of formula (I).sup.i:
(49) ##STR00018##
(50) The compound of formula (I).sup.i is known chemically as N6-methyl 2′-deoxyadenosine. As shown here, secondary amines are protected from oxidative deamination induced by nitrite solutions. Thus N-methyl would be an appropriate protecting group. Exocyclic N-methyl can be conveniently removed by treatment with demethylating enzymes such as AlkB (D. Li, et al., Chem. Res. Toxicol. 26 (2013) 1182-1187). The triphosphate form of species (I).sup.i can act as a substrate for terminal transferase enzymes in a DNA synthesis process as shown in
(51) In one embodiment, the compound of formula (I) is a compound of formula (I).sup.j:
(52) ##STR00019##
(53) The compound of formula (I).sup.j is known chemically as 3′-azido N4-benzoyl 2′-deoxycytidine. Upon exposure of the compound of formula (I).sup.j to sodium nitrite, no conversion to 3′-azido 2′-deoxyinosine was observed, as shown in
(54) In one embodiment, X represents an —OH group. In an alternative embodiment, X represents a triphosphate group. The triphosphate group of this embodiment has the advantage of being most commonly utilized with nucleotidyl transferases (e.g., polymerases) or any chemical moieties allowing addition to a nucleic acid molecule through enzymatic or chemical catalysis.
(55) References herein to “amine” refer to a —NH.sub.2 group.
(56) References herein to an “amine masking group” refer to any chemical group which is capable of generating or “unmasking” an amine group which is involved in hydrogen bond base-pairing with a complementary base. Most typically the unmasking will follow a chemical reaction, most suitably a simple, single step chemical reaction. In one embodiment, the hydrogen bond base-pairing is selected from: Watson-Crick, Hoogsteen, or alternative/expanded genetic code base pairing.
(57) Examples of suitable amine masking groups for R.sup.3 include azide (—N.sub.3), benzoylamine (N-benzoyl or —NHCOPh), N-methyl (—NHMe), isobutyrylamine, dimethylformamidylamine, 9-fluorenylmethyl carbamate, t-butyl carbamate, benzyl carbamate, acetamide (N-acetyl or —NHCOMe), trifluoroacetamide, pthlamide, benzylamine (N-benzyl or —NH—CH.sub.2-phenyl), triphenylmethylamine, benxylideneamine, tosylamide, isothiocyanate, N-allyl (such as N-dimethylallyl (—NHCH.sub.2—CH═CH.sub.2)) and N-anisoyl (—NHCOPh-OMe), such as azide (—N.sub.3), N-acetyl (—NHCOMe), N-benzyl (—NH—CH.sub.2-phenyl), N-anisoyl (—NHCOPh-OMe), N-methyl, (—NHMe), N-benzoyl (—NHCOPh), N-dimethylallyl (—NHCH.sub.2—CH═CH.sub.2).
(58) In one embodiment, B represents a nitrogenous heterocycle selected from a purine or pyrimidine, or derivative thereof. In a further embodiment, B and R3 can be combined into the following molecular structures, where the nitrogenous heterocycle is connected to the (deoxy)ribose 1′ position of the compound of formula (I):
(59) ##STR00020##
(60) In a further embodiment, R.sup.3 represents an azide (—N.sub.3) group and B is selected from:
(61) ##STR00021##
(62) The term ‘azide’ or ‘azido’ used herein refers to an —N.sub.3, or more specifically, an —N═N.sup.+═N.sup.− group. It will also be appreciated that azide extends to the presence of a tetrazolyl moiety. The “azide-tetrazole” equilibrium is well known to the skilled person from Lakshman et al (2010) J. Org. Chem. 75, 2461-2473. Thus, references herein to azide extend equally to tetrazole as illustrated below when applied to the R.sup.3 groups defined herein:
(63) ##STR00022##
(64) This embodiment has the advantage of reversibly masking the —NH.sub.2 group. While blocked in the —N.sub.3 state, the base (B) is impervious to deamination (e.g., deamination in the presence of sodium nitrite). The canonical cytosine, adenine, guanine can be respectively recovered from 4-azido cytosine, 6-azido adenine and 2-azido guanine by exposure to a reducing agent (e.g., TCEP). Thus, the —N.sub.3 group serves as an effective protecting group against deamination, especially in the presence of sodium nitrite.
(65) It will be appreciated that the compounds of the invention may be readily applied to methods of enzymatic nucleic acid synthesis which are well known to the person skilled in the art.
(66) Non-limiting methods of nucleic acid synthesis may be found in WO 2016/128731, WO 2016/139477, WO 2017/009663, GB 1613185.6 and GB 1714827.1, the contents of each of which are herein incorporated by reference.
(67) According to a further aspect of the invention, there is provided a compound of formula (I).sup.a:
(68) ##STR00023##
wherein:
R.sup.1 represents a moiety capable of being unmasked to reveal a hydroxyl group, including —H, —OH, —ONH.sub.2, —ONC(CH.sub.3).sub.2, —OCH.sub.2N.sub.3, —OCH.sub.2CHCH.sub.2, —OPO.sub.3.sup.2−, —OCH.sub.2SSCH.sub.2CH.sub.3, —OCOCH.sub.3, —OCH.sub.2CH.sub.2CN, —O-methoxyethyl, —O-alkyl, or —O-alkoxy;
R.sup.2 represents —H, —OH, —ONH.sub.2, —ONC(CH.sub.3).sub.2, —OCH.sub.2N.sub.3, —OCH.sub.2CHCH.sub.2, —OPO.sub.3.sup.2−, —OCH.sub.2SSCH.sub.2CH.sub.3, —OCOCH.sub.3, —OCH.sub.2CH.sub.2CN, —O-methoxyethyl, —O-alkyl, or —O-alkoxy or any other molecular moiety;
X represents one or more phosphate, phosphorothioate, boranophosphate or imidophosphate groups, or any combination thereof, wherein said group is capable of endowing competence for enzymatic addition;
R.sup.3 represents an amine masking group, wherein said amino group would be involved in hydrogen bond base-pairing with a complementary base and deamination of said amino group could result in altered hydrogen bonding with a complementary base; and
B represents a nitrogenous heterocycle.
(69) According to a further aspect of the invention which may be mentioned, there is provided a compound of formula (I).sup.a:
(70) ##STR00024##
wherein:
R.sup.1 and R.sup.2 independently represent —H, —OH, —ONH.sub.2, —ONC(CH.sub.3).sub.2, —OCH.sub.2N.sub.3, —OCH.sub.2CHCH.sub.2, —O-methoxyethyl, —O-alkyl, —O-alkoxy, cyanoethyl, a thiol or a suitable hydroxy protecting group;
X represents one or more phosphate, phosphorothioate, boranophosphate or imidophosphate groups, or any combination thereof;
R.sup.3 represents an amine masking group, wherein said amino group is involved in hydrogen bond base-pairing with a complementary base; and
B represents a nitrogenous heterocycle.
(71) In one embodiment, X represents a triphosphate group.
(72) In one embodiment, R.sup.1 and R.sup.2 independently represent —H, —OH, —ONH.sub.2, —ONC(CH.sub.3).sub.2, —OCH.sub.2N.sub.3, —OCH.sub.2CHCH.sub.2, —OPO.sub.3.sup.2−, —OCH.sub.2SSCH.sub.2CH.sub.3, —OCOCH.sub.3, —OCH.sub.2CH.sub.2CN, —O-methoxyethyl, —O-alkyl, or —O-alkoxy.
(73) In an alternative embodiment, R.sup.1 and R.sup.2 independently represent —H, —OH, —ONH.sub.2, —N.sub.3, —OCH.sub.2N.sub.3, —ONC(CH.sub.3).sub.2, —OCH.sub.2CHCH.sub.2, —O-methoxyethyl, —O-alkyl, —O-alkoxy, cyanoethyl, a thiol or a suitable hydroxy protecting group.
(74) Examples of suitable amine masking groups for R.sup.3 include azide (—N.sub.3), benzoylamine (N-benzoyl or —NHCOPh), N-methyl, (—NHMe), isobutyrylamine, dimethylformamidylamine, 9-fluorenylmethyl carbamate, t-butyl carbamate, benzyl carbamate, acetamide (N-acetyl or —NHCOMe), trifluoroacetamide, pthlamide, benzylamine (N-benzyl or —NH—CH.sub.2-phenyl), triphenylmethylamine, benxylideneamine, tosylamide, isothiocyanate, N-allyl (such as N-dimethylallyl (—NHCH.sub.2—CH═CH.sub.2)) and N-anisoyl (—NHCOPh-OMe), such as azide (—N.sub.3), N-acetyl (—NHCOMe), N-benzyl (—NH—CH.sub.2-phenyl), N-anisoyl (—NHCOPh-OMe), N-methyl, (—NHMe), N-benzoyl (—NHCOPh), N-dimethylallyl (—NHCH.sub.2—CH═CH.sub.2).
(75) In one embodiment, B represents a nitrogenous heterocycle selected from a purine or pyrimidine. In a further embodiment, B and R3 can be combined into the following molecular structures, where the nitrogenous heterocycle is connected to the (deoxyribose) 1′ position of the compound of formula (I):
(76) ##STR00025##
(77) In one embodiment, R.sup.3 represents an azide (—N.sub.3) group and B is selected from:
(78) ##STR00026##
(79) One particular compound of formula (I).sup.a which may be mentioned (1) is one wherein R.sup.1 represents —ONH.sub.2, R.sup.2 represents H, X represents a triphosphate group, B represents:
(80) ##STR00027##
and R.sup.3 represents N.sub.3, thus a compound of formula (1):
(81) ##STR00028##
(82) The compound of formula (1) may be prepared in accordance with the following synthetic scheme:
(83) ##STR00029## ##STR00030##
(84) One further particular compound of formula (I).sup.a which may be mentioned (2) is one wherein R.sup.1 represents —ONH.sub.2, R.sup.2 represents H, X represents a triphosphate group, B represents:
(85) ##STR00031##
and R.sup.3 represents N.sub.3, thus a compound of formula (2):
(86) ##STR00032##
(87) The compound of formula (2) may be prepared in accordance with the following synthetic scheme:
(88) ##STR00033## ##STR00034##
(89) One further particular compound of formula (I).sup.a which may be mentioned (3) is one wherein R.sup.1 represents —ONH.sub.2, R.sup.2 represents H, X represents a triphosphate group, B represents:
(90) ##STR00035##
and R.sup.3 represents N.sub.3, thus a compound of formula (3):
(91) ##STR00036##
(92) The compound of formula (3) may be prepared in accordance with the following synthetic scheme:
(93) ##STR00037## ##STR00038##
(94) In another embodiment, R.sup.3 represents an acetyl (—Ac) group and B is selected from:
(95) ##STR00039##
(96) In another embodiment, R.sup.3 represents an anisoyl group and B is selected from:
(97) ##STR00040##
(98) In another embodiment, R.sup.3 represents a benzyl group and B is selected from:
(99) ##STR00041##
(100) In another embodiment, R.sup.3 represents a benzyl group and B is selected from:
(101) ##STR00042##
(102) In another embodiment R.sup.3 represents a methyl group and R is selected from:
(103) ##STR00043##
(104) In another embodiment, R.sup.3 represents an allyl group and B is selected from:
(105) ##STR00044##
(106) Particular compounds of formula (I).sup.a which may be mentioned (4-27) are those wherein R.sup.1 represents —ONH.sub.2, R.sup.2 represents H, X represents a triphosphate group and B represents the bases described above, resulting in compounds:
(107) ##STR00045## ##STR00046## ##STR00047## ##STR00048## ##STR00049##
(108) According to a further aspect of the invention, there is provided a process of preparing a compound of formula (V):
(109) ##STR00050##
wherein X, R.sup.1, R.sup.2 and B are as defined herein, which comprises reacting a compound of formula (I):
(110) ##STR00051##
wherein X, R.sup.1, R.sup.2, R.sup.3 and B are as defined herein, with a chemical, with electromagnetic radiation, with heat and/or with an electric current.
(111) According to a further aspect of the invention, there is provided a process of preparing a compound of formula (II), (III) or (IV):
(112) ##STR00052##
wherein X, R.sup.1 and R.sup.2 are as defined herein, which comprises reacting a compound of formula (II).sup.a, (III).sup.a or (IV).sup.a, respectively:
(113) ##STR00053##
wherein X, R.sup.1 and R.sup.2 are as defined herein, with a chemical, with electromagnetic radiation and/or with an electric current.
(114) According to a further aspect of the invention, there is provided a process of preparing a compound of formula (II), (III) or (IV) as defined herein, which comprises reacting a compound of formula (VI):
(115) ##STR00054##
wherein X, R.sup.1, R.sup.2 and B are as defined herein, with a reducing agent.
(116) In one embodiment, the reducing agent is selected from beta-mercaptoethanol, dithiothreitol or a phosphine-based reducing agent such as tris(hydroxymethyl)phosphine (THP). tris(hydroxypropyl)phosphine (THPP) and tris(2-carboxylethyl)phosphine (TCEP).
(117) According to a further aspect of the invention, there is provided a compound of formula (VII):
(118) ##STR00055##
wherein R.sup.2 represents —H, —OH, —ONH.sub.2, —ONC(CH.sub.3).sub.2, —OCH.sub.2N.sub.3, —OCH.sub.2CHCH.sub.2, —OPO.sub.3.sup.2−, —OCH.sub.2SSCH.sub.2CH.sub.3, —OCOCH.sub.3, —OCH.sub.2CH.sub.2CN, —O-methoxyethyl, —O-alkyl, or —O-alkoxy or a suitable hydroxy protecting group;
X represents one or more phosphate, phosphorothioate, boranophosphate or imidophosphate groups, or any combination thereof; and
R.sup.4 represents C.sub.2-6 alkyl, —F, —Cl, —Br, —I, alkoxy, biotin, alkylamine or azide.
(119) According to a further aspect of the invention, there is provided the use of a compound of formula (VII) in a method of enzymatic nucleic acid synthesis.
(120) In one embodiment, the method of enzymatic nucleic acid synthesis is selected from a method of reversibly terminated enzymatic nucleic acid synthesis and a method of templated and non-templated de novo enzymatic nucleic acid synthesis.
(121) The following studies illustrate the invention:
Example 1: Enzymatic DNA Synthesis Using Azide-Masked Nitrogenous Heterocycles
(122) In the following method of DNA synthesis, engineered terminal deoxynucleotidyl transferase is used to add 3′-O-aminoxy reversibly terminated 2′-deoxynucleoside 5′-triphosphates to the 3′-end of DNA strands. This addition process is repeated until a desired sequence is synthesized. The 3′-O-aminoxy moiety must be deaminated (e.g., with acidic sodium nitrite) after each addition cycle to effect reversible termination. The process of deamination after each addition cycle also results in the mutagenic deamination of nitrogenous heterocycles containing amines (e.g., adenine, cytosine and guanine).
(123) Thus, in this example, amino moieties on the nitrogenous heterocycles are masked with an azido group to prevent oxidative deamination (
(124) A DNA polymer with amine-masked nitrogenous heterocycles (e.g., N4-azidocytosine, N6-azidoadenine, N2-azidoguanine) is thus synthesized. All amine-masked nitrogenous heterocycles are unmasked to reveal an amino group through exposure to a reducing agent (e.g., TCEP). The DNA polymer is now composed of nitrogenous heterocycles with unmasked amino groups (e.g., N4-azidocytosine is unmasked to cytosine, N6-azidoadenine is unmasked to adenine and N2-azidoguanine is unmasked to guanine). The DNA polymer can now be used for downstream molecular biology applications.
Example 2: Enzymatic DNA Synthesis Using N-Acetyl-Masked Nitrogenous Heterocycles
(125) In the following method of DNA synthesis, engineered terminal deoxynucleotidyl transferase is used to add 3′-O-aminoxy reversibly terminated 2′-deoxynucleoside 5′-triphosphates to the 3′-end of DNA strands. This addition process is repeated until a desired sequence is synthesized. The 3′-O-aminoxy moiety must be deaminated (e.g., with acidic sodium nitrite) after each addition cycle to effect reversible termination. The process of deamination after each addition cycle also results in the mutagenic deamination of nitrogenous heterocycles containing amines (e.g., adenine, cytosine and guanine).
(126) Thus, in this example, amino moieties on the nitrogenous heterocycles are masked with an acetyl group to protect from oxidative deamination (
(127) A DNA polymer with amine-masked nitrogenous heterocycles (e.g., N4-acetylcytosine, N6-acetyladenine, N2-acetylguanine) is thus synthesized. All amine-masked nitrogenous heterocycles are deacetylated and thus unmasked to reveal an amino group through exposure to a base (e.g., potassium carbonate) as shown in
Example 3: Enzymatic DNA Synthesis Using N-Benzoyl- and N-Anisoyl-Masked Nitrogenous Heterocycles
(128) In the following method of DNA synthesis, engineered terminal deoxynucleotidyl transferase is used to add 3′-O-aminoxy reversibly terminated 2′-deoxynucleoside 5′-triphosphates to the 3′-end of DNA strands. This addition process is repeated until a desired sequence is synthesized. The 3′-O-aminoxy moiety must be deaminated (e.g., with acidic sodium nitrite) after each addition cycle to effect reversible termination. The process of deamination after each addition cycle also results in the mutagenic deamination of nitrogenous heterocycles containing amines (e.g., adenine, cytosine and guanine).
(129) Thus, in this example, amino moieties on the nitrogenous heterocycles are masked with a benzoyl group to protect from oxidative deamination (
(130) A DNA polymer with amine-masked nitrogenous heterocycles (e.g., N4-benzoylcytosine, N6-benzoyladenine, N2-benzoylguanine) is thus synthesized. All amine-masked nitrogenous heterocycles are debenzoylated and thus unmasked to reveal an amino group through exposure to a base (e.g., methylamine) as shown in
Example 4: Enzymatic DNA Synthesis Using N-Benzyl-Masked Nitrogenous Heterocycles
(131) In the following method of DNA synthesis, engineered terminal deoxynucleotidyl transferase is used to add 3′-O-aminoxy reversibly terminated 2′-deoxynucleoside 5′-triphosphates to the 3′-end of DNA strands. This addition process is repeated until a desired sequence is synthesized. The 3′-O-aminoxy moiety must be deaminated (e.g., with acidic sodium nitrite) after each addition cycle to effect reversible termination. The process of deamination after each addition cycle also results in the mutagenic deamination of nitrogenous heterocycles containing amines (e.g., adenine, cytosine and guanine).
(132) Thus, in this example, amino moieties on the nitrogenous heterocycles are masked with a benzyl group to protect from oxidative deamination (
(133) A DNA polymer with amine-masked nitrogenous heterocycles (e.g., N4-benzylcytosine, N6-benzyladenine, N2-benzylguanine) is thus synthesized. All amine-masked nitrogenous heterocycles are debenzylated and thus unmasked to reveal an amino group through hydrogenolysis (e.g., Pd-C) or tert-butoxide and O.sub.2 in DMSO. The DNA polymer is now composed of nitrogenous heterocycles with unmasked amino groups (e.g., N4-acetlycytosine is unmasked to cytosine, N6-benzyladenine is unmasked to adenine and N2-benzylguanine is unmasked to guanine). The DNA polymer can now be used for downstream molecular biology applications.
Example 5: Enzymatic DNA Synthesis Using N-Methyl-Masked Nitrogenous Heterocycles
(134) In the following method of DNA synthesis, engineered terminal deoxynucleotidyl transferase is used to add 3′-O-aminoxy reversibly terminated 2′-deoxynucleoside 5′-triphosphates to the 3′-end of DNA strands. This addition process is repeated until a desired sequence is synthesized. The 3′-O-aminoxy moiety must be deaminated (e.g., with acidic sodium nitrite) after each addition cycle to effect reversible termination. The process of deamination after each addition cycle also results in the mutagenic deamination of nitrogenous heterocycles containing amines (e.g., adenine, cytosine and guanine).
(135) Thus, in this example, amino moieties on the nitrogenous heterocycles are masked with a methyl group. For example, one or a combination of 2′-deoxy-3′-O-aminoxy-N4-methylcytidine 5′-triphosphate, 2′-deoxy-3′-O-aminoxy-N6-methyladenine 5′-triphosphate, 2′-deoxy-3′-O-aminoxy-N2-methylguanosine 5′-triphosphate and 2′-deoxy-3′-O-aminoxy-5-ethyluridine 5′-triphosphate are used as nucleotide building blocks during each addition cycle in the presence of engineered TdT and required buffer components.
(136) A DNA polymer with amine-masked nitrogenous heterocycles (e.g., N4-methylcytosine, N6-methyladenine, N2-methylguanine) is thus synthesized. All amine-masked nitrogenous heterocycles are demethylated and thus unmasked to reveal an amino group through exposure to demethylases. For example, the amine-masked DNA polymer can be exposed to a cocktail of known demethylases or one single demethylase such as the DNA repair enzyme AlkB. The DNA polymer is now composed of nitrogenous heterocycles with unmasked amino groups (e.g., N4-acetlycytosine is unmasked to cytosine, N6-methyladenine is unmasked to adenine and N2-methylguanine is unmasked to guanine). The DNA polymer can now be used for downstream molecular biology applications.
Example 6: Enzymatic DNA Synthesis Using N-Allyl-Masked Nitrogenous Heterocycles
(137) In the following method of DNA synthesis, engineered terminal deoxynucleotidyl transferase is used to add 3′-O-aminoxy reversibly terminated 2′-deoxynucleoside 5′-triphosphates to the 3′-end of DNA strands. This addition process is repeated until a desired sequence is synthesized. The 3′-O-aminoxy moiety must be deaminated (e.g., with acidic sodium nitrite) after each addition cycle to effect reversible termination. The process of deamination after each addition cycle also results in the mutagenic deamination of nitrogenous heterocycles containing amines (e.g., adenine, cytosine and guanine).
(138) Thus, in this example, amino moieties on the nitrogenous heterocycles are masked with a allyl group to protect from oxidative deamination (
(139) A DNA polymer with amine-masked nitrogenous heterocycles (e.g., N4-allylcytosine, N6-allyladenine, N2-allylguanine) is thus synthesized. All amine-masked nitrogenous heterocycles are deallylated and thus unmasked to reveal an amino group through exposure to tetrakis(triphenylphosphine) palladium. The DNA polymer is now composed of nitrogenous heterocycles with unmasked amino groups (e.g., N4-acetlycytosine is unmasked to cytosine, N6-allyladenine is unmasked to adenine and N2-allylguanine is unmasked to guanine). The DNA polymer can now be used for downstream molecular biology applications.