Information storage with rectified dna

11708389 · 2023-07-25

    Inventors

    Cpc classification

    International classification

    Abstract

    Disclosed here are compositions of matter that comprise DNA analogues that have been rectified by replacing the four standard components with alternative components that mitigate various problems frequently encountered during synthesis of these compounds, during the functioning of these compounds, and complex mixtures for information data storage.

    Claims

    1. A polydeoxyribonucleotide comprising at least 10 nucleotide units, wherein two or more of said nucleotide units are replaced by a 2′-deoxyribonucleotide analog having a D-configuration selected from the group consisting of ##STR00001## wherein D is the point of attachment of said analog to nucleotides in the polydeoxyribonucleotide, X and L are side chains that are independently selected to be alkyl, and Q is either N or CH.

    Description

    BRIEF DESCRIPTION OF THE DRAWINGS

    (1) FIG. 1. Acid-catalyzed cleavage of the glycosidic bond in an exemplary nucleotide building block of DNA, 2′-deoxyadenosine. While not wishing to be bound by theory, a mechanism is proposed that involves protonation by the acid on the N-7 nitrogen of the nucleobase. Analogous cleavage processes are available to 2′-deoxyguanosine. Lost of the nucleobase can be followed by fragmentation of the DNA chain, as shown.

    (2) FIG. 2. Compositions of the instant invention that mitigate the undesired acid-catalyzed reactivity of standard 2′-deoxyribonucleotides. These are deoxyribonucleotides that comprise at least 10 nucleotides, wherein two or more of said nucleotides are replaced by a 2′-deoxyribonucleotide analog having a D-configuration selected from structures shown, wherein D is the point of attachment of said analog to the preceding and following nucleotides in the deoxyribonucleotide, X and L are side chains that are independently selected to be methyl, lower alkyl, functionalized alkyl, such as carboxymethyl, and Q is either N or CH.

    (3) FIG. 3. Synthesis of one of the compositions of the instant invention that mitigate the undesired acid-catalyzed reactivity of standard 2′-deoxyribonucleotides.

    DETAILED DESCRIPTION OF THE INVENTION

    (4) The vintage of some of these references and the very few examples where DNA has been used for non-biological applications suggest that obstacles must obstruct these concepts in practice. This disclosure teaches problematic limitations of natural DNA for these and other concept applications. Then, the instant inventions that are disclosed here provide compositions of matter assembled from a plurality of nucleotide analogs that mitigate those problematic limitations.

    (5) One set of problematic limitations relates to the chemical synthesis of DNA for genetic and non-genetic applications. Here, “chemical synthesis” refers to phosphoramidite, phosphotriester, or other processes that make DNA without involving enzymes in the nucleotide coupling steps. Chemical synthesis of DNA creates oligonucleotides contaminated with defective molecules. These defects, known in the art, are illustrated here for DNA, recognizing that similar issues apply to RNA and various of its sugar modified backbones, including (without limitation) species such as locked DNA, bicyclic DNA, RNA, and 2′-fluoro-2′-deoxy DNA, among others. In this application, “DNA” is understood to include these and other analogs.

    (6) The instant invention teaches that these particular problematic limitations can be usefully mitigated or resolved by changing the structure of the nucleotides, but in a way that allows them to continue to interface with standard enzymes, proteins, and nucleic acids in their natural or mutated forms. Speaking generally, the instant invention discloses replacements for adenine, guanine, cytosine, and thymine, and their analogs, that do not suffer from, or suffer less from, these problematic molecular features intrinsic to natural nucleoside building blocks, but still allow them to form nucleobase pairs with their complements (respectively) thymine and uracil, cytosine, guanine, and adenine. These are referred to as “conversion attribute.”

    (7) At a minimum, to have the attribute, oligonucleotides containing the replacements for adenine, guanine, cytosine, thymine, and their analogs must, immediately or eventually, pair with complementary oligonucleotides in simple strand-strand hybridization procedures. Further, that pairing must have the same specificity as the pairing with fully natural oligonucleotides (recognizing that even totally natural nucleobase pairing is not perfectly specific). This is one example of a “conversion attribute”

    (8) Further value is gained if the replacements for adenine, guanine, cytosine, and thymine, and their analog nucleobases interact with their complements with the same specificity as their natural components in the active sites of DNA polymerases, RNA polymerases, reverse transcriptases, or other enzymes charged with replicating oligonucleotides. This is another example of a “conversion attribute”.

    (9) Still further value is gained if the replacements for adenine, guanine, cytosine, and thymine, and their analogs, assembled by chemical synthesis into an oligonucleotide of the instant invention, lead to the production of another oligonucleotide (or oligonucleotides) that is (are) either fully natural, or sufficiently natural to be used in living systems, including living cells. As an example without limit, to have fully conversion attributes, oligonucleotides of the instant invention must be able to template a polymerase chain reaction (PCR) that generates multiple copies of a natural DNA that substantially preserves the information in the oligonucleotide(s) of the instant invention. This is another example of a “conversion attribute”.

    (10) The problematic limitations of standard DNA (and, by analogy, RNA and other analogs) are now summarized individually, together with the instant inventions taught by this specification to mitigate them.

    (11) One set of limitations relates to the chemical synthesis of DNA, where “chemical synthesis” reflects phosphoramidite, phosphotriester, or other systems that do not involve enzymes. Chemical synthesis of DNA always creates oligonucleotides contaminated with defective molecules, with only the amounts of those defective oligonucleotides depending on the precise process by which those oligonucleotides were prepare. These defects, known in the art, are illustrated here for DNA, recognizing that similar issues apply to RNA and various of its sugar modified backbones, including (without limitation) species such as locked DNA, bicyclic DNA, RNA, and 2′-fluoro-2′-deoxy DNA. We teach some of them individually.

    (12) The glycosyl bond attaching the nucleobase to the 2′-deoxy ribose unit in the backbone is easily cleaved under acidic conditions. This is especially problematic in standard phosphoramidite-based chemical synthesis, where acidic conditions are repeatedly applied to remove trityl protecting groups in growing DNA strands. This leads to product oligocleotides that are defective in that they have lost a nucleobase at one or more sites. This loss generally leads to the fragmentation of the oligonucleotide.

    (13) The standard nucleobases of standard DNA can react under the alkaline conditions used to protect the nucleobases during chemical synthesis. For example, cytosine is readily deaminated to give uracil, changing the information content of the synthetic DNA oligonucleotide. Adenine may be deaminated to inosine, which pairs best with cytosine in a duplex, also changing the information content of the synthetic DNA.

    (14) C-glycosides manage these by assembling an oligonucleotide of the instant invention from building blocks that replace some, most, or all of their adenine, guanine, cytosine, and thymine-containing building blocks. In natural DNA, each of these building blocks has its heterocycle joined to the oligonucleotide backbone by a carbon-nitrogen bond. In the claimed oligonucleotides of the instant invention, building blocks are analogs that have their heterocycles joined to the oligonucleotide backbone by a carbon-carbon bond. These are called “C glycosides”. Representative structures are shown in FIG. 2.

    (15) This specification further teaches specific attachments for the heterocycles in these building blocks, the X and L groups in the Figures. These can either be more hydrophobic (for example, without limitation, methyl or ethyl) or more hydrophilic (for example, without limitation, carboxamidomethyl or hydroxyethyl), depending on the needs to mitigate other problematic limitations of DNA, as disclosed further below. However

    EXAMPLES

    Example 1. Synthesis of the 7-Me-A/G Analogs (FIG. 3)

    Compound 2

    (16) Compound 1 (1.55 g, 5 mmol) is dissolved in acetone (50 mL). Potassium carbonate (690 mg, 5 mmol) is added, followed by iodomethane (1.5 mL, 25 mmol). The resulting suspension is heated under reflux for 1 h. The volatiles are removed in vacuo and the residue is partitioned between ethyl acetate (EtOAc) and saturated aqueous NaHCO.sub.3 solution. The organic phase is dried (Na.sub.2SO.sub.4) and concentrated. The residue is purified by flash liquid chromatography on silica (FLC) to give compound 2.

    Compound 4

    (17) A suspension of palladium acetate (34 mg, 0.15 mmol) and triphenyl arsine (92 mg, 0.3 mmol) in N,N-dimethylformamide (DMF) (6 mL) is stirred at room temperature for 30 minutes. To this is added a solution of compound 2 (328 mg, 1 mmol), compound 3 (430 mg, 1.2 mmol) and tributylamine (0.475 mL, 2 mmol) in DMF (8 mL). The resulting suspension is stirred at 60° C. for 20 h. The solids are centrifuged off (10,000 rpm, 5 min) and the supernatant is partitioned between EtOAC and saturated aqueous NaHCO.sub.3 solution. The organic phase is dried (Na.sub.2SO.sub.4) and concentrated. The residue is purified by FLC to give compound 4.

    Compound 5

    (18) A solution of compound 4 (555 mg, 1 mmol) and acetic acid (glac., 0.30 mL, 5 mmol) in THF (10 mL) is cooled to 0° C. A solution of tetrabutylammonium fluoride in THF (1 M, 1.5 mL, 1.5 mmol) is added. After 1 h at 0° C. the reaction is concentrated in vacuo. The residue is dissolved in acetonitrile (8 mL) and acetic acid (glac., 4 mL). The solution is cooled to 0° C., and sodium triacetoxyborohydride (254 mg, 1.2 mmol) is added. After stirring at 0° C. for 1 h the reaction is quenched by the addition of acetone (0.5 mL). The solution is concentrated and the residue purified by FLC to give compound 5.

    Compound 6

    (19) Compound 5 (318 mg, 1 mmol) is dissolved in methanolic ammonia (7 N, 10 mL, 70 mmol). The solution is heated in a pressure vessel at 100° C. for 20 h. The volatiles are removed in vacuo and the residue is purified by FLC to give compound 6.

    Compound 7

    (20) To a solution of compound 6 (149 mg, 0.5 mmol) in ethanol (20 mL) is added palladium hydroxide on carbon (20 wt% Pd, 50 mg, 0.1 mmol). The mixture is saturated with hydrogen and stirred under a hydrogen atmosphere (1 bar) at room temperature. After 20 h, the mixture is filtered over Celite, the volatiles are removed in vacuo, and the residue is purified by FLC to give compound 7.

    Compound 8

    (21) Compound 8 is prepared from compound 7 following standard procedures for the preparation of nucleoside phosphoramidites:

    (22) A solution of compound 7 (264 mg, 1 mmol), 4,4′-dimethoxytrityl chloride (407 mg, 1.2 mmol) and 4-(dimethylamino)pyridine (12 mg, 0.1 mmol) in pyridine (10 mL) is stirred at room temperature for 4 h. The reaction is quenched by the addition of methanol (0.5 mL). The volatiles are removed in vacuo and the residue is purified by FLC.

    (23) This intermediate (454 mg, 0.8 mmol) is dissolved in dichloromethane (8 mL), and N,N-diisopropylethylamine (0.28 mL, 1.6 mmol) is added. The solution is cooled to 0° C. and 2-cyanoethyl N,N-diisopropylchlorophosphoramidite (0.22 mL, 1 mmol) is added. After 30 min at 0° C. the reaction is quenched by the addition of methanol (0.5 mL). The volatiles are removed in vacuo and the residue is purified by FLC to give compound 8.

    Compound 9

    (24) A solution of compound 5 (318 mg, 1 mmol) in dioxane (10 mL) and aqueous sodium hydroxide solution (1 N, 2 mL, 2 mmol) is heated under reflux for 2 h. After concentration in vacuo, the residue is partitioned between EtOAc and saturated aqueous NaHCO.sub.3 solution. The organic phase is dried (Na.sub.2SO.sub.4) and concentrated. The residue is purified by FLC to give compound 9.

    Compound 10

    (25) Compound 9 (300 mg, 1 mmol) is dissolved in methanolic ammonia (7 N, 10 mL, 70 mmol). The solution is heated in a pressure vessel at 170° C. for 20 h. The volatiles are removed in vacuo and the residue is purified by FLC to give compound 10.

    Compound 11

    (26) Compound 11 is prepared from compound 10 following standard procedures for the preparation of nucleoside phosphoramidites:

    (27) A solution of compound 10 (420 mg, 1.5 mmol) and N,N-dimethylformamide dimethyl acetal (1.33 mL, 10 mmol) in methanol (10 mL) is heated under reflux for 2 h. The volatiles are removed in vacuo, and the residue is purified by FLC.

    (28) A solution of this intermediate (335 mg, 1 mmol), 4,4′-dimethoxytrityl chloride (407 mg, 1.2 mmol) and 4-(dimethylamino)pyridine (12 mg, 0.1 mmol) in pyridine (10 mL) is stirred at room temperature for 4 h. The reaction is quenched by the addition of methanol (0.5 mL). The volatiles are removed in vacuo and the residue is purified by FLC.

    (29) This intermediate (510 mg, 0.8 mmol) is dissolved in dichloromethane (8 mL), and N,N-diisopropylethylamine (0.28 mL, 1.6 mmol) is added. The solution is cooled to 0° C. and 2-cyanoethyl N,N-diisopropylchlorophosphoramidite (0.22 mL, 1 mmol) is added. After 30 min at 0° C. the reaction is quenched by the addition of methanol (0.5 mL). The volatiles are removed in vacuo and the residue is purified by FLC to give compound 11.