COMPOSITIONS AND METHODS RELATED TO TETHERED KETHOXAL DERIVATIVES

20220143198 · 2022-05-12

Assignee

Inventors

Cpc classification

International classification

Abstract

Embodiments are directed to therapeutic, diagnostic, or functional complexes comprising a kethoxal derivative.

Claims

1. A kethoxal complex comprising an agent coupled to a kethoxal derivative having a general formula of Formula I: ##STR00094## wherein E is a reactive functional group selected from alkynes, azides, strained alkynes, dienes, dieneophiles, alkoxyamines, carbonyls, phosphines, hydrazides, thiols, and alkenes; D is optionally a linker or a direct bond; R is a connecting group; A one or two substituents selected from H, F, CF.sub.3, CF.sub.2H, CFH.sub.2, CH.sub.3, alkyl group, or combinations thereof, or A is a second E moiety selected independent of the first E moiety; and G is H, F, CF.sub.3, CF.sub.2H, CFH.sub.2, CH.sub.3, or an alkyl group.

2. The kethoxal complex of claim 1, wherein E is selected from a substituted alkyl, heteroalkyl, substituted heteroalkyl, heteroaryl, or substituted heteroalkyl. In some aspects, E can be a substituted or unsubstituted phenol, substituted or unsubstituted thiophenol, substituted or unsubstituted aniline, substituted or unsubstituted tetrazole, substituted or unsubstituted tetrazine, substituted or unsubstituted SPh, substituted or unsubstituted diazirine, substituted or unsubstituted benzophenone, substituted or unsubstituted nitrone, substituted or unsubstituted nitrile oxide, substituted or unsubstituted norbornene, substituted or unsubstituted nitrile, substituted or unsubstituted isocyanide, substituted or unsubstituted quadricyclane, substituted or unsubstituted alkyne, substituted or unsubstituted azide, substituted or unsubstituted strained alkyne, substituted or unsubstituted diene, substituted or unsubstituted dienophile, substituted or unsubstituted alkoxyamine, substituted or unsubstituted carbonyl, substituted or unsubstituted phosphine, substituted or unsubstituted hydrazide, substituted or unsubstituted thiol, or substituted or unsubstituted alkene.

3. The kethoxal complex of claim 1 or 2, wherein D is a linker selected from one or more of an ester, amide, tetrazine, tetrazole, triazine, triazole, aryl groups, heterocycle, sulfonamide, a substituted or unsubstituted —(CH.sub.2).sub.n— where n is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions; —O(CH.sub.2).sub.m— where m is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions; —NR.sup.5— where R.sup.5 is H or alkyl such as methyl; —NR.sup.6CO(CH.sub.2).sub.j— where j is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions and R.sup.6 is H or alkyl such as methyl; or —O(CH.sub.2).sub.kR.sup.6— where k is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions and R.sup.11 is alkyl, substituted alkyl, cycloalkyl, substituted cycloalkyl, heteroalkyl, substituted heteroalkyl, aryl, substituted aryl, heteroaryl, or substituted heteroaryl. D can be —N(CH.sub.3)—, —OCH.sub.2—, —N(CH.sub.3)COCH.sub.2—, or ##STR00095##

4. The kethoxal complex of claim 3, wherein the linker is a concatamer of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more of the linkers.

5. The kethoxal complex of any one of claims 1 to 3, wherein R is selected from a substituted or unsubstituted carbon, nitrogen, aryl, alkylaryl, or heterocycle.

6. The kethoxal complex of any one of claims 1 to 5, wherein G is H; R is C; A is CH.sub.3; D is —OCH.sub.2CH.sub.2-triazole-pyridine-aryl-amide-CH.sub.2CH.sub.2, and E is N.sub.3 (azide); (ii) G is H; R is C, A is F, D is —OCH.sub.2CH.sub.2-triazole-amide-benzoimidazole-phenyl-NHCO—CH.sub.2CH.sub.2, and E is alkyne; (iii) G is H, R is C, A is a di-fluoro substituent of R, D is —OCH.sub.2CH.sub.2-triazole-CH.sub.2-pyridine-benzoimidazole-NHCO—CH.sub.2CH.sub.2CH.sub.2—, and E is N.sub.3 (azide); (iv) G is H, R is C, A is methyl, D is —OCH.sub.2CH.sub.2-triazole-, and E is phenol or diphenol.

7. The kethoxal complex of claim 1, wherein the kethoxal complex is selected from 3-azido-2-oxopropanal, 3-azido-2-oxobutanal, 3-azido-3-fluoro-2-oxopropanal, 2-oxo-6-(2-oxohexahydro-1H-thieno[3,4-d]imidazol-4-yl)hexanal, 2-((1S,4S)-bicyclo[2.2.1]hept-5-en-2-yl)-2-oxoacetaldehyde, 2-oxo-2-phenylacetaldehyde, 2-(3,5-dimethoxyphenyl)-2-oxoacetaldehyde, 2-(4-nitrophenyl)-2-oxoacetaldehyde, N-(2,3-dioxopropyl)-N-methyl-5-(2-oxohexahydro-1H-thieno[3,4-d]imidazol-4-yl)pentanamide, N-((1-(2-((3,4-dioxobutan-2-yl)oxy)ethyl)-1H-1,2,3-triazol-4-yl)methyl)-5-(2-oxohexahydro-1H-thieno[3,4-d]imidazol-4-yl)pentanamide, 2-oxo-3-(prop-2-yn-1-yloxy)butanal, (E)-3-(2-(cyclooct-4-en-1-ylamino)ethoxy)-2-oxobutanal, 3-(2-azidoethoxy)-2-oxopropanal, 3,4-dioxobutan-2-yl 2-azidoacetate, 3-(2-azidoethoxy)-3-methyl-2-oxobutanal, 5-azido-2-oxopentanal, 2-azido-N-(3,4-dioxobutan-2-yl)-N-methylacetamide, 3-(2-azidoethoxy)-2-oxobutanal, 3-(2-azidoethoxy)-3-fluoro-2-oxopropanal, 3-(2-azidoethoxy)-3,3-difluoro-2-oxopropanal, 4-(2-azidoethoxy)-2-oxobutanal, or 3-(((1S,4S)-bicyclo[2.2.1]hept-5-en-2-yl)methoxy)-2-oxobutanal.

8. A kethoxal complex comprising an agent coupled to a kethoxal derivative having a general formula of Formula III: ##STR00096## wherein E is a click chemistry moiety selected from alkynes, azides, strained alkynes, dienes, dieneophiles, alkoxyamines, carbonyls, phosphines, hydrazides, thiols, and alkenes; and A and G are independently selected from H, CF.sub.3, CF.sub.2H, CFH.sub.2, or CH.sub.3.

9. A kethoxal complex comprising an agent coupled to a kethoxal derivative having a general formula of Formula IV: ##STR00097## wherein A is a substituent selected from H, F, CF.sub.3, CF.sub.2H, CFH.sub.2, or CH.sub.3 or is a linker.

10. A kethoxal complex comprising an agent coupled to a kethoxal derivative having the formula: ##STR00098## wherein E is a click chemistry moiety selected from alkynes, azides, strained alkynes, dienes, dieneophiles, alkoxyamines, carbonyls, phosphines, hydrazides, thiols, and alkenes; and A is independently selected from H, F, CF.sub.3, CF.sub.2H, CFH.sub.2, or CH.sub.3.

11. A kethoxal complex comprising an agent coupled to a kethoxal derivative having the formula: ##STR00099## wherein A is hydrogen or methyl; D is a linker; and E is reactive functional group.

12. The kethoxal complex of claim 11, wherein D is a substituted or unsubstituted —(CH.sub.2).sub.n— where n is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions; —O(CH.sub.2).sub.m— where m is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions; —NR.sup.5— where R.sup.5 is H or alkyl such as methyl; —NR.sup.6CO(CH.sub.2).sub.j— where j is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions and R.sup.6 is H or alkyl such as methyl; or —O(CH.sub.2).sub.kR.sup.6— where k is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions and R.sup.6 is alkyl, substituted alkyl, cycloalkyl, substituted cycloalkyl, heteroalkyl, substituted heteroalkyl, aryl, substituted aryl, heteroaryl, or substituted heteroarylaryl.

13. The kethoxal complex of claim 11, wherein D is substituted with a reactive group.

14. The kethoxal complex of claim 13, wherein the reactive group is a click chemistry moiety.

15. The kethoxal complex of claim 11, wherein D is —N(CH.sub.3)—, —OCH.sub.2—, —N(CH.sub.3)COCH.sub.2—, or a group having the chemical formula of Formula VII, ##STR00100##

16. The kethoxal complex of any one of claims 1 to 15, wherein the agent binds directly or indirectly to a nucleic acid in vivo, ex vivo and/or in vitro.

17. The kethoxal complex of any one of claims 1 to 16, wherein the agent is a therapeutic, diagnostic, or functional agent.

18. The kethoxal complex of claim 17, wherein the therapeutic agent is a small molecule.

19. The kethoxal complex of claim 18, wherein the small molecule binds to a protein or a nucleic acid.

20. The kethoxal complex of any one of claims 1 to 17, wherein the agent is a therapeutic nucleic acid.

21. The kethoxal complex of claim 20, wherein the therapeutic nucleic acid is an inhibitory nucleic acid.

22. The kethoxal complex of claim 20, wherein the inhibitory nucleic acid is an siRNA.

23. The kethoxal complex of claim 1, wherein the kethoxal derivative is N.sub.3-kethoxal.

24. A method for localizing an agent to a nucleic acid comprising contacting a cell or an extracellular nucleic acid with a kethoxal complex of any one of claims 1 to 23.

25. The method of claim 24, wherein the agent is a therapeutic agent.

26. A method for localizing a therapeutic agent in a cell comprising: (i) contacting a target cell with a kethoxal complex of any one of claims 1 to 16 to form a treated cell; and (ii) coupling the therapeutic agent to a nucleic acid through a kethoxal derivative-coupled guanine base(s).

27. A kethoxal derivative of Formula VI ##STR00101## wherein A is H or methyl, D is a linker or a direct bond; and wherein E is a substituted or unsubstituted phenol, substituted or unsubstituted thiophenol, substituted or unsubstituted aniline, substituted or unsubstituted tetrazole, substituted or unsubstituted tetrazine, substituted or unsubstituted SPh, substituted or unsubstituted diazirine, substituted or unsubstituted benzophenone, substituted or unsubstituted nitrone, substituted or unsubstituted nitrile oxide, substituted or unsubstituted norbornene, substituted or unsubstituted nitrile, substituted or unsubstituted isocyanide, substituted or unsubstituted quadricyclane, substituted or unsubstituted alkyne, substituted or unsubstituted azide, substituted or unsubstituted strained alkyne, substituted or unsubstituted diene, substituted or unsubstituted dienophile, substituted or unsubstituted alkoxyamine, substituted or unsubstituted carbonyl, substituted or unsubstituted phosphine, substituted or unsubstituted hydrazide, substituted or unsubstituted thiol, or substituted or unsubstituted alkene.

28. The kethoxal derivative of claim 27, wherein D is —(CR.sup.5H).sub.n— where n is 1-10 and R.sup.5 is H or alkyl such as methyl; —O(CR.sup.6H).sub.m— where m is 1-10 and R.sup.6 is H or alkyl such as methyl; —NR.sup.7— where R.sup.7 is H or alkyl such as methyl; —NR.sup.8CO(CR.sup.9H).sub.j— where j is 1-10 and R.sup.8 and R.sup.9 are independently H or alkyl such as methyl; or —O(CR.sup.10H).sub.kR.sup.11— where k is 1-10 and R.sup.10 is H or alkyl such as methyl and R.sup.11 is alkyl, substituted alkyl, cycloalkyl, substituted cycloalkyl, heteroalkyl, substituted heteroalkyl, aryl, substituted aryl, heteroaryl, or substituted heteroarylaryl.

29. The kethoxal derivative of claim 27, wherein E further comprises a detectable label.

30. The kethoxal derivative of claim 29, wherein the detectable label is a drug, a toxin, a peptide, a polypeptide, an epitope tag, a member of a specific binding pair, a fluorophore, a solid support, a nucleic acid (DNA/RNA), a lipid, or a carbohydrate.

31. The kethoxal derivative of claim 27, wherein E further comprises an affinity group.

32. The kethoxal derivative of claim 31, wherein the affinity group is biotin.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

[0065] The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present invention. The invention may be better understood by reference to one or more of these drawings in combination with the detailed description of the specification embodiments presented herein.

[0066] FIG. 1A-F: N.sub.3-kethoxal and experimental evaluation of its selectivity, cell permeability and reversibility. (a) The structure of N.sub.3-kethoxal and the reaction with guanine. (b) Denaturing gel electrophoresis demonstrating N.sub.3-kethoxal only react with single-strand RNA (ssRNA). (c) Mass spectrum analysis of RNA oligos react with N.sub.3-kethoxal. In RNA 1 with four guanines, all guanines and only guanine were labelled by N.sub.3-kethoxal. In RNA 2 without guanine, no N.sub.3-kethoxal labelling was observed. (d) Upper: Denaturing gel electrophoresis analysis of the labelling reaction of kethoxal and N.sub.3-kethoxal with FAM-RNA oligo (5′-FAM-GAGCAGCUUUAGUUUAGAUCGAGUGUA (SEQ ID NO:3, lane 1-3) and biotinylation with biotin-DBCO (lane 5, 6). Only N.sub.3-kethoxal labelled RNA can be biotinylated (lane 6). Bottom: Dot blot of RNA after labelling and Biotinylation reactions. Methylene blue dot results are listed as control. (e) Dot blot of isolated total RNA from mES cells which were treated by N.sub.3-kethoxal with different periods, 1, 5, 10, 15, 20 mins. (f) Dot blot analysis of reversibility of N.sub.3-kethoxal labelled mRNA in present of 50 mM GTP at 95° C. The N.sub.3-kethoxal modification in mRNA was removed thoroughly after 10 mins incubation.

[0067] FIG. 2A-B. Examples of groups having chemical formula of Formula VIII (A) and kethoxal derivatives having chemical formula of Formula VI (B) are illustrated. R in FIG. 2 represent an agent coupled to the kethoxal derivative.

[0068] FIG. 3. Labeling activity of phenol-kethoxal and diphenol-kethoxal, the two compounds were incubated with a 12-mer synthetic RNA oligo containing four guanine bases, respectively. After 10 min, the reactions were cleaned-up and analyzed by MALDI-TOF.

[0069] FIG. 4. The cell permeability of phenol-kethoxal and diphenol-kethoxal was tested. Cells were treated with phenol-kethoxal and diphenol-kethoxal for 10 min, respectively, and RNA isolated from treated cells. An in vitro biotinylation reaction was performed by mixing these kethoxal derivative-labeled RNAs with biotin-phenol, horseradish peroxidase (HRP), and H.sub.2O.sub.2.

[0070] FIG. 5. Examples of conjugates are illustrated.

[0071] FIG. 6. Illustrates the general description of parent compound in Formula I.

[0072] FIG. 7. Illustrates non-limiting examples of Formula I.

[0073] FIG. 8A-8F. Tables illustrating various non-limiting examples of Formula I.

[0074] FIG. 9A-B. Example of LCMS results to follow relative amount of free guanosine.

DETAILED DESCRIPTION OF THE INVENTION

[0075] Chemical labeling of nucleic acids is extremely useful for a range of applications such as probing nucleic acid structure, nucleic acid location, nucleic acid proximity information, transcription and translation. Typical labeling strategies include metabolic labeling. Coupling or tethering moieties to nucleic acids is contemplated as an anchor or tether for therapeutic or diagnostic agents to a location to which the moieties bind or associates. Certain embodiments are directed to the development of kethoxal derivatives (e.g., N.sub.3-kethoxal) as a tethering agent.

[0076] Current methods do not specifically localize inhibitors and/or covalently lock the inhibitor in place. Embodiments described herein include an entity that localizes to a binding site and can be covalently linked at that site, e.g., tethering an inhibitory RNA to its target. Methods and compositions localize an agent to the proximity of specific target via a kethoxal derivative.

[0077] An appropriate localization signal in the form of a kethoxal derivative can be tethered to the therapeutic agent to cause it to be precisely located or fixed to or in the vicinity of its target or binding partner. Such localization anchors identify a target uniquely, or distinguish the target from a majority of incorrect targets. For example, RNA-based inhibitors of viral replication can be tethered to the target RNA. In addition, an inhibitor of a transcription complex can be locked in place altering the on/off kinetics of the inhibitor and blocking the transcription site.

[0078] Aspects include methods for enhancing the effect of a therapeutic agent in vivo. The method includes the step of causing the agent to be localized in vivo with or in the vicinity of its target.

[0079] By “enhancing” the effect of a therapeutic agent in vivo is meant that a localization anchor targets an agent to a specific site within a cell and thereby causes that agent to act more efficiently. Thus, a lower concentration of agent administered to a cell in vivo can have an equal effect to a larger concentration of non-localized agent. Such increased efficiency of the targeted or localized agent can be measured by any standard procedure well-known to those of ordinary skill in the art. In general, the effect of the agent is enhanced by placing and/or maintaining the agent in a closer proximity with the target, so that it may have its desired effect on that target.

[0080] In other aspects, the invention features methods for enhancing the effect of nucleic acid-based therapeutic agents in vivo by colocalizing or anchoring them with their target using an appropriate localization anchor.

A. Kethoxal Derivative Anchor

[0081] Kethoxal derivative anchors enable the covalent attachment of an agent to its binding target or another entity in the vicinity. The “click” chemistry can be controlled by light, so as to achieve site-specific modification in live cells.

[0082] As described herein, N.sub.3-kethoxal (representative of kethoxal derivatives) is shown to react selectively with guanines at single-stranded DNA and RNA. These reactions are highly efficient under mild normal cell culture conditions, and could be directly applied to tissues. Any chemical moiety can be installed on a kethoxal derivative using the methods described herein. Of particular use according to some aspects of this invention are click chemistry handles. Click chemistry handles are chemical moieties that provide a reactive group that can partake in a click chemistry reaction. Click chemistry reactions and suitable chemical groups for click chemistry reactions are well known to those of skill in the art, and include, but are not limited to terminal alkynes, azides, strained alkynes, dienes, dieneophiles, alkoxyamines, carbonyls, phosphines, hydrazides, thiols, and alkenes. For example, in some embodiments, an azide and an alkyne are used in a click chemistry reaction. In certain aspects, the “click-chemistry compatible” compounds or click chemistry handles include a terminal azide functional group (e.g., Formula I).

##STR00010##

[0083] In certain aspects, compounds have a general formula of Formula I and Formula II where E is selected from a reactive group, click chemistry moiety, binding group, or therapeutic agent; D is optionally a linker or a direct bond; R is a connecting element or group; A is a substituent or a second E moiety selected independent of the first E moiety; and G is a dicarbonyl-defining group.

[0084] In certain aspects, R can be selected from substituted or unsubstituted carbon, nitrogen, aryl, alkylaryl, or heterocyclic group.

[0085] In certain aspects, A can be substituted with one or more (mono-substituted, di-substituted, etc.) of H, F, CF.sub.3, CF.sub.2H, CFH.sub.2, CH.sub.3, alkyl group, or combinations thereof. In certain aspects, A can be mono- or di-substituted with a linker. In certain aspects, A can be mono- or di-substituted with a reactive group, e.g., a click chemistry moiety, therapeutic agent, or binding moiety.

[0086] In certain aspects, D is a linker selected from an ester, amide, tetrazine, tetrazole, triazine, triazole, aryl groups, heterocycle, sulfonamide, a substituted or unsubstituted (CH.sub.2).sub.n— where n is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions; —O(CH.sub.2).sub.m— where m is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions; —NR.sup.5— where R.sup.5 is H or alkyl such as methyl; —NR.sup.6CO(CH.sub.2).sub.j— where j is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions and R.sup.6 is H or alkyl such as methyl; or —O(CH.sub.2).sub.kR.sup.6— where k is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions and R.sup.11 is alkyl, substituted alkyl, cycloalkyl, substituted cycloalkyl, heteroalkyl, substituted heteroalkyl, aryl, substituted aryl, heteroaryl, or substituted heteroaryl. D can be —N(CH.sub.3)—, —OCH.sub.2—, —N(CH.sub.3)COCH.sub.2—, or a group having the chemical formula of Formula VII. In certain instances, the linker can be a concatamer (comprising 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more linker(s)) of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more of the linkers described above.

##STR00011##

In some aspects, D can be substituted with a reactive group, e.g., a click chemistry moiety. In some aspects, D can be a direct bond between E and the carbon atom binding A. In certain aspects, D can be a substituent that modulates the stability of the product formed, including alkoxy groups, ethers, carbonyls, aryl groups, electron withdrawing or electron donating groups, electrophilic of nucleophilic centers, or H-bond acceptors.

[0087] In certain aspects, G can be independently selected from H, CF.sub.3, CF.sub.2H, CFH.sub.2, CH.sub.3, or alkyl group.

[0088] In certain aspects, E can be selected from alkynes, azides, strained alkynes, dienes, dieneophiles, alkoxyamines, carbonyls, phosphines, hydrazides, thiols, alkenes, diazirines. In some aspects, E can be a substituted alkyl, heteroalkyl, substituted heteroalkyl, heteroaryl, or substituted heteroalkyl. In some aspects, E can be a substituted or unsubstituted phenol, substituted or unsubstituted thiophenol, substituted or unsubstituted aniline, substituted or unsubstituted tetrazole, substituted or unsubstituted tetrazine, substituted or unsubstituted SPh, substituted or unsubstituted diazirine, substituted or unsubstituted benzophenone, substituted or unsubstituted nitrone, substituted or unsubstituted nitrile oxide, substituted or unsubstituted norbornene, substituted or unsubstituted nitrile, substituted or unsubstituted isocyanide, substituted or unsubstituted quadricyclane, substituted or unsubstituted alkyne, substituted or unsubstituted azide, substituted or unsubstituted strained alkyne, substituted or unsubstituted diene, substituted or unsubstituted dienophile, substituted or unsubstituted alkoxyamine, substituted or unsubstituted carbonyl, substituted or unsubstituted phosphine, substituted or unsubstituted hydrazide, substituted or unsubstituted thiol, or substituted or unsubstituted alkene. In certain aspects, E is a click chemistry compatible reactive group selected from protected thiol, alkene (including trans-cyclooctene [TCO]) and tetrazine inverse-demand Diels-Alder, tetrazole photoclick reaction, vinyl thioether alkynes, azides, strained alkynes, diazrines, dienes, dieneophiles, alkoxyamines, carbonyls, phosphines, hydrazides, thiols, and alkenes. In certain aspects, E can be further coupled to an agent or binding moiety. In certain aspects the agent or binding moiety binds directly or indirectly to a target (protein or nucleic acid) in vivo, ex vivo or in vitro. In certain aspects the agent or binding moiety binds directly or indirectly to a target (protein or nucleic acid) in vivo.

[0089] In certain embodiments, kethoxal derivatives can be coupled to a variety of nucleic acids and/or small molecules (forming a kethoxal complex) that either binds and inhibits specific RNA, or to DNA or RNA reagents that bind or target RNA or DNA (such as antisense or guide RNA of CRISPR). The kethoxal component can serve to covalently lock the nucleic acid or small molecule complex. The same approach can be applied to target protein-RNA or protein-ssDNA interaction. A peptide or small molecule could bind a protein, RNA-binding protein or bind to the interface of RNA-protein interaction and the kethoxal derivative can covalently lock the inhibition.

##STR00012##

[0090] In certain aspects, N.sub.3-kethoxal or kethoxal derivatives of Formula III or Formula IV or Formula V can be incorporated into an agent (e.g., small molecules) developed to target RNA or protein-RNA interface to enable a covalent inhibition. The kethoxal component of Formula III can react with guanines in single stranded nucleic acids to form a covalent linkage. In certain aspects the G and/or A substitution on Formula III can be independently varied to tune various properties of the kethoxal component. In certain aspects, A or G can be independently selected from H, F, CF.sub.3, CF.sub.2H, CFH.sub.2, or alkyl group. For instance fluoride substitutions can be used to modulate reactivity. In certain aspects, A is a substituent or a second E moiety selected independent of the first E moiety. The modified kethoxal component could be less reactive and more specific. It could also be reversible. In certain aspects, A in Formula I, Formula III, Formula IV, Formula V, can be a substituent that modulates the stability of the product formed, selected from alkoxy groups, ethers, carbonyls, aryl groups, electron withdrawing or electron donating groups, or H-bond acceptors. The A and/or E substitutions of Formula III, Formula IV, or Formula V can be a linker that can be connected with RNA-targeting molecules. In certain aspects, the linker can be a substituent that modulates the stability of the product formed, selected from alkoxy groups, ethers, carbonyls, aryl groups, electron withdrawing or electron donating groups, or H-bond acceptors. Kethoxal derivatives can serve as a warhead to covalently lock the inhibition of the RNA-targeting molecule. “Warhead moiety” or “warhead” refers to a moiety of an inhibitor which participates, either reversibly or irreversibly, with the reaction of a donor, e.g., a protein, with a substrate. Warheads may, for example, form covalent bonds with the donor, or may create stable transition states, or be a reversible or an irreversible alkylating agent. For example, the warhead moiety can be a functional group on an inhibitor that can participate in a bond-forming reaction, wherein a new covalent bond is formed between a portion of the warhead and a donor, for example an amino acid residue of a protein. In embodiments, the warhead is an electrophile and the “donor” is a nucleophile such as the side chain of a cysteine residue. When A or E is a linker it can be connected or covalently coupled to a small molecule that binds an RNA-binding protein or binds to the interface of protein-RNA interaction. Compounds of Formula III or Formula IV or Formula V serve to covalently attached to a target (e.g., an RNA or protein) and lock the inhibition of a RNA, or a protein or protein/RNA complex. A and E can be connected to other DNA, RNA or molecules that sequence-specifically recognize RNA or ssDNA, an example is CRISPR guide RNA or any antisense developed to target RNA.

##STR00013##

[0091] Formula IV is an example for molecules included in Formula III. The presence of N.sub.3 makes Formula IV a candidate to be linked to fragment libraries that carry an alkyne. Formula IV can covalently target ssRNA and the N.sub.3-alkyne click chemistry can be used to connect RNA- or protein-targeting small molecules with Formula IV. Click chemistry can be any chemical functional groups. Linker can be any and the length can be varied or adjusted. Kethoxal can be incorporated into small molecules developed to target ssDNA or protein-ssDNA interface to enable a covalent inhibition. In certain aspects, A is a substituent or a second E moiety selected independent of the first E moiety.

##STR00014##

[0092] Formula V is an example for kethoxal derivative that can be rendered more electron rich and less reactive by substituting a CH.sub.2 group with —SO.sub.2—, in order to reduce reactivity and be potentially reversible. In certain aspects, A is a substituent or a second E moiety selected independent of the first E moiety.

##STR00015##

[0093] In certain aspects, a kethoxal derivative can have the general formula of Formula VI, wherein A can be hydrogen or methyl; D is optionally a linker or a direct bond; and E can be a be a reactive functional group. In certain aspects, A is a substituent or a second E moiety selected independent of the first E moiety. In some aspects, D can be a substituted or unsubstituted —(CH.sub.2).sub.n— where n is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions; —O(CH.sub.2).sub.m— where m is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions; —NR.sup.5— where R.sup.5 is H or alkyl such as methyl; —NR.sup.6CO(CH.sub.2).sub.j— where j is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions and R.sup.6 is H or alkyl such as methyl; or —O(CH.sub.2).sub.kR.sup.6— where k is 1-10 with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 methyl substitutions and R.sup.11 is alkyl, substituted alkyl, cycloalkyl, substituted cycloalkyl, heteroalkyl, substituted heteroalkyl, aryl, substituted aryl, heteroaryl, or substituted heteroaryl. In some aspects, D can be substituted with a reactive group, e.g., a click chemistry moiety. In some aspects, D can be —N(CH.sub.3)—, —OCH.sub.2—, —N(CH.sub.3)COCH.sub.2—, or a group having the chemical formula of Formula VII. In certain instances, the linker can be a concatamer (comprising 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more linker(s)) of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more of the linkers described above.

##STR00016##

[0094] In some aspects, D can be a direct bond between E and the carbon atom binding A. In some aspects, E can be substituted alkyl, heteroalkyl, substituted heteroalkyl, heteroaryl, or substituted heteroalkyl. In some aspects E can be a click chemistry moiety. In some aspects, E can be substituted or unsubstituted phenol, substituted or unsubstituted thiophenol, substituted or unsubstituted aniline, substituted or unsubstituted tetrazole, substituted or unsubstituted tetrazine, substituted or unsubstituted SPh, substituted or unsubstituted diazirine, substituted or unsubstituted benzophenone, substituted or unsubstituted nitrone, substituted or unsubstituted nitrile oxide, substituted or unsubstituted norbornene, substituted or unsubstituted nitrile, substituted or unsubstituted isocyanide, substituted or unsubstituted quadricyclane, substituted or unsubstituted alkyne, substituted or unsubstituted azide, substituted or unsubstituted strained alkyne, substituted or unsubstituted diene, substituted or unsubstituted dienophile, substituted or unsubstituted alkoxyamine, substituted or unsubstituted carbonyl, substituted or unsubstituted phosphine, substituted or unsubstituted hydrazide, substituted or unsubstituted thiol, or substituted or unsubstituted alkene.

[0095] In certain instances kethoxal derivatives are hydrated in aqueous solutions.

##STR00017##

[0096] All derivatives described above may also be in hydrated forms.

[0097] In certain instances of Formulas I-VII, D, A, or A and D can be stabilization-modulating substituents. Most specifically, a H-Bond acceptor group can be added to D or A to allow it to hydrogen bond to amine-hydrogens on guanine when the kethoxal derivative reacts with guanine. With respect to A, fluoro and like groups can be used to affect reversibility.

[0098] Kethoxal derivatives fused with or further coupled with therapeutic ligands, e.g kethoxal conjugates are represented in Formula IX.

##STR00018##

[0099] Wherein A, D and E are as defined above. In certain aspects, Z is a therapeutic agent. In some aspects, E or Z can also be any therapeutic macromolecule such as peptides, proteins, antibodies, or a ligand recognized by a therapeutic biomolecule, etc.; or a delivery vehicle such as nanoparticles, receptors, hydrogels, etc. Examples of kethoxal conjugates are illustrated in FIG. 5.

[0100] Definitions of specific functional groups and chemical terms are described in more detail below. For purposes of this invention, the chemical elements are identified in accordance with the Periodic Table of the Elements, CAS version, Handbook of Chemistry and Physics, 75th Ed., inside cover, and specific functional groups are generally defined as described therein. Additionally, general principles of organic chemistry, as well as specific functional moieties and reactivity, are described in Organic Chemistry, Thomas Sorrell, University Science Books, Sausalito, 1999; Smith and March March's Advanced Organic Chemistry, 5th Edition, John Wiley & Sons, Inc., New York, 2001; Larock, Comprehensive Organic Transformations, VCH Publishers, Inc., New York, 1989; Carruthers, Some Modern Methods of Organic Synthesis, 3rd Edition, Cambridge University Press, Cambridge, 1987.

[0101] The term “aliphatic,” as used herein, includes both saturated and unsaturated, nonaromatic, straight chain (i.e., unbranched), branched, acyclic, and cyclic (i.e., carbocyclic) hydrocarbons, which are optionally substituted with one or more functional groups. As will be appreciated by one of ordinary skill in the art, “aliphatic” is intended herein to include, but is not limited to, alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, and cycloalkynyl moieties. Thus, as used herein, the term “alkyl” includes straight, branched and cyclic alkyl groups. An analogous convention applies to other generic terms such as “alkenyl,” “alkynyl,” and the like. Furthermore, as used herein, the terms “alkyl,” “alkenyl,” “alkynyl,” and the like encompass both substituted and unsubstituted groups. In certain embodiments, as used herein, “aliphatic” is used to indicate those aliphatic groups (cyclic, acyclic, substituted, unsubstituted, branched or unbranched) having 1-20 carbon atoms (C1-20 aliphatic). In certain embodiments, the aliphatic group has 1-10 carbon atoms (C1-10 aliphatic). In certain embodiments, the aliphatic group has 1-6 carbon atoms (C1-6 aliphatic). In certain embodiments, the aliphatic group has 1-5 carbon atoms (C1-5 aliphatic). In certain embodiments, the aliphatic group has 1-4 carbon atoms (C1-4 aliphatic). In certain embodiments, the aliphatic group has 1-3 carbon atoms (C1-3 aliphatic). In certain embodiments, the aliphatic group has 1-2 carbon atoms (C1-2 aliphatic). Aliphatic group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0102] The term “alkyl,” as used herein, refers to saturated, straight- or branched-chain hydrocarbon radicals derived from a hydrocarbon moiety containing between one and twenty carbon atoms by removal of a single hydrogen atom. In some embodiments, the alkyl group employed in the invention contains 1-20 carbon atoms (C1-20alkyl). In another embodiment, the alkyl group employed contains 1-15 carbon atoms (C1-15alkyl). In another embodiment, the alkyl group employed contains 1-10 carbon atoms (C1-10alkyl). In another embodiment, the alkyl group employed contains 1-8 carbon atoms (C1-8alkyl). In another embodiment, the alkyl group employed contains 1-6 carbon atoms (C1-6alkyl). In another embodiment, the alkyl group employed contains 1-5 carbon atoms (C1-5alkyl). In another embodiment, the alkyl group employed contains 1-4 carbon atoms (C1-4alkyl). In another embodiment, the alkyl group employed contains 1-3 carbon atoms (C1-3alkyl). In another embodiment, the alkyl group employed contains 1-2 carbon atoms (C1-2alkyl). Examples of alkyl radicals include, but are not limited to, methyl, ethyl, n-propyl, isopropyl, n-butyl, iso-butyl, sec-butyl, sec-pentyl, iso-pentyl, tert-butyl, n-pentyl, neopentyl, n-hexyl, sec-hexyl, n-heptyl, n-octyl, n-decyl, n-undecyl, dodecyl, and the like, which may bear one or more substituents. Alkyl group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0103] The term “alkylaryl” refers to a radical containing both aliphatic and aromatic structures, an aryl group bonded directly to an alkyl group.

[0104] The term “alkylene,” as used herein, refers to a biradical derived from an alkyl group, as defined herein, by removal of two hydrogen atoms. Alkylene groups may be cyclic or acyclic, branched or unbranched, substituted or unsubstituted. Alkylene group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0105] The term “alkenyl,” as used herein, denotes a monovalent group derived from a straight- or branched-chain hydrocarbon moiety having at least one carbon-carbon double bond by the removal of a single hydrogen atom. In certain embodiments, the alkenyl group employed in the invention contains 2-20 carbon atoms (C2-20alkenyl). In some embodiments, the alkenyl group employed in the invention contains 2-15 carbon atoms (C2-15alkenyl). In another embodiment, the alkenyl group employed contains 2-10 carbon atoms (C2-10alkenyl). In still other embodiments, the alkenyl group contains 2-8 carbon atoms (C2-8alkenyl). In yet other embodiments, the alkenyl group contains 2-6 carbons (C2-6alkenyl). In yet other embodiments, the alkenyl group contains 2-5 carbons (C2-5alkenyl). In yet other embodiments, the alkenyl group contains 2-4 carbons (C2-4alkenyl). In yet other embodiments, the alkenyl group contains 2-3 carbons (C2-3alkenyl). In yet other embodiments, the alkenyl group contains 2 carbons (C2alkenyl). Alkenyl groups include, for example, ethenyl, propenyl, butenyl, 1-methyl-2-buten-1-yl, and the like, which may bear one or more substituents. Alkenyl group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety. The term “alkenylene,” as used herein, refers to a biradical derived from an alkenyl group, as defined herein, by removal of two hydrogen atoms. Alkenylene groups may be cyclic or acyclic, branched or unbranched, substituted or unsubstituted. Alkenylene group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0106] The term “alkynyl,” as used herein, refers to a monovalent group derived from a straight- or branched-chain hydrocarbon having at least one carbon-carbon triple bond by the removal of a single hydrogen atom. In certain embodiments, the alkynyl group employed in the invention contains 2-20 carbon atoms (C2-20alkynyl). In some embodiments, the alkynyl group employed in the invention contains 2-15 carbon atoms (C2-15alkynyl). In another embodiment, the alkynyl group employed contains 2-10 carbon atoms (C2-10alkynyl). In still other embodiments, the alkynyl group contains 2-8 carbon atoms (C2-8alkynyl). In still other embodiments, the alkynyl group contains 2-6 carbon atoms (C2-6alkynyl). In still other embodiments, the alkynyl group contains 2-5 carbon atoms (C2-5alkynyl). In still other embodiments, the alkynyl group contains 2-4 carbon atoms (C2-4alkynyl). In still other embodiments, the alkynyl group contains 2-3 carbon atoms (C2-3alkynyl). In still other embodiments, the alkynyl group contains 2 carbon atoms (C2alkynyl). Representative alkynyl groups include, but are not limited to, ethynyl, 2-propynyl (propargyl), 1-propynyl, and the like, which may bear one or more substituents. Alkynyl group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety. The term “alkynylene,” as used herein, refers to a biradical derived from an alkynylene group, as defined herein, by removal of two hydrogen atoms. Alkynylene groups may be cyclic or acyclic, branched or unbranched, substituted or unsubstituted. Alkynylene group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0107] The term “carbocyclic” or “carbocyclyl” as used herein, refers to an as used herein, refers to a cyclic aliphatic group containing 3-10 carbon ring atoms (C3-10carbocyclic). Carbocyclic group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0108] The term “heteroaliphatic,” as used herein, refers to an aliphatic moiety, as defined herein, which includes both saturated and unsaturated, nonaromatic, straight chain (i.e., unbranched), branched, acyclic, cyclic (i.e., heterocyclic), or polycyclic hydrocarbons, which are optionally substituted with one or more functional groups, and that further contains one or more heteroatoms (e.g., oxygen, sulfur, nitrogen, phosphorus, or silicon atoms) between carbon atoms. In certain embodiments, heteroaliphatic moieties are substituted by independent replacement of one or more of the hydrogen atoms thereon with one or more substituents. As will be appreciated by one of ordinary skill in the art, “heteroaliphatic” is intended herein to include, but is not limited to, heteroalkyl, heteroalkenyl, heteroalkynyl, heterocycloalkyl, heterocycloalkenyl, and heterocycloalkynyl moieties. Thus, the term “heteroaliphatic” includes the terms “heteroalkyl,” “heteroalkenyl,” “heteroalkynyl,” and the like. Furthermore, as used herein, the terms “heteroalkyl,” “heteroalkenyl,” “heteroalkynyl,” and the like encompass both substituted and unsubstituted groups. In certain embodiments, as used herein, “heteroaliphatic” is used to indicate those heteroaliphatic groups (cyclic, acyclic, substituted, unsubstituted, branched or unbranched) having 1-20 carbon atoms and 1-6 heteroatoms (C1-20heteroaliphatic). In certain embodiments, the heteroaliphatic group contains 1-10 carbon atoms and 1-4 heteroatoms (C1-10heteroaliphatic). In certain embodiments, the heteroaliphatic group contains 1-6 carbon atoms and 1-3 heteroatoms (C1-6heteroaliphatic). In certain embodiments, the heteroaliphatic group contains 1-5 carbon atoms and 1-3 heteroatoms (C1-5heteroaliphatic). In certain embodiments, the heteroaliphatic group contains 1˜4 carbon atoms and 1-2 heteroatoms (C1-4heteroaliphatic). In certain embodiments, the heteroaliphatic group contains 1-3 carbon atoms and 1 heteroatom (C1-3heteroaliphatic). In certain embodiments, the heteroaliphatic group contains 1-2 carbon atoms and 1 heteroatom (C1-2heteroaliphatic). Heteroaliphatic group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0109] The term “heteroalkyl,” as used herein, refers to an alkyl moiety, as defined herein, which contain one or more heteroatoms (e.g., oxygen, sulfur, nitrogen, phosphorus, or silicon atoms) in between carbon atoms. In certain embodiments, the heteroalkyl group contains 1-20 carbon atoms and 1-6 heteroatoms (C1-20 heteroalkyl). In certain embodiments, the heteroalkyl group contains 1-10 carbon atoms and 1-4 heteroatoms (C1-10 heteroalkyl). In certain embodiments, the heteroalkyl group contains 1-6 carbon atoms and 1-3 heteroatoms (C1-6 heteroalkyl). In certain embodiments, the heteroalkyl group contains 1-5 carbon atoms and 1-3 heteroatoms (C1-5 heteroalkyl). In certain embodiments, the heteroalkyl group contains 1-4 carbon atoms and 1-2 heteroatoms (C1-4 heteroalkyl). In certain embodiments, the heteroalkyl group contains 1-3 carbon atoms and 1 heteroatom (C1-3 heteroalkyl). In certain embodiments, the heteroalkyl group contains 1-2 carbon atoms and 1 heteroatom (C1-2 heteroalkyl). The term “heteroalkylene,” as used herein, refers to a biradical derived from an heteroalkyl group, as defined herein, by removal of two hydrogen atoms. Heteroalkylene groups may be cyclic or acyclic, branched or unbranched, substituted or unsubstituted. Heteroalkylene group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0110] The term “heteroalkenyl,” as used herein, refers to an alkenyl moiety, as defined herein, which further contains one or more heteroatoms (e.g., oxygen, sulfur, nitrogen, phosphorus, or silicon atoms) in between carbon atoms. In certain embodiments, the heteroalkenyl group contains 2-20 carbon atoms and 1-6 heteroatoms (C2-20 heteroalkenyl). In certain embodiments, the heteroalkenyl group contains 2-10 carbon atoms and 1-4 heteroatoms (C2-10 heteroalkenyl). In certain embodiments, the heteroalkenyl group contains 2-6 carbon atoms and 1-3 heteroatoms (C2-6 heteroalkenyl). In certain embodiments, the heteroalkenyl group contains 2-5 carbon atoms and 1-3 heteroatoms (C2-5 heteroalkenyl). In certain embodiments, the heteroalkenyl group contains 2-4 carbon atoms and 1-2 heteroatoms (C2-4 heteroalkenyl). In certain embodiments, the heteroalkenyl group contains 2-3 carbon atoms and 1 heteroatom (C2-3 heteroalkenyl). The term “heteroalkenylene,” as used herein, refers to a biradical derived from an heteroalkenyl group, as defined herein, by removal of two hydrogen atoms. Heteroalkenylene groups may be cyclic or acyclic, branched or unbranched, substituted or unsubstituted.

[0111] The term “heteroalkynyl,” as used herein, refers to an alkynyl moiety, as defined herein, which further contains one or more heteroatoms (e.g., oxygen, sulfur, nitrogen, phosphorus, or silicon atoms) in between carbon atoms. In certain embodiments, the heteroalkynyl group contains 2-20 carbon atoms and 1-6 heteroatoms (C2-20 heteroalkynyl). In certain embodiments, the heteroalkynyl group contains 2-10 carbon atoms and 1-4 heteroatoms (C2-10 heteroalkynyl). In certain embodiments, the heteroalkynyl group contains 2-6 carbon atoms and 1-3 heteroatoms (C2-6 heteroalkynyl). In certain embodiments, the heteroalkynyl group contains 2-5 carbon atoms and 1-3 heteroatoms (C2-5 heteroalkynyl). In certain embodiments, the heteroalkynyl group contains 2-4 carbon atoms and 1-2 heteroatoms (C2-4 heteroalkynyl). In certain embodiments, the heteroalkynyl group contains 2-3 carbon atoms and 1 heteroatom (C2-3 heteroalkynyl). The term “heteroalkynylene,” as used herein, refers to a biradical derived from an heteroalkynyl group, as defined herein, by removal of two hydrogen atoms. Heteroalkynylene groups may be cyclic or acyclic, branched or unbranched, substituted or unsubstituted.

[0112] The term “heterocyclic,” “heterocycles,” or “heterocyclyl,” as used herein, refers to a cyclic heteroaliphatic group. A heterocyclic group refers to a non-aromatic, partially unsaturated or fully saturated, 3- to 10-membered ring system, which includes single rings of 3 to 8 atoms in size, and bi- and tri-cyclic ring systems which may include aromatic five- or six-membered aryl or heteroaryl groups fused to a non-aromatic ring. These heterocyclic rings include those having from one to three heteroatoms independently selected from oxygen, sulfur, and nitrogen, in which the nitrogen and sulfur heteroatoms may optionally be oxidized and the nitrogen heteroatom may optionally be quaternized. In certain embodiments, the term heterocyclic refers to a non-aromatic 5-, 6-, or 7-membered ring or polycyclic group wherein at least one ring atom is a heteroatom selected from O, S, and N (wherein the nitrogen and sulfur heteroatoms may be optionally oxidized), and the remaining ring atoms are carbon, the radical being joined to the rest of the molecule via any of the ring atoms. Heterocycyl groups include, but are not limited to, a bi- or tri-cyclic group, comprising fused five, six, or seven-membered rings having between one and three heteroatoms independently selected from the oxygen, sulfur, and nitrogen, wherein (i) each 5-membered ring has 0 to 2 double bonds, each 6-membered ring has 0 to 2 double bonds, and each 7-membered ring has 0 to 3 double bonds, (ii) the nitrogen and sulfur heteroatoms may be optionally oxidized, (iii) the nitrogen heteroatom may optionally be quaternized, and (iv) any of the above heterocyclic rings may be fused to an aryl or heteroaryl ring. Exemplary heterocycles include azacyclopropanyl, azacyclobutanyl, 1,3-diazatidinyl, piperidinyl, piperazinyl, azocanyl, thiaranyl, thietanyl, tetrahydrothiophenyl, dithiolanyl, thiacyclohexanyl, oxiranyl, oxetanyl, tetrahydrofuranyl, tetrahydropuranyl, dioxanyl, oxathiolanyl, morpholinyl, thioxanyl, tetrahydronaphthyl, and the like, which may bear one or more substituents. Substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0113] The term “aryl,” as used herein, refers to an aromatic mono- or polycyclic ring system having 3-20 ring atoms, of which all the ring atoms are carbon, and which may be substituted or unsubstituted. In certain embodiments of the present invention, “aryl” refers to a mono, bi, or tricyclic C4-C20 aromatic ring system having one, two, or three aromatic rings which include, but are not limited to, phenyl, biphenyl, naphthyl, and the like, which may bear one or more substituents. Aryl substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety. The term “arylene,” as used herein refers to an aryl biradical derived from an aryl group, as defined herein, by removal of two hydrogen atoms. Arylene groups may be substituted or unsubstituted. Arylene group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety. Additionally, arylene groups may be incorporated as a linker group into an alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene, or heteroalkynylene group, as defined herein.

[0114] The term “heteroaryl,” as used herein, refers to an aromatic mono- or polycyclic ring system having 3-20 ring atoms, of which one ring atom is selected from S, O, and N; zero, one, or two ring atoms are additional heteroatoms independently selected from S, O, and N; and the remaining ring atoms are carbon, the radical being joined to the rest of the molecule via any of the ring atoms. Examples of heteroaryls include, but are not limited to pyrrolyl, pyrazolyl, imidazolyl, pyridinyl, pyrimidinyl, pyrazinyl, pyridazinyl, triazinyl, tetrazinyl, pyyrolizinyl, indolyl, quinolinyl, isoquinolinyl, benzoimidazolyl, indazolyl, quinolinyl, isoquinolinyl, quinolizinyl, cinnolinyl, quinazolynyl, phthalazinyl, naphthridinyl, quinoxalinyl, thiophenyl, thianaphthenyl, furanyl, benzofuranyl, benzothiazolyl, thiazolynyl, isothiazolyl, thiadiazolynyl, oxazolyl, isoxazolyl, oxadiaziolyl, oxadiaziolyl, and the like, which may bear one or more substituents. Heteroaryl substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety. The term “heteroarylene,” as used herein, refers to a biradical derived from an heteroaryl group, as defined herein, by removal of two hydrogen atoms. Heteroarylene groups may be substituted or unsubstituted.

[0115] Additionally, heteroarylene groups may be incorporated as a linker group into an alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene, or heteroalkynylene group, as defined herein. Heteroarylene group substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0116] The term “acyl,” as used herein, is a subset of a substituted alkyl group, and refers to a group having the general formula —C(═O)RA, —C(═O)ORA, —C(═O)—O—C(═O)RA, —C(═O)SRA, —C(═O)N(RA).sub.2, —C(═S)RA, —C(═S)N(RA).sub.2, and —C(═S)S(RA), —C(═NRA)RA, —C(═NRA)ORA, —C(═NRA)SRA, and —C(═NRA)N(RA).sub.2, wherein RA is hydrogen; halogen; substituted or unsubstituted hydroxyl; substituted or unsubstituted thiol; substituted or unsubstituted amino; acyl; optionally substituted aliphatic; optionally substituted heteroaliphatic; optionally substituted alkyl; optionally substituted alkenyl; optionally substituted alkynyl; optionally substituted aryl, optionally substituted heteroaryl, aliphaticoxy, heteroaliphaticoxy, alkyloxy, heteroalkyloxy, aryloxy, heteroaryloxy, aliphaticthioxy, heteroaliphaticthioxy, alkylthioxy, heteroalkylthioxy, arylthioxy, heteroarylthioxy, mono- or di-aliphaticamino, mono- or di-heteroaliphaticamino, mono- or di-alkylamino, mono- or di-heteroalkylamino, mono- or di-arylamino, or mono- or di heteroarylamino; or two RA groups taken together form a 5- to 6-membered heterocyclic ring. Exemplary acyl groups include aldehydes (—CHO), carboxylic acids (—CO.sub.2H), ketones, acyl halides, esters, amides, imines, carbonates, carbamates, and ureas. Acyl substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0117] The term “acylene,” as used herein, is a subset of a substituted alkylene, substituted alkenylene, substituted alkynylene, substituted heteroalkylene, substituted heteroalkenylene, or substituted heteroalkynylene group, and refers to an acyl group having the general formulae: R.sub.0—(C═X.sub.1)—R.sub.0—, —R—X.sub.2(C═X.sub.1)—R.sub.0—, or —R.sub.0—X.sub.2(C═X.sub.1)X.sub.3—R.sub.0—, where X.sub.1, X.sub.2, and X.sub.3 is, independently, oxygen, sulfur, or NRr, wherein Rr is hydrogen or optionally substituted aliphatic, and R.sub.0 is an optionally substituted alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene, or heteroalkynylene group, as defined herein. Exemplary acylene groups wherein R.sub.0 is alkylene includes —(CH.sub.2)T-O(C═O)—(CH.sub.2)T-; (CH.sub.2)T-NRr(C═O)—(CH.sub.2)T-; —(CH.sub.2)T-O(C=NRr)-(CH.sub.2)T-; —(CH.sub.2)T-NRr(C=NRr)-(CH.sub.2)T-; —(CH.sub.2)T-(C═O)—(CH.sub.2)T-; —(CH.sub.2)T-(C=NRr)-(CH.sub.2)T-; —(CH.sub.2)T-S(C═S)—(CH.sub.2)T-; —(CH.sub.2)T-NRr(C═S)—(CH.sub.2)—; —(CH.sub.2)T-S(C=NRr)-(CH.sub.2)T-; —(CH.sub.2)T-O(C═S)—(CH.sub.2)T-; —(CH.sub.2)T-(C═S)—(CH.sub.2)T-; or —(CH.sub.2)T-S(C═O)—(CH.sub.2)T-, and the like, which may bear one or more substituents; and wherein each instance of T is, independently, an integer between 0 to 20. Acylene substituents include, but are not limited to, any of the substituents described herein, that result in the formation of a stable moiety.

[0118] The term “amino,” as used herein, refers to a group of the formula (—NH.sub.2). A “substituted amino” refers either to a mono-substituted amine (—NHRh) of a disubstituted amine (—NRh.sub.2), wherein the Rh substituent is any substituent as described herein that results in the formation of a stable moiety (e.g., an amino protecting group; aliphatic, alkyl, alkenyl, alkynyl, heteroaliphatic, heterocyclic, aryl, heteroaryl, acyl, amino, nitro, hydroxyl, thiol, halo, aliphaticamino, heteroaliphaticamino, alkylamino, heteroalkylamino, arylamino, heteroarylamino, alkylaryl, arylalkyl, aliphaticoxy, heteroaliphaticoxy, alkyloxy, heteroalkyloxy, aryloxy, heteroaryloxy, aliphaticthioxy, heteroaliphaticthioxy, alkylthioxy, heteroalkylthioxy, arylthioxy, heteroarylthioxy, acyloxy, and the like, each of which may or may not be further substituted). In certain embodiments, the Rh substituents of the di-substituted amino group (—NRh.sub.2) form a 5- to 6-membered heterocyclic ring.

[0119] The term “hydroxy” or “hydroxyl,” as used herein, refers to a group of the formula (—OH). A “substituted hydroxyl” refers to a group of the formula (—ORO, wherein Ri can be any substituent which results in a stable moiety (e.g., a hydroxyl protecting group; aliphatic, alkyl, alkenyl, alkynyl, heteroaliphatic, heterocyclic, aryl, heteroaryl, acyl, nitro, alkylaryl, arylalkyl, and the like, each of which may or may not be further substituted).

[0120] The term “thio” or “thiol,” as used herein, refers to a group of the formula (—SH). A “substituted thiol” refers to a group of the formula (—SRr), wherein Rr can be any substituent that results in the formation of a stable moiety (e.g., a thiol protecting group; aliphatic, alkyl, alkenyl, alkynyl, heteroaliphatic, heterocyclic, aryl, heteroaryl, acyl, sulfinyl, sulfonyl, cyano, nitro, alkylaryl, arylalkyl, and the like, each of which may or may not be further substituted).

[0121] The term “imino,” as used herein, refers to a group of the formula (=NRr), wherein Rr corresponds to hydrogen or any substituent as described herein, that results in the formation of a stable moiety (for example, an amino protecting group; aliphatic, alkyl, alkenyl, alkynyl, heteroaliphatic, heterocyclic, aryl, heteroaryl, acyl, amino, hydroxyl, alkylaryl, arylalkyl, and the like, each of which may or may not be further substituted).

[0122] The term “azide” or “azido,” as used herein, refers to a group of the formula (—N.sub.3).

[0123] The terms “halo” and “halogen,” as used herein, refer to an atom selected from fluorine (fluoro, —F), chlorine (chloro, —Cl), bromine (bromo, —Br), and iodine (iodo, —I).

[0124] B. Synthesis of Kethoxal Derivatives.

[0125] Kethoxal and its analogs were first reported to react with and inactivate the RNA virus since the 1950s (Staehelin, Biochimca Biophysica Acta 31:448-54, 1959). The 1,2-dicarbonyl group of kethoxal showed high specificity to guanine, which make it very useful in the probing of RNA secondary structure. In addition, other kethoxal derivatives, such as kethoxal bis(thiosemicarbazone)(KTS)(Booth and Sartorelli, Nature 210:104-5, 1966) displayed promising anticancer activity, bikethoxal (Brewer et al., Biochemistry 22:4303-9, 1983) demonstrated the ability to cross-link RNA and proteins within intact ribosomal 30S and 505 subunits. However, it is surprising that the synthesis of kethoxal and its derivatives are rarely reported. A review of the literature indicates that kethoxal preparation was mostly based on oxidation by selenium dioxide following purification by vacuum distillation (Brewer et al., Biochemistry 22:4303-9, 1983; Tiffany et al., Journal of the American Chemical Society 79:1682-87, 1957; Lo et al., Journal of Labelled Compounds and Radiopharmaceuticals 44:S654-S656, 2001). This method has several limitations. First, metal oxidation reaction always results in byproducts. Second, the excess selenium was hard to remove. Third, synthesis of kethoxal derivatives with other functional groups is difficult because the reagents with functional groups may not survive with selenium dioxide under reflux conditions. For example, studies indicate that azide- and thiol-modified kethoxal cannot be prepared by selenium dioxide oxidation. Lastly, vacuum distillation purification is not suitable for kethoxal derivatives with high-molecular weight.

[0126] Glyoxal and its analogs are sensitive to air and therefore cannot be purified by chromatography (Jiang et al., Organic Letters 3:4011-13, 2001). The mild oxidation of diazoketone by freshly prepared dimethyl-dioxirane (DMD) can produce a glyoxal functional group in quantitative yield (Jiang et al., Organic Letters 3:4011-13, 2001). In this study, azide-kethoxal was prepared through a novel synthetic strategy following a three-step synthesis (Scheme S1). The advantage of the synthetic process is its easy-to-operate and is high yield. What's more, this strategy is also convenient for the preparation of other kethoxal derivatives with various functional groups.

##STR00019##

[0127] N.sub.3-kethoxal reacts with guanines in single-stranded DNA and RNA. Kethoxal (1,1-dihydroxy-3-ethoxy-2-butanone), is known to react with guanines specifically at N.sub.1 and N.sub.2 position at the Watson-Crick interface (Shapiro et al., Biochemistry 8:238-45, 1969). Due to challenges in synthesis, kethoxal has not been further functionalized and widely applied to nucleic acid labeling previously. Described herein is the development of N.sub.3-kethoxal (FIG. 1a), which not only inherits the reactivity towards guanines from its parent molecule, but also contains an azido group, which serves as a bio-orthogonal handle to be further functionalized through ‘click’ chemistry. With MALDI-TOF analysis, it was shown that N.sub.3-kethoxal efficiently labels guanines on RNA, while no reactivity was observed on other bases. It was further demonstrated the selectivity of N.sub.3-kethoxal on single-stranded DNA/RNA by using gel electrophoresis. After incubation with N.sub.3-kethoxal, a shift was observed on single-stranded RNA on the gel, indicating the formation of the RNA-kethoxal complex, while no such shift was detected with double-stranded RNA. It was also shown that N.sub.3-kethoxal is highly cell-permeable and can label DNA and RNA in living cells within 5 min, which makes it suitable for further applications.

[0128] C. Single-Stranded DNA Mapping (ssDNA-seq)

[0129] Kethoxal derivatives of the present invention enables genome-wide single-stranded DNA mapping (ssDNA-seq). Taking advantage of the sensitivity and the selectivity of kethoxal derivatives towards single-stranded nucleic acids, kethoxal derivatives were first applied to map single-stranded regions of the genome, which has not been previously achieved. One procedure for ssDNA mapping can comprise one or more of the following steps. First step can be preparing a labeling medium by adding a kethoxal derivative to a cell culture medium. Incubating cells in the labeling medium for a desired time, at a desired temperature, under desired conditions. Transcription inhibition studies can be performed by treating cells under DRB or triptolide or equivalent reagent prior to incubating in kethoxal derivative-containing medium. After incubation, harvesting the cells, and isolating total DNA from the cells. DNA can be suspended in FhO and in the presence of DBCO-PEG4-biotin (DMSO solution) and incubated at an appropriate temperature for an appropriate time, e.g., 37° C. for 2 h. RNase A can be added to the reaction mixture and the mixture incubated for an appropriate time at an appropriate temperature, e.g., 37° C. for 15 min. 7. DNA can be recovered from the reaction mixture and used to construct libraries. Libraries can be constructed using various commercial library construction kits, for example Accel-NGS Methyl-seq DNA library kit (Swift) or Kapa Hyper Plus kit (Kapa Biosystems). The next step can include sequencing libraries, for example on a Nextseq SR80 mode and perform downstream analysis.

[0130] D. Kethoxal-Assisted RNA-RNA Interaction Mapping (KARRI)

[0131] Considering the reactivity of kethoxal derivatives towards RNA, kethoxal-assisted RNA-RNA interaction mapping (KARRI) was developed based on kethoxal derivative labeling and dendrimer crosslinking of interacting RNA-RNA. To demonstrate KARRI mapping, formaldehyde-fixed mouse embryonic stem cells (mESC) were treated with kethoxal derivative and then incubated with PAMAM dendrimers (Esfand and Tomalia, (2001) Drug Discov. Today 6:427-36) decorated with two dibenzocyclooctyne (DBCO) molecules and one biotin molecule at the surface. Each PAMAM dendrimer chemically crosslinks two proximal kethoxal derivative labeled guanines through the “click” reaction, and provides a handle for enrichment through the biotin moiety on it. After crosslinking, RNAs were isolated, fragmented and subjected to immunoprecipitation by streptavidin beads. Proximity ligation was then performed on beads and the product RNA was used for library construction. Sequencing reads were aligned with only chimeric reads used for RNA-RNA interaction analysis.

[0132] Procedure for kethoxal-Assisted RNA-RNA interaction (KARRI). The KARRI methods can include one or more of the following steps. Cells can be suspended in a fixative, e.g., formaldehyde solution, and incubated at room temperature with gentle rotate. The reaction can be quenched, e.g., by adding glycine. For translation inhibitor treatment, cells are treated with cycloheximide or harringtonine. Cells are collected and aliquoted. Kethoxal derivative can be diluted 1:5 using an appropriate solvent, e.g., DMSO, and incorporated into a labeling buffer (kethoxal derivative, lysis buffer (10 mM Tris-HCl pH 8.0, 10 mM NaCl, 0.2 IGEPAL CA630) and proteinase inhibitor cocktail). Cells can be suspended in labeling buffer and cells collected after incubation. Collected cells can be washed in ice-cold lysis buffer 1, 2,3 or more times. The cell pellet can be suspended in MeOH containing cross-linkers and the cells collected. RNA can be extracted and purified. RNA pellets can be suspended in H2O, with DNase I buffer (100 mM Tris-HCl pH 7.4, 25 mM MgCl.sub.2, 1 mM CaCl.sub.2), DNase I, RNase inhibitor, and incubated with gentle shaking. The mixture is then exposed to proteinase K. RNA is extracted with phenol-chloroform and purified RNA by EtOH precipitation. RNA pellets are suspended in H.sub.2O and fragmentation buffer with RNase inhibitor and incubated. Fragmentation is stopped by additional of fragmentation stop buffer and the sample is put on ice to quench the reaction. Crosslinked RNA is enriched by using pre-washed Streptavidin beads. Beads are mixed with DNA and the mixture was incubated at room temperature with gentle rotate. After incubation, beads were washed. Washed beads are suspended in H.sub.2O with PNK buffer and T4 PNK, RNase inhibitor and shaken for a first incubation period, then another aliquot of T4 PNK and ATP are added and shaken for a second incubation period. Beads are washed and suspended in a ligase solution. After incubation in ligase solution the beads are washed. RNA is eluted by heating and the RNA recovered. Half of the recovered RNA is used for library construction. Libraries are sequenced and downstream analysis performed.

EXAMPLES

[0133] The following examples as well as the figures are included to demonstrate preferred embodiments of the invention. It should be appreciated by those of skill in the art that the techniques disclosed in the examples or figures represent techniques discovered by the inventors to function well in the practice of the invention, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the invention.

Example 1

Synthesis of Kethoxal Derivatives

[0134] The synthesis route of N.sub.3-kethoxal.

##STR00020##

[0135] 2-(2-azidoethoxy)propanoic acid 2: Sodium hydride (60% dispersion in mineral oil, 6 g, 0.15 mol) was added to a 250 mL two-necked flask, then anhydrous THF 50 mL was added under N.sub.2 condition. The suspension was vigorously stirred and cooled to 0° C. 2-Azidoenthanol (8.7 g, 0.1 mol) in 20 mL anhydrous THF was added dropwise over 20 minutes. The solution was stirred at an ambient temperature for 15 mins, then cooled to 0° C. again. Ethyl 2-bromopropionate (27.15 g, 0.15 mol) in 10 mL THF was added dropwise. The reaction mixture was warmed to room temperature and stirred overnight under N.sub.2 atmosphere. 100 mL Water was used to quench the reaction and the resulted mixture was washed by diethyl ether three times (3×100 mL). The combined organic layers were dried over anhydrous Na.sub.2SO.sub.4. The crude product was dissolved in 50 ml THF and was added to LiOH aqueous solution (40 ml, 1 M). The mixture was stirred for 16 h at room temperature. THF was removed and HCl (2 M) was added to pH 2. Then, the THF was extracted by diethyl ether three times (3×100 ml). The combined organic layers were dried over anhydrous NaSO.sub.4. After concentration and silica gel chromatography (ethyl acetate:petroleum ether=1:7), the product 2 was collected as colorless oil (6.67 g, 26%). .sup.1H NMR (400 MHz, CDCl.sub.3): δ=4.09 (q, J=6.9 Hz, 1H), 3.85 (ddd, J=9.8, 5.9, 3.4 Hz, 1H), 3.66-3.58 (m, 1H), 3.55-3.46 (m, 1H), 3.42-3.33 (m, 1H), 1.49 (t, J=9.4 Hz, 3H). .sup.13C NMR (101 MHz, CDCl.sub.3): δ=178.48, 74.98, 69.13, 50.65, 18.47. HRMS C.sub.3H.sub.9N.sub.3O.sub.3.sup.+ [M+H].sup.+ calculated 160.07167, found 160.07091.

##STR00021##

[0136] 3-(2-azidoethoxy)-1-diazopentane-2-one 3: Under N.sub.2 condition, 2 (1.59 g, 10 mmol) was dissolved in 15 mL anhydrous CH.sub.2C12 and one drop of DMF. Oxalyl chloride (926 μL, 15 mmol) was added to the solution and stirred at room temperature for 2 h. After that, the solvent and excess oxalyl chloride was removed. The residue was dissolved in anhydrous CH.sub.3CN 50 mL, cooled to 0° C., and (Trimethylsilyl)diazomethane solution 2 M in diethyl ether (4 mL, 10 mmol) was added dropwise. The reaction mixture was stirred at 0° C. overnight. The solvent was evaporated and silica gel chromatography (ethyl acetate:petroleum ether=1:7) was performed in order to afford product 3 as yellow oil (620 mg, 33.8%). .sup.1H NMR (400 MHz, CDCl.sub.3): δ=5.82 (s, 1H), 4.00-3.85 (m, 1H), 3.72-3.60 (m, 2H), 3.48-3.35 (m, 2H), 1.38 (d, J=6.8 Hz, 3H). .sup.13C NMR (101 MHz, CDCl.sub.3): δ=196.94, 80.89, 68.73, 52.30, 50.88, 18.58. HRMS C.sub.6H.sub.9N.sub.5O.sub.2.sup.+ [M+H].sup.+ calculated 184.0829, found 184.0822.

##STR00022##

[0137] Azido-kethoxal 1 (N.sub.3-kethoxal), or 3-(2-azidoethoxy)-1,1-dihydroxybutan-2-one (4):

[0138] According to Adam's procedure, the Dimethyldioxirane (DMD) in an acetone solution was prepared. To the compound 3 (183 mg, 1 mmol), 11 mL DMD-acetone was added in several portions. Obvious gas evolution was observed. The reaction mixture was stirred at room temperature until the reaction was complete under TLC monitor to Azido-kethoxal 1 and its hydyate 4 as a yellow oil. .sup.1H NMR (400 MHz, CDCl.sub.3): δ=[9.5 (m)+5.5 (m), 1H], 4.55-4.40 (m, 1H), 3.75 (m, 2H), 3.50-3.25 (m, 2H), 1.50-1.20 (m, 3H). HRMS C.sub.6H.sub.9N.sub.3O.sub.3.sup.+ [M+Na].sup.+ calculated 194.0536, found 194.0555.

[0139] General chemical and biological materials. All chemical reagents for N.sub.3-kethoxal synthesis were purchased from commercial sources. RNA oligoes were purchased from Integrated DNA Technologies, Inc. (IDT) and Takara Biomedical Technology Co., Ltd. Buffer salts and chemical reagents for N.sub.3-kethoxal synthesis were purchased from commercial sources. Superscript III, Dynabeads® MyOne™ Streptavidin C1 was purchased from Life technologies. T4 PNK, T4 RNL2tr K227Q, 5′-Deadenylase, RecJ.sub.f were purchased from New England Biolabs. CircLigaseII was purchase from epicenter company. DBCO-Biotin was purchase from Click Chemistry Tools LLC (A116-10). All RNase-free solutions were prepared from DEPC-treated MilliQ-water.

Synthesis Scheme of Carbon-Kethoxal (5-azido-2-oxopentanal)

[0140] ##STR00023##

[0141] Synthetic Route for carbon-kethoxal (5-azido-2-oxopentanal). Ethyl 4-azidobutyrate: A solution of ethyl 4-bromobutyrate (7.802 g, 40 mmol), NaN.sub.3 (3.900 g, 60 mmol, 15 equiv.) and 6 ml of water in 18 ml of acetone was refluxed for 5 h. After the reaction finished, the acetone was removed by vacuum and residue was partitioned between Et.sub.2O (200 ml) and water (100 ml). The organic layer was separated, and the water layer was extracted with 200 mL Et.sub.2O, twice. The combined organic layer was washed with water followed by drying over anhydrous Na.sub.2SO.sub.4. After filtration and evaporation of the solvent, silica gel chromatography was performed (ethyl acetate:petroleum ether=1:50) and ethyl 4-azidobutyrate (6.21 g, quant.) was obtained as a colorless oil. .sup.1H NMR (400 MHz, CDCl.sub.3) δ 4.05 (q, J=7.2 Hz, 2H), 3.39 (t, J=6.5 Hz, 2H), 2.40 (t, J=7.2 Hz, 2H), 2.08 (p, J=6.7 Hz, 2H), 1.18 (t, J=7.2 Hz, 3H).

[0142] 4-azidobutanoic acid: The above product ethyl 4-azidobutyrate (2.583 g, 20 mmol) was suspended in a mixture of LiOH.H.sub.2O (2.520 g, 60 mmol, 3.0 eq) in water (30 mL) and THF (10 mL). The mixture was stirred at 50° C. for 12 h. THF was removed and HCl (2 M) was added to adjust pH to 2. Then, the THF was extracted by diethyl ether three times (3×100 ml). The combined organic layers were dried over anhydrous NaSO.sub.4. After concentration and silica gel chromatography (acetone:petroleum ether=1:10 to 1:2), the product 4-azidobutanoic acid was collected as colorless oil (2.011 g, 78%). .sup.1H NMR (400 MHz, CDCl.sub.3) δ 10.19 (s, 1H), 3.36 (t, J=6.7 Hz, 2H), 2.46 (t, J=7.2 Hz, 2H), 1.90 (p, J=6.9 Hz, 2H).

[0143] 5-azido-1-diazopentan-2-one: Under inert conditions (N.sub.2), the above product 4-azidobutanoic acid (646 mg, 5 mmol) was dissolved in 15 mL anhydrous CH.sub.2C12 and chilled at 0° C. DMF and oxalyl chloride (650 μL, 7.5 mmol) were added to the solution dropwise. After warming the reaction mixture to room temperature, it was stirred for 2 h. After that, the solvent and excess oxalyl chloride were removed. The residue was dissolved in anhydrous CH.sub.2Cl.sub.2 25 mL, cooled to 0° C., and CaO (308 mg, 5.5 mmol, 1.1 equiv.) was added. To this, 2M TMSCHN.sub.2 solution in diethyl ether (2.5 mL, 5 mmol) was added dropwise. The reaction mixture was stirred at 0° C. overnight. The solvent was evaporated and silica gel chromatography (ethyl acetate:petroleum ether=1:5) was performed in order to afford product 5-azido-1-diazopentan-2-one as yellow oil (680 mg, 89%). .sup.1H NMR (400 MHz, CDCl.sub.3) δ 5.30 (s, 1H), 3.35 (t, J=6.6 Hz, 2H), 2.42 (s, 2H), 1.92 (p, J=6.9 Hz, 2H).

[0144] Carbon kethoxal (5-azido-2-oxopentanal): According to Adam's procedure, the dimethyldioxirane (DMD) in an acetone solution was prepared. To 5-azido-1-diazopentan-2-one (39 mg, 0.28 mmol), 5 mL DMD-acetone was added and gas evolution was observed. The reaction mixture was stirred at room temperature until the reaction was completed (under TLC monitoring) to form carbon kethoxal and its hydrate as a yellow oil (quant.). .sup.1H NMR (400 MHz, CDCl.sub.3): δ=[9.23 (m)+5.24 (m), 1H], 3.41-3.31 (m, 2H), 3.01-2.46 (m, 2H), 1.96-1.80 (m, 2H).

Synthetic Scheme for Mono-Fluoride Kethoxal (3-(2-azidoethoxy)-3-fluoro-2-oxopropanal)

[0145] ##STR00024##

[0146] Synthetic Route for mono-fluoride kethoxal (3-(2-azidoethoxy)-3-fluoro-2-oxopropanal): ethyl 2-(2-azidoethoxy)-2-fluoroacetate:Sodium hydride (4.4 g) was added to anhydrous THF. The suspension was vigorously stirred and cooled to 0° C. 2-azidoenthanol (6.416 g) in 20 mL anhydrous THF was added dropwise. The solution was stirred at RT for 15 min, then cooled to 0° C. again. Ethyl 2-bromopropionate (14.868 g) in 10 mL THF was added dropwise. The reaction mixture was warmed to room temperature and stirred overnight. Water was used to quench the reaction, followed by extraction with diethyl ether. The combined organic layers were dried over anhydrous Na.sub.2SO.sub.4. After filtration and evaporation of solvent, silica gel chromatography was performed (ethyl acetate:petroleum ether=1:50 to 1:30), and ethyl 2-(2-azidoethoxy)-2-fluoroacetate (8.832 g, 64%) was obtained as a colorless oil.

[0147] 2-(2-azidoethoxy)-2-fluoroacetic acid: The above product ethyl 2-(2-azidoethoxy)-2-fluoroacetate (7.5 g) was suspended in a mixture of LiOH.H.sub.2O (4.93 g) in water and THF. The mixture was stirred at 50° C. for 3 h. THF was removed and HCl (2 M) was added to adjust the mixture to pH 2. The THF was next extracted by diethyl ether. The combined organic layers were dried over anhydrous NaSO.sub.4. After concentration and silica gel chromatography (acetone:petroleum ether=1:10 to 1:5), the product 2-(2-azidoethoxy)-2-fluoroacetic acid was collected as colorless oil (3.80 g, 60%).

[0148] 1-(2-azidoethoxy)-3-diazo-1-fluoropropan-2-one: Under inert conditions (N.sub.2), the above product 2-(2-azidoethoxy)-2-fluoroacetic acid (200 mg) was dissolved in anhydrous CH.sub.2C12 and chilled to 0° C. DMF and oxalyl chloride (158 μL) was added to the solution dropwise. After warming the reaction mixture to room temperature, it was stirred for 2 h. The solvent and excess oxalyl chloride were removed. The residue was dissolved in anhydrous CH.sub.2C12, cooled to 0° C., and CaO (76 mg) was added. A 2M TMSCHN.sub.2 solution in diethyl ether (0.31 mL) was added dropwise to the mixture and was stirred at 0° C. overnight. The solvent was evaporated and silica gel chromatography (ethyl acetate:petroleum ether=1:20 to 1:5) was performed in order to afford the product 1-(2-azidoethoxy)-3-diazo-1-fluoropropan-2-one as yellow oil (180 mg, 79%).

[0149] Mono-fluoride kethoxal (3-(2-azidoethoxy)-3-fluoro-2-oxopropanal): According to Adam's procedure, the dimethyldioxirane (DMD) in an acetone solution was prepared. To 1-(2-azidoethoxy)-3-diazo-1-fluoropropan-2-one (47 mg), DMD-acetone was added, and obvious gas evolution was observed. The reaction mixture was stirred at room temperature until the reaction was complete (under TLC monitoring) to mono-fluoride kethoxal and its hydrate as a yellow oil (quant.).

Synthetic Scheme for Phenyl-Kethoxal (3,5-dimethoxyphenylglyoxal)

[0150] ##STR00025##

[0151] Synthetic route for the phenyl-kethoxal (3,5-dimethoxyphenylglyoxal): 2-diazo-1-(3,5-dimethoxy-phenyl)-ethanone: A mixture of 3,5-dimethoxybenzoic acid (182 mg) and SOCl.sub.2 (1.0 mL) was heated under reflux at 100° C. for 1.5 h. The excess SOCl.sub.2 was removed by vacuum to afford the crude product. The residue was dissolved in anhydrous CH.sub.2C12, cooled to 0° C., and CaO (61 mg) was added. Then, a 2M solution of TMSCHN.sub.2 in diethyl ether (0.5 mL) was added dropwise. The reaction mixture was stirred at 0° C. overnight. The solvent was evaporated and silica gel chromatography (ethyl acetate:petroleum ether=1:10 to 1:3) was performed in order to afford product 2-diazo-1-(3,5-dimethoxy-phenyl)-ethanone as yellow solid (102 mg, 50%).

[0152] Phenyl kethoxal or 3,5-dimethoxyphenylglyoxal: According to Adam's procedure, the dimethyldioxirane (DMD) in an acetone solution was prepared. To 2-diazo-1-(3,5-dimethoxy-phenyl)-ethanone (12 mg), DMD-acetone was added, and gas evolution was observed. The reaction mixture was stirred at room temperature until the reaction was complete (under TLC monitoring) to phenyl kethoxal and its hydyate as a yellow oil (quant.).

Example 2

Verification of N.SUB.3.-Kethoxal Reaction with Guanine

[0153] The N.sub.3-kethoxal and guanine reaction was verified. Guanine (100 μM, 2 μL), N.sub.3-kethoxal (1 M in DMSO, 1 μL), sodium cacodylate buffer (0.1 M, pH=7.0, 1 μL) and 6 μL ddH.sub.2O were added together into 1.5 mL microcentrifuge tube at 37° C. for 10 min. HRMS C.sub.11H.sub.14N.sub.8O.sub.4.sup.+ [M+H].sup.+ calculated 323.1216, found 323.1203.

##STR00026##

Example 3

The Reaction of N.SUB.3.-Kethoxal and RNA

[0154] The reaction of N.sub.3-kethoxal and RNA was generally performed with the following protocol: 100 pmol RNA oligo and 1 μmol N.sub.3-kethoxal was incubated in total 10 μL solution in PBS buffer at 37° C. for 10 mins. The modified RNA was purified by Micro Bio-Spin™ P-6 Gel Columns (Biorad, 7326222) to remove residual chemicals. The purified labelled RNA can be used for further studies such as mass spectrometry, gel electrophoresis and copper-free click reaction with biotin-DBCO.

[0155] Removal N.sub.3-kethoxal modification from N.sub.3-kethoxal labelled RNA. The detailed protocol of N.sub.3-kethoxal modification erasing is described below “N.sub.3-kethoxal-remove sample preparation” in the keth-seq protocol. Generally, the purified N.sub.3-kethoxal modified RNA was incubated with high concentration of GTP (1/2 volume of the reaction solution, final concentration 50 mM) at 37° C. for 6 hours or at 95° C. for 10 mins. Higher temperature benefits the removal the N.sub.3-kethoxal modification.

[0156] Fixation of N.sub.3-kethoxal modification in RNA. The labile N.sub.3-kethoxal modification in RNA can be fixed in the presence of borate buffer. The solution of N.sub.3-kethoxal labelled RNA was mixed with 1/10 volume of stock borate buffer (final concentration: 50 mM; stock borate buffer: 500 mM potassium borate, pH 7.0, pH was monitored while adding potassium hydroxide pellets to 500 mM boric acid). The borate buffer fixation was used in various steps of keth-seq protocol, see below.

[0157] MALDI-TOF-MS analysis of N.sub.3-kethoxal labelled RNA oligo. The N.sub.3-kethoxal labelled RNA was purified by Micro Bio-Spin™ P-6 Gel Columns. Meanwhile the buffer exchange occurred from PBS buffer to tris buffer that can be directly used in MALDI-TOF-MS experiment without extra desalt step. One microliter of product solution was mixed with one microliter matrix which include 8:1 volume ratio of 2′4′6′-trihydroxyacetophenone (THAP, 10 mg/mL in 50% CH.sub.3CN/H.sub.2O):ammonium citrate (50 mg/mL in H2O). Then the mixture was spotted on the MALDI sample plate, dried and analyzed by Bruker Ultraflextreme MALDI-TOF-TOF Mass Spectrometers.

Example 4

Phenol-Kethoxal and Diphenol-Kethoxal

[0158] To test the labeling activity of phenol-kethoxal and diphenol-kethoxal, the two compounds were incubated with a 12-mer synthetic RNA oligo containing four guanine bases, respectively. After 10 min, the reactions were cleaned-up and analyzed by MALDI-TOF. Both phenol-kethoxal and diphenol-kethoxal label the oligo efficiently, with all four guanines on all oligo molecules modified, see FIG. 3.

##STR00027##

[0159] A second set of test were performed to test cell permeability of phenol-kethoxal and diphenol-kethoxal and if the labeling enhances radical-mediated biotinylation. Cells were treated with phenol-kethoxal and diphenol-kethoxal for 10 min, respectively, and RNA isolated from treated cells. An in vitro biotinylation reaction was performed by mixing these kethoxal derivative-labeled RNAs with biotin-phenol, horseradish peroxidase (HRP), and H.sub.2O.sub.2, see FIG. 4. HRP is an enzyme that mimics APEX with higher radical generation activity in vitro. The biotinylated RNAs were purified and subjected to dot blot analysis. Both phenol-kethoxal-modified and diphenol-kethoxal-modified RNAs show stronger biotin signals compared with the control sample, suggesting (di)phenol-kethoxal could enhance radical-mediated biotinylation and show potentials for high-efficiency APEX-mediated proximity labeling in live cells.

Example 5

Experiment Procedure for Single-Stranded DNA (SSDNA) Mapping

[0160] ssDNA is performed by: (1) Prepare labeling medium by adding 5 μL pure a kethoxal derivative (e.g., N.sub.3-kethoxal) to 5 mL pre-warmed cell culture medium for each 10 cm dish. (2) Incubate cells in the labeling medium for 10 min at 37° C., 5% CO.sub.2. (3) For transcription inhibition experiments, cells were treated for 2 h under 100 μM DRB or 1 μM triptolide before incubated in kethoxal-derivative containing medium. (4) Harvest cells after the 10 min incubation, isolate total DNA from cells by PureLink genomic DNA mini kit according to the manufacturer's protocol. (5) Suspend 5 μg total DNA in 85 μL H2O, then add 10 μL 10×PBS and 5 μL 20 mM DBCO-PEG4-biotin (DMSO solution), incubate the mixture at 37° C. for 2 h. (6) Add 5 μL RNase A to the reaction mixture, incubate the mixture at 37° C. for another 15 min. (7) Recover DNA from the reaction mixture by DNA Clean & Concentrator kit according to the manufacturer's protocol.

[0161] Libraries were constructed by different commercial library construction kits with similar results obtained. Two examples include:

[0162] (8a) The use of Accel-NGS Methyl-seq DNA library kit (Swift): (i) Fragment 2 μg of recovered DNA from step 7 by sonication under 30 s-on/30 s-off setting for 30 cycles (ii) Save 5% of the fragmented DNA for input, use the rest 95% to enrich biotin-tagged DNA by 10 μL pre-washed Streptavidin Cl beads according to the manufacturer's protocol with minor changes. Beads were washed 3 times in 1× binding and wash buffer with 0.05% tween-20 before re-suspended in 95 μL 2× binding and wash buffer with 0.1% tween-20. Beads were mixed with DNA and the mixture was incubated at room temperature for 15 min with gentle rotation. After incubation, beads were washed 5 times with 1× binding and wash buffer with 0.05% tween-20 (iii) Elute the enriched DNA by heating the beads in 30 μL H.sub.2O at 95° C. for 10 min. Treat the saved input at 95° C. for 10 min at the same time. The put both input and IP samples on ice immediately (iv) Proceed to library construction according the protocol from the Accel-NGS Methyl-seq DNA library kit.

[0163] (8b) The use of Kapa Hyper Plus kit (Kapa Biosystems): (i) Suspend 1 μg total DNA in 35 μL H.sub.2O, add 5 μL Kapa fragmentation buffer and 10 μL Kapa fragmentation enzyme. Incubate the mixture at 37° C. for 30 min. (ii) Recovery fragmented DNA by DNA Clean & Concentrator kit according to the manufacturer's protocol (iii) Perform A-tailing and adapter ligation according the protocol from Kapa Hyper Plus kit. (iv) Save 5% of the DNA for input, use the rest 95% to enrich biotin-tagged DNA by 10 μL pre-washed Streptavidin Cl beads according to the manufacturer's protocol with minor changes. Beads were washed 3 times in 1× binding and wash buffer with 0.05% tween-20, before re-suspended in 95 μL 2× binding and wash buffer with 0.1% tween-20. Beads were mixed with DNA and the mixture was incubated at room temperature for 15 min with gentle rotate. After incubation, beads were washed 5 times with 1× binding and wash buffer with 0.05% tween-20 (v) Elute the enriched DNA by heating the beads in 25 μL H.sub.2O at 95° C. for 10 min. (vi) PCR amplify the libraries for both input and IP samples according to the protocol from Kapa Hyper Plus kit. (9) Sequence libraries on Nextseq SR80 mode and perform downstream analysis.

Example 6

Experiment Procedure for Kethoxal-Assisted RNA-RNA Interaction (KARRI)

[0164] KRRI is performed by: (1) Suspend live cells in 1% formaldehyde solution at 1×10.sup.6/mL and incubate at room temperature for 10 min with gentle rotate. Then quench this reaction by adding glycine to a final concentration of 125 mM and rotate the mixture at room temperature for 5 min. For translation inhibitor treatment, cells were treated with 100 μg/mL cycloheximide or 3 μg/mL harringtonine at 37° C. for 10 min. (2) Collect and take 2×10.sup.6 cells. Dilute Kethoxal derivative (e.g., N.sub.3-kethoxal) by 1:5 using DMSO. Make a labeling buffer by adding 10 μL Kethoxal derivative into 290 μL lysis buffer (10 mM Tris-HCl pH 8.0, 10 mM NaCl, 0.2 IGEPAL CA630) with 3 μL 100× proteinase inhibitor cocktail. (3) Suspend cells in labeling buffer and rotate at room temperature for 30 min, then centrifuge at 2500 g for 5 min at 4° C. to collect cells. (4) Wash cell pellets with 500 μL ice-cold lysis buffer for 3 times. (5) Suspend the pellet in 500 μL MeOH containing 10 mM dendrimers, rotate for 1 h at 37° C. Then centrifuge at 2500 g for 5 min at 4° C. to collect cells. (6) Wash cell pellet twice with 500 μL ice-cold lysis buffer. (7) Resuspend cells in 385 μL lysis buffer, add 50 μL 10% SDS, 30 μL proteinase K, 10 μL RNase inhibitor, 25 μL 500 mM K3B03, shake at 65° C. for 2 h. (8) Add 500 μL phenol-chloroform to extract RNA and purify RNA by EtOH precipitation. (9) Suspend RNA pellets in 104 μL H2O, add 12 μL 10×DNase I buffer (100 mM Tris-HCl pH 7.4, 25 mM MgCl.sub.2, 1 mM CaCl.sub.2), 2 μL DNase I (Thermo), 2 μL RNase inhibitor, and incubate at 37° C. for 30 min with gentle shaking. (10) Add 130 μL 2× proteinase K buffer (100 mM Tris-HCl pH 7.5, 200 mM NaCl, 2 mM EDTA, 1% SDS), 10 μL proteinase K to the reaction, incubate at 65° C. for 30 min with shaking (11) Extract RNA with 300 μL phenol-chloroform and purify RNA by EtOH precipitation. (12) Suspend RNA pellets in 61 μL H2O, add 7 μL 10× fragmentation buffer (Thermo), 2 μL RNase inhibitor, incubate at 70° C. for 15 min, then add 8 μL fragmentation stop buffer (Thermo) and put the sample on ice immediately to quench the reaction. (13) Enrich crosslinked RNA by using 30 μL pre-washed Streptavidin Cl beads according to the manufacturer's protocol with minor changes. Beads were washed 3 times in 1× binding and wash buffer with 0.05% tween-20, before re-suspended in 80 μL 2× binding and wash buffer with 0.1% tween-20. Beads were mixed with DNA and the mixture was incubated at room temperature for 30 min with gentle rotate. After incubation, beads were washed 3 times with 1× binding and wash buffer with 0.05% tween-20 and once with 1×PNK buffer (NEB). (14) Suspend beads in 41 μL H2O, 5 μL 10×PNK buffer (NEB), 3 μL T4 PNK (NEB), 1 μL RNase inhibitor and shake at 37° C. for 30 min, then add another 3 μL T4 PNK and 6 μL 10 mM ATP, shake at 37° C. for another 30 min. (15) Wash beads twice with 1× binding and wash buffer with 0.05% tween-20, once with 1× ligation buffer (NEB). (16) Suspend beads in 668 μL H2O, 100 μL 10× ligase buffer (NEB), 10 μL RNase inhibitor, 2 μL 10 mM ATP, 20 μL T4 RNA ligase 2 (high concentration) (NEB), 200 μL 50% PEG 8000, rotate at 16° C. for 16 h. (17) Wash beads twice with 1× binding and wash buffer with 0.05% tween-20, once with H2O. Then elute RNA by heating the beads in 30 μL H.sub.2O and shaking beads at 95° C. for 10 min. (18) Take half of the recovered RNA for library construction using the SMARTer Stranded Total RNA-seq Kit v2-Pico Input (Takara) by following the protocol from the manufacturer. (19) Sequence libraries on Novaseq PE150 mode and perform downstream analysis.

Example 7

Activity of Representative Kethoxal Derivatives

[0165] Reactivity and reversibility modulation of kethoxal derivatives. The reactivity and the reversibility of kethoxal derivatives can be tuned by adding a series of functional groups onto the glyoxal moiety. Here we studied the effect of reaction pH, electron donating/withdrawing groups, and steric on the reactivity and reversibility of kethoxal derivatives. We observed that the reactivity and reversibility are pH-dependent. Hydrogen bond acceptors at the α-position of the ketone largely enhance the reactivity by stabilizing the formed adduct through H-bonding with the guanosine amine proton. While most tested kethoxal derivatives show reversibility with GTP as competitor, less reactive molecules are generally more reversible. These studies deeper our understanding about the chemical properties of these molecules and therefore, provide theoretical structure-activity guidance and validates the feasibility of applying these molecules to both genomic studies (such as ssDNA and RNA labelling applications) and kethoxal-based therapeutic purposes.

##STR00028##

[0166] 1. Kethoxal derivatives are more reactive with guanosine at basic conditions. Conversion rates of guanosine at different pH conditions are shown in Table 1. Shown below is an example with a phenyl-substituted kethoxal derivative. In the image of the reaction below, guanosine is depicted as S1 and the kethoxal derivative is depicted as S2.

##STR00029##

TABLE-US-00001 TABLE 1 The effect of pH on reactivity. S1:S2 = 1:1 S1:S2 = 1:2 S1:S2 = 1:3 S1:S2 = 1:5 pH = 7.0 18.8% 37.6% 51.0% 67.0% pH = 7.8 32.2% 51.2% 66.2% 80.1%

[0167] 2. Electronic and steric effects can modulate the reactivity of kethoxal derivatives. Conversion rates of guanosine with different kethoxal derivatives at pH 7.8 are shown in Tables 2A and 2B. In the image of the reaction below, guanosine is depicted as S1 and the kethoxal derivatives are depicted as S2.

##STR00030##

TABLE-US-00002 TABLE 2A Reactivity of different kethoxal derivatives at pH = 7.8. S1:S2 = 2:1 S1:S2 = 1:1 S1:S2 = 1:2 S1:S2 = 1:3 S1:S2 = 1:5 S1:S2 = 1:10 [00031]embedded image 51.6% 86.9% 97.4% [00032]embedded image 51.3% 81.6% 97.4% [00033]embedded image 51.3% 78.6% 95.4% [00034]embedded image 43.6% 77.5% 92.1% [00035]embedded image 38.0% 71.2% 90.3% 96.5% [00036]embedded image 35.8% 67.2% 89.9% 92.2% [00037]embedded image 33.4% 60.4% 79.4% 85.4%

TABLE-US-00003 TABLE 2B Reactivity of different kethoxal derivatives at pH = 7.8 (continued) S1:S2 = 2:1 S1:S2 = 1:1 S1:S2 = 1:2 S1:S2 = 1:3 S1:S2 = 1:5 S1:S2 = 1:10 [00038]embedded image 32.1% 49.8% 67.3% 89.7% 98.3% 98.3% [00039]embedded image 23.8% 48.0% 70.5% 88.9% 89.2% [00040]embedded image 40.2% 66.7% 74.9% 83.2% [00041]embedded image 25.2% 41.1% 60.0% 66.4% 73.6% [00042]embedded image 32.2% 51.2% 66.2% 80.1% [00043]embedded image 30.9% 49.6% 69.5% 76.7% 81.6% [00044]embedded image  8.5% 14.7% 28.9% 38.7% 63.1%

[0168] 3. Reaction pH has different effects on kethoxal reactivity depending on substituents on the kethoxal derivatives. Conversion rates of guanosine with different kethoxal derivatives at pH 7.0 are shown in Tables 3A and 3B.

##STR00045##

TABLE-US-00004 TABLE 3A Reactivity of different kethoxal derivatives at pH = 7.0. S1:S2 = 2:1 S1:S2 = 1:1 S1:S2 = 1:2 S1:S2 = 1:4 S1:S2 = 1:10 [00046]embedded image 39.6% 70.6% 93.7% [00047]embedded image 23.6% 46.7% 76.6% [00048]embedded image 30.3% 52.2% 79.2% [00049]embedded image 29.5% 50.1% 81.3% [00050]embedded image 22.0% 46.7% 79.2% 95.3% [00051]embedded image 22.4% 40.4% 81.2% [00052]embedded image 16.8% 33.7% 55.4% 76.3%

TABLE-US-00005 TABLE 3B Reactivity of different kethoxal derivatives at pH = 7.0 (continued) S1:S2 = 2:1 S1:S2 = 1:1 S1:S2 = 1:2 S1:S2 = 1:4 S1:S2 = 1:0 [00053]embedded image  7.5% 17.0% 30.0% 59.7% [00054]embedded image 19.8% 40.8% 63.2% 87.2% [00055]embedded image 20.4% 46.2% 64.2% 84.7% [00056]embedded image 16.8% 38.3% 49.6% [00057]embedded image  9.9% 22.5% 33.2% 51.5% [00058]embedded image  3.2%  6.4%  8.2% 16.5% 24.7% [00059]embedded image  0  1.4%  1.9%  3.8%  9.8% [00060]embedded image  9.0% 22.5% 30.4%

[0169] 4. Improving product stability with hydrogen bonding. When guanosine reacts with kethoxal derivatives, a proton on the guanosine amine is capable of engaging in hydrogen bond formation. Therefore, kethoxal derivatives with H-bond-accepting substituents stabilize the product formed and facilitate the reaction. Conversely, derivatives without H-bonding substituents may be relatively less reactive. Shown in the image is N.sub.3-kethoxal, which has a ether-containing D linker (based on Formula I); this H-bond accepting moiety stabilizes the product.

##STR00061##

[0170] 5. Testing the reversibility of kethoxal derivatives by adjusting pH. As the reactivity of most kethoxal derivatives is higher under basic conditions, we first applied a high pH (pH=10.1) to transform kethoxal derivatives into the kethoxal-guanosine adduct. We then adjusted the pH to 5.8 and measured extent of product dissociation. Kethoxal derivatives and guanosine were mixed at 1:1 ratio. Results are shown in Table 4 (the numbers show the conversion of guanosine).

TABLE-US-00006 TABLE 4 The reversibility of kethoxal derivatives pH = pH = pH = pH = 10.1, 5.8, 5.8, 5.8, 10 min 10 min 4 h 24 h [00062]embedded image 79.8% 79.8% 80.2% 81.8% [00063]embedded image 77.0% 77.6% 80.3% [00064]embedded image 74.6% 75.0% 76.1% [00065]embedded image 75.5% 77.3% 77.2% [00066]embedded image 65.9% 65.6% 65.2% 58.7% [00067]embedded image 62.7% 64.3% 62.9% [00068]embedded image 24.5% 23.8% 21.6% 20.8% [00069]embedded image 84.7% 85.2% 84.4% 84.5% [00070]embedded image 30.2% 19.0% 14.7% [00071]embedded image 35.6% 31.9% 26.5% [00072]embedded image 19.7% 16.6% [00073]embedded image 28.3% 12.2% 10.7% 12.7% [00074]embedded image 46.2% 50.1% 57.1% 58.2% [00075]embedded image 41.5% 49.2% 55.1% 54.7%

[0171] 6. Testing the reversibility of kethoxal derivatives by using GTP for competition. We first mixed kethoxal derivatives and guanosine to form guanosine-kethoxal adducts. Kethoxal derivatives and guanosine were mixed at a 1:1 ratio. After 10 min, we added excess guanosine 5′-triphosphate (GTP), to as a competitor. Excess GTP is expected to competitively react with the kethoxal derivative, resulting in increased free guanosine. This free guanosine is detected by LCMS and used to determine relative reversibility afforded by the substituents on the kethoxal derivative (see reaction image and LCMS images).

[0172] Results are shown in Table 5 (the numbers show the conversion of guanosine) and an example LCMS image is shown below.

[0173] The kethoxal derivative reacts with guanosine to form the kethoxal-guanosine adduct.

##STR00076##

TABLE-US-00007 TABLE 5 The reversibility of kethoxal derivatives under competition condition pH = 7.0, pH = 7.0, pH = 7.0, 10 min 2 h 24 h [00077]embedded image 71.4% 60.8% 28.9% [00078]embedded image 51.6% 55.9% 33.6% [00079]embedded image 47.4% 29.7% 27.4% [00080]embedded image 54.4% 44.6% 37.5% [00081]embedded image 56.5% 40.9% [00082]embedded image 46.2% 38.2% 18.6% [00083]embedded image 34.6% 24.8% 12.4% [00084]embedded image 52.1% (pH = 7.8) 64.3% (pH = 7.8) 30.7% (pH = 7.8) [00085]embedded image 46.2% 21.0% 22.1% [00086]embedded image 41.8% 26.1% 23.4% [00087]embedded image 41.3% 12.6% 11.2% [00088]embedded image 25.7% 12.3%  4.4% [00089]embedded image  6.4% 18.6% 22.8% [00090]embedded image 51.2% (pH = 10.1) 42.4% (pH = 10.1) 22.2% (pH = 10.1) [00091]embedded image 21.8%  9.6%  8.4% [00092]embedded image 66.9% 66.1% 36.3% [00093]embedded image 48.0% 13.5%