OLIGONUCLEOTIDE MOLECULE AND APPLICATION THEREOF IN TUMOR THERAPY
20220096516 · 2022-03-31
Inventors
- Tao KANG (Nantong City, Jiangsu, CN)
- Longcheng LI (Nantong City, Jiangsu, CN)
- Moorim KANG (Nantong City, Jiangsu, CN)
Cpc classification
A61K31/7088
HUMAN NECESSITIES
A61K31/44
HUMAN NECESSITIES
A61K31/44
HUMAN NECESSITIES
A61K31/713
HUMAN NECESSITIES
C12N2320/11
CHEMISTRY; METALLURGY
A61K2300/00
HUMAN NECESSITIES
A61K2300/00
HUMAN NECESSITIES
A61K31/713
HUMAN NECESSITIES
International classification
Abstract
The present invention relates to oligomeric nucleic acids and uses thereof for the treatment of tumors. The oligomeric nucleic acid for tumor treatment provided by the present application can be small activating nucleic acid molecules. A small activating nucleic acid molecule of the present invention can be a double-stranded or single-stranded RNA molecule targeting the promoter region of an LHPP gene comprising a first nucleic acid strand and a second nucleic acid strand. The double-stranded RNA molecule targeting the promoter region of the LHPP gene comprises two nucleic acid strands of 16 to 35 nucleotides in length, wherein one of the nucleic acid strands has at least 75% homology or complementarity to a target selected from the promoter region of the LHPP gene. The present invention also relates to pharmaceutical compositions comprising the small activating nucleic acids and optional pharmaceutically acceptable carriers, and methods for upregulating the expression of the LHPP gene in a cell and methods for treating diseases or conditions related to insufficient or decreased expression of LHPP gene by using the small activating nucleic acid molecules or the pharmaceutical compositions.
Claims
1. A small activating RNA (saRNA) comprising a sense nucleic acid fragment and an antisense nucleic acid fragment, the sense nucleic acid fragment having at least 90% homology or complementarity to a continuous sequence of 16 to 35 nucleotides in length of any one of SEQ ID NOs: 500.
2. The saRNA of claim 1, wherein the sense nucleic acid fragment and the antisense nucleic acid fragment are located on two different nucleic acid strands or on an identical nucleic acid strand, preferably forming a hairpin single-stranded nucleic acid molecule, wherein the sense nucleic acid fragment and the antisense nucleic acid fragment comprise complementary regions, wherein the complementary regions form a double-stranded nucleic acid structure between the two fragments that can activate the expression of LHPP in a cell.
3. The saRNA of claim 1, wherein the sense nucleic acid fragment and the antisense nucleic acid fragment are located on an identical nucleic acid strand, forming a hairpin single-stranded nucleic acid molecule.
4. The saRNA of claim 2, wherein at least one nucleic acid fragment has 3′ overhang of 0 to 6 nucleotides in length.
5. The saRNA of claim 4, wherein the sense nucleic acid fragment and the antisense nucleic acid fragment has a 3′ overhang of 2 or 3 nucleotides in length.
6. The saRNA of claim 5, wherein the sense nucleic acid fragment and the antisense nucleic acid fragment independently are 16 to 35 nucleotides.
7. The saRNA of claim 1, wherein one fragment of the saRNA having at least 90% homology or complementarity to nucleotide sequence selected from the group consisting of SEQ ID NOs:329-492.
8. The saRNA of claim 7, wherein the sense nucleic acid fragment of the saRNA having at least 90% homology to nucleotide sequence selected from the group consisting of SEQ ID NOs:1-164, and an antisense nucleic acid fragment having at least 90% homology to nucleotide sequence selected from the group consisting of SEQ ID NOs:165-328.
9. The saRNA of claim 8, wherein the sense nucleic acid fragment of the saRNA comprises nucleotide sequence selected from the group consisting of SEQ ID NOs:1-164, and the antisense nucleic acid fragment comprises any nucleotide sequence chosen from SEQ ID NOs: 165-328.
10. The saRNA of claim 1, wherein the saRNA comprises: i. at least one chemically modified nucleotide, or ii. one or more modifications selected from the group consisting of: a. modification of a phosphodiester bond connecting nucleotides in the nucleotide sequence of the saRNA; b. modification of 2′-OH of a ribose in the nucleotide sequence of the saRNA; c. modification of a base in the nucleotide sequence of the saRNA; d. at least one nucleotide in the nucleotide sequence of the saRNA being a locked nucleic acid.
11. The saRNA of claim 1, wherein the nucleotides in the saRNA are not chemically modified nucleotides.
12. (canceled)
13. The saRNA of claim 1, wherein the saRNA activating nucleic acid molecule activates or upregulates the expression of LHPP by at least 10%.
14. A nucleic acid encoding the saRNA of claim 1, wherein the nucleic acid is a DNA or RNA molecule.
15. (canceled)
16. (canceled)
17. A composition comprising the saRNA of claim 1, and, a pharmaceutically acceptable carrier, wherein the pharmaceutically acceptable carrier is an aqueous carrier, a liposome, a high molecular polymer, or a polypeptide.
18. (canceled)
19. The composition of claim 17, wherein the composition comprises 1-150 nM of the saRNA.
20. (canceled)
21. (canceled)
22. (canceled)
23. (canceled)
24. (canceled)
25. (canceled)
26. (canceled)
27. (canceled)
28. (canceled)
29. (canceled)
30. (canceled)
31. A method for activating or upregulating the expression of LHPP in a cell, comprising administering a composition of claim 1 to the cell.
32. (canceled)
33. (canceled)
34. (canceled)
35. (canceled)
36. (canceled)
37. (canceled)
38. (canceled)
39. A method for treating a disease or condition related to insufficient or decreased expression of LHPP protein in a human patient in need thereof, comprising administering a composition of claim 1.
40. The method of claim 39, wherein the disease or condition related to insufficient or decreased expression of LHPP protein comprises solid tumors, wherein the solid tumor is selected from the group consisting of liver cancer, lung cancer, bladder cancer, prostatic cancer, and glioma.
41. (canceled)
42. (canceled)
43. (canceled)
44. (canceled)
45. (canceled)
46. (canceled)
47. (canceled)
48. (canceled)
49. (canceled)
50. (canceled)
51. (canceled)
52. (canceled)
53. The saRNA of claim 2, wherein the sense nucleic acid fragment and the antisense nucleic acid fragment has a 3′ overhang of 2 or 3 nucleotides in length, and wherein the nucleotide of the overhang is dT.
54. The saRNA of claim 1, containing a sense nucleic acid fragment and an antisense nucleic acid fragment combination selected from the group consisting of: SEQ ID NO:1 and SEQ ID NO:165; SEQ ID NO:2 and SEQ ID NO:166; SEQ ID NO:3 and SEQ ID NO:167; SEQ ID NO:4 and SEQ ID NO:168; SEQ ID NO:5 and SEQ ID NO:169; SEQ ID NO:6 and SEQ ID NO:170; SEQ ID NO:7 and SEQ ID NO:171; SEQ ID NO:8 and SEQ ID NO:172; SEQ ID NO:9 and SEQ ID NO:173; SEQ ID NO:10 and SEQ ID NO:174; SEQ ID NO:11 and SEQ ID NO:175; SEQ ID NO:12 and SEQ ID NO:176; SEQ ID NO:13 and SEQ ID NO:177; SEQ ID NO:14 and SEQ ID NO:178; SEQ ID NO:15 and SEQ ID NO:179; SEQ ID NO:16 and SEQ ID NO:180; SEQ ID NO:17 and SEQ ID NO:181; SEQ ID NO:18 and SEQ ID NO:182; SEQ ID NO:19 and SEQ ID NO:183; SEQ ID NO:20 and SEQ ID NO:184; SEQ ID NO:21 and SEQ ID NO:185; SEQ ID NO:22 and SEQ ID NO:186; SEQ ID NO:23 and SEQ ID NO:187; SEQ ID NO:24 and SEQ ID NO:188; SEQ ID NO:25 and SEQ ID NO:189; SEQ ID NO:26 and SEQ ID NO:190; SEQ ID NO:27 and SEQ ID NO:191; SEQ ID NO:28 and SEQ ID NO:192; SEQ ID NO:29 and SEQ ID NO:193; SEQ ID NO:30 and SEQ ID NO:194; SEQ ID NO:31 and SEQ ID NO:195; SEQ ID NO:32 and SEQ ID NO:196; SEQ ID NO:33 and SEQ ID NO:197; SEQ ID NO:34 and SEQ ID NO:198; SEQ ID NO:35 and SEQ ID NO:199; SEQ ID NO:36 and SEQ ID NO:200; SEQ ID NO:37 and SEQ ID NO:201; SEQ ID NO:38 and SEQ ID NO:202; SEQ ID NO:39 and SEQ ID NO:203; SEQ ID NO:40 and SEQ ID NO:204; SEQ ID NO:41 and SEQ ID NO:205; SEQ ID NO:42 and SEQ ID NO:206; SEQ ID NO:43 and SEQ ID NO:207; SEQ ID NO:44 and SEQ ID NO:208; SEQ ID NO:45 and SEQ ID NO:209; SEQ ID NO:46 and SEQ ID NO:210; SEQ ID NO:47 and SEQ ID NO:211; SEQ ID NO:48 and SEQ ID NO:212; SEQ ID NO:49 and SEQ ID NO:213; SEQ ID NO:50 and SEQ ID NO:214; SEQ ID NO:51 and SEQ ID NO:215; SEQ ID NO:52 and SEQ ID NO:216; SEQ ID NO:53 and SEQ ID NO:217; SEQ ID NO:54 and SEQ ID NO:218; SEQ ID NO:55 and SEQ ID NO:219; SEQ ID NO:56 and SEQ ID NO:220; SEQ ID NO:57 and SEQ ID NO:221; SEQ ID NO:58 and SEQ ID NO:222; SEQ ID NO:59 and SEQ ID NO:223; SEQ ID NO:60 and SEQ ID NO:224; SEQ ID NO:61 and SEQ ID NO:225; SEQ ID NO:62 and SEQ ID NO:226; SEQ ID NO:63 and SEQ ID NO:227; SEQ ID NO:64 and SEQ ID NO:228; SEQ ID NO:65 and SEQ ID NO:229; SEQ ID NO:66 and SEQ ID NO:230; SEQ ID NO:67 and SEQ ID NO:231; SEQ ID NO:68 and SEQ ID NO:232; SEQ ID NO:69 and SEQ ID NO:233; SEQ ID NO:70 and SEQ ID NO:234; SEQ ID NO:71 and SEQ ID NO:235; SEQ ID NO:72 and SEQ ID NO:236; SEQ ID NO:73 and SEQ ID NO:237; SEQ ID NO:74 and SEQ ID NO:238; SEQ ID NO:75 and SEQ ID NO:239; SEQ ID NO:76 and SEQ ID NO:240; SEQ ID NO:77 and SEQ ID NO:241; SEQ ID NO:78 and SEQ ID NO:242; SEQ ID NO:79 and SEQ ID NO:243; SEQ ID NO:80 and SEQ ID NO:244; SEQ ID NO:81 and SEQ ID NO:245; SEQ ID NO:82 and SEQ ID NO:246; SEQ ID NO:83 and SEQ ID NO:247; SEQ ID NO:84 and SEQ ID NO:248; SEQ ID NO:85 and SEQ ID NO:249; SEQ ID NO:86 and SEQ ID NO:250; SEQ ID NO:87 and SEQ ID NO:251; SEQ ID NO:88 and SEQ ID NO:252; SEQ ID NO:89 and SEQ ID NO:253; SEQ ID NO:90 and SEQ ID NO:254; SEQ ID NO:91 and SEQ ID NO:255; SEQ ID NO:92 and SEQ ID NO:256; SEQ ID NO:93 and SEQ ID NO:257; SEQ ID NO:94 and SEQ ID NO:258; SEQ ID NO:95 and SEQ ID NO:259; SEQ ID NO:96 and SEQ ID NO:260; SEQ ID NO:97 and SEQ ID NO:261; SEQ ID NO:98 and SEQ ID NO:262; SEQ ID NO:99 and SEQ ID NO:263; SEQ ID NO:100 and SEQ ID NO:264; SEQ ID NO:101 and SEQ ID NO:265; SEQ ID NO:102 and SEQ ID NO:266; SEQ ID NO:103 and SEQ ID NO:267; SEQ ID NO:104 and SEQ ID NO:268; SEQ ID NO:105 and SEQ ID NO:269; SEQ ID NO:106 and SEQ ID NO:270; SEQ ID NO:107 and SEQ ID NO:271; SEQ ID NO:108 and SEQ ID NO:272; SEQ ID NO:109 and SEQ ID NO:273; SEQ ID NO:110 and SEQ ID NO:274; SEQ ID NO:111 and SEQ ID NO:275; SEQ ID NO:112 and SEQ ID NO:276; SEQ ID NO:113 and SEQ ID NO:277; SEQ ID NO:114 and SEQ ID NO:278; SEQ ID NO:115 and SEQ ID NO:279; SEQ ID NO:116 and SEQ ID NO:280; SEQ ID NO:117 and SEQ ID NO:281; SEQ ID NO:118 and SEQ ID NO:282; SEQ ID NO:119 and SEQ ID NO:283; SEQ ID NO:120 and SEQ ID NO:284; SEQ ID NO:121 and SEQ ID NO:285; SEQ ID NO:122 and SEQ ID NO:286; SEQ ID NO:123 and SEQ ID NO:287; SEQ ID NO:124 and SEQ ID NO:288; SEQ ID NO:125 and SEQ ID NO:289; SEQ ID NO:126 and SEQ ID NO:290; SEQ ID NO:127 and SEQ ID NO:291; SEQ ID NO:128 and SEQ ID NO:292; SEQ ID NO:129 and SEQ ID NO:293; SEQ ID NO:130 and SEQ ID NO:294; SEQ ID NO:131 and SEQ ID NO:295; SEQ ID NO:132 and SEQ ID NO:296; SEQ ID NO:133 and SEQ ID NO:297; SEQ ID NO:134 and SEQ ID NO:298; SEQ ID NO:135 and SEQ ID NO:299; SEQ ID NO:136 and SEQ ID NO:300; SEQ ID NO:137 and SEQ ID NO:301; SEQ ID NO:138 and SEQ ID NO:302; SEQ ID NO:139 and SEQ ID NO:303; SEQ ID NO:140 and SEQ ID NO:304; SEQ ID NO:141 and SEQ ID NO:305; SEQ ID NO:142 and SEQ ID NO:306; SEQ ID NO:143 and SEQ ID NO:307; SEQ ID NO:144 and SEQ ID NO:308; SEQ ID NO:145 and SEQ ID NO:309; SEQ ID NO:146 and SEQ ID NO:310; SEQ ID NO:147 and SEQ ID NO:311; SEQ ID NO:148 and SEQ ID NO:312; SEQ ID NO:149 and SEQ ID NO:313; SEQ ID NO:150 and SEQ ID NO:314; SEQ ID NO:151 and SEQ ID NO:315; SEQ ID NO:152 and SEQ ID NO:316; SEQ ID NO:153 and SEQ ID NO:317; SEQ ID NO:154 and SEQ ID NO:318; SEQ ID NO:155 and SEQ ID NO:319; SEQ ID NO:156 and SEQ ID NO:320; SEQ ID NO:157 and SEQ ID NO:321; SEQ ID NO:158 and SEQ ID NO:322; SEQ ID NO:159 and SEQ ID NO:323; SEQ ID NO:160 and SEQ ID NO:324; SEQ ID NO:161 and SEQ ID NO:325; SEQ ID NO:162 and SEQ ID NO:326; SEQ ID NO:163 and SEQ ID NO:327; and SEQ ID NO:164 and SEQ ID NO:328.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0044]
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
[0051]
[0052]
[0053]
[0054]
DETAILED DESCRIPTION
[0055] In the present invention, the related terms are defined as follows:
[0056] The term “complementarity” as used herein refers to the capability of forming base pairs between two oligonucleotide strands. The base pairs are generally formed through hydrogen bonds between nucleotides in the antiparallel oligonucleotide strands. The bases of the complementary oligonucleotide strands can be paired in the Watson-Crick manner (such as A to T, A to U, and C to G) or in any other manner allowing the formation of a duplex (such as Hoogsteen or reverse Hoogsteen base pairing).
[0057] Complementarity includes complete complementarity and incomplete complementarity. “Complete complementarity” or “100% complementarity” means that each nucleotide from the first oligonucleotide strand can form a hydrogen bond with a nucleotide at a corresponding position in the second oligonucleotide strand in the double-stranded region of the double-stranded oligonucleotide molecule without “mispairing”. “Incomplete complementarity” means that not all the nucleotide units of the two strands are bonded with each other by hydrogen bonds. For example, for two oligonucleotide strands each of 20 nucleotides in length in the double-stranded region, if only two base pairs in this double-stranded region can be formed through hydrogen bonds, the oligonucleotide strands have a complementarity of 10%. In the same example, if 18 base pairs in this double-stranded region can be formed through hydrogen bonds, the oligonucleotide strands have a complementarity of 90%. Substantial complementarity refers to at least about 75%, about 79%, about 80%, about 85%, about 90%, about 95%, about 99% or about 100% complementarity.
[0058] The term “oligonucleotide” as used herein refers to polymers of nucleotides, and includes, but is not limited to, single-stranded or double-stranded molecules of DNA, RNA, or DNA/RNA hybrid, oligonucleotide strands containing regularly and irregularly alternating deoxyribosyl portions and ribosyl portions, as well as modified and naturally or unnaturally existing frameworks for such oligonucleotides. The oligonucleotide for activating target gene transcription described herein is a small activating nucleic acid molecule.
[0059] The terms “oligonucleotide strand” and “oligonucleotide sequence” as used herein can be used interchangeably, referring to a generic term for short nucleotide sequences having less than 35 bases (including nucleotides in deoxyribonucleic acid (DNA) or ribonucleic acid (RNA)). In the present invention, an oligonucleotide strand can have any of 16 to 35 nucleotides in length.
[0060] As used herein, the term “first nucleic acid strand” can be a sense strand or an antisense strand. The sense strand of a small activating RNA refers to a nucleic acid strand contained in a small activating RNA duplex which has identity to the coding strand of the promoter DNA sequence of a target gene, and the antisense strand refers to a nucleic acid strand in the small activating RNA duplex which is complementary with the sense strand.
[0061] As used herein, the term “second nucleic acid strand” can also be a sense strand or an antisense strand. If the first oligonucleotide strand is a sense strand, the second oligonucleotide strand is an antisense strand; and if the first oligonucleotide strand is an antisense strand, the second oligonucleotide strand is a sense strand.
[0062] The term “gene” as used herein refers to all nucleotide sequences required to encode a polypeptide chain or to transcribe a functional RNA. “Gene” can be an endogenous or fully or partially recombinant gene for a host cell (for example, because an exogenous oligonucleotide and a coding sequence for encoding a promoter are introduced into a host cell, or a heterogeneous promoter adjacent to an endogenous coding sequence is introduced into a host cell). For example, the term “gene” comprises a nucleic acid sequence consisting of exons and introns. Protein-coding sequences are, for example, sequences contained within exons in an open reading frame between an initiation codon and a termination codon, and as used herein, “gene” can comprise such as a gene regulatory sequence, such as a promoter, an enhancer, and all other sequences known in the art for controlling the transcription, expression or activity of another gene, no matter whether the gene comprises a coding sequence or a non-coding sequence. In one case, for example, “gene” can be used to describe a functional nucleic acid comprising a regulatory sequence such as a promoter or an enhancer. The expression of a recombinant gene can be controlled by one or more types of heterogeneous regulatory sequences.
[0063] The term “target gene” as used herein can refer to nucleic acid sequences naturally present in organisms, transgenes, viral or bacterial sequences, can be chromosomes or extrachromosomal genes, and/or can be transiently or stably transfected or incorporated into cells and/or chromatins thereof. The target gene can be a protein-coding gene or a non-protein-coding gene (such as a microRNA gene and a long non-coding RNA gene). The target gene generally contains a promoter sequence, and the positive regulation for the target gene can be achieved by designing a small activating nucleic acid molecule having sequence identity (also called homology) to the promoter sequence, characterized as the up-regulation of expression of the target gene. “Sequence of a target gene promoter” refers to a non-coding sequence of the target gene, and the reference of the sequence of a target gene promoter in the phrase “complementary with the sequence of a target gene promoter” of the present invention refers to a coding strand of the sequence, also known as a non-template strand, i.e., a nucleic acid sequence having the same sequence as the coding sequence of the gene. “Target” or “target sequence” refers to a sequence fragment in the sequence of a target gene promoter which is homologous or complementary with a sense oligonucleotide strand or an antisense oligonucleotide strand of a small activating nucleic acid molecule.
[0064] As used herein, the terms “sense strand” and “sense nucleic acid strand” can be used interchangeably, and the sense oligonucleotide strand of a small activating nucleic acid molecule refers to the first nucleic acid strand having sequence identity to the coding strand of the sequence of a target gene promoter in the small activating nucleic acid molecule duplex.
[0065] As used herein, the terms “antisense strand” and “antisense nucleic acid strand” can be used interchangeably, and the antisense oligonucleotide strand of a small activating nucleic acid molecule refers to the second nucleic acid strand which is complementary with the sense oligonucleotide strand in the small activating nucleic acid molecule duplex.
[0066] The term “coding strand” as used herein refers to a DNA strand in the target gene which cannot be used for transcription, and the nucleotide sequence of this strand is the same as that of a RNA produced from transcription (in the RNA, T in DNA is replaced by U). The coding strand of the double-stranded DNA sequence of the target gene promoter described herein refers to a promoter sequence on the same DNA strand as the DNA coding strand of the target gene.
[0067] The term “template strand” as used herein refers to the other strand complementary with the coding strand in the double-stranded DNA of the target gene, i.e., the strand that, as a template, can be transcribed into RNA, and this strand is complementary with the transcribed RNA (A to U and G to C). In the process of transcription, RNA polymerase binds to the template strand, moves along the 3′.fwdarw.5′ direction of the template strand, and catalyzes the synthesis of the RNA along the 5′.fwdarw.3′ direction. The template strand of the double-stranded DNA sequence of the target gene promoter described herein refers to a promoter sequence on the same DNA strand as the DNA template strand of the target gene.
[0068] The term “promoter” as used herein refers to a sequence which plays a regulatory role for the transcription of a protein-coding or RNA-coding nucleic acid sequence by spacially associating with the coding sequence. Generally, a eukaryotic gene promoter contains 100 to 5000 base pairs, although this length range is not intended to limit the term “promoter” as used herein. Although the promoter sequence is generally located at the 5′ terminus of a protein-coding or RNA-coding sequence, it can also exist in exon and intron sequences.
[0069] The term “transcription start site” as used herein refers to a nucleotide marking the transcription start on the template strand of a gene. The transcription start site can appear on the template strand of the promoter region. A gene can have more than one transcription start site.
[0070] The term “identity” or “homology” as used herein means that one oligonucleotide strand (sense or antisense strand) of an small activating RNA has sequence similarity with a coding strand or a template strand in a region of the promoter sequence of a target gene. As used herein, the “identity” or “homology” can be at least about 75%, about 79%, about 80%, about 85%, about 90%, about 95%, about 99% or about 100%.
[0071] The term “overhang” as used herein refers to non-base-paired nucleotides at the terminus (5′ or 3′) of an oligonucleotide strand, which is formed by one strand extending out of the other strand in a double-stranded oligonucleotide. A single-stranded region extending out of the 3′ terminus and/or 5′ terminus of a duplex is referred to as an overhang.
[0072] As used herein, the terms “gene activation”, “activating gene expression”, “gene up-regulation” and “up-regulating gene expression” can be used interchangeably, and mean an increase in transcription, translation, expression or activity of a certain nucleic acid as determined by measuring the transcriptional level, mRNA level, protein level, enzymatic activity, methylation state, chromatin state or configuration, translation level or the activity or state in a cell or biological system of a gene. These activities or states can be determined directly or indirectly. In addition, “gene activation”, “activating gene expression”, “gene up-regulation” or “up-regulating gene expression” refers to an increase in activity associated with a nucleic acid sequence, regardless of the mechanism of such activation. For example, the nucleic acid sequence plays a regulatory role as a regulatory sequence, the nucleic acid sequence is transcribed into RNA and the RNA is translated into a protein, thereby increasing the expression of the protein.
[0073] As used herein, the terms “small activating RNA”, “saRNA”, and “small activating nucleic acid molecule” can be used interchangeably, and refer to a nucleic acid molecule that can upregulate target gene expression and can be composed of the first nucleic acid fragment (antisense nucleic acid strand, also referred to as antisense oligonucleotide strand) containing a nucleotide sequence having sequence identity or homology with the non-coding nucleic acid sequence (e.g., a promoter and an enhancer) of a target gene and a second nucleic acid fragment (sense nucleic acid strand, also referred to as sense oligonucleotide strand) containing a nucleotide sequence complementary with the first nucleic acid fragment, wherein the first nucleic acid fragment and the second nucleic acid fragment form a duplex. The small activating nucleic acid molecule can also be composed of a synthesized or vector-expressed single-stranded RNA molecule that can form a hairpin structure by two complementary regions within the molecule, wherein the first region comprises a nucleotide sequence having sequence identity to the target sequence of a promoter of a gene, and the second region comprises a nucleotide sequence which is complementary with the first region. The length of the duplex region of the small activating nucleic acid molecule is typically about 10 to about 50, about 12 to about 48, about 14 to about 46, about 16 to about 44, about 18 to about 42, about 20 to about 40, about 22 to about 38, about 24 to about 36, about 26 to about 34, and about 28 to about 32 base pairs, and typically about 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, or about 50 base pairs. In addition, the terms “saRNA”, “small activating RNA”, and “small activating nucleic acid molecule” also comprise nucleic acids other than the ribonucleotide, including, but not limited to, modified nucleotides or analogues.
[0074] As used herein, the term “hot spot” refers to a promoter region of at least 30 bp in length of a target gene, wherein targets of functional small activating nucleic acid molecules are enriched, i.e., at least 30% of the small activating nucleic acid molecules designed to target this region can induce a 1.2-fold or more change in the mRNA expression of the target gene.
[0075] As used herein, the term “synthesis” refers to a method for synthesis of an oligonucleotide, including any method allowing RNA synthesis, such as chemical synthesis, in vitro transcription, and/or vector-based expression.
[0076] According to the present invention, the expression of LHPP gene is up-regulated by RNA activation, and a related disease (particularly hepatocellular carcinoma) is treated by increasing the expression of full-length LHPP protein. The LHPP gene in the present invention is sometimes also called a target gene.
[0077] The method for preparing the small activating nucleic acid molecule provided by the present invention comprises sequence design and synthesis.
[0078] Small activating nucleic acid molecules of the present invention can be chemically synthesized or can be obtained from a biotechnology company specialized in nucleic acid synthesis.
[0079] Generally speaking, chemical synthesis of nucleic acids comprises the following four steps: (1) synthesis of oligomeric ribonucleotides; (2) deprotection; (3) purification and isolation; and (4) desalination and annealing.
[0080] For example, the specific steps for chemically synthesizing saRNAs described herein are as follows:
[0081] (1) Synthesis of Oligomeric Ribonucleotides
[0082] Synthesis of 1 μM RNA was set in an automatic DNA/RNA synthesizer (e.g., Applied Biosystems EXPEDITE8909), and the coupling time of each cycle was set as 10 to 15 min. With a solid phase-bonded 5′-O-p-dimethoxytriphenylmethyl-thymidine substrate as an initiator, one base was bonded to the solid phase substrate in the first cycle, and then, in the n.sup.th(19≥n≥2) cycle, one base was bonded to the base bonded in the n−1.sup.th cycle. This process was repeated until the synthesis of the whole nucleic acid sequence was completed.
[0083] (2) Deprotection
[0084] The solid phase substrate bonded with the saRNA was put into a test tube, and 1 mL of a mixed solution of ethanol and ammonium hydroxide (volume ratio: 1:3) was added to the test tube. The test tube was then sealed and placed in an incubator, and the mixture was incubated at 25-70° C. for 2 to 30 h. The solution containing the solid phase substrate bonded with the saRNA was filtered, and the filtrate was collected. The solid phase substrate was rinsed with double distilled water twice (1 mL each time), and the filtrate was collected. The collected eluent was combined and dried under vacuum for 1 to 12 h. Then the solution was added with 1 mL of a solution of tetrabutylammonium fluoride in tetrahydrofuran (1 M), let stand at room temperature for 4 to 12 h, followed by addition of 2 mL of n-butanol. Precipitate was collected to give a single-stranded crude product of saRNA by high-speed centrifugation.
[0085] (3) Purification and Isolation
[0086] The resulting crude product of saRNA was dissolved in 2 mL of triethylamine acetate solution with a concentration of 1 mol/L, and the solution was separated by a reversed-phase C18 column of high pressure liquid chromatography to give a purified single-stranded product of saRNA.
[0087] (4) Desalination and Annealing
[0088] Salts were removed by gel filtration (size exclusion chromatography). A single sense oligomeric ribonucleic acid strand and a single antisense oligomeric ribonucleic acid strand were mixed in a 1 to 2 mL of buffer (10 mM Tris, pH 7.5-8.0, 50 mM NaCl) at a molar ratio of 1:1. The solution was heated to 95° C., and was then slowly cooled to room temperature to give a solution containing saRNA.
[0089] It was discovered in this study that after being introduced into a cell, the aforementioned saRNA could effectively increase the mRNA and protein expression of full-length LHPP.
[0090] The present invention will be further illustrated with reference to specific examples and drawings below. It should be understood that these examples are merely intended to illustrate the present invention rather than limit the scope of the present invention. In the following examples, study methods without specific conditions were generally in accordance with conventional conditions, such as conditions described in Sambrook, et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989), or conditions recommended by the manufacturer.
EXAMPLES
Example 1
Design and Synthesis of Small Activating Nucleic Acid Molecule Targeting LHPP Promoter
[0091] Using a 1 kb sequence (SEQ ID No: 493) encompassing −1 kb to −1 bp, but excluding an Alu repeat sequence from −647 bp to −198 bp, in the LHPP promoter region, as a target sequence (
[0092] Each of the sense strand and antisense strand in the double-stranded small activating RNA (saRNA) used in the study had 21 nucleotides in length. The 19 nucleotides in the 5′ region of the first nucleic acid strand (sense strand) of the double-stranded saRNA had 100% sequence identity to the target sequence of the promoter, and the 3′ terminus of the first nucleic acid strand contained a TT sequence. The 19 nucleotides in the 5′ region of the second nucleic acid strand were fully complementary with the first ribonucleic acid strand sequence, and the 3′ terminus of the second nucleic acid strand contained a TT sequence. The aforementioned two strands of the double-stranded saRNA were mixed at a molar ratio of 1:1, and after annealed to obtain a duplex saRNA.
[0093] The sequence of the LHPP promoter is shown as follows, which corresponds to position 1 to position 1000 from 5′ to 3′ of SEQ ID NO:493:
TABLE-US-00001 -1000 ttgaacccca taacatttca acgaattcct catcctttct gtgaatcaag -950 agcctgaaaa gaaatggtga aataatatga tcctctcttc tttgaaagct -900 caaagctatg ttggaccaga agtaaagtgt tctcgtttct atttaataac -850 ttgaaaggtt ccgaggggcc attgaggaaa ctcctccctt ttaatatcaa -800 tgtgtattta ttgcaaaaat aatgtagcat cgagtggtat tttatagctt -750 atccaaaaac ctcctgggtt taacgcattg tgatagtccc gttttcttct -700 cagcccaggt cctatgcatc ctcatctatg cagggctgtt atctgcatat -650 aatttttttt ttttttaaga caaagtcttg ctctgtcgcc ccggctggag -600 tgcagtggtg caatctcggc tcactgcaac ctccgcctcc caggttcaag -550 cggttcttcc gcctcagcct accgagtagc tgggactaca ggcatgcgcc -500 accacaccta ggtgattttt gtatttttag tagagacagg ggtttcacca -450 tgttgaccag gctggtctcg aactcctgat ctcaagcgat ccacccgcct -400 cagcctccca aagtgctggg attacaggca taagccacta cgcccggcct -350 caattttgta ttgtactttt tctttctttc tttaatagag acagggtctc -300 actatgttga ctaggttggt ctagaactcc tgggcacaag ctgtccgccc -250 gcttctgcct cccaaagtgc tgggattgca ggcgtgaacc accgcccctg -200 gctacaggtg ccttcttgtc tcaatttgcc tttgaccttt cttagggact -150 tgttttctgc ttttcctgct ctttgtccgc tgatctcctg ggaagaaagc -100 ttccgaaaag gacaccgttt caggggcgag tgacgccggg gtgcccaggc -50 cgcgccccag ttccgggttt gcacccggtc ttcttgccct gccccgcccg
Example 2
High-throughput Screening of saRNAs Targeting LHPP Promoter Region
[0094] 1. Cell Culture and Transfection
[0095] Human liver cancer cell line Huh7 was cultured in DMEM medium (Gibco), containing 10% of calf serum (Sigma-Aldrich) and 1% of penicillin/streptomycin (Gibco). The cells were cultured at 5% CO.sub.2 and 37° C. According to the instructions provided by the manufacturer, RNAiMax (Invitrogen, Carlsbad, Calif.) was used to transfect small activating RNAs at a concentration of 10 nM (unless otherwise specified).
[0096] 2. One-Step RT-qPCR
[0097] At the end of transfection, the media were discarded, and each well was washed with 150 μL of PBS once. After discarding the PBS, 50 μL of cell lysis buffer (Takara) was added to each well and incubated at room temperature for 5 min. 1 μL of the resulted cell lysis was taken from each well and analyzed by qPCR on an ABI 7500 fast real-time PCR system (Applied Biosystems) using a one-step TB Green™ PrimeScrip™ RT-PCR kit II (Takara, RR086A) Each transfection sample was repeatedly amplified in 3 replicate wells. PCR reaction conditions are shown in Table 1 below.
TABLE-US-00002 TABLE 1 PCR reaction preparation Reagent Volume/Reaction 2 × One-step TB Green RT-PCR buffer 4 2.5 μL PrimeScript 1 step enzyme mixture 2 0.2 μL Mixture of forward and reverse primers (5 μM) 0.4 μL No RNase dH.sub.2O 1.4 μL Crude lysate (RNA) 0.5 μL Sum 5 μL
[0098] Reaction conditions were as follows: reverse transcription reaction (stage 1): 5 min at 42° C., 10 s at 95° C.; PCR reaction (stage 2): 5 s at 95° C., 20 s at 60° C., 45 cycles of amplification. HPRT1 and TBP were used as internal reference genes. PCR primers used for amplifying LHPP, HPRT1 and TBP genes are shown in Table 2, wherein LHPP was amplified using the LHPP F1/R1 primer pair.
TABLE-US-00003 TABLE 2 Primer sequences for RT-qPCR analysis Primer Sequence No. Sequence (5′-3′) LHPP F1 SEQ ID NO: 494 AAGGCGCTTGAGTATGCCTG LHPP R1 SEQ ID NO: 495 GTGGGCTTCCACTCCTATCG HPRT1 F SEQ ID NO: 496 ATGGACAGGACTGAACGTCTT HPRT1 R SEQ ID NO: 497 TCCAGCAGGTCAGCAAAGAA TBP F SEQ ID NO: 498 ATAATCCCAAGCGGTTTGCT TBP R SEQ ID NO: 499 CTGCCAGTCTGGACTGTTCT
[0099] To calculate the relative expression level (E.sub.rel) of LHPP (target gene) in an saRNA-transfected sample relative to control treatment (Mock), the Ct values of the target gene and the two internal reference genes were substituted into formula 1
E.sub.rel=2.sup.(CtT.sup.
wherein CtT.sub.m was the Ct value of the target gene from the control (Mock) sample; CtT.sub.s was the Ct value of the target gene from the saRNA-treated sample; CtR1m was the Ct value of the internal reference gene 1 from the control (Mock) sample; CtR1s was the Ct value of the internal reference gene 1 from the saRNA-treated sample; CtR2m was the Ct value of the internal reference gene 2 from the control (Mock) sample; and CtR2s was the Ct value of the internal reference gene 2 from the saRNA-treated sample.
[0100] 3. Screening of Functional saRNAs
[0101] In order to obtain saRNAs capable of activating LHPP transcription, Huh7 cells were transfected with each of the aforementioned 290 saRNAs with a transfection concentration of 10 nM, and 72 hours later, according to the same method as described above, the cells were lysed and analyzed by one-step RT-qPCR to obtain the relative (compared with the control (Mock)) expression level of LHPP gene for each saRNA-treated sample. As shown in Table 3, 164 (56.6%) and 37 (12.8%) saRNAs exhibited activating and inhibiting activities, respectively, and 89 (30.7%) saRNAs had no effect on the expression of LHPP. The observed maximum activation and maximum inhibition is 3.46 fold and 0.49 fold, respectively, saRNAs with activating activity are referred to as activating saRNAs, and the saRNAs with inhibiting activity are referred to as inhibiting saRNAs.
TABLE-US-00004 TABLE 3 High-throughput screening results of LHPP Number log.sub.2 value of change in LHPP of Per- saRNA activity (fold) saRNAs centage High activation ≥0.49 (1.50)~≤1.79(3.46) 30 10.3 Moderate activation ≥0.26 (1.20)~<0.49 (1.50) 81 27.9 Mild activation ≥0.13 (1.10)~<0.26 (1.20) 53 18.3 No effect .sup. <0.13 (1.10)~>−0.13 (0.91) 89 30.7 Mild inhibition ≤−0.13 (0.91)~>−0.26 (0.84) 18 6.2 Moderate inhibition ≤−0.26 (0.84)~>−0.49 (0.71) 14 4.8 High inhibition ≤−0.49 (0.71)~≥−0.73 (0.49) 5 1.7 Total 290 100
[0102]
TABLE-US-00005 TABLE 4 Functional saRNA sequences, functional target sequences thereof and changes in LHPP mRNA expression level Fold of Fold of changes changes in in relative relative LHPP LHPP mRNA mRNA expression Active target Sense sequence Antisense sequence expression level saRNA sequence (5′-3′) (5′-3′) (5′-3′) level (log2) RAG7-133 GCTCTTTGTCCGCTGATCT GCUCUUUGUCCGCUGAUCUTT AGAUCAGCGGACAAAGAGCTT 2.54 1.35 (SEQ ID NO: 329) (SEQ ID NO: 1) (SEQ ID NO: 165) RAG7-892 TGTTGGACCAGAAGTAAAG UGUUGGACCAGAAGUAAAGTT CUUUACUUCUGGUCCAACATT 2.05 1.03 (SEQ ID NO: 330) (SEQ ID NO: 2) (SEQ ID NO: 166) RAG7-694 AGGTCCTATGCATCCTCAT AGGUCCUAUGCAUCCUCAUTT AUGAGGAUGCAUAGGACCUTT 1.88 0.91 (SEQ ID NO: 331) (SEQ ID NO: 3) (SEQ ID NO: 167) RAG7-132 CTCTTTGTCCGCTGATCTC CUCUUUGUCCGCUGAUCUCTT GAGAUCAGCGGACAAAGAGTT 1.86 0.90 (SEQ ID NO: 332) (SEQ ID NO: 4) (SEQ ID NO: 168) RAG7-178 AATTTGCCTTTGACCTTTC AAUUUGCCUUUGACCUUUCTT GAAAGGUCAAAGGCAAAUUTT 1.76 0.82 (SEQ ID NO: 333) (SEQ ID NO: 5) (SEQ ID NO: 169) RAG7-177 ATTTGCCTTTGACCTTTCT AUUUGCCUUUGACCUUUCUTT AGAAAGGUCAAAGGCAAAUTT 1.74 0.80 (SEQ ID NO: 334) (SEQ ID NO: 6) (SEQ ID NO: 170) RAG7-139 TTTCCTGCTCTTTGTCCGC UUUCCUGCUCUUUGUCCGCTT GCGGACAAAGAGCAGGAAATT 1.73 0.79 (SEQ ID NO: 335) (SEQ ID NO: 7) (SEQ ID NO: 171) RAG7-707 TTCTTCTCAGCCCAGGTCC UUCUUCUCAGCCCAGGUCCTT GGACCUGGGCUGAGAAGAATT 1.72 0.78 (SEQ ID NO: 336) (SEQ ID NO: 8) (SEQ ID NO: 172) RAG7-145 TCTGCTTTTCCTGCTCTTT UCUGCUUUUCCUGCUCUUUTT AAAGAGCAGGAAAAGCAGATT 1.71 0.77 (SEQ ID NO: 337) (SEQ ID NO: 9) (SEQ ID NO: 173) RAG7-146 TTCTGCTTTTCCTGCTCTT UUCUGCUUUUCCUGCUCUUTT AAGAGCAGGAAAAGCAGAATT 1.69 0.76 (SEQ ID NO: 338) (SEQ ID NO: 10) (SEQ ID NO: 174) RAG7-846 AAGGTTCCGAGGGGCCATT AAGGUUCCGAGGGGCCAUUTT AAUGGCCCCUCGGAACCUUTT 1.68 0.75 (SEQ ID NO: 339) (SEQ ID NO: 11) (SEQ ID NO: 175) RAG7-95 AAAAGGACACCGTTTCAGG AAAAGGACACCGUUUCAGGTT CCUGAAACGGUGUCCUUUUTT 1.67 0.74 (SEQ ID NO: 340) (SEQ ID NO: 12) (SEQ ID NO: 176) RAG7-94 AAAGGACACCGTTTCAGGG AAAGGACACCGUUUCAGGGTT CCCUGAAACGGUGUCCUUUTT 1.64 0.72 (SEQ ID NO: 341) (SEQ ID NO: 13) (SEQ ID NO: 177) RAG7-893 ATGTTGGACCAGAAGTAAA AUGUUGGACCAGAAGUAAATT UUUACUUCUGGUCCAACAUTT 1.63 0.70 (SEQ ID NO: 342) (SEQ ID NO: 14) (SEQ ID NO: 178) RAG7-706 TCTTCTCAGCCCAGGTCCT UCUUCUCAGCCCAGGUCCUTT AGGACCUGGGCUGAGAAGATT 1.63 0.70 (SEQ ID NO: 343) (SEQ ID NO: 15) (SEQ ID NO: 179) RAG7-184 TGTCTCAATTTGCCTTTGA UGUCUCAAUUUGCCUUUGATT UCAAAGGCAAAUUGAGACATT 1.59 0.67 (SEQ ID NO: 344) (SEQ ID NO: 16) (SEQ ID NO: 180) RAG7-696 CCAGGTCCTATGCATCCTC CCAGGUCCUAUGCAUCCUCTT GAGGAUGCAUAGGACCUGGTT 1.58 0.66 (SEQ ID NO: 345) (SEQ ID NO: 17) (SEQ ID NO: 181) RAG7-144 CTGCTTTTCCTGCTCTTTG CUGCUUUUCCUGCUCUUUGTT CAAAGAGCAGGAAAAGCAGTT 1.56 0.64 (SEQ ID NO: 346) (SEQ ID NO: 18) (SEQ ID NO: 182) RAG7-162 TTCTTAGGGACTTGTTTTC UUCUUAGGGACUUGUUUUCTT GAAAACAAGUCCCUAAGAATT 1.56 0.64 (SEQ ID NO: 347) (SEQ ID NO: 19) (SEQ ID NO: 183) RAG7-677 ATCTATGCAGGGCTGTTAT AUCUAUGCAGGGCUGUUAUTT AUAACAGCCCUGCAUAGAUTT 1.56 0.64 (SEQ ID NO: 348) (SEQ ID NO: 20) (SEQ ID NO: 184) RAG7-188 TTCTTGTCTCAATTTGCCT UUCUUGUCUCAAUUUGCCUTT AGGCAAAUUGAGACAAGAATT 1.55 0.64 (SEQ ID NO: 349) (SEQ ID NO: 21) (SEQ ID NO: 185) RAG7-907 GAAAGCTCAAAGCTATGTT GAAAGCUCAAAGCUAUGUUTT AACAUAGCUUUGAGCUUUCTT 1.55 0.63 (SEQ ID NO: 350) (SEQ ID NO: 22) (SEQ ID NO: 186) RAG7-909 TTGAAAGCTCAAAGCTATG UUGAAAGCUCAAAGCUAUGTT CAUAGCUUUGAGCUUUCAATT 1.53 0.61 (SEQ ID NO: 351) (SEQ ID NO: 23) (SEQ ID NO: 187) RAG7-35 GGTTTGCACCCGGTCTTCT GGUUUGCACCCGGUCUUCUTT AGAAGACCGGGUGCAAACCTT 1.53 0.61 (SEQ ID NO: 352) (SEQ ID NO: 24) (SEQ ID NO: 188) RAG7-886 ACCAGAAGTAAAGTGTTCT ACCAGAAGUAAAGUGUUCUTT AGAACACUUUACUUCUGGUTT 1.52 0.60 (SEQ ID NO: 353) (SEQ ID NO: 25) (SEQ ID NO: 189) RAG7-34 GTTTGCACCCGGTCTTCTT GUUUGCACCCGGUCUUCUUTT AAGAAGACCGGGUGCAAACTT 1.51 0.59 (SEQ ID NO: 354) (SEQ ID NO: 26) (SEQ ID NO: 190) RAG7-183 GTCTCAATTTGCCTTTGAC GUCUCAAUUUGCCUUUGACTT GUCAAAGGCAAAUUGAGACTT 1.51 0.59 (SEQ ID NO: 355) (SEQ ID NO: 27) (SEQ ID NO: 191) RAG7-195 AGGTGCCTTCTTGTCTCAA AGGUGCCUUCUUGUCUCAATT UUGAGACAAGAAGGCACCUTT 1.51 0.59 (SEQ ID NO: 356) (SEQ ID NO: 28) (SEQ ID NO: 192) RAG7-829 TTGAGGAAACTCCTCCCTT UUGAGGAAACUCCUCCCUUTT AAGGGAGGAGUUUCCUCAATT 1.50 0.59 (SEQ ID NO: 357) (SEQ ID NO: 29) (SEQ ID NO: 193) RAG7-691 TCCTATGCATCCTCATCTA UCCUAUGCAUCCUCAUCUATT UAGAUGAGGAUGCAUAGGATT 1.50 0.58 (SEQ ID NO: 358) (SEQ ID NO: 30) (SEQ ID NO: 194) RAG7-908 TGAAAGCTCAAAGCTATGT UGAAAGCUCAAAGCUAUGUTT ACAUAGCUUUGAGCUUUCATT 1.49 0.57 (SEQ ID NO: 359) (SEQ ID NO: 31) (SEQ ID NO: 195) RAG7-150 TGTTTTCTGCTTTTCCTGC UGUUUUCUGCUUUUCCUGCTT GCAGGAAAAGCAGAAAACATT 1.48 0.56 (SEQ ID NO: 360) (SEQ ID NO: 32) (SEQ ID NO: 196) RAG7-916 CTCTTCTTTGAAAGCTCAA CUCUUCUUUGAAAGCUCAATT UUGAGCUUUCAAAGAAGAGTT 1.48 0.56 (SEQ ID NO: 361) (SEQ ID NO: 33) (SEQ ID NO: 197) RAG7-847 AAAGGTTCCGAGGGGCCAT AAAGGUUCCGAGGGGCCAUTT AUGGCCCCUCGGAACCUUUTT 1.48 0.56 (SEQ ID NO: 362) (SEQ ID NO: 34) (SEQ ID NO: 198) RAG7-189 CTTCTTGTCTCAATTTGCC CUUCUUGUCUCAAUUUGCCTT GGCAAAUUGAGACAAGAAGTT 1.47 0.56 (SEQ ID NO: 363) (SEQ ID NO: 35) (SEQ ID NO: 199) RAG7-830 ATTGAGGAAACTCCTCCCT AUUGAGGAAACUCCUCCCUTT AGGGAGGAGUUUCCUCAAUTT 1.45 0.54 (SEQ ID NO: 364) (SEQ ID NO: 36) (SEQ ID NO: 200) RAG7-894 TATGTTGGACCAGAAGTAA UAUGUUGGACCAGAAGUAATT UUACUUCUGGUCCAACAUATT 1.45 0.54 (SEQ ID NO: 365) (SEQ ID NO: 37) (SEQ ID NO: 201) RAG7-196 CAGGTGCCTTCTTGTCTCA CAGGUGCCUUCUUGUCUCATT UGAGACAAGAAGGCACCUGTT 1.45 0.54 (SEQ ID NO: 366) (SEQ ID NO: 38) (SEQ ID NO: 202) RAG7-179 CAATTTGCCTTTGACCTTT CAAUUUGCCUUUGACCUUUTT AAAGGUCAAAGGCAAAUUGTT 1.45 0.53 (SEQ ID NO: 367) (SEQ ID NO: 39) (SEQ ID NO: 203) RAG7-879 GTAAAGTGTTCTCGTTTCT GUAAAGUGUUCUCGUUUCUTT AGAAACGAGAACACUUUACTT 1.44 0.53 (SEQ ID NO: 368) (SEQ ID NO: 40) (SEQ ID NO: 204) RAG7-697 CCCAGGTCCTATGCATCCT CCCAGGUCCUAUGCAUCCUTT AGGAUGCAUAGGACCUGGGTT 1.43 0.51 (SEQ ID NO: 369) (SEQ ID NO: 41) (SEQ ID NO: 205) RAG7-690 CCTATGCATCCTCATCTAT CCUAUGCAUCCUCAUCUAUTT AUAGAUGAGGAUGCAUAGGTT 1.42 0.51 (SEQ ID NO: 370) (SEQ ID NO: 42) (SEQ ID NO: 206) RAG7-104 AAGCTTCCGAAAAGGACAC AAGCUUCCGAAAAGGACACTT GUGUCCUUUUCGGAAGCUUTT 1.41 0.50 (SEQ ID NO: 371) (SEQ ID NO: 43) (SEQ ID NO: 207) RAG7-32 TTGCACCCGGTCTTCTTGC UUGCACCCGGUCUUCUUGCTT GCAAGAAGACCGGGUGCAATT 1.40 0.49 (SEQ ID NO: 372) (SEQ ID NO: 44) (SEQ ID NO: 208) RAG7-126 GTCCGCTGATCTCCTGGGA GUCCGCUGAUCUCCUGGGATT UCCCAGGAGAUCAGCGGACTT 1.39 0.48 (SEQ ID NO: 373) (SEQ ID NO: 45) (SEQ ID NO: 209) RAG7-850 TTGAAAGGTTCCGAGGGGC UUGAAAGGUUCCGAGGGGCTT GCCCCUCGGAACCUUUCAATT 1.39 0.47 (SEQ ID NO: 374) (SEQ ID NO: 46) (SEQ ID NO: 210) RAG7-684 CATCCTCATCTATGCAGGG CAUCCUCAUCUAUGCAGGGTT CCCUGCAUAGAUGAGGAUGTT 1.39 0.47 (SEQ ID NO: 375) (SEQ ID NO: 47) (SEQ ID NO: 211) RAG7-194 GGTGCCTTCTTGTCTCAAT GGUGCCUUCUUGUCUCAAUTT AUUGAGACAAGAAGGCACCTT 1.38 0.47 (SEQ ID NO: 376) (SEQ ID NO: 48) (SEQ ID NO: 212) RAG7-174 TGCCTTTGACCTTTCTTAG UGCCUUUGACCUUUCUUAGTT CUAAGAAAGGUCAAAGGCATT 1.38 0.47 (SEQ ID NO: 377) (SEQ ID NO: 49) (SEQ ID NO: 213) RAG7-902 CTCAAAGCTATGTTGGACC CUCAAAGCUAUGUUGGACCTT GGUCCAACAUAGCUUUGAGTT 1.38 0.46 (SEQ ID NO: 378) (SEQ ID NO: 50) (SEQ ID NO: 214) RAG7-887 GACCAGAAGTAAAGTGTTC GACCAGAAGUAAAGUGUUCTT GAACACUUUACUUCUGGUCTT 1.37 0.46 (SEQ ID NO: 379) (SEQ ID NO: 51) (SEQ ID NO: 215) RAG7-121 CTGATCTCCTGGGAAGAAA CUGAUCUCCUGGGAAGAAATT UUUCUUCCCAGGAGAUCAGTT 1.36 0.45 (SEQ ID NO: 380) (SEQ ID NO: 52) (SEQ ID NO: 216) RAG7-138 TTCCTGCTCTTTGTCCGCT UUCCUGCUCUUUGUCCGCUTT AGCGGACAAAGAGCAGGAATT 1.36 0.45 (SEQ ID NO: 381) (SEQ ID NO: 53) (SEQ ID NO: 217) RAG7-695 CAGGTCCTATGCATCCTCA CAGGUCCUAUGCAUCCUCATT UGAGGAUGCAUAGGACCUGTT 1.36 0.44 (SEQ ID NO: 382) (SEQ ID NO: 54) (SEQ ID NO: 218) RAG7-125 TCCGCTGATCTCCTGGGAA UCCGCUGAUCUCCUGGGAATT UUCCCAGGAGAUCAGCGGATT 1.36 0.44 (SEQ ID NO: 383) (SEQ ID NO: 55) (SEQ ID NO: 219) RAG7-776 TAGCATCGAGTGGTATTTT UAGCAUCGAGUGGUAUUUUTT AAAAUACCACUCGAUGCUATT 1.35 0.44 (SEQ ID NO: 384) (SEQ ID NO: 56) (SEQ ID NO: 220) RAG7-119 GATCTCCTGGGAAGAAAGC GAUCUCCUGGGAAGAAAGCTT GCUUUCUUCCCAGGAGAUCTT 1.35 0.43 (SEQ ID NO: 385) (SEQ ID NO: 57) (SEQ ID NO: 221) RAG7-180 TCAATTTGCCTTTGACCTT UCAAUUUGCCUUUGACCUUTT AAGGUCAAAGGCAAAUUGATT 1.35 0.43 (SEQ ID NO: 386) (SEQ ID NO: 58) (SEQ ID NO: 222) RAG7-898 AAGCTATGTTGGACCAGAA AAGCUAUGUUGGACCAGAATT UUCUGGUCCAACAUAGCUUTT 1.34 0.43 (SEQ ID NO: 387) (SEQ ID NO: 59) (SEQ ID NO: 223) RAG7-175 TTGCCTTTGACCTTTCTTA UUGCCUUUGACCUUUCUUATT UAAGAAAGGUCAAAGGCAATT 1.34 0.42 (SEQ ID NO: 388) (SEQ ID NO: 60) (SEQ ID NO: 224) RAG7-169 TTGACCTTTCTTAGGGACT UUGACCUUUCUUAGGGACUTT AGUCCCUAAGAAAGGUCAATT 1.34 0.42 (SEQ ID NO: 389) (SEQ ID NO: 61) (SEQ ID NO: 225) RAG7-720 TGATAGTCCCGTTTTCTTC UGAUAGUCCCGUUUUCUUCTT GAAGAAAACGGGACUAUCATT 1.33 0.41 (SEQ ID NO: 390) (SEQ ID NO: 62) (SEQ ID NO: 226) RAG7-678 CATCTATGCAGGGCTGTTA CAUCUAUGCAGGGCUGUUATT UAACAGCCCUGCAUAGAUGTT 1.33 0.41 (SEQ ID NO: 391) (SEQ ID NO: 63) (SEQ ID NO: 227) RAG7-917 TCTCTTCTTTGAAAGCTCA UCUCUUCUUUGAAAGCUCATT UGAGCUUUCAAAGAAGAGATT 1.32 0.40 (SEQ ID NO: 392) (SEQ ID NO: 64) (SEQ ID NO: 228) RAG7-897 AGCTATGTTGGACCAGAAG AGCUAUGUUGGACCAGAAGTT CUUCUGGUCCAACAUAGCUTT 1.31 0.39 (SEQ ID NO: 393) (SEQ ID NO: 65) (SEQ ID NO: 229) RAG7-147 TTTCTGCTTTTCCTGCTCT UUUCUGCUUUUCCUGCUCUTT AGAGCAGGAAAAGCAGAAATT 1.31 0.39 (SEQ ID NO: 394) (SEQ ID NO: 66) (SEQ ID NO: 230) RAG7-148 TTTTCTGCTTTTCCTGCTC UUUUCUGCUUUUCCUGCUCTT GAGCAGGAAAAGCAGAAAATT 1.30 0.38 (SEQ ID NO: 395) (SEQ ID NO: 67) (SEQ ID NO: 231) RAG7-123 CGCTGATCTCCTGGGAAGA CGCUGAUCUCCUGGGAAGATT UCUUCCCAGGAGAUCAGCGTT 1.30 0.38 (SEQ ID NO: 396) (SEQ ID NO: 68) (SEQ ID NO: 232) RAG7-896 GCTATGTTGGACCAGAAGT GCUAUGUUGGACCAGAAGUTT ACUUCUGGUCCAACAUAGCTT 1.30 0.38 (SEQ ID NO: 397) (SEQ ID NO: 69) (SEQ ID NO: 233) RAG7-778 TGTAGCATCGAGTGGTATT UGUAGCAUCGAGUGGUAUUTT AAUACCACUCGAUGCUACATT 1.29 0.37 (SEQ ID NO: 398) (SEQ ID NO: 70) (SEQ ID NO: 234) RAG7-97 CGAAAAGGACACCGTTTCA CGAAAAGGACACCGUUUCATT UGAAACGGUGUCCUUUUCGTT 1.29 0.37 (SEQ ID NO: 399) (SEQ ID NO: 71) (SEQ ID NO: 235) RAG7-103 AGCTTCCGAAAAGGACACC AGCUUCCGAAAAGGACACCTT GGUGUCCUUUUCGGAAGCUTT 1.29 0.37 (SEQ ID NO: 400) (SEQ ID NO: 72) (SEQ ID NO: 236) RAG7-114 CCTGGGAAGAAAGCTTCCG CCUGGGAAGAAAGCUUCCGTT CGGAAGCUUUCUUCCCAGGTT 1.28 0.36 (SEQ ID NO: 401) (SEQ ID NO: 73) (SEQ ID NO: 237) RAG7-140 TTTTCCTGCTCTTTGTCCG UUUUCCUGCUCUUUGUCCGTT CGGACAAAGAGCAGGAAAATT 1.28 0.36 (SEQ ID NO: 402) (SEQ ID NO: 74) (SEQ ID NO: 238) RAG7-134 TGCTCTTTGTCCGCTGATC UGCUCUUUGUCCGCUGAUCTT GAUCAGCGGACAAAGAGCATT 1.28 0.36 (SEQ ID NO: 403) (SEQ ID NO: 75) (SEQ ID NO: 239) RAG7-890 TTGGACCAGAAGTAAAGTG UUGGACCAGAAGUAAAGUGTT CACUUUACUUCUGGUCCAATT 1.28 0.36 (SEQ ID NO: 404) (SEQ ID NO: 76) (SEQ ID NO: 240) RAG7-130 CTTTGTCCGCTGATCTCCT CUUUGUCCGCUGAUCUCCUTT AGGAGAUCAGCGGACAAAGTT 1.28 0.36 (SEQ ID NO: 405) (SEQ ID NO: 77) (SEQ ID NO: 241) RAG7-186 CTTGTCTCAATTTGCCTTT CUUGUCUCAAUUUGCCUUUTT AAAGGCAAAUUGAGACAAGTT 1.28 0.35 (SEQ ID NO: 406) (SEQ ID NO: 78) (SEQ ID NO: 242) RAG7-29 CACCCGGTCTTCTTGCCCT CACCCGGUCUUCUUGCCCUTT AGGGCAAGAAGACCGGGUGTT 1.28 0.35 (SEQ ID NO: 407) (SEQ ID NO: 79) (SEQ ID NO: 243) RAG7-171 CTTTGACCTTTCTTAGGGA CUUUGACCUUUCUUAGGGATT UCCCUAAGAAAGGUCAAAGTT 1.27 0.34 (SEQ ID NO: 408) (SEQ ID NO: 80) (SEQ ID NO: 244) RAG7-172 CCTTTGACCTTTCTTAGGG CCUUUGACCUUUCUUAGGGTT CCCUAAGAAAGGUCAAAGGTT 1.27 0.34 (SEQ ID NO: 409) (SEQ ID NO: 81) (SEQ ID NO: 245) RAG7-112 TGGGAAGAAAGCTTCCGAA UGGGAAGAAAGCUUCCGAATT UUCGGAAGCUUUCUUCCCATT 1.26 0.33 (SEQ ID NO: 410) (SEQ ID NO: 82) (SEQ ID NO: 246) RAG7-676 TCTATGCAGGGCTGTTATC UCUAUGCAGGGCUGUUAUCTT GAUAACAGCCCUGCAUAGATT 1.26 0.33 (SEQ ID NO: 411) (SEQ ID NO: 83) (SEQ ID NO: 247) RAG7-899 AAAGCTATGTTGGACCAGA AAAGCUAUGUUGGACCAGATT UCUGGUCCAACAUAGCUUUTT 1.26 0.33 (SEQ ID NO: 412) (SEQ ID NO: 84) (SEQ ID NO: 248) RAG7-182 TCTCAATTTGCCTTTGACC UCUCAAUUUGCCUUUGACCTT GGUCAAAGGCAAAUUGAGATT 1.26 0.33 (SEQ ID NO: 413) (SEQ ID NO: 85) (SEQ ID NO: 249) RAG7-686 TGCATCCTCATCTATGCAG UGCAUCCUCAUCUAUGCAGTT CUGCAUAGAUGAGGAUGCATT 1.25 0.32 (SEQ ID NO: 414) (SEQ ID NO: 86) (SEQ ID NO: 250) RAG7-848 GAAAGGTTCCGAGGGGCCA GAAAGGUUCCGAGGGGCCATT UGGCCCCUCGGAACCUUUCTT 1.25 0.32 (SEQ ID NO: 415) (SEQ ID NO: 87) (SEQ ID NO: 251) RAG7-191 GCCTTCTTGTCTCAATTTG GCCUUCUUGUCUCAAUUUGTT CAAAUUGAGACAAGAAGGCTT 1.25 0.32 (SEQ ID NO: 416) (SEQ ID NO: 88) (SEQ ID NO: 252) RAG7-821 ACTCCTCCCTTTTAATATC ACUCCUCCCUUUUAAUAUCTT GAUAUUAAAAGGGAGGAGUTT 1.25 0.32 (SEQ ID NO: 417) (SEQ ID NO: 89) (SEQ ID NO: 253) RAG7-109 GAAGAAAGCTTCCGAAAAG GAAGAAAGCUUCCGAAAAGTT CUUUUCGGAAGCUUUCUUCTT 1.24 0.31 (SEQ ID NO: 418) (SEQ ID NO: 90) (SEQ ID NO: 254) RAG7-168 TGACCTTTCTTAGGGACTT UGACCUUUCUUAGGGACUUTT AAGUCCCUAAGAAAGGUCATT 1.24 0.31 (SEQ ID NO: 419) (SEQ ID NO: 91) (SEQ ID NO: 255) RAG7-905 AAGCTCAAAGCTATGTTGG AAGCUCAAAGCUAUGUUGGTT CCAACAUAGCUUUGAGCUUTT 1.24 0.31 (SEQ ID NO: 420) (SEQ ID NO: 92) (SEQ ID NO: 256) RAG7-43 CAGTTCCGGGTTTGCACCC CAGUUCCGGGUUUGCACCCTT GGGUGCAAACCCGGAACUGTT 1.23 0.30 (SEQ ID NO: 421) (SEQ ID NO: 93) (SEQ ID NO: 257) RAG7-31 TGCACCCGGTCTTCTTGCC UGCACCCGGUCUUCUUGCCTT GGCAAGAAGACCGGGUGCATT 1.23 0.30 (SEQ ID NO: 422) (SEQ ID NO: 94) (SEQ ID NO: 258) RAG7-741 CCTCCTGGGTTTAACGCAT CCUCCUGGGUUUAACGCAUTT AUGCGUUAAACCCAGGAGGTT 1.23 0.30 (SEQ ID NO: 423) (SEQ ID NO: 95) (SEQ ID NO: 259) RAG7-102 GCTTCCGAAAAGGACACCG GCUUCCGAAAAGGACACCGTT CGGUGUCCUUUUCGGAAGCTT 1.23 0.30 (SEQ ID NO: 424) (SEQ ID NO: 96) (SEQ ID NO: 260) RAG7-181 CTCAATTTGCCTTTGACCT CUCAAUUUGCCUUUGACCUTT AGGUCAAAGGCAAAUUGAGTT 1.22 0.29 (SEQ ID NO: 425) (SEQ ID NO: 97) (SEQ ID NO: 261) RAG7-693 GGTCCTATGCATCCTCATC GGUCCUAUGCAUCCUCAUCTT GAUGAGGAUGCAUAGGACCTT 1.22 0.29 (SEQ ID NO: 426) (SEQ ID NO: 98) (SEQ ID NO: 262) RAG7-149 GTTTTCTGCTTTTCCTGCT GUUUUCUGCUUUUCCUGCUTT AGCAGGAAAAGCAGAAAACTT 1.22 0.29 (SEQ ID NO: 427) (SEQ ID NO: 99) (SEQ ID NO: 263) RAG7-151 TTGTTTTCTGCTTTTCCTG UUGUUUUCUGCUUUUCCUGTT CAGGAAAAGCAGAAAACAATT 1.22 0.28 (SEQ ID NO: 428) (SEQ ID NO: 100) (SEQ ID NO: 264) RAG7-884 CAGAAGTAAAGTGTTCTCG CAGAAGUAAAGUGUUCUCGTT CGAGAACACUUUACUUCUGTT 1.22 0.28 (SEQ ID NO: 429) (SEQ ID NO: 101) (SEQ ID NO: 265) RAG7-143 TGCTTTTCCTGCTCTTTGT UGCUUUUCCUGCUCUUUGUTT ACAAAGAGCAGGAAAAGCATT 1.21 0.28 (SEQ ID NO: 430) (SEQ ID NO: 102) (SEQ ID NO: 266) RAG7-122 GCTGATCTCCTGGGAAGAA GCUGAUCUCCUGGGAAGAATT UUCUUCCCAGGAGAUCAGCTT 1.21 0.28 (SEQ ID NO: 431) (SEQ ID NO: 103) (SEQ ID NO: 267) RAG7-89 ACACCGTTTCAGGGGCGAG ACACCGUUUCAGGGGCGAGTT CUCGCCCCUGAAACGGUGUTT 1.21 0.28 (SEQ ID NO: 432) (SEQ ID NO: 104) (SEQ ID NO: 268) RAG7-96 GAAAAGGACACCGTTTCAG GAAAAGGACACCGUUUCAGTT CUGAAACGGUGUCCUUUUCTT 1.21 0.27 (SEQ ID NO: 433) (SEQ ID NO: 105) (SEQ ID NO: 269) RAG7-708 TTTCTTCTCAGCCCAGGTC UUUCUUCUCAGCCCAGGUCTT GACCUGGGCUGAGAAGAAATT 1.21 0.27 (SEQ ID NO: 434) (SEQ ID NO: 106) (SEQ ID NO: 270) RAG7-679 TCATCTATGCAGGGCTGTT UCAUCUAUGCAGGGCUGUUTT AACAGCCCUGCAUAGAUGATT 1.20 0.27 (SEQ ID NO: 435) (SEQ ID NO: 107) (SEQ ID NO: 271) RAG7-828 TGAGGAAACTCCTCCCTTT UGAGGAAACUCCUCCCUUUTT AAAGGGAGGAGUUUCCUCATT 1.20 0.26 (SEQ ID NO: 436) (SEQ ID NO: 108) (SEQ ID NO: 272) RAG7-837 AGGGGCCATTGAGGAAACT AGGGGCCAUUGAGGAAACUTT AGUUUCCUCAAUGGCCCCUTT 1.20 0.26 (SEQ ID NO: 437) (SEQ ID NO: 109) (SEQ ID NO: 273) RAG7-176 TTTGCCTTTGACCTTTCTT UUUGCCUUUGACCUUUCUUTT AAGAAAGGUCAAAGGCAAATT 1.20 0.26 (SEQ ID NO: 438) (SEQ ID NO: 110) (SEQ ID NO: 274) RAG7-128 TTGTCCGCTGATCTCCTGG UUGUCCGCUGAUCUCCUGGTT CCAGGAGAUCAGCGGACAATT 1.20 0.26 (SEQ ID NO: 439) (SEQ ID NO: 111) (SEQ ID NO: 275) RAG7-704 TTCTCAGCCCAGGTCCTAT UUCUCAGCCCAGGUCCUAUTT AUAGGACCUGGGCUGAGAATT 1.19 0.26 (SEQ ID NO: 440) (SEQ ID NO: 112) (SEQ ID NO: 276) RAG7-193 GTGCCTTCTTGTCTCAATT GUGCCUUCUUGUCUCAAUUTT AAUUGAGACAAGAAGGCACTT 1.19 0.26 (SEQ ID NO: 441) (SEQ ID NO: 113) (SEQ ID NO: 277) RAG7-735 GGGTTTAACGCATTGTGAT GGGUUUAACGCAUUGUGAUTT AUCACAAUGCGUUAAACCCTT 1.19 0.25 (SEQ ID NO: 442) (SEQ ID NO: 114) (SEQ ID NO: 278) RAG7-889 TGGACCAGAAGTAAAGTGT UGGACCAGAAGUAAAGUGUTT ACACUUUACUUCUGGUCCATT 1.19 0.25 (SEQ ID NO: 443) (SEQ ID NO: 115) (SEQ ID NO: 279) RAG7-185 TTGTCTCAATTTGCCTTTG UUGUCUCAAUUUGCCUUUGTT CAAAGGCAAAUUGAGACAATT 1.19 0.25 (SEQ ID NO: 444) (SEQ ID NO: 116) (SEQ ID NO: 280) RAG7-111 GGGAAGAAAGCTTCCGAAA GGGAAGAAAGCUUCCGAAATT UUUCGGAAGCUUUCUUCCCTT 1.18 0.24 (SEQ ID NO: 445) (SEQ ID NO: 117) (SEQ ID NO: 281) RAG7-698 GCCCAGGTCCTATGCATCC GCCCAGGUCCUAUGCAUCCTT GGAUGCAUAGGACCUGGGCTT 1.18 0.24 (SEQ ID NO: 446) (SEQ ID NO: 118) (SEQ ID NO: 282) RAG7-33 TTTGCACCCGGTCTTCTTG UUUGCACCCGGUCUUCUUGTT CAAGAAGACCGGGUGCAAATT 1.18 0.24 (SEQ ID NO: 447) (SEQ ID NO: 119) (SEQ ID NO: 283) RAG7-113 CTGGGAAGAAAGCTTCCGA CUGGGAAGAAAGCUUCCGATT UCGGAAGCUUUCUUCCCAGTT 1.18 0.24 (SEQ ID NO: 448) (SEQ ID NO: 120) (SEQ ID NO: 284) RAG7-44 CCAGTTCCGGGTTTGCACC CCAGUUCCGGGUUUGCACCTT GGUGCAAACCCGGAACUGGTT 1.18 0.24 (SEQ ID NO: 449) (SEQ ID NO: 121) (SEQ ID NO: 285) RAG7-710 GTTTTCTTCTCAGCCCAGG GUUUUCUUCUCAGCCCAGGTT CCUGGGCUGAGAAGAAAACTT 1.18 0.24 (SEQ ID NO: 450) (SEQ ID NO: 122) (SEQ ID NO: 286) RAG7-187 TCTTGTCTCAATTTGCCTT UCUUGUCUCAAUUUGCCUUTT AAGGCAAAUUGAGACAAGATT 1.17 0.22 (SEQ ID NO: 451) (SEQ ID NO: 123) (SEQ ID NO: 287) RAG7-692 GTCCTATGCATCCTCATCT GUCCUAUGCAUCCUCAUCUTT AGAUGAGGAUGCAUAGGACTT 1.17 0.22 (SEQ ID NO: 452) (SEQ ID NO: 124) (SEQ ID NO: 288) RAG7-100 TTCCGAAAAGGACACCGTT UUCCGAAAAGGACACCGUUTT AACGGUGUCCUUUUCGGAATT 1.17 0.22 (SEQ ID NO: 453) (SEQ ID NO: 125) (SEQ ID NO: 289) RAG7-709 TTTTCTTCTCAGCCCAGGT UUUUCUUCUCAGCCCAGGUTT ACCUGGGCUGAGAAGAAAATT 1.17 0.22 (SEQ ID NO: 454) (SEQ ID NO: 126) (SEQ ID NO: 290) RAG7-726 GCATTGTGATAGTCCCGTT GCAUUGUGAUAGUCCCGUUTT AACGGGACUAUCACAAUGCTT 1.17 0.22 (SEQ ID NO: 455) (SEQ ID NO: 127) (SEQ ID NO: 291) RAG7-852 ACTTGAAAGGTTCCGAGGG ACUUGAAAGGUUCCGAGGGTT CCCUCGGAACCUUUCAAGUTT 1.17 0.22 (SEQ ID NO: 456) (SEQ ID NO: 128) (SEQ ID NO: 292) RAG7-844 GGTTCCGAGGGGCCATTGA GGUUCCGAGGGGCCAUUGATT UCAAUGGCCCCUCGGAACCTT 1.17 0.22 (SEQ ID NO: 457) (SEQ ID NO: 129) (SEQ ID NO: 293) RAG7-190 CCTTCTTGTCTCAATTTGC CCUUCUUGUCUCAAUUUGCTT GCAAAUUGAGACAAGAAGGTT 1.16 0.22 (SEQ ID NO: 458) (SEQ ID NO: 130) (SEQ ID NO: 294) RAG7-736 TGGGTTTAACGCATTGTGA UGGGUUUAACGCAUUGUGATT UCACAAUGCGUUAAACCCATT 1.16 0.22 (SEQ ID NO: 459) (SEQ ID NO: 131) (SEQ ID NO: 295) RAG7-170 TTTGACCTTTCTTAGGGAC UUUGACCUUUCUUAGGGACTT GUCCCUAAGAAAGGUCAAATT 1.16 0.21 (SEQ ID NO: 460) (SEQ ID NO: 132) (SEQ ID NO: 296) RAG7-721 GTGATAGTCCCGTTTTCTT GUGAUAGUCCCGUUUUCUUTT AAGAAAACGGGACUAUCACTT 1.16 0.21 (SEQ ID NO: 461) (SEQ ID NO: 133) (SEQ ID NO: 297) RAG7-198 TACAGGTGCCTTCTTGTCT UACAGGUGCCUUCUUGUCUTT AGACAAGAAGGCACCUGUATT 1.16 0.21 (SEQ ID NO: 462) (SEQ ID NO: 134) (SEQ ID NO: 298) RAG7-722 TGTGATAGTCCCGTTTTCT UGUGAUAGUCCCGUUUUCUTT AGAAAACGGGACUAUCACATT 1.15 0.21 (SEQ ID NO: 463) (SEQ ID NO: 135) (SEQ ID NO: 299) RAG7-670 CAGGGCTGTTATCTGCATA CAGGGCUGUUAUCUGCAUATT UAUGCAGAUAACAGCCCUGTT 1.15 0.20 (SEQ ID NO: 464) (SEQ ID NO: 136) (SEQ ID NO: 300) RAG7-86 CCGTTTCAGGGGCGAGTGA CCGUUUCAGGGGCGAGUGATT UCACUCGCCCCUGAAACGGTT 1.15 0.20 (SEQ ID NO: 465) (SEQ ID NO: 137) (SEQ ID NO: 301) RAG7-833 GCCATTGAGGAAACTCCTC GCCAUUGAGGAAACUCCUCTT GAGGAGUUUCCUCAAUGGCTT 1.14 0.20 (SEQ ID NO: 466) (SEQ ID NO: 138) (SEQ ID NO: 302) RAG7-832 CCATTGAGGAAACTCCTCC CCAUUGAGGAAACUCCUCCTT GGAGGAGUUUCCUCAAUGGTT 1.14 0.19 (SEQ ID NO: 467) (SEQ ID NO: 139) (SEQ ID NO: 303) RAG7-702 CTCAGCCCAGGTCCTATGC CUCAGCCCAGGUCCUAUGCTT GCAUAGGACCUGGGCUGAGTT 1.14 0.19 (SEQ ID NO: 468) (SEQ ID NO: 140) (SEQ ID NO: 304) RAG7-120 TGATCTCCTGGGAAGAAAG UGAUCUCCUGGGAAGAAAGTT CUUUCUUCCCAGGAGAUCATT 1.14 0.19 (SEQ ID NO: 469) (SEQ ID NO: 141) (SEQ ID NO: 305) RAG7-780 AATGTAGCATCGAGTGGTA AAUGUAGCAUCGAGUGGUATT UACCACUCGAUGCUACAUUTT 1.14 0.19 (SEQ ID NO: 470) (SEQ ID NO: 142) (SEQ ID NO: 306) RAG7-914 CTTCTTTGAAAGCTCAAAG CUUCUUUGAAAGCUCAAAGTT CUUUGAGCUUUCAAAGAAGTT 1.14 0.19 (SEQ ID NO: 471) (SEQ ID NO: 143) (SEQ ID NO: 307) RAG7-93 AAGGACACCGTTTCAGGGG AAGGACACCGUUUCAGGGGTT CCCCUGAAACGGUGUCCUUTT 1.14 0.19 (SEQ ID NO: 472) (SEQ ID NO: 144) (SEQ ID NO: 308) RAG7-98 CCGAAAAGGACACCGTTTC CCGAAAAGGACACCGUUUCTT GAAACGGUGUCCUUUUCGGTT 1.14 0.19 (SEQ ID NO: 473) (SEQ ID NO: 145) (SEQ ID NO: 309) RAG7-853 AACTTGAAAGGTTCCGAGG AACUUGAAAGGUUCCGAGGTT CCUCGGAACCUUUCAAGUUTT 1.14 0.18 (SEQ ID NO: 474) (SEQ ID NO: 146) (SEQ ID NO: 310) RAG7-885 CCAGAAGTAAAGTGTTCTC CCAGAAGUAAAGUGUUCUCTT GAGAACACUUUACUUCUGGTT 1.14 0.18 (SEQ ID NO: 475) (SEQ ID NO: 147) (SEQ ID NO: 311) RAG7-715 GTCCCGTTTTCTTCTCAGC GUCCCGUUUUCUUCUCAGCTT GCUGAGAAGAAAACGGGACTT 1.13 0.18 (SEQ ID NO: 476) (SEQ ID NO: 148) (SEQ ID NO: 312) RAG7-681 CCTCATCTATGCAGGGCTG CCUCAUCUAUGCAGGGCUGTT CAGCCCUGCAUAGAUGAGGTT 1.13 0.17 (SEQ ID NO: 477) (SEQ ID NO: 149) (SEQ ID NO: 313) RAG7-106 GAAAGCTTCCGAAAAGGAC GAAAGCUUCCGAAAAGGACTT GUCCUUUUCGGAAGCUUUCTT 1.12 0.17 (SEQ ID NO: 478) (SEQ ID NO: 150) (SEQ ID NO: 314) RAG7-137 TCCTGCTCTTTGTCCGCTG UCCUGCUCUUUGUCCGCUGTT CAGCGGACAAAGAGCAGGATT 1.12 0.16 (SEQ ID NO: 479) (SEQ ID NO: 151) (SEQ ID NO: 315) RAG7-165 CCTTTCTTAGGGACTTGTT CCUUUCUUAGGGACUUGUUTT AACAAGUCCCUAAGAAAGGTT 1.11 0.16 (SEQ ID NO: 480) (SEQ ID NO: 152) (SEQ ID NO: 316) RAG7-160 CTTAGGGACTTGTTTTCTG CUUAGGGACUUGUUUUCUGTT CAGAAAACAAGUCCCUAAGTT 1.11 0.15 (SEQ ID NO: 481) (SEQ ID NO: 153) (SEQ ID NO: 317) RAG7-745 AAAACCTCCTGGGTTTAAC AAAACCUCCUGGGUUUAACTT GUUAAACCCAGGAGGUUUUTT 1.11 0.15 (SEQ ID NO: 482) (SEQ ID NO: 154) (SEQ ID NO: 318) RAG7-92 AGGACACCGTTTCAGGGGC AGGACACCGUUUCAGGGGCTT GCCCCUGAAACGGUGUCCUTT 1.11 0.15 (SEQ ID NO: 483) (SEQ ID NO: 155) (SEQ ID NO: 319) RAG7-107 AGAAAGCTTCCGAAAAGGA AGAAAGCUUCCGAAAAGGATT UCCUUUUCGGAAGCUUUCUTT 1.11 0.15 (SEQ ID NO: 484) (SEQ ID NO: 156) (SEQ ID NO: 320) RAG7-826 AGGAAACTCCTCCCTTTTA AGGAAACUCCUCCCUUUUATT UAAAAGGGAGGAGUUUCCUTT 1.11 0.15 (SEQ ID NO: 485) (SEQ ID NO: 157) (SEQ ID NO: 321) RAG7-839 CGAGGGGCCATTGAGGAAA CGAGGGGCCAUUGAGGAAATT UUUCCUCAAUGGCCCCUCGTT 1.11 0.15 (SEQ ID NO: 486) (SEQ ID NO: 158) (SEQ ID NO: 322) RAG7-738 CCTGGGTTTAACGCATTGT CCUGGGUUUAACGCAUUGUTT ACAAUGCGUUAAACCCAGGTT 1.11 0.15 (SEQ ID NO: 487) (SEQ ID NO: 159) (SEQ ID NO: 323) RAG7-911 CTTTGAAAGCTCAAAGCTA CUUUGAAAGCUCAAAGCUATT UAGCUUUGAGCUUUCAAAGTT 1.11 0.15 (SEQ ID NO: 488) (SEQ ID NO: 160) (SEQ ID NO: 324) RAG7-883 AGAAGTAAAGTGTTCTCGT AGAAGUAAAGUGUUCUCGUTT ACGAGAACACUUUACUUCUTT 1.11 0.14 (SEQ ID NO: 489) (SEQ ID NO: 161) (SEQ ID NO: 325) RAG7-687 ATGCATCCTCATCTATGCA AUGCAUCCUCAUCUAUGCATT UGCAUAGAUGAGGAUGCAUTT 1.10 0.14 (SEQ ID NO: 490) (SEQ ID NO: 162) (SEQ ID NO: 326) RAG7-851 CTTGAAAGGTTCCGAGGGG CUUGAAAGGUUCCGAGGGGTT CCCCUCGGAACCUUUCAAGTT 1.10 0.14 (SEQ ID NO: 491) (SEQ ID NO: 163) (SEQ ID NO: 327) RAG7-142 GCTTTTCCTGCTCTTTGTC GCUUUUCCUGCUCUUUGUCTT GACAAAGAGCAGGAAAAGCTT 1.10 0.14 (SEQ ID NO: 492) (SEQ ID NO: 164) (SEQ ID NO: 328)
[0103] When the 290 saRNAs were sorted by their targeting positions on the LHPP promoter, it can be clearly seen that the functional saRNAs were distributed across the promoter region in a cluster fashion, i.e., at certain promoter regions, there were “hot spots” where functional sRNAs were enriched (
[0104] The sequence of the hot spot H1 (5′ to 3′: −917 to −844) corresponds to position 1 to position 74 from 5′ to 3′ of SEQ ID NO: 500:
TABLE-US-00006 tctcttc tttgaaagct caaagctatg ttggaccaga agtaaagtgt tctcgtttct atttaataac ttgaaag
[0105] The sequence of the hot spot H2 (5′ to 3′: −710 to −675) corresponds to position 1 to position 36 from 5′ to 3′ of SEQ ID NO: 501:
TABLE-US-00007 gttttcttct cagcccaggt cctatgcatc ctcatc
[0106] The sequence of the hot spot H3 (5′ to 3′: −198 to −168) corresponds to position 1 to position 31 from 5′ to 3′ of SEQ ID NO: 502:
TABLE-US-00008 tacaggtg ccttcttgtc tcaatttgcc ttt
[0107] The sequence of the hot spot H4 (5′ to 3′: −151 to −28) corresponds to position 1 to position 124 from 5′ to 3′ of SEQ ID NO: 503:
TABLE-US-00009 t tgttttctgc ttttcctgct ctttgtccgc tgatctcctg ggaagaaagc ttccgaaaag gacaccgttt caggggcgag tgacgccggg gtgcccaggc cgcgccccag ttccgggttt gca
[0108] The sequence of the hot spot HC (5′ to 3′: −845 to −711) corresponds to position 1 to position 135 from 5′ to 3′ of SEQ ID NO: 504:
TABLE-US-00010 aggtt ccgaggggcc attgaggaaa ctcctccctt ttaatatcaa tgtgtattta ttgcaaaaat aatgtagcat cgagtggtat tttatagctt atccaaaaac ctcctgggtt taacgcattg tgatagtccc.
Example 3
saRNAs Promoted LHPP mRNA Expression and Inhibited Tumor Cell Proliferation
[0109] The 290 saRNAs targeting the LHPP promoter were individually transfected into Huh7 cells, and 72 hour later, one-step RT-qPCR was employed to analyze the expression levels of LHPP mRNA and cell viability was detected by the CCK-8 method. As shown in
[0110] The cell viability was detected by the following CCK-8 method: the cCells were plated into a 96-well plate at 3-5×10.sup.3 cells/well, cultured overnight, and transfected with the oligonucleotide duplexes. After 72 hour of transfection, 10 uL of CCK-8 solution (Dojindo Molecular Technologies) was added into each well. After 1 hour of incubation at 37° C., a microplate reader was used to measure absorbances at 450 nm.
Example 4
saRNAs Promoted LHPP Protein Expression
[0111] Cells were plated into a 96-well plate at 3-5×10.sup.3 cells/well, cultured overnight, and transfected with 10 randomly selected oligonucleotide duplexes. After 72 h of transfection, the cells were collected and lysed using cell lysis buffer (1×RIPA buffer, CST) containing protease inhibitor. Protein quantification was performed by using the BCA method (Thermo). After polyacrylamide gel electrophoresis separation, then the protein was transferred to a 0.45 μm PVDF membrane. The primary antibody used for the blot assay was a mouse monoclonal anti-LHPP antibody (Invitrogen), a rabbit polyclonal anti-AKT antibody (Cell Signaling Technology), a rabbit polyclonal anti-pAKT antibody (Cell Signaling Technology), or a rabbit polyclonal anti-α/β-tubulin antibody (Cell Signaling Technology); and the secondary antibody used was an anti-mouse IgG HRP-linked antibody (Cell Signaling Technology) or an anti-rabbit IgG HRP-linked antibody (Cell Signaling Technology). Image Lab (BIO-RAD, Chemistry Doctm MP imaging system) was used to scan detecting signals.
TABLE-US-00011 TABLE 5 Double-stranded RNA sequences as study controls Double-stranded RNA Sequence No. Sequence (5′-3′) dsCon2-sense SEQ ID NO: 505 ACUACUGAGUGACAGUAGATT strand SEQ ID NO: 506 UCUACUGUCACUCAGUAGUTT dsCon2- antisense strand siLHPP1-sense SEQ ID NO: 507 GAAGUUCAGAGCCGCUCAATT strand SEQ ID NO: 508 UUGAGCGGCUCUGAACUUCTT siLHPP1- antisense strand
[0112] As shown in
Example 5
saRNAs Inhibited Proliferation of A Variety of Tumor Cells
[0113] In order to further evaluate the effect of LHPP saRNAs in inducing the mRNA expression of the LHPP gene and inhibiting the proliferation of cancer cells, eight screened saRNAs (RAG7-132, RAG7-133, RAG7-139, RAG7-177, RAG7-178, RAG7-694, RAG7-707 and RAG7-892) each were transfected into the liver cancer cell lines Huh7 (Medical Cell Resource Center, Tohoku University, Japan), HepG2 (ATCC), Hep3B (ATCC), Li-7 (Medical Cell Resource Center, Tohoku University, Japan) and SK-HEP-1 (ATCC), a lung cancer cell line A549 (ATCC), a bladder cancer cell line T24 (ATCC), a prostatic cancer cell line PC3 (ATCC), and a glioma cell line U87MG (ATCC). The mRNA expression and cell viability of the transfected cells were measured. As shown in
Example 6
saRNAs in Combination with Chemotherapies Inhibited Cell Proliferation
[0114] The compounds used in the study included: Sorafenib (Sora) (SELLECK, S1040), Lenvatinib (Lenv) (SELLECK, S1164), Regorafenib (Rego) (SELLECK, S1178), and Cabozantinib (Cabo) (SELLECK, S1119). Cells were transfected with each candidate saRNAs at varying concentration gradients for 24 hours. Thereafter, the aforementioned compounds were added to the transfected cells at a concentration of 5 μM and the cells were incubated with the compounds for 48 hours. Cell viability was measured using the CCK-8 method. Compusyn© version 1.0 software (ComboSyn, Inc. Paramus, N.J., USA) was used to analyze the combination index (CI) of drugs, wherein CI<1 represented a synergistic effect, CI=1 represented an additive effect, and CI>1 represented an antagonistic effect.
[0115] As shown for HepG2 cells in
Example 7
Drug Combination Inhibited Tumor Growth In Vivo in Mice Xenograped with Human HepG2
[0116] To prepare the saRNA formulation, the in vivo-jetPEI (201-10G, Polyplus-transfection, France) was adopted as an saRNA delivery system. The preparation process is briefly described as follows. An saRNA was first diluted in 10% glucose solution to obtain a solution A. According to the instructions of the manufacturer, a required amount of in vivo-jetPEI was diluted in 10% glucose solution to obtain a solution B. Equal volumes of the solution A and the solution B were mixed (nitrogen-to-phosphorus ratio: 8; final concentration of glucose: 5%). After mixing, the mixture was let to stand at room temperature for 15 minutes for later use.
[0117] HepG2 cells in the logarithmic growth phase were obtained and counted, and then the cell suspension was regulated to 5×10.sup.7 cells/mL and subcutaneously inoculated into the right armpit of BALB/c nude mice at a volume of 0.1 mL per mouse. When tumors in nude mice grew to about 100 mm.sup.3, the nude mice were randomly divided into four groups each with six mice: (vehicle control (Vehicle) group, saRNA group, regorafenib group, and saRNA and regorafenib combination group (saRNA+regorafenib)). For the saRNA group and the saRNA+regorafenib group, intratumor injection of saRNA was performed at 1 mg.Math.kg.sup.−1 on days 1, 4, 7 and 10. For the regorafenib group and the saRNA+regorafenib group, intragastric administration of regorafenib was performed at 3 mg kg.sup.−1 everyday from day 1 through day 12. Starting from the initial administration, the long diameter and the short diameter of each tumor were measured with a vernier caliper every two days. The tumor volume was calculated according to the formula V=(l×w.sup.2)/2, wherein 1 represents the longest diameter of the tumor and w represents the diameter parallel to the surface of the tumor and perpendicular to the long diameter. A tumor growth curve and size and morphology of the tumor after anatomy were recorded during the administration. As shown in
Example 8
Drug Combination Inhibited Tumor Growth In Vivo in Mice Xenograped with Human U87MG
[0118] To prepare the saRNA formulation, in vivo-jetPEI (201-10G, Polyplus-transfection, France) was adopted as an saRNA delivery system. The preparation process is briefly described as follows: an saRNA was first diluted in a 10% glucose solution to give a solution A; a required amount of in vivo-jetPEI was diluted in 10% glucose solution to give a solution B; then, equal volumes of the solution A and the solution B were mixed (nitrogen-to-phosphorus ratio: 8; final concentration of glucose: 5%). After mixing the mixture was let to stand under room temperature for 15 minutes for later use.
[0119] The glioma cell line U87MG were grown to the logarithmic phase, counted, then subcutaneously inoculated at a concentration of 9×10.sup.7 cells/mL into the right armpit of BALB/c nude mice at 0.1 mL per mouse. Tumor-bearing nude mice were randomly divided into four groups after tumors grew to about 100 mm.sup.3 (vehicle control group, saRNA group, regorafenib group, and saRNA and regorafenib in combination group (RAG7-133+regorafenib group)) with seven mice in each group. For the saRNA group and the saRNA+regorafenib group, intratumor injection of saRNA at 1 mg.Math.kg.sup.−1 was performed on days 1, 4, 7 and 10. For the regorafenib group and the saRNA+regorafenib group, intragastric administeration of regorafenib at 3 mg.Math.kg.sup.−1 was performed every day on day 1 through day 12. After the initial administration, the long diameter and short diameter of each tumor were measured with a vernier caliper every two days. The tumor volume was calculated according to the formula V=(l×w.sup.2)/2, wherein 1 represents the longest diameter of the tumor and w represents the diameter parallel to the surface of the tumor and perpendicular to the long diameter. A tumor growth curve and size and morphology of the tumor anatomy were recorded during the administration. As shown in
[0120] Based on the results above, a plurality of saRNAs capable of remarkably activating the expression of LHPP gene were identified through high-throughput screening of saRNAs targeting LHPP gene promoter. These saRNAs inhibit the proliferation of tumor cells in vitro or in vivo by up-regulating the expression of LHPP gene and protein and downregulating the phosphorylation of AKT. These results clearly suggest that saRNAs targeting the LHPP gene promoter can be a promising strategy for tumor treatment.
REFERENCES
[0121] 1. Yokoi F, Hiraishi H, Izuhara K. 2003. Molecular cloning of a cDNA for the human phospholysine phosphohistidine inorganic pyrophosphate phosphatase. J Biochem 133:607-14. [0122] 2. Neff C D, Abkevich V, Packer J C, Chen Y, Potter J, et al. 2009. Evidence for HTR1A and LHPP as interacting genetic risk factors in major depression. Mol Psychiatry 14:621-30. [0123] 3. Gohla A. 2019. Do metabolic HAD phosphatases moonlight as protein phosphatases? Biochim Biophys Acta Mol Cell Res 1866:153-66. [0124] 4. CONVERGE consortium. 2015. Sparse whole-genome sequencing identifies two loci for major depressive disorder. Nature 523:588-91. [0125] 5. Knowles E E, Kent J W, Jr., McKay D R, Sprooten E, Mathias S R, et al. 2016. Genome-wide linkage on chromosome 10q26 for a dimensional scale of major depression. J Affect Disord 191:123-31. [0126] 6. Polimanti R, Wang Q, Meda S A, Patel K T, Pearlson G D, et al. 2017. The Interplay Between Risky Sexual Behaviors and Alcohol Dependence: Genome-Wide Association and Neuroimaging Support for LHPP as a Risk Gene. Neuropsychopharmacology 42:598-605. [0127] 7. Cui L, Gong X, Tang Y, Kong L, Chang M, et al. 2016. Relationship between the LHPP Gene Polymorphism and Resting-State Brain Activity in Major Depressive Disorder. Neural Plast 2016:9162590. [0128] 8. Lesseur C, Diergaarde B, Olshan A F, Wunsch-Filho V, Ness A R, et al. 2016. Genome-wide association analyses identify new susceptibility loci for oral cavity and pharyngeal cancer. Nat Genet 48:1544-50. [0129] 9. Gutierrez-Camino A, Martin-Guerrero I, Garcia-Orad A. 2017. Genetic susceptibility in childhood acute lymphoblastic leukemia. Med Oncol 34:179. [0130] 10. Vijayakrishnan J, Kumar R, Henrion M Y, Moorman A V, Rachakonda P S, et al. 2017. A genome-wide association study identifies risk loci for childhood acute lymphoblastic leukemia at 10q26.13 and 12q23.1. Leukemia 31:573-9. [0131] 11. Hindupur S K, Colombi M, Fuhs S R, Matter M S, Guri Y, et al. 2018. The protein histidine phosphatase LHPP is a tumour suppressor. Nature 555:678-82. [0132] 12. Bray F, Ferlay J, Soerjomataram I, Siegel R L, Torre L A, Jemal A. 2018. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 68:394-424.