COMPOSITIONS AND METHODS FOR TREATING SENSORINEURAL HEARING LOSS USING OTOFERLIN DUAL VECTOR SYSTEMS
20210395781 · 2021-12-23
Inventors
- Joseph BURNS (Newton, MA, US)
- Kathryn Ellis (Arlington, MA, US)
- Adam PALERMO (Somerville, MA, US)
- Martin SCHWANDER (Auburndale, MA, US)
- Jonathon Whitton (Cambridge, MA, US)
- Leah Sabin (Goldens Bridge, NY, US)
- Christos Kyratsous (Irvington, NY, US)
- Meghan Drummond Samuelson (Katonah, NY, US)
Cpc classification
C12Y207/10001
CHEMISTRY; METALLURGY
C07K14/705
CHEMISTRY; METALLURGY
C12N2750/14143
CHEMISTRY; METALLURGY
C12N2800/40
CHEMISTRY; METALLURGY
C12N2830/48
CHEMISTRY; METALLURGY
A61K48/005
HUMAN NECESSITIES
C12Y402/01011
CHEMISTRY; METALLURGY
C12N2840/44
CHEMISTRY; METALLURGY
C12N2830/008
CHEMISTRY; METALLURGY
A61K48/0075
HUMAN NECESSITIES
C12N15/86
CHEMISTRY; METALLURGY
International classification
C12N15/86
CHEMISTRY; METALLURGY
A61K48/00
HUMAN NECESSITIES
Abstract
The disclosure features compositions and methods for the treatment of sensorineural hearing loss and auditory neuropathy, particularly forms of the disease that are associated with mutations in otoferlin (OTOF), by way of OTOF gene therapy. The disclosure provides a variety of compositions that include a first nucleic acid vector that contains a polynucleotide encoding an N-terminal portion of an OTOF protein and a second nucleic acid vector that contains a polynucleotide encoding a C-terminal portion of an OTOF protein. These vectors can be used to increase the expression of OTOF in a subject, such as a human subject suffering from sensorineural hearing loss.
Claims
1. A composition comprising: a first nucleic acid vector comprising a promoter selected from the group consisting of a myosin 15 (Myo15) promoter, a vesicular glutamate transporter 3 (VGLUT3) promoter, and a fibroblast growth factor 8 (FGF8) promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of an otoferlin (OTOF) protein; and a second nucleic acid vector comprising a second coding polynucleotide that encodes a C-terminal portion of an OTOF protein and a polyadenylation (poly(A)) sequence positioned at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide partially overlaps with the second coding polynucleotide, wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein, and, when introduced into a mammalian cell, the two first and second nucleic acid vectors undergo homologous recombination to form a recombined nucleic acid that encodes a full-length OTOF protein.
2. The composition of claim 1, wherein each of the first and second coding polynucleotides encode about half of the OTOF protein sequence.
3. The composition of claim 1 or 2, wherein the first coding polynucleotide overlaps with the second coding polynucleotide by at least 1 kilobase (kb).
4. The composition of any one of claims 1-3, wherein the region of overlap between the first and second coding polynucleotides is centered at an OTOF exon boundary.
5. The composition of claim 4, wherein the first coding polynucleotide encodes an N-terminal portion of the OTOF protein comprising the OTOF N-terminus to 500 kb 3′ of the exon boundary at the center of the overlap region; and the second coding polynucleotide encodes a C-terminal portion of the OTOF protein comprising 500 kb 5′ of the exon boundary at the center of the overlap region to the OTOF C-terminus.
6. The composition of claim 4 or 5, wherein the exon boundary at the center of the overlap region is not within a portion of the first coding polynucleotide or second coding polynucleotide that encodes a C2 domain.
7. The composition of any one of claims 1-6, wherein the promoter is a Myo15 promoter.
8. The composition of claim 7, wherein the exon boundary is selected such that the first coding polynucleotide encodes the entire C2C domain and the second coding polynucleotide encodes the entire C2D domain.
9. The composition of claim 7 or 8, wherein the first coding polynucleotide comprises exons 1-21 of a polynucleotide encoding the OTOF protein and 500 kb 3′ of the exon 21/22 boundary; and the second coding polynucleotide comprises 500 kb 5′ of the exon 21/22 boundary and exons 22-48 of a polynucleotide encoding the OTOF protein.
10. The composition of any one of claims 7-9, wherein the first nucleic acid vector and the second nucleic acid vector do not comprise OTOF untranslated regions (UTRs).
11. A composition comprising: a first nucleic acid vector comprising a promoter selected from the group consisting of a Myo15 promoter, a VGLUT3 promoter, and an FGF8 promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of an OTOF protein, and a splice donor signal sequence positioned at the 3′ end of the first coding polynucleotide; and a second nucleic acid vector comprising a splice acceptor signal sequence, a second coding polynucleotide that encodes a C-terminal portion of an OTOF protein positioned at the 3′ end of the splice acceptor signal sequence, and a poly(A) sequence at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide and the second coding polynucleotide do not overlap, and wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein.
12. A composition comprising: a first nucleic acid vector comprising a promoter selected from the group consisting of a Myo15 promoter, a VGLUT3 promoter, and an FGF8 promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of an OTOF protein, a splice donor signal sequence positioned at the 3′ end of the first coding polynucleotide, and a recombinogenic region positioned 3′ of the splice donor signal sequence; and a second nucleic acid vector comprising a second recombinogenic region, a splice acceptor signal sequence positioned 3′ of the second recombinogenic region, a second coding polynucleotide that encodes a C-terminal portion of an OTOF protein positioned at the 3′ end of the splice acceptor signal sequence, and a poly(A) sequence positioned at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide and the second coding polynucleotide do not overlap, and wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein.
13. The composition of claim 12, wherein the first and second recombinogenic regions are the same.
14. The composition of claim 12 or 13, wherein the recombinogenic region is an AP gene fragment or an F1 phage AK gene.
15. The composition of any one of claims 12-14, wherein the first nucleic acid vector further comprises a degradation signal sequence positioned 3′ of the recombinogenic region; and wherein the second nucleic acid vector further comprises a degradation signal sequence positioned between the recombinogenic region and the splice acceptor signal sequence.
16. The composition of any one of claims 11-15, wherein each of the first and second coding polynucleotides encode about half of the OTOF protein sequence.
17. The composition of any one of claims 11-16, wherein the division between the first and second coding polynucleotides is at an OTOF exon boundary.
18. The composition of any one of claims 11-17, wherein the promoter is a Myo15 promoter.
19. The composition of claim 18, wherein the exon boundary is not within a portion of the first coding polynucleotide or second coding polynucleotide that encodes a C2 domain.
20. The composition of claim 18 or 19, wherein the exon boundary is selected such that the first coding polynucleotide encodes the entire C2C domain and the second coding polynucleotide encodes the entire C2D domain.
21. The composition of any one of claims 18-20, wherein the first coding polynucleotide comprises exons 1-19 of a polynucleotide encoding the OTOF protein and the second coding polynucleotide comprises exons 20-48 of a polynucleotide encoding the OTOF protein.
22. The composition of any one of claims 18-21, wherein the first nucleic acid vector and the second nucleic acid vector do not comprise OTOF untranslated regions (UTRs).
23. The composition of claim 18, wherein the exon boundary is within a portion of the first coding polynucleotide and the second coding polynucleotide that encodes the C2D domain.
24. The composition of claim 23, wherein the first coding polynucleotide comprises exons 1-25 of a polynucleotide encoding the OTOF protein and the second coding polynucleotide comprises exons 26-48 of a polynucleotide encoding the OTOF protein.
25. The composition of claim 23 or 24, wherein the second nucleic acid vector comprises a full-length OTOF 3′ UTR.
26. The composition of any one of claims 1-25, wherein the first and second nucleic acid vectors are adeno-associated virus (AAV) vectors.
27. The composition of claim 26, wherein the serotype of the AAV vectors is selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
28. The composition of claim 27, wherein the serotype of the AAV vectors is AAV1.
29. The composition of claim 27, wherein the serotype of the AAV vectors is AAV9.
30. The composition of any one of claims 1-29, wherein the first and second coding polynucleotides that encode the OTOF protein do not comprise introns
31. The composition of any one of claims 1-30, wherein the OTOF protein is a mammalian OTOF protein.
32. The composition of claim 31, wherein the OTOF protein is a human OTOF protein.
33. The composition of claim 32, wherein the OTOF protein has at least 85% identity to the sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO:3, SEQ ID NO: 4, or SEQ ID NO: 5.
34. The composition of claim 33, wherein the OTOF protein has the sequence of SEQ ID NO: 1.
35. The composition of claim 33, wherein the OTOF protein has the sequence of SEQ ID NO: 5.
36. A composition comprising: a first nucleic acid vector comprising a promoter selected from the group consisting of a CAG promoter, a cytomegalovirus (CMV) promoter, a truncated CMV-chicken β-actin promoter (smCBA), a Myo15 promoter, a Myosin 7A (Myo7A) promoter, a Myosin 6 (Myo6) promoter, a POU Class 4 Homeobox 3 (POU4F3) promoter, an OTOF promoter, an FGF8 promoter, and a VGLUT3 promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of a human OTOF protein; and a second nucleic acid vector comprising a second coding polynucleotide that encodes a C-terminal portion of a human OTOF protein and a polyadenylation (poly(A)) sequence positioned at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide partially overlaps with the second coding polynucleotide, wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein, and, when introduced into a mammalian cell, the two first and second nucleic acid vectors undergo homologous recombination to form a recombined nucleic acid that encodes a full-length OTOF protein.
37. The composition of claim 36, wherein the first and second coding polynucleotides that encode the human OTOF protein do not comprise introns.
38. The composition of claim 36 or 37, wherein the human OTOF protein has at least 85% identity to the sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 5.
39. The composition of claim 38, wherein the human OTOF protein has the sequence of SEQ ID NO: 1.
40. The composition of claim 38, wherein the human OTOF protein has the sequence of SEQ ID NO: 5.
41. The composition of any one of claims 36-40, wherein the first and second nucleic acid vectors are AAV vectors.
42. The composition of claim 41, wherein the serotype of the AAV vectors is selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
43. The composition of claim 42, wherein the serotype of the AAV vectors is AAV1.
44. The composition of claim 42, wherein the serotype of the AAV vectors is AAV9.
45. A composition comprising: a first nucleic acid vector comprising a promoter selected from the group consisting of a CAG promoter, a CMV promoter, a smCBA promoter, a Myo15 promoter, a Myo7A promoter, a Myo6 promoter, a POU4F3 promoter, an OTOF promoter, an FGF8 promoter, and a VGLUT3 promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of an OTOF protein; and a second nucleic acid vector comprising a second coding polynucleotide that encodes a C-terminal portion of an OTOF protein and a polyadenylation (poly(A)) sequence positioned at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide partially overlaps with the second coding polynucleotide, wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein, and, when introduced into a mammalian cell, the two first and second nucleic acid vectors undergo homologous recombination to form a recombined nucleic acid that encodes a full-length OTOF protein, and wherein the first and second nucleic acid vectors are AAV vectors having a serotype selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
46. The composition of claim 45, wherein the first and second coding polynucleotides that encode the OTOF protein do not comprise introns.
47. The composition of claim 45 or 46, wherein the OTOF protein is a mammalian OTOF protein.
48. The composition of claim 47, wherein the OTOF protein is a human OTOF protein.
49. The composition of claim 48, wherein the OTOF protein has at least 85% identity to the sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO:3, SEQ ID NO: 4, or SEQ ID NO: 5.
50. The composition of claim 49, wherein the OTOF protein has the sequence of SEQ ID NO: 1.
51. The composition of claim 49, wherein the OTOF protein has the sequence of SEQ ID NO: 5.
52. The composition of any one of claims 45-51, wherein the serotype of the AAV vectors is AAV1.
53. The composition of any one of claims 45-51, wherein the serotype of the AAV vectors is AAV9.
54. A composition comprising: a first nucleic acid vector comprising a promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of a human OTOF protein; and a second nucleic acid vector comprising a second coding polynucleotide that encodes a C-terminal portion of a human OTOF protein and a polyadenylation (poly(A)) sequence positioned at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide partially overlaps with the second coding polynucleotide, wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein, and, when introduced into a mammalian cell, the two first and second nucleic acid vectors undergo homologous recombination to form a recombined nucleic acid that encodes a full-length OTOF protein, and wherein the first and second nucleic acid vectors are AAV vectors having a serotype selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
55. The composition of claim 54, wherein the serotype of the AAV vectors is AAV1.
56. The composition of claim 54, wherein the serotype of the AAV vectors is AAV9.
57. The composition of any one of claims 36-56, wherein each of the first and second coding polynucleotides encode about half of the OTOF protein sequence.
58. The composition of any one of claims 36-57, wherein the first coding polynucleotide overlaps with the second coding polynucleotide by at least 1 kilobase (kb).
59. The composition of any one of claims 36-58, wherein the region of overlap between the first and second coding polynucleotides is centered at an OTOF exon boundary.
60. The composition of claim 59, wherein the first coding polynucleotide encodes an N-terminal portion of the OTOF protein comprising the OTOF N-terminus to 500 kb 3′ of the exon boundary at the center of the overlap region; and the second coding polynucleotide encodes a C-terminal portion of the OTOF protein comprising 500 kb 5′ of the exon boundary at the center of the overlap region to the OTOF C-terminus.
61. The composition of claim 59 or 60, wherein the exon boundary at the center of the overlap region is not within a portion of the first coding polynucleotide or second coding polynucleotide that encodes a C2 domain.
62. The composition of any one of claims 36-61, wherein the promoter is a Myo15 promoter.
63. The composition of claim 62, wherein the exon boundary is selected such that the first coding polynucleotide encodes the entire C2C domain and the second coding polynucleotide encodes the entire C2D domain.
64. The composition of claim 62 or 63, wherein the first coding polynucleotide comprises exons 1-21 of a polynucleotide encoding the OTOF protein and 500 kb 3′ of the exon 21/22 boundary; and the second coding polynucleotide comprises 500 kb 5′ of the exon 21/22 boundary and exons 22-48 of a polynucleotide encoding the OTOF protein.
65. The composition of any one of claims 62-64, wherein the first nucleic acid vector and the second nucleic acid vector do not comprise OTOF untranslated regions (UTRs).
66. The composition of any one of claims 36-60, wherein the promoter is a CAG promoter.
67. The composition of claim 66, wherein the exon boundary is within a portion of the first coding polynucleotide and the second coding polynucleotide that encodes the C2D domain.
68. The composition of claim 66 or 67, wherein the first coding polynucleotide comprises exons 1-24 of a polynucleotide encoding the OTOF protein and 500 kb 3′ of the exon 24/25 boundary; and the second coding polynucleotide comprises 500 kb 5′ of the exon 24/25 boundary and exons 25-48 of a polynucleotide encoding the OTOF protein.
69. The composition of claim 67 or 68, wherein the first and second nucleic acid vectors comprise OTOF UTRs.
70. The composition of claim 66, wherein the exon boundary at the center of the overlap region is not within a portion of the first coding polynucleotide or second coding polynucleotide that encodes a C2 domain.
71. The composition of claim 70, wherein the exon boundary is selected such that the first coding polynucleotide encodes the entire C2D domain and the second coding polynucleotide encodes the entire C2E domain.
72. The composition of claim 70 or 71, wherein the first coding polynucleotide comprises exons 1-28 of a polynucleotide encoding the OTOF protein and 500 kb 3′ of the exon 28/29 boundary; and the second coding polynucleotide comprises 500 kb 5′ of the exon 28/29 boundary and exons 29-48 of a polynucleotide encoding the OTOF protein.
73. The composition of any one of claims 70-72, wherein the second nucleic acid vector comprises a full-length OTOF 3′ UTR.
74. A composition comprising: a first nucleic acid vector comprising a promoter selected from the group consisting of a CAG promoter, a CMV promoter, a smCBA promoter, a Myo15 promoter, a Myo7A promoter, a Myo6 promoter, a POU4F3 promoter, an OTOF promoter, an FGF8 promoter, and a VGLUT3 promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of a human OTOF protein, and a splice donor signal sequence positioned at the 3′ end of the first coding polynucleotide; and a second nucleic acid vector comprising a splice acceptor signal sequence, a second coding polynucleotide that encodes a C-terminal portion of a human OTOF protein positioned at the 3′ end of the splice acceptor signal sequence, and a poly(A) sequence at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide and the second coding polynucleotide do not overlap, and wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein.
75. The composition of claim 74, wherein the first and second coding polynucleotides that encode the human OTOF protein do not comprise introns.
76. The composition of claim 74 or 75, wherein the human OTOF protein has at least 85% identity to the sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 5.
77. The composition of claim 76, wherein the human OTOF protein has the sequence of SEQ ID NO: 1.
78. The composition of claim 76, wherein the human OTOF protein has the sequence of SEQ ID NO: 5.
79. The composition of any one of claims 74-78, wherein the first and second nucleic acid vectors are AAV vectors.
80. The composition of claim 79, wherein the serotype of the AAV vectors is selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
81. The composition of claim 80, wherein the serotype of the AAV vectors is AAV1.
82. The composition of claim 80, wherein the serotype of the AAV vectors is AAV9.
83. A composition comprising: a first nucleic acid vector comprising a promoter selected from the group consisting of a CAG promoter, a CMV promoter, a smCBA promoter, a Myo15 promoter, a Myo7A promoter, a Myo6 promoter, a POU4F3 promoter, an OTOF promoter, an FGF8 promoter, and a VGLUT3 promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of an OTOF protein, and a splice donor signal sequence positioned at the 3′ end of the first coding polynucleotide; and a second nucleic acid vector comprising a splice acceptor signal sequence, a second coding polynucleotide that encodes a C-terminal portion of an OTOF protein positioned at the 3′ end of the splice acceptor signal sequence, and a poly(A) sequence at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide and the second coding polynucleotide do not overlap, wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein, and wherein the first and second nucleic acid vectors are AAV vectors having a serotype selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
84. The composition of claim 83, wherein the first and second coding polynucleotides that encode the human OTOF protein do not comprise introns.
85. The composition of claim 83 or 84, wherein the OTOF protein is a mammalian OTOF protein.
86. The composition of claim 85, wherein the OTOF protein is a human OTOF protein.
87. The composition of claim 86, wherein the OTOF protein has at least 85% identity to the sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO:3, SEQ ID NO: 4, or SEQ ID NO: 5.
88. The composition of claim 87, wherein the OTOF protein has the sequence of SEQ ID NO: 1.
89. The composition of claim 87, wherein the OTOF protein has the sequence of SEQ ID NO: 5.
90. The composition of any one of claims 83-89, wherein the serotype of the AAV vectors is AAV1.
91. The composition of any one of claims 83-89, wherein the serotype of the AAV vectors is AAV9.
92. A composition comprising: a first nucleic acid vector comprising a promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of a human OTOF protein, and a splice donor signal sequence positioned at the 3′ end of the first coding polynucleotide; and a second nucleic acid vector comprising a splice acceptor signal sequence, a second coding polynucleotide that encodes a C-terminal portion of a human OTOF protein positioned at the 3′ end of the splice acceptor signal sequence, and a poly(A) sequence at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide and the second coding polynucleotide do not overlap, wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein, and wherein the first and second nucleic acid vectors are AAV vectors having a serotype selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
93. The composition of claim 92, wherein the serotype of the AAV vectors is AAV1.
94. The composition of claim 92, wherein the serotype of the AAV vectors is AAV9.
95. A composition comprising: a first nucleic acid vector comprising a promoter selected from the group consisting of a CAG promoter, a CMV promoter, a smCBA promoter, a Myo15 promoter, a Myo7A promoter, a Myo6 promoter, a POU4F3 promoter, an OTOF promoter, an FGF8 promoter, and a VGLUT3 promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of a human OTOF protein, a splice donor signal sequence positioned at the 3 end of the first coding polynucleotide, and a recombinogenic region positioned 3′ of the splice donor signal sequence; and a second nucleic acid vector comprising a second recombinogenic region, a splice acceptor signal sequence positioned 3′ of the second recombinogenic region, a second coding polynucleotide that encodes a C-terminal portion of a human OTOF protein positioned at the 3′ end of the splice acceptor signal sequence, and a poly(A) sequence positioned at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide and the second coding polynucleotide do not overlap, and wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein.
96. The composition of claim 95, wherein the first and second coding polynucleotides that encode the human OTOF protein do not comprise introns.
97. The composition of claim 95 or 96, wherein the human OTOF protein has at least 85% identity to the sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, or SEQ ID NO: 5.
98. The composition of claim 97, wherein the human OTOF protein has the sequence of SEQ ID NO: 1.
99. The composition of claim 97, wherein the human OTOF protein has the sequence of SEQ ID NO: 5.
100. The composition of any one of claims 95-99, wherein the first and second nucleic acid vectors are AAV vectors.
101. The composition of claim 100, wherein the serotype of the AAV vectors is selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
102. The composition of claim 101, wherein the serotype of the AAV vectors is AAV1.
103. The composition of claim 101, wherein the serotype of the AAV vectors is AAV9.
104. A composition comprising: a first nucleic acid vector comprising a promoter selected from the group consisting of a CAG promoter, a CMV promoter, a smCBA promoter, a Myo15 promoter, a Myo7A promoter, a Myo6 promoter, a POU4F3 promoter, an OTOF promoter, an FGF8 promoter, and a VGLUT3 promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of an OTOF protein, a splice donor signal sequence positioned at the 3′ end of the first coding polynucleotide, and a recombinogenic region positioned 3′ of the splice donor signal sequence; and a second nucleic acid vector comprising a second recombinogenic region, a splice acceptor signal sequence positioned 3′ of the second recombinogenic region, a second coding polynucleotide that encodes a C-terminal portion of an OTOF protein positioned at the 3′ end of the splice acceptor signal sequence, and a poly(A) sequence positioned at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide and the second coding polynucleotide do not overlap, wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein, and wherein the first and second nucleic acid vectors are AAV vectors having a serotype selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
105. The composition of claim 104, wherein the first and second coding polynucleotides that encode the OTOF protein do not comprise introns.
106. The composition of claim 104 or 105, wherein the OTOF protein is a mammalian OTOF protein.
107. The composition of claim 106, wherein the OTOF protein is a human OTOF protein.
108. The composition of claim 107, wherein the OTOF protein has at least 85% identity to the sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO:3, SEQ ID NO: 4, or SEQ ID NO: 5.
109. The composition of claim 108, wherein the OTOF protein has the sequence of SEQ ID NO: 1.
110. The composition of claim 108, wherein the OTOF protein has the sequence of SEQ ID NO: 5.
111. The composition of any one of claims 104-110, wherein the serotype of the AAV vectors is AAV1.
112. The composition of any one of claims 104-110, wherein the serotype of the AAV vectors is AAV9.
113. A composition comprising: a first nucleic acid vector comprising a promoter operably linked to a first coding polynucleotide that encodes an N-terminal portion of a human OTOF protein, a splice donor signal sequence positioned at the 3′ end of the first coding polynucleotide, and a recombinogenic region positioned 3′ of the splice donor signal sequence; and a second nucleic acid vector comprising a second recombinogenic region, a splice acceptor signal sequence positioned 3′ of the second recombinogenic region, a second coding polynucleotide that encodes a C-terminal portion of a human OTOF protein positioned at the 3′ end of the splice acceptor signal sequence, and a poly(A) sequence positioned at the 3′ end of the second coding polynucleotide; wherein the first coding polynucleotide and the second coding polynucleotide do not overlap, wherein neither the first nor second nucleic acid vector encodes a full-length OTOF protein, and wherein the first and second nucleic acid vectors are AAV vectors having a serotype selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
114. The composition of claim 113, wherein the serotype of the AAV vectors is AAV1.
115. The composition of claim 113, wherein the serotype of the AAV vectors is AAV9.
116. The composition of any one of claims 95-115, wherein the first and second recombinogenic regions are the same.
117. The composition of any one of claims 95-116, wherein the recombinogenic region is an AP gene fragment or an F1 phage AK gene.
118. The composition of claim 14 or 117, wherein the F1 phage AK gene comprises or consists of the sequence of SEQ ID NO: 19.
119. The composition of claim 14 or 117, wherein the AP gene fragment comprises or consists of the sequence of any one of SEQ ID NOs: 62-67.
120. The composition of claim 119, wherein the AP gene fragment comprises or consists of the sequence of SEQ ID NO: 65.
121. The composition of any one of claims 95-120, wherein the first nucleic acid vector further comprises a degradation signal sequence positioned 3′ of the recombinogenic region; and wherein the second nucleic acid vector further comprises a degradation signal sequence positioned between the recombinogenic region and the splice acceptor sequence.
122. The composition of claim 15 or 121, wherein the degradation signal sequence comprises or consists of the sequence of SEQ ID NO: 22.
123. The composition of any one of claims 74-122, wherein each of the first and second coding polynucleotides encode about half of the OTOF protein sequence.
124. The composition of any one of claims 74-123, wherein the division between the first and second coding polynucleotides is at an OTOF exon boundary.
125. The composition of any one of claims 74-124, wherein the OTOF exon boundary is not within a portion of the first coding polynucleotide or second coding polynucleotide that encodes a C2 domain.
126. The composition of any one of claims 74-125, wherein the promoter is a CAG promoter.
127. The composition of claim 126, wherein the exon boundary is selected such that the first coding polynucleotide encodes the entire C2D domain and the second coding polynucleotide encodes the entire C2E domain.
128. The composition of claim 126 or 127, wherein the first coding polynucleotide encodes exons 1-26 of a polynucleotide encoding the OTOF protein and the second coding polynucleotide encodes exons 27-48 of a polynucleotide encoding the OTOF protein.
129. The composition of any one of claims 126-128, wherein the first and second nucleic acid vectors comprise OTOF UTRs.
130. The composition of any one of claims 74-125, wherein the promoter is a Myo15 promoter.
131. The composition of claim 130, wherein the OTOF exon boundary is not within a portion of the first coding polynucleotide or second coding polynucleotide that encodes a C2 domain
132. The composition of claim 130 or 131, wherein the exon boundary is selected such that the first coding polynucleotide encodes the entire C2C domain and the second coding polynucleotide encodes the entire C2D domain.
133. The composition of any one of claims 130-132, wherein the first coding polynucleotide encodes exons 1-19 of a polynucleotide encoding the OTOF protein and the second coding polynucleotide encodes exons 20-48 of a polynucleotide encoding the OTOF protein.
134. The composition of any one of claims 130-133, wherein the first nucleic acid vector and the second nucleic acid vector do not comprise OTOF UTRs.
135. The composition of claim 130, wherein the exon boundary is within a portion of the first coding polynucleotide and the second coding polynucleotide that encodes the C2D domain.
136. The composition of claim 135, wherein the first coding polynucleotide encodes exons 1-25 of a polynucleotide encoding the OTOF protein and the second coding polynucleotide encodes exons 26-48 of a polynucleotide encoding the OTOF protein.
137. The composition of claim 135 or 136, wherein the second nucleic acid vector comprises a full-length OTOF 3′ UTR.
138. The composition of any one of claims 11-35 and 74-137, wherein the splice donor sequence comprises or consists of the sequence of SEQ ID NO: 20 or SEQ ID NO: 68.
139. The composition of any one of claims 11-35 and 74-138, wherein the splice acceptor sequence comprises or consists of the sequence of SEQ ID NO: 21 or SEQ ID NO: 69.
140. The composition of any one of claims 1-139, wherein the first and second nucleic acid vectors comprise an inverted terminal repeat (ITR) at each end of the nucleic acid sequence.
141. The composition of claim 140, wherein the ITR is an AAV2 ITR.
142. The composition of any one of claims 1-141, wherein the poly(A) sequence is a bovine growth hormone (bGH) poly(A) signal sequence.
143. The composition of any one of claims 1-142, wherein the second nucleic acid vector comprises a Woodchuck Hepatitis Virus Posttranscriptional Regulatory Element (WPRE).
144. The composition of claim 143, wherein the WPRE comprises or consists of the sequence of SEQ ID NO: 23 or SEQ ID NO: 61.
145. The composition of any one of claims 1-144, further comprising a pharmaceutically acceptable excipient.
146. A kit comprising the composition of any one of claims 1-145.
147. A method of increasing OTOF expression in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of the composition of any one of claims 1-145.
148. A method of treating a subject having or at risk of developing sensorineural hearing loss, the method comprising administering to the subject a therapeutically effective amount of the composition of any one of claims 1-145.
149. A method of treating a subject having or at risk of developing auditory neuropathy, the method comprising administering to the subject a therapeutically effective amount of the composition of any one of claims 1-145.
150. The method of any one of claims 147-149, wherein the subject has a mutation in OTOF.
151. The method of any one of claims 147-149, wherein the subject has been identified as having a mutation in OTOF.
152. The method of any one of claims 147-149, wherein the method further comprises identifying the subject as having a mutation in OTOF prior to administering the composition.
153. The method of any one of claims 147-152, wherein the method further comprises evaluating the hearing of the subject prior to administering the composition.
154. The method of any one of claims 147-153, wherein the composition is administered locally to the ear.
155. The method of any one of claims 147-154, wherein the method increases OTOF expression in a cochlear hair cell.
156. The method of claim 155, wherein the cochlear hair cell is an inner hair cell.
157. The method of any one of claims 147-156, wherein the subject is a mammal.
158. The method of claim 157, wherein the subject is a human.
159. The method of any one of claims 147-158, wherein the wherein the method further comprises evaluating the hearing of the subject after administering the composition.
160. The method of any one of claims 147-159, wherein the composition prevents or reduces hearing loss, delays the development of hearing loss, slows the progression of hearing loss, improves hearing, improves speech discrimination, or improves hair cell function.
161. The method of any one of claims 147-160, wherein the composition is administered in an amount sufficient to increase OTOF expression in a cochlear hair cell, prevent or reduce hearing loss, delay the development of hearing loss, slow the progression of hearing loss, improve hearing, improve speech discrimination, or improve hair cell function.
162. A method of increasing OTOF expression in a cell, the method comprising introducing the composition of any one of claims 1-145 into the cell.
163. The method of claim 162, wherein the cell is a cochlear hair cell.
164. The method of claim 163, wherein the cell is an inner hair cell.
165. The method of any one of claims 162-164, wherein the cell is a mammalian cell.
166. The method of claim 165, wherein the cell is a human cell.
167. A method of increasing OTOF expression in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a pair of nucleic acid vectors listed in Table 4.
168. A method of treating a subject having or at risk of developing sensorineural hearing loss, the method comprising administering to the subject a therapeutically effective amount of a pair of nucleic acid vectors listed in Table 4.
169. A method of treating a subject having or at risk of developing auditory neuropathy, the method comprising administering to the subject a therapeutically effective amount of a pair of nucleic acid vectors listed in Table 4.
170. The method of any one of claims 167-169, wherein the subject has a mutation in OTOF.
171. The method of any one of claims 167-169, wherein the subject has been identified as having a mutation in OTOF.
172. The method of any one of claims 167-169, wherein the method further comprises identifying the subject as having a mutation in OTOF prior to administering the pair of nucleic acid vectors.
173. The method of any one of claims 167-169, wherein the method further comprises evaluating the hearing of the subject prior to administering the pair of nucleic acid vectors.
174. The method of any one of claims 167-173, wherein the vectors are administered locally to the ear.
175. The method of claim 174, wherein the vectors are administered concurrently.
176. The method of claim 174, wherein the vectors are administered sequentially.
177. The method of any one of claims 167-176, wherein the method increases OTOF expression in a cochlear hair cell.
178. The method of claim 177, wherein the cochlear hair cell is an inner hair cell.
179. The method of any one of claims 167-178, wherein the subject is a mammal.
180. The method of claim 179, wherein the subject is a human.
181. The method of any one of claims 167-180, wherein the method further comprises evaluating the hearing of the subject after administering the pair of nucleic acid vectors.
182. The method of any one of claims 167-181, wherein the vectors are administered in an amount sufficient to increase OTOF expression in a cochlear hair cell, prevent or reduce hearing loss, delay the development of hearing loss, slow the progression of hearing loss, improve hearing, improve speech discrimination, or improve hair cell function.
183. A method of increasing OTOF expression in a cell, the method comprising introducing a pair of nucleic acid vectors listed in Table 4 into the cell.
184. The method of claim 183, wherein the cell is a cochlear hair cell.
185. The method of claim 184, wherein the cell is an inner hair cell.
186. The method of any one of claims 183-185, wherein the cell is a mammalian cell.
187. The method of claim 186, wherein the cell is a human cell.
188. The method of any one of claims 167-187, wherein the pair of nucleic acid vectors encode an OTOF protein that has at least 85% identity to the sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO:3, SEQ ID NO: 4, or SEQ ID NO: 5.
189. The method of claim 188, wherein the OTOF protein has the sequence of SEQ ID NO: 1.
190. The method of claim 188, wherein the OTOF protein has the sequence of SEQ ID NO: 5.
191. The method of any one of claims 167-190, wherein the nucleic acid vectors are AAV vectors.
192. The method of claim 191, wherein the AAV vectors have a serotype selected from the group consisting of AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S.
193. The method of claim 192, wherein the serotype of the AAV vectors is AAV1.
194. The method of claim 192, wherein the serotype of the AAV vectors is AAV9.
195. The method of any one of claims 167-194, wherein the vectors contain AAV2 ITRs.
196. The method of any one of claims 165-195, wherein the second nucleic acid vector in the pair of nucleic acid vectors comprises a WPRE.
197. The method of claim 196, wherein the WPRE comprises or consists of the sequence of SEQ ID NO: 23 or SEQ ID NO: 61.
198. The method of any one of claims 167-197, wherein the nucleic acid vectors are overlapping dual vectors.
199. The method of any one of claims 167-197, wherein the nucleic acid vectors are trans-splicing dual vectors.
200. The method of any one of claims 167-197, wherein the nucleic acid vectors are dual hybrid vectors.
201. The method of claim 200, wherein the recombinogenic region in the dual hybrid vectors is an AP gene fragment or an F1 phage AK gene.
202. The method of claim 200, wherein the F1 phage AK gene comprises or consists of the sequence of SEQ ID NO: 19.
203. The method of claim 200, wherein the AP gene fragment comprises or consists of the sequence of any one of SEQ ID NOs: 62-67.
204. The method of claim 203, wherein the AP gene fragment comprises or consists of the sequence of SEQ ID NO: 65.
205. The method of any one of claims 200-204, wherein the first nucleic acid vector in the pair of nucleic acid vectors further comprises a degradation signal sequence positioned 3′ of the recombinogenic region; and wherein the second nucleic acid vector in the pair of nucleic acid vectors further comprises a degradation signal sequence positioned between the recombinogenic region and the splice acceptor sequence.
206. The method of claim 205, wherein the degradation signal sequence comprises or consists of the sequence of SEQ ID NO: 22.
207. The method of any one of claims 199-206, wherein the splice donor sequence in the first nucleic acid vector comprises or consists of the sequence of SEQ ID NO: 20 or SEQ ID NO: 68.
208. The method of any one of claims 199-207, wherein the splice acceptor sequence in the second nucleic acid vector comprises or consists of the sequence of SEQ ID NO: 21 or SEQ ID NO: 69.
209. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises a first region having at least 85% sequence identity to SEQ ID NO: 24 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 26 and/or SEQ ID NO: 27 operably linked to a second region having at least 85% sequence identity to SEQ ID NO: 25 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 31 and/or SEQ ID NO: 32, optionally comprising a linker comprising one to one hundred nucleotides between the first region and the second region.
210. The composition of claim 209, wherein the first region comprises or consists of the sequence of SEQ ID NO: 24.
211. The composition of claim 209, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 26.
212. The composition of claim 209, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 27.
213. The composition of claim 209, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 26 and the sequence of SEQ ID NO: 27.
214. The composition of claim 213, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 28.
215. The composition of claim 213, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 29.
216. The composition of claim 213, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 30.
217. The composition of claim 213, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 50.
218. The composition of any one of claims 209-217, wherein the second region comprises or consists of the sequence of SEQ ID NO: 25.
219. The composition of any one of claims 209-217, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 31.
220. The composition of any one of claims 209-217, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 32.
221. The composition of any one of claims 209-217, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 31 and the sequence of SEQ ID NO: 32.
222. The composition of claim 221, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 33.
223. The composition of claim 221, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 34.
224. The composition of claim 221, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 35.
225. The composition of any one of claims 209-217, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 51.
226. The composition of any one of claims 209-217, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 52.
227. The composition of any one of claims 209-217, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 51 and the sequence of SEQ ID NO: 52.
228. The composition of claim 227, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 55.
229. The composition of claim 209, wherein the polynucleotide comprises or consists of the sequence of SEQ ID NO: 36.
230. The composition of claim 209, wherein the polynucleotide comprises or consists of the sequence of SEQ ID NO: 38.
231. The composition of claim 209, wherein the polynucleotide comprises or consists of the sequence of SEQ ID NO: 39.
232. The composition of claim 209, wherein the polynucleotide comprises or consists of the sequence of SEQ ID NO: 59.
233. The composition of claim 209, wherein the polynucleotide comprises or consists of the sequence of SEQ ID NO: 60.
234. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises a first region having at least 85% sequence identity to SEQ ID NO: 25 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 31 and/or SEQ ID NO: 32 operably linked to a second region having at least 85% sequence identity to SEQ ID NO: 24 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 26 and/or SEQ ID NO: 27, optionally comprising a linker comprising one to one hundred nucleotides between the first region and the second region.
235. The composition of claim 234, wherein the first region comprises or consists of the sequence of SEQ ID NO: 25.
236. The composition of claim 234, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 31.
237. The composition of claim 234, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 32.
238. The composition of claim 234, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 31 and the sequence of SEQ ID NO: 32.
239. The composition of claim 238, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 33.
240. The composition of claim 238, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 34.
241. The composition of claim 238, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 35.
242. The composition of claim 234, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 51.
243. The composition of claim 234, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 52.
244. The composition of claim 234, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 51 and the sequence of SEQ ID NO: 52.
245. The composition of claim 244, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 55.
246. The composition of any one of claims 234-245, wherein the second region comprises or consists of the sequence of SEQ ID NO: 24.
247. The composition of any one of claims 234-245, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 26.
248. The composition of any one of claims 234-245, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 27.
249. The composition of any one of claims 234-245, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 26 and the sequence of SEQ ID NO: 27.
250. The composition of claim 249, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 28.
251. The composition of claim 249, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 29.
252. The composition of claim 249, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 30.
253. The composition of claim 249, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 50.
254. The composition of claim 234, wherein the polynucleotide comprises or consists of the sequence of SEQ ID NO: 37.
255. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises a region having at least 85% sequence identity to SEQ ID NO: 24 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 26 and/or SEQ ID NO: 27.
256. The composition of claim 255, wherein the first region comprises or consists of the sequence of SEQ ID NO: 24.
257. The composition of claim 255, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 26.
258. The composition of claim 255, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 27.
259. The composition of claim 255, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 26 and the sequence of SEQ ID NO: 27.
260. The composition of claim 259, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 28.
261. The composition of claim 259, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 29.
262. The composition of claim 259, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 30.
263. The composition of claim 259, wherein the functional portion of SEQ ID NO: 24 comprises the sequence of SEQ ID NO: 50.
264. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises a region having at least 85% sequence identity to SEQ ID NO: 25 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 31 and/or SEQ ID NO: 32.
265. The composition of claim 264, wherein the first region comprises or consists of the sequence of SEQ ID NO: 25.
266. The composition of claim 264, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 31.
267. The composition of claim 264, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 32.
268. The composition of claim 264, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 31 and the sequence of SEQ ID NO: 32.
269. The composition of claim 268, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 33.
270. The composition of claim 268, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 34.
271. The composition of claim 268, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 35.
272. The composition of claim 264, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 51.
273. The composition of claim 264, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 52.
274. The composition of claim 264, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 51 and the sequence of SEQ ID NO: 52.
275. The composition of claim 274, wherein the functional portion of SEQ ID NO: 25 comprises the sequence of SEQ ID NO: 55.
276. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises a first region having at least 85% sequence identity to SEQ ID NO: 40 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 42 operably linked to a second region having at least 85% sequence identity to SEQ ID NO: 41 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 43 and/or SEQ ID NO: 44, optionally comprising a linker comprising one to four hundred nucleotides between the first region and the second region.
277. The composition of claim 276, wherein the first region comprises or consists of the sequence of SEQ ID NO: 40.
278. The composition of claim 276, wherein the functional portion of SEQ ID NO: 40 comprises the sequence of SEQ ID NO: 42.
279. The composition of any one of claims 276-278, wherein the second region comprises or consists of the sequence of SEQ ID NO: 41.
280. The composition of any one of claims 276-278, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 43.
281. The composition of any one of claims 276-278, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 44.
282. The composition of any one of claims 276-278, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 43 and the sequence of SEQ ID NO: 44.
283. The composition of claim 282, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 45.
284. The composition of claim 282, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 46.
285. The composition of claim 282, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 47.
286. The composition of claim 276, wherein the polynucleotide comprises or consists of the sequence of SEQ ID NO: 48.
287. The composition of claim 276, wherein the polynucleotide comprises or consists of the sequence of SEQ ID NO: 49.
288. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises a first region having at least 85% sequence identity to SEQ ID NO: 41 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 43 and/or SEQ ID NO: 44 operably linked to a second region having at least 85% sequence identity to SEQ ID NO: 40 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 42, optionally comprising a linker comprising one to four hundred nucleotides between the first region and the second region.
289. The composition of claim 288, wherein the first region comprises or consists of the sequence of SEQ ID NO: 41.
290. The composition of claim 288, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 43.
291. The composition of claim 288, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 44.
292. The composition of claim 288, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 43 and the sequence of SEQ ID NO: 44.
293. The composition of claim 292, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 45.
294. The composition of claim 292, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 46.
295. The composition of claim 292, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 47.
296. The composition of any one of claims 288-295, wherein the second region comprises or consists of the sequence of SEQ ID NO: 40.
297. The composition of any one of claims 288-295, wherein the functional portion of SEQ ID NO: 40 comprises the sequence of SEQ ID NO: 42.
298. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises a region having at least 85% sequence identity to SEQ ID NO: 40 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 42.
299. The composition of claim 298, wherein the region comprises or consists of the sequence of SEQ ID NO: 40.
300. The composition of claim 298, wherein the functional portion of SEQ ID NO: 40 comprises the sequence of SEQ ID NO: 42.
301. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises a region having at least 85% sequence identity to SEQ ID NO: 41 or a functional portion or derivative thereof comprising the sequence of SEQ ID NO: 43 and/or SEQ ID NO: 44.
302. The composition of claim 301, wherein the region comprises or consists of the sequence of SEQ ID NO: 41.
303. The composition of claim 301, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 43.
304. The composition of claim 301, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 44.
305. The composition of claim 301, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 43 and the sequence of SEQ ID NO: 44.
306. The composition of claim 305, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 45.
307. The composition of claim 305, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 46.
308. The composition of claim 305, wherein the functional portion of SEQ ID NO: 41 comprises the sequence of SEQ ID NO: 47.
309. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises or consists of a polynucleotide having at least 85% sequence identity to SEQ ID NO: 51.
310. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises or consists of a polynucleotide having at least 85% sequence identity to SEQ ID NO: 55.
311. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises or consists of a polynucleotide having at least 85% sequence identity SEQ ID NO: 56.
312. The composition of any one of claims 1-65, 74-125, and 130-145, wherein the Myo15 promoter comprises or consists of a polynucleotide having at least 85% sequence identity to SEQ ID NO: 57.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0155]
[0156]
[0157]
[0158]
[0159]
[0160]
[0161]
[0162]
[0163]
[0164]
[0165]
[0166]
[0167]
[0168]
[0169]
[0170]
[0171]
[0172]
[0173]
[0174]
[0175]
[0176]
[0177] In another set of experiments, Homozygous OTOF-Q828X mice were treated with an AAV1-truncated chimeric CMV-chicken β-actin (smCBA)-hOTOF (isoform 5, SEQ ID NO: 5) dual hybrid vector system by injection through the round window membrane and ABR thresholds were used to assess hearing function as described above (
[0178] In yet another set of experiments, homozygous OTOF-Q828X mice were treated with an AAV1-smCBA-hOTOF (isoform 5, SEQ ID NO: 5) dual hybrid vector system by injection through the round window membrane at low, medium (mid), and high doses and ABR thresholds were used to assess hearing function at four weeks and eight weeks post-treatment (
DETAILED DESCRIPTION
[0179] Described herein are compositions and methods for the treatment of sensorineural hearing loss or auditory neuropathy in a subject (such as a mammalian subject, for instance, a human) by administering a first nucleic acid vector containing a promoter and a polynucleotide encoding an N-terminal portion of an otoferlin (OTOF) protein (e.g., a wild-type (WT) OTOF protein) and a second nucleic acid vector containing a polynucleotide encoding a C-terminal portion of an OTOF protein and a polyadenylation (poly(A)) sequence. When introduced into a mammalian cell, such as a cochlear hair cell, the polynucleotides encoded by the two nucleic acid vectors can combine to form a nucleic acid molecule that encodes the full-length OTOF protein. The compositions and methods described herein can, therefore, be used to induce or increase expression of WT OTOF in cochlear hair cells of a subject who has an OTOF deficiency (e.g., low OTOF expression or an OTOF mutation that impairs OTOF expression or function).
Otofelin
[0180] OTOF is a 230 kDa membrane protein that contains at least six C2 domains implicated in calcium, phospholipid, and protein binding. It is encoded by a gene that contains 48 exons, and the full-length protein is made up of 1,997 amino acids. OTOF is located at ribbon synapses in inner hair cells, where it is believed to function as a calcium sensor in synaptic vesicle fusion, triggering the fusion of neurotransmitter-containing vesicles with the plasma membrane. It has also been implicated in vesicle replenishment and clathrin-mediated endocytosis, and has been shown to interact with Myosin VI, Rab8b, SNARE proteins, calcium channel Cav1.3, Ergic2, and AP-2. The mechanism by which OTOF mediates exocytosis and the physiological significance of its interactions with its binding partners remain to be determined.
Otoferlin-Associated Hearing Loss
[0181] OTOF was first identified by a study investigating the genetics of a non-syndromic form of deafness, autosomal recessive deafness-9 (DFNB9). Mutations in OTOF have since been found to cause sensorineural hearing loss in patients throughout the world, with many patients carrying OTOF mutations having auditory neuropathy, a disorder in which the inner ear detects sound, but is unable to properly transmit sound from the ear to the brain. These patients have an abnormal auditory brainstem response (ABR) and impaired speech discrimination with initially normal otoacoustic emissions. Patients carrying homozygous or compound heterozygous mutations often develop hearing loss in early childhood, and the severity of hearing impairment has been found to vary with the location and type of mutation in OTOF.
[0182] The compositions and methods described herein can be used to treat sensorineural hearing loss or auditory neuropathy by administering a first nucleic acid vector containing a polynucleotide encoding an N-terminal portion of an OTOF protein and a second nucleic acid vector containing a polynucleotide encoding a C-terminal portion of an OTOF protein. The full-length OTOF coding sequence is too large to include in the type of vector that is commonly used for gene therapy (e.g., an adeno-associated virus (AAV) vector, which is thought to have a packaging limit of 5 kb). The compositions and methods described herein overcome this problem by dividing the OTOF coding sequence between two different nucleic acid vectors that can recombine in a cell to reconstitute the full-length OTOF sequence. These compositions and methods can be used to treat subjects having one or more mutations in the OTOF gene, e.g., an OTOF mutation that reduces OTOF expression, reduces OTOF function, or is associated with hearing loss. When the first and second nucleic acid vectors are administered in a composition, the polynucleotides encoding the N-terminal and C-terminal portions of OTOF can combine within a cell (e.g., a human cell, e.g., a cochlear hair cell) to form a single nucleic acid molecule that contains the full-length OTOF coding sequence (e.g., through homologous recombination and/or splicing).
[0183] The nucleic acid vectors used in the compositions and methods described herein include nucleic acid sequences that encode wild-type OTOF, or a variant thereof, such as a nucleic acid sequences that, when combined, encode a protein having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97, 98%, 99, or more, sequence identity) to the amino acid sequence ofwild-type human or mouse OTOF. The polynucleotides used in the nucleic acid vectors described herein encode an N-terminal portion and a C-terminal portion of an OTOF amino acid sequence in Table 2 below (e.g., two portions that, when combined, encode a full-length OTOF amino acid sequence listed in Table 2, e.g., any one of SEQ ID NOs: 1-5).
[0184] According to the methods described herein, a subject can be administered a composition containing a first nucleic acid vector and a second nucleic acid vector that encode an N-terminal and C-terminal portion, respectively, of a nucleic acid sequence encoding the amino acid sequence of any one of SEQ ID NOs: 1-5, or a nucleic acid sequence encoding an amino acid sequence having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the amino acid sequence of any one of SEQ ID NOs: 1-5, or a nucleic acid sequence encoding an amino acid sequence that contains one or more conservative amino acid substitutions relative to any one of SEQ ID NOs: 1-5 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 or more conservative amino acid substitutions), provided that the OTOF analog encoded retains the therapeutic function of wild-type OTOF (e.g., the ability to regulate exocytosis at ribbon synapses).
TABLE-US-00002 TABLE 2 OTOF Sequences SEQ ID NO. Sequence Name Sequence 1 OTOF-201 protein MALLIHLKTVSELRGRGDRIAKVTFRGQSFYSRVLENCEDVADFDE (NP_919224.1), TFRWPVASSIDRNEMLEIQVFNYSKVFSNKLIGTFRMVLQKVVEES human otoferlin HVEVTDTLIDDNNAIIKTSLCVEVRYQATDGTVGSWDDGDFLGDES isoform a, 1997 aa LQEEEKDSQETDGLLPGSRPSSRPPGEKSFRRAGRSVFSAMKLGK NRSHKEEPQRPDEPAVLEMEDLDHLAIRLGDGLDPDSVSLASVTAL TTNVSNKRSKPDIKMEPSAGRPMDYQVSITVIEARQLVGLNMDPVV CVEVGDDKKYTSMKESTNCPYYNEYFVFDFHVSPDVMFDKIIKISVI HSKNLLRSGTLVGSFKMDVGTVYSQPEHQFHHKWAILSDPDDISS GLKGYVKCDVAVVGKGDNIKTPHKANETDEDDIEGNLLLPEGVPPE RQWARFYVKIYRAEGLPRMNTSLMANVKKAFIGENKDLVDPYVQV FFAGQKGKTSVQKSSYEPLWNEQVVFTDLFPPLCKRMKVQIRDSD KVNDVAIGTHFIDLRKISNDGDKGFLPTLGPAWVNMYGSTRNYTLL DEHQDLNEGLGEGVSFRARLLLGLAVEIVDTSNPELTSSTEVQVEQ ATPISESCAGKMEEFFLFGAFLEASMIDRRNGDKPITFEVTIGNYGN EVDGLSRPQRPRPRKEPGDEEEVDLIQNASDDEAGDAGDLASVSS TPPMRPQVTDRNYFHLPYLERKPCIYIKSWWPDQRRRLYNANIMD HIADKLEEGLNDIQEMIKTEKSYPERRLRGVLEELSCGCCRFLSLAD KDQGHSSRTRLDRERLKSCMRELENMGQQARMLRAQVKRHTVRD KLRLCQNFLQKLRFLADEPQHSIPDIFIWMMSNNKRVAYARVPSKD LLFSIVEEETGKDCAKVKTLFLKLPGKRGFGSAGWTVQAKVELYLW LGLSKQRKEFLCGLPCGFQEVKAAQGLGLHAFPPVSLVYTKKQAF QLRAHMYQARSLFAADSSGLSDPFARVFFINQSQCTEVLNETLCPT WDQMLVFDNLELYGEAHELRDDPPIIVIEIYDQDSMGKADFMGRTF AKPLVKMADEAYCPPRFPPQLEYYQIYRGNATAGDLLAAFELLQIG PAGKADLPPINGPVDVDRGPIMPVPMGIRPVLSKYRVEVLFWGLRD LKRVNLAQVDRPRVDIECAGKGVQSSLIHNYKKNPNFNTLVKWFEV DLPENELLHPPLNIRVVDCRAFGRYTLVGSHAVSSLRRFIYRPPDRS APSWNTTVRLLRRCRVLCNGGSSSHSTGEVVVTMEPEVPIKKLET MVKLDATSEAVVKVDVAEEEKEKKKKKKGTAEEPEEEEPDESMLD WWSKYFASIDTMKEQLRQQEPSGIDLEEKEEVDNTEGLKGSMKGK EKARAAKEEKKKKTQSSGSGQGSEAPEKKKPKIDELKVYPKELESE FDNFEDWLHTFNLLRGKTGDDEDGSTEEERIVGRFKGSLCVYKVPL PEDVSREAGYDSTYGMFQGIPSNDPINVLVRVYVVRATDLHPADIN GKADPYIAIRLGKTDIRDKENYISKQLNPVFGKSFDIEASFPMESMLT VAVYDWDLVGTDDLIGETKIDLENRFYSKHRATCGIAQTYSTHGYNI WRDPMKPSQILTRLCKDGKVDGPHFGPPGRVKVANRVFTGPSEIE DENGQRKPTDEHVALLALRHWEDIPRAGCRLVPEHVETRPLLNPD KPGIEQGRLELWVDMFPMDMPAPGTPLDISPRKPKKYELRVIIWNT DEVVLEDDDFFTGEKSSDIFVRGWLKGQQEDKQDTDVHYHSLTGE GNFNWRYLFPFDYLAAEEKIVISKKESMFSWDETEYKIPARLTLQIW DADHFSADDFLGAIELDLNRFPRGAKTAKQCTMEMATGEVDVPLV SIFKQKRVKGWWPLLARNENDEFELTGKVEAELHLLTAEEAEKNPV GLARNEPDPLEKPNRPDTSFIWFLNPLKSARYFLWHTYRWLLLKLL LLLLLLLLLALFLYSVPGYLVKKILGA 2 OTOF-202 protein MIKTEKSYPERRLRGVLEELSCGCCRFLSLADKDQGHSSRTRLDRE (NP_004793.2), RLKSCMRELENMGQQARMLRAQVKRHTVRDKLRLCQNFLQKLRF human otoferlin LADEPQHSIPDIFIWMMSNNKRVAYARVPSKDLLFSIVEEETGKDCA isoform b, 1230 aa KVKTLFLKLPGKRGFGSAGWTVQAKVELYLWLGLSKQRKEFLCGL PCGFQEVKAAQGLGLHAFPPVSLVYTKKQAFQLRAHMYQARSLFA ADSSGLSDPFARVFFINQSQCTEVLNETLCPTWDQMLVFDNLELYG EAHELRDDPPIIVIEIYDQDSMGKADFMGRTFAKPLVKMADEAYCPP RFPPQLEYYQIYRGNATAGDLLAAFELLQIGPAGKADLPPINGPVDV DRGPIMPVPMGIRPVLSKYRVEVLFWGLRDLKRVNLAQVDRPRVDI ECAGKGVQSSLIHNYKKNPNFNTLVKWFEVDLPENELLHPPLNIRV VDCRAFGRYTLVGSHAVSSLRRFIYRPPDRSAPSWNTTGEVVVTM EPEVPIKKLETMVKLDATSEAVVKVDVAEEEKEKKKKKKGTAEEPE EEEPDESMLDWWSKYFASIDTMKEQLRQQEPSGIDLEEKEEVDNT EGLKGSMKGKEKARAAKEEKKKKTQSSGSGQGSEAPEKKKPKIDE LKVYPKELESEFDNFEDWLHTFNLLRGKTGDDEDGSTEEERIVGRF KGSLCVYKVPLPEDVSREAGYDSTYGMFQGIPSNDPINVLVRVYVV RATDLHPADINGKADPYIAIRLGKTDIRDKENYISKQLNPVFGKSFDI EASFPMESMLTVAVYDWDLVGTDDLIGETKIDLENRFYSKHRATCG IAQTYSTHGYNIWRDPMKPSQILTRLCKDGKVDGPHFGPPGRVKVA NRVFTGPSEIEDENGQRKPTDEHVALLALRHWEDIPRAGCRLVPEH VETRPLLNPDKPGIEQGRLELWVDMFPMDMPAPGTPLDISPRKPKK YELRVIIWNTDEVVLEDDDFFTGEKSSDIFVRGWLKGQQEDKQDTD VHYHSLTGEGNFNWRYLFPFDYLAAEEKIVISKKESMFSWDETEYKI PARLTLQIWDADHFSADDFLGAIELDLNRFPRGAKTAKQCTMEMAT GEVDVPLVSIFKQKRVKGWWPLLARNENDEFELTGKVEAELHLLTA EEAEKNPVGLARNEPDPLEKPNRPDTSFIWFLNPLKSARYFLWHTY RWLLLKLLLLLLLLLLLALFLYSVPGYLVKKILGA 3 OTOF-203 protein MIKTEKSYPERRLRGVLEELSCGCCRFLSLADKDQGHSSRTRLDRE (NP_919304.1), RLKSCMRELENMGQQARMLRAQVKRHTVRDKLRLCQNFLQKLRF human otoferlin LADEPQHSIPDIFIWMMSNNKRVAYARVPSKDLLFSIVEEETGKDCA isoform d, 1230 aa KVKTLFLKLPGKRGFGSAGWTVQAKVELYLWLGLSKQRKEFLCGL PCGFQEVKAAQGLGLHAFPPVSLVYTKKQAFQLRAHMYQARSLFA ADSSGLSDPFARVFFINQSQCTEVLNETLCPTWDQMLVFDNLELYG EAHELRDDPPIIVIEIYDQDSMGKADFMGRTFAKPLVKMADEAYCPP RFPPQLEYYQIYRGNATAGDLLAAFELLQIGPAGKADLPPINGPVDV DRGPIMPVPMGIRPVLSKYRVEVLFWGLRDLKRVNLAQVDRPRVDI ECAGKGVQSSLIHNYKKNPNFNTLVKWFEVDLPENELLHPPLNIRV VDCRAFGRYTLVGSHAVSSLRRFIYRPPDRSAPSWNTTGEVVVTM EPEVPIKKLETMVKLDATSEAVVKVDVAEEEKEKKKKKKGTAEEPE EEEPDESMLDWWSKYFASIDTMKEQLRQQEPSGIDLEEKEEVDNT EGLKGSMKGKEKARAAKEEKKKKTQSSGSGQGSEAPEKKKPKIDE LKVYPKELESEFDNFEDWLHTFNLLRGKTGDDEDGSTEEERIVGRF KGSLCVYKVPLPEDVSREAGYDSTYGMFQGIPSNDPINVLVRVYVV RATDLHPADINGKADPYIAIRLGKTDIRDKENYISKQLNPVFGKSFDI EASFPMESMLTVAVYDWDLVGTDDLIGETKIDLENRFYSKHRATCG IAQTYSTHGYNIWRDPMKPSQILTRLCKDGKVDGPHFGPPGRVKVA NRVFTGPSEIEDENGQRKPTDEHVALLALRHWEDIPRAGCRLVPEH VETRPLLNPDKPGIEQGRLELWVDMFPMDMPAPGTPLDISPRKPKK YELRVIIWNTDEVVLEDDDFFTGEKSSDIFVRGWLKGQQEDKQDTD VHYHSLTGEGNFNWRYLFPFDYLAAEEKIVISKKESMFSWDETEYKI PARLTLQIWDADHFSADDFLGAIELDLNRFPRGAKTAKQCTMEMAT GEVDVPLVSIFKQKRVKGWWPLLARNENDEFELTGKVEAELHLLTA EEAEKNPVGLARNEPDPLEKPNRPDTAFVWFLNPLKSIKYLICTRYK WLIIKIVLALLGLLMLGLFLYSLPGYMVKKLLGA 4 OTOF-208 protein MMTDTQDGPSESSQIMRSLTPLINREEAFGEAGEAGLWPSITHTPD (NP_919303.1), SQEEGLNDIQEMIKTEKSYPERRLRGVLEELSCGCCRFLSLADKDQ human otoferlin GHSSRTRLDRERLKSCMRELENMGQQARMLRAQVKRHTVRDKLR isoform c, 1307 aa LCQNFLQKLRFLADEPQHSIPDIFIWMMSNNKRVAYARVPSKDLLFS IVEEETGKDCAKVKTLFLKLPGKRGFGSAGWTVQAKVELYLWLGLS KQRKEFLCGLPCGFQEVKAAQGLGLHAFPPVSLVYTKKQAFQLRA HMYQARSLFAADSSGLSDPFARVFFINQSQCTEVLNETLCPTWDQ MLVFDNLELYGEAHELRDDPPIIVIEIYDQDSMGKADFMGRTFAKPL VKMADEAYCPPRFPPQLEYYQIYRGNATAGDLLAAFELLQIGPAGK ADLPPINGPVDVDRGPIMPVPMGIRPVLSKYRVEVLFWGLRDLKRV NLAQVDRPRVDIECAGKGVQSSLIHNYKKNPNFNTLVKWFEVDLPE NELLHPPLNIRVVDCRAFGRYTLVGSHAVSSLRRFIYRPPDRSAPS WNTTVRLLRRCRVLCNGGSSSHSTGEVVVTMEPEVPIKKLETMVK LDATSEAVVKVDVAEEEKEKKKKKKGTAEEPEEEEPDESMLDWWS KYFASIDTMKEQLRQQEPSGIDLEEKEEVDNTEGLKGSMKGKEKAR AAKEEKKKKTQSSGSGQGSEAPEKKKPKIDELKVYPKELESEFDNF EDWLHTFNLLRGKTGDDEDGSTEEERIVGRFKGSLCVYKVPLPED VSREAGYDSTYGMFQGIPSNDPINVLVRVYVVRATDLHPADINGKA DPYIAIRLGKTDIRDKENYISKQLNPVFGKSFDIEASFPMESMLTVAV YDWDLVGTDDLIGETKIDLENRFYSKHRATCGIAQTYSTHGYNIWR DPMKPSQILTRLCKDGKVDGPHFGPPGRVKVANRVFTGPSEIEDE NGQRKPTDEHVALLALRHWEDIPRAGCRLVPEHVETRPLLNPDKP GIEQGRLELWVDMFPMDMPAPGTPLDISPRKPKKYELRVIIWNTDE VVLEDDDFFTGEKSSDIFVRGWLKGQQEDKQDTDVHYHSLTGEGN FNWRYLFPFDYLAAEEKIVISKKESMFSWDETEYKIPARLTLQIWDA DHFSADDFLGAIELDLNRFPRGAKTAKQCTMEMATGEVDVPLVSIF KQKRVKGWWPLLARNENDEFELTGKVEAELHLLTAEEAEKNPVGL ARNEPDPLEKPNRPDTSFIWFLNPLKSARYFLWHTYRWLLLKLLLLL LLLLLLALFLYSVPGYLVKKILGA 5 OTOF-205 protein MALLIHLKTVSELRGRGDRIAKVTFRGQSFYSRVLENCEDVADFDE (NP_001274418.1), TFRWPVASSIDRNEMLEIQVFNYSKVFSNKLIGTFRMVLQKVVEES human otoferlin HVEVTDTLIDDNNAIIKTSLCVEVRYQATDGTVGSWDDGDFLGDES isoform e, 1997 aa LQEEEKDSQETDGLLPGSRPSSRPPGEKSFRRAGRSVFSAMKLGK NRSHKEEPQRPDEPAVLEMEDLDHLAIRLGDGLDPDSVSLASVTAL TTNVSNKRSKPDIKMEPSAGRPMDYQVSITVIEARQLVGLNMDPVV CVEVGDDKKYTSMKESTNCPYYNEYFVFDFHVSPDVMFDKIIKISVI HSKNLLRSGTLVGSFKMDVGTVYSQPEHQFHHKWAILSDPDDISS GLKGYVKCDVAVVGKGDNIKTPHKANETDEDDIEGNLLLPEGVPPE RQWARFYVKIYRAEGLPRMNTSLMANVKKAFIGENKDLVDPYVQV FFAGQKGKTSVQKSSYEPLWNEQVVFTDLFPPLCKRMKVQIRDSD KVNDVAIGTHFIDLRKISNDGDKGFLPTLGPAWVNMYGSTRNYTLL DEHQDLNEGLGEGVSFRARLLLGLAVEIVDTSNPELTSSTEVQVEQ ATPISESCAGKMEEFFLFGAFLEASMIDRRNGDKPITFEVTIGNYGN EVDGLSRPQRPRPRKEPGDEEEVDLIQNASDDEAGDAGDLASVSS TPPMRPQVTDRNYFHLPYLERKPCIYIKSWWPDQRRRLYNANIMD HIADKLEEGLNDIQEMIKTEKSYPERRLRGVLEELSCGCCRFLSLAD KDQGHSSRTRLDRERLKSCMRELENMGQQARMLRAQVKRHTVRD KLRLCQNFLQKLRFLADEPQHSIPDIFIWMMSNNKRVAYARVPSKD LLFSIVEEETGKDCAKVKTLFLKLPGKRGFGSAGWTVQAKVELYLW LGLSKQRKEFLCGLPCGFQEVKAAQGLGLHAFPPVSLVYTKKQAF QLRAHMYQARSLFAADSSGLSDPFARVFFINQSQCTEVLNETLCPT WDQMLVFDNLELYGEAHELRDDPPIIVIEIYDQDSMGKADFMGRTF AKPLVKMADEAYCPPRFPPQLEYYQIYRGNATAGDLLAAFELLQIG PAGKADLPPINGPVDVDRGPIMPVPMGIRPVLSKYRVEVLFWGLRD LKRVNLAQVDRPRVDIECAGKGVQSSLIHNYKKNPNFNTLVKWFEV DLPENELLHPPLNIRVVDCRAFGRYTLVGSHAVSSLRRFIYRPPDRS APSWNTTVRLLRRCRVLCNGGSSSHSTGEVVVTMEPEVPIKKLET MVKLDATSEAVVKVDVAEEEKEKKKKKKGTAEEPEEEEPDESMLD WWSKYFASIDTMKEQLRQQEPSGIDLEEKEEVDNTEGLKGSMKGK EKARAAKEEKKKKTQSSGSGQGSEAPEKKKPKIDELKVYPKELESE FDNFEDWLHTFNLLRGKTGDDEDGSTEEERIVGRFKGSLCVYKVPL PEDVSREAGYDSTYGMFQGIPSNDPINVLVRVYVVRATDLHPADIN GKADPYIAIRLGKTDIRDKENYISKQLNPVFGKSFDIEASFPMESMLT VAVYDWDLVGTDDLIGETKIDLENRFYSKHRATCGIAQTYSTHGYNI WRDPMKPSQILTRLCKDGKVDGPHFGPPGRVKVANRVFTGPSEIE DENGQRKPTDEHVALLALRHWEDIPRAGCRLVPEHVETRPLLNPD KPGIEQGRLELWVDMFPMDMPAPGTPLDISPRKPKKYELRVIIWNT DEVVLEDDDFFTGEKSSDIFVRGWLKGQQEDKQDTDVHYHSLTGE GNFNWRYLFPFDYLAAEEKIVISKKESMFSWDETEYKIPARLTLQIW DADHFSADDFLGAIELDLNRFPRGAKTAKQCTMEMATGEVDVPLV SIFKQKRVKGWWPLLARNENDEFELTGKVEAELHLLTAEEAEKNPV GLARNEPDPLEKPNRPDTAFVWFLNPLKSIKYLICTRYKWLIIKIVLAL LGLLMLGLFLYSLPGYMVKKLLGA 6 mOTOF-201_1 MALIVHLKTVSELRGKGDRIAKVTFRGQSFYSRVLENCEGVADFDE protein TFRWPVASSIDRNEVLEIQIFNYSKVFSNKLIGTFCMVLQKVVEENR (NP_114081.2), VEVTDTLMDDSNAIIKTSLSMEVRYQATDGTVGPWDDGDFLGDESL mouse otoferlin QEEKDSQETDGLLPGSRPSTRISGEKSFRRAGRSVFSAMKLGKTR isoform 2, 1997 aa SHKEEPQRQDEPAVLEMEDLDHLAIQLGDGLDPDSVSLASVTALTS NVSNKRSKPDIKMEPSAGRPMDYQVSITVIEARQLVGLNMDPVVCV EVGDDKKYTSMKESTNCPYYNEYFVFDFHVSPDVMFDKIIKISVIHS KNLLRSGTLVGSFKMDVGTVYSQPEHQFHHKWAILSDPDDISAGLK GYVKCDVAVVGKGDNIKTPHKANETDEDDIEGNLLLPEGVPPERQ WARFYVKIYRAEGLPRMNTSLMANVKKAFIGENKDLVDPYVQVFFA GQKGKTSVQKSSYEPLWNEQVVFTDLFPPLCKRMKVQIRDSDKVN DVAIGTHFIDLRKISNDGDKGFLPTLGPAWVNMYGSTRNYTLLDEH QDLNEGLGEGVSFRARLMLGLAVEILDTSNPELTSSTEVQVEQATP VSESCTGRMEEFFLFGAFLEASMIDRKNGDKPITFEVTIGNYGNEV DGMSRPLRPRPRKEPGDEEEVDLIQNSSDDEGDEAGDLASVSSTP PMRPQITDRNYFHLPYLERKPCIYIKSWWPDQRRRLYNANIMDHIA DKLEEGLNDVQEMIKTEKSYPERRLRGVLEELSCGCHRFLSLSDKD QGRSSRTRLDRERLKSCMRELESMGQQAKSLRAQVKRHTVRDKL RSCQNFLQKLRFLADEPQHSIPDVFIWMMSNNKRIAYARVPSKDLL FSIVEEELGKDCAKVKTLFLKLPGKRGFGSAGWTVQAKLELYLWLG LSKQRKDFLCGLPCGFEEVKAAQGLGLHSFPPISLVYTKKQAFQLR AHMYQARSLFAADSSGLSDPFARVFFINQSQCTEVLNETLCPTWD QMLVFDNLELYGEAHELRDDPPIIVIEIYDQDSMGKADFMGRTFAKP LVKMADEAYCPPRFPPQLEYYQIYRGSATAGDLLAAFELLQIGPSG KADLPPINGPVDMDRGPIMPVPVGIRPVLSKYRVEVLFWGLRDLKR VNLAQVDRPRVDIECAGKGVQSSLIHNYKKNPNFNTLVKWFEVDLP ENELLHPPLNIRVVDCRAFGRYTLVGSHAVSSLRRFIYRPPDRSAP NWNTTVRLLRGCHRLRNGGPSSRPTGEVVVSMEPEEPVKKLETM VKLDATSDAVVKVDVAEDEKERKKKKKKGPSEEPEEEEPDESMLD WWSKYFASIDTMKEQLRQHETSGTDLEEKEEMESAEGLKGPMKS KEKSRAAKEEKKKKNQSPGPGQGSEAPEKKKAKIDELKVYPKELES EFDSFEDWLHTFNLLRGKTGDDEDGSTEEERIVGRFKGSLCVYKV PLPEDVSREAGYDPTYGMFQGIPSNDPINVLVRIYVVRATDLHPADI NGKADPYIAIKLGKTDIRDKENYISKQLNPVFGKSFDIEASFPMESML TVAVYDWDLVGTDDLIGETKIDLENRFYSKHRATCGIAQTYSIHGYN IWRDPMKPSQILTRLCKEGKVDGPHFGPHGRVRVANRVFTGPSEIE DENGQRKPTDEHVALSALRHWEDIPRVGCRLVPEHVETRPLLNPD KPGIEQGRLELWVDMFPMDMPAPGTPLDISPRKPKKYELRVIVWNT DEVVLEDDDFFTGEKSSDIFVRGWLKGQQEDKQDTDVHYHSLTGE GNFNWRYLFPFDYLAAEEKIVMSKKESMFSWDETEYKIPARLTLQI WDADHFSADDFLGAIELDLNRFPRGAKTAKQCTMEMATGEVDVPL VSIFKQKRVKGWWPLLARNENDEFELTGKVEAELHLLTAEEAEKNP VGLARNEPDPLEKPNRPDTSFIWFLNPLKSARYFLWHTYRWLLLKF LLLFLLLLLFALFLYSLPGYLAKKILGA 7 mOTOF-201_2 MALIVHLKTVSELRGKGDRIAKVTFRGQSFYSRVLENCEGVADFDE protein TFRWPVASSIDRNEVLEIQIFNYSKVFSNKLIGTFCMVLQKVVEENR (NP_001273350.1), VEVTDTLMDDSNAIIKTSLSMEVRYQATDGTVGPWDDGDFLGDESL mouse otoferlin QEEKDSQETDGLLPGSRPSTRISGEKSFRRAGRSVFSAMKLGKTR isoform 3, 1977 aa SHKEEPQRQDEPAVLEMEDLDHLAIQLGDGLDPDSVSLASVTALTS NVSNKRSKPDIKMEPSAGRPMDYQVSITVIEARQLVGLNMDPVVCV EVGDDKKYTSMKESTNCPYYNEYFVFDFHVSPDVMFDKIIKISVIHS KNLLRSGTLVGSFKMDVGTVYSQPEHQFHHKWAILSDPDDISAGLK GYVKCDVAVVGKGDNIKTPHKANETDEDDIEGNLLLPEGVPPERQ WARFYVKIYRAEGLPRMNTSLMANVKKAFIGENKDLVDPYVQVFFA GQKGKTSVQKSSYEPLWNEQVVFTDLFPPLCKRMKVQIRDSDKVN DVAIGTHFIDLRKISNDGDKGFLPTLGPAWVNMYGSTRNYTLLDEH QDLNEGLGEGVSFRARLMLGLAVEILDTSNPELTSSTEVQVEQATP VSESCTGRMEEFFLFGAFLEASMIDRKNGDKPITFEVTIGNYGNEV DGMSRPLRPRPRKEPGDEEEVDLIQNSSDDEGDEAGDLASVSSTP PMRPQITDRNYFHLPYLERKPCIYIKSWWPDQRRRLYNANIMDHIA DKLEEGLNDVQEMIKTEKSYPERRLRGVLEELSCGCHRFLSLSDKD QGRSSRTRLDRERLKSCMRELESMGQQAKSLRAQVKRHTVRDKL RSCQNFLQKLRFLADEPQHSIPDVFIWMMSNNKRIAYARVPSKDLL FSIVEEELGKDCAKVKTLFLKLPGKRGFGSAGWTVQAKLELYLWLG LSKQRKDFLCGLPCGFEEVKAAQGLGLHSFPPISLVYTKKQAFQLR AHMYQARSLFAADSSGLSDPFARVFFINQSQCTEVLNETLCPTWD QMLVFDNLELYGEAHELRDDPPIIVIEIYDQDSMGKADFMGRTFAKP LVKMADEAYCPPRFPPQLEYYQIYRGSATAGDLLAAFELLQIGPSG KADLPPINGPVDMDRGPIMPVPVGIRPVLSKYRVEVLFWGLRDLKR VNLAQVDRPRVDIECAGKGVQSSLIHNYKKNPNFNTLVKWFEVDLP ENELLHPPLNIRVVDCRAFGRYTLVGSHAVSSLRRFIYRPPDRSAP NWNTTGEVVVSMEPEEPVKKLETMVKLDATSDAVVKVDVAEDEKE RKKKKKKGPSEEPEEEEPDESMLDWWSKYFASIDTMKEQLRQHET SGTDLEEKEEMESAEGLKGPMKSKEKSRAAKEEKKKKNQSPGPG QGSEAPEKKKAKIDELKVYPKELESEFDSFEDWLHTFNLLRGKTGD DEDGSTEEERIVGRFKGSLCVYKVPLPEDVSREAGYDPTYGMFQGI PSNDPINVLVRIYVVRATDLHPADINGKADPYIAIKLGKTDIRDKENYI SKQLNPVFGKSFDIEASFPMESMLTVAVYDWDLVGTDDLIGETKIDL ENRFYSKHRATCGIAQTYSIHGYNIWRDPMKPSQILTRLCKEGKVD GPHFGPHGRVRVANRVFTGPSEIEDENGQRKPTDEHVALSALRHW EDIPRVGCRLVPEHVETRPLLNPDKPGIEQGRLELWVDMFPMDMP APGTPLDISPRKPKKYELRVIVWNTDEVVLEDDDFFTGEKSSDIFVR GWLKGQQEDKQDTDVHYHSLTGEGNFNWRYLFPFDYLAAEEKIV MSKKESMFSWDETEYKIPARLTLQIWDADHFSADDFLGAIELDLNR FPRGAKTAKQCTMEMATGEVDVPLVSIFKQKRVKGWWPLLARNEN DEFELTGKVEAELHLLTAEEAEKNPVGLARNEPDPLEKPNRPDTSFI WFLNPLKSARYFLWHTYRWLLLKFLLLFLLLLLFALFLYSLPGYLAKK ILGA 8 mOTOF-202_1 MALIVHLKTVSELRGKGDRIAKVTFRGQSFYSRVLENCEGVADFDE protein TFRWPVASSIDRNEVLEIQIFNYSKVFSNKLIGTFCMVLQKVVEENR (NP_001093865.1), VEVTDTLMDDSNAIIKTSLSMEVRYQATDGTVGPWDDGDFLGDESL mouse otoferlin QEEKDSQETDGLLPGSRPSTRISGEKSFRSKGREKTKGGRDGEHK isoform 1, 1992 aa AGRSVFSAMKLGKTRSHKEEPQRQDEPAVLEMEDLDHLAIQLGDG LDPDSVSLASVTALTSNVSNKRSKPDIKMEPSAGRPMDYQVSITVIE ARQLVGLNMDPVVCVEVGDDKKYTSMKESTNCPYYNEYFVFDFHV SPDVMFDKIIKISVIHSKNLLRSGTLVGSFKMDVGTVYSQPEHQFHH KWAILSDPDDISAGLKGYVKCDVAVVGKGDNIKTPHKANETDEDDIE GNLLLPEGVPPERQWARFYVKIYRAEGLPRMNTSLMANVKKAFIGE NKDLVDPYVQVFFAGQKGKTSVQKSSYEPLWNEQVVFTDLFPPLC KRMKVQIRDSDKVNDVAIGTHFIDLRKISNDGDKGFLPTLGPAWVN MYGSTRNYTLLDEHQDLNEGLGEGVSFRARLMLGLAVEILDTSNPE LTSSTEVQVEQATPVSESCTGRMEEFFLFGAFLEASMIDRKNGDKP ITFEVTIGNYGNEVDGMSRPLRPRPRKEPGDEEEVDLIQNSSDDEG DEAGDLASVSSTPPMRPQITDRNYFHLPYLERKPCIYIKSWWPDQR RRLYNANIMDHIADKLEEGLNDVQEMIKTEKSYPERRLRGVLEELSC GCHRFLSLSDKDQGRSSRTRLDRERLKSCMRELESMGQQAKSLR AQVKRHTVRDKLRSCQNFLQKLRFLADEPQHSIPDVFIWMMSNNK RIAYARVPSKDLLFSIVEEELGKDCAKVKTLFLKLPGKRGFGSAGWT VQAKLELYLWLGLSKQRKDFLCGLPCGFEEVKAAQGLGLHSFPPIS LVYTKKQAFQLRAHMYQARSLFAADSSGLSDPFARVFFINQSQCTE VLNETLCPTWDQMLVFDNLELYGEAHELRDDPPIIVIEIYDQDSMGK ADFMGRTFAKPLVKMADEAYCPPRFPPQLEYYQIYRGSATAGDLLA AFELLQIGPSGKADLPPINGPVDMDRGPIMPVPVGIRPVLSKYRVEV LFWGLRDLKRVNLAQVDRPRVDIECAGKGVQSSLIHNYKKNPNFNT LVKWFEVDLPENELLHPPLNIRVVDCRAFGRYTLVGSHAVSSLRRFI YRPPDRSAPNWNTTGEVVVSMEPEEPVKKLETMVKLDATSDAVVK VDVAEDEKERKKKKKKGPSEEPEEEEPDESMLDWWSKYFASIDTM KEQLRQHETSGTDLEEKEEMESAEGLKGPMKSKEKSRAAKEEKKK KNQSPGPGQGSEAPEKKKAKIDELKVYPKELESEFDSFEDWLHTF NLLRGKTGDDEDGSTEEERIVGRFKGSLCVYKVPLPEDVSREAGY DPTYGMFQGIPSNDPINVLVRIYVVRATDLHPADINGKADPYIAIKLG KTDIRDKENYISKQLNPVFGKSFDIEASFPMESMLTVAVYDWDLVG TDDLIGETKIDLENRFYSKHRATCGIAQTYSIHGYNIWRDPMKPSQIL TRLCKEGKVDGPHFGPHGRVRVANRVFTGPSEIEDENGQRKPTDE HVALSALRHWEDIPRVGCRLVPEHVETRPLLNPDKPGIEQGRLELW VDMFPMDMPAPGTPLDISPRKPKKYELRVIVWNTDEVVLEDDDFFT GEKSSDIFVRGWLKGQQEDKQDTDVHYHSLTGEGNFNWRYLFPF DYLAAEEKIVMSKKESMFSWDETEYKIPARLTLQIWDADHFSADDF LGAIELDLNRFPRGAKTAKQCTMEMATGEVDVPLVSIFKQKRVKGW WPLLARNENDEFELTGKVEAELHLLTAEEAEKNPVGLARNEPDPLE KPNRPDTAFVWFLNPLKSIKYLICTRYKWLIIKIVLALLGLLMLALFLY SLPGYMVKKLLGA 9 mOTOF-202_2 MALIVHLKTVSELRGKGDRIAKVTFRGQSFYSRVLENCEGVADFDE protein TFRWPVASSIDRNEVLEIQIFNYSKVFSNKLIGTFCMVLQKVVEENR (NP_001300696.1), VEVTDTLMDDSNAIIKTSLSMEVRYQATDGTVGPWDDGDFLGDESL mouse otoferlin QEEKDSQETDGLLPGSRPSTRISGEKSFRRAGRSVFSAMKLGKTR isoform 4, 1977 aa SHKEEPQRQDEPAVLEMEDLDHLAIQLGDGLDPDSVSLASVTALTS NVSNKRSKPDIKMEPSAGRPMDYQVSITVIEARQLVGLNMDPVVCV EVGDDKKYTSMKESTNCPYYNEYFVFDFHVSPDVMFDKIIKISVIHS KNLLRSGTLVGSFKMDVGTVYSQPEHQFHHKWAILSDPDDISAGLK GYVKCDVAVVGKGDNIKTPHKANETDEDDIEGNLLLPEGVPPERQ WARFYVKIYRAEGLPRMNTSLMANVKKAFIGENKDLVDPYVQVFFA GQKGKTSVQKSSYEPLWNEQVVFTDLFPPLCKRMKVQIRDSDKVN DVAIGTHFIDLRKISNDGDKGFLPTLGPAWVNMYGSTRNYTLLDEH QDLNEGLGEGVSFRARLMLGLAVEILDTSNPELTSSTEVQVEQATP VSESCTGRMEEFFLFGAFLEASMIDRKNGDKPITFEVTIGNYGNEV DGMSRPLRPRPRKEPGDEEEVDLIQNSSDDEGDEAGDLASVSSTP PMRPQITDRNYFHLPYLERKPCIYIKSWWPDQRRRLYNANIMDHIA DKLEEGLNDVQEMIKTEKSYPERRLRGVLEELSCGCHRFLSLSDKD QGRSSRTRLDRERLKSCMRELESMGQQAKSLRAQVKRHTVRDKL RSCQNFLQKLRFLADEPQHSIPDVFIWMMSNNKRIAYARVPSKDLL FSIVEEELGKDCAKVKTLFLKLPGKRGFGSAGWTVQAKLELYLWLG LSKQRKDFLCGLPCGFEEVKAAQGLGLHSFPPISLVYTKKQAFQLR AHMYQARSLFAADSSGLSDPFARVFFINQSQCTEVLNETLCPTWD QMLVFDNLELYGEAHELRDDPPIIVIEIYDQDSMGKADFMGRTFAKP LVKMADEAYCPPRFPPQLEYYQIYRGSATAGDLLAAFELLQIGPSG KADLPPINGPVDMDRGPIMPVPVGIRPVLSKYRVEVLFWGLRDLKR VNLAQVDRPRVDIECAGKGVQSSLIHNYKKNPNFNTLVKWFEVDLP ENELLHPPLNIRVVDCRAFGRYTLVGSHAVSSLRRFIYRPPDRSAP NWNTTGEVVVSMEPEEPVKKLETMVKLDATSDAVVKVDVAEDEKE RKKKKKKGPSEEPEEEEPDESMLDWWSKYFASIDTMKEQLRQHET SGTDLEEKEEMESAEGLKGPMKSKEKSRAAKEEKKKKNQSPGPG QGSEAPEKKKAKIDELKVYPKELESEFDSFEDWLHTFNLLRGKTGD DEDGSTEEERIVGRFKGSLCVYKVPLPEDVSREAGYDPTYGMFQGI PSNDPINVLVRIYVVRATDLHPADINGKADPYIAIKLGKTDIRDKENYI SKQLNPVFGKSFDIEASFPMESMLTVAVYDWDLVGTDDLIGETKIDL ENRFYSKHRATCGIAQTYSIHGYNIWRDPMKPSQILTRLCKEGKVD GPHFGPHGRVRVANRVFTGPSEIEDENGQRKPTDEHVALSALRHW EDIPRVGCRLVPEHVETRPLLNPDKPGIEQGRLELWVDMFPMDMP APGTPLDISPRKPKKYELRVIVWNTDEVVLEDDDFFTGEKSSDIFVR GWLKGQQEDKQDTDVHYHSLTGEGNFNWRYLFPFDYLAAEEKIV MSKKESMFSWDETEYKIPARLTLQIWDADHFSADDFLGAIELDLNR FPRGAKTAKQCTMEMATGEVDVPLVSIFKQKRVKGWWPLLARNEN DEFELTGKVEAELHLLTAEEAEKNPVGLARNEPDPLEKPNRPDTAF VWFLNPLKSIKYLICTRYKWLIIKIVLALLGLLMLALFLYSLPGYMVKK LLGA 10 OTOF-201 transcript ATCGGAGGGGGGTCGGGAGGAGGAGGAGGAGGCAGCGGCAG (NM_194248.1), AGAAGAGAGAGGCGTGTGAGCCGTGCTCCACCGGCTAGCTCCT human otoferlin TCCCGCTGCTCCTGCCTGGCAGTGCCAGGCAGCCCACACCAGC transcript variant 1, ATGGCCTTGCTCATCCACCTCAAGACAGTCTCGGAGCTGCGGG 7156 bp, encodes GCAGGGGCGACCGGATCGCCAAAGTGACTTTCCGAGGGCAATC the protein of SEQ CTTCTACTCTCGGGTCCTGGAGAACTGTGAGGATGTGGCTGACT ID NO: 1 TTGATGAGACATTTCGGTGGCCGGTGGCCAGCAGCATCGACAG AAATGAGATGCTGGAGATTCAGGTTTTCAACTACAGCAAAGTCTT CAGCAACAAGCTCATCGGGACCTTCCGCATGGTGCTGCAGAAG GTGGTAGAGGAGAGCCATGTGGAGGTGACTGACACGCTGATTG ATGACAACAATGCTATCATCAAGACCAGCCTGTGCGTGGAGGTC CGGTATCAGGCCACTGACGGCACAGTGGGCTCCTGGGACGATG GGGACTTCCTGGGAGATGAGTCTCTTCAAGAGGAAGAGAAGGA CAGCCAAGAGACGGATGGACTGCTCCCAGGCTCCCGGCCCAGC TCCCGGCCCCCAGGAGAGAAGAGCTTCCGGAGAGCCGGGAGG AGCGTGTTCTCCGCCATGAAGCTCGGCAAAAACCGGTCTCACAA GGAGGAGCCCCAAAGACCAGATGAACCGGCGGTGCTGGAGAT GGAAGACCTTGACCATCTGGCCATTCGGCTAGGAGATGGACTG GATCCCGACTCGGTGTCTCTAGCCTCAGTCACAGCTCTCACCAC TAATGTCTCCAACAAGCGATCTAAGCCAGACATTAAGATGGAGC CAAGTGCTGGGCGGCCCATGGATTACCAGGTCAGCATCACGGT GATCGAGGCCCGGCAGCTGGTGGGCTTGAACATGGACCCTGTG GTGTGCGTGGAGGTGGGTGACGACAAGAAGTACACATCCATGA AGGAGTCCACTAACTGCCCCTATTACAACGAGTACTTCGTCTTC GACTTCCATGTCTCTCCGGATGTCATGTTTGACAAGATCATCAAG ATTTCGGTGATTCACTCCAAGAACCTGCTGCGCAGTGGCACCCT GGTGGGCTCCTTCAAAATGGACGTGGGAACCGTGTACTCGCAG CCAGAGCACCAGTTCCATCACAAGTGGGCCATCCTGTCTGACCC CGATGACATCTCCTCGGGGCTGAAGGGCTACGTGAAGTGTGAC GTTGCCGTGGTGGGCAAAGGGGACAACATCAAGACGCCCCACA AGGCCAATGAGACCGACGAAGATGACATTGAGGGGAACTTGCT GCTCCCCGAGGGGGTGCCCCCCGAACGCCAGTGGGCCCGGTT CTATGTGAAAATTTACCGAGCAGAGGGGCTGCCCCGTATGAACA CAAGCCTCATGGCCAATGTAAAGAAGGCTTTCATCGGTGAAAAC AAGGACCTCGTGGACCCCTACGTGCAAGTCTTCTTTGCTGGCCA GAAGGGCAAGACTTCAGTGCAGAAGAGCAGCTATGAGCCCCTG TGGAATGAGCAGGTCGTCTTTACAGACCTCTTCCCCCCACTCTG CAAACGCATGAAGGTGCAGATCCGAGACTCGGACAAGGTCAAC GACGTGGCCATCGGCACCCACTTCATTGACCTGCGCAAGATTTC TAATGACGGAGACAAAGGCTTCCTGCCCACACTGGGCCCAGCC TGGGTGAACATGTACGGCTCCACACGTAACTACACGCTGCTGGA TGAGCATCAGGACCTGAACGAGGGCCTGGGGGAGGGTGTGTC CTTCCGGGCCCGGCTCCTGCTGGGCCTGGCTGTGGAGATCGTA GACACCTCCAACCCTGAGCTCACCAGCTCCACAGAGGTGCAGG TGGAGCAGGCCACGCCCATCTCGGAGAGCTGTGCAGGTAAAAT GGAAGAATTCTTTCTCTTTGGAGCCTTCCTGGAGGCCTCAATGA TCGACCGGAGAAACGGAGACAAGCCCATCACCTTTGAGGTCAC CATAGGCAACTATGGGAACGAAGTTGATGGCCTGTCCCGGCCC CAGCGGCCTCGGCCCCGGAAGGAGCCGGGGGATGAGGAAGAA GTAGACCTGATTCAGAACGCAAGTGATGACGAGGCCGGTGATG CCGGGGACCTGGCCTCAGTCTCCTCCACTCCACCAATGCGGCC CCAGGTCACCGACAGGAACTACTTCCATCTGCCCTACCTGGAGC GAAAGCCCTGCATCTACATCAAGAGCTGGTGGCCGGACCAGCG CCGCCGCCTCTACAATGCCAACATCATGGACCACATTGCCGACA AGCTGGAAGAAGGCCTGAACGACATACAGGAGATGATCAAAAC GGAGAAGTCCTACCCTGAGCGTCGCCTGCGGGGCGTCCTGGA GGAGCTGAGCTGTGGCTGCTGCCGCTTCCTCTCCCTCGCTGAC AAGGACCAGGGCCACTCATCCCGCACCAGGCTTGACCGGGAGC GCCTCAAGTCCTGCATGAGGGAGCTGGAAAACATGGGGCAGCA GGCCAGGATGCTGCGGGCCCAGGTGAAGCGGCACACGGTGCG GGACAAGCTGAGGCTGTGCCAGAACTTCCTGCAGAAGCTGCGC TTCCTGGCGGACGAGCCCCAGCACAGCATTCCCGACATCTTCAT CTGGATGATGAGCAACAACAAGCGTGTCGCCTATGCCCGTGTG CCCTCCAAGGACCTGCTCTTCTCCATCGTGGAGGAGGAGACTG GCAAGGACTGCGCCAAGGTCAAGACGCTCTTCCTTAAGCTGCC AGGGAAGCGGGGCTTCGGCTCGGCAGGCTGGACAGTGCAGGC CAAGGTGGAGCTGTACCTGTGGCTGGGCCTCAGCAAACAGCGC AAGGAGTTCCTGTGCGGCCTGCCCTGTGGCTTCCAGGAGGTCA AGGCAGCCCAGGGCCTGGGCCTGCATGCCTTCCCACCCGTCAG CCTGGTCTACACCAAGAAGCAGGCGTTCCAGCTCCGAGCGCAC ATGTACCAGGCCCGCAGCCTCTTTGCCGCCGACAGCAGCGGAC TCTCAGACCCCTTTGCCCGCGTCTTCTTCATCAATCAGAGTCAG TGCACAGAGGTGCTGAATGAGACCCTGTGTCCCACCTGGGACC AGATGCTGGTGTTCGACAACCTGGAGCTCTATGGTGAAGCTCAT GAGCTGAGGGACGATCCGCCCATCATTGTCATTGAAATCTATGA CCAGGATTCCATGGGCAAAGCTGACTTCATGGGCCGGACCTTC GCCAAACCCCTGGTGAAGATGGCAGACGAGGCGTACTGCCCAC CCCGCTTCCCACCTCAGCTCGAGTACTACCAGATCTACCGTGGC AACGCCACAGCTGGAGACCTGCTGGCGGCCTTCGAGCTGCTGC AGATTGGACCAGCAGGGAAGGCTGACCTGCCCCCCATCAATGG CCCGGTGGACGTGGACCGAGGTCCCATCATGCCCGTGCCCATG GGCATCCGGCCCGTGCTCAGCAAGTACCGAGTGGAGGTGCTGT TCTGGGGCCTACGGGACCTAAAGCGGGTGAACCTGGCCCAGGT GGACCGGCCACGGGTGGACATCGAGTGTGCAGGGAAGGGGGT GCAGTCGTCCCTGATCCACAATTATAAGAAGAACCCCAACTTCA ACACCCTCGTCAAGTGGTTTGAAGTGGACCTCCCAGAGAACGA GCTGCTGCACCCGCCCTTGAACATCCGTGTGGTGGACTGCCGG GCCTTCGGTCGCTACACACTGGTGGGCTCCCATGCCGTCAGCT CCCTGCGACGCTTCATCTACCGGCCCCCAGACCGCTCGGCCCC CAGCTGGAACACCACGGTCAGGCTTCTCCGGCGCTGCCGTGTG CTGTGCAATGGGGGCTCCTCCTCTCACTCCACAGGGGAGGTTG TGGTGACTATGGAGCCAGAGGTACCCATCAAGAAACTGGAGAC CATGGTGAAGCTGGACGCGACTTCTGAAGCTGTTGTCAAGGTG GATGTGGCTGAGGAGGAGAAGGAGAAGAAGAAGAAGAAGAAGG GCACTGCGGAGGAGCCAGAGGAGGAGGAGCCAGACGAGAGCA TGCTGGACTGGTGGTCCAAGTACTTTGCCTCCATTGACACCATG AAGGAGCAACTTCGACAACAAGAGCCCTCTGGAATTGACTTGGA GGAGAAGGAGGAAGTGGACAATACCGAGGGCCTGAAGGGGTC AATGAAGGGCAAGGAGAAGGCAAGGGCTGCCAAAGAGGAGAAG AAGAAGAAAACTCAGAGCTCTGGCTCTGGCCAGGGGTCCGAGG CCCCCGAGAAGAAGAAACCCAAGATTGATGAGCTTAAGGTATAC CCCAAAGAGCTGGAGTCCGAGTTTGATAACTTTGAGGACTGGCT GCACACTTTCAACTTGCTTCGGGGCAAGACCGGGGATGATGAG GATGGCTCCACCGAGGAGGAGCGCATTGTGGGACGCTTCAAGG GCTCCCTCTGCGTGTACAAAGTGCCACTCCCAGAGGACGTGTC CCGGGAAGCCGGCTACGACTCCACCTACGGCATGTTCCAGGGC ATCCCGAGCAATGACCCCATCAATGTGCTGGTCCGAGTCTATGT GGTCCGGGCCACGGACCTGCACCCTGCTGACATCAACGGCAAA GCTGACCCCTACATCGCCATCCGGCTAGGCAAGACTGACATCC GCGACAAGGAGAACTACATCTCCAAGCAGCTCAACCCTGTCTTT GGGAAGTCCTTTGACATCGAGGCCTCCTTCCCCATGGAATCCAT GCTGACGGTGGCTGTGTATGACTGGGACCTGGTGGGCACTGAT GACCTCATTGGGGAAACCAAGATCGACCTGGAGAACCGCTTCTA CAGCAAGCACCGCGCCACCTGCGGCATCGCCCAGACCTACTCC ACACATGGCTACAATATCTGGCGGGACCCCATGAAGCCCAGCC AGATCCTGACCCGCCTCTGCAAAGACGGCAAAGTGGACGGCCC CCACTTTGGGCCCCCTGGGAGAGTGAAGGTGGCCAACCGCGTC TTCACTGGGCCCTCTGAGATTGAGGACGAGAACGGTCAGAGGA AGCCCACAGACGAGCATGTGGCGCTGTTGGCCCTGAGGCACTG GGAGGACATCCCCCGCGCAGGCTGCCGCCTGGTGCCAGAGCA TGTGGAGACGAGGCCGCTGCTCAACCCCGACAAGCCGGGCATC GAGCAGGGCCGCCTGGAGCTGTGGGTGGACATGTTCCCCATGG ACATGCCAGCCCCTGGGACGCCTCTGGACATCTCACCTCGGAA GCCCAAGAAGTACGAGCTGCGGGTCATCATCTGGAACACAGAT GAGGTGGTCTTGGAGGACGACGACTTCTTCACAGGGGAGAAGT CCAGTGACATCTTCGTGAGGGGGTGGCTGAAGGGCCAGCAGGA GGACAAGCAGGACACAGACGTCCACTACCACTCCCTCACTGGC GAGGGCAACTTCAACTGGCGCTACCTGTTCCCCTTCGACTACCT GGCGGCGGAGGAGAAGATCGTCATCTCCAAGAAGGAGTCCATG TTCTCCTGGGACGAGACCGAGTACAAGATCCCCGCGCGGCTCA CCCTGCAGATCTGGGATGCGGACCACTTCTCCGCTGACGACTT CCTGGGGGCCATCGAGCTGGACCTGAACCGGTTCCCGCGGGG CGCAAAGACAGCCAAGCAGTGCACCATGGAGATGGCCACCGGG GAGGTGGACGTGCCCCTCGTGTCCATCTTCAAGCAAAAGCGCG TCAAAGGCTGGTGGCCCCTCCTGGCCCGCAATGAGAACGATGA GTTTGAGCTCACGGGCAAGGTGGAGGCTGAGCTGCATTTACTG ACAGCAGAGGAGGCAGAGAAGAACCCAGTGGGCCTGGCCCGC AATGAACCTGACCCCCTAGAGAAACCCAACCGGCCCGACACGA GCTTCATCTGGTTCCTGAACCCTCTCAAGTCGGCTCGCTACTTC TTGTGGCACACGTATCGCTGGCTGCTCCTCAAACTGTTGCTGCT CCTGCTGCTGCTCCTCCTCCTCGCCCTGTTCCTCTACTCTGTGC CTGGCTACCTGGTCAAGAAAATCCTCGGGGCCTGAGCCCAGTG GCCTCCTGGCCGGCCCGACACGGCCTTCGTCTGGTTCCTCAAC CCTCTCAAGTCCATCAAGTACCTCATCTGCACCCGGTACAAGTG GCTCATCATCAAGATCGTGCTGGCGCTGTTGGGGCTGCTCATGT TGGGGCTCTTCCTCTACAGCCTCCCTGGCTACATGGTCAAAAAG CTCCTTGGGGCATGAAGGCCGCCAGCTCCCGCCAGCCGCTCCC CAGCCCTGCCGCATTTCCTTTCAGTGGCTTGGACTCTTTCCCAT CTCCCCTGGGGAGCCTGAGGAGCCCAGCGTCCACTCTTCATGC CTTGGGCCGAGCCTGCCTCCTGCTTGCGGGGGCCGCCTGTCCT CACTGCCCCAGGCTGCGGCTTGCCCAGTCCCGCCCCTCTGACC CCTGCCTGTGGGCTGGGGAGCCTTGGATGGGGTGGGGACCTG GAATGGGTCTCTCTTGCCCCACCTGGCTGAGGCGCCACCCTTC TTCAGGCCCAGGCTCCAGAGGAAGACTCCTGAAACCCTCCCCA GGTCTTCCAAGTACAGGATTGAAGCTTTAGTGAAATTAACCAAG GACCATGGGTCAGTGCCCAGGGCTTTAAAAAGAATGAACGAGC AAAAGGTATCCCCGCCGTGACCCCTGCAGATAGCACCGGTCTTT GATCCGCAGCAGGGGCCAGACCCTGCCCACAAGTCCCAGCGC GGCTGCTTCTGCCACTGCTGGGCTCCACTTGGCTCCTCTCACTT CCCAGGGGGTCGCCTGTCCTGCCTGTGGGTTTCCATGGCTTCC CAGAGCTCCCTCTGCCCCAGCCAGCGCCTCCAGGCCCAGCTGA GGAGCTGTGAGAAGCAGCAGAGGGGACTCCCCATCCCGGGCA CACCCTGTCCTCCCACCCCTGCCCCCTTGCCCTTCCAGCCCTTT CAGCTGCAGCTGGGAGCTGGCCCGTCAAGTGCTGCCCCTGCCT GTGTCTGGGTTTCTGTTGGCTGTTTTTCTTTTCTTGAGTGGTGAT TTTTCTCTAAATAAAAGAAGTCAAGCACTGAAAAAAAAAAAAAAA A 11 OTOF-202 transcript CCGTGAGTTCTGCCCAGGCCCTGTGAGCTCACCAGAGCCACAG (NM_004802.3), ACTCACAGCCCAGAGGTGGCTTCTTCCTTCAGGAACTGAAGAAC human otoferlin CCCCATGAACACCAACATCTCCAGGTTCTGAGAACAGAACCTGG transcript variant 2, GAAATTGATGACTTCCTCATGATGACCGATACTCAGGATGGCCC 4954 bp, encodes TAGCGAGAGCTCCCAGATCATGAGGAAGAAGGCCTGAACGACA the protein of SEQ TACAGGAGATGATCAAAACGGAGAAGTCCTACCCTGAGCGTCG ID NO: 2 CCTGCGGGGCGTCCTGGAGGAGCTGAGCTGTGGCTGCTGCCG CTTCCTCTCCCTCGCTGACAAGGACCAGGGCCACTCATCCCGC ACCAGGCTTGACCGGGAGCGCCTCAAGTCCTGCATGAGGGAGC TGGAAAACATGGGGCAGCAGGCCAGGATGCTGCGGGCCCAGG TGAAGCGGCACACGGTGCGGGACAAGCTGAGGCTGTGCCAGAA CTTCCTGCAGAAGCTGCGCTTCCTGGCGGACGAGCCCCAGCAC AGCATTCCCGACATCTTCATCTGGATGATGAGCAACAACAAGCG TGTCGCCTATGCCCGTGTGCCCTCCAAGGACCTGCTCTTCTCCA TCGTGGAGGAGGAGACTGGCAAGGACTGCGCCAAGGTCAAGAC GCTCTTCCTTAAGCTGCCAGGGAAGCGGGGCTTCGGCTCGGCA GGCTGGACAGTGCAGGCCAAGGTGGAGCTGTACCTGTGGCTGG GCCTCAGCAAACAGCGCAAGGAGTTCCTGTGCGGCCTGCCCTG TGGCTTCCAGGAGGTCAAGGCAGCCCAGGGCCTGGGCCTGCAT GCCTTCCCACCCGTCAGCCTGGTCTACACCAAGAAGCAGGCGT TCCAGCTCCGAGCGCACATGTACCAGGCCCGCAGCCTCTTTGC CGCCGACAGCAGCGGACTCTCAGACCCCTTTGCCCGCGTCTTC TTCATCAATCAGAGTCAGTGCACAGAGGTGCTGAATGAGACCCT GTGTCCCACCTGGGACCAGATGCTGGTGTTCGACAACCTGGAG CTCTATGGTGAAGCTCATGAGCTGAGGGACGATCCGCCCATCAT TGTCATTGAAATCTATGACCAGGATTCCATGGGCAAAGCTGACT TCATGGGCCGGACCTTCGCCAAACCCCTGGTGAAGATGGCAGA CGAGGCGTACTGCCCACCCCGCTTCCCACCTCAGCTCGAGTAC TACCAGATCTACCGTGGCAACGCCACAGCTGGAGACCTGCTGG CGGCCTTCGAGCTGCTGCAGATTGGACCAGCAGGGAAGGCTGA CCTGCCCCCCATCAATGGCCCGGTGGACGTGGACCGAGGTCCC ATCATGCCCGTGCCCATGGGCATCCGGCCCGTGCTCAGCAAGT ACCGAGTGGAGGTGCTGTTCTGGGGCCTACGGGACCTAAAGCG GGTGAACCTGGCCCAGGTGGACCGGCCACGGGTGGACATCGA GTGTGCAGGGAAGGGGGTGCAGTCGTCCCTGATCCACAATTAT AAGAAGAACCCCAACTTCAACACCCTCGTCAAGTGGTTTGAAGT GGACCTCCCAGAGAACGAGCTGCTGCACCCGCCCTTGAACATC CGTGTGGTGGACTGCCGGGCCTTCGGTCGCTACACACTGGTGG GCTCCCATGCCGTCAGCTCCCTGCGACGCTTCATCTACCGGCC CCCAGACCGCTCGGCCCCCAGCTGGAACACCACGGGGGAGGT TGTGGTGACTATGGAGCCAGAGGTACCCATCAAGAAACTGGAG ACCATGGTGAAGCTGGACGCGACTTCTGAAGCTGTTGTCAAGGT GGATGTGGCTGAGGAGGAGAAGGAGAAGAAGAAGAAGAAGAAG GGCACTGCGGAGGAGCCAGAGGAGGAGGAGCCAGACGAGAGC ATGCTGGACTGGTGGTCCAAGTACTTTGCCTCCATTGACACCAT GAAGGAGCAACTTCGACAACAAGAGCCCTCTGGAATTGACTTGG AGGAGAAGGAGGAAGTGGACAATACCGAGGGCCTGAAGGGGT CAATGAAGGGCAAGGAGAAGGCAAGGGCTGCCAAAGAGGAGAA GAAGAAGAAAACTCAGAGCTCTGGCTCTGGCCAGGGGTCCGAG GCCCCCGAGAAGAAGAAACCCAAGATTGATGAGCTTAAGGTATA CCCCAAAGAGCTGGAGTCCGAGTTTGATAACTTTGAGGACTGGC TGCACACTTTCAACTTGCTTCGGGGCAAGACCGGGGATGATGA GGATGGCTCCACCGAGGAGGAGCGCATTGTGGGACGCTTCAAG GGCTCCCTCTGCGTGTACAAAGTGCCACTCCCAGAGGACGTGT CCCGGGAAGCCGGCTACGACTCCACCTACGGCATGTTCCAGGG CATCCCGAGCAATGACCCCATCAATGTGCTGGTCCGAGTCTATG TGGTCCGGGCCACGGACCTGCACCCTGCTGACATCAACGGCAA AGCTGACCCCTACATCGCCATCCGGCTAGGCAAGACTGACATC CGCGACAAGGAGAACTACATCTCCAAGCAGCTCAACCCTGTCTT TGGGAAGTCCTTTGACATCGAGGCCTCCTTCCCCATGGAATCCA TGCTGACGGTGGCTGTGTATGACTGGGACCTGGTGGGCACTGA TGACCTCATTGGGGAAACCAAGATCGACCTGGAGAACCGCTTCT ACAGCAAGCACCGCGCCACCTGCGGCATCGCCCAGACCTACTC CACACATGGCTACAATATCTGGCGGGACCCCATGAAGCCCAGC CAGATCCTGACCCGCCTCTGCAAAGACGGCAAAGTGGACGGCC CCCACTTTGGGCCCCCTGGGAGAGTGAAGGTGGCCAACCGCGT CTTCACTGGGCCCTCTGAGATTGAGGACGAGAACGGTCAGAGG AAGCCCACAGACGAGCATGTGGCGCTGTTGGCCCTGAGGCACT GGGAGGACATCCCCCGCGCAGGCTGCCGCCTGGTGCCAGAGC ATGTGGAGACGAGGCCGCTGCTCAACCCCGACAAGCCGGGCAT CGAGCAGGGCCGCCTGGAGCTGTGGGTGGACATGTTCCCCATG GACATGCCAGCCCCTGGGACGCCTCTGGACATCTCACCTCGGA AGCCCAAGAAGTACGAGCTGCGGGTCATCATCTGGAACACAGA TGAGGTGGTCTTGGAGGACGACGACTTCTTCACAGGGGAGAAG TCCAGTGACATCTTCGTGAGGGGGTGGCTGAAGGGCCAGCAGG AGGACAAGCAGGACACAGACGTCCACTACCACTCCCTCACTGG CGAGGGCAACTTCAACTGGCGCTACCTGTTCCCCTTCGACTACC TGGCGGCGGAGGAGAAGATCGTCATCTCCAAGAAGGAGTCCAT GTTCTCCTGGGACGAGACCGAGTACAAGATCCCCGCGCGGCTC ACCCTGCAGATCTGGGATGCGGACCACTTCTCCGCTGACGACTT CCTGGGGGCCATCGAGCTGGACCTGAACCGGTTCCCGCGGGG CGCAAAGACAGCCAAGCAGTGCACCATGGAGATGGCCACCGGG GAGGTGGACGTGCCCCTCGTGTCCATCTTCAAGCAAAAGCGCG TCAAAGGCTGGTGGCCCCTCCTGGCCCGCAATGAGAACGATGA GTTTGAGCTCACGGGCAAGGTGGAGGCTGAGCTGCATTTACTG ACAGCAGAGGAGGCAGAGAAGAACCCAGTGGGCCTGGCCCGC AATGAACCTGACCCCCTAGAGAAACCCAACCGGCCCGACACGA GCTTCATCTGGTTCCTGAACCCTCTCAAGTCGGCTCGCTACTTC TTGTGGCACACGTATCGCTGGCTGCTCCTCAAACTGTTGCTGCT CCTGCTGCTGCTCCTCCTCCTCGCCCTGTTCCTCTACTCTGTGC CTGGCTACCTGGTCAAGAAAATCCTCGGGGCCTGAGCCCAGTG GCCTCCTGGCCGGCCCGACACGGCCTTCGTCTGGTTCCTCAAC CCTCTCAAGTCCATCAAGTACCTCATCTGCACCCGGTACAAGTG GCTCATCATCAAGATCGTGCTGGCGCTGTTGGGGCTGCTCATGT TGGGGCTCTTCCTCTACAGCCTCCCTGGCTACATGGTCAAAAAG CTCCTTGGGGCATGAAGGCCGCCAGCTCCCGCCAGCCGCTCCC CAGCCCTGCCGCATTTCCTTTCAGTGGCTTGGACTCTTTCCCAT CTCCCCTGGGGAGCCTGAGGAGCCCAGCGTCCACTCTTCATGC CTTGGGCCGAGCCTGCCTCCTGCTTGCGGGGGCCGCCTGTCCT CACTGCCCCAGGCTGCGGCTTGCCCAGTCCCGCCCCTCTGACC CCTGCCTGTGGGCTGGGGAGCCTTGGATGGGGTGGGGACCTG GAATGGGTCTCTCTTGCCCCACCTGGCTGAGGCGCCACCCTTC TTCAGGCCCAGGCTCCAGAGGAAGACTCCTGAAACCCTCCCCA GGTCTTCCAAGTACAGGATTGAAGCTTTAGTGAAATTAACCAAG GACCATGGGTCAGTGCCCAGGGCTTTAAAAAGAATGAACGAGC AAAAGGTATCCCCGCCGTGACCCCTGCAGATAGCACCGGTCTTT GATCCGCAGCAGGGGCCAGACCCTGCCCACAAGTCCCAGCGC GGCTGCTTCTGCCACTGCTGGGCTCCACTTGGCTCCTCTCACTT CCCAGGGGGTCGCCTGTCCTGCCTGTGGGTTTCCATGGCTTCC CAGAGCTCCCTCTGCCCCAGCCAGCGCCTCCAGGCCCAGCTGA GGAGCTGTGAGAAGCAGCAGAGGGGACTCCCCATCCCGGGCA CACCCTGTCCTCCCACCCCTGCCCCCTTGCCCTTCCAGCCCTTT CAGCTGCAGCTGGGAGCTGGCCCGTCAAGTGCTGCCCCTGCCT GTGTCTGGGTTTCTGTTGGCTGTTTTTCTTTTCTTGAGTGGTGAT TTTTCTCTAAATAAAAGAAGTCAAGCACTGAAAAAAAAAAAAAAA A 12 OTOF-203 transcript CCGTGAGTTCTGCCCAGGCCCTGTGAGCTCACCAGAGCCACAG (NM_194323.2), ACTCACAGCCCAGAGGTGGCTTCTTCCTTCAGGAACTGAAGAAC human otoferlin CCCCATGAACACCAACATCTCCAGGTTCTGAGAACAGAACCTGG transcript variant 4, GAAATTGATGACTTCCTCATGATGACCGATACTCAGGATGGCCC 4756 bp, encodes TAGCGAGAGCTCCCAGATCATGAGGAAGAAGGCCTGAACGACA the protein of SEQ TACAGGAGATGATCAAAACGGAGAAGTCCTACCCTGAGCGTCG ID NO: 3 CCTGCGGGGCGTCCTGGAGGAGCTGAGCTGTGGCTGCTGCCG CTTCCTCTCCCTCGCTGACAAGGACCAGGGCCACTCATCCCGC ACCAGGCTTGACCGGGAGCGCCTCAAGTCCTGCATGAGGGAGC TGGAAAACATGGGGCAGCAGGCCAGGATGCTGCGGGCCCAGG TGAAGCGGCACACGGTGCGGGACAAGCTGAGGCTGTGCCAGAA CTTCCTGCAGAAGCTGCGCTTCCTGGCGGACGAGCCCCAGCAC AGCATTCCCGACATCTTCATCTGGATGATGAGCAACAACAAGCG TGTCGCCTATGCCCGTGTGCCCTCCAAGGACCTGCTCTTCTCCA TCGTGGAGGAGGAGACTGGCAAGGACTGCGCCAAGGTCAAGAC GCTCTTCCTTAAGCTGCCAGGGAAGCGGGGCTTCGGCTCGGCA GGCTGGACAGTGCAGGCCAAGGTGGAGCTGTACCTGTGGCTGG GCCTCAGCAAACAGCGCAAGGAGTTCCTGTGCGGCCTGCCCTG TGGCTTCCAGGAGGTCAAGGCAGCCCAGGGCCTGGGCCTGCAT GCCTTCCCACCCGTCAGCCTGGTCTACACCAAGAAGCAGGCGT TCCAGCTCCGAGCGCACATGTACCAGGCCCGCAGCCTCTTTGC CGCCGACAGCAGCGGACTCTCAGACCCCTTTGCCCGCGTCTTC TTCATCAATCAGAGTCAGTGCACAGAGGTGCTGAATGAGACCCT GTGTCCCACCTGGGACCAGATGCTGGTGTTCGACAACCTGGAG CTCTATGGTGAAGCTCATGAGCTGAGGGACGATCCGCCCATCAT TGTCATTGAAATCTATGACCAGGATTCCATGGGCAAAGCTGACT TCATGGGCCGGACCTTCGCCAAACCCCTGGTGAAGATGGCAGA CGAGGCGTACTGCCCACCCCGCTTCCCACCTCAGCTCGAGTAC TACCAGATCTACCGTGGCAACGCCACAGCTGGAGACCTGCTGG CGGCCTTCGAGCTGCTGCAGATTGGACCAGCAGGGAAGGCTGA CCTGCCCCCCATCAATGGCCCGGTGGACGTGGACCGAGGTCCC ATCATGCCCGTGCCCATGGGCATCCGGCCCGTGCTCAGCAAGT ACCGAGTGGAGGTGCTGTTCTGGGGCCTACGGGACCTAAAGCG GGTGAACCTGGCCCAGGTGGACCGGCCACGGGTGGACATCGA GTGTGCAGGGAAGGGGGTGCAGTCGTCCCTGATCCACAATTAT AAGAAGAACCCCAACTTCAACACCCTCGTCAAGTGGTTTGAAGT GGACCTCCCAGAGAACGAGCTGCTGCACCCGCCCTTGAACATC CGTGTGGTGGACTGCCGGGCCTTCGGTCGCTACACACTGGTGG GCTCCCATGCCGTCAGCTCCCTGCGACGCTTCATCTACCGGCC CCCAGACCGCTCGGCCCCCAGCTGGAACACCACGGGGGAGGT TGTGGTGACTATGGAGCCAGAGGTACCCATCAAGAAACTGGAG ACCATGGTGAAGCTGGACGCGACTTCTGAAGCTGTTGTCAAGGT GGATGTGGCTGAGGAGGAGAAGGAGAAGAAGAAGAAGAAGAAG GGCACTGCGGAGGAGCCAGAGGAGGAGGAGCCAGACGAGAGC ATGCTGGACTGGTGGTCCAAGTACTTTGCCTCCATTGACACCAT GAAGGAGCAACTTCGACAACAAGAGCCCTCTGGAATTGACTTGG AGGAGAAGGAGGAAGTGGACAATACCGAGGGCCTGAAGGGGT CAATGAAGGGCAAGGAGAAGGCAAGGGCTGCCAAAGAGGAGAA GAAGAAGAAAACTCAGAGCTCTGGCTCTGGCCAGGGGTCCGAG GCCCCCGAGAAGAAGAAACCCAAGATTGATGAGCTTAAGGTATA CCCCAAAGAGCTGGAGTCCGAGTTTGATAACTTTGAGGACTGGC TGCACACTTTCAACTTGCTTCGGGGCAAGACCGGGGATGATGA GGATGGCTCCACCGAGGAGGAGCGCATTGTGGGACGCTTCAAG GGCTCCCTCTGCGTGTACAAAGTGCCACTCCCAGAGGACGTGT CCCGGGAAGCCGGCTACGACTCCACCTACGGCATGTTCCAGGG CATCCCGAGCAATGACCCCATCAATGTGCTGGTCCGAGTCTATG TGGTCCGGGCCACGGACCTGCACCCTGCTGACATCAACGGCAA AGCTGACCCCTACATCGCCATCCGGCTAGGCAAGACTGACATC CGCGACAAGGAGAACTACATCTCCAAGCAGCTCAACCCTGTCTT TGGGAAGTCCTTTGACATCGAGGCCTCCTTCCCCATGGAATCCA TGCTGACGGTGGCTGTGTATGACTGGGACCTGGTGGGCACTGA TGACCTCATTGGGGAAACCAAGATCGACCTGGAGAACCGCTTCT ACAGCAAGCACCGCGCCACCTGCGGCATCGCCCAGACCTACTC CACACATGGCTACAATATCTGGCGGGACCCCATGAAGCCCAGC CAGATCCTGACCCGCCTCTGCAAAGACGGCAAAGTGGACGGCC CCCACTTTGGGCCCCCTGGGAGAGTGAAGGTGGCCAACCGCGT CTTCACTGGGCCCTCTGAGATTGAGGACGAGAACGGTCAGAGG AAGCCCACAGACGAGCATGTGGCGCTGTTGGCCCTGAGGCACT GGGAGGACATCCCCCGCGCAGGCTGCCGCCTGGTGCCAGAGC ATGTGGAGACGAGGCCGCTGCTCAACCCCGACAAGCCGGGCAT CGAGCAGGGCCGCCTGGAGCTGTGGGTGGACATGTTCCCCATG GACATGCCAGCCCCTGGGACGCCTCTGGACATCTCACCTCGGA AGCCCAAGAAGTACGAGCTGCGGGTCATCATCTGGAACACAGA TGAGGTGGTCTTGGAGGACGACGACTTCTTCACAGGGGAGAAG TCCAGTGACATCTTCGTGAGGGGGTGGCTGAAGGGCCAGCAGG AGGACAAGCAGGACACAGACGTCCACTACCACTCCCTCACTGG CGAGGGCAACTTCAACTGGCGCTACCTGTTCCCCTTCGACTACC TGGCGGCGGAGGAGAAGATCGTCATCTCCAAGAAGGAGTCCAT GTTCTCCTGGGACGAGACCGAGTACAAGATCCCCGCGCGGCTC ACCCTGCAGATCTGGGATGCGGACCACTTCTCCGCTGACGACTT CCTGGGGGCCATCGAGCTGGACCTGAACCGGTTCCCGCGGGG CGCAAAGACAGCCAAGCAGTGCACCATGGAGATGGCCACCGGG GAGGTGGACGTGCCCCTCGTGTCCATCTTCAAGCAAAAGCGCG TCAAAGGCTGGTGGCCCCTCCTGGCCCGCAATGAGAACGATGA GTTTGAGCTCACGGGCAAGGTGGAGGCTGAGCTGCATTTACTG ACAGCAGAGGAGGCAGAGAAGAACCCAGTGGGCCTGGCCCGC AATGAACCTGACCCCCTAGAGAAACCCAACCGGCCCGACACGG CCTTCGTCTGGTTCCTCAACCCTCTCAAGTCCATCAAGTACCTCA TCTGCACCCGGTACAAGTGGCTCATCATCAAGATCGTGCTGGCG CTGTTGGGGCTGCTCATGTTGGGGCTCTTCCTCTACAGCCTCCC TGGCTACATGGTCAAAAAGCTCCTTGGGGCATGAAGGCCGCCA GCTCCCGCCAGCCGCTCCCCAGCCCTGCCGCATTTCCTTTCAG TGGCTTGGACTCTTTCCCATCTCCCCTGGGGAGCCTGAGGAGC CCAGCGTCCACTCTTCATGCCTTGGGCCGAGCCTGCCTCCTGC TTGCGGGGGCCGCCTGTCCTCACTGCCCCAGGCTGCGGCTTGC CCAGTCCCGCCCCTCTGACCCCTGCCTGTGGGCTGGGGAGCCT TGGATGGGGTGGGGACCTGGAATGGGTCTCTCTTGCCCCACCT GGCTGAGGCGCCACCCTTCTTCAGGCCCAGGCTCCAGAGGAAG ACTCCTGAAACCCTCCCCAGGTCTTCCAAGTACAGGATTGAAGC TTTAGTGAAATTAACCAAGGACCATGGGTCAGTGCCCAGGGCTT TAAAAAGAATGAACGAGCAAAAGGTATCCCCGCCGTGACCCCTG CAGATAGCACCGGTCTTTGATCCGCAGCAGGGGCCAGACCCTG CCCACAAGTCCCAGCGCGGCTGCTTCTGCCACTGCTGGGCTCC ACTTGGCTCCTCTCACTTCCCAGGGGGTCGCCTGTCCTGCCTGT GGGTTTCCATGGCTTCCCAGAGCTCCCTCTGCCCCAGCCAGCG CCTCCAGGCCCAGCTGAGGAGCTGTGAGAAGCAGCAGAGGGG ACTCCCCATCCCGGGCACACCCTGTCCTCCCACCCCTGCCCCC TTGCCCTTCCAGCCCTTTCAGCTGCAGCTGGGAGCTGGCCCGT CAAGTGCTGCCCCTGCCTGTGTCTGGGTTTCTGTTGGCTGTTTT TCTTTTCTTGAGTGGTGATTTTTCTCTAAATAAAAGAAGTCAAGC ACTGAAAAAAAAAAAAAAAA 13 OTOF-208 transcript CCGTGAGTTCTGCCCAGGCCCTGTGAGCTCACCAGAGCCACAG (NM_194322.2), ACTCACAGCCCAGAGGTGGCTTCTTCCTTCAGGAACTGAAGAAC human otoferlin CCCCATGAACACCAACATCTCCAGGTTCTGAGAACAGAACCTGG transcript variant 3, GAAATTGATGACTTCCTCATGATGACCGATACTCAGGATGGCCC 3924 bp, encodes TAGCGAGAGCTCCCAGATCATGAGGTCCCTCACTCCCCTGATCA the protein of SEQ ACAGGGAGGAGGCATTTGGGGAGGCTGGGGAGGCGGGGCTGT ID NO: 4 GGCCCAGCATCACCCACACTCCTGATTCACAGGAAGAAGGCCT GAACGACATACAGGAGATGATCAAAACGGAGAAGTCCTACCCTG AGCGTCGCCTGCGGGGCGTCCTGGAGGAGCTGAGCTGTGGCT GCTGCCGCTTCCTCTCCCTCGCTGACAAGGACCAGGGCCACTC ATCCCGCACCAGGCTTGACCGGGAGCGCCTCAAGTCCTGCATG AGGGAGCTGGAAAACATGGGGCAGCAGGCCAGGATGCTGCGG GCCCAGGTGAAGCGGCACACGGTGCGGGACAAGCTGAGGCTG TGCCAGAACTTCCTGCAGAAGCTGCGCTTCCTGGCGGACGAGC CCCAGCACAGCATTCCCGACATCTTCATCTGGATGATGAGCAAC AACAAGCGTGTCGCCTATGCCCGTGTGCCCTCCAAGGACCTGC TCTTCTCCATCGTGGAGGAGGAGACTGGCAAGGACTGCGCCAA GGTCAAGACGCTCTTCCTTAAGCTGCCAGGGAAGCGGGGCTTC GGCTCGGCAGGCTGGACAGTGCAGGCCAAGGTGGAGCTGTAC CTGTGGCTGGGCCTCAGCAAACAGCGCAAGGAGTTCCTGTGCG GCCTGCCCTGTGGCTTCCAGGAGGTCAAGGCAGCCCAGGGCCT GGGCCTGCATGCCTTCCCACCCGTCAGCCTGGTCTACACCAAG AAGCAGGCGTTCCAGCTCCGAGCGCACATGTACCAGGCCCGCA GCCTCTTTGCCGCCGACAGCAGCGGACTCTCAGACCCCTTTGC CCGCGTCTTCTTCATCAATCAGAGTCAGTGCACAGAGGTGCTGA ATGAGACCCTGTGTCCCACCTGGGACCAGATGCTGGTGTTCGA CAACCTGGAGCTCTATGGTGAAGCTCATGAGCTGAGGGACGAT CCGCCCATCATTGTCATTGAAATCTATGACCAGGATTCCATGGG CAAAGCTGACTTCATGGGCCGGACCTTCGCCAAACCCCTGGTG AAGATGGCAGACGAGGCGTACTGCCCACCCCGCTTCCCACCTC AGCTCGAGTACTACCAGATCTACCGTGGCAACGCCACAGCTGG AGACCTGCTGGCGGCCTTCGAGCTGCTGCAGATTGGACCAGCA GGGAAGGCTGACCTGCCCCCCATCAATGGCCCGGTGGACGTG GACCGAGGTCCCATCATGCCCGTGCCCATGGGCATCCGGCCCG TGCTCAGCAAGTACCGAGTGGAGGTGCTGTTCTGGGGCCTACG GGACCTAAAGCGGGTGAACCTGGCCCAGGTGGACCGGCCACG GGTGGACATCGAGTGTGCAGGGAAGGGGGTGCAGTCGTCCCT GATCCACAATTATAAGAAGAACCCCAACTTCAACACCCTCGTCAA GTGGTTTGAAGTGGACCTCCCAGAGAACGAGCTGCTGCACCCG CCCTTGAACATCCGTGTGGTGGACTGCCGGGCCTTCGGTCGCT ACACACTGGTGGGCTCCCATGCCGTCAGCTCCCTGCGACGCTT CATCTACCGGCCCCCAGACCGCTCGGCCCCCAGCTGGAACACC ACGGTCAGGCTTCTCCGGCGCTGCCGTGTGCTGTGCAATGGGG GCTCCTCCTCTCACTCCACAGGGGAGGTTGTGGTGACTATGGA GCCAGAGGTACCCATCAAGAAACTGGAGACCATGGTGAAGCTG GACGCGACTTCTGAAGCTGTTGTCAAGGTGGATGTGGCTGAGG AGGAGAAGGAGAAGAAGAAGAAGAAGAAGGGCACTGCGGAGG AGCCAGAGGAGGAGGAGCCAGACGAGAGCATGCTGGACTGGT GGTCCAAGTACTTTGCCTCCATTGACACCATGAAGGAGCAACTT CGACAACAAGAGCCCTCTGGAATTGACTTGGAGGAGAAGGAGG AAGTGGACAATACCGAGGGCCTGAAGGGGTCAATGAAGGGCAA GGAGAAGGCAAGGGCTGCCAAAGAGGAGAAGAAGAAGAAAACT CAGAGCTCTGGCTCTGGCCAGGGGTCCGAGGCCCCCGAGAAG AAGAAACCCAAGATTGATGAGCTTAAGGTATACCCCAAAGAGCT GGAGTCCGAGTTTGATAACTTTGAGGACTGGCTGCACACTTTCA ACTTGCTTCGGGGCAAGACCGGGGATGATGAGGATGGCTCCAC CGAGGAGGAGCGCATTGTGGGACGCTTCAAGGGCTCCCTCTGC GTGTACAAAGTGCCACTCCCAGAGGACGTGTCCCGGGAAGCCG GCTACGACTCCACCTACGGCATGTTCCAGGGCATCCCGAGCAA TGACCCCATCAATGTGCTGGTCCGAGTCTATGTGGTCCGGGCC ACGGACCTGCACCCTGCTGACATCAACGGCAAAGCTGACCCCT ACATCGCCATCCGGCTAGGCAAGACTGACATCCGCGACAAGGA GAACTACATCTCCAAGCAGCTCAACCCTGTCTTTGGGAAGTCCT TTGACATCGAGGCCTCCTTCCCCATGGAATCCATGCTGACGGTG GCTGTGTATGACTGGGACCTGGTGGGCACTGATGACCTCATTG GGGAAACCAAGATCGACCTGGAGAACCGCTTCTACAGCAAGCA CCGCGCCACCTGCGGCATCGCCCAGACCTACTCCACACATGGC TACAATATCTGGCGGGACCCCATGAAGCCCAGCCAGATCCTGA CCCGCCTCTGCAAAGACGGCAAAGTGGACGGCCCCCACTTTGG GCCCCCTGGGAGAGTGAAGGTGGCCAACCGCGTCTTCACTGGG CCCTCTGAGATTGAGGACGAGAACGGTCAGAGGAAGCCCACAG ACGAGCATGTGGCGCTGTTGGCCCTGAGGCACTGGGAGGACAT CCCCCGCGCAGGCTGCCGCCTGGTGCCAGAGCATGTGGAGAC GAGGCCGCTGCTCAACCCCGACAAGCCGGGCATCGAGCAGGG CCGCCTGGAGCTGTGGGTGGACATGTTCCCCATGGACATGCCA GCCCCTGGGACGCCTCTGGACATCTCACCTCGGAAGCCCAAGA AGTACGAGCTGCGGGTCATCATCTGGAACACAGATGAGGTGGT CTTGGAGGACGACGACTTCTTCACAGGGGAGAAGTCCAGTGAC ATCTTCGTGAGGGGGTGGCTGAAGGGCCAGCAGGAGGACAAG CAGGACACAGACGTCCACTACCACTCCCTCACTGGCGAGGGCA ACTTCAACTGGCGCTACCTGTTCCCCTTCGACTACCTGGCGGCG GAGGAGAAGATCGTCATCTCCAAGAAGGAGTCCATGTTCTCCTG GGACGAGACCGAGTACAAGATCCCCGCGCGGCTCACCCTGCAG ATCTGGGATGCGGACCACTTCTCCGCTGACGACTTCCTGGGGG CCATCGAGCTGGACCTGAACCGGTTCCCGCGGGGCGCAAAGAC AGCCAAGCAGTGCACCATGGAGATGGCCACCGGGGAGGTGGA CGTGCCCCTCGTGTCCATCTTCAAGCAAAAGCGCGTCAAAGGCT GGTGGCCCCTCCTGGCCCGCAATGAGAACGATGAGTTTGAGCT CACGGGCAAGGTGGAGGCTGAGCTGCATTTACTGACAGCAGAG GAGGCAGAGAAGAACCCAGTGGGCCTGGCCCGCAATGAACCTG ACCCCCTAGAGAAACCCAACCGGCCCGACACGAGCTTCATCTG GTTCCTGAACCCTCTCAAGTCGGCTCGCTACTTCTTGTGGCACA CGTATCGCTGGCTGCTCCTCAAACTGTTGCTGCTCCTGCTGCTG CTCCTCCTCCTCGCCCTGTTCCTCTACTCTGTGCCTGGCTACCT GGTCAAGAAAATCCTCGGGGCCTGAGCCCAGTGGCCTCCTGGC CGGCCCGACACGGCCTTCGTCTGGTTCCTCAACCCTCTCAAGTC CATCAAGTACCTCATCTGCACCCGGTACAAGTGGCTCATCATCA AGATCGTGCTGGCGCTGTTGGGGCTGCTCATGTTGGGGCTCTT CCTCTACAGCCTCCCTGGCTACATGGTCAAAAAGCTCCTTGGGG CATGAAGGCCGCCAGCTCCCGCCAGCCGCTCCCCAGCCCTGCC GCATTTCCTTTCAGTGGCTTGGACTCTTTCCCATCTCCCCTGGG GAGCCTGAGGAGCCCAGCGTCCACTCTTCATGCCTTGGGCCGA GCCTGCCTCCTGCTTGCGGGGGCCGCCTGTCCTCACTGCCCCA GGCTGCGGCTTGCCCAGTCCCGCCCCTCTGACCCCTGCCTGTG GGCTGGGGAGCCTTGGATGGGGTGGGGACCTGGAATGGGTCT CTCTTGCCCCACCTGGCTGAGGCGCCACCCTTCTTCAGGCCCA GGCTCCAGAGGAAGACTCCTGAAACCCTCCCCAGGTCTTCCAA GTACAGGATTGAAGCTTTAGTGAAATTAACCAAGGACCATGGGT CAGTGCCCAGGGCTTTAAAAAGAATGAACGAGCAAAAGGTATCC CCGCCGTGACCCCTGCAGATAGCACCGGTCTTTGATCCGCAGC AGGGGCCAGACCCTGCCCACAAGTCCCAGCGCGGCTGCTTCTG CCACTGCTGGGCTCCACTTGGCTCCTCTCACTTCCCAGGGGGT CGCCTGTCCTGCCTGTGGGTTTCCATGGCTTCCCAGAGCTCCCT CTGCCCCAGCCAGCGCCTCCAGGCCCAGCTGAGGAGCTGTGA GAAGCAGCAGAGGGGACTCCCCATCCCGGGCACACCCTGTCCT CCCACCCCTGCCCCCTTGCCCTTCCAGCCCTTTCAGCTGCAGCT GGGAGCTGGCCCGTCAAGTGCTGCCCCTGCCTGTGTCTGGGTT TCTGTTGGCTGTTTTTCTTTTCTTGAGTGGTGATTTTTCTCTAAAT AAAAGAAGTCAAGCACTGAAAAAAAAAAAAAAAA 14 OTOF-205 transcript ATCGGAGGGGGGTCGGGAGGAGGAGGAGGAGGCAGCGGCAG (NM_001287489.1), AGAAGAGAGAGGCGTGTGAGCCGTGCTCCACCGGCTAGCTCCT human otoferlin TCCCGCTGCTCCTGCCTGGCAGTGCCAGGCAGCCCACACCAGC transcript variant 5, ATGGCCTTGCTCATCCACCTCAAGACAGTCTCGGAGCTGCGGG 6937 bp, encodes GCAGGGGCGACCGGATCGCCAAAGTGACTTTCCGAGGGCAATC the protein of SEQ CTTCTACTCTCGGGTCCTGGAGAACTGTGAGGATGTGGCTGACT ID NO: 5 TTGATGAGACATTTCGGTGGCCGGTGGCCAGCAGCATCGACAG AAATGAGATGCTGGAGATTCAGGTTTTCAACTACAGCAAAGTCTT CAGCAACAAGCTCATCGGGACCTTCCGCATGGTGCTGCAGAAG GTGGTAGAGGAGAGCCATGTGGAGGTGACTGACACGCTGATTG ATGACAACAATGCTATCATCAAGACCAGCCTGTGCGTGGAGGTC CGGTATCAGGCCACTGACGGCACAGTGGGCTCCTGGGACGATG GGGACTTCCTGGGAGATGAGTCTCTTCAAGAGGAAGAGAAGGA CAGCCAAGAGACGGATGGACTGCTCCCAGGCTCCCGGCCCAGC TCCCGGCCCCCAGGAGAGAAGAGCTTCCGGAGAGCCGGGAGG AGCGTGTTCTCCGCCATGAAGCTCGGCAAAAACCGGTCTCACAA GGAGGAGCCCCAAAGACCAGATGAACCGGCGGTGCTGGAGAT GGAAGACCTTGACCATCTGGCCATTCGGCTAGGAGATGGACTG GATCCCGACTCGGTGTCTCTAGCCTCAGTCACAGCTCTCACCAC TAATGTCTCCAACAAGCGATCTAAGCCAGACATTAAGATGGAGC CAAGTGCTGGGCGGCCCATGGATTACCAGGTCAGCATCACGGT GATCGAGGCCCGGCAGCTGGTGGGCTTGAACATGGACCCTGTG GTGTGCGTGGAGGTGGGTGACGACAAGAAGTACACATCCATGA AGGAGTCCACTAACTGCCCCTATTACAACGAGTACTTCGTCTTC GACTTCCATGTCTCTCCGGATGTCATGTTTGACAAGATCATCAAG ATTTCGGTGATTCACTCCAAGAACCTGCTGCGCAGTGGCACCCT GGTGGGCTCCTTCAAAATGGACGTGGGAACCGTGTACTCGCAG CCAGAGCACCAGTTCCATCACAAGTGGGCCATCCTGTCTGACCC CGATGACATCTCCTCGGGGCTGAAGGGCTACGTGAAGTGTGAC GTTGCCGTGGTGGGCAAAGGGGACAACATCAAGACGCCCCACA AGGCCAATGAGACCGACGAAGATGACATTGAGGGGAACTTGCT GCTCCCCGAGGGGGTGCCCCCCGAACGCCAGTGGGCCCGGTT CTATGTGAAAATTTACCGAGCAGAGGGGCTGCCCCGTATGAACA CAAGCCTCATGGCCAATGTAAAGAAGGCTTTCATCGGTGAAAAC AAGGACCTCGTGGACCCCTACGTGCAAGTCTTCTTTGCTGGCCA GAAGGGCAAGACTTCAGTGCAGAAGAGCAGCTATGAGCCCCTG TGGAATGAGCAGGTCGTCTTTACAGACCTCTTCCCCCCACTCTG CAAACGCATGAAGGTGCAGATCCGAGACTCGGACAAGGTCAAC GACGTGGCCATCGGCACCCACTTCATTGACCTGCGCAAGATTTC TAATGACGGAGACAAAGGCTTCCTGCCCACACTGGGCCCAGCC TGGGTGAACATGTACGGCTCCACACGTAACTACACGCTGCTGGA TGAGCATCAGGACCTGAACGAGGGCCTGGGGGAGGGTGTGTC CTTCCGGGCCCGGCTCCTGCTGGGCCTGGCTGTGGAGATCGTA GACACCTCCAACCCTGAGCTCACCAGCTCCACAGAGGTGCAGG TGGAGCAGGCCACGCCCATCTCGGAGAGCTGTGCAGGTAAAAT GGAAGAATTCTTTCTCTTTGGAGCCTTCCTGGAGGCCTCAATGA TCGACCGGAGAAACGGAGACAAGCCCATCACCTTTGAGGTCAC CATAGGCAACTATGGGAACGAAGTTGATGGCCTGTCCCGGCCC CAGCGGCCTCGGCCCCGGAAGGAGCCGGGGGATGAGGAAGAA GTAGACCTGATTCAGAACGCAAGTGATGACGAGGCCGGTGATG CCGGGGACCTGGCCTCAGTCTCCTCCACTCCACCAATGCGGCC CCAGGTCACCGACAGGAACTACTTCCATCTGCCCTACCTGGAGC GAAAGCCCTGCATCTACATCAAGAGCTGGTGGCCGGACCAGCG CCGCCGCCTCTACAATGCCAACATCATGGACCACATTGCCGACA AGCTGGAAGAAGGCCTGAACGACATACAGGAGATGATCAAAAC GGAGAAGTCCTACCCTGAGCGTCGCCTGCGGGGCGTCCTGGA GGAGCTGAGCTGTGGCTGCTGCCGCTTCCTCTCCCTCGCTGAC AAGGACCAGGGCCACTCATCCCGCACCAGGCTTGACCGGGAGC GCCTCAAGTCCTGCATGAGGGAGCTGGAAAACATGGGGCAGCA GGCCAGGATGCTGCGGGCCCAGGTGAAGCGGCACACGGTGCG GGACAAGCTGAGGCTGTGCCAGAACTTCCTGCAGAAGCTGCGC TTCCTGGCGGACGAGCCCCAGCACAGCATTCCCGACATCTTCAT CTGGATGATGAGCAACAACAAGCGTGTCGCCTATGCCCGTGTG CCCTCCAAGGACCTGCTCTTCTCCATCGTGGAGGAGGAGACTG GCAAGGACTGCGCCAAGGTCAAGACGCTCTTCCTTAAGCTGCC AGGGAAGCGGGGCTTCGGCTCGGCAGGCTGGACAGTGCAGGC CAAGGTGGAGCTGTACCTGTGGCTGGGCCTCAGCAAACAGCGC AAGGAGTTCCTGTGCGGCCTGCCCTGTGGCTTCCAGGAGGTCA AGGCAGCCCAGGGCCTGGGCCTGCATGCCTTCCCACCCGTCAG CCTGGTCTACACCAAGAAGCAGGCGTTCCAGCTCCGAGCGCAC ATGTACCAGGCCCGCAGCCTCTTTGCCGCCGACAGCAGCGGAC TCTCAGACCCCTTTGCCCGCGTCTTCTTCATCAATCAGAGTCAG TGCACAGAGGTGCTGAATGAGACCCTGTGTCCCACCTGGGACC AGATGCTGGTGTTCGACAACCTGGAGCTCTATGGTGAAGCTCAT GAGCTGAGGGACGATCCGCCCATCATTGTCATTGAAATCTATGA CCAGGATTCCATGGGCAAAGCTGACTTCATGGGCCGGACCTTC GCCAAACCCCTGGTGAAGATGGCAGACGAGGCGTACTGCCCAC CCCGCTTCCCACCTCAGCTCGAGTACTACCAGATCTACCGTGGC AACGCCACAGCTGGAGACCTGCTGGCGGCCTTCGAGCTGCTGC AGATTGGACCAGCAGGGAAGGCTGACCTGCCCCCCATCAATGG CCCGGTGGACGTGGACCGAGGTCCCATCATGCCCGTGCCCATG GGCATCCGGCCCGTGCTCAGCAAGTACCGAGTGGAGGTGCTGT TCTGGGGCCTACGGGACCTAAAGCGGGTGAACCTGGCCCAGGT GGACCGGCCACGGGTGGACATCGAGTGTGCAGGGAAGGGGGT GCAGTCGTCCCTGATCCACAATTATAAGAAGAACCCCAACTTCA ACACCCTCGTCAAGTGGTTTGAAGTGGACCTCCCAGAGAACGA GCTGCTGCACCCGCCCTTGAACATCCGTGTGGTGGACTGCCGG GCCTTCGGTCGCTACACACTGGTGGGCTCCCATGCCGTCAGCT CCCTGCGACGCTTCATCTACCGGCCCCCAGACCGCTCGGCCCC CAGCTGGAACACCACGGTCAGGCTTCTCCGGCGCTGCCGTGTG CTGTGCAATGGGGGCTCCTCCTCTCACTCCACAGGGGAGGTTG TGGTGACTATGGAGCCAGAGGTACCCATCAAGAAACTGGAGAC CATGGTGAAGCTGGACGCGACTTCTGAAGCTGTTGTCAAGGTG GATGTGGCTGAGGAGGAGAAGGAGAAGAAGAAGAAGAAGAAGG GCACTGCGGAGGAGCCAGAGGAGGAGGAGCCAGACGAGAGCA TGCTGGACTGGTGGTCCAAGTACTTTGCCTCCATTGACACCATG AAGGAGCAACTTCGACAACAAGAGCCCTCTGGAATTGACTTGGA GGAGAAGGAGGAAGTGGACAATACCGAGGGCCTGAAGGGGTC AATGAAGGGCAAGGAGAAGGCAAGGGCTGCCAAAGAGGAGAAG AAGAAGAAAACTCAGAGCTCTGGCTCTGGCCAGGGGTCCGAGG CCCCCGAGAAGAAGAAACCCAAGATTGATGAGCTTAAGGTATAC CCCAAAGAGCTGGAGTCCGAGTTTGATAACTTTGAGGACTGGCT GCACACTTTCAACTTGCTTCGGGGCAAGACCGGGGATGATGAG GATGGCTCCACCGAGGAGGAGCGCATTGTGGGACGCTTCAAGG GCTCCCTCTGCGTGTACAAAGTGCCACTCCCAGAGGACGTGTC CCGGGAAGCCGGCTACGACTCCACCTACGGCATGTTCCAGGGC ATCCCGAGCAATGACCCCATCAATGTGCTGGTCCGAGTCTATGT GGTCCGGGCCACGGACCTGCACCCTGCTGACATCAACGGCAAA GCTGACCCCTACATCGCCATCCGGCTAGGCAAGACTGACATCC GCGACAAGGAGAACTACATCTCCAAGCAGCTCAACCCTGTCTTT GGGAAGTCCTTTGACATCGAGGCCTCCTTCCCCATGGAATCCAT GCTGACGGTGGCTGTGTATGACTGGGACCTGGTGGGCACTGAT GACCTCATTGGGGAAACCAAGATCGACCTGGAGAACCGCTTCTA CAGCAAGCACCGCGCCACCTGCGGCATCGCCCAGACCTACTCC ACACATGGCTACAATATCTGGCGGGACCCCATGAAGCCCAGCC AGATCCTGACCCGCCTCTGCAAAGACGGCAAAGTGGACGGCCC CCACTTTGGGCCCCCTGGGAGAGTGAAGGTGGCCAACCGCGTC TTCACTGGGCCCTCTGAGATTGAGGACGAGAACGGTCAGAGGA AGCCCACAGACGAGCATGTGGCGCTGTTGGCCCTGAGGCACTG GGAGGACATCCCCCGCGCAGGCTGCCGCCTGGTGCCAGAGCA TGTGGAGACGAGGCCGCTGCTCAACCCCGACAAGCCGGGCATC GAGCAGGGCCGCCTGGAGCTGTGGGTGGACATGTTCCCCATGG ACATGCCAGCCCCTGGGACGCCTCTGGACATCTCACCTCGGAA GCCCAAGAAGTACGAGCTGCGGGTCATCATCTGGAACACAGAT GAGGTGGTCTTGGAGGACGACGACTTCTTCACAGGGGAGAAGT CCAGTGACATCTTCGTGAGGGGGTGGCTGAAGGGCCAGCAGGA GGACAAGCAGGACACAGACGTCCACTACCACTCCCTCACTGGC GAGGGCAACTTCAACTGGCGCTACCTGTTCCCCTTCGACTACCT GGCGGCGGAGGAGAAGATCGTCATCTCCAAGAAGGAGTCCATG TTCTCCTGGGACGAGACCGAGTACAAGATCCCCGCGCGGCTCA CCCTGCAGATCTGGGATGCGGACCACTTCTCCGCTGACGACTT CCTGGGGGCCATCGAGCTGGACCTGAACCGGTTCCCGCGGGG CGCAAAGACAGCCAAGCAGTGCACCATGGAGATGGCCACCGGG GAGGTGGACGTGCCCCTCGTGTCCATCTTCAAGCAAAAGCGCG TCAAAGGCTGGTGGCCCCTCCTGGCCCGCAATGAGAACGATGA GTTTGAGCTCACGGGCAAGGTGGAGGCTGAGCTGCATTTACTG ACAGCAGAGGAGGCAGAGAAGAACCCAGTGGGCCTGGCCCGC AATGAACCTGACCCCCTAGAGAAACCCAACCGGCCCGACACGG CCTTCGTCTGGTTCCTCAACCCTCTCAAGTCCATCAAGTACCTCA TCTGCACCCGGTACAAGTGGCTCATCATCAAGATCGTGCTGGCG CTGTTGGGGCTGCTCATGTTGGGGCTCTTCCTCTACAGCCTCCC TGGCTACATGGTCAAAAAGCTCCTTGGGGCATGAAGGCCGCCA GCTCCCGCCAGCCGCTCCCCAGCCCTGCCGCATTTCCTTTCAG TGGCTTGGACTCTTTCCCATCTCCCCTGGGGAGCCTGAGGAGC CCAGCGTCCACTCTTCATGCCTTGGGCCGAGCCTGCCTCCTGC TTGCGGGGGCCGCCTGTCCTCACTGCCCCAGGCTGCGGCTTGC CCAGTCCCGCCCCTCTGACCCCTGCCTGTGGGCTGGGGAGCCT TGGATGGGGTGGGGACCTGGAATGGGTCTCTCTTGCCCCACCT GGCTGAGGCGCCACCCTTCTTCAGGCCCAGGCTCCAGAGGAAG ACTCCTGAAACCCTCCCCAGGTCTTCCAAGTACAGGATTGAAGC TTTAGTGAAATTAACCAAGGACCATGGGTCAGTGCCCAGGGCTT TAAAAAGAATGAACGAGCAAAAGGTATCCCCGCCGTGACCCCTG CAGATAGCACCGGTCTTTGATCCGCAGCAGGGGCCAGACCCTG CCCACAAGTCCCAGCGCGGCTGCTTCTGCCACTGCTGGGCTCC ACTTGGCTCCTCTCACTTCCCAGGGGGTCGCCTGTCCTGCCTGT GGGTTTCCATGGCTTCCCAGAGCTCCCTCTGCCCCAGCCAGCG CCTCCAGGCCCAGCTGAGGAGCTGTGAGAAGCAGCAGAGGGG ACTCCCCATCCCGGGCACACCCTGTCCTCCCACCCCTGCCCCC TTGCCCTTCCAGCCCTTTCAGCTGCAGCTGGGAGCTGGCCCGT CAAGTGCTGCCCCTGCCTGTGTCTGGGTTTCTGTTGGCTGTTTT TCTTTTCTTGAGTGGTGATTTTTCTCTAAATAAAAGAAGTCAAGC ACTGAAAAAAAAAAAAAAAA 15 mOTOF-201_1 TTGGTTGCCTTGGTCTCTGTGGGCAGCAGCAGGAGGAGGCGGC transcript AGCAGCCAGAGAAGAGGGAGGCGTGTGAGCCACACTCCACCAG (NM_031875.2), CGAGCTTCTTCCCGCTGCTCTGGAACTGCCCAGGCTCTCCCCA mouse otoferlin CCAGCATGGCCCTGATTGTTCACCTCAAGACTGTCTCAGAGCTC transcript variant 2, CGAGGCAAAGGTGACCGGATTGCCAAAGTCACTTTCCGAGGGC 7129 bp, encodes AGTCTTTCTACTCCCGGGTCCTGGAGAACTGCGAGGGTGTGGC the protein of SEQ TGACTTTGATGAGACGTTCCGGTGGCCAGTGGCCAGCAGCATC ID NO: 6 GACCGGAATGAAGTGTTGGAGATTCAGATTTTCAACTACAGCAA AGTCTTCAGCAACAAGCTGATAGGGACCTTCTGCATGGTGCTGC AGAAAGTGGTGGAGGAGAATCGGGTAGAGGTGACCGACACGCT GATGGATGACAGCAATGCTATCATCAAGACCAGCCTGAGCATGG AGGTCCGGTATCAGGCCACAGATGGCACTGTGGGCCCCTGGGA TGATGGAGACTTCCTGGGAGATGAATCCCTCCAGGAGGAGAAG GACAGCCAGGAGACAGATGGGCTGCTACCTGGTTCCCGACCCA GCACCCGGATATCTGGCGAGAAGAGCTTTCGCAGAGCGGGAAG GAGTGTGTTCTCGGCCATGAAACTCGGCAAAACTCGGTCCCACA AAGAGGAGCCCCAAAGACAAGATGAGCCAGCAGTGCTGGAGAT GGAGGACCTGGACCACCTAGCCATTCAGCTGGGGGATGGGCTG GATCCTGACTCCGTGTCTCTAGCCTCGGTCACCGCTCTCACCAG CAATGTCTCCAACAAACGGTCTAAGCCAGATATTAAGATGGAGC CCAGTGCTGGAAGGCCCATGGATTACCAGGTCAGCATCACAGT GATTGAGGCTCGGCAGCTGGTGGGCTTGAACATGGACCCTGTG GTGTGTGTGGAGGTGGGTGATGACAAGAAATACACGTCAATGAA GGAGTCCACAAACTGCCCTTACTACAACGAGTACTTTGTCTTCG ACTTCCATGTCTCTCCTGATGTCATGTTTGACAAGATCATCAAGA TCTCGGTTATCCATTCTAAGAACCTGCTTCGGAGCGGCACCCTG GTGGGTTCCTTCAAAATGGATGTGGGGACTGTGTATTCCCAGCC TGAACACCAGTTCCATCACAAATGGGCCATCCTGTCAGACCCCG ATGACATCTCTGCTGGGTTGAAGGGTTATGTAAAGTGTGATGTC GCTGTGGTGGGCAAGGGAGACAACATCAAGACACCCCACAAGG CCAACGAGACGGATGAGGACGACATTGAAGGGAACTTGCTGCT CCCCGAGGGCGTGCCCCCCGAACGGCAGTGGGCACGGTTCTA TGTGAAAATTTACCGAGCAGAGGGACTGCCCCGGATGAACACAA GCCTCATGGCCAACGTGAAGAAGGCGTTCATCGGTGAGAACAA GGACCTCGTCGACCCCTATGTGCAAGTCTTCTTTGCTGGACAAA AGGGCAAAACATCAGTGCAGAAGAGCAGCTATGAGCCGCTATG GAATGAGCAGGTCGTCTTCACAGACTTGTTCCCCCCACTCTGCA AACGCATGAAGGTGCAGATCCGGGACTCTGACAAGGTCAATGAT GTGGCCATCGGCACCCACTTCATCGACCTGCGCAAGATTTCCAA CGATGGAGACAAAGGCTTCCTGCCTACCCTCGGTCCAGCCTGG GTGAACATGTACGGCTCCACGCGCAACTACACACTGCTGGACG AGCACCAGGACTTGAATGAAGGCCTGGGGGAGGGTGTGTCCTT CCGGGCCCGCCTCATGTTGGGACTAGCTGTGGAGATCCTGGAC ACCTCCAACCCAGAGCTCACCAGCTCCACGGAGGTGCAGGTGG AGCAGGCCACGCCTGTCTCGGAGAGCTGCACAGGGAGAATGGA AGAATTTTTTCTATTTGGAGCCTTCTTGGAAGCCTCAATGATTGA CCGGAAAAATGGGGACAAGCCAATTACCTTTGAGGTGACCATAG GAAACTACGGCAATGAAGTCGATGGTATGTCCCGGCCCCTGAG GCCTCGGCCCCGGAAAGAGCCTGGGGATGAAGAAGAGGTAGA CCTGATTCAGAACTCCAGTGACGATGAAGGTGACGAAGCCGGG GACCTGGCCTCGGTGTCCTCCACCCCACCTATGCGGCCCCAGA TCACGGACAGGAACTATTTCCACCTGCCCTACCTGGAGCGCAAG CCCTGCATCTATATCAAGAGCTGGTGGCCTGACCAGAGGCGGC GCCTCTACAATGCCAACATCATGGATCACATTGCTGACAAGCTG GAAGAAGGCCTGAATGATGTACAGGAGATGATCAAAACGGAGAA GTCCTACCCGGAGCGCCGCCTGCGGGGTGTGCTAGAGGAACTC AGCTGTGGCTGCCACCGCTTCCTCTCCCTCTCGGACAAGGACC AGGGCCGCTCGTCCCGCACCAGGCTGGATCGAGAGCGTCTTAA GTCCTGTATGAGGGAGTTGGAGAGCATGGGACAGCAGGCCAAG AGCCTGAGGGCTCAGGTGAAGCGGCACACTGTTCGGGACAAGC TGAGGTCATGCCAGAACTTTCTGCAGAAGCTACGCTTCCTGGCG GATGAGCCCCAGCACAGCATTCCTGATGTGTTCATTTGGATGAT GAGCAACAACAAACGTATCGCCTATGCCCGCGTGCCTTCCAAAG ACCTGCTCTTCTCCATCGTGGAGGAGGAACTGGGCAAGGACTG CGCCAAAGTCAAGACCCTCTTCCTGAAGCTGCCAGGGAAGAGG GGCTTCGGCTCGGCAGGCTGGACAGTACAGGCCAAGCTGGAG CTCTACCTGTGGCTGGGCCTCAGCAAGCAGCGAAAGGACTTCC TGTGTGGTCTGCCCTGTGGCTTCGAGGAGGTCAAGGCAGCCCA AGGCCTGGGCCTGCATTCCTTTCCGCCCATCAGCCTAGTCTACA CCAAGAAGCAAGCCTTCCAGCTCCGAGCACACATGTATCAGGC CCGAAGCCTCTTTGCTGCTGACAGCAGTGGGCTCTCTGATCCCT TTGCCCGTGTCTTCTTCATCAACCAGAGCCAATGCACTGAGGTT CTAAACGAGACACTGTGTCCCACCTGGGACCAGATGCTGGTATT TGACAACCTGGAGCTGTACGGTGAAGCTCACGAGTTACGAGAT GATCCCCCCATCATTGTCATTGAAATCTACGACCAGGACAGCAT GGGCAAAGCCGACTTCATGGGCCGGACCTTCGCCAAGCCCCTG GTGAAGATGGCAGATGAAGCATACTGCCCACCTCGCTTCCCGC CGCAGCTTGAGTACTACCAGATCTACCGAGGCAGTGCCACTGC CGGAGACCTACTGGCTGCCTTCGAGCTGCTGCAGATTGGGCCA TCAGGGAAGGCTGACCTGCCACCCATCAATGGCCCAGTGGACA TGGACAGAGGGCCCATCATGCCTGTGCCCGTGGGAATCCGGCC AGTGCTCAGCAAGTACCGAGTGGAGGTGCTGTTCTGGGGCCTG AGGGACCTAAAGAGGGTGAACCTGGCCCAGGTGGACCGACCAC GGGTGGACATCGAGTGTGCAGGAAAGGGGGTACAATCCTCCCT GATTCACAATTATAAGAAGAACCCCAACTTCAACACGCTGGTCAA GTGGTTTGAAGTGGACCTCCCGGAGAATGAGCTCCTGCACCCA CCCTTGAACATCCGAGTGGTAGATTGCCGGGCCTTTGGACGATA CACCCTGGTGGGTTCCCACGCAGTCAGCTCACTGAGGCGCTTC ATCTACCGACCTCCAGACCGCTCAGCCCCCAACTGGAACACCA CAGTCAGGCTGCTCCGGGGCTGCCACAGGCTGCGCAATGGGG GCCCCTCTTCTCGCCCCACAGGGGAGGTTGTAGTAAGCATGGA GCCTGAGGAGCCAGTTAAGAAGCTGGAGACCATGGTGAAACTG GATGCGACTTCTGATGCTGTGGTCAAGGTGGATGTGGCTGAAG ATGAGAAGGAAAGGAAGAAGAAGAAAAAGAAAGGCCCGTCAGA GGAGCCAGAGGAGGAAGAGCCCGATGAGAGCATGCTGGATTG GTGGTCCAAGTACTTCGCCTCCATCGACACAATGAAGGAGCAAC TTCGACAACATGAGACCTCTGGAACTGACTTGGAAGAGAAG GAA GAGATGGAAAGCGCTGAGGGCCTGAAGGGACCAATGAAGAGCA AGGAGAAGTCCAGAGCTGCAAAGGAGGAGAAAAAGAAGAAAAA CCAGAGCCCTGGCCCTGGCCAGGGATCGGAGGCTCCTGAGAA GAAGAAAGCCAAGATCGATGAGCTTAAGGTGTACCCCAAGGAG CTGGAATCGGAGTTTGACAGCTTTGAGGACTGGCTGCACACCTT CAACCTGTTGAGGGGCAAGACGGGAGATGATGAGGATGGCTCC ACAGAGGAGGAGCGCATAGTAGGCCGATTCAAGGGCTCCCTCT GTGTGTACAAAGTGCCACTCCCAGAAGATGTATCTCGAGAAGCT GGCTATGATCCCACCTATGGAATGTTCCAGGGCATCCCAAGCAA TGACCCCATCAATGTGCTGGTCCGAATCTATGTGGTCCGGGCCA CAGACCTGCACCCGGCCGACATCAATGGCAAAGCTGACCCCTA TATTGCCATCAAGTTAGGCAAGACCGACATCCGAGACAAGGAGA ACTACATCTCCAAGCAGCTCAACCCTGTGTTTGGGAAGTCCTTT GACATTGAGGCCTCCTTCCCCATGGAGTCCATGTTGACAGTGGC CGTGTACGACTGGGATCTGGTGGGCACTGATGACCTCATCGGA GAAACCAAGATTGACCTGGAAAACCGCTTCTACAGCAAGCATCG CGCCACCTGCGGCATCGCACAGACCTATTCCATACATGGCTACA ATATCTGGAGGGACCCCATGAAGCCCAGCCAGATCCTGACACG CCTCTGTAAAGAGGGCAAAGTGGACGGCCCCCACTTTGGTCCC CATGGGAGAGTGAGGGTTGCCAACCGTGTCTTCACGGGGCCTT CAGAAATAGAGGATGAGAATGGTCAGAGGAAGCCCACAGATGA GCACGTGGCACTGTCTGCTCTGAGACACTGGGAGGACATCCCC CGGGTGGGCTGCCGCCTTGTGCCGGAACACGTGGAGACCAGG CCGCTGCTCAACCCTGACAAGCCAGGCATTGAGCAGGGCCGCC TGGAGCTGTGGGTGGACATGTTCCCCATGGACATGCCAGCCCC TGGGACACCTCTGGATATATCCCCCAGGAAACCCAAGAAGTACG AGCTGCGGGTCATCGTGTGGAACACAGACGAGGTGGTCCTGGA AGACGATGATTTCTTCACGGGAGAGAAGTCCAGTGACATTTTTG TGAGGGGGTGGCTGAAGGGCCAGCAGGAGGACAAACAGGACA CAGATGTCCACTATCACTCCCTCACGGGGGAGGGCAACTTCAAC TGGAGATACCTCTTCCCCTTCGACTACCTAGCGGCCGAAGAGAA GATCGTTATGTCCAAAAAGGAGTCTATGTTCTCCTGGGATGAGA CGGAGTACAAGATCCCTGCGCGGCTCACCCTGCAGATCTGGGA CGCTGACCACTTCTCGGCTGACGACTTCCTGGGGGCTATCGAG CTGGACCTGAACCGGTTCCCGAGGGGCGCTAAGACAGCCAAGC AGTGCACCATGGAGATGGCCACCGGGGAGGTGGACGTACCCCT GGTTTCCATCTTTAAACAGAAACGTGTCAAAGGCTGGTGGCCCC TCCTGGCCCGCAATGAGAATGATGAGTTTGAGCTCACAGGCAAA GTGGAGGCGGAGCTACACCTACTCACGGCAGAGGAGGCAGAG AAGAACCCTGTGGGCCTGGCTCGCAATGAACCTGATCCCCTAG AAAAACCCAATCGGCCGGACACAAGCTTCATCTGGTTCTTGAAC CCTCTCAAGTCTGCCCGCTACTTCCTGTGGCATACCTACCGCTG GCTACTCCTCAAATTCCTGCTGCTCTTCCTCCTGCTGCTGCTCTT CGCCCTGTTTCTCTACTCTCTGCCTGGCTACCTGGCCAAGAAGA TCCTTGGGGCCTGAGCCCTGCAGTCGCCTAGGCCTGCCGGCCT GACACGGCATTCGTCTGGTTCCTGAACCCACTCAAATCTATCAA GTACCTCATCTGCACCCGGTACAAGTGGCTGATCATCAAGATCG TGCTGGCGCTGCTGGGGCTGCTCATGCTGGCCCTCTTCCTTTAC AGCCTCCCAGGCTACATGGTCAAGAAGCTCCTAGGGGCCTGAA GTGTGCCCCACCCCAGCCCGCTCCAGCATCCCTCCAGGGGCTG CTGCGTATTTTGCCTTCCCTCACCTGGACTCTCTCCCAACTCCCT GAGGAGCCCTCCCACGCCTGCCAGCCTTGAGCAAGACACCTGC TTGCTGGACTTCATCCCCACCCCACACCCAAACTGTTGCTTGCC TGATCTTGTCCCAGGCCTGCCTGGGGTTTGGGGCACAGTTGGC CTCCAAAACCAGATACCCTCTTGTCTAAAGTACCAGGTTCCTCTG CCCAACCCCAAGAGTGGTAGTGGCCCAACCCTCCCTGTGCTTTC CAAATCTTGTCTTAAGGCACCAGTGAAATTAACCAAGAAACGCG GAGCAATGCCCAAGGCTCTGATGAGTAGGAACACGTGGAAAGC ACCAGGAATGCCAGCAGAGGCGAGGCGGCACACCTCTCTGCAG AGCATCCAGGCCGAGCGGCGGGCAGCGGCCAGCTGCTTCTGC GCATGCTCTCCTCTTGGCTCTGCTTCTTTCTCACAGTCACAGTCA CTTCACAGCTTAGCCTTGGGCTTCCCATCACTTCCAGGGGTGCC TCTGCCTTGGCCAGTGTGTGTCAGCTAGTACACAAGCTCCAAGT GTGAATCAGGTGTACTGGCCGTCCTGAAGACTGACTGCCCTGTC CTTCCTGCCGACAGCCACACCCGAGTGTACACTTAAAGCGGTG CCCTTCTGCCTCTGTGGGCCTGCTGGCTGCTGTTCCTTTCTTGA GTGTGATTTTTTTTTTCTCTCCCTCAATAAAATAAATCAAACTCTG AGAC 16 mOTOF-201 2 TTGGTTGCCTTGGTCTCTGTGGGCAGCAGCAGGAGGAGGCGGC transcript AGCAGCCAGAGAAGAGGGAGGCGTGTGAGCCACACTCCACCAG (NM_001286421.1), CGAGCTTCTTCCCGCTGCTCTGGAACTGCCCAGGCTCTCCCCA mouse otoferlin CCAGCATGGCCCTGATTGTTCACCTCAAGACTGTCTCAGAGCTC transcript variant 3, CGAGGCAAAGGTGACCGGATTGCCAAAGTCACTTTCCGAGGGC 7129 bp, encodes AGTCTTTCTACTCCCGGGTCCTGGAGAACTGCGAGGGTGTGGC the protein of SEQ TGACTTTGATGAGACGTTCCGGTGGCCAGTGGCCAGCAGCATC ID NO: 7 GACCGGAATGAAGTGTTGGAGATTCAGATTTTCAACTACAGCAA AGTCTTCAGCAACAAGCTGATAGGGACCTTCTGCATGGTGCTGC AGAAAGTGGTGGAGGAGAATCGGGTAGAGGTGACCGACACGCT GATGGATGACAGCAATGCTATCATCAAGACCAGCCTGAGCATGG AGGTCCGGTATCAGGCCACAGATGGCACTGTGGGCCCCTGGGA TGATGGAGACTTCCTGGGAGATGAATCCCTCCAGGAGGAGAAG GACAGCCAGGAGACAGATGGGCTGCTACCTGGTTCCCGACCCA GCACCCGGATATCTGGCGAGAAGAGCTTTCGCAGAGCGGGAAG GAGTGTGTTCTCGGCCATGAAACTCGGCAAAACTCGGTCCCACA AAGAGGAGCCCCAAAGACAAGATGAGCCAGCAGTGCTGGAGAT GGAGGACCTGGACCACCTAGCCATTCAGCTGGGGGATGGGCTG GATCCTGACTCCGTGTCTCTAGCCTCGGTCACCGCTCTCACCAG CAATGTCTCCAACAAACGGTCTAAGCCAGATATTAAGATGGAGC CCAGTGCTGGAAGGCCCATGGATTACCAGGTCAGCATCACAGT GATTGAGGCTCGGCAGCTGGTGGGCTTGAACATGGACCCTGTG GTGTGTGTGGAGGTGGGTGATGACAAGAAATACACGTCAATGAA GGAGTCCACAAACTGCCCTTACTACAACGAGTACTTTGTCTTCG ACTTCCATGTCTCTCCTGATGTCATGTTTGACAAGATCATCAAGA TCTCGGTTATCCATTCTAAGAACCTGCTTCGGAGCGGCACCCTG GTGGGTTCCTTCAAAATGGATGTGGGGACTGTGTATTCCCAGCC TGAACACCAGTTCCATCACAAATGGGCCATCCTGTCAGACCCCG ATGACATCTCTGCTGGGTTGAAGGGTTATGTAAAGTGTGATGTC GCTGTGGTGGGCAAGGGAGACAACATCAAGACACCCCACAAGG CCAACGAGACGGATGAGGACGACATTGAAGGGAACTTGCTGCT CCCCGAGGGCGTGCCCCCCGAACGGCAGTGGGCACGGTTCTA TGTGAAAATTTACCGAGCAGAGGGACTGCCCCGGATGAACACAA GCCTCATGGCCAACGTGAAGAAGGCGTTCATCGGTGAGAACAA GGACCTCGTCGACCCCTATGTGCAAGTCTTCTTTGCTGGACAAA AGGGCAAAACATCAGTGCAGAAGAGCAGCTATGAGCCGCTATG GAATGAGCAGGTCGTCTTCACAGACTTGTTCCCCCCACTCTGCA AACGCATGAAGGTGCAGATCCGGGACTCTGACAAGGTCAATGAT GTGGCCATCGGCACCCACTTCATCGACCTGCGCAAGATTTCCAA CGATGGAGACAAAGGCTTCCTGCCTACCCTCGGTCCAGCCTGG GTGAACATGTACGGCTCCACGCGCAACTACACACTGCTGGACG AGCACCAGGACTTGAATGAAGGCCTGGGGGAGGGTGTGTCCTT CCGGGCCCGCCTCATGTTGGGACTAGCTGTGGAGATCCTGGAC ACCTCCAACCCAGAGCTCACCAGCTCCACGGAGGTGCAGGTGG AGCAGGCCACGCCTGTCTCGGAGAGCTGCACAGGGAGAATGGA AGAATTTTTTCTATTTGGAGCCTTCTTGGAAGCCTCAATGATTGA CCGGAAAAATGGGGACAAGCCAATTACCTTTGAGGTGACCATAG GAAACTACGGCAATGAAGTCGATGGTATGTCCCGGCCCCTGAG GCCTCGGCCCCGGAAAGAGCCTGGGGATGAAGAAGAGGTAGA CCTGATTCAGAACTCCAGTGACGATGAAGGTGACGAAGCCGGG GACCTGGCCTCGGTGTCCTCCACCCCACCTATGCGGCCCCAGA TCACGGACAGGAACTATTTCCACCTGCCCTACCTGGAGCGCAAG CCCTGCATCTATATCAAGAGCTGGTGGCCTGACCAGAGGCGGC GCCTCTACAATGCCAACATCATGGATCACATTGCTGACAAGCTG GAAGAAGGCCTGAATGATGTACAGGAGATGATCAAAACGGAGAA GTCCTACCCGGAGCGCCGCCTGCGGGGTGTGCTAGAGGAACTC AGCTGTGGCTGCCACCGCTTCCTCTCCCTCTCGGACAAGGACC AGGGCCGCTCGTCCCGCACCAGGCTGGATCGAGAGCGTCTTAA GTCCTGTATGAGGGAGTTGGAGAGCATGGGACAGCAGGCCAAG AGCCTGAGGGCTCAGGTGAAGCGGCACACTGTTCGGGACAAGC TGAGGTCATGCCAGAACTTTCTGCAGAAGCTACGCTTCCTGGCG GATGAGCCCCAGCACAGCATTCCTGATGTGTTCATTTGGATGAT GAGCAACAACAAACGTATCGCCTATGCCCGCGTGCCTTCCAAAG ACCTGCTCTTCTCCATCGTGGAGGAGGAACTGGGCAAGGACTG CGCCAAAGTCAAGACCCTCTTCCTGAAGCTGCCAGGGAAGAGG GGCTTCGGCTCGGCAGGCTGGACAGTACAGGCCAAGCTGGAG CTCTACCTGTGGCTGGGCCTCAGCAAGCAGCGAAAGGACTTCC TGTGTGGTCTGCCCTGTGGCTTCGAGGAGGTCAAGGCAGCCCA AGGCCTGGGCCTGCATTCCTTTCCGCCCATCAGCCTAGTCTACA CCAAGAAGCAAGCCTTCCAGCTCCGAGCACACATGTATCAGGC CCGAAGCCTCTTTGCTGCTGACAGCAGTGGGCTCTCTGATCCCT TTGCCCGTGTCTTCTTCATCAACCAGAGCCAATGCACTGAGGTT CTAAACGAGACACTGTGTCCCACCTGGGACCAGATGCTGGTATT TGACAACCTGGAGCTGTACGGTGAAGCTCACGAGTTACGAGAT GATCCCCCCATCATTGTCATTGAAATCTACGACCAGGACAGCAT GGGCAAAGCCGACTTCATGGGCCGGACCTTCGCCAAGCCCCTG GTGAAGATGGCAGATGAAGCATACTGCCCACCTCGCTTCCCGC CGCAGCTTGAGTACTACCAGATCTACCGAGGCAGTGCCACTGC CGGAGACCTACTGGCTGCCTTCGAGCTGCTGCAGATTGGGCCA TCAGGGAAGGCTGACCTGCCACCCATCAATGGCCCAGTGGACA TGGACAGAGGGCCCATCATGCCTGTGCCCGTGGGAATCCGGCC AGTGCTCAGCAAGTACCGAGTGGAGGTGCTGTTCTGGGGCCTG AGGGACCTAAAGAGGGTGAACCTGGCCCAGGTGGACCGACCAC GGGTGGACATCGAGTGTGCAGGAAAGGGGGTACAATCCTCCCT GATTCACAATTATAAGAAGAACCCCAACTTCAACACGCTGGTCAA GTGGTTTGAAGTGGACCTCCCGGAGAATGAGCTCCTGCACCCA CCCTTGAACATCCGAGTGGTAGATTGCCGGGCCTTTGGACGATA CACCCTGGTGGGTTCCCACGCAGTCAGCTCACTGAGGCGCTTC ATCTACCGACCTCCAGACCGCTCAGCCCCCAACTGGAACACCA CAGGGGAGGTTGTAGTAAGCATGGAGCCTGAGGAGCCAGTTAA GAAGCTGGAGACCATGGTGAAACTGGATGCGACTTCTGATGCT GTGGTCAAGGTGGATGTGGCTGAAGATGAGAAGGAAAGGAAGA AGAAGAAAAAGAAAGGCCCGTCAGAGGAGCCAGAGGAGGAAGA GCCCGATGAGAGCATGCTGGATTGGTGGTCCAAGTACTTCGCC TCCATCGACACAATGAAGGAGCAACTTCGACAACATGAGACCTC TGGAACTGACTTGGAAGAGAAGGAAGAGATGGAAAGCGCTGAG GGCCTGAAGGGACCAATGAAGAGCAAGGAGAAGTCCAGAGCTG CAAAGGAGGAGAAAAAGAAGAAAAACCAGAGCCCTGGCCCTGG CCAGGGATCGGAGGCTCCTGAGAAGAAGAAAGCCAAGATCGAT GAGCTTAAGGTGTACCCCAAGGAGCTGGAATCGGAGTTTGACA GCTTTGAGGACTGGCTGCACACCTTCAACCTGTTGAGGGGCAA GACGGGAGATGATGAGGATGGCTCCACAGAGGAGGAGCGCATA GTAGGCCGATTCAAGGGCTCCCTCTGTGTGTACAAAGTGCCACT CCCAGAAGATGTATCTCGAGAAGCTGGCTATGATCCCACCTATG GAATGTTCCAGGGCATCCCAAGCAATGACCCCATCAATGTGCTG GTCCGAATCTATGTGGTCCGGGCCACAGACCTGCACCCGGCCG ACATCAATGGCAAAGCTGACCCCTATATTGCCATCAAGTTAGGC AAGACCGACATCCGAGACAAGGAGAACTACATCTCCAAGCAGCT CAACCCTGTGTTTGGGAAGTCCTTTGACATTGAGGCCTCCTTCC CCATGGAGTCCATGTTGACAGTGGCCGTGTACGACTGGGATCT GGTGGGCACTGATGACCTCATCGGAGAAACCAAGATTGACCTG GAAAACCGCTTCTACAGCAAGCATCGCGCCACCTGCGGCATCG CACAGACCTATTCCATACATGGCTACAATATCTGGAGGGACCCC ATGAAGCCCAGCCAGATCCTGACACGCCTCTGTAAAGAGGGCA AAGTGGACGGCCCCCACTTTGGTCCCCATGGGAGAGTGAGGGT TGCCAACCGTGTCTTCACGGGGCCTTCAGAAATAGAGGATGAGA ATGGTCAGAGGAAGCCCACAGATGAGCACGTGGCACTGTCTGC TCTGAGACACTGGGAGGACATCCCCCGGGTGGGCTGCCGCCTT GTGCCGGAACACGTGGAGACCAGGCCGCTGCTCAACCCTGACA AGCCAGGCATTGAGCAGGGCCGCCTGGAGCTGTGGGTGGACAT GTTCCCCATGGACATGCCAGCCCCTGGGACACCTCTGGATATAT CCCCCAGGAAACCCAAGAAGTACGAGCTGCGGGTCATCGTGTG GAACACAGACGAGGTGGTCCTGGAAGACGATGATTTCTTCACG GGAGAGAAGTCCAGTGACATTTTTGTGAGGGGGTGGCTGAAGG GCCAGCAGGAGGACAAACAGGACACAGATGTCCACTATCACTC CCTCACGGGGGAGGGCAACTTCAACTGGAGATACCTCTTCCCC TTCGACTACCTAGCGGCCGAAGAGAAGATCGTTATGTCCAAAAA GGAGTCTATGTTCTCCTGGGATGAGACGGAGTACAAGATCCCTG CGCGGCTCACCCTGCAGATCTGGGACGCTGACCACTTCTCGGC TGACGACTTCCTGGGGGCTATCGAGCTGGACCTGAACCGGTTC CCGAGGGGCGCTAAGACAGCCAAGCAGTGCACCATGGAGATGG CCACCGGGGAGGTGGACGTACCCCTGGTTTCCATCTTTAAACAG AAACGTGTCAAAGGCTGGTGGCCCCTCCTGGCCCGCAATGAGA ATGATGAGTTTGAGCTCACAGGCAAAGTGGAGGCGGAGCTACA CCTACTCACGGCAGAGGAGGCAGAGAAGAACCCTGTGGGCCTG GCTCGCAATGAACCTGATCCCCTAGAAAAACCCAATCGGCCGGA CACAAGCTTCATCTGGTTCTTGAACCCTCTCAAGTCTGCCCGCT ACTTCCTGTGGCATACCTACCGCTGGCTACTCCTCAAATTCCTG CTGCTCTTCCTCCTGCTGCTGCTCTTCGCCCTGTTTCTCTACTCT CTGCCTGGCTACCTGGCCAAGAAGATCCTTGGGGCCTGAGCCC TGCAGTCGCCTAGGCCTGCCGGCCTGACACGGCATTCGTCTGG TTCCTGAACCCACTCAAATCTATCAAGTACCTCATCTGCACCCG GTACAAGTGGCTGATCATCAAGATCGTGCTGGCGCTGCTGGGG CTGCTCATGCTGGCCCTCTTCCTTTACAGCCTCCCAGGCTACAT GGTCAAGAAGCTCCTAGGGGCCTGAAGTGTGCCCCACCCCAGC CCGCTCCAGCATCCCTCCAGGGGCTGCTGCGTATTTTGCCTTCC CTCACCTGGACTCTCTCCCAACTCCCTGAGGAGCCCTCCCACG CCTGCCAGCCTTGAGCAAGACACCTGCTTGCTGGACTTCATCCC CACCCCACACCCAAACTGTTGCTTGCCTGATCTTGTCCCAGGCC TGCCTGGGGTTTGGGGCACAGTTGGCCTCCAAAACCAGATACC CTCTTGTCTAAAGTACCAGGTTCCTCTGCCCAACCCCAAGAGTG GTAGTGGCCCAACCCTCCCTGTGCTTTCCAAATCTTGTCTTAAG GCACCAGTGAAATTAACCAAGAAACGCGGAGCAATGCCCAAGG CTCTGATGAGTAGGAACACGTGGAAAGCACCAGGAATGCCAGC AGAGGCGAGGCGGCACACCTCTCTGCAGAGCATCCAGGCCGA GCGGCGGGCAGCGGCCAGCTGCTTCTGCGCATGCTCTCCTCTT GGCTCTGCTTCTTTCTCACAGTCACAGTCACTTCACAGCTTAGC CTTGGGCTTCCCATCACTTCCAGGGGTGCCTCTGCCTTGGCCA GTGTGTGTCAGCTAGTACACAAGCTCCAAGTGTGAATCAGGTGT ACTGGCCGTCCTGAAGACTGACTGCCCTGTCCTTCCTGCCGACA GCCACACCCGAGTGTACACTTAAAGCGGTGCCCTTCTGCCTCTG TGGGCCTGCTGGCTGCTGTTCCTTTCTTGAGTGTGATTTTTTTTT TCTCTCCCTCAATAAAATAAATCAAACTCTGAGAC 17 mOTOF-202_1 TTGGTTGCCTTGGTCTCTGTGGGCAGCAGCAGGAGGAGGCGGC transcript AGCAGCCAGAGAAGAGGGAGGCGTGTGAGCCACACTCCACCAG (NM_001100395.1), CGAGCTTCTTCCCGCTGCTCTGGAACTGCCCAGGCTCTCCCCA mouse otoferlin CCAGCATGGCCCTGATTGTTCACCTCAAGACTGTCTCAGAGCTC transcript variant 1, CGAGGCAAAGGTGACCGGATTGCCAAAGTCACTTTCCGAGGGC 6881 bp, encodes AGTCTTTCTACTCCCGGGTCCTGGAGAACTGCGAGGGTGTGGC the protein of SEQ TGACTTTGATGAGACGTTCCGGTGGCCAGTGGCCAGCAGCATC ID NO: 8 GACCGGAATGAAGTGTTGGAGATTCAGATTTTCAACTACAGCAA AGTCTTCAGCAACAAGCTGATAGGGACCTTCTGCATGGTGCTGC AGAAAGTGGTGGAGGAGAATCGGGTAGAGGTGACCGACACGCT GATGGATGACAGCAATGCTATCATCAAGACCAGCCTGAGCATGG AGGTCCGGTATCAGGCCACAGATGGCACTGTGGGCCCCTGGGA TGATGGAGACTTCCTGGGAGATGAATCCCTCCAGGAGGAGAAG GACAGCCAGGAGACAGATGGGCTGCTACCTGGTTCCCGACCCA GCACCCGGATATCTGGCGAGAAGAGCTTTCGCAGCAAAGGCAG AGAGAAGACCAAGGGAGGCAGAGATGGCGAGCACAAAGCGGG AAGGAGTGTGTTCTCGGCCATGAAACTCGGCAAAACTCGGTCCC ACAAAGAGGAGCCCCAAAGACAAGATGAGCCAGCAGTGCTGGA GATGGAGGACCTGGACCACCTAGCCATTCAGCTGGGGGATGGG CTGGATCCTGACTCCGTGTCTCTAGCCTCGGTCACCGCTCTCAC CAGCAATGTCTCCAACAAACGGTCTAAGCCAGATATTAAGATGG AGCCCAGTGCTGGAAGGCCCATGGATTACCAGGTCAGCATCAC AGTGATTGAGGCTCGGCAGCTGGTGGGCTTGAACATGGACCCT GTGGTGTGTGTGGAGGTGGGTGATGACAAGAAATACACGTCAAT GAAGGAGTCCACAAACTGCCCTTACTACAACGAGTACTTTGTCT TCGACTTCCATGTCTCTCCTGATGTCATGTTTGACAAGATCATCA AGATCTCGGTTATCCATTCTAAGAACCTGCTTCGGAGCGGCACC CTGGTGGGTTCCTTCAAAATGGATGTGGGGACTGTGTATTCCCA GCCTGAACACCAGTTCCATCACAAATGGGCCATCCTGTCAGACC CCGATGACATCTCTGCTGGGTTGAAGGGTTATGTAAAGTGTGAT GTCGCTGTGGTGGGCAAGGGAGACAACATCAAGACACCCCACA AGGCCAACGAGACGGATGAGGACGACATTGAAGGGAACTTGCT GCTCCCCGAGGGCGTGCCCCCCGAACGGCAGTGGGCACGGTT CTATGTGAAAATTTACCGAGCAGAGGGACTGCCCCGGATGAACA CAAGCCTCATGGCCAACGTGAAGAAGGCGTTCATCGGTGAGAA CAAGGACCTCGTCGACCCCTATGTGCAAGTCTTCTTTGCTGGAC AAAAGGGCAAAACATCAGTGCAGAAGAGCAGCTATGAGCCGCT ATGGAATGAGCAGGTCGTCTTCACAGACTTGTTCCCCCCACTCT GCAAACGCATGAAGGTGCAGATCCGGGACTCTGACAAGGTCAA TGATGTGGCCATCGGCACCCACTTCATCGACCTGCGCAAGATTT CCAACGATGGAGACAAAGGCTTCCTGCCTACCCTCGGTCCAGC CTGGGTGAACATGTACGGCTCCACGCGCAACTACACACTGCTG GACGAGCACCAGGACTTGAATGAAGGCCTGGGGGAGGGTGTGT CCTTCCGGGCCCGCCTCATGTTGGGACTAGCTGTGGAGATCCT GGACACCTCCAACCCAGAGCTCACCAGCTCCACGGAGGTGCAG GTGGAGCAGGCCACGCCTGTCTCGGAGAGCTGCACAGGGAGA ATGGAAGAATTTTTTCTATTTGGAGCCTTCTTGGAAGCCTCAATG ATTGACCGGAAAAATGGGGACAAGCCAATTACCTTTGAGGTGAC CATAGGAAACTACGGCAATGAAGTCGATGGTATGTCCCGGCCC CTGAGGCCTCGGCCCCGGAAAGAGCCTGGGGATGAAGAAGAG GTAGACCTGATTCAGAACTCCAGTGACGATGAAGGTGACGAAGC CGGGGACCTGGCCTCGGTGTCCTCCACCCCACCTATGCGGCCC CAGATCACGGACAGGAACTATTTCCACCTGCCCTACCTGGAGCG CAAGCCCTGCATCTATATCAAGAGCTGGTGGCCTGACCAGAGG CGGCGCCTCTACAATGCCAACATCATGGATCACATTGCTGACAA GCTGGAAGAAGGCCTGAATGATGTACAGGAGATGATCAAAACG GAGAAGTCCTACCCGGAGCGCCGCCTGCGGGGTGTGCTAGAG GAACTCAGCTGTGGCTGCCACCGCTTCCTCTCCCTCTCGGACAA GGACCAGGGCCGCTCGTCCCGCACCAGGCTGGATCGAGAGCG TCTTAAGTCCTGTATGAGGGAGTTGGAGAGCATGGGACAGCAG GCCAAGAGCCTGAGGGCTCAGGTGAAGCGGCACACTGTTCGGG ACAAGCTGAGGTCATGCCAGAACTTTCTGCAGAAGCTACGCTTC CTGGCGGATGAGCCCCAGCACAGCATTCCTGATGTGTTCATTTG GATGATGAGCAACAACAAACGTATCGCCTATGCCCGCGTGCCTT CCAAAGACCTGCTCTTCTCCATCGTGGAGGAGGAACTGGGCAA GGACTGCGCCAAAGTCAAGACCCTCTTCCTGAAGCTGCCAGGG AAGAGGGGCTTCGGCTCGGCAGGCTGGACAGTACAGGCCAAG CTGGAGCTCTACCTGTGGCTGGGCCTCAGCAAGCAGCGAAAGG ACTTCCTGTGTGGTCTGCCCTGTGGCTTCGAGGAGGTCAAGGC AGCCCAAGGCCTGGGCCTGCATTCCTTTCCGCCCATCAGCCTA GTCTACACCAAGAAGCAAGCCTTCCAGCTCCGAGCACACATGTA TCAGGCCCGAAGCCTCTTTGCTGCTGACAGCAGTGGGCTCTCT GATCCCTTTGCCCGTGTCTTCTTCATCAACCAGAGCCAATGCAC TGAGGTTCTAAACGAGACACTGTGTCCCACCTGGGACCAGATGC TGGTATTTGACAACCTGGAGCTGTACGGTGAAGCTCACGAGTTA CGAGATGATCCCCCCATCATTGTCATTGAAATCTACGACCAGGA CAGCATGGGCAAAGCCGACTTCATGGGCCGGACCTTCGCCAAG CCCCTGGTGAAGATGGCAGATGAAGCATACTGCCCACCTCGCTT CCCGCCGCAGCTTGAGTACTACCAGATCTACCGAGGCAGTGCC ACTGCCGGAGACCTACTGGCTGCCTTCGAGCTGCTGCAGATTG GGCCATCAGGGAAGGCTGACCTGCCACCCATCAATGGCCCAGT GGACATGGACAGAGGGCCCATCATGCCTGTGCCCGTGGGAATC CGGCCAGTGCTCAGCAAGTACCGAGTGGAGGTGCTGTTCTGGG GCCTGAGGGACCTAAAGAGGGTGAACCTGGCCCAGGTGGACC GACCACGGGTGGACATCGAGTGTGCAGGAAAGGGGGTACAATC CTCCCTGATTCACAATTATAAGAAGAACCCCAACTTCAACACGCT GGTCAAGTGGTTTGAAGTGGACCTCCCGGAGAATGAGCTCCTG CACCCACCCTTGAACATCCGAGTGGTAGATTGCCGGGCCTTTG GACGATACACCCTGGTGGGTTCCCACGCAGTCAGCTCACTGAG GCGCTTCATCTACCGACCTCCAGACCGCTCAGCCCCCAACTGG AACACCACAGGGGAGGTTGTAGTAAGCATGGAGCCTGAGGAGC CAGTTAAGAAGCTGGAGACCATGGTGAAACTGGATGCGACTTCT GATGCTGTGGTCAAGGTGGATGTGGCTGAAGATGAGAAGGAAA GGAAGAAGAAGAAAAAGAAAGGCCCGTCAGAGGAGCCAGAGGA GGAAGAGCCCGATGAGAGCATGCTGGATTGGTGGTCCAAGTAC TTCGCCTCCATCGACACAATGAAGGAGCAACTTCGACAACATGA GACCTCTGGAACTGACTTGGAAGAGAAGGAAGAGATGGAAAGC GCTGAGGGCCTGAAGGGACCAATGAAGAGCAAGGAGAAGTCCA GAGCTGCAAAGGAGGAGAAAAAGAAGAAAAACCAGAGCCCTGG CCCTGGCCAGGGATCGGAGGCTCCTGAGAAGAAGAAAGCCAAG ATCGATGAGCTTAAGGTGTACCCCAAGGAGCTGGAATCGGAGTT TGACAGCTTTGAGGACTGGCTGCACACCTTCAACCTGTTGAGGG GCAAGACGGGAGATGATGAGGATGGCTCCACAGAGGAGGAGC GCATAGTAGGCCGATTCAAGGGCTCCCTCTGTGTGTACAAAGTG CCACTCCCAGAAGATGTATCTCGAGAAGCTGGCTATGATCCCAC CTATGGAATGTTCCAGGGCATCCCAAGCAATGACCCCATCAATG TGCTGGTCCGAATCTATGTGGTCCGGGCCACAGACCTGCACCC GGCCGACATCAATGGCAAAGCTGACCCCTATATTGCCATCAAGT TAGGCAAGACCGACATCCGAGACAAGGAGAACTACATCTCCAAG CAGCTCAACCCTGTGTTTGGGAAGTCCTTTGACATTGAGGCCTC CTTCCCCATGGAGTCCATGTTGACAGTGGCCGTGTACGACTGG GATCTGGTGGGCACTGATGACCTCATCGGAGAAACCAAGATTGA CCTGGAAAACCGCTTCTACAGCAAGCATCGCGCCACCTGCGGC ATCGCACAGACCTATTCCATACATGGCTACAATATCTGGAGGGA CCCCATGAAGCCCAGCCAGATCCTGACACGCCTCTGTAAAGAG GGCAAAGTGGACGGCCCCCACTTTGGTCCCCATGGGAGAGTGA GGGTTGCCAACCGTGTCTTCACGGGGCCTTCAGAAATAGAGGA TGAGAATGGTCAGAGGAAGCCCACAGATGAGCACGTGGCACTG TCTGCTCTGAGACACTGGGAGGACATCCCCCGGGTGGGCTGCC GCCTTGTGCCGGAACACGTGGAGACCAGGCCGCTGCTCAACCC TGACAAGCCAGGCATTGAGCAGGGCCGCCTGGAGCTGTGGGTG GACATGTTCCCCATGGACATGCCAGCCCCTGGGACACCTCTGG ATATATCCCCCAGGAAACCCAAGAAGTACGAGCTGCGGGTCATC GTGTGGAACACAGACGAGGTGGTCCTGGAAGACGATGATTTCTT CACGGGAGAGAAGTCCAGTGACATTTTTGTGAGGGGGTGGCTG AAGGGCCAGCAGGAGGACAAACAGGACACAGATGTCCACTATC ACTCCCTCACGGGGGAGGGCAACTTCAACTGGAGATACCTCTTC CCCTTCGACTACCTAGCGGCCGAAGAGAAGATCGTTATGTCCAA AAAGGAGTCTATGTTCTCCTGGGATGAGACGGAGTACAAGATCC CTGCGCGGCTCACCCTGCAGATCTGGGACGCTGACCACTTCTC GGCTGACGACTTCCTGGGGGCTATCGAGCTGGACCTGAACCGG TTCCCGAGGGGCGCTAAGACAGCCAAGCAGTGCACCATGGAGA TGGCCACCGGGGAGGTGGACGTACCCCTGGTTTCCATCTTTAAA CAGAAACGTGTCAAAGGCTGGTGGCCCCTCCTGGCCCGCAATG AGAATGATGAGTTTGAGCTCACAGGCAAAGTGGAGGCGGAGCT ACACCTACTCACGGCAGAGGAGGCAGAGAAGAACCCTGTGGGC CTGGCTCGCAATGAACCTGATCCCCTAGAAAAACCCAACCGGCC TGACACGGCATTCGTCTGGTTCCTGAACCCACTCAAATCTATCA AGTACCTCATCTGCACCCGGTACAAGTGGCTGATCATCAAGATC GTGCTGGCGCTGCTGGGGCTGCTCATGCTGGCCCTCTTCCTTT ACAGCCTCCCAGGCTACATGGTCAAGAAGCTCCTAGGGGCCTG AAGTGTGCCCCACCCCAGCCCGCTCCAGCATCCCTCCAGGGGC TGCTGCGTATTTTGCCTTCCCTCACCTGGACTCTCTCCCAACTC CCTGAGGAGCCCTCCCACGCCTGCCAGCCTTGAGCAAGACACC TGCTTGCTGGACTTCATCCCCACCCCACACCCAAACTGTTGCTT GCCTGATCTTGTCCCAGGCCTGCCTGGGGTTTGGGGCACAGTT GGCCTCCAAAACCAGATACCCTCTTGTCTAAAGTACCAGGTTCC TCTGCCCAACCCCAAGAGTGGTAGTGGCCCAACCCTCCCTGTG CTTTCCAAATCTTGTCTTAAGGCACCAGTGAAATTAACCAAGAAA CGCGGAGCAATGCCCAAGGCTCTGATGAGTAGGAACACGTGGA AAGCACCAGGAATGCCAGCAGAGGCGAGGCGGCACACCTCTCT GCAGAGCATCCAGGCCGAGCGGCGGGCAGCGGCCAGCTGCTT CTGCGCATGCTCTCCTCTTGGCTCTGCTTCTTTCTCACAGTCACA GTCACTTCACAGCTTAGCCTTGGGCTTCCCATCACTTCCAGGGG TGCCTCTGCCTTGGCCAGTGTGTGTCAGCTAGTACACAAGCT CCAAGTGTGAATCAGGTGTACTGGCCGTCCTGAAGACTGACTGC CCTGTCCTTCCTGCCGACAGCCACACCCGAGTGTACACTTAAAG CGGTGCCCTTCTGCCTCTGTGGGCCTGCTGGCTGCTGTTCCTTT CTTGAGTGTGATTTTTTTTTTCTCTCCCTCAATAAAATAAATCAAA CTCTGAGAC 18 mOTOF-202_2 TTGGTTGCCTTGGTCTCTGTGGGCAGCAGCAGGAGGAGGCGGC transcript AGCAGCCAGAGAAGAGGGAGGCGTGTGAGCCACACTCCACCAG (NM_001313767.1), CGAGCTTCTTCCCGCTGCTCTGGAACTGCCCAGGCTCTCCCCA mouse otoferlin CCAGCATGGCCCTGATTGTTCACCTCAAGACTGTCTCAGAGCTC transcript variant 4, CGAGGCAAAGGTGACCGGATTGCCAAAGTCACTTTCCGAGGGC 6881 bp, encodes AGTCTTTCTACTCCCGGGTCCTGGAGAACTGCGAGGGTGTGGC the protein of SEQ TGACTTTGATGAGACGTTCCGGTGGCCAGTGGCCAGCAGCATC ID NO: 9 GACCGGAATGAAGTGTTGGAGATTCAGATTTTCAACTACAGCAA AGTCTTCAGCAACAAGCTGATAGGGACCTTCTGCATGGTGCTGC AGAAAGTGGTGGAGGAGAATCGGGTAGAGGTGACCGACACGCT GATGGATGACAGCAATGCTATCATCAAGACCAGCCTGAGCATGG AGGTCCGGTATCAGGCCACAGATGGCACTGTGGGCCCCTGGGA TGATGGAGACTTCCTGGGAGATGAATCCCTCCAGGAGGAGAAG GACAGCCAGGAGACAGATGGGCTGCTACCTGGTTCCCGACCCA GCACCCGGATATCTGGCGAGAAGAGCTTTCGCAGAGCGGGAAG GAGTGTGTTCTCGGCCATGAAACTCGGCAAAACTCGGTCCCACA AAGAGGAGCCCCAAAGACAAGATGAGCCAGCAGTGCTGGAGAT GGAGGACCTGGACCACCTAGCCATTCAGCTGGGGGATGGGCTG GATCCTGACTCCGTGTCTCTAGCCTCGGTCACCGCTCTCACCAG CAATGTCTCCAACAAACGGTCTAAGCCAGATATTAAGATGGAGC CCAGTGCTGGAAGGCCCATGGATTACCAGGTCAGCATCACAGT GATTGAGGCTCGGCAGCTGGTGGGCTTGAACATGGACCCTGTG GTGTGTGTGGAGGTGGGTGATGACAAGAAATACACGTCAATGAA GGAGTCCACAAACTGCCCTTACTACAACGAGTACTTTGTCTTCG ACTTCCATGTCTCTCCTGATGTCATGTTTGACAAGATCATCAAGA TCTCGGTTATCCATTCTAAGAACCTGCTTCGGAGCGGCACCCTG GTGGGTTCCTTCAAAATGGATGTGGGGACTGTGTATTCCCAGCC TGAACACCAGTTCCATCACAAATGGGCCATCCTGTCAGACCCCG ATGACATCTCTGCTGGGTTGAAGGGTTATGTAAAGTGTGATGTC GCTGTGGTGGGCAAGGGAGACAACATCAAGACACCCCACAAGG CCAACGAGACGGATGAGGACGACATTGAAGGGAACTTGCTGCT CCCCGAGGGCGTGCCCCCCGAACGGCAGTGGGCACGGTTCTA TGTGAAAATTTACCGAGCAGAGGGACTGCCCCGGATGAACACAA GCCTCATGGCCAACGTGAAGAAGGCGTTCATCGGTGAGAACAA GGACCTCGTCGACCCCTATGTGCAAGTCTTCTTTGCTGGACAAA AGGGCAAAACATCAGTGCAGAAGAGCAGCTATGAGCCGCTATG GAATGAGCAGGTCGTCTTCACAGACTTGTTCCCCCCACTCTGCA AACGCATGAAGGTGCAGATCCGGGACTCTGACAAGGTCAATGAT GTGGCCATCGGCACCCACTTCATCGACCTGCGCAAGATTTCCAA CGATGGAGACAAAGGCTTCCTGCCTACCCTCGGTCCAGCCTGG GTGAACATGTACGGCTCCACGCGCAACTACACACTGCTGGACG AGCACCAGGACTTGAATGAAGGCCTGGGGGAGGGTGTGTCCTT CCGGGCCCGCCTCATGTTGGGACTAGCTGTGGAGATCCTGGAC ACCTCCAACCCAGAGCTCACCAGCTCCACGGAGGTGCAGGTGG AGCAGGCCACGCCTGTCTCGGAGAGCTGCACAGGGAGAATGGA AGAATTTTTTCTATTTGGAGCCTTCTTGGAAGCCTCAATGATTGA CCGGAAAAATGGGGACAAGCCAATTACCTTTGAGGTGACCATAG GAAACTACGGCAATGAAGTCGATGGTATGTCCCGGCCCCTGAG GCCTCGGCCCCGGAAAGAGCCTGGGGATGAAGAAGAGGTAGA CCTGATTCAGAACTCCAGTGACGATGAAGGTGACGAAGCCGGG GACCTGGCCTCGGTGTCCTCCACCCCACCTATGCGGCCCCAGA TCACGGACAGGAACTATTTCCACCTGCCCTACCTGGAGCGCAAG CCCTGCATCTATATCAAGAGCTGGTGGCCTGACCAGAGGCGGC GCCTCTACAATGCCAACATCATGGATCACATTGCTGACAAGCTG GAAGAAGGCCTGAATGATGTACAGGAGATGATCAAAACGGAGAA GTCCTACCCGGAGCGCCGCCTGCGGGGTGTGCTAGAGGAACTC AGCTGTGGCTGCCACCGCTTCCTCTCCCTCTCGGACAAGGACC AGGGCCGCTCGTCCCGCACCAGGCTGGATCGAGAGCGTCTTAA GTCCTGTATGAGGGAGTTGGAGAGCATGGGACAGCAGGCCAAG AGCCTGAGGGCTCAGGTGAAGCGGCACACTGTTCGGGACAAGC TGAGGTCATGCCAGAACTTTCTGCAGAAGCTACGCTTCCTGGCG GATGAGCCCCAGCACAGCATTCCTGATGTGTTCATTTGGATGAT GAGCAACAACAAACGTATCGCCTATGCCCGCGTGCCTTCCAAAG ACCTGCTCTTCTCCATCGTGGAGGAGGAACTGGGCAAGGACTG CGCCAAAGTCAAGACCCTCTTCCTGAAGCTGCCAGGGAAGAGG GGCTTCGGCTCGGCAGGCTGGACAGTACAGGCCAAGCTGGAG CTCTACCTGTGGCTGGGCCTCAGCAAGCAGCGAAAGGACTTCC TGTGTGGTCTGCCCTGTGGCTTCGAGGAGGTCAAGGCAGCCCA AGGCCTGGGCCTGCATTCCTTTCCGCCCATCAGCCTAGTCTACA CCAAGAAGCAAGCCTTCCAGCTCCGAGCACACATGTATCAGGC CCGAAGCCTCTTTGCTGCTGACAGCAGTGGGCTCTCTGATCCCT TTGCCCGTGTCTTCTTCATCAACCAGAGCCAATGCACTGAGGTT CTAAACGAGACACTGTGTCCCACCTGGGACCAGATGCTGGTATT TGACAACCTGGAGCTGTACGGTGAAGCTCACGAGTTACGAGAT GATCCCCCCATCATTGTCATTGAAATCTACGACCAGGACAGCAT GGGCAAAGCCGACTTCATGGGCCGGACCTTCGCCAAGCCCCTG GTGAAGATGGCAGATGAAGCATACTGCCCACCTCGCTTCCCGC CGCAGCTTGAGTACTACCAGATCTACCGAGGCAGTGCCACTGC CGGAGACCTACTGGCTGCCTTCGAGCTGCTGCAGATTGGGCCA TCAGGGAAGGCTGACCTGCCACCCATCAATGGCCCAGTGGACA TGGACAGAGGGCCCATCATGCCTGTGCCCGTGGGAATCCGGCC AGTGCTCAGCAAGTACCGAGTGGAGGTGCTGTTCTGGGGCCTG AGGGACCTAAAGAGGGTGAACCTGGCCCAGGTGGACCGACCAC GGGTGGACATCGAGTGTGCAGGAAAGGGGGTACAATCCTCCCT GATTCACAATTATAAGAAGAACCCCAACTTCAACACGCTGGTCAA GTGGTTTGAAGTGGACCTCCCGGAGAATGAGCTCCTGCACCCA CCCTTGAACATCCGAGTGGTAGATTGCCGGGCCTTTGGACGATA CACCCTGGTGGGTTCCCACGCAGTCAGCTCACTGAGGCGCTTC ATCTACCGACCTCCAGACCGCTCAGCCCCCAACTGGAACACCA CAGGGGAGGTTGTAGTAAGCATGGAGCCTGAGGAGCCAGTTAA GAAGCTGGAGACCATGGTGAAACTGGATGCGACTTCTGATGCT GTGGTCAAGGTGGATGTGGCTGAAGATGAGAAGGAAAGGAAGA AGAAGAAAAAGAAAGGCCCGTCAGAGGAGCCAGAGGAGGAAGA GCCCGATGAGAGCATGCTGGATTGGTGGTCCAAGTACTTCGCC TCCATCGACACAATGAAGGAGCAACTTCGACAACATGAGACCTC TGGAACTGACTTGGAAGAGAAGGAAGAGATGGAAAGCGCTGAG GGCCTGAAGGGACCAATGAAGAGCAAGGAGAAGTCCAGAGCTG CAAAGGAGGAGAAAAAGAAGAAAAACCAGAGCCCTGGCCCTGG CCAGGGATCGGAGGCTCCTGAGAAGAAGAAAGCCAAGATCGAT GAGCTTAAGGTGTACCCCAAGGAGCTGGAATCGGAGTTTGACA GCTTTGAGGACTGGCTGCACACCTTCAACCTGTTGAGGGGCAA GACGGGAGATGATGAGGATGGCTCCACAGAGGAGGAGCGCATA GTAGGCCGATTCAAGGGCTCCCTCTGTGTGTACAAAGTGCCACT CCCAGAAGATGTATCTCGAGAAGCTGGCTATGATCCCACCTATG GAATGTTCCAGGGCATCCCAAGCAATGACCCCATCAATGTGCTG GTCCGAATCTATGTGGTCCGGGCCACAGACCTGCACCCGGCCG ACATCAATGGCAAAGCTGACCCCTATATTGCCATCAAGTTAGGC AAGACCGACATCCGAGACAAGGAGAACTACATCTCCAAGCAGCT CAACCCTGTGTTTGGGAAGTCCTTTGACATTGAGGCCTCCTTCC CCATGGAGTCCATGTTGACAGTGGCCGTGTACGACTGGGATCT GGTGGGCACTGATGACCTCATCGGAGAAACCAAGATTGACCTG GAAAACCGCTTCTACAGCAAGCATCGCGCCACCTGCGGCATCG CACAGACCTATTCCATACATGGCTACAATATCTGGAGGGACCCC ATGAAGCCCAGCCAGATCCTGACACGCCTCTGTAAAGAGGGCA AAGTGGACGGCCCCCACTTTGGTCCCCATGGGAGAGTGAGGGT TGCCAACCGTGTCTTCACGGGGCCTTCAGAAATAGAGGATGAGA ATGGTCAGAGGAAGCCCACAGATGAGCACGTGGCACTGTCTGC TCTGAGACACTGGGAGGACATCCCCCGGGTGGGCTGCCGCCTT GTGCCGGAACACGTGGAGACCAGGCCGCTGCTCAACCCTGACA AGCCAGGCATTGAGCAGGGCCGCCTGGAGCTGTGGGTGGACAT GTTCCCCATGGACATGCCAGCCCCTGGGACACCTCTGGATATAT CCCCCAGGAAACCCAAGAAGTACGAGCTGCGGGTCATCGTGTG GAACACAGACGAGGTGGTCCTGGAAGACGATGATTTCTTCACG GGAGAGAAGTCCAGTGACATTTTTGTGAGGGGGTGGCTGAAGG GCCAGCAGGAGGACAAACAGGACACAGATGTCCACTATCACTC CCTCACGGGGGAGGGCAACTTCAACTGGAGATACCTCTTCCCC TTCGACTACCTAGCGGCCGAAGAGAAGATCGTTATGTCCAAAAA GGAGTCTATGTTCTCCTGGGATGAGACGGAGTACAAGATCCCTG CGCGGCTCACCCTGCAGATCTGGGACGCTGACCACTTCTCGGC TGACGACTTCCTGGGGGCTATCGAGCTGGACCTGAACCGGTTC CCGAGGGGCGCTAAGACAGCCAAGCAGTGCACCATGGAGATGG CCACCGGGGAGGTGGACGTACCCCTGGTTTCCATCTTTAAACAG AAACGTGTCAAAGGCTGGTGGCCCCTCCTGGCCCGCAATGAGA ATGATGAGTTTGAGCTCACAGGCAAAGTGGAGGCGGAGCTACA CCTACTCACGGCAGAGGAGGCAGAGAAGAACCCTGTGGGCCTG GCTCGCAATGAACCTGATCCCCTAGAAAAACCCAACCGGCCTGA CACGGCATTCGTCTGGTTCCTGAACCCACTCAAATCTATCAAGT ACCTCATCTGCACCCGGTACAAGTGGCTGATCATCAAGATCGTG CTGGCGCTGCTGGGGCTGCTCATGCTGGCCCTCTTCCTTTACA GCCTCCCAGGCTACATGGTCAAGAAGCTCCTAGGGGCCTGAAG TGTGCCCCACCCCAGCCCGCTCCAGCATCCCTCCAGGGGCTGC TGCGTATTTTGCCTTCCCTCACCTGGACTCTCTCCCAACTCCCT GAGGAGCCCTCCCACGCCTGCCAGCCTTGAGCAAGACACCTGC TTGCTGGACTTCATCCCCACCCCACACCCAAACTGTTGCTTGCC TGATCTTGTCCCAGGCCTGCCTGGGGTTTGGGGCACAGTTGGC CTCCAAAACCAGATACCCTCTTGTCTAAAGTACCAGGTTCCTCTG CCCAACCCCAAGAGTGGTAGTGGCCCAACCCTCCCTGTGCTTTC CAAATCTTGTCTTAAGGCACCAGTGAAATTAACCAAGAAACGCG GAGCAATGCCCAAGGCTCTGATGAGTAGGAACACGTGGAAAGC ACCAGGAATGCCAGCAGAGGCGAGGCGGCACACCTCTCTGCAG AGCATCCAGGCCGAGCGGCGGGCAGCGGCCAGCTGCTTCTGC GCATGCTCTCCTCTTGGCTCTGCTTCTTTCTCACAGTCACAGTCA CTTCACAGCTTAGCCTTGGGCTTCCCATCACTTCCAGGGGTGCC TCTGCCTTGGCCAGTGTGTGTCAGCTAGTACACAAGCTCCAAGT GTGAATCAGGTGTACTGGCCGTCCTGAAGACTGACTGCCCTGTC CTTCCTGCCGACAGCCACACCCGAGTGTACACTTAAAGCGGTG CCCTTCTGCCTCTGTGGGCCTGCTGGCTGCTGTTCCTTTCTTGA GTGTGATTTTTTTTTTCTCTCCCTCAATAAAATAAATCAAACTCTG AGAC
Expression of OTOF in Mammalian Cells
[0185] Mutations in OTOF have been linked to sensorineural hearing loss and auditory neuropathy. The compositions and methods described herein increase the expression of WT OTOF protein by administering a first nucleic acid vector that contains a polynucleotide encoding an N-terminal portion of an OTOF protein and a second nucleic acid vector that contains a polynucleotide encoding a C-terminal portion of an OTOF protein. In order to utilize nucleic acid vectors for therapeutic application in the treatment of sensorineural hearing loss and auditory neuropathy, they can be directed to the interior of the cell, and, in particular, to specific cell types. A wide array of methods has been established for the delivery of proteins to mammalian cells and for the stable expression of genes encoding proteins in mammalian cells.
Polynucleotides Encoding OTOF
[0186] One platform that can be used to achieve therapeutically effective intracellular concentrations of OTOF in mammalian cells is via the stable expression of the gene encoding OTOF (e.g., by integration into the nuclear or mitochondrial genome of a mammalian cell, or by episomal concatemer formation in the nucleus of a mammalian cell). The gene is a polynucleotide that encodes the primary amino acid sequence of the corresponding protein. In order to introduce exogenous genes into a mammalian cell, genes can be incorporated into a vector. Vectors can be introduced into a cell by a variety of methods, including transformation, transfection, transduction, direct uptake, projectile bombardment, and by encapsulation of the vector in a liposome. Examples of suitable methods of transfecting or transforming cells include calcium phosphate precipitation, electroporation, microinjection, infection, lipofection and direct uptake. Such methods are described in more detail, for example, in Green, et al., Molecular Cloning: A Laboratory Manual, Fourth Edition (Cold Spring Harbor University Press, New York 2014); and Ausubel, et al., Current Protocols in Molecular Biology (John Wiley & Sons, New York 2015), the disclosures of each of which are incorporated herein by reference.
[0187] OTOF can also be introduced into a mammalian cell by targeting vectors containing portions of a gene encoding an OTOF protein to cell membrane phospholipids. For example, vectors can be targeted to the phospholipids on the extracellular surface of the cell membrane by linking the vector molecule to a VSV-G protein, a viral protein with affinity for all cell membrane phospholipids. Such a construct can be produced using methods well known to those of skill in the field.
[0188] Recognition and binding of the polynucleotide encoding an OTOF protein by mammalian RNA polymerase is important for gene expression. As such, one may include sequence elements within the polynucleotide that exhibit a high affinity for transcription factors that recruit RNA polymerase and promote the assembly of the transcription complex at the transcription initiation site. Such sequence elements include, e.g., a mammalian promoter, the sequence of which can be recognized and bound by specific transcription initiation factors and ultimately RNA polymerase.
[0189] Polynucleotides suitable for use in the compositions and methods described herein also include those that encode an OTOF protein downstream of a mammalian promoter (e.g., a polynucleotide that encodes an N-terminal portion of an OTOF protein downstream of a mammalian promoter). Promoters that are useful for the expression of an OTOF protein in mammalian cells include constitutive promoters, cochlear hair cell-specific promoters, and inner hair cell-specific promoters. Constitutive promoters include the CAG promoter, the cytomegalovirus (CMV) promoter, the smCBA promoter, the EF1α promoter, and the PGK promoter. Cochlear hair cell-specific promoters include the Myosin 15 (Myo15) promoter, the Myosin 7A (Myo7A) promoter, the Myosin 6 (Myo6) promoter, the POU4F3 promoter, the Atonal BHLH Transcription Factor 1 (ATOH1) promoter, the LIM Homeobox 3 (LHX3) promoter, the α9 acetylcholine receptor (α9AChR) promoter, and the a10 acetylcholine receptor (α10AChR) promoter. Inner hair cell-specific promoters include the FGF8 promoter, the VGLUT3 promoter, and the OTOF promoter. Alternatively, promoters derived from viral genomes can also be used for the stable expression of these agents in mammalian cells. Examples of functional viral promoters that can be used to promote mammalian expression of these agents include adenovirus late promoter, vaccinia virus 7.5K promoter, SV40 promoter, tk promoter of HSV, mouse mammary tumor virus (MMTV) promoter, LTR promoter of HIV, promoter of moloney virus, Epstein barr virus (EBV) promoter, and the Rous sarcoma virus (RSV) promoter.
[0190] Murine Myosin 15 Promoters
[0191] In some embodiments, the Myo15 promoter for use in the compositions and methods described herein includes nucleic acid sequences from regions of the murine Myo15 locus that are capable of expressing a transgene specifically in hair cells, or variants thereof, such as a nucleic acid sequences that have at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to regions of the murine Myo15 locus that are capable of expressing a transgene specifically in hair cells. These regions include nucleic acid sequences immediately preceding the murine Myo15 translation start site and an upstream regulatory element that is located over 5 kb from the murine Myo15 translation start site. The Myo15 promoter for use in the compositions and methods described herein can optionally include a linker operably linking the regions of the murine Myo15 locus that are capable of expressing a transgene specifically in hair cells, or the regions of the murine Myo15 locus can be joined directly without an intervening linker.
[0192] In some embodiments, the Myo15 promoter for use in the compositions and methods described herein contains a first region (an upstream regulatory element) having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to a region containing the first non-coding exon of the murine Myo15 gene (nucleic acids from −6755 to −7209 with respect to the murine Myo15 translation start site, the sequence of which is set forth in SEQ ID NO: 24) or a functional portion or derivative thereof joined (e.g., operably linked) to a second region having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the nucleic acid sequence immediately preceding the murine Myo15 translation start site (nucleic acids from −1 to −1157 with respect to the murine Myo15 translation start site, the sequence of which is set forth in SEQ ID NO: 25) or a functional portion or derivative thereof. The functional portion of SEQ ID NO: 24 may have the sequence of nucleic acids from −7166 to −7091 with respect to the murine Myo15 translation start site (set forth in SEQ ID NO: 26) and/or the sequence of nucleic acids from −7077 to −6983 with respect to the murine Myo15 translation start site (set forth in SEQ ID NO: 27). The first region may contain the nucleic acid sequence of SEQ ID NO: 26 fused to the nucleic acid sequence of SEQ ID NO: 27 with no intervening nucleic acids, as set forth in SEQ ID NO: 28, or the first region may contain the nucleic acid sequence of SEQ ID NO: 27 fused to the nucleic acid sequence of SEQ ID NO: 26 with no intervening nucleic acids, as set forth in SEQ ID NO: 29. Alternatively, the first region may contain the sequences of SEQ ID NO: 26 and SEQ ID NO: 27 joined by the endogenous intervening nucleic acid sequence (e.g., the first region may have or include the sequence of nucleic acids from −7166 to −6983 with respect to the murine Myo15 translation start site, as set forth in SEQ ID NO: 30 and SEQ ID NO: 50) or a nucleic acid linker. In a murine Myo15 promoter in which the first region contains both SEQ ID NO: 26 and SEQ ID NO: 27, the two sequences can be included in any order (e.g., SEQ ID NO: 26 may be joined to (e.g., precede) SEQ ID NO: 27, or SEQ ID NO: 27 may be joined to (e.g., precede) SEQ ID NO: 26). The functional portion of SEQ ID NO: 25 may have the sequence of nucleic acids from −590 to −509 with respect to the murine Myo15 translation start site (set forth in SEQ ID NO: 31) and/or the sequence of nucleic acids from −266 to −161 with respect to the murine Myo15 translation start site (set forth in SEQ ID NO: 32). In some embodiments, the sequence containing SEQ ID NO: 31 has the sequence of SEQ ID NO: 51. In some embodiments, the sequence containing SEQ ID NO: 32 has the sequence of SEQ ID NO: 52. The second region may contain the nucleic acid sequence of SEQ ID NO: 31 fused to the nucleic acid sequence of SEQ ID NO: 32 with no intervening nucleic acids, as set forth in SEQ ID NO: 33, or the second region may contain the nucleic acid sequence of SEQ ID NO: 32 fused to the nucleic acid sequence of SEQ ID NO: 31 with no intervening nucleic acids, as set forth in SEQ ID NO: 34. The second region may contain the nucleic acid sequence of SEQ ID NO: 51 fused to the nucleic acid sequence of SEQ ID NO: 52 with no intervening nucleic acids, as set forth in SEQ ID NO: 55, or the second region may contain the nucleic acid sequence of SEQ ID NO: 52 fused to the nucleic acid sequence of SEQ ID NO: 51 with no intervening nucleic acids. Alternatively, the second region may contain the sequences of SEQ ID NO: 31 and SEQ ID NO: 32 joined by the endogenous intervening nucleic acid sequence (e.g., the second region may have the sequence of nucleic acids from −590 to −161 with respect to the murine Myo15 translation start site, as set forth in SEQ ID NO: 35) or a nucleic acid linker. In a murine Myo15 promoter in which the second region contains both SEQ ID NO: 31 and SEQ ID NO: 32, the two sequences can be included in any order (e.g., SEQ ID NO: 31 may be joined to (e.g., precede) SEQ ID NO: 32, or SEQ ID NO: 32 may be joined to (e.g., precede) SEQ ID NO: 31).
[0193] The first region and the second region of the murine Myo15 promoter can be joined directly or can be joined by a nucleic acid linker. For example, the murine Myo15 promoter can contain the sequence of SEQ ID NO: 24 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 26-30 and 50, e.g., SEQ ID NOs 26 and 27) fused to the sequence of SEQ ID NO: 25 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 31-35, 51, 52, and 55, e.g., SEQ ID NOs 31 and 32) with no intervening nucleic acids. For example, the nucleic acid sequence of the murine Myo15 promoter that results from direct fusion of SEQ ID NO: 24 to SEQ ID NO: 25 is set forth in SEQ ID NO: 36. Alternatively, a linker can be used to join the sequence of SEQ ID NO: 24 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 26-30 and 50, e.g., SEQ ID NOs 26 and 27) to the sequence of SEQ ID NO: 25 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 31-35, 51, 52, and 55, e.g., SEQ ID NOs 31 and 32). Exemplary Myo15 promoters containing functional portions of both SEQ ID NO: 24 and SEQ ID NO: 25 are provided in SEQ ID NOs: 38, 39, 53, 54, 59, and 60.
[0194] The length of a nucleic acid linker for use in a murine Myo15 promoter described herein can be about 5 kb or less (e.g., about 5 kb, 4.5, kb, 4, kb, 3.5 kb, 3 kb, 2.5 kb, 2 kb, 1.5 kb, 1 kb, 900 bp, 800 bp, 700 bp, 600 bp, 500 bp, 450 bp, 400 bp, 350 bp, 300 bp, 250 bp, 200 bp, 150 bp, 100 bp, 90 bp, 80 bp, 70 bp, 60 bp, 50 bp, 40 bp, 30 bp, 25 bp, 20 bp, 15, bp, 10 bp, 5 bp, 4 bp, 3 bp, 2 bp, or less). Nucleic acid linkers that can be used in the murine Myo15 promoter described herein do not disrupt the ability of the murine Myo15 promoter of the invention to induce transgene expression in hair cells.
[0195] In some embodiments, the sequence of SEQ ID NO: 24 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 26-30 and 50, e.g., SEQ ID NOs 26 and 27) is joined (e.g., operably linked) to the sequence of SEQ ID NO: 25 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 31-35, 51, 52, and 55, e.g., SEQ ID NOs 31 and 32), and, in some embodiments, the order of the regions is reversed (e.g., the sequence of SEQ ID NO: 25 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 31-35, 51, 52, and 55, e.g., SEQ ID NOs 31 and 32) is joined (e.g., operably linked) to the sequence of SEQ ID NO: 24 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 26-30 and 50, e.g., SEQ ID NOs 26 and 27)). For example, the nucleic acid sequence of a murine Myo15 promoter that results from direct fusion of SEQ ID NO: 25 to SEQ ID NO: 24 is set forth in SEQ ID NO: 37. An example of a murine Myo15 promoter in which a functional portion or derivative of SEQ ID NO: 25 precedes a functional portion or derivative of SEQ ID NO: 24 is provided in SEQ ID NO: 58. Regardless of order, the sequence of SEQ ID NO: 24 or a functional portion or derivative thereof and the sequence of SEQ ID NO: 25 or a functional portion or derivative thereof can be joined by direct fusion or a nucleic acid linker, as described above.
[0196] In some embodiments, the murine Myo15 promoter for use in the compositions and methods described herein contains a region having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to a region containing the first non-coding exon of the murine Myo15 gene (nucleic acids from −6755 to −7209 with respect to the murine Myo15 translation start site, the sequence of which is set forth in SEQ ID NO: 24) or a functional portion or derivative thereof. The functional portion of SEQ ID NO: 24 may have the sequence of nucleic acids from −7166 to −7091 with respect to the murine Myo15 translation start site (set forth in SEQ ID NO: 26) and/or the sequence of nucleic acids from −7077 to −6983 with respect to the murine Myo15 translation start site (set forth in SEQ ID NO: 27). The murine Myo15 promoter may contain the nucleic acid sequence of SEQ ID NO: 26 fused to the nucleic acid sequence of SEQ ID NO: 27 with no intervening nucleic acids, as set forth in SEQ ID NO: 28, or the murine Myo15 promoter may contain the nucleic acid sequence of SEQ ID NO: 27 fused to the nucleic acid sequence of SEQ ID NO: 26 with no intervening nucleic acids, as set forth in SEQ ID NO: 29. Alternatively, the murine Myo15 promoter may contain the sequences of SEQ ID NO: 26 and SEQ ID NO: 27 joined by the endogenous intervening nucleic acid sequence (e.g., the first region may have or include the sequence of nucleic acids from −7166 to −6983 with respect to the murine Myo15 translation start site, as set forth in SEQ ID NO: 30 and SEQ ID NO: 50) or a nucleic acid linker. In a murine Myo15 promoter that contains both SEQ ID NO: 26 and SEQ ID NO: 27, the two sequences can be included in any order (e.g., SEQ ID NO: 26 may be joined to (e.g., precede) SEQ ID NO: 27, or SEQ ID NO: 27 may be joined to (e.g., precede) SEQ ID NO: 26).
[0197] In some embodiments, the murine Myo15 promoter for use in the compositions and methods described herein contains a region having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the nucleic acid sequence immediately upstream of the murine Myo15 translation start site (nucleic acids from −1 to −1157 with respect to the murine Myo15 translation start site, the sequence of which is set forth in SEQ ID NO: 25) or a functional portion or derivative thereof. The functional portion of SEQ ID NO: 25 may have the sequence of nucleic acids from −590 to −509 with respect to the murine Myo15 translation start site (set forth in SEQ ID NO: 31) and/or the sequence of nucleic acids from −266 to −161 with respect to the murine Myo15 translation start site (set forth in SEQ ID NO: 32). In some embodiments, the sequence containing SEQ ID NO: 31 has the sequence of SEQ ID NO: 51. In some embodiments, the sequence containing SEQ ID NO: 32 has the sequence of SEQ ID NO: 52. The murine Myo15 promoter may contain the nucleic acid sequence of SEQ ID NO: 31 fused to the nucleic acid sequence of SEQ ID NO: 32 with no intervening nucleic acids, as set forth in SEQ ID NO: 33, or the murine Myo15 promoter may contain the nucleic acid sequence of SEQ ID NO: 32 fused to the nucleic acid sequence of SEQ ID NO: 31 with no intervening nucleic acids, as set forth in SEQ ID NO: 34. The murine Myo15 promoter may contain the nucleic acid sequence of SEQ ID NO: 51 fused to the nucleic acid sequence of SEQ ID NO: 52 with no intervening nucleic acids, as set forth in SEQ ID NO: 55, or the murine Myo15 promoter may contain the nucleic acid sequence of SEQ ID NO: 52 fused to the nucleic acid sequence of SEQ ID NO: 51 with no intervening nucleic acids. Alternatively, the murine Myo15 promoter may contain the sequences of SEQ ID NO: 31 and SEQ ID NO: 32 joined by the endogenous intervening nucleic acid sequence (e.g., the second region may have the sequence of nucleic acids from −590 to −161 with respect to the murine Myo15 translation start site, as set forth in SEQ ID NO: 35) or a nucleic acid linker. In a murine Myo15 promoter that contains both SEQ ID NO: 31 and SEQ ID NO: 32, the two sequences can be included in any order (e.g., SEQ ID NO: 31 may be joined to (e.g., precede) SEQ ID NO: 32, or SEQ ID NO: 32 may be joined to (e.g., precede) SEQ ID NO: 31).
[0198] In some embodiments, the murine Myo15 promoter for use in the compositions and methods described herein contains a functional portion or derivative of a region having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to a region containing the first non-coding exon of the Myo15 gene (nucleic acids from −6755 to −7209 with respect to the murine Myo15 translation start site, the sequence of which is set forth in SEQ ID NO: 24) flanked on both sides by a functional portion or derivative of a region having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the nucleic acid sequence immediately upstream of the murine Myo15 translation start site (nucleic acids from −1 to −1157 with respect to the murine Myo15 translation start site, the sequence of which is set forth in SEQ ID NO: 25). For example, a functional portion or derivative of SEQ ID NO: 25, such as SEQ ID NO: 31 or 51 may be directly fused or joined by a nucleic acid linker to a portion of SEQ ID NO: 24, such as any one of SEQ ID NOs: 26-30 and 50, which is directly fused or joined by a nucleic acid linker to a different functional portion of SEQ ID NO: 25, such as SEQ ID NO: 32 or 52. In other embodiments, a functional portion or derivative of SEQ ID NO: 25, such as SEQ ID NO: 32 or 52 may be directly fused or joined by a nucleic acid linker to a portion of SEQ ID NO: 24, such as any one of SEQ ID NOs: 26-30 and 50, which is directly fused or joined by a nucleic acid linker to a different functional portion of SEQ ID NO: 25, such as SEQ ID NO: 31 or 51. For example, polynucleotides having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the nucleic acid sequence of SEQ ID NOs: 51, 50, and 52 can be fused to produce a polynucleotide having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the nucleic acid sequence of SEQ ID NO: 56. In some embodiments, polynucleotides having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the nucleic acid sequence of SEQ ID NOs: 52, 50, and 51 can be fused to produce a polynucleotide having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the nucleic acid sequence of SEQ ID NO: 57.
[0199] Human Myosin 15 Promoters
[0200] In some embodiments, the Myo15 promoter for use in the compositions and methods described herein includes nucleic acid sequences from regions of the human Myo15 locus that are capable of expressing a transgene specifically in hair cells, or variants thereof, such as a nucleic acid sequences that have at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to regions of the human Myo15 locus that are capable of expressing a transgene specifically in hair cells. The Myo15 promoter for use in the compositions and methods described herein can optionally include a linker operably linking the regions of the human Myo15 locus that are capable of expressing a transgene specifically in hair cells, or the regions of the human Myo15 locus can be joined directly without an intervening linker.
[0201] In some embodiments, the Myo15 promoter for use in the compositions and methods described herein contains a first region having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the sequence set forth in SEQ ID NO: 40 or a functional portion or derivative thereof joined (e.g., operably linked) to a second region having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the sequence set forth in SEQ ID NO: 41 or a functional portion or derivative thereof. The functional portion of SEQ ID NO: 40 may have the sequence set forth in SEQ ID NO: 42. The functional portion of SEQ ID NO: 41 may have the sequence set forth in SEQ ID NO: 43 and/or the sequence set forth in SEQ ID NO: 44. The second region may contain the nucleic acid sequence of SEQ ID NO: 43 fused to the nucleic acid sequence of SEQ ID NO: 44 with no intervening nucleic acids, as set forth in SEQ ID NO: 45, or the second region may contain the nucleic acid sequence of SEQ ID NO: 44 fused to the nucleic acid sequence of SEQ ID NO: 43 with no intervening nucleic acids, as set forth in SEQ ID NO: 46. Alternatively, the second region may contain the sequences of SEQ ID NO: 43 and SEQ ID NO: 44 joined by the endogenous intervening nucleic acid sequence (as set forth in SEQ ID NO: 47) or a nucleic acid linker. In a human Myo15 promoter in which the second region contains both SEQ ID NO: 43 and SEQ ID NO: 44, the two sequences can be included in any order (e.g., SEQ ID NO: 43 may be joined to (e.g., precede) SEQ ID NO: 44, or SEQ ID NO: 44 may be joined to (e.g., precede) SEQ ID NO: 43).
[0202] The first region and the second region of the human Myo15 promoter can be joined directly or can be joined by a nucleic acid linker. For example, the human Myo15 promoter can contain the sequence of SEQ ID NO: 40 or a functional portion or derivative thereof (e.g., SEQ ID NO: 42) fused to the sequence of SEQ ID NO: 41 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 43-47, e.g., SEQ ID NOs: 43 and/or 44) with no intervening nucleic acids. Alternatively, a linker can be used to join the sequence of SEQ ID NO: 40 or a functional portion or derivative thereof (e.g., SEQ ID NO: 42) to the sequence of SEQ ID NO: 41 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 43-47, e.g., SEQ ID NOs: 43 and/or 44). Exemplary human Myo15 promoters containing functional portions of both SEQ ID NO: 40 and SEQ ID NO: 41 are provided in SEQ ID NOs: 48 and 49.
[0203] In some embodiments, the sequence of SEQ ID NO: 40 or a functional portion or derivative thereof (e.g., SEQ ID NO: 42) is joined (e.g., operably linked) to the sequence of SEQ ID NO: 41 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 43-47, e.g., SEQ ID NOs: 43 and 44), and, in some embodiments, the order of the regions is reversed (e.g., the sequence of SEQ ID NO: 41 or a functional portion or derivative thereof (e.g., any one or more of SEQ ID NOs: 43-47, e.g., SEQ ID NOs: 43 and/or 44) is joined (e.g., operably linked) to the sequence of SEQ ID NO: 40 or a functional portion or derivative thereof (e.g., SEQ ID NO: 42). Regardless of order, the sequence of SEQ ID NO: 40 or a functional portion or derivative thereof and the sequence of SEQ ID NO: 41 or a functional portion or derivative thereof can be joined by direct fusion or a nucleic acid linker, as described above.
[0204] In some embodiments, the human Myo15 promoter for use in the compositions and methods described herein contains a region having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to a region containing the sequence set forth in SEQ ID NO: 40 or a functional portion or derivative thereof. The functional portion of SEQ ID NO: 40 may have the sequence of nucleic acids set forth in SEQ ID NO: 42.
[0205] In some embodiments, the human Myo15 promoter for use in the compositions and methods described herein contains a region having at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the sequence set forth in SEQ ID NO: 41 or a functional portion or derivative thereof. The functional portion of SEQ ID NO: 41 may have the sequence set forth in SEQ ID NO: 43 and/or the sequence set forth in SEQ ID NO: 44. The human Myo15 promoter may contain the nucleic acid sequence of SEQ ID NO: 43 fused to the nucleic acid sequence of SEQ ID NO: 44 with no intervening nucleic acids, as set forth in SEQ ID NO: 45, or the human Myo15 promoter may contain the nucleic acid sequence of SEQ ID NO: 44 fused to the nucleic acid sequence of SEQ ID NO: 43 with no intervening nucleic acids, as set forth in SEQ ID NO: 46. Alternatively, the human Myo15 promoter may contain the sequences of SEQ ID NO: 43 and SEQ ID NO: 44 joined by the endogenous intervening nucleic acid sequence (e.g., as set forth in SEQ ID NO: 47) or a nucleic acid linker. In a human Myo15 promoter that contains both SEQ ID NO: 43 and SEQ ID NO: 44, the two sequences can be included in any order (e.g., SEQ ID NO: 43 may be joined to (e.g., precede) SEQ ID NO: 44, or SEQ ID NO: 44 may be joined to (e.g., precede) SEQ ID NO: 43).
[0206] The length of a nucleic acid linker for use in a human Myo15 promoter described herein can be about 5 kb or less (e.g., about 5 kb, 4.5, kb, 4, kb, 3.5 kb, 3 kb, 2.5 kb, 2 kb, 1.5 kb, 1 kb, 900 bp, 800 bp, 700 bp, 600 bp, 500 bp, 450 bp, 400 bp, 350 bp, 300 bp, 250 bp, 200 bp, 150 bp, 100 bp, 90 bp, 80 bp, 70 bp, 60 bp, 50 bp, 40 bp, 30 bp, 25 bp, 20 bp, 15, bp, 10 bp, 5 bp, 4 bp, 3 bp, 2 bp, or less). Nucleic acid linkers that can be used in the human Myo15 promoters described herein do not disrupt the ability of the human Myo15 promoter of the invention to induce transgene expression in hair cells.
[0207] The foregoing Myo15 promoter sequences are summarized in Table 3, below.
TABLE-US-00003 TABLE 3 Exemplary nucleotide sequences for use in the Myo15 promoter described herein SEQ ID Description of nucleic NO. acid sequence Nucleic Acid Sequence 24 Region containing non- CTGCAGCTCAGCCTACTACTTGCTTTCCAGGCTGTTCCTAGT coding exon 1 of Myo15 TCCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAA (−6755 to −7209) TAATAGATGTCATTAAATATACATTGGGCCCCAGGCGGTCAAT GTGGCAGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGAC ATAGGACCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCC ACAGGACCCAGGTAAGGGGCCCTGGGTCCTTAAGCTTCTGC CACTGGCTCCGGCATTGCAGAGAGAAGAGAAGGGGCGGCA GAGCTGAACCTTAGCCTTGCCTTCCTGGGTACCCTTCTGAGC CTCACTGTCTTCTGTGAGATGGGCAAAGTGCGGGTGTGACTC CTTGGCAACGGTGTTACACCAGGGCAGGTAAAGTTGTAGTTA TTTGTGGGGTACACCAGGACTGTTAAAGGTGTAACTAT 25 Region immediately GGTCTCACCCAGCATTTTCACTTCTAATAAGTTCAAATGTGAT preceding the ACGGCACCTTTCTAAAAATTAGTTTTCAGGGAAATAGGGTTCA translation start site AAACTGGTAGTGGTAGGGTCCATTCTCACGACCCCCAGGCCT of Myo15 (−1 to −1157) GCTAACCCTGACCAAGCTACCTATTACTTACCCTCCTCTTTCT CCTCCTCCTCTTTCTCCTTCTCCTGCTTCCCCTCTTCCTTCTC CCTCCCTTCCTCTCCCTCCTCCCCCTCCTTGGCTGTGATCAG ATCCAGAGCCTGAATGAGCCTCCTGACCCCACACCCCCACTA GCATGGGCCTGCAAGTGCCCAGAAGTCCCTCCTGCCTCCTA AACTGCCCAGCCGATCCATTAGCTCTTCCTTCTTCCCAGTGA AAGAAGCAGGCACAGCCTGTCCCTCCCGTTCTACAGAAAGG AAGCTACAGCACAGGGAGGGCCAAAGGCCTTCCTGGGACTA GACAGTTGATCAACAGCAGGACTGGAGAGCTGGGCTCCATTT TTGTTCCTTGGTGCCCTGCCCCTCCCCATGACCTGCAGAGAC ATTCAGCCTGCCAGGCTTTATGAGGTGGGAGCTGGGCTCTC CCTGATGTATTATTCAGCTCCCTGGAGTTGGCCAGCTCCTGT TACACTGGCCACAGCCCTGGGCATCCGCTTCTCACTTCTAGT TTCCCCTCCAAGGTAATGTGGTGGGTCATGATCATTCTATCCT GGCTTCAGGGACCTGACTCCACTTTGGGGCCATTCGAGGGG TCTAGGGTAGATGATGTCCCCCTGTGGGGATTAATGTCCTGC TCTGTAAAACTGAGCTAGCTGAGATCCAGGAGGGCTTGGCCA GAGACAGCAAGTTGTTGCCATGGTGACTTTAAAGCCAGGTTG CTGCCCCAGCACAGGCCTCCCAGTCTACCCTCACTAGAAAAC AACACCCAGGCACTTTCCACCACCTCTCAAAGGTGAAACCCA AGGCTGGTCTAGAGAATGAATTATGGATCCTCGCTGTCCGTG CCACCCAGCTAGTCCCAGCGGCTCAGACACTGAGGAGAGAC TGTAGGTTCAGCTACAAGCAAAAAGACCTAGCTGGTCTCCAA GCAGTGTCTCCAAGTCCCTGAACCTGTGACACCTGCCCCAG GCATCATCAGGCACAGAGGGCCACC 26 Portion of SEQ ID NO: CCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAAT 24 (−7166 to −7091) AATAGATGTCATTAAATATACATTGGGCCCCAGG 27 Portion of SEQ ID NO: AGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGACATAGGA 24 (−7077 to −6983) CCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCCACAGGA CCCAGGTAAGGG 28 Portion of SEQ ID NO: CCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAAT 24 (SEQ ID NO: 26 AATAGATGTCATTAAATATACATTGGGCCCCAGGAGCCTGAG fused to SEQ ID NO: CCTCCTTTCCATCTCTGTGGAGGCAGACATAGGACCCCCAAC 27) AAACAGCATGCAGGTTGGGAGCCAGCCACAGGACCCAGGTA AGGG 29 Portion of SEQ ID NO: AGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGACATAGGA 24 (SEQ ID NO: 27 CCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCCACAGGA fused to SEQ ID NO: CCCAGGTAAGGGCCCATGTCAGCTGCTTGTGCTTTCCAGAGA 26) CAAAACAGGAATAATAGATGTCATTAAATATACATTGGGCCCC AGG 30 Portion of SEQ ID NO: CCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAAT 24 (−7166 to −6983) AATAGATGTCATTAAATATACATTGGGCCCCAGGCGGTCAAT GTGGCAGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGAC ATAGGACCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCC ACAGGACCCAGGTAAGGG 31 Portion of SEQ ID NO: TGAGGTGGGAGCTGGGCTCTCCCTGATGTATTATTCAGCTCC 25 (−590 to −509) CTGGAGTTGGCCAGCTCCTGTTACACTGGCCACAGCCCTG 32 Portion of SEQ ID NO: CACAGGCCTCCCAGTCTACCCTCACTAGAAAACAACACCCAG 25 (−266 to −161) GCACTTTCCACCACCTCTCAAAGGTGAAACCCAAGGCTGGTC TAGAGAATGAATTATGGATCCT 33 Portion of SEQ ID NO: TGAGGTGGGAGCTGGGCTCTCCCTGATGTATTATTCAGCTCC 25 CTGGAGTTGGCCAGCTCCTGTTACACTGGCCACAGCCCTGC (SEQ ID NO: 31 fused ACAGGCCTCCCAGTCTACCCTCACTAGAAAACAACACCCAGG to SEQ ID NO: 32) CACTTTCCACCACCTCTCAAAGGTGAAACCCAAGGCTGGTCT AGAGAATGAATTATGGATCCT 34 Portion of SEQ ID NO: CACAGGCCTCCCAGTCTACCCTCACTAGAAAACAACACCCAG 25 GCACTTTCCACCACCTCTCAAAGGTGAAACCCAAGGCTGGTC (SEQ ID NO: 32 fused TAGAGAATGAATTATGGATCCTTGAGGTGGGAGCTGGGCTCT to SEQ ID NO: 31) CCCTGATGTATTATTCAGCTCCCTGGAGTTGGCCAGCTCCTG TTACACTGGCCACAGCCCTG 35 Portion of SEQ ID NO: TGAGGTGGGAGCTGGGCTCTCCCTGATGTATTATTCAGCTCC 25 CTGGAGTTGGCCAGCTCCTGTTACACTGGCCACAGCCCTGG (−590 to −161) GCATCCGCTTCTCACTTCTAGTTTCCCCTCCAAGGTAATGTG GTGGGTCATGATCATTCTATCCTGGCTTCAGGGACCTGACTC CACTTTGGGGCCATTCGAGGGGTCTAGGGTAGATGATGTCC CCCTGTGGGGATTAATGTCCTGCTCTGTAAAACTGAGCTAGC TGAGATCCAGGAGGGCTTGGCCAGAGACAGCAAGTTGTTGC CATGGTGACTTTAAAGCCAGGTTGCTGCCCCAGCACAGGCCT CCCAGTCTACCCTCACTAGAAAACAACACCCAGGCACTTTCC ACCACCTCTCAAAGGTGAAACCCAAGGCTGGTCTAGAGAATG AATTATGGATCCT 36 SEQ ID NO: 24 fused CTGCAGCTCAGCCTACTACTTGCTTTCCAGGCTGTTCCTAGT to SEQ ID NO: 25 TCCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAA TAATAGATGTCATTAAATATACATTGGGCCCCAGGCGGTCAAT GTGGCAGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGAC ATAGGACCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCC ACAGGACCCAGGTAAGGGGCCCTGGGTCCTTAAGCTTCTGC CACTGGCTCCGGCATTGCAGAGAGAAGAGAAGGGGCGGCA GAGCTGAACCTTAGCCTTGCCTTCCTGGGTACCCTTCTGAGC CTCACTGTCTTCTGTGAGATGGGCAAAGTGCGGGTGTGACTC CTTGGCAACGGTGTTACACCAGGGCAGGTAAAGTTGTAGTTA TTTGTGGGGTACACCAGGACTGTTAAAGGTGTAACTATGGTC TCACCCAGCATTTTCACTTCTAATAAGTTCAAATGTGATACGG CACCTTTCTAAAAATTAGTTTTCAGGGAAATAGGGTTCAAAAC TGGTAGTGGTAGGGTCCATTCTCACGACCCCCAGGCCTGCT AACCCTGACCAAGCTACCTATTACTTACCCTCCTCTTTCTCCT CCTCCTCTTTCTCCTTCTCCTGCTTCCCCTCTTCCTTCTCCCT CCCTTCCTCTCCCTCCTCCCCCTCCTTGGCTGTGATCAGATC CAGAGCCTGAATGAGCCTCCTGACCCCACACCCCCACTAGC ATGGGCCTGCAAGTGCCCAGAAGTCCCTCCTGCCTCCTAAAC TGCCCAGCCGATCCATTAGCTCTTCCTTCTTCCCAGTGAAAG AAGCAGGCACAGCCTGTCCCTCCCGTTCTACAGAAAGGAAG CTACAGCACAGGGAGGGCCAAAGGCCTTCCTGGGACTAGAC AGTTGATCAACAGCAGGACTGGAGAGCTGGGCTCCATTTTTG TTCCTTGGTGCCCTGCCCCTCCCCATGACCTGCAGAGACATT CAGCCTGCCAGGCTTTATGAGGTGGGAGCTGGGCTCTCCCT GATGTATTATTCAGCTCCCTGGAGTTGGCCAGCTCCTGTTAC ACTGGCCACAGCCCTGGGCATCCGCTTCTCACTTCTAGTTTC CCCTCCAAGGTAATGTGGTGGGTCATGATCATTCTATCCTGG CTTCAGGGACCTGACTCCACTTTGGGGCCATTCGAGGGGTC TAGGGTAGATGATGTCCCCCTGTGGGGATTAATGTCCTGCTC TGTAAAACTGAGCTAGCTGAGATCCAGGAGGGCTTGGCCAG AGACAGCAAGTTGTTGCCATGGTGACTTTAAAGCCAGGTTGC TGCCCCAGCACAGGCCTCCCAGTCTACCCTCACTAGAAAACA ACACCCAGGCACTTTCCACCACCTCTCAAAGGTGAAACCCAA GGCTGGTCTAGAGAATGAATTATGGATCCTCGCTGTCCGTGC CACCCAGCTAGTCCCAGCGGCTCAGACACTGAGGAGAGACT GTAGGTTCAGCTACAAGCAAAAAGACCTAGCTGGTCTCCAAG CAGTGTCTCCAAGTCCCTGAACCTGTGACACCTGCCCCAGG CATCATCAGGCACAGAGGGCCACC 37 SEQ ID NO: 25 fused GGTCTCACCCAGCATTTTCACTTCTAATAAGTTCAAATGTGAT to SEQ ID NO: 24 ACGGCACCTTTCTAAAAATTAGTTTTCAGGGAAATAGGGTTCA AAACTGGTAGTGGTAGGGTCCATTCTCACGACCCCCAGGCCT GCTAACCCTGACCAAGCTACCTATTACTTACCCTCCTCTTTCT CCTCCTCCTCTTTCTCCTTCTCCTGCTTCCCCTCTTCCTTCTC CCTCCCTTCCTCTCCCTCCTCCCCCTCCTTGGCTGTGATCAG ATCCAGAGCCTGAATGAGCCTCCTGACCCCACACCCCCACTA GCATGGGCCTGCAAGTGCCCAGAAGTCCCTCCTGCCTCCTA AACTGCCCAGCCGATCCATTAGCTCTTCCTTCTTCCCAGTGA AAGAAGCAGGCACAGCCTGTCCCTCCCGTTCTACAGAAAGG AAGCTACAGCACAGGGAGGGCCAAAGGCCTTCCTGGGACTA GACAGTTGATCAACAGCAGGACTGGAGAGCTGGGCTCCATTT TTGTTCCTTGGTGCCCTGCCCCTCCCCATGACCTGCAGAGAC ATTCAGCCTGCCAGGCTTTATGAGGTGGGAGCTGGGCTCTC CCTGATGTATTATTCAGCTCCCTGGAGTTGGCCAGCTCCTGT TACACTGGCCACAGCCCTGGGCATCCGCTTCTCACTTCTAGT TTCCCCTCCAAGGTAATGTGGTGGGTCATGATCATTCTATCCT GGCTTCAGGGACCTGACTCCACTTTGGGGCCATTCGAGGGG TCTAGGGTAGATGATGTCCCCCTGTGGGGATTAATGTCCTGC TCTGTAAAACTGAGCTAGCTGAGATCCAGGAGGGCTTGGCCA GAGACAGCAAGTTGTTGCCATGGTGACTTTAAAGCCAGGTTG CTGCCCCAGCACAGGCCTCCCAGTCTACCCTCACTAGAAAAC AACACCCAGGCACTTTCCACCACCTCTCAAAGGTGAAACCCA AGGCTGGTCTAGAGAATGAATTATGGATCCTCGCTGTCCGTG CCACCCAGCTAGTCCCAGCGGCTCAGACACTGAGGAGAGAC TGTAGGTTCAGCTACAAGCAAAAAGACCTAGCTGGTCTCCAA GCAGTGTCTCCAAGTCCCTGAACCTGTGACACCTGCCCCAG GCATCATCAGGCACAGAGGGCCACCCTGCAGCTCAGCCTAC TACTTGCTTTCCAGGCTGTTCCTAGTTCCCATGTCAGCTGCTT GTGCTTTCCAGAGACAAAACAGGAATAATAGATGTCATTAAAT ATACATTGGGCCCCAGGCGGTCAATGTGGCAGCCTGAGCCT CCTTTCCATCTCTGTGGAGGCAGACATAGGACCCCCAACAAA CAGCATGCAGGTTGGGAGCCAGCCACAGGACCCAGGTAAGG GGCCCTGGGTCCTTAAGCTTCTGCCACTGGCTCCGGCATTG CAGAGAGAAGAGAAGGGGCGGCAGAGCTGAACCTTAGCCTT GCCTTCCTGGGTACCCTTCTGAGCCTCACTGTCTTCTGTGAG ATGGGCAAAGTGCGGGTGTGACTCCTTGGCAACGGTGTTAC ACCAGGGCAGGTAAAGTTGTAGTTATTTGTGGGGTACACCAG GACTGTTAAAGGTGTAACTAT 38 Portion of SEQ ID NO: CTGCAGCTCAGCCTACTACTTGCTTTCCAGGCTGTTCCTAGT 24 that contains SEQ TCCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAA ID NO: 26 and SEQ ID TAATAGATGTCATTAAATATACATTGGGCCCCAGGCGGTCAAT NO: 27 fused to portion GTGGCAGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGAC of SEQ ID NO: 25 that ATAGGACCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCC contains SEQ ID NO: ACAGGACCCAGGTAAGGGGCCCTGGGTCCTTAAGCTTCTGC 31 and SEQ ID NO: 32 CACTGGCTCCGGCATTGCAGAGAGAAGAGAAGGGGCGGCA GACTGGAGAGCTGGGCTCCATTTTTGTTCCTTGGTGCCCTGC CCCTCCCCATGACCTGCAGAGACATTCAGCCTGCCAGGCTTT ATGAGGTGGGAGCTGGGCTCTCCCTGATGTATTATTCAGCTC CCTGGAGTTGGCCAGCTCCTGTTACACTGGCCACAGCCCTG GGCATCCGCTTCTCACTTCTAGTTTCCCCTCCAAGGTAATGT GGTGGGTCATGATCATTCTATCCTGGCTTCAGGGACCTGACT CCACTTTGGGGCCATTCGAGGGGTCTAGGGTAGATGATGTC CCCCTGTGGGGATTAATGTCCTGCTCTGTAAAACTGAGCTAG CTGAGATCCAGGAGGGCTTGGCCAGAGACAGCAAGTTGTTG CCATGGTGACTTTAAAGCCAGGTTGCTGCCCCAGCACAGGC CTCCCAGTCTACCCTCACTAGAAAACAACACCCAGGCACTTT CCACCACCTCTCAAAGGTGAAACCCAAGGCTGGTCTAGAGAA TGAATTATGGATCCTCGCTGTCCGTGCCACCCAGCTAGTCCC AGCGGCTCAGACACTGAGGAGAGACTGTAGGTTCAGCTACA AGCAAAAAGACCTAGCTGGTCTCCAAGCAGTGTCTCCAAGTC CCTGAACCTGTGACACCTGCCCCAGGCATCATCAGGCACAG AGGGCCACC 39 Portion of SEQ ID NO: CTGCAGCTCAGCCTACTACTTGCTTTCCAGGCTGTTCCTAGT 24 that contains SEQ TCCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAA ID NO: 26 and SEQ ID TAATAGATGTCATTAAATATACATTGGGCCCCAGGCGGTCAAT NO: 27 fused to portion GTGGCAGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGAC of SEQ ID NO: 25 that ATAGGACCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCC contains SEQ ID NO: ACAGGACCCAGGTAAGGGGCCCTGGGTCCTTTTTATGAGGT 31 and SEQ ID NO: 32 GGGAGCTGGGCTCTCCCTGATGTATTATTCAGCTCCCTGGAG TTGGCCAGCTCCTGTTACACTGGCCACAGCCCTGGGCATCC GCTGCCATGGTGACTTTAAAGCCAGGTTGCTGCCCCAGCACA GGCCTCCCAGTCTACCCTCACTAGAAAACAACACCCAGGCAC TTTCCACCACCTCTCAAAGGTGAAACCCAAGGCTGGTCTAGA GAATGAATTATGGATCCTCGCTGTCCGTGCCACCCAGCTAGT CCCAGCGGCTCAGACACTG 40 Region 1 of the human GTATGCCTTTTGAGATGGATGCAGCAGGTTCTGTGAGGCTGC Myo15 promoter CAGGAGGGGTAGAGTTCCCGGGGGCCTCGGGCCCCGCTGG AGTGTGGAGCAGGCCCATGCTCAGCTCTCCAGGCTGTTCGT GGCTCCCCTGTCAGCTGCTCACTCCTTTCCAGAGACAAAACA GGAATAATAGACATCATTAAATATACATAGGGCCCCAGGCGG TCGGCGTGGTGGGCTGGGCCTCCCTTCC 41 Region 2 of human TGCCCTGCCTTCTGAGCCGGCAGCCTGGCTCCCCACCCCAT Myo15 promoter GTATTATTCAGCTCCTGAGAGCCAGCCAGCTCCTGTTACACT GACCGCAGCCCAGCACCTGCTCTGCCCATTCCCCTCCTCCC TTGCCTAGGACCTAGAGGGTTCAAAGTTCTCCTCCAAGATGA CTTGGTGGGCTTTGGCCATCCCACCCTAGGCCCCACTTCTG GCCCAGTGCAGGTGTGCTGGTGATTTAGGGCAGGTGGCATT CCATCTCTGTGGCTCAATGTCTTCCTCTGTGAAGCCGAAGTG ACCCAAGGGCTCCCTTCATGGGGTTGAGCCAGCTGTGGCCC AGGGAGGGCCTAACCAGGATGAGCACTGATGTTGCCATGAC GACTCCGAGGCCAGAATGTCTCCCCCAGCACAGGCCTCATA GGCAGGCTTCCCCATCCTGGTAAACAACACCCACACACTTTC TACTACTGCTCTAGGGTGAAACCCAAGGCGCTCTAGAGGAGA TGAATTATGGATCCGCCCTCCCGGAATCCTGGCTCGGCCCTC CCCACGCCACCCAGGGCCAGTCGGGTCTGCTCACAGCCCGA GGAGGCCGCGTGTCCAGCCGCGGGCAAGAGACAGAGCAGG TCCCTGTGTCTCCAAGTCCCTGAGCCCGTGACACCGGCCCC AGGCCCTGTAGAGAGCAGGCAGCCACC 42 Portion of SEQ ID NO: CCCCTGTCAGCTGCTCACTCCTTTCCAGAGACAAAACAGGAA 40 TAATAGACATCATTAAATATACATAGGGCCCCAGG 43 Portion of SEQ ID NO: TGAGCCGGCAGCCTGGCTCCCCACCCCATGTATTATTCAGCT 41 CCTGAGAGCCAGCCAGCTCCTGTTACACTGACCGCAGCCC 44 Portion of SEQ ID NO: CACAGGCCTCATAGGCAGGCTTCCCCATCCTGGTAAACAACA 41 CCCACACACTTTCTACTACTGCTCTAGGGTGAAACCCAAGGC GCTCTAGAGGAGATGAATTATGGATCC 45 Portion of SEQ ID NO: TGAGCCGGCAGCCTGGCTCCCCACCCCATGTATTATTCAGCT 41 CCTGAGAGCCAGCCAGCTCCTGTTACACTGACCGCAGCCCC (SEQ ID NO: 43 fused ACAGGCCTCATAGGCAGGCTTCCCCATCCTGGTAAACAACAC to SEQ ID NO: 44) CCACACACTTTCTACTACTGCTCTAGGGTGAAACCCAAGGCG CTCTAGAGGAGATGAATTATGGATCC 46 Portion of SEQ ID NO: CACAGGCCTCATAGGCAGGCTTCCCCATCCTGGTAAACAACA 41 CCCACACACTTTCTACTACTGCTCTAGGGTGAAACCCAAGGC (SEQ ID NO: 44 fused GCTCTAGAGGAGATGAATTATGGATCCTGAGCCGGCAGCCT to SEQ ID NO: 43) GGCTCCCCACCCCATGTATTATTCAGCTCCTGAGAGCCAGCC AGCTCCTGTTACACTGACCGCAGCCC 47 Portion of SEQ ID NO: TGAGCCGGCAGCCTGGCTCCCCACCCCATGTATTATTCAGCT 41 (contiguous CCTGAGAGCCAGCCAGCTCCTGTTACACTGACCGCAGCCCA sequence including GCACCTGCTCTGCCCATTCCCCTCCTCCCTTGCCTAGGACCT SEQ ID NO: 43 and AGAGGGTTCAAAGTTCTCCTCCAAGATGACTTGGTGGGCTTT SEQ ID NO: 44) GGCCATCCCACCCTAGGCCCCACTTCTGGCCCAGTGCAGGT GTGCTGGTGATTTAGGGCAGGTGGCATTCCATCTCTGTGGCT CAATGTCTTCCTCTGTGAAGCCGAAGTGACCCAAGGGCTCCC TTCATGGGGTTGAGCCAGCTGTGGCCCAGGGAGGGCCTAAC CAGGATGAGCACTGATGTTGCCATGACGACTCCGAGGCCAG AATGTCTCCCCCAGCACAGGCCTCATAGGCAGGCTTCCCCAT CCTGGTAAACAACACCCACACACTTTCTACTACTGCTCTAGG GTGAAACCCAAGGCGCTCTAGAGGAGATGAATTATGGATCC 48 Polynucleotide GTATGCCTTTTGAGATGGATGCAGCAGGTTCTGTGAGGCTGC containing SEQ ID NO: CAGGAGGGGTAGAGTTCCCGGGGGCCTCGGGCCCCGCTGG 40 and SEQ ID NO: 41 AGTGTGGAGCAGGCCCATGCTCAGCTCTCCAGGCTGTTCGT GGCTCCCCTGTCAGCTGCTCACTCCTTTCCAGAGACAAAACA GGAATAATAGACATCATTAAATATACATAGGGCCCCAGGCGG TCGGCGTGGTGGGCTGGGCCTCCCTTCCCCATAACACTGAG CTGCTCTGCTGGGCCAACCGTGCTCCTGGGCCAGCCAGAGG ACCCCCATGAGGCGGCATGCAGGCGGGGAGCAGGCCACAG AACGCAGGTAAGGAGACCTTAGCCTAGAGTCCTTGGGGTCT GTCACTGGCCACCCTCGCATCCCAGGCTGCAGGAAACTGAG GCCCAGAGAGGACAAGGACTTTCCTGGACCCACACAGCCAG TCAGTGACAGAGCCTAGGGTCTGAGCCAGGCCTGACCCAAC CTCCATTTCTGCCTCTCTACCCCTGCCCCCGCCCCAACACAC ACACACACACAAGTGGAGTTCCACTGAAACGCCCCTCCTTGC CCTGCCTTCTGAGCCGGCAGCCTGGCTCCCCACCCCATGTA TTATTCAGCTCCTGAGAGCCAGCCAGCTCCTGTTACACTGAC CGCAGCCCAGCACCTGCTCTGCCCATTCCCCTCCTCCCTTGC CTAGGACCTAGAGGGTTCAAAGTTCTCCTCCAAGATGACTTG GTGGGCTTTGGCCATCCCACCCTAGGCCCCACTTCTGGCCC AGTGCAGGTGTGCTGGTGATTTAGGGCAGGTGGCATTCCAT CTCTGTGGCTCAATGTCTTCCTCTGTGAAGCCGAAGTGACCC AAGGGCTCCCTTCATGGGGTTGAGCCAGCTGTGGCCCAGGG AGGGCCTAACCAGGATGAGCACTGATGTTGCCATGACGACT CCGAGGCCAGAATGTCTCCCCCAGCACAGGCCTCATAGGCA GGCTTCCCCATCCTGGTAAACAACACCCACACACTTTCTACT ACTGCTCTAGGGTGAAACCCAAGGCGCTCTAGAGGAGATGA ATTATGGATCCGCCCTCCCGGAATCCTGGCTCGGCCCTCCC CACGCCACCCAGGGCCAGTCGGGTCTGCTCACAGCCCGAG GAGGCCGCGTGTCCAGCCGCGGGCAAGAGACAGAGCAGGT CCCTGTGTCTCCAAGTCCCTGAGCCCGTGACACCGGCCCCA GGCCCTGTAGAGAGCAGGCAGCCACC 49 Polynucleotide GCAGGCCCATGCTCAGCTCTCCAGGCTGTTCGTGGCTCCCC containing SEQ ID NO: TGTCAGCTGCTCACTCCTTTCCAGAGACAAAACAGGAATAAT 42, SEQ ID NO: 43, AGACATCATTAAATATACATAGGGCCCCAGGCGGTCGGCGTG and SEQ ID NO: 44 GTGGGCTGGGCCTCCCTTCCCCATAACACTGAGCTGCTCTG CTGGGCCAACCGTGCTCCTGGGCCAGCCAGAGGACCCCCAT GAGGCGGCATGCAGGCGGGGAGCAGGCCACAGAACGCAGG TAAGGAGACCTTGCCTTCTGAGCCGGCAGCCTGGCTCCCCA CCCCATGTATTATTCAGCTCCTGAGAGCCAGCCAGCTCCTGT TACACTGACCGCAGCCCAGCACCTGCTCTGCCCATTCCCCTC CTCCCTTGCCTAGGACCTAGAGGGTTCAAAGTTCTCCTCCAA GATGACTTGGTGGGCTTTGGCCATCGGGCCTAACCAGGATG AGCACTGATGTTGCCATGACGACTCCGAGGCCAGAATGTCTC CCCCAGCACAGGCCTCATAGGCAGGCTTCCCCATCCTGGTA AACAACACCCACACACTTTCTACTACTGCTCTAGGGTGAAAC CCAAGGCGCTCTAGAGGAGATGAATTATGGATCCGCCCTCC CGGAATCCTGGCTCGGCCCTCCCCACGC 50 Portion of SEQ ID NO: CTGCAGCTCAGCCTACTACTTGCTTTCCAGGCTGTTCCTAGT 24 that contains SEQ TCCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAA ID NO: 26 and SEQ ID TAATAGATGTCATTAAATATACATTGGGCCCCAGGCGGTCAAT NO: 27 GTGGCAGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGAC ATAGGACCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCC ACAGGACCCAGGTAAGGGGCCCTGGGTCCTT 51 Portion of SEQ ID NO: TTTATGAGGTGGGAGCTGGGCTCTCCCTGATGTATTATTCAG 25 that contains SEQ CTCCCTGGAGTTGGCCAGCTCCTGTTACACTGGCCACAGCC ID NO: 31 CTGGGCATCCGC 52 Portion of SEQ ID NO: TGCCATGGTGACTTTAAAGCCAGGTTGCTGCCCCAGCACAG 25 that contains SEQ GCCTCCCAGTCTACCCTCACTAGAAAACAACACCCAGGCACT ID NO: 32 TTCCACCACCTCTCAAAGGTGAAACCCAAGGCTGGTCTAGAG AATGAATTATGGATCCTCGCTGTCCGTGCCACCCAGCTAGTC CCAGCGGCTCAGACACTG 53 SEQ ID NO: 50 fused CTGCAGCTCAGCCTACTACTTGCTTTCCAGGCTGTTCCTAGT to SEQ ID NO: 51 TCCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAA TAATAGATGTCATTAAATATACATTGGGCCCCAGGCGGTCAAT GTGGCAGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGAC ATAGGACCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCC ACAGGACCCAGGTAAGGGGCCCTGGGTCCTTTTTATGAGGT GGGAGCTGGGCTCTCCCTGATGTATTATTCAGCTCCCTGGAG TTGGCCAGCTCCTGTTACACTGGCCACAGCCCTGGGCATCC GC 54 SEQ ID NO: 50 fused CTGCAGCTCAGCCTACTACTTGCTTTCCAGGCTGTTCCTAGT to SEQ ID NO: 52 TCCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAA TAATAGATGTCATTAAATATACATTGGGCCCCAGGCGGTCAAT GTGGCAGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGAC ATAGGACCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCC ACAGGACCCAGGTAAGGGGCCCTGGGTCCTTTGCCATGGTG ACTTTAAAGCCAGGTTGCTGCCCCAGCACAGGCCTCCCAGTC TACCCTCACTAGAAAACAACACCCAGGCACTTTCCACCACCT CTCAAAGGTGAAACCCAAGGCTGGTCTAGAGAATGAATTATG GATCCTCGCTGTCCGTGCCACCCAGCTAGTCCCAGCGGCTC AGACACTG 55 SEQ ID NO: 51 fused TTTATGAGGTGGGAGCTGGGCTCTCCCTGATGTATTATTCAG to SEQ ID NO: 52 CTCCCTGGAGTTGGCCAGCTCCTGTTACACTGGCCACAGCC CTGGGCATCCGCTGCCATGGTGACTTTAAAGCCAGGTTGCTG CCCCAGCACAGGCCTCCCAGTCTACCCTCACTAGAAAACAAC ACCCAGGCACTTTCCACCACCTCTCAAAGGTGAAACCCAAGG CTGGTCTAGAGAATGAATTATGGATCCTCGCTGTCCGTGCCA CCCAGCTAGTCCCAGCGGCTCAGACACTG 56 SEQ ID NO: 51 fused TTTATGAGGTGGGAGCTGGGCTCTCCCTGATGTATTATTCAG to SEQ ID NO: 50, CTCCCTGGAGTTGGCCAGCTCCTGTTACACTGGCCACAGCC which is fused to SEQ CTGGGCATCCGCCTGCAGCTCAGCCTACTACTTGCTTTCCAG ID NO: 52 GCTGTTCCTAGTTCCCATGTCAGCTGCTTGTGCTTTCCAGAG ACAAAACAGGAATAATAGATGTCATTAAATATACATTGGGCCC CAGGCGGTCAATGTGGCAGCCTGAGCCTCCTTTCCATCTCTG TGGAGGCAGACATAGGACCCCCAACAAACAGCATGCAGGTT GGGAGCCAGCCACAGGACCCAGGTAAGGGGCCCTGGGTCC TTTGCCATGGTGACTTTAAAGCCAGGTTGCTGCCCCAGCACA GGCCTCCCAGTCTACCCTCACTAGAAAACAACACCCAGGCAC TTTCCACCACCTCTCAAAGGTGAAACCCAAGGCTGGTCTAGA GAATGAATTATGGATCCTCGCTGTCCGTGCCACCCAGCTAGT CCCAGCGGCTCAGACACTG 57 SEQ ID NO: 52 fused TGCCATGGTGACTTTAAAGCCAGGTTGCTGCCCCAGCACAG to SEQ ID NO: 50, GCCTCCCAGTCTACCCTCACTAGAAAACAACACCCAGGCACT which is fused to SEQ TTCCACCACCTCTCAAAGGTGAAACCCAAGGCTGGTCTAGAG ID NO: 51 AATGAATTATGGATCCTCGCTGTCCGTGCCACCCAGCTAGTC CCAGCGGCTCAGACACTGCTGCAGCTCAGCCTACTACTTGCT TTCCAGGCTGTTCCTAGTTCCCATGTCAGCTGCTTGTGCTTTC CAGAGACAAAACAGGAATAATAGATGTCATTAAATATACATTG GGCCCCAGGCGGTCAATGTGGCAGCCTGAGCCTCCTTTCCA TCTCTGTGGAGGCAGACATAGGACCCCCAACAAACAGCATG CAGGTTGGGAGCCAGCCACAGGACCCAGGTAAGGGGCCCT GGGTCCTTTTTATGAGGTGGGAGCTGGGCTCTCCCTGATGTA TTATTCAGCTCCCTGGAGTTGGCCAGCTCCTGTTACACTGGC CACAGCCCTGGGCATCCGC 58 SEQ ID NO: 51 fused TTTATGAGGTGGGAGCTGGGCTCTCCCTGATGTATTATTCAG to SEQ ID NO: 52, CTCCCTGGAGTTGGCCAGCTCCTGTTACACTGGCCACAGCC which is fused to SEQ CTGGGCATCCGCTGCCATGGTGACTTTAAAGCCAGGTTGCTG ID NO: 50 CCCCAGCACAGGCCTCCCAGTCTACCCTCACTAGAAAACAAC ACCCAGGCACTTTCCACCACCTCTCAAAGGTGAAACCCAAGG CTGGTCTAGAGAATGAATTATGGATCCTCGCTGTCCGTGCCA CCCAGCTAGTCCCAGCGGCTCAGACACTGCTGCAGCTCAGC CTACTACTTGCTTTCCAGGCTGTTCCTAGTTCCCATGTCAGCT GCTTGTGCTTTCCAGAGACAAAACAGGAATAATAGATGTCATT AAATATACATTGGGCCCCAGGCGGTCAATGTGGCAGCCTGA GCCTCCTTTCCATCTCTGTGGAGGCAGACATAGGACCCCCAA CAAACAGCATGCAGGTTGGGAGCCAGCCACAGGACCCAGGT AAGGGGCCCTGGGTCCTT 59 Portion of SEQ ID NO: TGCAGCTCAGCCTACTACTTGCTTTCCAGGCTGTTCCTAGTT 24 that contains SEQ CCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAAT ID NO: 26 and SEQ ID AATAGATGTCATTAAATATACATTGGGCCCCAGGCGGTCAAT NO: 27 fused to portion GTGGCAGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGAC of SEQ ID NO: 25 that ATAGGACCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCC contains SEQ ID NO: ACAGGACCCAGGTAAGGGGCCCTGGGTCCTTAAGCTTCTGC 31 and SEQ ID NO: 32 CACTGGCTCCGGCATTGCAGAGAGAAGAGAAGGGGCGGCA GACTGGAGAGCTGGGCTCCATTTTTGTTCCTTGGTGCCCTGC CCCTCCCCATGACCTGCAGAGACATTCAGCCTGCCAGGCTTT ATGAGGTGGGAGCTGGGCTCTCCCTGATGTATTATTCAGCTC CCTGGAGTTGGCCAGCTCCTGTTACACTGGCCACAGCCCTG GGCATCCGCTTCTCACTTCTAGTTTCCCCTCCAAGGTAATGT GGTGGGTCATGATCATTCTATCCTGGCTTCAGGGACCTGACT CCACTTTGGGGCCATTCGAGGGGTCTAGGGTAGATGATGTC CCCCTGTGGGGATTAATGTCCTGCTCTGTAAAACTGAGCTAG CTGAGATCCAGGAGGGCTTGGCCAGAGACAGCAAGTTGTTG CCATGGTGACTTTAAAGCCAGGTTGCTGCCCCAGCACAGGC CTCCCAGTCTACCCTCACTAGAAAACAACACCCAGGCACTTT CCACCACCTCTCAAAGGTGAAACCCAAGGCTGGTCTAGAGAA TGAATTATGGATCCTCGCTGTCCGTGCCACCCAGCTAGTCCC AGCGGCTCAGACACTGAGGAGAGACTGTAGGTTCAGCTACA AGCAAAAAGACCTAGCTGGTCTCCAAGCAGTGTCTCCAAGTC CCTGAACCTGTGACACCTGCCCCAGGCATCATCAGGCACAG AGGGCCACC 60 Portion of SEQ ID NO: TGCAGCTCAGCCTACTACTTGCTTTCCAGGCTGTTCCTAGTT 24 that contains SEQ CCCATGTCAGCTGCTTGTGCTTTCCAGAGACAAAACAGGAAT ID NO: 26 and SEQ ID AATAGATGTCATTAAATATACATTGGGCCCCAGGCGGTCAAT NO: 27 fused to portion GTGGCAGCCTGAGCCTCCTTTCCATCTCTGTGGAGGCAGAC of SEQ ID NO: 25 that ATAGGACCCCCAACAAACAGCATGCAGGTTGGGAGCCAGCC contains SEQ ID NO: ACAGGACCCAGGTAAGGGGCCCTGGGTCCTTTTTATGAGGT 31 and SEQ ID NO: 32 GGGAGCTGGGCTCTCCCTGATGTATTATTCAGCTCCCTGGAG TTGGCCAGCTCCTGTTACACTGGCCACAGCCCTGGGCATCC GCTGCCATGGTGACTTTAAAGCCAGGTTGCTGCCCCAGCACA GGCCTCCCAGTCTACCCTCACTAGAAAACAACACCCAGGCAC TTTCCACCACCTCTCAAAGGTGAAACCCAAGGCTGGTCTAGA GAATGAATTATGGATCCTCGCTGTCCGTGCCACCCAGCTAGT CCCAGCGGCTCAGACACTG
[0208] Additional Myo15 promoters useful in conjunction with the compositions and methods described herein include nucleic acid molecules that have at least 85% sequence identity (e.g., 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, sequence identity) to the nucleic acid sequences set forth in Table 3, as well as functional portions or derivatives of the nucleic acid sequences set forth in Table 3. The Myo15 promoters listed in Table 3 are characterized in U.S. Provisional Application Nos. 62/663,679, 62/802,874, 62/928,311, and 62/965,773 and International Application No. PCT/US2019/029336, which are incorporated herein by reference.
[0209] Once a polynucleotide encoding OTOF has been incorporated into the nuclear DNA of a mammalian cell or stabilized in an episomal monomer or concatemer, the transcription of this polynucleotide can be induced by methods known in the art. For example, expression can be induced by exposing the mammalian cell to an external chemical reagent, such as an agent that modulates the binding of a transcription factor and/or RNA polymerase to the mammalian promoter and thus regulates gene expression. The chemical reagent can serve to facilitate the binding of RNA polymerase and/or transcription factors to the mammalian promoter, e.g., by removing a repressor protein that has bound the promoter. Alternatively, the chemical reagent can serve to enhance the affinity of the mammalian promoter for RNA polymerase and/or transcription factors such that the rate of transcription of the gene located downstream of the promoter is increased in the presence of the chemical reagent. Examples of chemical reagents that potentiate polynucleotide transcription by the above mechanisms include tetracycline and doxycycline. These reagents are commercially available (Life Technologies, Carlsbad, Calif.) and can be administered to a mammalian cell in order to promote gene expression according to established protocols.
[0210] Other DNA sequence elements that may be included in the nucleic acid vectors for use in the compositions and methods described herein include enhancer sequences. Enhancers represent another class of regulatory elements that induce a conformational change in the polynucleotide containing the gene of interest such that the DNA adopts a three-dimensional orientation that is favorable for binding of transcription factors and RNA polymerase at the transcription initiation site. Thus, polynucleotides for use in the compositions and methods described herein include those that encode an OTOF protein and additionally include a mammalian enhancer sequence. Many enhancer sequences are now known from mammalian genes, and examples include enhancers from the genes that encode mammalian globin, elastase, albumin, α-fetoprotein, and insulin. Enhancers for use in the compositions and methods described herein also include those that are derived from the genetic material of a virus capable of infecting a eukaryotic cell. Examples include the SV40 enhancer on the late side of the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. Additional enhancer sequences that induce activation of eukaryotic gene transcription are disclosed in Yaniv, et al., Nature 297:17 (1982). An enhancer may be spliced into a vector containing a polynucleotide encoding an OTOF protein, for example, at a position 5′ or 3′ to this gene. In a preferred orientation, the enhancer is positioned at the 5′ side of the promoter, which in turn is located 5′ relative to the polynucleotide encoding an OTOF protein.
[0211] The nucleic acid vectors described herein may include a Woodchuck Posttranscriptional Regulatory Element (WPRE). The WPRE acts at the mRNA level, by promoting nuclear export of transcripts and/or by increasing the efficiency of polyadenylation of the nascent transcript, thus increasing the total amount of mRNA in the cell. The addition of the WPRE to a vector can result in a substantial improvement in the level of transgene expression from several different promoters, both in vitro and in vivo. The WPRE can be located in the second nucleic acid vector between the polynucleotide encoding a C-terminal portion of an OTOF protein and the poly(A) sequence. In the compositions and methods described herein, the WPRE can have the sequence:
TABLE-US-00004 (SEQ ID NO: 23) GATCCAATCAACCTCTGGATTACAAAATTTGTGAAAGATTGACTGGTATTC TTAACTATGTTGCTCCTTTTACGCTATGTGGATACGCTGCTTTAATGCCTT TGTATCATGCTATTGCTTCCCGTATGGCTTTCATTTTCTCCTCCTTGTATA AATCCTGGTTGCTGTCTCTTTATGAGGAGTTGTGGCCCGTTGTCAGGCAAC GTGGCGTGGTGTGCACTGTGTTTGCTGACGCAACCCCCACTGGTTGGGGCA TTGCCACCACCTGTCAGCTCCTTTCCGGGACTTTCGCTTTCCCCCTCCCTA TTGCCACGGCGGAACTCATCGCCGCCTGCCTTGCCCGCTGCTGGACAGGGG CTCGGCTGTTGGGCACTGACAATTCCGTGGTGTTGTCGGGGAAATCATCGT CCTTTCCTTGGCTGCTCGCCTGTGTTGCCACCTGGATTCTGCGCGGGACGT CCTTCTGCTACGTCCCTTCGGCCCTCAATCCAGCGGACCTTCCTTCCCGCG GCCTGCTGCCGGCTCTGCGGCCTCTTCCGCGTCTTCGA.
[0212] The WPRE can also have the sequence:
TABLE-US-00005 (SEQ ID NO: 61) AATCAACCTCTGGATTACAAAATTTGTGAAAGATTGACTGGTATTCTTAAC TATGTTGCTCCTTTTACGCTATGTGGATACGCTGCTTTAATGCCTTTGTAT CATGCTATTGCTTCCCGTATGGCTTTCATTTTCTCCTCCTTGTATAAATCC TGGTTAGTTCTTGCCACGGCGGAACTCATCGCCGCCTGCCTTGCCCGCTGC TGGACAGGGGCTCGGCTGTTGGGCACTGACAATTCCGTGGTGTTTATTTGT GAAATTTGTGATGCTATTGCTTTATTTGTAACCATCTAGCTTTATTTGTGA AATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACA AGTTAACAACAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGAT GTGGGAGGTTTTTTAAA.
[0213] In some embodiments, the nucleic acid vectors for use in the compositions and methods described herein include a reporter sequence, which can be useful in verifying OTOF gene expression, for example, in specific cells and tissues (e.g., in cochlear hair cells). Reporter sequences that may be provided in a transgene include DNA sequences encoding β-lactamase, β-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein (GFP), chloramphenicol acetyltransferase (CAT), luciferase, and others well known in the art. When associated with regulatory elements which drive their expression, the reporter sequences provide signals detectable by conventional means, including enzymatic, radiographic, colorimetric, fluorescence or other spectrographic assays, fluorescent activating cell sorting assays and immunological assays, including enzyme linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and immunohistochemistry. For example, where the marker sequence is the LacZ gene, the presence of the vector carrying the signal is detected by assays for β-galactosidase activity. Where the transgene is green fluorescent protein or luciferase, the vector carrying the signal may be measured visually by color or light production in a luminometer.
Overlapping Dual Vectors
[0214] One approach for expressing large proteins in mammalian cells involves the use of overlapping dual vectors. This approach is based on the use of two nucleic acid vectors, each of which contains a portion of a polynucleotide that encodes a protein of interest and has a defined region of sequence overlap with the other polynucleotide. Homologous recombination can occur at the region of overlap and lead to the formation of a single nucleic acid molecule that encodes the full-length protein of interest.
[0215] Overlapping dual vectors for use in the methods and compositions described herein contain at least one kilobase (kb) of overlapping sequence (e.g., 1 kb, 1.5 kb, 2 kb, 2.5 kb, 3 kb or more of overlapping sequence). The nucleic acid vectors are designed such that the overlapping region is centered at an OTOF exon boundary, with an equal amount of overlap on either side of the boundary. The boundaries are chosen based on the size of the promoter and the locations of the portions of the polynucleotide that encode OTOF C2 domains. Overlapping regions are centered on exon boundaries that occur outside of the portion of the polynucleotide that encodes the C2C domain (e.g., after the portion of the polynucleotide that encodes the C2C domain). Exon boundaries within the portion of the polynucleotide that encodes the C2D domain can be selected as the center of the overlapping region, or exon boundaries located after the portion of the polynucleotide that encodes the C2D domain and before the portion of the polynucleotide that encodes the C2E domain can serve as the center of an overlapping region. The nucleic acid vectors for use in the methods and compositions described herein are also designed such that approximately half of the OTOF gene is contained within each vector (e.g., each vector contains a polynucleotide that encodes approximately half of the OTOF protein).
[0216] One exemplary overlapping dual vector system includes a first nucleic acid vector containing a CAG promoter operably linked to exons 1-28 and the 500 kb immediately 3′ of the exon 28/29 boundary of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5, or mouse OTOF, e.g., SEQ ID NO: 6); and a second nucleic acid vector containing the 500 kb immediately 5′ of the exon 28/29 boundary and exons 29-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5, or mouse OTOF, e.g., SEQ ID NO: 6) and a poly(A) sequence (e.g., a bovine growth hormone (bGH) poly(A) signal sequence). In this overlapping dual vector system, the overlapping sequence is centered at the exon 28/29 boundary, which is after the portion of the polynucleotide that encodes the C2D domain. Another exemplary overlapping dual vector system includes a first nucleic acid vector containing a CAG promoter operably linked to exons 1-24 and the 500 kb immediately 3′ of the exon 24/25 boundary of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5, or mouse OTOF, e.g., SEQ ID NO: 6); and a second nucleic acid vector containing the 500 kb immediately 5′ of the exon 24/25 boundary and exons 25-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5, or mouse OTOF, e.g., SEQ ID NO: 6) and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). In this overlapping dual vector system, the overlapping sequence is centered at the exon 24/25 boundary, which is within the portion of the polynucleotide that encodes the C2D domain. The two exon boundaries described above can be used with any promoter that is a similar size to the CAG promoter (e.g., the CMV promoter or smCBA promoter), such as promoters that are 1 kb or shorter (e.g., approximately 1 kb, 950 bp, 900 bp, 850 bp, 800 bp, 750 bp, 700 bp, 650 bp, 600 bp, 550 bp 500 bp, 450 bp, 400 bp, 350 bp, 300 bp or shorter). For example, in either of the foregoing dual vector systems, the CMV promoter or the smCBA promoter, can be used in the place of the CAG promoter. A Myo15 promoter having a sequence that is 1 kb or shorter (e.g., a Myo15 promoter described hereinabove, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) can also be used in place of the CAG promoter. Alternatively, a different exon boundary can be chosen that is within or after the portion of the polynucleotide that encodes the C2D domain and before the portion of the polynucleotide that encodes the C2E domain. The nucleic acid vectors containing promoters of this size can optionally contain OTOF UTRs. For example, in the foregoing overlapping dual vector system in which the overlapping region is centered at the exon 28/29 boundary of OTOF, the second nucleic acid vector can contain the full length OTOF 3′ UTR (e.g., the 1035 bp human OTOF 3′ UTR in dual vector systems encoding human OTOF, or the 1001 bp mouse OTOF 3′ UTR in dual vector systems encoding mouse OTOF). In the foregoing overlapping dual vector system in which the overlapping region is centered at the exon 24/25 boundary of OTOF, neither the first nor the second nucleic acid vector contains an OTOF UTR.
[0217] In some embodiments, the first nucleic acid vector in the overlapping dual vector system contains a long promoter (e.g., a promoter that is longer than 1 kb, e.g., 1.1 kb, 1.25 kb, 1.5 kb, 1.75 kb, 2 kb, 2.5 kb, 3 kb or longer). In such overlapping dual vector systems, the overlapping region is centered at an exon boundary that is located after the portion of the polynucleotide that encodes the C2C domain and before the portion of the polynucleotide that encodes the C2D domain. For example, an overlapping dual vector system for use in the methods and compositions described herein includes a first nucleic acid vector containing a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36) operably linked exons 1-21 and the 500 kb immediately 3′ of the exon 21/22 boundary of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5); and a second nucleic acid vector containing the 500 kb immediately 5′ of the exon 21/22 boundary and exons 22-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). In this overlapping dual vector system, neither the first nor the second nucleic acid vector includes an OTOF UTR. A short promoter (e.g., a CMV promoter, CAG promoter, smCBA promoter, or a Myo15 promoter having a sequence that is 1 kb or shorter, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) can also be used in this dual vector system (e.g., a dual vector system in which the overlapping region is centered at the exon 21/22 boundary). If a short promoter is used, additional elements, such as a 5′ OTOF UTR, can be included in the first vector (e.g., the vector containing exons 1-21 and the 500 kb immediately 3′ of the exon 21/22 boundary of a polynucleotide encoding an OTOF protein).
Trans-Splicing Dual Vectors
[0218] A second approach for expressing large proteins in mammalian cells involves the use of trans-splicing dual vectors. In this approach, two nucleic acid vectors are used that contain distinct nucleic acid sequences, and the polynucleotide encoding the N-terminal portion of the protein of interest and the polynucleotide encoding the C-terminal portion of the protein of interest do not overlap. Instead, the first nucleic acid vector includes a splice donor sequence 3′ of the polynucleotide encoding the N-terminal portion of the protein of interest, and the second nucleic acid vector includes a splice acceptor sequence 5′ of the polynucleotide encoding the C-terminal portion of the protein of interest. When the first and second nucleic acids are present in the same cell, their ITRs can concatemerize, forming a single nucleic acid structure in which the concatemerized ITRs are positioned between the splice donor and splice acceptor. Trans-splicing then occurs during transcription, producing a nucleic acid molecule in which the polynucleotides encoding the N-terminal and C-terminal portions of the protein of interest are contiguous, thereby forming the full-length coding sequence.
[0219] Trans-splicing dual vectors for use in the methods and compositions described herein are designed such that approximately half of the OTOF gene is contained within each vector (e.g., each vector contains a polynucleotide that encodes approximately half of the OTOF protein). The determination of how to split the polynucleotide sequence between the two nucleic acid vectors is made based on the size of the promoter and the locations of the portions of the polynucleotide that encode the OTOF C2 domains. When a short promoter is used in the trans-splicing dual vector system (e.g., a promoter that is 1 kb or shorter, e.g., approximately 1 kb, 950 bp, 900 bp, 850 bp, 800 bp, 750 bp, 700 bp, 650 bp, 600 bp, 550 bp 500 bp, 450 bp, 400 bp, 350 bp, 300 bp or shorter), such as a CAG promoter, a CMV promoter, a smCBA promoter, or a Myo15 promoter having a sequence that is 1 kb or shorter (e.g., a Myo15 promoter described hereinabove, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) the OTOF polynucleotide sequence is divided between the two nucleic acid vectors at an exon boundary that occurs after the portion of the polynucleotide that encodes the C2D domain and before the portion of the polynucleotide that encodes the C2E domain, for example, the exon 26/27 boundary. The nucleic acid vectors containing promoters of this size can optionally contain OTOF UTRs (e.g., both the 5′ and 3′ OTOF UTRs, e.g., full-length UTRs). When a long promoter is used in the trans-splicing dual vector system (e.g., a promoter that is longer than 1 kb, e.g., 1.1 kb, 1.25 kb, 1.5 kb, 1.75 kb, 2 kb, 2.5 kb, 3 kb or longer), such as a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36), the OTOF polynucleotide sequence will be divided between the two nucleic acid vectors at an exon boundary that occurs after the portion of the polynucleotide that encodes the C2C domain, and either before the portion of the polynucleotide that encodes the C2D domain, such as the exon 19/20 boundary, or within the portion of the polynucleotide that encodes the C2D domain, such as the exon 25/26 boundary. A short promoter (e.g., a CMV promoter, smCBA promoter, CAG promoter, or a Myo15 promoter having a sequence that is 1 kb or shorter, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) can also be used in the dual vector systems designed for large promoters, in which case additional elements (e.g., OTOF UTR sequences) may be included in the first vector (e.g., the vector containing the portion of the polynucleotide the encodes the C2C domain).
[0220] One exemplary trans-splicing dual vector system that uses a short promoter includes a first nucleic acid vector containing a CAG promoter operably linked to exons 1-26 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and a splice donor sequence 3′ of the polynucleotide sequence; and a second nucleic acid vector containing a splice acceptor sequence 5′ of exons 27-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). An alternative trans-splicing dual vector system includes a first nucleic acid vector containing a CAG promoter operably linked to exons 1-28 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6) and a splice donor sequence 3′ of the polynucleotide sequence; and a second nucleic acid vector containing a splice acceptor sequence 5′ of exons 29-48 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6) and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). The CMV promoter, smCBA promoter, or a Myo15 promoter having a sequence that is 1 kb or shorter (e.g., a Myo15 promoter described hereinabove, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) can be used in place of the CAG promoter either of the foregoing dual vector systems. These nucleic acid vectors can also contain full length 5′ and 3′ OTOF UTRs in the first and second nucleic acid vectors, respectively (e.g., the first nucleic acid vector can contain the 5′ human OTOF UTR (127 bp) in dual vector systems encoding human OTOF, or the 5′ mouse UTR (134 bp) in dual vector systems encoding mouse OTOF; and the second nucleic acid vector can contain the 3′ human OTOF UTR (1035 bp) in dual vector systems encoding human OTOF, or the 3′ mouse OTOF UTR (1001 bp) in dual vector systems encoding mouse OTOF).
[0221] An exemplary trans-splicing dual vector system that uses a long promoter includes a first nucleic acid vector containing a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36) operably linked to exons 1-19 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and a splice donor sequence 3′ of the polynucleotide sequence; and a second nucleic acid vector containing a splice acceptor sequence 5′ of exons 20-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). Alternatively, the trans-splicing dual vector system can include a first nucleic acid vector containing a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36) operably linked to exons 1-20 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6) and a splice donor sequence 3′ of the polynucleotide sequence; and a second nucleic acid vector containing a splice acceptor sequence 5′ of exons 21-48 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6) and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). Neither the first nor the second nucleic acid vector in either of the foregoing Myo15 promoter trans-splicing dual vector systems contains an OTOF UTR. A short promoter (e.g., a CMV promoter, smCBA promoter, CAG promoter, or a Myo15 promoter having a sequence that is 1 kb or shorter, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) can also be used in the foregoing dual vector systems designed for large promoters. If these dual vector systems contain a short promoter, they may also include a 5′ OTOF UTR in the first vector.
[0222] To accommodate an OTOF UTR, the OTOF coding sequence can be divided in a different position. For example, in a trans-splicing dual vector system in which the first nucleic acid vector contains a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36) operably linked to exons 1-25 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and a splice donor sequence 3′ of the polynucleotide sequence; and the second nucleic acid vector contains a splice acceptor sequence 5′ of exons 26-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and a poly(A) sequence (e.g., a bGH poly(A) signal sequence), the second nucleic acid can also contain a full length OTOF 3′ UTR (e.g., the 1035 bp human OTOF 3′ UTR). For mouse OTOF, the trans-splicing dual vector system can contain a 3′ UTR if the first nucleic acid vector contains a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36) operably linked to exons 1-24 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6) and a splice donor sequence 3′ of the polynucleotide sequence; and the second nucleic acid vector contains a splice acceptor sequence 5′ of exons 25-48 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6) and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). In this dual vector system, the second nucleic acid can also contain a full length OTOF 3′ UTR (e.g., the 1001 bp mouse OTOF 3′ UTR). A short promoter (e.g., a CMV promoter, smCBA promoter, CAG promoter, or a Myo15 promoter having a sequence that is 1 kb or shorter, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) can also be used in the foregoing dual vector systems designed for large promoters. If these dual vector systems contain a short promoter, they may also include a 5′ OTOF UTR in the first vector.
Dual Hybrid Vectors
[0223] A third approach for expressing large proteins in mammalian cells involves the use of dual hybrid vectors. This approach combines elements of the overlapping dual vector strategy and the trans-splicing strategy in that it features both an overlapping region at which homologous recombination can occur and splice donor and splice acceptor sequences. In dual hybrid vector systems, the overlapping region is a recombinogenic region that is contained in both the first and second nucleic acid vectors, rather than a portion of the polynucleotide sequence encoding the protein of interest—the polynucleotide encoding the N-terminal portion of the protein of interest and the polynucleotide encoding the C-terminal portion of the protein of interest do not overlap in this approach. The recombinogenic region is 3′ of the splice donor sequence in the first nucleic acid vector and 5′ of the splice acceptor sequence in the second nucleic acid sequence. The first and second nucleic acid sequences can then join to form a single sequence based on one of two mechanisms: 1) recombination at the overlapping region, or 2) concatemerization of the ITRs. The remaining recombinogenic region(s) and/or the concatemerized ITRs can be removed by splicing, leading to the formation of a contiguous polynucleotide sequence that encodes the full-length protein of interest.
[0224] Recombinogenic regions that can be used in the compositions and methods described herein include the F1 phage AK gene having a sequence of: GGGATTTTGCCGATTTCGGCCTATTGGTTAA AAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAAT (SEQ ID NO: 19) and alkaline phosphatase (AP) gene fragments as described in U.S. Pat. No. 8,236,557, which are incorporated herein by reference. In some embodiments, the AP gene fragment has the sequence of:
TABLE-US-00006 (SEQ ID NO: 62) CCCCGGGTGCGCGGCGTCGGTGGTGCCGGCGGGGGGCGCCAGGTCGCAGGC GGTGTAGGGCTCCAGGCAGGCGGCGAAGGCCATGACGTGCGCTATGAAGGT CTGCTCCTGCACGCCGTGAACCAGGTGCGCCTGCGGGCCGCGCGCGAACAC CGCCACGTCCTCGCCTGCGTGGGTCTCTTCGTCCAGGGGCACTGCTGACTG CTGCCGATACTCGGGGCTCCCGCTCTCGCTCTCGGTAACATCCGGCCGGGC GCCGTCCTTGAGCACATAGCCTGGACCGTTTCCGTATAGGAGGACCGTGTA GGCCTTCCTGTCCCGGGCCTTGCCAGCGGCCAGCCCGATGAAGGAGCTCCC TCGCAGGGGGTAGCCTCCGAAGGAGAAGACGTGGGAGTGGTCGGCAGTGAC GAGGCTCAGCGTGTCCTCCTCGCTGGTGAGCTGGCCCGCCCTCTCAATGGC GTCGTCGAACATGATCGTCTCAGTCAGTGCCCGGTAAGCCCTGCTTTCATG ATGACCATGGTCGATGCGACCACCCTCCACGAAGAGGAAGAAGCCGCGGGG GTGTCTGCTCAGCAGGCGCAGGGCAGCCTCTGTCATCTCCATCAGGGAGGG GTCCAGTGTGGAGTCTCGGTGGATCTCGTATTTCATGTCTCCAGGCTCAAA GAGACCCATGAGATGGGTCACAGACGGGTCCAGGGAAGCCTGCATGAGCTC AGTGCGGTTCCACACGTACCGGGCACCCTGGCGTTCGCCGAGCCATTCCTG CACCAGATTCTTCCCGTCCAGCCTGGTCCCACCTTGGCTGTAGTCATCTGG GTACTCAGGGTCTGGGGTTCCCATGCGAAACATGTACTTTCGGCCTCCA.
In some embodiments, the AP gene fragment has the sequence of:
TABLE-US-00007 (SEQ ID NO: 63) CCCCGGGTGCGCGGCGTCGGTGGTGCCGGCGGGGGGCGCCAGGTCGCAGGC GGTGTAGGGCTCCAGGCAGGCGGCGAAGGCCATGACGTGCGCTATGAAGGT CTGCTCCTGCACGCCGTGAACCAGGTGCGCCTGCGGGCCGCGCGCGAACAC CGCCACGTCCTCGCCTGCGTGGGTCTCTTCGTCCAGGGGCACTGCTGACTG CTGCCGATACTCGGGGCTCCCGCTCTCGCTCTCGGTAACATCCGGCCGGGC GCCGTCCTTGAGCACATAGCCTGGACCGTTTCCGTATAGGAGGACCGTGTA GGCCTTCCTGTCCCGGGCCTTGCCAGCGGCCAGCCCGATGAAGGAGCTCCC TCGCAGGGGGTAGCCTCCGAAGGAGAAGACGTGGGAGTGGTCGGCAGTGAC GAGGCTCAGCGTGTCCTCCTCG CTGGTGA.
In some embodiments, the AP gene fragment has the sequence of:
TABLE-US-00008 (SEQ ID NO: 64) GCTGGCCCGCCCTCTCAATGGCGTCGTCGAACATGATCGTCTCAGTCAGTG CCCGGTAAGCCCTGCTTTCATGATGACCATGGTCGATGCGACCACCCTCCA CGAAGAGGAAGAAGCCGCGGGGGTGTCTGCTCAGCAGGCGCAGGGCAGCCT CTGTCATCTCCATCAGGGAGGGGTCCAGTGTGGAGTCTCGGTGGATCTCGT ATTTCATGTCTCCAGGCTCAAAGAGACCCATGAGATGGGTCACAGACGGGT CCAGGGAAGCCTGCATGAGCTCAGTGCGGTTCCACACGTACCGGGCACCCT GGCGTTCGCCGAGCCATTCCTGCACCAGATTCTTCCCGTCCAGCCTGGTCC CACCTTGGCTGTAGTCATCTGGGTACTCAGGGTCTGGGGTTCCCATGCGAA ACATGTACTTTCGGCCTCCA.
In some embodiments, the AP gene fragment has the sequence of:
TABLE-US-00009 (SEQ ID NO: 65) CCCCGGGTGCGCGGCGTCGGTGGTGCCGGCGGGGGGCGCCAGGTCGCAGGC GGTGTAGGGCTCCAGGCAGGCGGCGAAGGCCATGACGTGCGCTATGAAGGT CTGCTCCTGCACGCCGTGAACCAGGTGCGCCTGCGGGCCGCGCGCGAACAC CGCCACGTCCTCGCCTGCGTGGGTCTCTTCGTCCAGGGGCACTGCTGACTG CTGCCGATACTCGGGGCTCCCGCTCTCGCTCTCGGTAACATCCGGCCGGGC GCCGTCCTTGAGCACATAGCCTGGACCGTTTC
In some embodiments, the AP gene fragment has the sequence of:
TABLE-US-00010 (SEQ ID NO: 66) CGTATAGGAGGACCGTGTAGGCCTTCCTGTCCCGGGCCTTGCCAGCGGCCA GCCCGATGAAGGAGCTCCCTCGCAGGGGGTAGCCTCCGAAGGAGAAGACGT GGGAGTGGTCGGCAGTGACGAGGCTCAGCGTGTCCTCCTCGCTGGTGAGCT GGCCCGCCCTCTCAATGGCGTCGTCGAACATGATCGTCTCAGTCAGTGCCC GGTAAGCCCTGCTTTCATGATGACCATGGTCGATGCGACCACCCTCCACGA AGAGGAAGAAGCCGCGGGGGTGTCTGCTCAGCAGG.
In some embodiments, the AP gene fragment has the sequence of:
TABLE-US-00011 (SEQ ID NO: 67) CGCAGGGCAGCCTCTGTCATCTCCATCAGGGAGGGGTCCAGTGTGGAGTCT CGGTGGATCTCGTATTTCATGTCTCCAGGCTCAAAGAGACCCATGAGATGG GTCACAGACGGGTCCAGGGAAGCCTGCATGAGCTCAGTGCGGTTCCACACG TACCGGGCACCCTGGCGTTCGCCGAGCCATTCCTGCACCAGATTCTTCCCG TCCAGCCTGGTCCCACCTTGGCTGTAGTCATCTGGGTACTCAGGGTCTGGG GTTCCCATGCGAAACATGTACTTTCGGCCTCCA.
[0225] An exemplary splice donor sequence for use in the methods and compositions described herein (e.g., in trans-splicing and dual hybrid approaches) has the sequence: GTAAGTATCAAGGTTACAAGAC AGGTTTAAGGAGACCAATAGAAACTGGGCTTGTCGAGACAGAGAAGACTCTTGCGTTTCT (SEQ ID NO: 20). An exemplary splice acceptor sequence for use in the methods and compositions described herein (e.g., in trans-splicing and dual hybrid approaches) has the sequence: GATAGGCACCTATTGG TCTTACTGACATCCACTTTGCCTTTCTCTCCACAG (SEQ ID NO: 21). The splice donor sequence GTAAGTATCAAGGTTACAAGACAGGTTTAAGGAGACCAATAGAAACTGGGCTTGTCGAGACAGAGAA GACTCTTGCGTTTCTGA (SEQ ID NO: 68) and the splice acceptor sequence TAGGCACCTATTGGTCTTACTGACATCCACTTTGCCTTTCTCTCCACAG (SEQ ID NO: 69) can also be used in the methods and compositions described herein. Additional examples of splice donor and splice acceptor sequences are known in the art.
[0226] Dual hybrid vectors for use in the methods and compositions described herein are designed such that approximately half of the OTOF gene is contained within each vector (e.g., each vector contains a polynucleotide that encodes approximately half of the OTOF protein). The determination of how to split the polynucleotide sequence between the two nucleic acid vectors is made based on the size of the promoter and the locations of the portions of the polynucleotide that encode the OTOF C2 domains. When a short promoter is used in the dual hybrid vector system (e.g., a promoter that is 1 kb or shorter, e.g., approximately 1 kb, 950 bp, 900 bp, 850 bp, 800 bp, 750 bp, 700 bp, 650 bp, 600 bp, 550 bp 500 bp, 450 bp, 400 bp, 350 bp, 300 bp or shorter), such as CAG, CMV, smCBA, or a Myo15 promoter having a sequence that is 1 kb or shorter (e.g., a Myo15 promoter described hereinabove, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60), the OTOF polynucleotide sequence is divided between the two nucleic acid vectors at an exon boundary that occurs after the portion of the polynucleotide that encodes the C2D domain and before the portion of the polynucleotide that encodes C2E domain, for example, the exon 26/27 boundary. The nucleic acid vectors containing promoters of this size can optionally contain OTOF UTRs (e.g., full-length 5′ and 3′ UTRs). When a long promoter is used in the dual hybrid vector system (e.g., a promoter that is longer than 1 kb, e.g., 1.1 kb, 1.25 kb, 1.5 kb, 1.75 kb, 2 kb, 2.5 kb, 3 kb or longer), such as a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36), the OTOF polynucleotide sequence will be divided between the two nucleic acid vectors at an exon boundary that occurs after the portion of the polynucleotide that encodes the C2C domain, and either before the portion of the polynucleotide that encodes the C2D domain, such as the exon 19/20 boundary, or within the portion of the polynucleotide that encodes the C2D domain, such as the exon 25/26 boundary. A short promoter (e.g., a CMV promoter, CAG promoter, smCBA promoter, or a Myo15 promoter having a sequence that is 1 kb or shorter, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) can also be used in the dual vector systems designed for large promoters, in which case additional elements (e.g., OTOF UTR sequences) may be included in the first vector (e.g., the vector containing the portion of the polynucleotide the encodes the C2C domain).
[0227] One exemplary dual hybrid vector system that uses a short promoter includes a first nucleic acid vector containing a CAG promoter operably linked to exons 1-26 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5), a splice donor sequence 3′ of the polynucleotide sequence, and a recombinogenic region 3′ of the splice donor sequence; and a second nucleic acid vector containing a recombinogenic region, a splice acceptor sequence 3′ of the recombinogenic region, a polynucleotide 3′ of the splice acceptor sequence that contains exons 27-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5), and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). The first and second nucleic acid vectors can also contain the full length 5′ and 3′ OTOF UTRs, respectively (e.g., the 127 bp human OTOF 5′ UTR can be included in the first nucleic acid vector, and the 1035 bp human OTOF 3′ UTR can be included in the second nucleic acid vector). Another exemplary dual hybrid vector system that uses a short promoter includes a first nucleic acid vector containing a CAG promoter operably linked to exons 1-28 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6), a splice donor sequence 3′ of the polynucleotide sequence, and a recombinogenic region 3′ of the splice donor sequence; and a second nucleic acid vector containing a recombinogenic region, a splice acceptor sequence 3′ of the recombinogenic region, a polynucleotide 3′ of the splice acceptor sequence that contains exons 29-48 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6), and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). The first and second nucleic acid vectors can also contain the full length 5′ and 3′ OTOF UTRs, respectively (e.g., the 134 bp mouse OTOF 5′ UTR can be included in the first nucleic acid vector, and the 1001 bp mouse OTOF 3′ UTR can be included in the second nucleic acid vector). The CMV promoter, smCBA promoter, or a Myo15 promoter having a sequence that is 1 kb or shorter (e.g., a Myo15 promoter described hereinabove, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) can be used in place of the CAG promoter either of the foregoing dual vector systems.
[0228] An exemplary dual hybrid vector system that uses a long promoter includes a first nucleic acid vector containing a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36) operably linked to exons 1-19 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5), a splice donor sequence 3′ of the polynucleotide sequence, and a recombinogenic region 3′ of the splice donor sequence; and a second nucleic acid vector containing a recombinogenic region, a splice acceptor sequence 3′ of the recombinogenic region, a polynucleotide 3′ of the splice acceptor sequence that contains exons 20-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5), and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). Another exemplary dual hybrid vector system that uses a long promoter includes a first nucleic acid vector containing a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36) operably linked to exons 1-20 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6), a splice donor sequence 3′ of the polynucleotide sequence, and a recombinogenic region 3′ of the splice donor sequence; and a second nucleic acid vector containing a recombinogenic region, a splice acceptor sequence 3′ of the recombinogenic region, a polynucleotide 3′ of the splice acceptor sequence that contains exons 21-48 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6), and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). Neither the first nor the second nucleic acid vector in either of the foregoing Myo15 promoter dual hybrid vector systems contains an OTOF UTR. A short promoter (e.g., a CMV promoter, smCBA promoter, CAG promoter, or a Myo15 promoter having a sequence that is 1 kb or shorter, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) can also be used in the foregoing dual vector systems designed for large promoters. If these dual vector systems contain a short promoter, they may also include a 5′ OTOF UTR in the first vector.
[0229] To accommodate an OTOF UTR, the OTOF coding sequence can be divided in a different position. For example, in a dual hybrid vector system in which the first nucleic acid vector contains a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36) operably linked to exons 1-25 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5), a splice donor sequence 3′ of the polynucleotide sequence, and a recombinogenic region 3′ of the splice donor sequence; and the second nucleic acid vector contains a recombinogenic region, a splice acceptor sequence 3′ of the recombinogenic region, a polynucleotide 3′ of the splice acceptor sequence that contains exons 26-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5), and a poly(A) sequence (e.g., a bGH poly(A) signal sequence), the second nucleic acid can also contain a full-length OTOF 3′ UTR (e.g., the 1035 bp human OTOF UTR). For mouse OTOF, the dual hybrid vector system can contain a 3′ UTR if the first nucleic acid vector contains a Myo15 promoter that is longer than 1 kb (e.g., SEQ ID NO: 36) operably linked to exons 1-24 of a polynucleotide encoding an OTOF protein (e.g., mouse OTOF, e.g., SEQ ID NO: 6), a splice donor sequence 3′ of the polynucleotide sequence, and a recombinogenic region 3′ of the splice donor sequence; and the second nucleic acid vector contains a recombinogenic region, a splice acceptor sequence 3′ of the recombinogenic region, a polynucleotide 3′ of the splice acceptor sequence that contains exons 25-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5), and a poly(A) sequence (e.g., a bGH poly(A) signal sequence). In this dual hybrid vector system, the second nucleic acid can also contain a full-length OTOF 3′ UTR (e.g., the 1001 bp mouse OTOF UTR). A short promoter (e.g., a CMV promoter, smCBA promoter, CAG promoter, or a Myo15 promoter having a sequence that is 1 kb or shorter, e.g., a Myo15 promoter having the sequence of any one of SEQ ID NOs: 38, 39, or 49-60) can also be used in the foregoing dual vector systems designed for large promoters. If these dual vector systems contain a short promoter, they may also include a 5′ OTOF UTR in the first vector.
[0230] The dual hybrid vectors used in the methods and compositions described herein can optionally include a degradation signal sequence in both the first and second nucleic acid vectors. The degradation signal sequence can be included to prevent or reduce the expression of portions of the OTOF protein from polynucleotides that failed to recombine and/or undergo splicing. The degradation signal sequence is positioned 3′ of the recombinogenic region in the first nucleic acid vector, and is positioned between the recombinogenic region and the splice acceptor in the second nucleic acid vector. A degradation signal sequence that can be used in the compositions and methods described herein has the sequence of:
TABLE-US-00012 (SEQ ID NO: 22) GCCTGCAAGAACTGGTTCAGCAGCCTGAGCCACTTCGTGATCCACCTG.
[0231] Exemplary pairs of overlapping, trans-splicing, and dual hybrid vectors are described in Table 4 below.
TABLE-US-00013 TABLE 4 Exemplary pairs of overlapping, trans-splicing, and hybrid dual vectors for use in the methods and compositions described herein Vector Pair Number Vector Type Vector Pair 1 Overlapping First nucleic acid vector contains: CAG promoter operably linked to exons 1-24 and the 500 kb 3′ of the exon 24/25 boundary of a polynucleotide encoding a human OTOF protein Second nucleic acid vector contains: the 500 kb 5′ of the exon 24/25 boundary and exons 25-48 of a polynucleotide encoding a human OTOF protein and a bGH poly(A) sequence 2 Overlapping First nucleic acid vector contains: CAG promoter operably linked to exons 1-28 and the 500 kb 3′ of the exon 28/29 boundary of a polynucleotide encoding a human OTOF protein Second nucleic acid vector contains: the 500 kb 5′ of the exon 28/29 boundary and exons 29-48 of a polynucleotide encoding a human OTOF protein and a bGH poly(A) sequence 3 Overlapping First nucleic acid vector contains: Myo15 promoter operably linked to exons 1-21 and the 500 kb 3′ of the exon 21/22 boundary of a polynucleotide encoding a human OTOF protein Second nucleic acid vector contains: the 500 kb 5′ of the exon 21/22 boundary and exons 22-48 of a polynucleotide encoding a human OTOF protein and a bGH poly(A) sequence 4 Trans- First nucleic acid vector contains: CAG promoter operably linked to exons splicing 1-26 of a polynucleotide encoding a human OTOF protein and a splice donor sequence 3′ of the polynucleotide Second nucleic acid vector contains: a splice acceptor sequence 5′ of exons 27-48 of a polynucleotide encoding a human OTOF protein and a bGH poly(A) sequence 5 Trans- First nucleic acid vector contains: Myo15 promoter operably linked to splicing exons 1-19 of a polynucleotide encoding a human OTOF protein and a splice donor sequence 3′ of the polynucleotide Second nucleic acid vector contains: a splice acceptor sequence 5′ of exons 20-48 of a polynucleotide encoding a human OTOF protein and a bGH poly(A) sequence 6 Trans- First nucleic acid vector contains: Myo15 promoter operably linked to splicing exons 1-25 of a polynucleotide encoding a human OTOF protein and a splice donor sequence 3′ of the polynucleotide Second nucleic acid vector contains: a splice acceptor sequence 5′ of exons 26-48 of a polynucleotide encoding a human OTOF protein and a bGH poly(A) sequence 7 Hybrid First nucleic acid vector contains: CAG promoter operably linked to exons 1-26 of a polynucleotide encoding a human OTOF protein, a splice donor sequence 3′ of the polynucleotide, and a recombinogenic region 3′ of the splice donor sequence Second nucleic acid vector contains: a recombinogenic region, a splice acceptor sequence 3′ of the recombinogenic region, exons 27-48 of a polynucleotide encoding a human OTOF protein 3′ of the splice acceptor sequence, and a bGH poly(A) sequence 8 Hybrid First nucleic acid vector contains: Myo15 promoter operably linked to exons 1-19 of a polynucleotide encoding a human OTOF protein, a splice donor sequence 3′ of the polynucleotide, and a recombinogenic region 3′ of the splice donor sequence Second nucleic acid vector contains: a recombinogenic region, a splice acceptor sequence 3′ of the recombinogenic region, exons 20-48 of a polynucleotide encoding a human OTOF protein 3′ of the splice acceptor sequence, and a bGH poly(A) sequence 9 Hybrid First nucleic acid vector contains: Myo15 promoter operably linked to exons 1-25 of a polynucleotide encoding a human OTOF protein, a splice donor sequence 3′ of the polynucleotide, and a recombinogenic region 3′ of the splice donor sequence Second nucleic acid vector contains: a recombinogenic region, a splice acceptor sequence 3′ of the recombinogenic region, exons 26-48 of a polynucleotide encoding a human OTOF protein 3′ of the splice acceptor sequence, and a bGH poly(A) sequence
Vectors for the Expression of OTOF
[0232] In addition to achieving high rates of transcription and translation, stable expression of an exogenous gene in a mammalian cell can be achieved by integration of the polynucleotide containing the gene into the nuclear genome of the mammalian cell. A variety of vectors for the delivery and integration of polynucleotides encoding exogenous proteins into the nuclear DNA of a mammalian cell have been developed. Examples of expression vectors are disclosed in, e.g., WO 1994/011026 and are incorporated herein by reference. Expression vectors for use in the compositions and methods described herein contain a polynucleotide sequence that encodes a portion of OTOF, as well as, e.g., additional sequence elements used for the expression of these agents and/or the integration of these polynucleotide sequences into the genome of a mammalian cell. Certain vectors that can be used for the expression of OTOF include plasmids that contain regulatory sequences, such as promoter and enhancer regions, which direct gene transcription. Other useful vectors for expression of OTOF contain polynucleotide sequences that enhance the rate of translation of these genes or improve the stability or nuclear export of the mRNA that results from gene transcription. These sequence elements include, e.g., 5′ and 3′ untranslated regions and a polyadenylation signal site in order to direct efficient transcription of the gene carried on the expression vector. The expression vectors suitable for use with the compositions and methods described herein may also contain a polynucleotide encoding a marker for selection of cells that contain such a vector. Examples of a suitable marker include genes that encode resistance to antibiotics, such as ampicillin, chloramphenicol, kanamycin, or nourseothricin.
AAV Vectors for Nucleic Acid Delivery
[0233] In some embodiments, nucleic acids of the compositions and methods described herein are incorporated into recombinant AAV (rAAV) vectors and/or virions in order to facilitate their introduction into a cell. rAAV vectors useful in the compositions and methods described herein are recombinant nucleic acid constructs that include (1) a heterologous sequence to be expressed (e.g., a polynucleotide encoding an N-terminal or C-terminal portion of an OTOF protein) and (2) viral sequences that facilitate stability and expression of the heterologous genes. The viral sequences may include those sequences of AAV that are required in cis for replication and packaging (e.g., functional ITRs) of the DNA into a virion. Such rAAV vectors may also contain marker or reporter genes. Useful rAAV vectors have one or more of the AAV WT genes deleted in whole or in part but retain functional flanking ITR sequences. The AAV ITRs may be of any serotype suitable for a particular application. For use in the methods and compositions described herein, the ITRs can be AAV2 ITRs. Methods for using rAAV vectors are described, for example, in Tal et al., J. Biomed. Sci. 7:279 (2000), and Monahan and Samulski, Gene Delivery 7:24 (2000), the disclosures of each of which are incorporated herein by reference as they pertain to AAV vectors for gene delivery.
[0234] The nucleic acids and vectors described herein can be incorporated into a rAAV virion in order to facilitate introduction of the nucleic acid or vector into a cell. The capsid proteins of AAV compose the exterior, non-nucleic acid portion of the virion and are encoded by the AAV cap gene. The cap gene encodes three viral coat proteins, VP1, VP2 and VP3, which are required for virion assembly. The construction of rAAV virions has been described, for instance, in U.S. Pat. Nos. 5,173,414; 5,139,941; 5,863,541; 5,869,305; 6,057,152; and 6,376,237; as well as in Rabinowitz et al., J. Virol. 76:791 (2002) and Bowles et al., J. Virol. 77:423 (2003), the disclosures of each of which are incorporated herein by reference as they pertain to AAV vectors for gene delivery.
[0235] rAAV virions useful in conjunction with the compositions and methods described herein include those derived from a variety of AAV serotypes including AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, and PHP.S. For targeting cochlear hair cells, AAV1, AAV2, AAV6, AAV9, Anc80, Anc80L65, DJ/9, 7m8, and PHP.B may be particularly useful. Serotypes evolved for transduction of the retina may also be used in the methods and compositions described herein. The first and second nucleic acid vectors in the compositions and methods described herein may have the same serotype or different serotypes. Construction and use of AAV vectors and AAV proteins of different serotypes are described, for instance, in Chao et al., Mol. Ther. 2:619 (2000); Davidson et al., Proc. Nati. Acad. Sci. USA 97:3428 (2000); Xiao et al., J. Virol. 72:2224 (1998); Halbert et al., J. Virol. 74:1524 (2000); Halbert et al., J. Virol. 75:6615 (2001); and Auricchio et al., Hum. Molec. Genet. 10:3075 (2001), the disclosures of each of which are incorporated herein by reference as they pertain to AAV vectors for gene delivery.
[0236] Also useful in conjunction with the compositions and methods described herein are pseudotyped rAAV vectors. Pseudotyped vectors include AAV vectors of a given serotype (e.g., AAV9) pseudotyped with a capsid gene derived from a serotype other than the given serotype (e.g., AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, etc.). Techniques involving the construction and use of pseudotyped rAAV virions are known in the art and are described, for instance, in Duan et al., J. Virol. 75:7662 (2001); Halbert et al., J. Virol. 74:1524 (2000); Zolotukhin et al., Methods, 28:158 (2002); and Auricchio et al., Hum. Molec. Genet. 10:3075 (2001).
[0237] AAV virions that have mutations within the virion capsid may be used to infect particular cell types more effectively than non-mutated capsid virions. For example, suitable AAV mutants may have ligand insertion mutations for the facilitation of targeting AAV to specific cell types. The construction and characterization of AAV capsid mutants including insertion mutants, alanine screening mutants, and epitope tag mutants is described in Wu et al., J. Virol. 74:8635 (2000). Other rAAV virions that can be used in methods described herein include those capsid hybrids that are generated by molecular breeding of viruses as well as by exon shuffling. See, e.g., Soong et al., Nat. Genet., 25:436 (2000) and Kolman and Stemmer, Nat. Biotechnol. 19:423 (2001).
Pharmaceutical Compositions
[0238] The nucleic acid vectors described herein may be incorporated into a vehicle for administration into a patient, such as a human patient suffering from sensorineural hearing loss or auditory neuropathy, as described herein. Pharmaceutical compositions containing vectors, such as viral vectors, that contain a polynucleotide encoding a portion of an OTOF protein can be prepared using methods known in the art. For example, such compositions can be prepared using, e.g., physiologically acceptable carriers, excipients or stabilizers (Remington's Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980); incorporated herein by reference), and in a desired form, e.g., in the form of lyophilized formulations or aqueous solutions.
[0239] Mixtures of the nucleic acid vectors (e.g., viral vectors) described herein may be prepared in water suitably mixed with one or more excipients, carriers, or diluents. Dispersions may also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these preparations may contain a preservative to prevent the growth of microorganisms. The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersions (described in U.S. Pat. No. 5,466,468, the disclosure of which is incorporated herein by reference). In any case the formulation may be sterile and may be fluid to the extent that easy syringability exists. Formulations may be stable under the conditions of manufacture and storage and may be preserved against the contaminating action of microorganisms, such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable oils. Proper fluidity may be maintained, for example, by the use of a coating, such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. The prevention of the action of microorganisms can be brought about by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the use in the compositions of agents delaying absorption, for example, aluminum monostearate and gelatin.
[0240] For example, a solution containing a pharmaceutical composition described herein may be suitably buffered, if necessary, and the liquid diluent first rendered isotonic with sufficient saline or glucose. These particular aqueous solutions are especially suitable for intravenous, intramuscular, subcutaneous, and intraperitoneal administration. In this connection, sterile aqueous media that can be employed will be known to those of skill in the art in light of the present disclosure. For example, one dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml of hypodermoclysis fluid or injected at the proposed site of infusion. Some variation in dosage will necessarily occur depending on the condition of the subject being treated. For local administration to the inner ear, the composition may be formulated to contain a synthetic perilymph solution. An exemplary synthetic perilymph solution includes 20-200 mM NaCl, 1-5 mM KCl, 0.1-10 mM CaCl.sub.2, 1-10 mM glucose, and 2-50 mM HEPEs, with a pH between about 6 and 9 and an osmolality of about 300 mOsm/kg. The person responsible for administration will, in any event, determine the appropriate dose for the individual subject. Moreover, for human administration, preparations may meet sterility, pyrogenicity, general safety, and purity standards as required by FDA Office of Biologics standards.
Methods of Treatment
[0241] The compositions described herein may be administered to a subject with sensorineural hearing loss or auditory neuropathy by a variety of routes, such as local administration to the inner ear (e.g., administration into the perilymph or endolymph, e.g., through the oval window, round window, or horizontal canal, e.g., administration to a cochlear hair cell), intravenous, parenteral, intradermal, transdermal, intramuscular, intranasal, subcutaneous, percutaneous, intratracheal, intraperitoneal, intraarterial, intravascular, inhalation, perfusion, lavage, and oral administration. The most suitable route for administration in any given case will depend on the particular composition administered, the patient, pharmaceutical formulation methods, administration methods (e.g., administration time and administration route), the patient's age, body weight, sex, severity of the disease being treated, the patient's diet, and the patient's excretion rate. Compositions may be administered once, or more than once (e.g., once annually, twice annually, three times annually, bi-monthly, monthly, or bi-weekly). In some embodiments, the first and second nucleic acid vectors are administered simultaneously (e.g., in one composition). In some embodiments, the first and second nucleic acid vectors are administered sequentially (e.g., the second nucleic acid vector is administered immediately after the first nucleic acid vector, or 5 minutes, 10 minutes, 15 minutes, 20 minutes, 25 minutes, 30 minutes, 45 minutes, 1 hour, 2 hours, 8 hours, 12 hours, 1 day, 2 days, 7 days, two weeks, 1 month or more after the first nucleic acid vector). The first and second nucleic acid vector can have the same serotype or different serotypes (e.g., AAV serotypes).
[0242] Subjects that may be treated as described herein are subjects having or at risk of developing sensorineural hearing loss or auditory neuropathy. The compositions and methods described herein can be used to treat subjects having a mutation in OTOF (e.g., a mutation that reduces OTOF function or expression, or an OTOF mutation associated with sensorineural hearing loss), subjects having a family history of autosomal recessive sensorineural hearing loss or deafness (e.g., a family history of OTOF-related hearing loss), or subjects whose OTOF mutational status and/or OTOF activity level is unknown. The methods described herein may include a step of screening a subject for a mutation in OTOF prior to treatment with or administration of the compositions described herein. A subject can be screened for an OTOF mutation using standard methods known to those of skill in the art (e.g., genetic testing). The methods described herein may also include a step of assessing hearing in a subject prior to treatment with or administration of the compositions described herein. Hearing can be assessed using standard tests, such as audiometry, ABR, electrocochleography (ECOG), and otoacoustic emissions. The compositions and methods described herein may also be administered as a preventative treatment to patients at risk of developing hearing loss or auditory neuropathy, e.g., patients who have a family history of inherited hearing loss or patients carrying an OTOF mutation who do not yet exhibit hearing loss or impairment.
[0243] Treatment may include administration of a composition containing the nucleic acid vectors (e.g., AAV viral vectors) described herein in various unit doses. Each unit dose will ordinarily contain a predetermined-quantity of the therapeutic composition. The quantity to be administered, and the particular route of administration and formulation, are within the skill of those in the clinical arts. A unit dose need not be administered as a single injection but may include continuous infusion over a set period of time. Dosing may be performed using a syringe pump to control infusion rate in order to minimize damage to the cochlea. In cases in which the nucleic acid vectors are AAV vectors (e.g., AAV1, AAV2, AAV2quad(Y-F), AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, rh10, rh39, rh43, rh74, Anc80, Anc80L65, DJ/8, DJ/9, 7m8, PHP.B, PHP.eb, or PHP.S vectors), the viral vectors may be administered to the patient at a dose of, for example, from about 1×10.sup.10 vector genomes (VG) to 1×10.sup.15 VG (e.g., 1×10.sup.10 VG, 2×10.sup.10 VG, 3×10.sup.10 VG, 4×10.sup.10 VG, 5×10.sup.10 VG, 6×10.sup.10 VG, 7×10.sup.10 VG, 8×10.sup.10 VG, 9×10.sup.10 VG, 1×10.sup.11 VG, 2×10.sup.11 VG, 3×10.sup.11 VG, 4×10.sup.11 VG, 5×10.sup.11 VG, 6×10.sup.11 VG, 7×10.sup.11 VG, 8×10.sup.11 VG, 9×10.sup.11 VG, 1×10.sup.12 VG, 2×10.sup.12 VG, 3×10.sup.12 VG, 4×10.sup.12 VG, 5×10.sup.12 VG, 6×10.sup.12 VG, 7×10.sup.12 VG, 8×10.sup.12 VG, 9×10.sup.12 VG, 1×10.sup.13VG, 2×10.sup.13 VG, 3×10.sup.13 VG, 4×10.sup.13 VG, 5×10.sup.13 VG, 6×10.sup.13 VG, 7×10.sup.13 VG, 8×10.sup.13VG, 9×10.sup.13VG, 1×10.sup.14VG, 2×10.sup.14VG, 3×10.sup.14VG, 4×10.sup.14 VG, 5×10.sup.14VG, 6×10.sup.14VG, 7×10.sup.14VG, 8×10.sup.14VG, 9×10.sup.14VG, 1×10.sup.15 VG) in a volume of 1 μL to 200 μL (e.g., 1, 2, 3, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, or 200 μL).
[0244] The compositions described herein are administered in an amount sufficient to improve hearing, increase WT OTOF expression (e.g., expression in a cochlear hair cell, e.g., an inner hair cell), or increase OTOF function. Hearing may be evaluated using standard hearing tests (e.g., audiometry, ABR, electrocochleography (ECOG), and otoacoustic emissions) and may be improved by 5% or more (e.g., 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100% or more) compared to hearing measurements obtained prior to treatment. In some embodiments, the compositions are administered in an amount sufficient to improve the subject's ability to understand speech. The compositions described herein may also be administered in an amount sufficient to slow or prevent the development or progression of sensorineural hearing loss or auditory neuropathy (e.g., in subjects who carry a mutation in OTOF or have a family history of autosomal recessive hearing loss but do not exhibit hearing impairment, or in subjects exhibiting mild to moderate hearing loss). OTOF expression may be evaluated using immunohistochemistry, Western blot analysis, quantitative real-time PCR, or other methods known in the art for detection protein or mRNA, and may be increased by 5% or more (e.g., 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100% or more) compared to OTOF expression prior to administration of the compositions described herein. OTOF function may be evaluated directly (e.g., using electrophysiological methods or imaging methods to assess exocytosis) or indirectly based on hearing tests, and may be increased by 5% or more (e.g., 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100% or more) compared to OTOF function prior to administration of the compositions described herein. These effects may occur, for example, within 1 week, 2 weeks, 3 weeks, 4 weeks, 5 weeks, 6 weeks, 7 weeks, 8 weeks, 9 weeks, 10 weeks, 15 weeks, 20 weeks, 25 weeks, or more, following administration of the compositions described herein. The patient may be evaluated 1 month, 2 months, 3 months, 4 months, 5 months, 6 months or more following administration of the composition depending on the dose and route of administration used for treatment. Depending on the outcome of the evaluation, the patient may receive additional treatments.
Kits
[0245] The compositions described herein can be provided in a kit for use in treating sensorineural hearing loss or auditory neuropathy (e.g., hearing loss associated with a mutation in OTOF). Compositions may include nucleic acid vectors described herein (e.g., a first nucleic acid vector containing a polynucleotide that encodes and N-terminal portion of an OTOF protein and a second nucleic acid vector containing a polynucleotide that encodes a C-terminal portion of an OTOF protein), optionally packaged in an AAV virus capsid (e.g., AAV1, AAV9, Anc80L65, DJ/9, or 7m8). The kit can further include a package insert that instructs a user of the kit, such as a physician, to perform the methods described herein. The kit may optionally include a syringe or other device for administering the composition.
EXAMPLES
[0246] The following examples are put forth so as to provide those of ordinary skill in the art with a description of how the compositions and methods described herein may be used, made, and evaluated, and are intended to be purely exemplary of the invention and are not intended to limit the scope of what the inventors regard as their invention.
Example 1: Generation of Nucleic Acid Vectors that Recombine to Produce Full-Length OTOF
[0247] Gene fragments were synthesized and sub-cloned into an AAV2/cis-plasmid using restriction enzyme sites. Plasmids were maxi prepped to generate 1 mg of transfection grade plasmid. Inner ear-derived HEI-OC1 cells were seeded into a 12-well tissue culture dish 24 hours before plasmid transfection at a density of 200,000 cells/well. One microgram of each plasmid was transfected using Lipofectamine 3000 according to standard manufacturer's protocol. For wells that received both 5′ and 3′ plasmids, 1 μg of each was transfected for a total of 2 μg of DNA. As a positive control, full-length Otoferlin cDNA was also transfected. Cells were incubated with plasmid for 48 hours.
[0248] For PCR to check for recombination at the DNA level, genomic DNA was extracted from each well using Qiagen's DNeasy Blood and Tissue kit, according to standard manufacturer's protocol. PCR primers were designed to anneal to the plasmid outside of the region of overlap or splicing to generate an amplicon of ˜1200 bp. PCR was performed using My Taq 2x mastermix according to manufacturer's recommendations: annealing temperature of 58° C., elongation step of 30 seconds, and cycle number of 35×. Ten microliters of PCR product was run on a pre-cast 1.2% agarose E-gel and imaged on a BioRad gel doc imaging station. Both dual hybrid vectors (
[0249] For immunofluorescence to check for recombination and generation of protein, cells were fixed with cold 4% PFA for 20 minutes at room temperature. Cells were washed three times with PBS and then permeabilized in a blocking solution of PBS with 10% normal donkey serum and 0.01% TritonX100. Cells were incubated in primary antibody overnight (mouse-anti-Otoferlin, Abcam ab53233) at a concentration of 1:1000 at 4° C. Cells were washed three times with PBS and incubated in secondary antibody for three hours at room temperature (donkey-anti-mouse Alexa Fluor 647, ThermoFisher A-31571). Cells were washed three times in PBS and stained with DAPI for 15 minutes at room temperature. Cells were imaged using a Zeiss inverted Apotome microscope. Increased staining was observed in cells transfected with both 5′ and 3′ plasmids compared to transfection of the 5′ or 3′ plasmid alone, indicating that the dual hybrid vector (
Example 2: Administration of a Composition Containing Dual Hybrid Vectors that Express OTOF to Mice Restores Electrophysiological Signatures of Hearing Function
[0250] Homozygous (HOM) OTOF-0828X mice (a mouse model of human OTOF mutation p.Gln828Ter) were either left untreated or treated (by injection through the round window membrane) with 4E10 (4×10.sup.10) vector genomes (vg)/ear of an AAV1-Myo15 (SEQ ID NO: 38)-hOTOF (isoform 5, SEQ ID NO: 5) dual hybrid vector system in which exons 1-20 of the polynucleotide encoding the N-terminal portion of the OTOF protein and exons 21-46 and 48 of the polynucleotide encoding the C-terminal portion of the OTOF protein were delivered in separate vectors (
[0251] In a separate set of experiments, homozygous OTOF-Q828X mice were either left untreated or treated (by injection through the round window membrane) with 4E10 (4×10.sup.10) vg/ear of an AAV1-truncated chimeric CMV-chicken β-actin (smCBA)-hOTOF (isoform 5, SEQ ID NO: 5) dual hybrid vector system as described above (
[0252] In yet another set of experiments, homozygous OTOF-Q828X mice were either left untreated or treated (by injection through the round window membrane) with an AAV1-smCBA-hOTOF (isoform 5, SEQ ID NO: 5) dual hybrid vector, as described above, at either 8E9 (8×109) vg/ear (low dose), 1.6E10 (1.6×10.sup.10) vg vg/ear (mid dose), or 6.4E10 (6.4×10.sup.10) vg/ear (high dose). The first vector contained exons 1-20 of the polynucleotide encoding the N-terminal portion of the OTOF protein and exons 21-46 and 48 of the polynucleotide encoding the C-terminal portion of the OTOF protein and both vectors contained an AP recombinogenic region (SEQ ID NO: 65). ABR thresholds were used to assess hearing function at four weeks and eight weeks post-treatment (
Example 3: Administration of a Composition Containing Overlapping Dual Vectors that Express OTOF to a Subject with Sensorineural Hearing Loss
[0253] According to the methods disclosed herein, a physician of skill in the art can treat a patient, such as a human patient, with sensorineural hearing loss (e.g., sensorineural hearing loss associated with a mutation in OTOF) so as to improve or restore hearing. To this end, a physician of skill in the art can administer to the human patient a composition containing a first AAV vector (e.g., AAV1 or AAV9) containing a Myo15 promoter (e.g., SEQ ID NO: 36) operably linked to exons 1-21 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and the 500 kb immediately 3′ of the exon 21/22 boundary, and a second AAV vector (e.g., AAV1 or AAV9) containing the 500 kb immediately 5′ of the exon 21/22 boundary and exons 22-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and a bGH poly(A) sequence. The composition containing the overlapping dual AAV vectors may be administered to the patient, for example, by local administration to the inner ear (e.g., injection into the perilymph), to treat sensorineural hearing loss.
[0254] Following administration of the composition to a patient, a practitioner of skill in the art can monitor the expression of OTOF, and the patient's improvement in response to the therapy, by a variety of methods. For example, a physician can monitor the patient's hearing by performing standard tests, such as audiometry, ABR, electrocochleography (ECOG), and otoacoustic emissions following administration of the composition. A finding that the patient exhibits improved hearing in one or more of the tests following administration of the composition compared to hearing test results prior to administration of the composition indicates that the patient is responding favorably to the treatment. Subsequent doses can be determined and administered as needed.
Example 4: Administration of a Composition Containing Trans-Splicing Dual Vectors that Express OTOF to a Subject with Sensorineural Hearing Loss
[0255] According to the methods disclosed herein, a physician of skill in the art can treat a patient, such as a human patient, with sensorineural hearing loss (e.g., sensorineural hearing loss associated with a mutation in OTOF) so as to improve or restore hearing. To this end, a physician of skill in the art can administer to the human patient a composition containing a first AAV vector (e.g., AAV1 or AAV9) containing a Myo15 promoter (e.g., SEQ ID NO: 36) operably linked to exons 1-19 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and a splice donor sequence (e.g., SEQ ID NO: 20 or SEQ ID NO: 68) 3′ of the polynucleotide sequence, and a second AAV vector (e.g., AAV1 or AAV9) containing a splice acceptor sequence (e.g., SEQ ID NO: 21 or SEQ ID NO: 69) 5′ of exons 20-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5) and a bGH poly(A) sequence. The composition containing the trans-splicing dual AAV vectors may be administered to the patient, for example, by local administration to the inner ear (e.g., injection into the perilymph), to treat sensorineural hearing loss.
[0256] Following administration of the composition to a patient, a practitioner of skill in the art can monitor the expression of OTOF, and the patient's improvement in response to the therapy, by a variety of methods. For example, a physician can monitor the patient's hearing by performing standard tests, such as audiometry, ABR, electrocochleography (ECOG), and otoacoustic emissions following administration of the composition. A finding that the patient exhibits improved hearing in one or more of the tests following administration of the composition compared to hearing test results prior to administration of the composition indicates that the patient is responding favorably to the treatment. Subsequent doses can be determined and administered as needed.
Example 5: Administration of a Composition Containing Dual Hybrid Vectors that Express OTOF to a Subject with Sensorineural Hearing Loss
[0257] According to the methods disclosed herein, a physician of skill in the art can treat a patient, such as a human patient, with sensorineural hearing loss (e.g., sensorineural hearing loss associated with a mutation in OTOF) so as to improve or restore hearing. To this end, a physician of skill in the art can administer to the human patient a composition containing a first AAV vector (e.g., AAV1 or AAV9) containing a Myo15 promoter (e.g., SEQ ID NO: 36) operably linked to exons 1-19 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5), a splice donor sequence (e.g., SEQ ID NO: 20 or SEQ ID NO: 68) 3′ of the polynucleotide sequence, and an F1 phage recombinogenic region (e.g., an F1 phage AK gene, SEQ ID NO: 19) 3′ of the splice donor sequence, and a second nucleic acid vector containing an F1 phage recombinogenic region (e.g., an F1 phage AK gene, SEQ ID NO: 19), a splice acceptor sequence (e.g., SEQ ID NO: 21 or SEQ ID NO: 69) 3′ of the recombinogenic region, a polynucleotide 3′ of the splice acceptor sequence that contains exons 20-48 of a polynucleotide encoding an OTOF protein (e.g., human OTOF, e.g., SEQ ID NO: 1 or SEQ ID NO: 5), and a bGH poly(A) sequence. The first and second dual hybrid AAV vectors can optionally include a degradation signal sequence (e.g., SEQ ID NO: 22) positioned 3′ of the recombinogenic region in the first nucleic acid vector, and positioned between the recombinogenic region and the splice acceptor sequence in the second nucleic acid vector. The composition containing the dual hybrid AAV vectors may be administered to the patient, for example, by local administration to the inner ear (e.g., injection into the perilymph), to treat sensorineural hearing loss.
Following administration of the composition to a patient, a practitioner of skill in the art can monitor the expression of OTOF, and the patient's improvement in response to the therapy, by a variety of methods. For example, a physician can monitor the patient's hearing by performing standard tests, such as audiometry, ABR, electrocochleography (ECOG), and otoacoustic emissions following administration of the composition. A finding that the patient exhibits improved hearing in one or more of the tests following administration of the composition compared to hearing test results prior to administration of the composition indicates that the patient is responding favorably to the treatment. Subsequent doses can be determined and administered as needed.
Other Embodiments
[0258] Various modifications and variations of the described invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention that are obvious to those skilled in the art are intended to be within the scope of the invention. Other embodiments are in the claims.