METHODS OF TREATING HEARING LOSS USING A SECRETED TARGET PROTEIN
20220396806 · 2022-12-15
Inventors
- Emmanuel J. Simons (Brookline, MA, US)
- Robert Ng (Newton, MA, US)
- Danielle R. Lenz (Brookline, MA, US)
- Hao Chiang (Somerville, MA, US)
Cpc classification
C12N2750/14143
CHEMISTRY; METALLURGY
C12N2830/42
CHEMISTRY; METALLURGY
C12N15/86
CHEMISTRY; METALLURGY
International classification
Abstract
Provided herein are compositions that include a single nucleic acid vector or two different nucleic acid vectors, and the use of these compositions to treat hearing loss in a subject.
Claims
1. A construct comprising a coding sequence operably linked to a promoter, wherein the coding sequence encodes a secreted target protein.
2. The construct of claim 1, wherein the coding sequence is an NDP gene.
3.-7. (canceled)
8. The construct of claim 1, wherein the coding sequence encodes a heat shock protein.
9. The construct of claim 1, wherein the coding sequence is an HSPA1A gene.
10.-15. (canceled)
16. The construct of claim 8, wherein the heat shock protein is a member of the Hsp40/DNAJ family.
17.-18. (canceled)
19. The construct of claim 1, wherein the coding sequence is an DNAJB1 gene.
20.-22. (canceled)
23. The construct of claim 1, wherein the coding sequence is an DNAJB5 gene.
24.-28. (canceled)
29. The construct of claim 1, wherein the secreted target protein is a Norrin Cystine Knot Growth Factor (NDP) protein.
30.-31. (canceled)
32. The construct of claim 1, wherein the secreted target protein is a Heat Shock Protein Family A (HSP) Member 1a (HSPA1A) protein.
33.-36. (canceled)
37. The construct of claim 1, wherein the secreted target protein is Heat Shock Protein Family Type 2 subfamily B protein.
38. (canceled)
39. The construct of claim 1, wherein the promoter is an inducible promoter, a constitutive promoter, or a tissue-specific promoter.
40.-43. (canceled)
44. The construct of claim 1, further comprising two AAV inverted terminal repeats (ITRs), wherein the two AAV ITRs flank the coding sequence and promoter.
45.-54. (canceled)
55. An AAV particle comprising the construct of claim 1.
56.-57. (canceled)
58. A composition comprising the construct of claim 1.
59. A composition comprising the AAV particle of claim 55.
60. The composition of claim 58, wherein the composition is a pharmaceutical composition.
61. The composition of claim 60, further comprising a pharmaceutically acceptable carrier.
62. An ex vivo cell comprising a composition of claim 58.
63. A method comprising, transfecting an ex vivo cell with: (i) a construct of claim 44; and (ii) one or more helper plasmids collectively comprising an AAV Rep gene, AAV Cap gene, AAV VA gene, AAV E2a gene, and AAV E4 gene.
64. (canceled)
65. A method of treatment comprising: introducing a composition of claim 60 into the inner ear of a subject.
66.-68. (canceled)
69. The method claim 65, further comprising measuring a hearing level of the subject.
70. (canceled)
71. The method of claim 69, further comprising comparing the hearing level of the subject to a reference hearing level.
72.-73. (canceled)
74. The method of claim 65, further comprising measuring a level of secreted target protein in the subject.
75.-76. (canceled)
77. The method of claim 74, further comprising comparing the level of secreted target protein in the subject to a reference secreted target protein level.
78.-83. (canceled)
Description
BRIEF DESCRIPTION OF DRAWING
[0194]
[0195]
[0196]
[0197]
[0198]
[0199]
[0200]
[0201]
[0202]
[0203]
[0204]
[0205]
[0206]
[0207]
[0208]
[0209]
[0210]
[0211]
[0212]
[0213]
[0214]
DETAILED DESCRIPTION OF CERTAIN EMBODIMENTS
Hearing Loss
[0215] Generally, an ear can be described as including: an outer ear, middle ear, inner ear, hearing (acoustic) nerve, and auditory system (which processes sound as it travels from the ear to the brain). In addition to detecting sound, ears also help to maintain balance. Thus, in some embodiments, disorders of the inner ear can cause hearing loss, tinnitus, vertigo, imbalance, or combinations thereof.
[0216] Hearing loss can be the result of genetic factors, environmental factors, or a combination of genetic and environmental factors. About half of all people who have tinnitus—phantom noises in their auditory system (ringing, buzzing, chirping, humming, or beating)—also have an over-sensitivity to/reduced tolerance for certain sound frequency and volume ranges, known as hyperacusis (also spelled hyperacousis). A variety of non-syndromic and syndromic-related hearing losses will be known to those of skill in the art (e.g., DFNB4, and Pendred syndrome, respectively). Environmental causes of hearing impairment or loss may include, e.g., certain medications, specific infections before or after birth, and/or exposure to loud noise over an extended period. In some embodiments, hearing loss can result from noise, ototoxic agents, presbycusis, disease, infection or cancers that affect specific parts of the ear. In some embodiments, ischemic damage can cause hearing loss via pathophysiological mechanisms. In some embodiments, intrinsic abnormalities, like congenital mutations to genes that play an important role in cochlear anatomy or physiology, or genetic or anatomical changes in supporting and/or hair cells can be responsible for or contribute to hearing loss.
[0217] Hearing loss and/or deafness is one of the most common human sensory deficits, and can occur for many reasons. In some embodiments, a subject may be born with hearing loss or without hearing, while others may lose hearing slowly over time. Approximately 36 million American adults report some degree of hearing loss, and one in three people older than 60 and half of those older than 85 experience hearing loss. Approximately 1.5 in 1,000 children are born with profound hearing loss, and another two to three per 1,000 children are born with partial hearing loss (Smith et al., 2005, Lancet 365:879-890, which is incorporated in its entirety herein by reference). More than half of these cases are attributed to a genetic basis (Di Domenico, et al., 2011, J. Cell. Physiol. 226:2494-2499, which is incorporated in its entirety herein by reference).
[0218] Treatments for hearing loss currently consist of hearing amplification for mild to severe losses and cochlear implantation for severe to profound losses (Kral and O'Donoghue, 2010, N. Engl. J. Med. 363:1438-1450, which is incorporated in its entirety herein by reference). Recent research in this arena has focused on cochlear hair cell regeneration, applicable to the most common forms of hearing loss, including presbycusis, noise damage, infection, and ototoxicity. There remains a need for effective treatments, such as gene therapy, which can repair and/or mitigate a source of a hearing problem (see e.g., WO 2018/039375, WO 2019/165292, and PCT filing application US2019/060328, each of which is incorporated in its entirety herein by reference).
[0219] In some embodiments, non-syndromic hearing loss and/or deafness is not associated with other signs and symptoms. In some embodiments, syndromic hearing loss and/or deafness occurs in conjunction with abnormalities in other parts of the body. Approximately 70 percent to 80 percent of genetic hearing loss and/or deafness cases are non-syndromic; remaining cases are often caused by specific genetic syndromes. Non-syndromic deafness and/or hearing loss can have different patterns of inheritance, and can occur at any age. Types of non-syndromic deafness and/or hearing loss are generally named according to their inheritance patterns. For example, autosomal dominant forms are designated DFNA, autosomal recessive forms are DFNB, and X-linked forms are DFN. Each type is also numbered in the order in which it was first described. For example, DFNA1 was the first described autosomal dominant type of non-syndromic deafness. Between 75 percent and 80 percent of genetically causative hearing loss and/or deafness cases are inherited in an autosomal recessive pattern, which means both copies of the gene in each cell have mutations. Usually, each parent of an individual with autosomal recessive hearing loss and/or deafness is a carrier of one copy of the mutated gene, but is not affected by this form of hearing loss. Another 20 percent to 25 percent of non-syndromic hearing loss and/or deafness cases are autosomal dominant, which means one copy of the altered gene in each cell is sufficient to result in deafness and/or hearing loss. People with autosomal dominant deafness and/or hearing loss most often inherit an altered copy of the gene from a parent who is deaf and/or has hearing loss. Between 1 to 2 percent of cases of deafness and/or hearing loss show an X-linked pattern of inheritance, which means the mutated gene responsible for the condition is located on the X chromosome (one of the two sex chromosomes). Males with X-linked non-syndromic hearing loss and/or deafness tend to develop more severe hearing loss earlier in life than females who inherit a copy of the same gene mutation. A characteristic of X-linked inheritance is that fathers cannot pass X-linked traits to their sons. Mitochondrial non-syndromic deafness, which results from changes to mitochondrial DNA, occurs in less than one percent of cases in the United States. The altered mitochondrial DNA is passed from a mother to all of her sons and daughters. This type of deafness is not inherited from fathers. The causes of syndromic and non-syndromic deafness and/or hearing loss are complex. Researchers have identified more than 30 genes that, when altered, are associated with syndromic and/or non-syndromic deafness and/or hearing loss; however, some of these genes have not been fully characterized. Different mutations in the same gene can be associated with different types of deafness and/or hearing loss, and some genes are associated with both syndromic and non-syndromic deafness and/or hearing loss.
[0220] In some embodiments, deafness and/or hearing loss can be conductive (arising from the ear canal or middle ear), sensorineural (arising from the inner ear or auditory nerve), or mixed. In some embodiments, non-syndromic deafness and/or hearing loss is associated with permanent hearing loss caused by damage to structures in the inner ear (sensorineural deafness). In some embodiments, sensorineural hearing loss can be due to poor hair cell function. In some embodiments, sensorineural hearing impairments involve the eighth cranial nerve (the vestibulocochlear nerve) or the auditory portions of the brain. In some such embodiments, only the auditory centers of the brain are affected. In such a situation, cortical deafness may occur, where sounds may be heard at normal thresholds, but quality of sound perceived is so poor that speech cannot be understood. Hearing loss that results from changes in the middle ear is called conductive hearing loss. Some forms of non-syndromic deafness and/or hearing loss involve changes in both the inner ear and the middle ear, called mixed hearing loss. Hearing loss and/or deafness that is present before a child learns to speak can be classified as prelingual or congenital. Hearing loss and/or deafness that occurs after the development of speech can be classified as postlingual. Most autosomal recessive loci related to syndromic or non-syndromic hearing loss cause prelingual severe-to-profound hearing loss.
[0221] As is known to those of skill in the art, hair cells are sensory receptors for both auditory and vestibular systems of vertebrate ears. Hair cells detect movement in the environment and, in mammals, hair cells are located within the cochlea of the ear, in the organ of Corti. Mammalian ears are known to have two types of hair cells—inner hair cells and outer hair cells. Outer hair cells can amplify low level sound frequencies, either through mechanical movement of hair cell bundles or electrically-driven movement of hair cell soma. Inner hair cells transform vibrations in cochlear fluid into electrical signals that the auditory nerve transmits to the brain. In some embodiments, hair cells may be abnormal at birth, or damaged during the lifetime of an individual. In some embodiments, outer hair cells may be able to regenerate. In some embodiments, inner hair cells are not capable of regeneration after illness or injury. In some embodiments, sensorineural hearing loss is due to abnormalities in hair cells.
[0222] As is known to those of skill in the art, hair cells do not occur in isolation, and their function is supported by a wide variety of cells which can collectively be referred to as supporting cells. Supporting cells may fulfil numerous functions, and include a number of cell types, including but not limited to Hensen's cells, Deiters' cells, pillar cells, Claudius cells, inner phalangeal cells, and border cells. In some embodiments, sensorineural hearing loss is due to abnormalities in supporting cells. In some embodiments, supporting cells may be abnormal at birth, or damaged during the lifetime of an individual. In some embodiments, supporting cells may be able to regenerate. In some embodiments, certain supporting cells may not be capable of regeneration.
[0223] As described herein, mutations in an NDP gene that encodes the “norrin cysteine knot growth factor” (NDP) protein may cause hearing loss and vision loss. For example, mutations in NDP lead to Norrie disease pseudoglioma.
[0224] Provided herein are composition including a single adeno-associated virus (AAV) vector, wherein the single AAV vector that includes a nucleic acid sequence that encodes a secreted target protein; and when introduced into a primate cell, a nucleic acid encoding a full-length secreted target protein is generated at the locus of the secreted target protein, and the primate cell expresses and secretes the secreted target protein.
[0225] Also provided herein are compositions including at least two different nucleic acid vectors, wherein: each of the at least two different vectors includes a coding sequence that encodes a different portion of a secreted target protein, each of the encoded portions being at least 30 amino acid residues in length, wherein the amino acid sequence of each of the encoded portions may optionally partially overlap with the amino acid sequence of a different one of the encoded portions; no single vector of the at least two different vectors encodes a full-length secreted target protein; at least one of the coding sequences includes a nucleotide sequence spanning two neighboring exons of the secreted target protein genomic DNA, and lacks an intronic sequence between the two neighboring exons; and when introduced into a mammalian cell the at least two different vectors undergo concatamerization or homologous recombination with each other, thereby forming a recombined nucleic acid that encodes a full-length secreted target protein.
[0226] Also provided herein are compositions including two different nucleic acid vectors, wherein: a first nucleic acid vector of the two different nucleic acid vectors includes a promoter, a first coding sequence that encodes an N-terminal portion of an secreted target protein positioned 3′ of the promoter, and a splicing donor signal sequence positioned at the 3′ end of the first coding sequence; and a second nucleic acid vector of the two different nucleic acid vectors includes a splicing acceptor signal sequence, a second coding sequence that encodes a C-terminal portion of an secreted target protein positioned at the 3′ end of the splicing acceptor signal sequence, and a polyadenylation sequence at the 3′ end of the second coding sequence; wherein each of the encoded portions is at least 30 amino acid residues in length, wherein the amino acid sequences of the encoded portions do not overlap, wherein no single vector of the two different vectors encodes a full-length secreted target protein, and, when the coding sequences are transcribed in a mammalian cell, to produce RNA transcripts, splicing occurs between the splicing donor signal sequence on one transcript and the splicing acceptor signal sequence on the other transcript, thereby forming a recombined RNA molecule that encodes a full-length secreted target protein.
[0227] Also provided herein are compositions including: a first nucleic acid vector including a promoter, a first coding sequence that encodes an N-terminal portion of an secreted target protein positioned 3′ of the promoter, a splicing donor signal sequence positioned at the 3′ end of the first coding sequence, and a first detectable marker gene positioned 3′ of the splicing donor signal sequence; and a second nucleic acid vector, different from the first nucleic acid vector, including a second detectable marker gene, a splicing acceptor signal sequence positioned 3′ of the second detectable marker gene, a second coding sequence that encodes a C-terminal portion of an secreted target protein positioned at the 3′ end of the splicing acceptor signal sequence, and a polyadenylation sequence positioned at the 3′ end of the second coding sequence; wherein each of the encoded portions is at least 30 amino acid residues in length, wherein the respective amino acid sequences of the encoded portions do not overlap with each other, wherein no single vector of the two different vectors encodes a full-length secreted target protein, and, when the coding sequences are transcribed in a mammalian cell to produce RNA transcripts, splicing occurs between the splicing donor signal on one transcript and the splicing acceptor signal on the other transcript, thereby forming a recombined RNA molecule that encodes a full-length secreted target protein.
[0228] Also provided herein are compositions including: a first nucleic acid vector including a promoter, a first coding sequence that encodes an N-terminal portion of an secreted target protein positioned 3′ to the promoter, a splicing donor signal sequence positioned at the 3′ end of the first coding sequence, and a F1 phage recombinogenic region positioned 3′ to the splicing donor signal sequence; and a second nucleic acid vector, different from the first nucleic acid vector, including a second F1 phage recombinogenic region, a splicing acceptor signal sequence positioned 3′ of the second F1 phage recombinogenic region, a second coding sequence that encodes a C-terminal portion of an secreted target protein positioned at the 3′ end of the splicing acceptor signal sequence, and a polyadenylation sequence positioned at the 3′ end of the second coding sequence; wherein each of the encoded portions is at least 30 amino acid residues in length, wherein the respective amino acid sequences of the encoded portions do not overlap with each other, wherein no single vector of the two different vectors encodes a full-length secreted target protein, and, when the coding sequences are transcribed in a mammalian cell to produce RNA transcripts, splicing occurs between the splicing donor signal one transcript and the splicing acceptor signal on the other transcript, thereby forming a recombined RNA molecule that encodes a full-length secreted target protein.
[0229] Also provided herein are methods including introducing into a cochlea of a mammal a therapeutically effective amount of any of the compositions described herein.
[0230] Also provided herein are methods of increasing expression of a full-length secreted target protein in a mammalian cell that include introducing any of the compositions described herein into the mammalian cell.
[0231] Also provided herein are methods of increasing expression of a full-length secreted target protein in an inner hair cell in a cochlea of a mammal that include introducing into the cochlea a therapeutically effective amount of any of the compositions described herein.
[0232] Also provided herein are methods of treating syndromic and non-syndromic sensorineural hearing loss in a subject identified as having a defective secreted target gene that include: administering a therapeutically effective amount of any of the compositions described herein into the cochlea of the subject.
[0233] Also provided herein are methods of treating or preventing vision loss in a subject identified as having a defective NDP gene that include: administering a therapeutically effective amount of any of the compositions described herein into an inner ear or central nervous system of the subject, or systemically administering a therapeutically effective amount of any of the compositions described herein to the subject.
[0234] Also provided herein are pharmaceutical compositions and kits that include any of the compositions or AAV vectors described herein.
[0235] Additional non-limiting aspects of the compositions, kits and methods are described herein and can be used in any combination without limitation.
Secreted Target Proteins
[0236] The term “secreted target protein” refers to a protein encoded by DNA that if expressed in a cell (e.g., any of the exemplary cells described herein) can be secreted. The term “secreted target protein” also refers to a protein that includes a secretion signal. Non-limiting examples of secreted target proteins and secretion signals are described herein. In some examples, the secreted target protein is a NDP protein (e.g., any of the NDP proteins described herein). In some examples, the secreted target protein is an HSPA1A protein (e.g., any of the HPSA1A proteins described herein).
[0237] In some embodiments, the term “secreted target protein” means a protein that is expressed and secreted by a cell in a primate, and functionally contributes, at least in part, to the hearing in the primate. Non-limiting examples of secreted target proteins include NDP, HSPA1A, DNAJB1, or DNAJB5.
[0238] The term “mutation in a secreted target gene” refers to a modification in a wildtype secreted target gene that results in the production of a secreted target protein having one or more of: a deletion in one or more amino acids, one or more amino acid substitutions, and one or more amino acid insertions as compared to the wildtype secreted target protein, and/or results in a decrease in the expressed level of the encoded secreted target protein in a primate cell as compared to the expressed level of the encoded secreted target protein in a primate cell not having a mutation. In some embodiments, a mutation can result in the production of a secreted target protein having a deletion in one or more amino acids (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 amino acids). In some embodiments, the mutation can result in a frameshift in the secreted target gene.
Norrin Cystine Knot Growth Factor (NDP)
[0239] The NDP gene encodes “norrin cystine knot growth factor” (NDP). The human ND gene is located on chromosome Xp11.3. It contains 3 exons, encompassing ˜ kilobases (kb) (NCBI Accession No. NG_009832.1). NDP encodes secreted norrin protein that plays a role in retina vascularization of Wnt signaling pathway through (FZDA) and (LRP5) coreceptor. Mutations in NDP have been associated with Norrie Disease Pseudoglioma, an X-linked recessive syndromic hearing loss that is characterized by early childhood retinopathy. About one third of individuals with Norrie disease develop progressive hearing loss, and more than half experience developmental delays in motor skills.
[0240] Various mutations in the NDP gene have been associated with hearing loss (e.g., Norrie Disease Pseudoglioma). For example, the p.His4ArgfsX21, p.Asp23GlufsX9, p.Arg38Cys, p.Ile48ValfsX55, p.His50Asp, p.Ser57*, p.Cys93Arg, p.Lys104Gln, p.Gly113Asp, p.Arg121Gln, and p.Cys126Arg mutation have each been associated with Norrie disease pseudoglioma. See, e.g., Musada et al., Mol. Vis. 22:491-502, 2016; Chamney et al., Eye (Lond) 25(12):1658, 2011; Liu et al., J Chin. Med. Assoc. 79(11):633-638, 2016; and Parzefall et al., Audiol. Neurootol. 19(3):203-209, 2014, each of which is hereby incorporated by reference in its entirety.
[0241] As used herein, the term “active NDP protein” means a protein encoded by DNA that, if substituted for both wildtype alleles encoding full-length NDP protein in auditory hair cells, or ocular cells, of what is otherwise a wildtype mammal, and if expressed in the auditory hair cells, or ocular cells, of that mammal, results in that mammal's having a level of hearing, or vision, approximating the normal level of hearing, or vision, of a similar mammal that is entirely wildtype. Non-limiting examples of active NDP proteins are full-length NDP proteins (e.g., any of the full-length NDP proteins described herein).
[0242] For example, an active NDP protein can include a sequence of a wildtype, full-length NDP protein (e.g., a wildtype, human, full-length NDP protein) including about 1 to about 24 amino acid substitutions (e.g., about 1 to about 22 amino acid substitutions, about 1 to about 20 amino acid substitutions, about 1 to about 18 amino acid substitutions, about 1 to about 16 amino acid substitutions, about 1 to about 15 amino acid substitutions, about 1 to about 14 amino acid substitutions, about 1 to about 12 amino acid substitutions, about 1 to about 10 amino acid substitutions, about 1 to about 8 amino acid substitutions, about 1 to about 6 amino acid substitutions, about 1 to about 5 amino acid substitutions, about 1 to about 4 amino acid substitutions, about 1 to about 2 amino acid substitutions, about 2 to about 24 amino acid substitutions, about 2 to about 22 amino acid substitutions, about 2 to about 20 amino acid substitutions, about 2 to about 18 amino acid substitutions, about 2 to about 16 amino acid substitutions, about 2 to about 15 amino acid substitutions, about 2 to about 14 amino acid substitutions, about 2 to about 12 amino acid substitutions, about 2 to about 10 amino acid substitutions, about 2 to about 8 amino acid substitutions, about 2 to about 6 amino acid substitutions, about 2 to about 5 amino acid substitutions, about 2 to about 4 amino acid substitutions, about 5 to about 24 amino acid substitutions, about 5 to about 22 amino acid substitutions, about 5 to about 20 amino acid substitutions, about 5 to about 18 amino acid substitutions, about 5 to about 16 amino acid substitutions, about 5 to about 15 amino acid substitutions, about 5 to about 14 amino acid substitutions, about 5 to about 12 amino acid substitutions, about 5 to about 10 amino acid substitutions, about 5 to about 8 amino acid substitutions, about 5 to about 6 amino acid substitutions, about 6 to about 24 amino acid substitutions, about 6 to about 22 amino acid substitutions, about 6 to about 20 amino acid substitutions, about 6 to about 18 amino acid substitutions, about 6 to about 16 amino acid substitutions, about 6 to about 15 amino acid substitutions, about 6 to about 14 amino acid substitutions, about 6 to about 12 amino acid substitutions, about 6 to about 10 amino acid substitutions, about 6 to about 8 amino acid substitutions, about 8 to about 24 amino acid substitutions, about 8 to about 22 amino acid substitutions, about 8 to about 20 amino acid substitutions, about 8 to about 18 amino acid substitutions, about 8 to about 16 amino acid substitutions, about 8 to about 15 amino acid substitutions, about 8 to about 14 amino acid substitutions, about 8 to about 12 amino acid substitutions, about 8 to about 10 amino acid substitutions, about 10 to about 24 amino acid substitutions, about 10 to about 22 amino acid substitutions, about 10 to about 20 amino acid substitutions, about 10 to about 18 amino acid substitutions, about 10 to about 16 amino acid substitutions, about 10 to about 15 amino acid substitutions, about 10 to about 14 amino acid substitutions, about 10 to about 12 amino acid substitutions, about 12 to about 24 amino acid substitutions, about 12 to about 22 amino acid substitutions, about 12 to about 20 amino acid substitutions, about 12 to about 18 amino acid substitutions, about 12 to about 16 amino acid substitutions, about 12 to about 15 amino acid substitutions, about 12 to about 14 amino acid substitutions, about 14 to about 24 amino acid substitutions, about 14 to about 22 amino acid substitutions, about 14 to about 20 amino acid substitutions, about 14 to about 18 amino acid substitutions, about 14 to about 16 amino acid substitutions, about 14 to about 15 amino acid substitutions, about 15 to about 24 amino acid substitutions, about 15 to about 22 amino acid substitutions, about 15 to about 20 amino acid substitutions, about 15 to about 18 amino acid substitutions, about 15 to about 16 amino acid substitutions, about 16 to about 24 amino acid substitutions, about 16 to about 22 amino acid substitutions, about 16 to about 20 amino acid substitutions, about 16 to about 18 amino acid substitutions, about 18 to about 24 amino acid substitutions, about 18 to about 22 amino acid substitutions, about 18 to about 20 amino acid substitutions, about 20 to about 24 amino acid substitutions, about 20 to about 22 amino acid substitutions, or about 22 to about 24 amino acid substitutions).
[0243] One skilled in the art would appreciate that amino acids that are not conserved between wildtype NDP proteins from different species can be mutated without losing activity, while those amino acids that are conserved between wildtype NDP proteins from different species should not be mutated as they are more likely (than amino acids that are not conserved between different species) to be involved in activity.
[0244] An active NDP protein can include, e.g., a sequence of a wildtype, full-length NDP protein (e.g., a wildtype, human, full-length NDP protein) that has about 1 to about 80 amino acids (e.g., about 1 to about 75 amino acids, about 1 to about 70 amino acids, about 1 to about 65 amino acids, about 1 to about 60 amino acids, about 1 to about 55 amino acids, about 1 to about 50 amino acids, about 1 to about 45 amino acids, about 1 to about 40 amino acids, about 1 to about 35 amino acids, about 1 to about 30 amino acids, about 1 to about 25 amino acids, about 1 to about 20 amino acids, about 1 to about 15 amino acids, about 1 to about 10 amino acids, about 1 to about 5 amino acids, about 5 to about 80 amino acids, about 5 to about 75 amino acids, about 5 to about 70 amino acids, about 5 to about 65 amino acids, about 5 to about 60 amino acids, about 5 to about 55 amino acids, about 5 to about 50 amino acids, about 5 to about 45 amino acids, about 5 to about 40 amino acids, about 5 to about 35 amino acids, about 5 to about 30 amino acids, about 5 to about 25 amino acids, about 5 to about 20 amino acids, about 5 to about 15 amino acids, about 5 to about 10 amino acids, about 10 to about 80 amino acids, about 10 to about 75 amino acids, about 10 to about 70 amino acids, about 10 to about 65 amino acids, about 10 to about 60 amino acids, about 10 to about 55 amino acids, about 10 to about 50 amino acids, about 10 to about 45 amino acids, about 10 to about 40 amino acids, about 10 to about 35 amino acids, about 10 to about 30 amino acids, about 10 to about 25 amino acids, about 10 to about 20 amino acids, about 10 to about 15 amino acids, about 15 to about 80 amino acids, about 15 to about 75 amino acids, about 15 to about 70 amino acids, about 15 to about 65 amino acids, about 15 to about 60 amino acids, about 15 to about 55 amino acids, about 15 to about 50 amino acids, about 15 to about 45 amino acids, about 15 to about 40 amino acids, about 15 to about 35 amino acids, about 15 to about 30 amino acids, about 15 to about 25 amino acids, about 15 to about 20 amino acids, about 20 to about 80 amino acids, about 20 to about 75 amino acids, about 20 to about 70 amino acids, about 20 to about 65 amino acids, about 20 to about 60 amino acids, about 20 to about 55 amino acids, about 20 to about 50 amino acids, about 20 to about 45 amino acids, about 20 to about 40 amino acids, about 20 to about 35 amino acids, about 20 to about 30 amino acids, about 20 to about 25 amino acids, about 25 to about 80 amino acids, about 25 to about 75 amino acids, about 25 to about 70 amino acids, about 25 to about 65 amino acids, about 25 to about 60 amino acids, about 25 to about 55 amino acids, about 25 to about 50 amino acids, about 25 to about 45 amino acids, about 25 to about 40 amino acids, about 25 to about 35 amino acids, about 25 to about 30 amino acids, about 30 to about 80 amino acids, about 30 to about 75 amino acids, about 30 to about 70 amino acids, about 30 to about 65 amino acids, about 30 to about 60 amino acids, about 30 to about 55 amino acids, about 30 to about 50 amino acids, about 30 to about 45 amino acids, about 30 to about 40 amino acids, about 30 to about 35 amino acids, about 35 to about 80 amino acids, about 35 to about 75 amino acids, about 35 to about 70 amino acids, about 35 to about 65 amino acids, about 35 to about 60 amino acids, about 35 to about 55 amino acids, about 35 to about 50 amino acids, about 35 to about 45 amino acids, about 35 to about 40 amino acids, about 40 to about 80 amino acids, about 40 to about 75 amino acids, about 40 to about 70 amino acids, about 40 to about 65 amino acids, about 40 to about 60 amino acids, about 40 to about 55 amino acids, about 40 to about 50 amino acids, about 40 to about 45 amino acids, about 45 to about 80 amino acids, about 45 to about 75 amino acids, about 45 to about 70 amino acids, about 45 to about 65 amino acids, about 45 to about 60 amino acids, about 45 to about 55 amino acids, about 45 to about 50 amino acids, about 50 to about 80 amino acids, about 50 to about 75 amino acids, about 50 to about 70 amino acids, about 50 to about 65 amino acids, about 50 to about 60 amino acids, about 50 to about 55 amino acids, about 55 to about 80 amino acids, about 55 to about 75 amino acids, about 55 to about 70 amino acids, about 55 to about 65 amino acids, about 55 to about 60 amino acids, about 60 to about 80 amino acids, about 60 to about 75 amino acids, about 60 to about 70 amino acids, about 60 to about 65 amino acids, about 65 to about 80 amino acids, about 65 to about 75 amino acids, about 65 to about 70 amino acids, about 70 to about 80 amino acids, about 70 to about 75 amino acids, or about 75 to about 80 amino acids), removed from its N-terminus and/or 1 amino acid to 80 amino acids (or any of the subranges of this range described herein) removed from its C-terminus.
[0245] In some embodiments, an active NDP protein can, e.g., include the sequence of a wildtype, full-length NDP protein where 1 amino acid to 50 amino acids, 1 amino acid to 45 amino acids, 1 amino acid to 40 amino acids, 1 amino acid to 35 amino acids, 1 amino acid to 30 amino acids, 1 amino acid to 25 amino acids, 1 amino acid to 20 amino acids, 1 amino acid to 15 amino acids, 1 amino acid to 10 amino acids, 1 amino acid to 9 amino acids, 1 amino acid to 8 amino acids, 1 amino acid to 7 amino acids, 1 amino acid to 6 amino acids, 1 amino acid to 5 amino acids, 1 amino acid to 4 amino acids, 1 amino acid to 3 amino acids, about 2 amino acids to 50 amino acids, about 2 amino acids to 45 amino acids, about 2 amino acids to 40 amino acids, about 2 amino acids to 35 amino acids, about 2 amino acids to 30 amino acids, about 2 amino acids to 25 amino acids, about 2 amino acids to 20 amino acids, about 2 amino acids to 15 amino acids, about 2 amino acids to 10 amino acids, about 2 amino acids to 9 amino acids, about 2 amino acids to 8 amino acids, about 2 amino acids to 7 amino acids, about 2 amino acids to 6 amino acids, about 2 amino acids to 5 amino acids, about 2 amino acids to 4 amino acids, about 3 amino acids to 50 amino acids, about 3 amino acids to 45 amino acids, about 3 amino acids to 40 amino acids, about 3 amino acids to 35 amino acids, about 3 amino acids to 30 amino acids, about 3 amino acids to 25 amino acids, about 3 amino acids to 20 amino acids, about 3 amino acids to 15 amino acids, about 3 amino acids to 10 amino acids, about 3 amino acids to 9 amino acids, about 3 amino acids to 8 amino acids, about 3 amino acids to 7 amino acids, about 3 amino acids to 6 amino acids, about 3 amino acids to 5 amino acids, about 4 amino acids to 50 amino acids, about 4 amino acids to 45 amino acids, about 4 amino acids to 40 amino acids, about 4 amino acids to 35 amino acids, about 4 amino acids to 30 amino acids, about 4 amino acids to 25 amino acids, about 4 amino acids to 20 amino acids, about 4 amino acids to 15 amino acids, about 4 amino acids to 10 amino acids, about 4 amino acids to 9 amino acids, about 4 amino acids to 8 amino acids, about 4 amino acids to 7 amino acids, about 4 amino acids to 6 amino acids, about 5 amino acids to 50 amino acids, about 5 amino acids to 45 amino acids, about 5 amino acids to 40 amino acids, about 5 amino acids to 35 amino acids, about 5 amino acids to 30 amino acids, about 5 amino acids to 25 amino acids, about 5 amino acids to 20 amino acids, about 5 amino acids to 15 amino acids, about 5 amino acids to 10 amino acids, about 5 amino acids to 9 amino acids, about 5 amino acids to 8 amino acids, about 5 amino acids to 7 amino acids, about 6 amino acids to 50 amino acids, about 6 amino acids to 45 amino acids, about 6 amino acids to 40 amino acids, about 6 amino acids to 35 amino acids, about 6 amino acids to 30 amino acids, about 6 amino acids to 25 amino acids, about 6 amino acids to 20 amino acids, about 6 amino acids to 15 amino acids, about 6 amino acids to 10 amino acids, about 6 amino acids to 9 amino acids, about 6 amino acids to 8 amino acids, about 7 amino acids to 50 amino acids, about 7 amino acids to 45 amino acids, about 7 amino acids to 40 amino acids, about 7 amino acids to 35 amino acids, about 7 amino acids to 30 amino acids, about 7 amino acids to 25 amino acids, about 7 amino acids to 20 amino acids, about 7 amino acids to 15 amino acids, about 7 amino acids to 10 amino acids, about 7 amino acids to 9 amino acids, about 8 amino acids to 50 amino acids, about 8 amino acids to 45 amino acids, about 8 amino acids to 40 amino acids, about 8 amino acids to 35 amino acids, about 8 amino acids to 30 amino acids, about 8 amino acids to 25 amino acids, about 8 amino acids to 20 amino acids, about 8 amino acids to 15 amino acids, about 8 amino acids to 10 amino acids, about 10 amino acids to 50 amino acids, about 10 amino acids to 45 amino acids, about 10 amino acids to 40 amino acids, about 10 amino acids to 35 amino acids, about 10 amino acids to 30 amino acids, about 10 amino acids to 25 amino acids, about 10 amino acids to 20 amino acids, about 10 amino acids to 15 amino acids, about 15 amino acids to 50 amino acids, about 15 amino acids to 45 amino acids, about 15 amino acids to 40 amino acids, about 15 amino acids to 35 amino acids, about 15 amino acids to 30 amino acids, about 15 amino acids to 25 amino acids, about 15 amino acids to 20 amino acids, about 20 amino acids to 50 amino acids, about 20 amino acids to 45 amino acids, about 20 amino acids to 40 amino acids, about 20 amino acids to 35 amino acids, about 20 amino acids to 30 amino acids, about 20 amino acids to 25 amino acids, about 25 amino acids to 50 amino acids, about 25 amino acids to 45 amino acids, about 25 amino acids to 40 amino acids, about 25 amino acids to 35 amino acids, about 25 amino acids to 30 amino acids, about 30 amino acids to 50 amino acids, about 30 amino acids to 45 amino acids, about 30 amino acids to 40 amino acids, about 30 amino acids to 35 amino acids, about 35 amino acids to 50 amino acids, about 35 amino acids to 45 amino acids, about 35 amino acids to 40 amino acids, about 40 amino acids to 50 amino acids, about 40 amino acids to 45 amino acids, or about 45 amino acids to about 50 amino acids, are inserted. In some examples, the 1 amino acid to 50 amino acids (or any subrange thereof) can be inserted as a contiguous sequence into the sequence of a wildtype, full-length NDP protein. In some examples, the 1 amino acid to 50 amino acids (or any subrange thereof) are inserted in multiple, non-contiguous places in the sequence of a wildtype, full-length NDP protein. As can be appreciated in the art, the 1 amino acid to 50 amino acids can be inserted into a portion of the sequence of a wildtype, full-length NDP protein that is not well-conserved between species.
[0246] Methods of detecting mutations in a gene are well-known in the art. Non-limiting examples of such techniques include: real-time polymerase chain reaction (RT-PCR), PCR, sequencing, Southern blotting, and Northern blotting.
[0247] An exemplary human wildtype NDP protein is or includes the sequence of SEQ ID NO: 1, 33, 35, 36 or 37. Non-limiting examples of nucleic acid encoding a wildtype NDP protein are or include SEQ ID NO: 2 or SEQ ID NO: 34. As can be appreciated in the art, at least some or all of the codons in SEQ ID NO: 2 or SEQ ID NO: 34 can be codon-optimized to allow for optimal expression in a non-human mammal.
[0248] Exemplary wildtype NDP protein sequences are or include SEQ ID NO: 1, 33, 35, 36, and 37. Exemplary DNA sequences that encode an NDP protein and exemplary polypeptides encoded by an NDP gene are shown below.
NDP Polynucleotides
[0249] Among other things, the present disclosure provides polynucleotides, e.g., polynucleotides comprising an NDP gene or characteristic portion thereof, as well as compositions including such polynucleotides and methods utilizing such polynucleotides and/or compositions.
[0250] In some embodiments, a polynucleotide comprising an NDP gene or characteristic portion thereof can be DNA or RNA. In some embodiments, DNA can be genomic DNA or cDNA. In some embodiments, RNA can be an mRNA. In some embodiments, a polynucleotide comprises exons and/or introns of an NDP gene.
[0251] In some embodiments, a gene product is expressed from a polynucleotide comprising an NDP gene or characteristic portion thereof. In some embodiments, expression of such a polynucleotide can utilize one or more control elements (e.g., promoters, enhancers, splice sites, poly-adenylation sites, translation initiation sites, etc.). Thus, in some embodiments, a polynucleotide provided herein can include one or more control elements.
[0252] In some embodiments, an NDP gene is a mammalian NDP gene. In some embodiments, an NDP gene is a murine NDP gene. In some embodiments, an NDP gene is a primate NDP gene. In some embodiments, a NDP gene is a human NDP gene. An exemplary human NDP cDNA sequence is or includes the sequence of SEQ ID NO: 57. An exemplary human NDP genomic DNA sequence can be found in SEQ ID NO: 79. An exemplary human NDP cDNA sequence including untranslated regions is or includes the sequence of SEQ ID NO: 114.
TABLE-US-00002 Exemplary Human Mature NDP cDNA (SEQ ID NO: 57) ATGAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACTATG TGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTGCGA GGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAGCAA CCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGCTGC GATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGGTACATCCTCTCCTGTCACTGCGAGGA ATGCAATTCC Exemplary Human NDP cDNA including untranslated regions (SEQ ID NO: 114) AGAAGAACAAAAGCATTTGGAAGTAACAGGACCTCTTTCTAGCTCTCAGAAAAGTCTGAGAAGA AAGGAGCCCTGCGTTCCCCTAAGCTGTGCAGCAGATACTGTGATGATGGATTGCAAGTGCAAAG AGTAAGACAAAACTCCAGCACATAAAGGACAATGACAACCAGAAAGCTTCAGCCCGATCCTGCC CTTTCCTTGAACGGGACTGGATCCTAGGAGGTGAAGCCATTTCCAATTTTTTGTCCTCTGCCTC CCTCTGCTGTTCTTCTAGAGAAGTTTTTCCTTACAACAATGAGAAAACATGTACTAGCTGCATC CTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACAGACAGTAAAACGGACAGCTCATTC ATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACTATGTGGATTCTATCAGTCACCCAT TGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTGCGAGGGGCACTGCAGCCAGGCGTC ACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAGCAACCCTTCCGTTCCTCCTGTCAC TGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGCTGCGATGCTCAGGGGGCATGCGAC TCACTGCCACCTACCGGTACATCCTCTCCTGTCACTGCGAGGAATGCAATTCCTGAGGCCCGCT GCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGTTCGACCAGCCAGGGAAAGACTGGC AAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTCTCCCGGGACTCTGCATATTCTAGTAAT AAAGACTCTACATGCTTGTTGACAGAGAGAGATACTCTGGGAACTTCTTTGCAGTTCCCATCTC CTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTCAGGCATTTTCCCCCTTGGCTCTCA ATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGGAAAAAGTGGGCCCTCATACACAAGCGT GTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAATTTACTTTGGAAAGTAGAAAAGCCCAG CTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTTTTTTTACCTTGTCATTTTGGTCTAAGG TTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGATACCAAGCATGTGGATATGTTTAG CTACGTTTACTCACAGCGAGCGAACTGAGATTAAAATAACTAACAAACAGATTCTTTTATGTGA TGCTGGAACTCTTGACAGCTATAATTATTATTCAGAAATGAGTTTTTGAAAGTAAAAGCAGCAT AAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATGGTAAAATTTTGTAAGGGAGCAGAC TTTTAAAGACTTGCACAAATACGGATCCTGCACTGAGTCTGGAAAAGGCATATATGTACTAGTG GCATGGAGAATGCACCATACTCATGCATGCAAATTAGACAACCAAGTATGAATCTATTTGTGGG TGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCTCTAATATCCACTTGTCCATGTGAAACA TGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTTGGTTCAAATGTGTTTTGGTCCTGG AGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATAAAAAGAGTATATTCAAAA
[0253] A non-limiting example of a human wildtype NDP genomic DNA sequence is SEQ ID NO: 79. The exons in SEQ ID NO: 79 are: nucleotide positions 1-87 (exon 1); nucleotide positions 88-468 (exon 2) and nucleotide positions 469-1719 (exon 3).
TABLE-US-00003 Exemplary Human NDP Gene Sequence (NCBI Reference Sequence: NC_000023.11) (SEQ ID NO: 79) AGAAGAACAAAAGCATTTGGAAGTAACAGGACCTCTTTCTAGCTCTCAGAAAAGTCTGAGAAGA AAGGAGCCCTGCGTTCCCCTAAGCTGTGCAGCAGATACTGTGATGATGGATTGCAAGTGCAAAG AGTAAGACAAAACTCCAGCACATAAAGGACAATGACAACCAGAAAGCTTCAGCCCGATCCTGCC CTTTCCTTGAACGGGACTGGATCCTAGGAGGTGAAGCCATTTCCAATTTTTTGTCCTCTGCCTC CCTCTGCTGTTCTTCTAGAGAAGTTTTTCCTTACAACAATGAGAAAACATGTACTAGCTGCATC CTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACAGACAGTAAAACGGACAGCTCATTC ATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACTATGTGGATTCTATCAGTCACCCAT TGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTGCGAGGGGCACTGCAGCCAGGCGTC ACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAGCAACCCTTCCGTTCCTCCTGTCAC TGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGCTGCGATGCTCAGGGGGCATGCGAC TCACTGCCACCTACCGGTACATCCTCTCCTGTCACTGCGAGGAATGCAATTCCTGAGGCCCGCT GCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGTTCGACCAGCCAGGGAAAGACTGGC AAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTCTCCCGGGACTCTGCATATTCTAGTAAT AAAGACTCTACATGCTTGTTGACAGAGAGAGATACTCTGGGAACTTCTTTGCAGTTCCCATCTC CTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTCAGGCATTTTCCCCCTTGGCTCTCA ATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGGAAAAAGTGGGCCCTCATACACAAGCGT GTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAATTTACTTTGGAAAGTAGAAAAGCCCAG CTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTTTTTTTACCTTGTCATTTTGGTCTAAGG TTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGATACCAAGCATGTGGATATGTTTAG CTACGTTTACTCACAGCGAGCGAACTGAGATTAAAATAACTAACAAACAGATTCTTTTATGTGA TGCTGGAACTCTTGACAGCTATAATTATTATTCAGAAATGAGTTTTTGAAAGTAAAAGCAGCAT AAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATGGTAAAATTTTGTAAGGGAGCAGAC TTTTAAAGACTTGCACAAATACGGATCCTGCACTGAGTCTGGAAAAGGCATATATGTACTAGTG GCATGGAGAATGCACCATACTCATGCATGCAAATTAGACAACCAAGTATGAATCTATTTGTGGG TGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCTCTAATATCCACTTGTCCATGTGAAACA TGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTTGGTTCAAATGTGTTTTGGTCCTGG AGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATAAAAAGAGTATATTCAAAA Exemplary Mouse Mature NDP cDNA (SEQ ID NO: 81) AAAACAGACAGTTCATTTCTGATGGAGTCTCAACGCTGCATGAGACACCATTATGTCGATTCTA TCAGTCACCCACTGTACAAATGTAGCTCAAAGATGGTGCTCCTGGCCAGATGTGAGGGGCACTG CAGCCAGGCATCACGCTCTGAGCCCTTGGTGTCCTTCAGCACTGTCCTCAAGCAACCTTTCCGT TCCTCCTGTCACTGCTGCCGACCCCAGACTTCCAAGCTGAAGGCTCTGCGTCTGCGCTGCTCAG GGGGCATGCGACTTACTGCCACTTACCGGTACATCCTCTCCTGTCACTGTGAGGAATGCAGCTC C
Polypeptides Encoded by NDP Gene
[0254] Among other things, the present disclosure provides polypeptides encoded by an NDP gene or characteristic portion thereof. In some embodiments, an NDP gene is a mammalian NDP gene. In some embodiments, an NDP gene is a murine NDP gene. In some embodiments, an NDP gene is a primate NDP gene. In some embodiments, a NDP gene is a human NDP gene.
[0255] In some embodiments, a polypeptide comprises a NDP protein or characteristic portion thereof. In some embodiments, a NDP protein or characteristic portion thereof is mammalian NDP protein or characteristic portion thereof, e.g., primate NDP protein or characteristic portion thereof. In some embodiments, a NDP protein or characteristic portion thereof is a human NDP protein or characteristic portion thereof.
[0256] In some embodiments, a polypeptide provided herein comprises post-translational modifications. In some embodiments, a NDP protein or characteristic portion thereof provided herein comprises post-translational modifications. In some embodiments, post-translational modifications can comprise but is not limited to glycosylation (e.g., N-linked glycosylation, O-linked glycosylation), phosphorylation, acetylation, amidation, hydroxylation, methylation, ubiquitylation, sulfation, and/or a combination thereof.
[0257] An exemplary human NDP protein sequence is or includes the sequence of SEQ ID NO: 56. An exemplary human NDP protein sequence with a c-terminal flag tag is or includes the sequence of SEQ ID NO: 94. As exemplary mouse NDP protein sequence is or includes the sequence of SEQ ID NO: 80. As exemplary rhesus monkey NDP protein sequence is or includes the sequence of SEQ ID NO: 82. As exemplary rat NDP protein sequence is or includes the sequence of SEQ ID NO: 83. As exemplary chimpanzee NDP protein sequence is or includes the sequence of SEQ ID NO: 84.
TABLE-US-00004 Exemplary Human Mature NDP Protein (NCBI Accession No. NP_000257.1) (SEQ ID NO: 56) MKTDSSFIMDSDPRRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQ PFRSSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECNS Exemplary Human NDP Protein Sequence with C-terminal Flag Tag (SEQ ID NO: 94) MKTDSSFIMDSDPRRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQ PFRSSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECNSGSRADYKDHDGDYKDHDI DYKDDDDK Exemplary Mouse Mature NDP Protein (NCBI Accession No. NP_035013.1) (SEQ ID NO: 80) KTDSSFLMDSQRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQPFR SSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECSS Exemplary Rhesus Monkey Mature NDP Protein (NCBI Accession No. NP_001253901.1) (SEQ ID NO: 82) MRKHVLAASFSMLSLLVIMGDTDSKTDSSFIMDSDPRRCMRHHYVDSISHPLYKCSSKMVLLAR CEGHCSQASRSEPLVSFSTVLKQPFRSSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHC EECNS Exemplary Rat Mature NDP Protein (NCBI Accession No. NP_001102284.1) (SEQ ID NO: 83) KTDSSFLMDSQRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQPFR SSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECSS Exemplary Chimpanzee NDP Protein (NCBI Accession No. XP_016799622.1) (SEQ ID NO: 84) KTDSSFVMDSDPRRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQP FRSSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECNS
Heat Shock Proteins
[0258] In some embodiments of any of the compositions described herein, a secreted target protein is a heat shock protein. In some embodiments, a heat shock protein is a heat shock protein family A (Hsp70) member 1A (HSPA1A). In some embodiments, a heat shock protein is a heat shock protein 40 (Hsp40)/DNJ family member, e.g., Hsp40.
Heat Shock Protein Family A (Hsn70) member 1A (HSPA1A)
[0259] The HSPA1A gene encodes “heat shock protein family A member 1A” (HSPA1A). The human HSPA1A gene is located on chromosome 6p21.33. It contains 1 exon, encompassing ˜2400 bp (NCBI Accession No. NC_000006.12). HSPA1A encodes a 70 kDa heat shock protein, which stabilizes proteins against protein aggregation and is involved in protein folding.
[0260] Hair cells are susceptible to death when exposed to therapeutic drugs (such as aminoglycoside antibiotics or cisplatin) with ototoxic side effects. The induction of heat shock proteins (such as heat shock protein 70 (“HSP70”) or such as heme oxygenase-1 (“HO-1” or “HSP32”)) in response to cellular stress is a highly conserved response that can significantly inhibit cell death (such as aminoglycoside-induced hair cell death) in a variety of systems. For example, adenovirus-mediated infection of inner ear supporting cells with HSP70 was shown to inhibit hair cell death (see, e.g., Lindsay May et al., J Clin Invest. 2013; 123(8):3577-3587, the contents of which is hereby incorporated by reference herein in its entirety). In addition, other heat shock proteins such as HO-1 or HSP32 have been found to offer significant protection against cisplatin-induced hair cell death (see, e.g., Tiffany Baker et al., JARO 16: 67-80 (2015), the contents of which is hereby incorporated by reference herein in its entirety). As described herein, the present disclosure surprisingly found that viral transduction of an IL2ss sequence upstream of a sequence encoding an Hsp70 protein promotes a high level of secretion (see
[0261] As described herein, in some embodiments, polymorphisms in Hsp70 (such as rs1043618, rs1061581 and rs2227956 in HSP70-1, HSP70-2 and HSP70-hom, respectively) may render the cochlea susceptible to hearing loss (such as a polymorphism described by Konings et al., “Variations in HSP70 genes associated with noise-induced hearing loss in two independent populations”, Eur J Hum Genet. 2009 March; 17(3): 329-33, the contents of which is hereby incorporated by reference in its entirety).
[0262] As used herein, the term “active HSPA1A protein” means a protein encoded by DNA that, if substituted for both wildtype alleles encoding full-length HSPA1A protein in auditory hair cells, or ocular cells, of what is otherwise a wildtype mammal, and if expressed in the auditory hair cells, or ocular cells, of that mammal, results in that mammal's having a level of hearing, or vision, approximating the normal level of hearing, or vision, of a similar mammal that is entirely wildtype. Non-limiting examples of active HSPA1A proteins are full-length HSPA1A proteins (e.g., any of the full-length HSPA1A proteins described herein).
[0263] For example, an active HSPA1A protein can include a sequence of a wildtype, full-length HSPA1A protein (e.g., a wildtype, human, full-length HSPA1A protein) including about 1 to about 200 amino acid substitutions (e.g., about 1 to 190 amino acid substitutions, about 1 to about 180 amino acid substitutions, about 1 to about 160 amino acid substitutions, about 1 to about 150 amino acid substitutions, about 1 to about 140 amino acid substitutions, about 1 to about 130 amino acid substitutions, about 1 to about 120 amino acid substitutions, about 1 to about 110 amino acid substitutions, about 1 to about 100 amino acid substitutions, about 1 to about 90 amino acid substitutions, about 1 to about 80 amino acid substitutions, about 1 to about 70 amino acid substitutions, about 1 to about 60 amino acid substitutions, about 1 to about 50 amino acid substitutions, about 1 to about 40 amino acid substitutions, about 1 to about 30 amino acid substitutions, about 1 to about 25 amino acid substitutions, about 1 to about 20 amino acid substitutions, about 1 to about 10 amino acid substitutions, about 1 to about 5 amino acid substitutions, about 10 to about 200 amino acid substitutions, about 10 to about 180 amino acid substitutions, about 10 to about 160 amino acid substitutions, about 10 to about 150 amino acid substitutions, about 10 to about 140 amino acid substitutions, about 10 to about 120 amino acid substitutions, about 10 to about 100 amino acid substitutions, about 10 to about 80 amino acid substitutions, about 10 to about 60 amino acid substitutions, about 10 to about 50 amino acid substitutions, about 10 to about 40 amino acid substitutions, about 10 to about 20 amino acid substitutions, about 20 to about 200 amino acid substitutions, about 20 to about 180 amino acid substitutions, about 20 to about 160 amino acid substitutions, about 20 to about 150 amino acid substitutions, about 20 to about 140 amino acid substitutions, about 20 to about 120 amino acid substitutions, about 20 to about 100 amino acid substitutions, about 20 to about 80 amino acid substitutions, about 20 to about 60 amino acid substitutions, about 20 to about 50 amino acid substitutions, about 20 to about 40 amino acid substitutions, about 40 to about 200 amino acid substitutions, about 40 to about 180 amino acid substitutions, about 40 to about 160 amino acid substitutions, about 40 to about 150 amino acid substitutions, about 40 to about 140 amino acid substitutions, about 40 to about 120 amino acid substitutions, about 40 to about 100 amino acid substitutions, about 40 to about 80 amino acid substitutions, about 40 to about 60 amino acid substitutions, about 40 to about 50 amino acid substitutions, about 50 to about 200 amino acid substitutions, about 50 to about 180 amino acid substitutions, about 50 to about 160 amino acid substitutions, about 50 to about 150 amino acid substitutions, about 50 to about 140 amino acid substitutions, about 50 to about 120 amino acid substitutions, about 50 to about 100 amino acid substitutions, about 50 to about 80 amino acid substitutions, about 50 to about 60 amino acid substitutions, about 60 to about 200 amino acid substitutions, about 60 to about 180 amino acid substitutions, about 60 to about 160 amino acid substitutions, about 60 to about 150 amino acid substitutions, about 60 to about 140 amino acid substitutions, about 60 to about 120 amino acid substitutions, about 60 to about 100 amino acid substitutions, about 60 to about 80 amino acid substitutions, about 80 to about 200 amino acid substitutions, about 80 to about 180 amino acid substitutions, about 80 to about 160 amino acid substitutions, about 80 to about 150 amino acid substitutions, about 80 to about 140 amino acid substitutions, about 80 to about 120 amino acid substitutions, about 80 to about 100 amino acid substitutions, about 100 to about 200 amino acid substitutions, about 100 to about 180 amino acid substitutions, about 100 to about 160 amino acid substitutions, about 100 to about 150 amino acid substitutions, about 100 to about 140 amino acid substitutions, about 100 to about 120 amino acid substitutions, about 120 to about 200 amino acid substitutions, about 120 to about 180 amino acid substitutions, about 120 to about 160 amino acid substitutions, about 120 to about 150 amino acid substitutions, about 120 to about 140 amino acid substitutions, about 140 to about 200 amino acid substitutions, about 140 to about 180 amino acid substitutions, about 140 to about 160 amino acid substitutions, about 140 to about 150 amino acid substitutions, about 150 to about 200 amino acid substitutions, about 150 to about 180 amino acid substitutions, about 150 to about 160 amino acid substitutions, about 160 to about 200 amino acid substitutions, about 160 to about 180 amino acid substitutions, or about 180 to about 200 amino acid substitutions).
[0264] One skilled in the art would appreciate that amino acids that are not conserved between wildtype HSPA1A proteins from different species can be mutated without losing activity, while those amino acids that are conserved between wildtype HSPA1A proteins from different species should not be mutated as they are more likely (than amino acids that are not conserved between different species) to be involved in activity.
[0265] An active HSPA1A protein can include, e.g., a sequence of a wildtype, full-length HSPA1A protein (e.g., a wildtype, human, full-length HSPA1A protein) that has about 1 to about 100 amino acids (e.g., about 1 to about 95 amino acids, about 1 to about 90 amino acids, about 1 to about 85 amino acids, about 1 to about 80 amino acids, about 1 to about 75 amino acids, about 1 to about 70 amino acids, about 1 to about 65 amino acids, about 1 to about 60 amino acids, about 1 to about 55 amino acids, about 1 to about 50 amino acids, about 1 to about 45 amino acids, about 1 to about 40 amino acids, about 1 to about 35 amino acids, about 1 to about 30 amino acids, about 1 to about 25 amino acids, about 1 to about 20 amino acids, about 1 to about 15 amino acids, about 1 to about 10 amino acids, about 1 to about 5 amino acids, about 5 to about 100 amino acids, about 5 to about 95 amino acids, about 5 to about 90 amino acids, about 5 to about 85 amino acids, about 5 to about 80 amino acids, about 5 to about 75 amino acids, about 5 to about 70 amino acids, about 5 to about 65 amino acids, about 5 to about 60 amino acids, about 5 to about 55 amino acids, about 5 to about 50 amino acids, about 5 to about 45 amino acids, about 5 to about 40 amino acids, about 5 to about 35 amino acids, about 5 to about 30 amino acids, about 5 to about 25 amino acids, about 5 to about 20 amino acids, about 5 to about 15 amino acids, about 5 to about 10 amino acids, about 10 to about 100 amino acids, about 10 to about 95 amino acids, about 10 to about 90 amino acids, about 10 to about 85 amino acids, about 10 to about 80 amino acids, about 10 to about 80 amino acids, about 10 to about 75 amino acids, about 10 to about 70 amino acids, about 10 to about 65 amino acids, about 10 to about 60 amino acids, about 10 to about 55 amino acids, about 10 to about 50 amino acids, about 10 to about 45 amino acids, about 10 to about 40 amino acids, about 10 to about 35 amino acids, about 10 to about 30 amino acids, about 10 to about 25 amino acids, about 10 to about 20 amino acids, about 10 to about 15 amino acids, about 15 to about 100 amino acids, about 15 to about 95 amino acids, about 15 to about 90 amino acids, about 15 to about 85 amino acids, about 15 to about 80 amino acids, about 15 to about 75 amino acids, about 15 to about 70 amino acids, about 15 to about 65 amino acids, about 15 to about 60 amino acids, about 15 to about 55 amino acids, about 15 to about 50 amino acids, about 15 to about 45 amino acids, about 15 to about 40 amino acids, about 15 to about 35 amino acids, about 15 to about 30 amino acids, about 15 to about 25 amino acids, about 15 to about 20 amino acids, about 20 to about 100 amino acids, about 20 to about 95 amino acids, about 20 to about 90 amino acids, about 20 to about 85 amino acids, about 20 to about 80 amino acids, about 20 to about 75 amino acids, about 20 to about 70 amino acids, about 20 to about 65 amino acids, about 20 to about 60 amino acids, about 20 to about 55 amino acids, about 20 to about 50 amino acids, about 20 to about 45 amino acids, about 20 to about 40 amino acids, about 20 to about 35 amino acids, about 20 to about 30 amino acids, about 20 to about 25 amino acids, about 25 to about 100 amino acids, about 25 to about 95 amino acids, about 25 to about 90 amino acids, about 25 to about 85 amino acids, about 25 to about 80 amino acids, about 25 to about 75 amino acids, about 25 to about 70 amino acids, about 25 to about 65 amino acids, about 25 to about 60 amino acids, about 25 to about 55 amino acids, about 25 to about 50 amino acids, about 25 to about 45 amino acids, about 25 to about 40 amino acids, about 25 to about 35 amino acids, about 25 to about 30 amino acids, about 30 to about 100 amino acids, about 30 to about 95 amino acids, about 30 to about 90 amino acids, about 30 to about 85 amino acids, about 30 to about 80 amino acids, about 30 to about 75 amino acids, about 30 to about 70 amino acids, about 30 to about 65 amino acids, about 30 to about 60 amino acids, about 30 to about 55 amino acids, about 30 to about 50 amino acids, about 30 to about 45 amino acids, about 30 to about 40 amino acids, about 30 to about 35 amino acids, about 35 to about 100 amino acids, about 35 to about 95 amino acids, about 35 to about 90 amino acids, about 35 to about 85 amino acids, about 35 to about 80 amino acids, about 35 to about 75 amino acids, about 35 to about 70 amino acids, about 35 to about 65 amino acids, about 35 to about 60 amino acids, about 35 to about 55 amino acids, about 35 to about 50 amino acids, about 35 to about 45 amino acids, about 35 to about 40 amino acids, about 40 to about 100 amino acids, about 40 to about 95 amino acids, about 40 to about 90 amino acids, about 40 to about 85 amino acids, about 40 to about 80 amino acids, about 40 to about 75 amino acids, about 40 to about 70 amino acids, about 40 to about 65 amino acids, about 40 to about 60 amino acids, about 40 to about 55 amino acids, about 40 to about 50 amino acids, about 40 to about 45 amino acids, about 45 to about 100 amino acids, about 45 to about 95 amino acids, about 45 to about 90 amino acids, about 45 to about 85 amino acids, about 45 to about 80 amino acids, about 45 to about 75 amino acids, about 45 to about 70 amino acids, about 45 to about 65 amino acids, about 45 to about 60 amino acids, about 45 to about 55 amino acids, about 45 to about 50 amino acids, about 50 to about 100 amino acids, about 50 to about 95 amino acids, about 50 to about 90 amino acids, about 50 to about 85 amino acids, about 50 to about 80 amino acids, about 50 to about 75 amino acids, about 50 to about 70 amino acids, about 50 to about 65 amino acids, about 50 to about 60 amino acids, about 50 to about 55 amino acids, about 55 to about 100 amino acids, about 55 to about 95 amino acids, about 55 to about 90 amino acids, about 55 to about 85 amino acids, about 55 to about 80 amino acids, about 55 to about 75 amino acids, about 55 to about 70 amino acids, about 55 to about 65 amino acids, about 55 to about 60 amino acids, about 60 to about 100 amino acids, about 60 to about 95 amino acids, about 60 to about 90 amino acids, about 60 to about 85 amino acids, about 60 to about 80 amino acids, about 60 to about 75 amino acids, about 60 to about 70 amino acids, about 60 to about 65 amino acids, about 65 to about 100 amino acids, about 65 to about 95 amino acids, about 65 to about 90 amino acids, about 65 to about 85 amino acids, about 65 to about 80 amino acids, about 65 to about 75 amino acids, about 65 to about 70 amino acids, about 70 to about 100 amino acids, about 70 to about 95 amino acids, about 70 to about 90 amino acids, about 70 to about 85 amino acids, about 70 to about 80 amino acids, about 70 to about 75 amino acids, about 75 to about 100 amino acids, about 75 to about 95 amino acids, about 75 to about 90 amino acids, about 75 to about 85 amino acids, about 75 to about 80 amino acids, about 80 to about 100 amino acids, about 80 to about 95 amino acids, about 80 to about 90 amino acids, about 80 to about 85 amino acids, about 85 to about 100 amino acids, about 85 to about 95 amino acids, about 85 to about 90 amino acids, about 90 to about 100 amino acids, about 90 to about 95 amino acids, or about 95 to about 100 amino acids), removed from its N-terminus and/or 1 amino acid to 80 amino acids (or any of the subranges of this range described herein) removed from its C-terminus.
[0266] In some embodiments, an active HSPA1A protein can, e.g., include the sequence of a wildtype, full-length HSPA1A protein where 1 amino acid to 50 amino acids, 1 amino acid to 45 amino acids, 1 amino acid to 40 amino acids, 1 amino acid to 35 amino acids, 1 amino acid to 30 amino acids, 1 amino acid to 25 amino acids, 1 amino acid to 20 amino acids, 1 amino acid to 15 amino acids, 1 amino acid to 10 amino acids, 1 amino acid to 9 amino acids, 1 amino acid to 8 amino acids, 1 amino acid to 7 amino acids, 1 amino acid to 6 amino acids, 1 amino acid to 5 amino acids, 1 amino acid to 4 amino acids, 1 amino acid to 3 amino acids, about 2 amino acids to 50 amino acids, about 2 amino acids to 45 amino acids, about 2 amino acids to 40 amino acids, about 2 amino acids to 35 amino acids, about 2 amino acids to 30 amino acids, about 2 amino acids to 25 amino acids, about 2 amino acids to 20 amino acids, about 2 amino acids to 15 amino acids, about 2 amino acids to 10 amino acids, about 2 amino acids to 9 amino acids, about 2 amino acids to 8 amino acids, about 2 amino acids to 7 amino acids, about 2 amino acids to 6 amino acids, about 2 amino acids to 5 amino acids, about 2 amino acids to 4 amino acids, about 3 amino acids to 50 amino acids, about 3 amino acids to 45 amino acids, about 3 amino acids to 40 amino acids, about 3 amino acids to 35 amino acids, about 3 amino acids to 30 amino acids, about 3 amino acids to 25 amino acids, about 3 amino acids to 20 amino acids, about 3 amino acids to 15 amino acids, about 3 amino acids to 10 amino acids, about 3 amino acids to 9 amino acids, about 3 amino acids to 8 amino acids, about 3 amino acids to 7 amino acids, about 3 amino acids to 6 amino acids, about 3 amino acids to 5 amino acids, about 4 amino acids to 50 amino acids, about 4 amino acids to 45 amino acids, about 4 amino acids to 40 amino acids, about 4 amino acids to 35 amino acids, about 4 amino acids to 30 amino acids, about 4 amino acids to 25 amino acids, about 4 amino acids to 20 amino acids, about 4 amino acids to 15 amino acids, about 4 amino acids to 10 amino acids, about 4 amino acids to 9 amino acids, about 4 amino acids to 8 amino acids, about 4 amino acids to 7 amino acids, about 4 amino acids to 6 amino acids, about 5 amino acids to 50 amino acids, about 5 amino acids to 45 amino acids, about 5 amino acids to 40 amino acids, about 5 amino acids to 35 amino acids, about 5 amino acids to 30 amino acids, about 5 amino acids to 25 amino acids, about 5 amino acids to 20 amino acids, about 5 amino acids to 15 amino acids, about 5 amino acids to 10 amino acids, about 5 amino acids to 9 amino acids, about 5 amino acids to 8 amino acids, about 5 amino acids to 7 amino acids, about 6 amino acids to 50 amino acids, about 6 amino acids to 45 amino acids, about 6 amino acids to 40 amino acids, about 6 amino acids to 35 amino acids, about 6 amino acids to 30 amino acids, about 6 amino acids to 25 amino acids, about 6 amino acids to 20 amino acids, about 6 amino acids to 15 amino acids, about 6 amino acids to 10 amino acids, about 6 amino acids to 9 amino acids, about 6 amino acids to 8 amino acids, about 7 amino acids to 50 amino acids, about 7 amino acids to 45 amino acids, about 7 amino acids to 40 amino acids, about 7 amino acids to 35 amino acids, about 7 amino acids to 30 amino acids, about 7 amino acids to 25 amino acids, about 7 amino acids to 20 amino acids, about 7 amino acids to 15 amino acids, about 7 amino acids to 10 amino acids, about 7 amino acids to 9 amino acids, about 8 amino acids to 50 amino acids, about 8 amino acids to 45 amino acids, about 8 amino acids to 40 amino acids, about 8 amino acids to 35 amino acids, about 8 amino acids to 30 amino acids, about 8 amino acids to 25 amino acids, about 8 amino acids to 20 amino acids, about 8 amino acids to 15 amino acids, about 8 amino acids to 10 amino acids, about 10 amino acids to 50 amino acids, about 10 amino acids to 45 amino acids, about 10 amino acids to 40 amino acids, about 10 amino acids to 35 amino acids, about 10 amino acids to 30 amino acids, about 10 amino acids to 25 amino acids, about 10 amino acids to 20 amino acids, about 10 amino acids to 15 amino acids, about 15 amino acids to 50 amino acids, about 15 amino acids to 45 amino acids, about 15 amino acids to 40 amino acids, about 15 amino acids to 35 amino acids, about 15 amino acids to 30 amino acids, about 15 amino acids to 25 amino acids, about 15 amino acids to 20 amino acids, about 20 amino acids to 50 amino acids, about 20 amino acids to 45 amino acids, about 20 amino acids to 40 amino acids, about 20 amino acids to 35 amino acids, about 20 amino acids to 30 amino acids, about 20 amino acids to 25 amino acids, about 25 amino acids to 50 amino acids, about 25 amino acids to 45 amino acids, about 25 amino acids to 40 amino acids, about 25 amino acids to 35 amino acids, about 25 amino acids to 30 amino acids, about 30 amino acids to 50 amino acids, about 30 amino acids to 45 amino acids, about 30 amino acids to 40 amino acids, about 30 amino acids to 35 amino acids, about 35 amino acids to 50 amino acids, about 35 amino acids to 45 amino acids, about 35 amino acids to 40 amino acids, about 40 amino acids to 50 amino acids, about 40 amino acids to 45 amino acids, or about 45 amino acids to about 50 amino acids, are inserted. In some examples, the 1 amino acid to 50 amino acids (or any subrange thereof) can be inserted as a contiguous sequence into the sequence of a wildtype, full-length HSPA1A protein. In some examples, the 1 amino acid to 50 amino acids (or any subrange thereof) are inserted in multiple, non-contiguous places in the sequence of a wildtype, full-length HSPA1A protein. As can be appreciated in the art, the 1 amino acid to 50 amino acids can be inserted into a portion of the sequence of a wildtype, full-length HSPA1A protein that is not well-conserved between species.
[0267] Methods of detecting mutations in a gene are well-known in the art. Non-limiting examples of such techniques include: real-time polymerase chain reaction (RT-PCR), PCR, sequencing, Southern blotting, and Northern blotting.
[0268] Exemplary wildtype HSPA1A protein sequences are or include SEQ ID NO: 85, 88, 90, 91, and 92. Exemplary DNA sequences that encode a NDP protein and exemplary polypeptides encoded by an NDP gene are shown below.
HSPA1A Polynucleotides
[0269] Among other things, the present disclosure provides polynucleotides, e.g., polynucleotides comprising an HSPA1A gene or characteristic portion thereof, as well as compositions including such polynucleotides and methods utilizing such polynucleotides and/or compositions.
[0270] In some embodiments, a polynucleotide comprising an HSPA1A gene or characteristic portion thereof can be DNA or RNA. In some embodiments, DNA can be genomic DNA or cDNA. In some embodiments, RNA can be an mRNA. In some embodiments, a polynucleotide comprises exons and/or introns of an HSPA1A gene.
[0271] In some embodiments, a gene product is expressed from a polynucleotide comprising an HSPA1A gene or characteristic portion thereof. In some embodiments, expression of such a polynucleotide can utilize one or more control elements (e.g., promoters, enhancers, splice sites, poly-adenylation sites, translation initiation sites, etc.). Thus, in some embodiments, a polynucleotide provided herein can include one or more control elements.
[0272] In some embodiments, an HSPA1A gene is a mammalian HSPA1A gene. In some embodiments, an HSPA1A gene is a murine HSPA1A gene. In some embodiments, an HSPA1A gene is a primate HSPA1A gene. In some embodiments, a HSPA1A gene is a human HSPA1A gene. An exemplary human HSPA1A cDNA sequence is or includes the sequence of SEQ ID NO: 86. An exemplary human HSPA1A genomic DNA sequence can be found in SEQ ID NO: 40. An exemplary human HSPA1A cDNA sequence including untranslated regions is or includes the sequence of SEQ ID NO: 115.
TABLE-US-00005 Exemplary Human Mature HSPA1A cDNA (SEQ ID NO: 86) ATGGCCAAAGCCGCGGCGATCGGCATCGACCTGGGCACCACCTACTCCTGCGTGGGGGTGTTCC AACACGGCAAGGTGGAGATCATCGCCAACGACCAGGGCAACCGCACCACCCCCAGCTACGTGGC CTTCACGGACACCGAGCGGCTCATCGGGGATGCGGCCAAGAACCAGGTGGCGCTGAACCCGCAG AACACCGTGTTTGACGCGAAGCGGCTGATCGGCCGCAAGTTCGGCGACCCGGTGGTGCAGTCGG ACATGAAGCACTGGCCTTTCCAGGTGATCAACGACGGAGACAAGCCCAAGGTGCAGGTGAGCTA CAAGGGGGAGACCAAGGCATTCTACCCCGAGGAGATCTCGTCCATGGTGCTGACCAAGATGAAG GAGATCGCCGAGGCGTACCTGGGCTACCCGGTGACCAACGCGGTGATCACCGTGCCGGCCTACT TCAACGACTCGCAGCGCCAGGCCACCAAGGATGCGGGTGTGATCGCGGGGCTCAACGTGCTGCG GATCATCAACGAGCCCACGGCCGCCGCCATCGCCTACGGCCTGGACAGAACGGGCAAGGGGGAG CGCAACGTGCTCATCTTTGACCTGGGCGGGGGCACCTTCGACGTGTCCATCCTGACGATCGACG ACGGCATCTTCGAGGTGAAGGCCACGGCCGGGGACACCCACCTGGGTGGGGAGGACTTTGACAA CAGGCTGGTGAACCACTTCGTGGAGGAGTTCAAGAGAAAACACAAGAAGGACATCAGCCAGAAC AAGCGAGCCGTGAGGCGGCTGCGCACCGCCTGCGAGAGGGCCAAGAGGACCCTGTCGTCCAGCA CCCAGGCCAGCCTGGAGATCGACTCCCTGTTTGAGGGCATCGACTTCTACACGTCCATCACCAG GGCGAGGTTCGAGGAGCTGTGCTCCGACCTGTTCCGAAGCACCCTGGAGCCCGTGGAGAAGGCT CTGCGCGACGCCAAGCTGGACAAGGCCCAGATTCACGACCTGGTCCTGGTCGGGGGCTCCACCC GCATCCCCAAGGTGCAGAAGCTGCTGCAGGACTTCTTCAACGGGCGCGACCTGAACAAGAGCAT CAACCCCGACGAGGCTGTGGCCTACGGGGCGGCGGTGCAGGCGGCCATCCTGATGGGGGACAAG TCCGAGAACGTGCAGGACCTGCTGCTGCTGGACGTGGCTCCCCTGTCGCTGGGGCTGGAGACGG CCGGAGGCGTGATGACTGCCCTGATCAAGCGCAACTCCACCATCCCCACCAAGCAGACGCAGAT CTTCACCACCTACTCCGACAACCAACCCGGGGTGCTGATCCAGGTGTACGAGGGCGAGAGGGCC ATGACGAAAGACAACAATCTGTTGGGGCGCTTCGAGCTGAGCGGCATCCCTCCGGCCCCCAGGG GCGTGCCCCAGATCGAGGTGACCTTCGACATCGATGCCAACGGCATCCTGAACGTCACGGCCAC GGACAAGAGCACCGGCAAGGCCAACAAGATCACCATCACCAACGACAAGGGCCGCCTGAGCAAG GAGGAGATCGAGCGCATGGTGCAGGAGGCGGAGAAGTACAAAGCGGAGGACGAGGTGCAGCGCG AGAGGGTGTCAGCCAAGAACGCCCTGGAGTCCTACGCCTTCAACATGAAGAGCGCCGTGGAGGA TGAGGGGCTCAAGGGCAAGATCAGCGAGGCGGACAAGAAGAAGGTTCTGGACAAGTGTCAAGAG GTCATCTCGTGGCTGGACGCCAACACCTTGGCCGAGAAGGACGAGTTTGAGCACAAGAGGAAGG AGCTGGAGCAGGTGTGTAACCCCATCATCAGCGGACTGTACCAGGGTGCCGGTGGTCCCGGGCC TGGGGGCTTCGGGGCTCAGGGTCCCAAGGGAGGGTCTGGGTCAGGCCCCACCATTGAGGAGGTG GATTAG Exemplary Mature HSPA1A cDNA including untranslated regions (SEQ ID NO: 115) AACGGCTAGCCTGAGGAGCTGCTGCGACAGTCCACTACCTTTTTCGAGAGTGACTCCCGTTGTC CCAAGGCTTCCCAGAGCGAACCTGTGCGGCTGCAGGCACCGGCGCGTCGAGTTTCCGGCGTCCG GAAGGACCGAGCTCTTCTCGCGGATCCAGTGTTCCGTTTCCAGCCCCCAATCTCAGAGCGGAGC CGACAGAGAGCAGGGAACCGGCATGGCCAAAGCCGCGGCGATCGGCATCGACCTGGGCACCACC TACTCCTGCGTGGGGGTGTTCCAACACGGCAAGGTGGAGATCATCGCCAACGACCAGGGCAACC GCACCACCCCCAGCTACGTGGCCTTCACGGACACCGAGCGGCTCATCGGGGATGCGGCCAAGAA CCAGGTGGCGCTGAACCCGCAGAACACCGTGTTTGACGCGAAGCGGCTGATTGGCCGCAAGTTC GGCGACCCGGTGGTGCAGTCGGACATGAAGCACTGGCCTTTCCAGGTGATCAACGACGGAGACA AGCCCAAGGTGCAGGTGAGCTACAAGGGGGAGACCAAGGCATTCTACCCCGAGGAGATCTCGTC CATGGTGCTGACCAAGATGAAGGAGATCGCCGAGGCGTACCTGGGCTACCCGGTGACCAACGCG GTGATCACCGTGCCGGCCTACTTCAACGACTCGCAGCGCCAGGCCACCAAGGATGCGGGTGTGA TCGCGGGGCTCAACGTGCTGCGGATCATCAACGAGCCCACGGCCGCCGCCATCGCCTACGGCCT GGACAGAACGGGCAAGGGGGAGCGCAACGTGCTCATCTTTGACCTGGGCGGGGGCACCTTCGAC GTGTCCATCCTGACGATCGACGACGGCATCTTCGAGGTGAAGGCCACGGCCGGGGACACCCACC TGGGTGGGGAGGACTTTGACAACAGGCTGGTGAACCACTTCGTGGAGGAGTTCAAGAGAAAACA CAAGAAGGACATCAGCCAGAACAAGCGAGCCGTGAGGCGGCTGCGCACCGCCTGCGAGAGGGCC AAGAGGACCCTGTCGTCCAGCACCCAGGCCAGCCTGGAGATCGACTCCCTGTTTGAGGGCATCG ACTTCTACACGTCCATCACCAGGGCGAGGTTCGAGGAGCTGTGCTCCGACCTGTTCCGAAGCAC CCTGGAGCCCGTGGAGAAGGCTCTGCGCGACGCCAAGCTGGACAAGGCCCAGATTCACGACCTG GTCCTGGTCGGGGGCTCCACCCGCATCCCCAAGGTGCAGAAGCTGCTGCAGGACTTCTTCAACG GGCGCGACCTGAACAAGAGCATCAACCCCGACGAGGCTGTGGCCTACGGGGCGGCGGTGCAGGC GGCCATCCTGATGGGGGACAAGTCCGAGAACGTGCAGGACCTGCTGCTGCTGGACGTGGCTCCC CTGTCGCTGGGGCTGGAGACGGCCGGAGGCGTGATGACTGCCCTGATCAAGCGCAACTCCACCA TCCCCACCAAGCAGACGCAGATCTTCACCACCTACTCCGACAACCAACCCGGGGTGCTGATCCA GGTGTACGAGGGCGAGAGGGCCATGACGAAAGACAACAATCTGTTGGGGCGCTTCGAGCTGAGC GGCATCCCTCCGGCCCCCAGGGGCGTGCCCCAGATCGAGGTGACCTTCGACATCGATGCCAACG GCATCCTGAACGTCACGGCCACGGACAAGAGCACCGGCAAGGCCAACAAGATCACCATCACCAA CGACAAGGGCCGCCTGAGCAAGGAGGAGATCGAGCGCATGGTGCAGGAGGCGGAGAAGTACAAA GCGGAGGACGAGGTGCAGCGCGAGAGGGTGTCAGCCAAGAACGCCCTGGAGTCCTACGCCTTCA ACATGAAGAGCGCCGTGGAGGATGAGGGGCTCAAGGGCAAGATCAGCGAGGCGGACAAGAAGAA GGTGCTGGACAAGTGTCAAGAGGTCATCTCGTGGCTGGACGCCAACACCTTGGCCGAGAAGGAC GAGTTTGAGCACAAGAGGAAGGAGCTGGAGCAGGTGTGTAACCCCATCATCAGCGGACTGTACC AGGGTGCCGGTGGTCCCGGGCCTGGGGGCTTCGGGGCTCAGGGTCCCAAGGGAGGGTCTGGGTC AGGCCCCACCATTGAGGAGGTAGATTAGGGGCCTTTCCAAGATTGCTGTTTTTGTTTTGGAGCT TCAAGACTTTGCATTTCCTAGTATTTCTGTTTGTCAGTTCTCAATTTCCTGTGTTTGCAATGTT GAAATTTTTTGGTGAAGTACTGAACTTGCTTTTTTTCCGGTTTCTACATGCAGAGATGAATTTA TACTGCCATCTTACGACTATTTCTTCTTTTTAATACACTTAACTCAGGCCATTTTTTAAGTTGG TTACTTCAAAGTAAATAAACTTTAAAATTCAA
[0273] A non-limiting example of a human wildtype HSPA1A genomic DNA sequence is SEQ ID NO: 87.
TABLE-US-00006 Exemplary Human HSPA1A Gene Sequence (NCBI Reference Sequence NC 000006.12) (SEQ ID NO: 87) AACGGCTAGCCTGAGGAGCTGCTGCGACAGTCCACTACCTTTTTCGAGAGTGACTCCCGTTGTC CCAAGGCTTCCCAGAGCGAACCTGTGCGGCTGCAGGCACCGGCGCGTCGAGTTTCCGGCGTCCG GAAGGACCGAGCTCTTCTCGCGGATCCAGTGTTCCGTTTCCAGCCCCCAATCTCAGAGCGGAGC CGACAGAGAGCAGGGAACCGGCATGGCCAAAGCCGCGGCGATCGGCATCGACCTGGGCACCACC TACTCCTGCGTGGGGGTGTTCCAACACGGCAAGGTGGAGATCATCGCCAACGACCAGGGCAACC GCACCACCCCCAGCTACGTGGCCTTCACGGACACCGAGCGGCTCATCGGGGATGCGGCCAAGAA CCAGGTGGCGCTGAACCCGCAGAACACCGTGTTTGACGCGAAGCGGCTGATTGGCCGCAAGTTC GGCGACCCGGTGGTGCAGTCGGACATGAAGCACTGGCCTTTCCAGGTGATCAACGACGGAGACA AGCCCAAGGTGCAGGTGAGCTACAAGGGGGAGACCAAGGCATTCTACCCCGAGGAGATCTCGTC CATGGTGCTGACCAAGATGAAGGAGATCGCCGAGGCGTACCTGGGCTACCCGGTGACCAACGCG GTGATCACCGTGCCGGCCTACTTCAACGACTCGCAGCGCCAGGCCACCAAGGATGCGGGTGTGA TCGCGGGGCTCAACGTGCTGCGGATCATCAACGAGCCCACGGCCGCCGCCATCGCCTACGGCCT GGACAGAACGGGCAAGGGGGAGCGCAACGTGCTCATCTTTGACCTGGGCGGGGGCACCTTCGAC GTGTCCATCCTGACGATCGACGACGGCATCTTCGAGGTGAAGGCCACGGCCGGGGACACCCACC TGGGTGGGGAGGACTTTGACAACAGGCTGGTGAACCACTTCGTGGAGGAGTTCAAGAGAAAACA CAAGAAGGACATCAGCCAGAACAAGCGAGCCGTGAGGCGGCTGCGCACCGCCTGCGAGAGGGCC AAGAGGACCCTGTCGTCCAGCACCCAGGCCAGCCTGGAGATCGACTCCCTGTTTGAGGGCATCG ACTTCTACACGTCCATCACCAGGGCGAGGTTCGAGGAGCTGTGCTCCGACCTGTTCCGAAGCAC CCTGGAGCCCGTGGAGAAGGCTCTGCGCGACGCCAAGCTGGACAAGGCCCAGATTCACGACCTG GTCCTGGTCGGGGGCTCCACCCGCATCCCCAAGGTGCAGAAGCTGCTGCAGGACTTCTTCAACG GGCGCGACCTGAACAAGAGCATCAACCCCGACGAGGCTGTGGCCTACGGGGCGGCGGTGCAGGC GGCCATCCTGATGGGGGACAAGTCCGAGAACGTGCAGGACCTGCTGCTGCTGGACGTGGCTCCC CTGTCGCTGGGGCTGGAGACGGCCGGAGGCGTGATGACTGCCCTGATCAAGCGCAACTCCACCA TCCCCACCAAGCAGACGCAGATCTTCACCACCTACTCCGACAACCAACCCGGGGTGCTGATCCA GGTGTACGAGGGCGAGAGGGCCATGACGAAAGACAACAATCTGTTGGGGCGCTTCGAGCTGAGC GGCATCCCTCCGGCCCCCAGGGGCGTGCCCCAGATCGAGGTGACCTTCGACATCGATGCCAACG GCATCCTGAACGTCACGGCCACGGACAAGAGCACCGGCAAGGCCAACAAGATCACCATCACCAA CGACAAGGGCCGCCTGAGCAAGGAGGAGATCGAGCGCATGGTGCAGGAGGCGGAGAAGTACAAA GCGGAGGACGAGGTGCAGCGCGAGAGGGTGTCAGCCAAGAACGCCCTGGAGTCCTACGCCTTCA ACATGAAGAGCGCCGTGGAGGATGAGGGGCTCAAGGGCAAGATCAGCGAGGCGGACAAGAAGAA GGTGCTGGACAAGTGTCAAGAGGTCATCTCGTGGCTGGACGCCAACACCTTGGCCGAGAAGGAC GAGTTTGAGCACAAGAGGAAGGAGCTGGAGCAGGTGTGTAACCCCATCATCAGCGGACTGTACC AGGGTGCCGGTGGTCCCGGGCCTGGGGGCTTCGGGGCTCAGGGTCCCAAGGGAGGGTCTGGGTC AGGCCCCACCATTGAGGAGGTAGATTAGGGGCCTTTCCAAGATTGCTGTTTTTGTTTTGGAGCT TCAAGACTTTGCATTTCCTAGTATTTCTGTTTGTCAGTTCTCAATTTCCTGTGTTTGCAATGTT GAAATTTTTTGGTGAAGTACTGAACTTGCTTTTTTTCCGGTTTCTACATGCAGAGATGAATTTA TACTGCCATCTTACGACTATTTCTTCTTTTTAATACACTTAACTCAGGCCATTTTTTAAGTTGG TTACTTCAAAGTAAATAAACTTTAAAATTCAA Exemplary Mouse Mature HSPA1A cDNA (SEQ ID NO: 89) ATGGCCAAGAACACGGCGATCGGCATCGACCTGGGCACCACCTACTCGTGCGTGGGCGTGTTCC AGCACGGCAAGGTGGAGATCATCGCCAACGACCAGGGCAACCGCACGACCCCCAGCTACGTGGC CTTCACCGACACCGAGCGCCTCATCGGAGACGCCGCCAAGAACCAGGTGGCGCTGAACCCGCAG AACACCGTGTTCGACGCGAAGCGGCTGATCGGCCGCAAGTTCGGCGATGCGGTGGTGCAGTCCG ACATGAAGCACTGGCCCTTCCAGGTGGTGAACGACGGCGACAAGCCCAAGGTGCAGGTGAACTA CAAGGGCGAGAGCCGGTCGTTCTTCCCGGAGGAGATCTCGTCCATGGTGCTGACGAAGATGAAG GAGATCGCTGAGGCGTACCTGGGCCACCCGGTGACCAACGCGGTGATCACGGTGCCCGCCTACT TCAACGACTCTCAGCGGCAGGCCACCAAGGACGCGGGCGTGATCGCCGGTCTAAACGTGCTGCG GATCATCAACGAGCCCACGGCGGCCGCCATCGCCTACGGGCTGGACCGGACCGGCAAGGGCGAG CGCAACGTGCTCATCTTCGACCTGGGGGGCGGCACGTTCGACGTGTCCATCCTGACGATCGACG ACGGCATCTTCGAGGTGAAGGCCACGGCGGGCGACACGCACCTGGGAGGGGAGGACTTCGACAA CCGGCTGGTGAGCCACTTCGTGGAGGAGTTCAAGAGGAAGCACAAGAAGGACATCAGCCAGAAC AAGCGCGCGGTGCGGCGGCTGCGCACTGCGTGTGAGAGGGCCAAGAGGACGCTGTCGTCCAGCA CCCAGGCCAGCCTGGAGATCGACTCTCTGTTCGAGGGCATCGACTTCTACACATCCATCACGCG GGCGCGGTTCGAAGAGCTGTGCTCAGACCTGTTCCGCGGCACGCTGGAGCCCGTGGAGAAGGCC CTGCGCGACGCCAAGATGGACAAGGCGCAGATCCACGACCTGGTGCTGGTGGGCGGCTCGACGC GCATCCCCAAGGTGCAGAAGCTGCTGCAGGACTTCTTCAACGGGCGCGACCTGAACAAGAGCAT CAACCCGGACGAGGCGGTGGCCTACGGGGCGGCGGTGCAGGCGGCCATCCTGATGGGGGACAAG TCGGAGAACGTGCAGGACCTGCTGCTGCTGGACGTGGCGCCGCTGTCGCTGGGCCTGGAGACTG CGGGCGGCGTGATGACGGCGCTCATCAAGCGCAACTCCACCATCCCCACCAAGCAGACGCAGAC CTTCACCACCTACTCGGACAACCAGCCCGGGGTGCTGATCCAGGTGTACGAGGGCGAGAGGGCC ATGACGCGCGACAACAACCTGCTGGGGCGCTTCGAACTGAGCGGCATCCCGCCGGCGCCCAGGG GCGTGCCACAGATCGAGGTGACCTTCGACATCGACGCCAACGGCATCCTGAACGTCACGGCCAC CGACAAGAGCACCGGCAAGGCCAACAAGATCACCATCACCAACGACAAGGGCCGCCTGAGCAAG GAGGAGATCGAGCGCATGGTGCAGGAGGCCGAGCGCTACAAGGCCGAGGACGAGGTGCAGCGCG ACAGGGTGGCCGCCAAGAACGCGCTCGAATCCTATGCCTTCAACATGAAGAGCGCCGTGGAGGA CGAGGGTCTCAAGGGCAAGCTCAGCGAGGCTGACAAGAAGAAGGTGCTGGACAAGTGCCAGGAG GTCATCTCCTGGCTGGACTCCAACACGCTGGCCGACAAGGAGGAGTTCGTGCACAAGCGGGAGG AGCTGGAGCGGGTGTGCAGCCCCATCATCAGTGGGCTGTACCAGGGTGCGGGTGCTCCTGGGGC TGGGGGCTTCGGGGCCCAGGCGCCCAAGGGAGCCTCTGGCTCAGGACCCACCATCGAGGAGGTG GATTAGA
Polypeptides Encoded by HSPA1A Gene
[0274] Among other things, the present disclosure provides polypeptides encoded by an NDP gene or characteristic portion thereof. In some embodiments, an HSPA1A gene is a mammalian HSPA1A gene. In some embodiments, an HSPA1A gene is a murine HSPA1A gene. In some embodiments, an HSPA1A gene is a primate HSPA1A gene. In some embodiments, a HSPA1A gene is a human HSPA1A gene.
[0275] In some embodiments, a polypeptide comprises a HSPA1A protein or characteristic portion thereof. In some embodiments, a HSPA1A protein or characteristic portion thereof is mammalian HSPA1A protein or characteristic portion thereof, e.g., primate HSPA1A protein or characteristic portion thereof. In some embodiments, a HSPA1A protein or characteristic portion thereof is a human HSPA1A protein or characteristic portion thereof.
[0276] In some embodiments, a polypeptide provided herein comprises post-translational modifications. In some embodiments, a HSPA1A protein or characteristic portion thereof provided herein comprises post-translational modifications. In some embodiments, post-translational modifications can comprise but is not limited to glycosylation (e.g., N-linked glycosylation, O-linked glycosylation), phosphorylation, acetylation, amidation, hydroxylation, methylation, ubiquitylation, sulfation, and/or a combination thereof.
[0277] An exemplary human HSPA1A protein sequence is or includes the sequence of SEQ ID NO: 85. An exemplary human HSPA1A protein sequence with a c-terminal flag tag is or includes the sequence of SEQ ID NO: 95. As exemplary mouse HSPA1A protein sequence is or includes the sequence of SEQ ID NO: 88. As exemplary rhesus monkey HSPA1A protein sequence is or includes the sequence of SEQ ID NO: 90. As exemplary rat HSPA1A protein sequence is or includes the sequence of SEQ ID NO: 91. As exemplary cattle HSPA1A protein sequence is or includes the sequence of SEQ ID NO: 92.
TABLE-US-00007 Exemplary Human Mature HSPA1A Protein (NCBI Accession No. NP_005336.3) (SEQ ID NO: 85) MAKAAAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDPVVQSDMKHWPFQVINDGDKPKVQVSYKGETKAFYPEEISSMVLTKMK EIAEAYLGYPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTEDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVNHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRSTLEPVEKA LRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQIFTTYSDNQPGVLIQVYEGERA MTKDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAEKYKAEDEVQRERVSAKNALESYAFNMKSAVEDEGLKGKISEADKKKVLDKCQE VISWLDANTLAEKDEFEHKRKELEQVCNPIISGLYQGAGGPGPGGFGAQGPKGGSGSGPTIEEV D Exemplary Human HSPA1A Protein Sequence with C-terminal Flag Tag (SEQ ID NO: 95) MAKAAAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDPVVQSDMKHWPFQVINDGDKPKVQVSYKGETKAFYPEEISSMVLTKMK EIAEAYLGYPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTEDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVNHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRSTLEPVEKA LRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQIFTTYSDNQPGVLIQVYEGERA MTKDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAEKYKAEDEVQRERVSAKNALESYAFNMKSAVEDEGLKGKISEADKKKVLDKCQE VISWLDANTLAEKDEFEHKRKELEQVCNPIISGLYQGAGGPGPGGFGAQGPKGGSGSGPTIEEV DGSRADYKDHDGDYKDHDIDYKDDDDK Exemplary Mouse Mature HSPA1A Protein (NCBI Accession No. NP_034609.2) (SEQ ID NO: 88) MAKNTAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDAVVQSDMKHWPFQVVNDGDKPKVQVNYKGESRSFFPEEISSMVLTKMK EIAEAYLGHPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTFDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVSHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRGTLEPVEKA LRDAKMDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQTFTTYSDNQPGVLIQVYEGERA MTRDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAERYKAEDEVQRDRVAAKNALESYAFNMKSAVEDEGLKGKLSEADKKKVLDKCQE VISWLDSNTLADKEEFVHKREELERVCSPIISGLYQGAGAPGAGGFGAQAPKGASGSGPTIEEV D Exemplary Rhesus Monkey Mature HSPA1A Protein (NCBI Accession No. XP_014991489.2) (SEQ ID NO: 90) MAKAAAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDPVVQSDMKHWPFQVINDGDKPKVQVSYKGETKAFYPEEISSMVLTKMK EIAEAYLGYPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTEDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVNHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRSTLEPVEKA LRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQIFTTYSDNQPGVLIQVYEGERA MTKDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAEKYKAEDEVQRERVSAKNALESYAFNMKSAVEDEGLKGKISEADKKKVLDKCQE VISWLDANTLAEKDEFEHKRKELEQVCNPIISGLYQGAGGPGPGGFGAQGPKGGSGSGPTIEEV D Exemplary Rat Mature HSPA1A Protein (NCBI Accession No. NP_114177.2) (SEQ ID NO: 91) MAKKTAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDPVVQSDMKHWPFQVVNDGDKPKVQVNYKGENRSFYPEEISSMVLTKMK EIAEAYLGHPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTFDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVSHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRGTLEPVEKA LRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQTFTTYSDNQPGVLIQVYEGERA MTRDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAERYKAEDEVQRERVAAKNALESYAFNMKSAVEDEGLKGKISEADKKKVLDKCQE VISWLDSNTLAEKEEFVHKREELERVCNPIISGLYQGAGAPGAGGFGAQAPKGGSGSGPTIEEV D Exemplary Cattle HSPAIA Protein (NCBI Accession No. NP_976067.3) (SEQ ID NO: 92) MAKNMAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDPVVQSDMKHWPFRVINDGDKPKVQVSYKGETKAFYPEEISSMVLTKMK EIAEAYLGHPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTEDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVNHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRSTLEPVEKA LRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQIFTTYSDNQPGVLIQVYEGERA MTRDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAEKYKAEDEVQRERVSAKNALESYAFNMKSAVEDEGLKGKISEADKKKVLDKCQE VISWLDANTLAEKDEFEHKRKELEQVCNPIISRLYQGAGGPGAGGFGAQGPKGGSGSGPTIEEV D
Hsp40/DNAJ Family of Proteins
[0278] A secreted protein described herein can be a heat shock protein from the Hsp40/DNAJ family. Proteins in the Hsp40/DNAJ family comprise a 70 amino-acid consensus sequence known as the J domain, which interacts with a HSP70 protein. Without being bound to any particular theory, it is believed that proteins in the Hsp40/DNAJ family play a role in regulating the adenosine triphosphatase (ATPase) activity of HSP70 heat-shock proteins (e.g., HSPA1A).
[0279] In some embodiments, a protein in the Hsp40/DNAJ family is encoded by a Type 1 (subfamily A) gene. In some embodiments, a protein in the Hsp40/DNAJ family is encoded by a Type 2 (subfamily B) gene. In some embodiments, a protein in the Hsp40/DNAJ family is encoded by a Type 3 (subfamily C) gene. In some embodiments, a protein in the Hsp40/DNAJ family is encoded by a J-like gene. In some embodiments, a protein in the Hsp40/DNAJ family is encoded by a gene listed in Table 1. Exemplary sequences for the genes can be found by reference to the Gene ID in Table 1. In some embodiments, a protein in the Hsp40/DNAJ family is listed in Table 1. Exemplary sequences for the proteins can be found by reference to the UniProt ID in Table 1.
TABLE-US-00008 TABLE 1 Gene Protein UniProt ID Gene ID Type 1 (subfamily A) DNAJA1 DnaJA1 P31689 3301 DNAJA2 DnaJA2 O60884 10294 DNAJA3 DnaJA3 Q96EY1 9093 DNAJA4 DnaJA4 Q8WW22 55466 Type 2 (subfamily B) DNAJB1 DnaJB1 P25685 3337 DNAJB2 DnaJB2 P25686 3300 DNAJB3 DnaJB3 Q8WWF6 414061 DNAJB4 DnaJB4 Q9UDY4 11080 DNAJB5 DnaJB5 O75953 25822 DNAJB6 DnaJB6 O75190 10049 DNAJB7 DnaJB7 Q7Z6W7 150353 DNAJB8 DnaJB8 Q8NHSO 165721 DNAJB9 DnaJB9 Q9UBS3 4189 DNAJB11 DnaJB11 Q9UBS4 51726 DNAJB12 DnaJB12 Q9NXW2 54788 DNAJB13 DnaJB13 P59910 374407 DNAJB14 DnaJB14 Q8TBM8 79982 Type 3 (subfamily C) DNAJC1 DnaJC1 Q96KC8 64215 DNAJC2 DnaJC2 Q99543 27000 DNAJC3 DnaJC3 Q13217 5611 DNAJC4 DnaJC4 Q9NNZ3 3338 DNAJC5 DnaJC5 Q9H3Z4 80331 DNAJC5B DnaJC5B Q9UF47 85479 DNAJC5G DnaJC5G Q8N7S2 285126 DNAJC6 DnaJC6 O75061 9829 DNAJC7 DnaJC7 Q99615 7266 DNAJC8 DnaJC8 O75937 22826 DNAJC9 DnaJC9 Q8WXX5 23234 DNAJC10 DnaJC10 Q8IXB1 54431 DNAJC11 DnaJC11 Q9NVH1 55735 DNAJC12 DnaJC12 Q9UKB3 56521 DNAJC13 DnaJC13 O75165 23317 DNAJC14 DnaJC14 Q6Y2X3 85406 DNAJC15 DnaJC15 Q9Y5T4 29103 DNAJC16 DnaJC16 Q9Y2G8 23341 DNAJC17 DnaJC17 Q9NVM6 55192 DNAJC18 DnaJC18 Q9H819 202052 DNAJC19 DnaJC19 Q96DA6 131118 DNAJC20 DnaJC20 Q8IWL3 150274 DNAJC21 DnaJC21 Q5F1R6 134218 DNAJC22 DnaJC22 Q8N4W6 79962 J-like DNAJC23 DnaJC23 Q9UGP8 11231 DNAJC24 DnaJC24 Q6P3W2 120526 DNAJC25 DnaJC25 Q9H1X3 548645 DNAJC26 DnaJC26 O14976 2580 DNAJC27 DnaJC27 Q9NZQ0 51277 DNAJC28 DnaJC28 Q9NX36 54943 DNAJC29 DnaJC29 Q9NZJ4 26278 DNAJC30 DnaJC30 Q96LL9 84277
Heat Shock Protein 40 (Hsp40) (DnaJ Homolog Subfamily B Member 1) (DNAJB1)
[0280] In some embodiments, an Hsp40 protein is encoded by a DNAJB1 gene. The human DNAJB1 gene is located on chromosome Chr19:14, 514, 770-14,529,770. It contains 7 exons (NCBI Accession No. NC_000019.10). DNAJB1 encodes a 40 kDa heat shock protein.
[0281] As used herein, the term “active DNAJB1 protein” means a protein encoded by DNA that, if substituted for both wildtype alleles encoding full-length DNAJB1 protein in auditory hair cells, or ocular cells, of what is otherwise a wildtype mammal, and if expressed in the auditory hair cells, or ocular cells, of that mammal, results in that mammal's having a level of hearing, or vision, approximating the normal level of hearing, or vision, of a similar mammal that is entirely wildtype. Non-limiting examples of active DNAJB1 proteins are full-length DNAJB1 proteins (e.g., any of the full-length DNAJB1 proteins described herein).
[0282] For example, an active DNAJB1 protein can include a sequence of a wildtype, full-length DNAJB1 protein (e.g., a wildtype, human, full-length DNAJB1 protein) including about 1 to about 200 amino acid substitutions (e.g., about 1 to 190 amino acid substitutions, about 1 to about 180 amino acid substitutions, about 1 to about 160 amino acid substitutions, about 1 to about 150 amino acid substitutions, about 1 to about 140 amino acid substitutions, about 1 to about 130 amino acid substitutions, about 1 to about 120 amino acid substitutions, about 1 to about 110 amino acid substitutions, about 1 to about 100 amino acid substitutions, about 1 to about 90 amino acid substitutions, about 1 to about 80 amino acid substitutions, about 1 to about 70 amino acid substitutions, about 1 to about 60 amino acid substitutions, about 1 to about 50 amino acid substitutions, about 1 to about 40 amino acid substitutions, about 1 to about 30 amino acid substitutions, about 1 to about 25 amino acid substitutions, about 1 to about 20 amino acid substitutions, about 1 to about 10 amino acid substitutions, about 1 to about 5 amino acid substitutions, about 10 to about 200 amino acid substitutions, about 10 to about 180 amino acid substitutions, about 10 to about 160 amino acid substitutions, about 10 to about 150 amino acid substitutions, about 10 to about 140 amino acid substitutions, about 10 to about 120 amino acid substitutions, about 10 to about 100 amino acid substitutions, about 10 to about 80 amino acid substitutions, about 10 to about 60 amino acid substitutions, about 10 to about 50 amino acid substitutions, about 10 to about 40 amino acid substitutions, about 10 to about 20 amino acid substitutions, about 20 to about 200 amino acid substitutions, about 20 to about 180 amino acid substitutions, about 20 to about 160 amino acid substitutions, about 20 to about 150 amino acid substitutions, about 20 to about 140 amino acid substitutions, about 20 to about 120 amino acid substitutions, about 20 to about 100 amino acid substitutions, about 20 to about 80 amino acid substitutions, about 20 to about 60 amino acid substitutions, about 20 to about 50 amino acid substitutions, about 20 to about 40 amino acid substitutions, about 40 to about 200 amino acid substitutions, about 40 to about 180 amino acid substitutions, about 40 to about 160 amino acid substitutions, about 40 to about 150 amino acid substitutions, about 40 to about 140 amino acid substitutions, about 40 to about 120 amino acid substitutions, about 40 to about 100 amino acid substitutions, about 40 to about 80 amino acid substitutions, about 40 to about 60 amino acid substitutions, about 40 to about 50 amino acid substitutions, about 50 to about 200 amino acid substitutions, about 50 to about 180 amino acid substitutions, about 50 to about 160 amino acid substitutions, about 50 to about 150 amino acid substitutions, about 50 to about 140 amino acid substitutions, about 50 to about 120 amino acid substitutions, about 50 to about 100 amino acid substitutions, about 50 to about 80 amino acid substitutions, about 50 to about 60 amino acid substitutions, about 60 to about 200 amino acid substitutions, about 60 to about 180 amino acid substitutions, about 60 to about 160 amino acid substitutions, about 60 to about 150 amino acid substitutions, about 60 to about 140 amino acid substitutions, about 60 to about 120 amino acid substitutions, about 60 to about 100 amino acid substitutions, about 60 to about 80 amino acid substitutions, about 80 to about 200 amino acid substitutions, about 80 to about 180 amino acid substitutions, about 80 to about 160 amino acid substitutions, about 80 to about 150 amino acid substitutions, about 80 to about 140 amino acid substitutions, about 80 to about 120 amino acid substitutions, about 80 to about 100 amino acid substitutions, about 100 to about 200 amino acid substitutions, about 100 to about 180 amino acid substitutions, about 100 to about 160 amino acid substitutions, about 100 to about 150 amino acid substitutions, about 100 to about 140 amino acid substitutions, about 100 to about 120 amino acid substitutions, about 120 to about 200 amino acid substitutions, about 120 to about 180 amino acid substitutions, about 120 to about 160 amino acid substitutions, about 120 to about 150 amino acid substitutions, about 120 to about 140 amino acid substitutions, about 140 to about 200 amino acid substitutions, about 140 to about 180 amino acid substitutions, about 140 to about 160 amino acid substitutions, about 140 to about 150 amino acid substitutions, about 150 to about 200 amino acid substitutions, about 150 to about 180 amino acid substitutions, about 150 to about 160 amino acid substitutions, about 160 to about 200 amino acid substitutions, about 160 to about 180 amino acid substitutions, or about 180 to about 200 amino acid substitutions).
[0283] One skilled in the art would appreciate that amino acids that are not conserved between wildtype DNAJB1 proteins from different species can be mutated without losing activity, while those amino acids that are conserved between wildtype DNAJB1 proteins from different species should not be mutated as they are more likely (than amino acids that are not conserved between different species) to be involved in activity.
[0284] An active DNAJB1 protein can include, e.g., a sequence of a wildtype, full-length DNAJB1 protein (e.g., a wildtype, human, full-length DNAJB1 protein) that has about 1 to about 100 amino acids (e.g., about 1 to about 95 amino acids, about 1 to about 90 amino acids, about 1 to about 85 amino acids, about 1 to about 80 amino acids, about 1 to about 75 amino acids, about 1 to about 70 amino acids, about 1 to about 65 amino acids, about 1 to about 60 amino acids, about 1 to about 55 amino acids, about 1 to about 50 amino acids, about 1 to about 45 amino acids, about 1 to about 40 amino acids, about 1 to about 35 amino acids, about 1 to about 30 amino acids, about 1 to about 25 amino acids, about 1 to about 20 amino acids, about 1 to about 15 amino acids, about 1 to about 10 amino acids, about 1 to about 5 amino acids, about 5 to about 100 amino acids, about 5 to about 95 amino acids, about 5 to about 90 amino acids, about 5 to about 85 amino acids, about 5 to about 80 amino acids, about 5 to about 75 amino acids, about 5 to about 70 amino acids, about 5 to about 65 amino acids, about 5 to about 60 amino acids, about 5 to about 55 amino acids, about 5 to about 50 amino acids, about 5 to about 45 amino acids, about 5 to about 40 amino acids, about 5 to about 35 amino acids, about 5 to about 30 amino acids, about 5 to about 25 amino acids, about 5 to about 20 amino acids, about 5 to about 15 amino acids, about 5 to about 10 amino acids, about 10 to about 100 amino acids, about 10 to about 95 amino acids, about 10 to about 90 amino acids, about 10 to about 85 amino acids, about 10 to about 80 amino acids, about 10 to about 80 amino acids, about 10 to about 75 amino acids, about 10 to about 70 amino acids, about 10 to about 65 amino acids, about 10 to about 60 amino acids, about 10 to about 55 amino acids, about 10 to about 50 amino acids, about 10 to about 45 amino acids, about 10 to about 40 amino acids, about 10 to about 35 amino acids, about 10 to about 30 amino acids, about 10 to about 25 amino acids, about 10 to about 20 amino acids, about 10 to about 15 amino acids, about 15 to about 100 amino acids, about 15 to about 95 amino acids, about 15 to about 90 amino acids, about 15 to about 85 amino acids, about 15 to about 80 amino acids, about 15 to about 75 amino acids, about 15 to about 70 amino acids, about 15 to about 65 amino acids, about 15 to about 60 amino acids, about 15 to about 55 amino acids, about 15 to about 50 amino acids, about 15 to about 45 amino acids, about 15 to about 40 amino acids, about 15 to about 35 amino acids, about 15 to about 30 amino acids, about 15 to about 25 amino acids, about 15 to about 20 amino acids, about 20 to about 100 amino acids, about 20 to about 95 amino acids, about 20 to about 90 amino acids, about 20 to about 85 amino acids, about 20 to about 80 amino acids, about 20 to about 75 amino acids, about 20 to about 70 amino acids, about 20 to about 65 amino acids, about 20 to about 60 amino acids, about 20 to about 55 amino acids, about 20 to about 50 amino acids, about 20 to about 45 amino acids, about 20 to about 40 amino acids, about 20 to about 35 amino acids, about 20 to about 30 amino acids, about 20 to about 25 amino acids, about 25 to about 100 amino acids, about 25 to about 95 amino acids, about 25 to about 90 amino acids, about 25 to about 85 amino acids, about 25 to about 80 amino acids, about 25 to about 75 amino acids, about 25 to about 70 amino acids, about 25 to about 65 amino acids, about 25 to about 60 amino acids, about 25 to about 55 amino acids, about 25 to about 50 amino acids, about 25 to about 45 amino acids, about 25 to about 40 amino acids, about 25 to about 35 amino acids, about 25 to about 30 amino acids, about 30 to about 100 amino acids, about 30 to about 95 amino acids, about 30 to about 90 amino acids, about 30 to about 85 amino acids, about 30 to about 80 amino acids, about 30 to about 75 amino acids, about 30 to about 70 amino acids, about 30 to about 65 amino acids, about 30 to about 60 amino acids, about 30 to about 55 amino acids, about 30 to about 50 amino acids, about 30 to about 45 amino acids, about 30 to about 40 amino acids, about 30 to about 35 amino acids, about 35 to about 100 amino acids, about 35 to about 95 amino acids, about 35 to about 90 amino acids, about 35 to about 85 amino acids, about 35 to about 80 amino acids, about 35 to about 75 amino acids, about 35 to about 70 amino acids, about 35 to about 65 amino acids, about 35 to about 60 amino acids, about 35 to about 55 amino acids, about 35 to about 50 amino acids, about 35 to about 45 amino acids, about 35 to about 40 amino acids, about 40 to about 100 amino acids, about 40 to about 95 amino acids, about 40 to about 90 amino acids, about 40 to about 85 amino acids, about 40 to about 80 amino acids, about 40 to about 75 amino acids, about 40 to about 70 amino acids, about 40 to about 65 amino acids, about 40 to about 60 amino acids, about 40 to about 55 amino acids, about 40 to about 50 amino acids, about 40 to about 45 amino acids, about 45 to about 100 amino acids, about 45 to about 95 amino acids, about 45 to about 90 amino acids, about 45 to about 85 amino acids, about 45 to about 80 amino acids, about 45 to about 75 amino acids, about 45 to about 70 amino acids, about 45 to about 65 amino acids, about 45 to about 60 amino acids, about 45 to about 55 amino acids, about 45 to about 50 amino acids, about 50 to about 100 amino acids, about 50 to about 95 amino acids, about 50 to about 90 amino acids, about 50 to about 85 amino acids, about 50 to about 80 amino acids, about 50 to about 75 amino acids, about 50 to about 70 amino acids, about 50 to about 65 amino acids, about 50 to about 60 amino acids, about 50 to about 55 amino acids, about 55 to about 100 amino acids, about 55 to about 95 amino acids, about 55 to about 90 amino acids, about 55 to about 85 amino acids, about 55 to about 80 amino acids, about 55 to about 75 amino acids, about 55 to about 70 amino acids, about 55 to about 65 amino acids, about 55 to about 60 amino acids, about 60 to about 100 amino acids, about 60 to about 95 amino acids, about 60 to about 90 amino acids, about 60 to about 85 amino acids, about 60 to about 80 amino acids, about 60 to about 75 amino acids, about 60 to about 70 amino acids, about 60 to about 65 amino acids, about 65 to about 100 amino acids, about 65 to about 95 amino acids, about 65 to about 90 amino acids, about 65 to about 85 amino acids, about 65 to about 80 amino acids, about 65 to about 75 amino acids, about 65 to about 70 amino acids, about 70 to about 100 amino acids, about 70 to about 95 amino acids, about 70 to about 90 amino acids, about 70 to about 85 amino acids, about 70 to about 80 amino acids, about 70 to about 75 amino acids, about 75 to about 100 amino acids, about 75 to about 95 amino acids, about 75 to about 90 amino acids, about 75 to about 85 amino acids, about 75 to about 80 amino acids, about 80 to about 100 amino acids, about 80 to about 95 amino acids, about 80 to about 90 amino acids, about 80 to about 85 amino acids, about 85 to about 100 amino acids, about 85 to about 95 amino acids, about 85 to about 90 amino acids, about 90 to about 100 amino acids, about 90 to about 95 amino acids, or about 95 to about 100 amino acids), removed from its N-terminus and/or 1 amino acid to 80 amino acids (or any of the subranges of this range described herein) removed from its C-terminus.
[0285] In some embodiments, an active DNAJB1 protein can, e.g., include the sequence of a wildtype, full-length DNAJB1 protein where 1 amino acid to 50 amino acids, 1 amino acid to 45 amino acids, 1 amino acid to 40 amino acids, 1 amino acid to 35 amino acids, 1 amino acid to 30 amino acids, 1 amino acid to 25 amino acids, 1 amino acid to 20 amino acids, 1 amino acid to 15 amino acids, 1 amino acid to 10 amino acids, 1 amino acid to 9 amino acids, 1 amino acid to 8 amino acids, 1 amino acid to 7 amino acids, 1 amino acid to 6 amino acids, 1 amino acid to 5 amino acids, 1 amino acid to 4 amino acids, 1 amino acid to 3 amino acids, about 2 amino acids to 50 amino acids, about 2 amino acids to 45 amino acids, about 2 amino acids to 40 amino acids, about 2 amino acids to 35 amino acids, about 2 amino acids to 30 amino acids, about 2 amino acids to 25 amino acids, about 2 amino acids to 20 amino acids, about 2 amino acids to 15 amino acids, about 2 amino acids to 10 amino acids, about 2 amino acids to 9 amino acids, about 2 amino acids to 8 amino acids, about 2 amino acids to 7 amino acids, about 2 amino acids to 6 amino acids, about 2 amino acids to 5 amino acids, about 2 amino acids to 4 amino acids, about 3 amino acids to 50 amino acids, about 3 amino acids to 45 amino acids, about 3 amino acids to 40 amino acids, about 3 amino acids to 35 amino acids, about 3 amino acids to 30 amino acids, about 3 amino acids to 25 amino acids, about 3 amino acids to 20 amino acids, about 3 amino acids to 15 amino acids, about 3 amino acids to 10 amino acids, about 3 amino acids to 9 amino acids, about 3 amino acids to 8 amino acids, about 3 amino acids to 7 amino acids, about 3 amino acids to 6 amino acids, about 3 amino acids to 5 amino acids, about 4 amino acids to 50 amino acids, about 4 amino acids to 45 amino acids, about 4 amino acids to 40 amino acids, about 4 amino acids to 35 amino acids, about 4 amino acids to 30 amino acids, about 4 amino acids to 25 amino acids, about 4 amino acids to 20 amino acids, about 4 amino acids to 15 amino acids, about 4 amino acids to 10 amino acids, about 4 amino acids to 9 amino acids, about 4 amino acids to 8 amino acids, about 4 amino acids to 7 amino acids, about 4 amino acids to 6 amino acids, about 5 amino acids to 50 amino acids, about 5 amino acids to 45 amino acids, about 5 amino acids to 40 amino acids, about 5 amino acids to 35 amino acids, about 5 amino acids to 30 amino acids, about 5 amino acids to 25 amino acids, about 5 amino acids to 20 amino acids, about 5 amino acids to 15 amino acids, about 5 amino acids to 10 amino acids, about 5 amino acids to 9 amino acids, about 5 amino acids to 8 amino acids, about 5 amino acids to 7 amino acids, about 6 amino acids to 50 amino acids, about 6 amino acids to 45 amino acids, about 6 amino acids to 40 amino acids, about 6 amino acids to 35 amino acids, about 6 amino acids to 30 amino acids, about 6 amino acids to 25 amino acids, about 6 amino acids to 20 amino acids, about 6 amino acids to 15 amino acids, about 6 amino acids to 10 amino acids, about 6 amino acids to 9 amino acids, about 6 amino acids to 8 amino acids, about 7 amino acids to 50 amino acids, about 7 amino acids to 45 amino acids, about 7 amino acids to 40 amino acids, about 7 amino acids to 35 amino acids, about 7 amino acids to 30 amino acids, about 7 amino acids to 25 amino acids, about 7 amino acids to 20 amino acids, about 7 amino acids to 15 amino acids, about 7 amino acids to 10 amino acids, about 7 amino acids to 9 amino acids, about 8 amino acids to 50 amino acids, about 8 amino acids to 45 amino acids, about 8 amino acids to 40 amino acids, about 8 amino acids to 35 amino acids, about 8 amino acids to 30 amino acids, about 8 amino acids to 25 amino acids, about 8 amino acids to 20 amino acids, about 8 amino acids to 15 amino acids, about 8 amino acids to 10 amino acids, about 10 amino acids to 50 amino acids, about 10 amino acids to 45 amino acids, about 10 amino acids to 40 amino acids, about 10 amino acids to 35 amino acids, about 10 amino acids to 30 amino acids, about 10 amino acids to 25 amino acids, about 10 amino acids to 20 amino acids, about 10 amino acids to 15 amino acids, about 15 amino acids to 50 amino acids, about 15 amino acids to 45 amino acids, about 15 amino acids to 40 amino acids, about 15 amino acids to 35 amino acids, about 15 amino acids to 30 amino acids, about 15 amino acids to 25 amino acids, about 15 amino acids to 20 amino acids, about 20 amino acids to 50 amino acids, about 20 amino acids to 45 amino acids, about 20 amino acids to 40 amino acids, about 20 amino acids to 35 amino acids, about 20 amino acids to 30 amino acids, about 20 amino acids to 25 amino acids, about 25 amino acids to 50 amino acids, about 25 amino acids to 45 amino acids, about 25 amino acids to 40 amino acids, about 25 amino acids to 35 amino acids, about 25 amino acids to 30 amino acids, about 30 amino acids to 50 amino acids, about 30 amino acids to 45 amino acids, about 30 amino acids to 40 amino acids, about 30 amino acids to 35 amino acids, about 35 amino acids to 50 amino acids, about 35 amino acids to 45 amino acids, about 35 amino acids to 40 amino acids, about 40 amino acids to 50 amino acids, about 40 amino acids to 45 amino acids, or about 45 amino acids to about 50 amino acids, are inserted. In some examples, the 1 amino acid to 50 amino acids (or any subrange thereof) can be inserted as a contiguous sequence into the sequence of a wildtype, full-length DNAJB1 protein. In some examples, the 1 amino acid to 50 amino acids (or any subrange thereof) are inserted in multiple, non-contiguous places in the sequence of a wildtype, full-length DNAJB1 protein. As can be appreciated in the art, the 1 amino acid to 50 amino acids can be inserted into a portion of the sequence of a wildtype, full-length DNAJB1 protein that is not well-conserved between species.
[0286] Methods of detecting mutations in a gene are well-known in the art. Non-limiting examples of such techniques include: real-time polymerase chain reaction (RT-PCR), PCR, sequencing, Southern blotting, and Northern blotting.
[0287] Exemplary wildtype DNAJB1 protein sequences are or include SEQ ID NO: 38, 41 43, 44, and 45. Exemplary DNA sequences that encode a NDP protein and exemplary polypeptides encoded by an NDP gene are shown below.
DNAJB1 Polynucleotides
[0288] Among other things, the present disclosure provides polynucleotides, e.g., polynucleotides comprising an DNAJB1 gene or characteristic portion thereof, as well as compositions including such polynucleotides and methods utilizing such polynucleotides and/or compositions.
[0289] In some embodiments, a polynucleotide comprising an DNAJB1 gene or characteristic portion thereof can be DNA or RNA. In some embodiments, DNA can be genomic DNA or cDNA. In some embodiments, RNA can be an mRNA. In some embodiments, a polynucleotide comprises exons and/or introns of an DNAJB1 gene.
[0290] In some embodiments, a gene product is expressed from a polynucleotide comprising an DNAJB1 gene or characteristic portion thereof. In some embodiments, expression of such a polynucleotide can utilize one or more control elements (e.g., promoters, enhancers, splice sites, poly-adenylation sites, translation initiation sites, etc.). Thus, in some embodiments, a polynucleotide provided herein can include one or more control elements.
[0291] In some embodiments, an DNAJB1 gene is a mammalian DNAJB1 gene. In some embodiments, an DNAJB1 gene is a murine DNAJB1 gene. In some embodiments, an DNAJB1 gene is a primate DNAJB1 gene. In some embodiments, a DNAJB1 gene is a human DNAJB1 gene. An exemplary human DNAJB1 genomic sequence is or includes SEQ ID NO: 112. An exemplary human DNAJB1 cDNA sequence is or includes the sequence of SEQ ID NO: 156 or 158. An exemplary human DNAJB1 cDNA sequence including untranslated regions is or includes the sequence of SEQ ID NO: 113, 157, or 159.
TABLE-US-00009 Exemplary Human DNAJB1 Genomic Sequence (SEQ ID NO: 112) ACTTTATTCATCCTTACAAACAGGAATAATAATAGTTCTTATACAAGGTAACAGAATCTCACCA GCACAAGAGTAAGCACTTGCTAAATTGTTATTTTGAATTTTAATAACAAATTTTATTATTTGAT GATAGTTAGTTAAAAGTCAGCAATAATTCAGGGATAATTGGTTGAATTATTAATATTATATTAT CACCTAGGGCGATAAGCACATCAGAAGACTTCGTTTTAAGGGAAAAAGAGACCCAAGTTAGGAG TCAGAAGACTTGGGCTTCAAGAGTGGCTGTGGGTGGGTATAAGATTTCTCCTTGGTGACACCCA AGGGTTGACGTAGATGACCTCTTTGATAACCAGAAAGCAAGATCAGCTTCGCTGCTCTGCGAAA GCCAGCCGTGGAGCTGCACCTCCCTCACCGCCTAAGTCTCCCGGGAACGTCCCGGGCGAGGGAA GGGGTCCAAAGGTCGTACACAAGAATACTGATAGCAAAGCCCCTTCCTACGCTGGTGACGGCGG CGTGGCGCAAGATTTGTGCAAGACTCCCTCGGATTGGGAGCAAGGGTCCCGCACCTTCGTGGCT CCCATGACAAAGTCTGCAACTCAGCCCGCTGGGGGAGCTGCAAGGAACCTGCAAGCGTCCCAGC CCCTGCAGAGACCCCCAAGAAAAAGTACCCCCAAAACAGCCTGGGTTGCAGATTTATAAATACT TCCTGCGCACATGCGCATTGGAATTTACGAGTGGCCCGGGGCGCAACGCTTGCCCACTGCGCCT GCGCCGTCTGGCTCTTCCTTCGGGGCACAGGACCAGAAAGTGGGGTCCCGTGGTCCCGCAAAGA AGGAAAAAGAATGGGTCAGCAGTGGCCACACGGCCCCTGTTTCTCGCTTCTTACCTCGTAATGC TTCATGGCTCCCTCCCACGGCGGCTACTGCTCCGCGGCTGCTGCTGCCTAACTGCGCGGCACAG CACAGGCTCCCTACAGCGCTCGCAACCGCAGCGATAGACTAAACGCGGCTCTGCGTCCGCCCCG CCCCTCCAGGCCGCGCGCGCCCCGTCGGCCAATCAGGGCACGAGGCGCCCACGTTCGCCCCTGC TCCTTGGGGCCGGTCCGAACCAAGACTAGGACTAGCACCAGGGGATTGACCAATCAACTTGCGA GACCAATCGAAGGGGGCGGAACGCCTGTAGGAGGGGCTAAAGAGAAGGGGCTTGACCATCCACC AATCCAAAGGAGGTCTCTGCCCCGCGCGTCCCTTTGCCACGCCCCCTGATGGCGTCGCTGTGGA AACCAAGGTAAGCGACGGTTAGGCCAGACGCGGGGGCGGGGTAAGAAGTTGAGTGACAGGCAAG GCAGCATCCACATGACAGGCGGCGCCGAAGGGGTAAATTCTGAGGTGGGCGGCCCCCGGGGTAC ACCGGGAACAGCAGGAGAGGGCTAGGGGCTGGGGTCGCGGGCTGCGGAGTCTGGTTGGCGAGGA GGTCACTATGGGAGGAGACTCTTGAGTGGGAGGGAAGGAAAGGCGAAACGGAGTCGCGAAAAGC TCCCCATTTGGGAACCCCCAACCTGCAAGAACCTTGAGCCCCAGCCCCATTCTGGGGTGGCTTC ACTGCCGTTTTTAATGAAAGCCCCGCCCCACTTTGTTTTTCTGTTTTGTTTTGTTTTTGAAACA ATCTCGTTCTGTCCCCCAGCTGGAGTGCAGTGGCGCAATGACGGCTCACGCAACCTCCGCCTCC CGGGTTCAAGCGATTCTCATGCCTCAGCCTCCCAAGTAGATGAGATTACAGGCACCCGCCACCA CGCCCGACTAATTTTTGTATTTTTTTTAGTAGAGACGGGGTTTCCCCATGGTTGGCCAGGCTGG TCTTGAACTCCTGACCTCAGGTGATCCTCCCGCCTCAGCCTCCCAAAGTGCTAGGATTACAGGC CTGAGCCACCCCGCCCGGCCCCCCTCCACACTTTGCTCCGCCCTATGGACAGGGGAACTACATT GGTCTCTGCCTGACGTCCAGAATCGTGTCTAGTGGCTCCAATGGGGCCACTGAGCCCTTGAAAT GGGGCTGGTCCAGGCCGGGAACGATGGCTCACACCTGTAATCCTAGCACCTTGGGAGGCCGAGG GGGGCAGATCACGAGGTGAGGAGTTCGAGACCAGCCTGGCAGATATGGTGAAACCTCATCGCTA CTAAAAATACAAAAATTAGCCGGGCGTGGTGGTACACACCTGTAGTCCCAGCTAATCAGGAGGC TGAGGCAGAAGAATTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATCTCGCCACTA CACTCCAGCCTGAGCAACAGACCGTCTCAAAAAAAAAAAAAGAAAAAAGAAAAAGAAATGGGAC TGGTCCAAATTGAGATGTGCTGTAAATGAAAAACACATAAATTTCTGGCCAGGCACGGTGGCTC ACACCTGTAATCCCAGCACTTCGGGAGGCCAAGACAGGCAGATCACATGAGGTCAGGAATTCAA GACCAGCCTGGACAACGTGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCAGGTGTA ATGGTGGGCACCTGTAATCCCAGCTACTAGGGAGGCTGAGAATCGCTTGAACCCGGGAGGTGGA GGTTGCAGTGAGCCAAGATCGCACCACTGCACTCCAGCCTGGGCGACAGAGTGAGATTCTGTCT CAAAAAAAAAAGAAAAAAAAAGAAAAGACTTTTGACAGACAAAGGAACAGGCAGTACCTGGCAC ACTTCTTCACCTCCTCTCCTTACTCTTGTTTCCAGATCCTGCCCCTGAGCTTTCATGAGCTGTT GAACCATCTGGAATTCACAGGCCTGTCATGAGAGACACGATGAGAAGTCCTTAAAGGTAGATCA CTGATTCACAGGGGAGCAGGCGGAGGCAAGGGTGAGTCAGTGCTTGGAACTCAGTCATCCAGAT TTGGCTCTGGAAACTTCTGAAGCTGTAGCCTTTGGGGATCCCTGACTGCGAGTACAGGAAGCCA ACGCTATGTGGTCTTCTGGAAACTCATTATCTTTTTCACTGGTGCTATCTGGGAAAAACAGATG AAAACCTGAAGGTGTTCTGTATGTGTGCTTTCAAAAGCAAGGATCTGGCCGGACGCAGTGGCTC AGGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGAGGATCACCTGAGGTCAGGAGTTTGA GACCAGCTTGGCCAACATGGCGAAACCATCTCTACTAAAAGTACAAAAATTATCTGGGTGTGGT GGTGGGCACCTGTAATCACAGCTACTCAAGTAGCTGAGGCAGAAGAATCAGTTGAACCCAGGAG GCAGAGGTTGCAGTGAGCAGAGATCACACCACTGCACTCCAGCCTGGGTGACAAGAATGAAACT CCGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCAGAGATATTTTTCTGGTGGGAG TTTGAGTTTTGAGGCCTTCTGTGTCATGACAGGCCTCACTCCCTGCCCTTCTTGGGTGCTGGGG GAGAGTATTTGCTTCCAGGCCACAAGTTAATTTTTTAGTAGAGATGGGATCTCATTATGTTGCC CAGGCTGCTCTGGAACTCCTGGCCTCAAGTGATCCTCCCTCCTCGGCCTCCCAAAGCGCTGGGA TTACAGGCATGAGCCATCCCACTCGGCCTGAGACCACAGTTAAAATCCTATCAGATTGAGTTAC CTGAGGTACTTGATTGTGAGGGGCCATGGGGATAGGGTTGAAACCCTAAGACCCATCCATGTGT TGCTCCTTCACCAGTTATGCAAATTTTCAACTTATTTAATGTGATCTGAAGAGGTGAGTCTTCC ATCTCTGGGTTGCAGTTAGATAGGGCATGATGGTGTACAAGTAGGTGTCATTTTGAAGAGATTC CATACATATTCCCATAAACTCAGAACGTTTTGGAAATTCCCTTTTGAGACTGCTTTTCTGAGCC CCACACTTACTCAACAGGCACCAAATCATCCTCATGCACCTACTATGCTGCGTGTCAACCGGAA CATACAAATAATTTCCACCCCCTCCTCACCCATAAGAAGCTGGCAGAAAGGCAAGATCCAGCCT CCAAAGTAGAATTCCTCTTTGTTTTTGAGGGGTGTTGCAGGCTTGAGGAACCCCTTGTTTGCCC ATCTGGCTTCCGGTAACCAGGCTGTCAGCAGAAATGAAACCCACCCTTGGAATGAAGACTGGCC ACTTTGTTCAAGCCACTGAGGACACTGTACTGTGGCCCCCTGACAGTGCCTGTCTGTAGGGACA GCTAGATCCTGCTGGTCCCTTCACAGTCCCAGCAGTGGCTGATTCACAATGGTGACTGGTAGAT CTGTTGGGTCGTTGTTGAGCTGATGGTAACTAGCTCCTTTATTAAAGTAATTTCCAAGGGAGAA ACTGCAAGAGACGGCCTAAGCTGCGGCAGCTTCCAAGGATGAGCTTTTCTGGACCTCAGGTTTG GGGGGAACCCGAGACCATTTGAATGCCAGAGATGTGATGTGTTTGGTAATGTCGTGTCTCACAA GCTCCTTTTGTATTTCCCATGGGCAGATCAGGTCGGATCTTGACTTTTTCCAGAACCTGGCTTG GCTGGTTTGGGTTGTGTATGAATGAGCTGGCTCGCTGGGTGTGTGAGGGGAGGTGAAGCTGGGA CTAGACAAGCCAGGGCTCGTTAGCAAGTTTCTGCTGTGTCAGGAATCCCAGAGAGAGGTTTCCC AAGCCCTATCCTGTCTTGCCTGGGTTGTTTGGTTTTTCTCCTTCTTTTTTTTTTTTTCATTGAA GTGTAATTGACATGTAGTAAAATTCACCCATTTTAGTGTAGAGCTCTGAACACATACGGTCATG CAACCACCACTAGAATCAAGATACAGAGCATTTACACCAACCCAAAAAATTCTCCAAACTACTC CATTTGTTGCAGGTCCCAGCCCCTGGCAAGCATTCATCTGCTTTCTGTCCTGATACTTTTGCCT TTTCCAGAAGGTTGAGTGAAATGGAATCATGCAGCCTTTTGAGCCTGGCTTCTTTCACCCACAT AATGCATATGAGGTTCATCTGTAATGTTGTTTTTTTGTATCCGTAGCTCATTCCTTTTTATAGC TGAATAGTATTCCATTGTGTGGATGGATCCCATTTGTTTATCCATTCATCCATTGAAGGACAGT TAGGGTTTAACTGCTGTTGGTAATAACTAATAAAGCCTCAGCAAACATTCGCTCATGAGACTCC ATCTCAAAAAAAAAAACAAAAAAAGTAGATTTTACCCTGCTGGTGTTGCTATGTTGATGAAATT GAGAGGAGCCAGGAAATGAACATTATAATAGTATCTTTTTTATTTTTATTTTTGAGATGGAGTC TCGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACTGCAACCTCCACCTTCT GGGTTCAAGCGATTCTCCTGCATCAGCCTCCCGAATAGCTGGGTTACAGGTGTGCACCACCATG CCCAGCTAATTTTTGTGTTTTTAGTACAGACGGAATTTCACCATGTTGGTCAGGATGGTCTCCA TCTCTTGACCTTGTGATCCTCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCA CCGCACCCAGCCATGAATTTTTATATATGAACAAAGGTTTTCATTTCACATGGGTATCCCACTG GGAGTGGGATTGTTGGCTCATATGGCTATAGCGCTTTTTGAAGGTCTTTTTTTTTTGAGAACGA GTCTCGCTCTGTTGTCCAGGCTGGAGTGCAGTGGCACCATCTCGGCTCATTGCAACCTCCACCT CCTAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAATAGCTGGGATTACAGGTGCACGCCAC CATGCCTGACTAATTTTTGCATTTTTAATAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTC TCGAACTCCTGACTTCAAGGGATCCACCTGCCTCAGCCTCCCAAAGTGCTGAGATTACAGGCAT GAGCCACCGCACCCAGCCTTGAAGATCTTTTCCAAATTTATTTGGATTTTTATTTATTTTTGAG ACAGGGTCTCACTGTCTCTCAGGCTAGTGTGCAGTGGCACACTCATAGCTCACTGCAGCTTGGA ATTCCTGAGCTCAAGTGATCCTCCCACCTCAGCCTCCCAAGTAGCTGGCACGTGCCACCATGCC CAGCTTATTTCTTTGTCATTCTTTGCAGATATGGGGTCTCACTATGTGGGCCAGGCTGATCTGG AACTCCTGGGCTCAAGTGATCCTCCTGCCTTGGCCTCCAAAAGTGCTGGGATTACAGATGTGAG CCACCGCACCTGGCCTGAAGGTCTTTTTTTCAACAGGTGAAGAGCTGAAGACAGGAGTAATGCA ATTTTCTTAAAATTAAGGCTTTATGATCATCTGAGTCACATTATACACAAGCTGAGTGTCTAGT GTAAGCACTCAGTATGCTAAACACTTCAGCATTTTACTCTCTGGGTGACCCTGGGGAAGTCACA TGACCTCTCAGAATCTCCATCTCCCCACCTGTAAAATGGGTGTAATGATAGCCCAACCTCATAG GGCTGTTGTGACAAGTATATGAGTTAATATTCATCGAGTACTTGGAACAGGCTTGGCCATGTCA GTGTTAGATGTTATTATAGGCCAGACATGGTGGCTTATGCCTGTAATCCCAACACTTGGGGAGG CCAAGGCCGGCGGATCACCTGAGCTCAGGTGTTCGAGACCAACCTGAGCAACATGGCAAAACCC CATCTCTACCAAAACCATAATACAAAAAATTAGCCGGGCATGGTGGCAGGCACCTGTGATCCCA ACTACTCAGGAGGCTGGGGCAGGAGGATCATTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCCA AGATTGTGCCACTGTGCTCCAGCATGGGTGACAGTGTGAGACCCCATCTCAAAAAAAAAAAAAA ACAAAAACAAGGCCGGGTGTAGGGTTCAACCCTTGTAATCCCAGCACTTTGGGAGGCCAAGGTG GGTGGATCATGAGATCAGGAGTTCGAGATTAGCCTGGCCAACATGGTGAAACCCAGTCTCTACT AAAAATACAAAAATTAGCCAGGTGTGGTGGTAGGCACCTATAATCCCAGCTACTCAGGAGGCTG AGGCAGAGAATCGCTTGAACCCAGGAGGCGGAAGTTGCAGTGAGCTGAGATCACACCACTGCAC TCCAGCCTGGGTGACAGAGTGAGACTCCATCTCAAAAATAAAAATAAAAATGTTATTATAATGT TCATTTCCTGTTCACTTCCTGTTCATTTCCTCTCAATTTCATCCACATAGCAACACCAGCAGCG TAAAATCTACTCTTTTCTTTTCTTTTCTTTTCGAGACAGAGTCTGGCTCTGTTGCCCAGGCTGG AGTACAGTGGCGTGATCTTGGCTCACTGCAACCTCCACCTCCCAGGTTCAAGCGATTCTCCTGC CTCAGCCTCCCGAGTAGCTGGGAGTATGGGCACATGCCACCATGCCCAGCTAATTTTTGTGTTT TTAGTAGAGACACCACGTTGGACAGGCTGGTCTCAAACTCCTGACCTCAAGTGATCCACCCGCC TCAGCTTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCGCACCCGGCCTAATCTACTCTTAT CTCCAATTTATGGAGGACAGACCTAATGCTCCCAGAGATCAAGCAGGCTGTGGAAGATCACACA CGTAGGAAATAAGGAAACAGTTGGTCCAGGATTTGAACTAAAGCAACTGTCCTCAGACTCACTT GCATAGTTCCTGTATCAGTCAGGATTTTACCAAAGACACAGAGCCAGAGCCAGTAGGAGTGTGT GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTGTGTGTGTGTCTGTGTTTGT GTGCAGAGATTTATTTCAAGGAATTGGCTTACATGCTTGTGGGGGCTGGCTAGACAACACCAAA ATCTGCTGGGTGGGCTAGCAGGCTGGAAACTTAGGCAGGAGCTGATGCTTTTTCTGCCGGGAAA CCTCAGTGTTTGCTTTCAGTGTTGGCTTTTTTTTTTTTTTTTTTTTCTTCGAGACAGAGTCTTG CTCTGTCACCAGGCTGGAGTGCAGCGGCCTGATCTTGGCTCACTGCAACCTCTGCCTCCTGGGT TCAAGTAATTCTCCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGACCCACCACACCC GGCCTCAGTGTTGGCTTTTTAAAGTCTCTAGACTGATTGGATGAGGCCCGCTTACATCCTCACA TCCTTGAGGGTCATGTCCTGTTTTTAAAGTCAAGCAACTGTAGACGTGCTAACCTAACCGCATC AACATAATAACTTCACAGCAACACCTAGATTAGTGTTTAGTTGAATAACTAGGTTCTAGAGCTT AGCCACGCTGACACAACAAACAACTATAACAGCTTTTCCAAGTTAAGAGCCCAATATCTGCTGA AGTTGACGCAGAGGCCATTTTTTCAGTAATATTCTGAAGTTCTCAGTGAGTGTTCAGGTCACAG TACCATGTCCATTAGGAGAGAGTTACTGGGTTTGTTTGTTTGTTTAATTTCCTGGCTCCTAGAA CTGATTAAAAAAAAAAAAATTAAAACTTCCCGAACTGGCTGGGCGTGGTGGCTCACGCCTGTAA TCCCAGCACCTTGGGAGATCAAGGCGTGTTGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTG GCCAACATGGTGAAACCCCATCTCTACTAAAACTACAAAAATTAGCCAGGCATGGTCGCAAGCA CCTGTAGTCCTAGCTACTTGGGAGGCTAAGGCAGGAGAATCACTTGAACCCAGGAGGCGGAGGT TGCAGTGAGCTGAGATCGTGCCACTGCACTCCAGCCTGGGCTACAAGAGTGAAACTCCATCTCA AAACAAACAAACAAAACTTCCTGAACATGACCCAGGCTTACTCAAGGAGGGGAGAGTTCCCAGC TCTAACACTGACCCCAAGCACAAAGACATCCCTTCCGCAGTGTTTTTAGCAAAATGTTCCCCTC GAGTGTGTCAAGAAACTTTTATTATTATTATTATTATTATTATTATTATTATTTTGAAACTCTC ACTCTGTCACCCAGTCTGGAGCACAGTGGCACGATCTTGGCTCACTGCAACCTCTGCCTCTCGG GTTGAAGCGAGTCTCATGCCTCAGCCTCCAGCGTAGCTGGGATTACAGGCACCTGCCACCGCAC CCGACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCACGTTGGCCAGACTGGTCTCGAA CTCCTGACCTCAGGTGATCCACCTGCCTCAGCCTCCCAAAGTGCTGGGATTATAGGCGTTGAGT CACCGTGCCTGGCCTATGGCAAGAAACTTTAGAACAACCAGAAAGGCCCACCCAGAGCCATGTT AAAGGCTTTGGAGGCTGCTGCACTTGTAATTCTAAATTAAATTTATATTATAAAATACTAGTTT ATAAAGTCACAAGCTCTGAGTTTTTCCCCATAATGCTCCTGATAGTTATTTATATAATTTTTTT ATGGGGAGGTTCAGGATCTCACTGTCATCCAGGCTGGAGAGCAGTGGCACAATCATAGCTGACT GCAGCCTCAAATTCCTGGGCTCAAGCTATCCTCCTACCTTGGTCCACTGAGTAGCTGGGACTAC AGGCTCACATCATTGCACCCGGCTAATTTTTTTTATTTTTAATTTTTTGTATAGATGGGGTCTC ACTATGTTGCCCAGGCTGGAACTCCTGGCCTCAAAAGATCCTCCCACCTCACCTTCCCAGAATG CTGGGATGACAGGCGTGAGCCACTGTGCCCAGCCTAGTAACTTTTTAAAAAATATCAAGTTCTC TCTCCCCATAATTGCTTCAGGACCCCTGCTCAAGTATGGTTCTCGATGTCTAGGGAATTTCAGC CCCATCCTCCCGGGTTGTCCAGTTCTAGTCCAGAGTTCTTGCTGATGCAGTTACCTCAGAGATC ATTGAGTGGGCCAAGAGAACCAGATCTGCCTGCTGGATGCTCCTGATAGTTATTTATGTAACTA TCCGCCATGATGCTCCTGATAGTTATTTATGTAACTTTTTTATGGGGAGGCTGAGGGTCTCACT GTCACCCAGGCTGGAGTGCAGTGGCACAATCATAGCTGCCTGCAGCCTCAAATTCCTGGGCTCA AGAGATCCTCCCGCCTCAGGAGGCAGGAGGATGGTCTTCTTCTTCAGGAGTCCCTGTCTGCCCA GGGGCAGCTGTCACTGGAATCTTTTAGCTGCCAGGAATGAGGTGATGGCTGAAGAGCCCACCAC TAAGAAAATGGTTAGTCACCCCTTTGACAATGCTTCCTTTGTGCCATGCCTGCTCTGAGCACTT TGCAAATGTTGACTCATATAAACTTCAAAGCCCTCATGATGTGGGCACAACTGCTATCCTCATT CCTCATTGTGCAGATGAGGAAGCTAAGGCCAGCAAAGGCCTAGTAACTAACCCAAGGTTACCCA GCCGGGAAATGCTGGAGCTGGGAATGAAGCCCAGCTGTCTGGTTCCAGAGCCAGTGTTACATGG CTCTGACCTCCCAGGCAGTTCTTCCAGGAACCTCCCCCACTTCCCTGGGCCCTTCCCAGGTGGA ACCCCTTTCTGAGCCTCCTTTTCCCATCAGCTCTGTCCTTGACAGAGGCACCCCAGACTTTCAG AGCCAAACGCGATCGTGTTGCCTACAATTGAGGAAACAGAGGCCTTGAAAGGTGTGTGACACAC CTATGTCCCCCCAAATTATAAAGGTAGTTGGGGAAGAAAAAGAAGCACGTGTGTAAGAACGGGG AATATTTATGAGGTTCTCACTGCGTTCCAGGCACTGGGGACCCAGTGCAGACAGGACCTGGCCC CTGCCCAGGTGGCGATGTCAGACTGGGAGGGAAGACACATTGAAAGCCAAGAGACAAGATAAAT TTCAAATTTAGCGGCAAGTGATCTTGAGAGGGCAGTAAAGGGAAGAAGGCGGGACTGGGAGGAC CTTGGGAGATGCGCTTCCGTCCAGATCCTCACCTTGCTTTAGGAGGTGAGAACAGAGGCAGTCT TCCGTCCCAAAATCCTTCTCCCCACGGTTGGGAGAATAGCGCTGGTGGGTTTTTTGTTCATTTT ACTTTATTTTACTTATTTTATTTATTTATGAGACGGAGTCTCGCTCTGTTGCCGAGGCTGGAGT GCAACGGCCGGATCTCGGCTCACTGCAACCTCCGCCTCCCCGATTCAAGCTATTCTCAGGCCTC AATCTCCTGAGTAGCTGGGATTACAGGTTCGTGCCACCAACGCCCGGGTAATTTTTATATTTTT AGCAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATCCG CCCGCCTCGGCCTCCCAAAATGCTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCTGTTTGTT AATCTTAAATAGAAACGGAGTCTCCTTATGTTGCCTAGGCTGGTCTCCAACTCCTAGGCTCAAG AGATCCTCCTGTCTTGGCCTCAGGAAGTGATGGGATTACAGGCATGAGGCTCCGCGCCCGGCCT GATAGGTTCTCTTATTGCCATGTTGTAGATAGAAGGGGGCCCCTGGGTTAGGCAGACACAGGTT AGGTAGTTCGTCCGGCGTCACCGGGCGGGAGCCCACCGCAGGCCCCACGAGACAAGGGCGAGCA GGTTCATCGCCTCCCGGGGAAAGGACTGGGTAGTCCCAGGCCCCACCCCTTCCTGGAACAGCAA GTTCGCTGTGGGTCCCGACCTACTGACCCAGGCGGAGGGCGGGACGCAGGGTAGCTGGGGCCGC GTAGAGAGGGAAAGAAGGTCGCGATTGGCTCGTCCCGAAAGGTGGGCGGGGCCTCCTCCGACCT GTGCGCGCGCGCCGCGGGGAGGTGTTGCCGAGGGCGGAGCGGGAGGGGGCGTGGCCCCGCGCGG GCGGCCGTTGACGGCGAGGCCCCGCCCCGGATGTCGCGTGTCGCTGAAAGGGCGGCGGCGATTG GCCGGCGCCGCGGGGGCGGGCGGGGCGGAAGGTTCTGGAGGGGGCTGGCGGGCTCTGGAAGCTT CCGCCGGACGGGTATATAGAGTCCGGGACTGGTCGGCGGCGGAGCCGGGGGACGGCGACAGCGG GTCGGCGGGCCGCAGGAGGGGGTCATGGGTAAAGACTACTACCAGACGTTGGGCCTGGCCCGCG GCGCGTCGGACGAGGAGATCAAGCGGGCCTACCGCCGCCAGGCGCTGCGCTACCACCCGGACAA GAACAAGGAGCCCGGCGCCGAGGAGAAGTTCAAGGAGATCGCTGAGGCCTACGACGTGCTCAGC GACCCGCGCAAGCGCGAGATCTTCGACCGCTACGGGGAGGAAGGTGTGTGCTCGCGGCCAGGGG CGCGGCCCCGCCTTTTGACAGACAGGGAAACTGAGGCACGCTGGCCCCTCTCGGCGGCCCCCCG GGAAGGACGCCCCGGGCCGTGGGGCCGAGCTGCCCCGCCCTCCGGCCTCGCGGGCCTCCGCCCT CGGATTGGCGGCCGCGCGGTGGGAGGAGGAGCCTTGGCCCAGCCGCTCGGCCAGAAGCTTCTAG CATGTCTGGGGCTGCCCCTCCCCGGGCGCCTCCTCCCGCGGCCCCGAGCAGCCCCGGGGTGCGG TGCATGGGGGTGGGGGAGCCGGGGGTGACGACGGGGACGGCGCGGCGGAGCCCGCTGCGGACCC GGGCTCACCTGGGCTCGGCCGCCGGGGTCCGCGGGGCGGCGCCTCCGGCTCAGCTGCGGGGCGA GGGGTTGTGAATGCAGGAGCCGACCCCGTTCGTGGGCTTGGGGGCTGGGTTGGGATAATTCCCG GGAAGTGATGACCTGGCCCCGGCAGGGCACGCAGGAGGCAGCGGCCGCCCAGGTCCGGAGAGCG GGGCCGCCTCGGAGGGTAAGGAGAGCAGGGCTTTTTGTTCCTGTGCCCGTTGATGATTCAGGCT GGCTTCTGGGCTTGATCTGGAGCTGACTTTTTGTGCAAGGATGGTGGCAGTGAATGCTAATGTG TGAGGATTTAAAAACTCCCTATTTATTTTTGTTTTTTTTTTTCTTCTCTTCCTACGCACCATTA GCCCTAAAACGTTGAGAGTAAACACTTACCGAAAGCCGACCCTGGACCATGTGTCTTTAGCCAT CTCTTGGTGATGTTGCCAGGAGGATGGGTCAGTCCAGGAGTGGCAGGGCCTAGGTGGCACCTGG AGGTGAACACCTAACCGCATGAGTCCTTTTCCACTGTCAGGCTACGCGGAAATACTGTTGGTAG GGGGGCGTCTCTGCTGTTGGCAGTGAGAGGTAGAAACTTGGGGTTCTTCCTTGCTCTCCCCTGA CAGACAAAAAAGTTCCTCCCCCATGGCTTCCAAGCTGTTTCCCCCTGTAAGGGAAGCTCGAGAG AGAGTCTGCTGTTCAGCCATCTGACTGCTTCTAAATCTGCTTTTCAGGCCTAAAGGGGAGTGGC CCCAGTGGCGGTAGCGGCGGTGGTGCCAATGGTACCTCTTTCAGCTACACATTCCATGGAGACC CTCATGCCATGTTTGCTGAGTTCTTCGGTGGCAGAAATCCCTTTGACACCTTTTTTGGGCAGCG GAACGGGGAGGAAGGCATGGACATTGATGACCCATTCTCTGGCTTCCCTATGGGCATGGGTGGC TTCACCAACGTGAACTTTGGCCGCTCCCGCTCTGCCCAAGAGCCCGCCCGAAAGAAGCAAGATC CCCCAGTCACCCACGACCTTCGAGTCTCCCTTGAAGAGATCTACAGCGGCTGTACCAAGAAGAT GAAAATCTCCCACAAGCGGCTAAACCCCGACGGAAAGAGCATTCGAAACGAAGACAAAATATTG ACCATCGAAGTGAAGAAGGGGTGGAAAGAAGGAACCAAAATCACTTTCCCCAAGGAAGGAGACC AGACCTCCAACAACATTCGAGCTGATATCGTCTTTGTTTTAAAGGACAAGCCCCACAATATCTT TAAGAGAGATGGCTCTGATGTCATTTATCCTGCCAGGATCAGCCTCCGGGAGGTAAGGTGCCAG GTGGGGCGGTGTCTGTAGAGGGCGATGGCTGCTTTTTGAAGCAGTTTAGTATTTGCTGAACCAA TGTGTTGGGGTGTGTGGTGCGTGCCAGGCTTGGGGCACAACAGTGAACAGGACTGACCAGGGCG CTGTTGTGGAGCTTTCGTGGTGGGGACGTGTAGATTTGGGGGCCAGGTCTTAGCTGCAGAGGCC TGATGGGTCTTATCTATGGCAGACGGCCTCTGGGCAAGAGCTGAGCCTCTTGAGCCAGCTTCCA TCTAATGCTGTCCTTTTGCTTTTCAAGGCTCTGTGTGGCTGCACAGTGAACGTCCCCACTCTGG ACGGCAGGACGATACCCGTCGTATTCAAAGATGTTATCAGGCCTGGCATGCGGCGAAAAGTTCC TGGAGAAGGCCTCCCCCTCCCCAAAACACCCGAGAAACGTGGGGACCTCATTATTGAGTTTGAA GTGATCTTCCCCGAAAGGATTCCCCAGACATCAAGAACCGTACTTGAGCAGGTTCTTCCAATAT AGCTATCTGAGCTCCCCAAGGACTGACCAGGGACCTTTCCAGAGCTCAAGGATTTCTGGACCTT TCTACCAGTTGTGGACCATGAGAGGGTGGGAGGGCCCAGGGAGGGCTTTCGTACTGCTGAATGT TTTCCAGAGCATATATTACAATCTTTCAAAGTCGCACACTAGACTTCAGTGGTTTTTCGAGCTA TAGGGCATCAGGTGGTGGGAACAGCAGGAAAAGGCATTCCAGTCTGCCCCACTGGGTCTGGCAG CCCTCCCGGGATGGGCCCACATCCACCTCCAGTCCCTGGCCAGGGGTGAGAGGCAGACCAGCAG ATGGACTTGATCCCTCTGTGTCTTTTTGCTTCTGGCTGGTAGATAATGTCAACCTGCAGTCTTG ATTCCCAGACCCTGTACACTCCTCCTTTTCTGTTGTGTGATCAGTTTGTGCTTTATTCTGTATT TGTCTCCCATGTCTTGCTCTTCTCCTGGAGAATTCTGTCTTCTCTTTGGCCATCTCAAATTGAG AACCTAAACTATTCCTGCAGAACTGCCTGGTTGGCGTCCACAAGCAATACCTCTCGTTCCAGCA GGACCAAGGGAGCCAGCCTCCAGTGAGTGACTCCAGCAAGTGCAGCCACCTCTCCCTTGATGGT CTGGGAGCCTGGCCTCAGCAAGGGGCCTTCCTGACCTCTGGCTCCAGTGAAGCTGAATGTCCTC ACTTTGTGGGTCACACTCTTTACATTTCTGTAAGGCAATCTTGGCACACGTGGGGCTTACCAGT GGCCCAGGTAATTTTTTGTTTCATGGACTATGGACTCTTTCAAAGGGATCTGATCCTTTTGAAT TTTGCACAGCCCTAGATACAATCCCTTTTGATAAAAGGGTCTTTGCTTCTGATTACAGGAGCAC TGTGGAACGTCTGTAAATATGTTTTTATAATTCCATGTATAGTTGGTGTACACTCAAAACCTGT CCCCGGCAGCCAGTGCTCTCTGTATAGGGCCATAATGGAATTCTGAAGAAATCTTGGGGAGGGA AGGGGAGTTGGAACAAATGTCTGTTCCCTGGAGGCCAGTCCAGTGCTCAGACCTTTAGACTCAT TGTAAGTTGCCACTGCCAACATGAGACCAAAGTGTGTGACTAGTCAATGAAGTGCGACAGCATT AAAGACTGATGCTAAACCTCAGGGGA Exemplary Human DNAJB1 cDNA including untranslated regions Variant 1 (SEQ ID NO: 113) GGAGCCGGGGGACGGCGACAGCGGGTCGGCGGGCCGCAGGAGGGGGTCATGGGTAAAGACTACT ACCAGACGTTGGGCCTGGCCCGCGGCGCGTCGGACGAGGAGATCAAGCGGGCCTACCGCCGCCA GGCGCTGCGCTACCACCCGGACAAGAACAAGGAGCCCGGCGCCGAGGAGAAGTTCAAGGAGATC GCTGAGGCCTACGACGTGCTCAGCGACCCGCGCAAGCGCGAGATCTTCGACCGCTACGGGGAGG AAGGCCTAAAGGGGAGTGGCCCCAGTGGCGGTAGCGGCGGTGGTGCCAATGGTACCTCTTTCAG CTACACATTCCATGGAGACCCTCATGCCATGTTTGCTGAGTTCTTCGGTGGCAGAAATCCCTTT GACACCTTTTTTGGGCAGCGGAACGGGGAGGAAGGCATGGACATTGATGACCCATTCTCTGGCT TCCCTATGGGCATGGGTGGCTTCACCAACGTGAACTTTGGCCGCTCCCGCTCTGCCCAAGAGCC CGCCCGAAAGAAGCAAGATCCCCCAGTCACCCACGACCTTCGAGTCTCCCTTGAAGAGATCTAC AGCGGCTGTACCAAGAAGATGAAAATCTCCCACAAGCGGCTAAACCCCGACGGAAAGAGCATTC GAAACGAAGACAAAATATTGACCATCGAAGTGAAGAAGGGGTGGAAAGAAGGAACCAAAATCAC TTTCCCCAAGGAAGGAGACCAGACCTCCAACAACATTCCAGCTGATATCGTCTTTGTTTTAAAG GACAAGCCCCACAATATCTTTAAGAGAGATGGCTCTGATGTCATTTATCCTGCCAGGATCAGCC TCCGGGAGGCTCTGTGTGGCTGCACAGTGAACGTCCCCACTCTGGACGGCAGGACGATACCCGT CGTATTCAAAGATGTTATCAGGCCTGGCATGCGGCGAAAAGTTCCTGGAGAAGGCCTCCCCCTC CCCAAAACACCCGAGAAACGTGGGGACCTCATTATTGAGTTTGAAGTGATCTTCCCCGAAAGGA TTCCCCAGACATCAAGAACCGTACTTGAGCAGGTTCTTCCAATATAGCTATCTGAGCTCCCCAA GGACTGACCAGGGACCTTTCCAGAGCTCAAGGATTTCTGGACCTTTCTACCAGTTGTGGACCAT GAGAGGGTGGGAGGGCCCAGGGAGGGCTTTCGTACTGCTGAATGTTTTCCAGAGCATATATTAC AATCTTTCAAAGTCGCACACTAGACTTCAGTGGTTTTTCGAGCTATAGGGCATCAGGTGGTGGG AACAGCAGGAAAAGGCATTCCAGTCTGCCCCACTGGGTCTGGCAGCCCTCCCGGGATGGGCCCA CATCCACCTCCAGTCCCTGGCCAGGGGTGAGAGGCAGACCAGCAGATGGACTTGATCCCTCTGT GTCTTTTTGCTTCTGGCTGGTAGATAATGTCAACCTGCAGTCTTGATTCCCAGACCCTGTACAC TCCTCCTTTTCTGTTGTGTGATCAGTTTGTGCTTTATTCTGTATTTGTCTCCCATGTCTTGCTC TTCTCCTGGAGAATTCTGTCTTCTCTTTGGCCATCTCAAATTGAGAACCTAAACTATTCCTGCA GAACTGCCTGGTTGGCGTCCACAAGCAATACCTCTCGTTCCAGCAGGACCAAGGGAGCCAGCCT CCAGTGAGTGACTCCAGCAAGTGCAGCCACCTCTCCCTTGATGGTCTGGGAGCCTGGCCTCAGC AAGGGGCCTTCCTGACCTCTGGCTCCAGTGAAGCTGAATGTCCTCACTTTGTGGGTCACACTCT TTACATTTCTGTAAGGCAATCTTGGCACACGTGGGGCTTACCAGTGGCCCAGGTAATTTTTTGT TTCATGGACTATGGACTCTTTCAAAGGGATCTGATCCTTTTGAATTTTGCACAGCCCTAGATAC AATCCCTTTTGATAAAAGGGTCTTTGCTTCTGATTACAGGAGCACTGTGGAACGTCTGTAAATA TGTTTTTATAATTCCATGTATAGTTGGTGTACACTCAAAACCTGTCCCCGGCAGCCAGTGCTCT CTGTATAGGGCCATAATGGAATTCTGAAGAAATCTTGGGGAGGGAAGGGGAGTTGGAACAAATG TCTGTTCCCTGGAGGCCAGTCCAGTGCTCAGACCTTTAGACTCATTGTAAGTTGCCACTGCCAA CATGAGACCAAAGTGTGTGAGTAGTCAATGAAGTGCGACAGCATTAAAGACTGATGCTAAACCT CA Exemplary Human DNAJB1 cDNA coding sequence Variant 1 and Variant 3 (SEQ ID NO: 156) ATGGGTAAAGACTACTACCAGACGTTGGGCCTGGCCCGCGGCGCGTCGGACGAGGAGATCAAGC GGGCCTACCGCCGCCAGGCGCTGCGCTACCACCCGGACAAGAACAAGGAGCCCGGCGCCGAGGA GAAGTTCAAGGAGATCGCTGAGGCCTACGACGTGCTCAGCGACCCGCGCAAGCGCGAGATCTTC GACCGCTACGGGGAGGAAGGCCTAAAGGGGAGTGGCCCCAGTGGCGGTAGCGGCGGTGGTGCCA ATGGTACCTCTTTCAGCTACACATTCCATGGAGACCCTCATGCCATGTTTGCTGAGTTCTTCGG TGGCAGAAATCCCTTTGACACCTTTTTTGGGCAGCGGAACGGGGAGGAAGGCATGGACATTGAT GACCCATTCTCTGGCTTCCCTATGGGCATGGGTGGCTTCACCAACGTGAACTTTGGCCGCTCCC GCTCTGCCCAAGAGCCCGCCCGAAAGAAGCAAGATCCCCCAGTCACCCACGACCTTCGAGTCTC CCTTGAAGAGATCTACAGCGGCTGTACCAAGAAGATGAAAATCTCCCACAAGCGGCTAAACCCC GACGGAAAGAGCATTCGAAACGAAGACAAAATATTGACCATCGAAGTGAAGAAGGGGTGGAAAG AAGGAACCAAAATCACTTTCCCCAAGGAAGGAGACCAGACCTCCAACAACATTCCAGCTGATAT CGTCTTTGTTTTAAAGGACAAGCCCCACAATATCTTTAAGAGAGATGGCTCTGATGTCATTTAT CCTGCCAGGATCAGCCTCCGGGAGGCTCTGTGTGGCTGCACAGTGAACGTCCCCACTCTGGACG GCAGGACGATACCCGTCGTATTCAAAGATGTTATCAGGCCTGGCATGCGGCGAAAAGTTCCTGG AGAAGGCCTCCCCCTCCCCAAAACACCCGAGAAACGTGGGGACCTCATTATTGAGTTTGAAGTG ATCTTCCCCGAAAGGATTCCCCAGACATCAAGAACCGTACTTGAGCAGGTTCTTCCAATATAG Exemplary Human DNAJB1 cDNA including untranslated regions Variant 2 (SEQ ID NO: 157) GTGCATGGGGGTGGGGGAGCCGGGGGTGACGACGGGGACGGCGCGGCGGAGCCCGCTGCGGACC CGGGCTCACCTGGGCTCGGCCGCCGGGGTCCGCGGGGCGGCGCCTCCGGCTCAGCTGCGGGGCG AGGGGTTGTGAATGCAGGAGCCGACCCCGTTCGTGGGCTTGGGGGCTGGGTTGGGATAATTCCC GGGAAGTGATGACCTGGCCCCGGCAGGGCACGCAGGAGGCAGCGGCCGCCCAGGTCCGGAGAGC GGGGCCGCCTCGGAGGGCCTAAAGGGGAGTGGCCCCAGTGGCGGTAGCGGCGGTGGTGCCAATG GTACCTCTTTCAGCTACACATTCCATGGAGACCCTCATGCCATGTTTGCTGAGTTCTTCGGTGG CAGAAATCCCTTTGACACCTTTTTTGGGCAGCGGAACGGGGAGGAAGGCATGGACATTGATGAC CCATTCTCTGGCTTCCCTATGGGCATGGGTGGCTTCACCAACGTGAACTTTGGCCGCTCCCGCT CTGCCCAAGAGCCCGCCCGAAAGAAGCAAGATCCCCCAGTCACCCACGACCTTCGAGTCTCCCT TGAAGAGATCTACAGCGGCTGTACCAAGAAGATGAAAATCTCCCACAAGCGGCTAAACCCCGAC GGAAAGAGCATTCGAAACGAAGACAAAATATTGACCATCGAAGTGAAGAAGGGGTGGAAAGAAG GAACCAAAATCACTTTCCCCAAGGAAGGAGACCAGACCTCCAACAACATTCGAGCTGATATCGT CTTTGTTTTAAAGGACAAGCCCCACAATATCTTTAAGAGAGATGGCTCTGATGTCATTTATCCT GCCAGGATCAGCCTCCGGGAGGCTCTGTGTGGCTGCACAGTGAACGTCCCCACTCTGGACGGCA GGACGATACCCGTCGTATTCAAAGATGTTATCAGGCCTGGCATGCGGCGAAAAGTTCCTGGAGA AGGCCTCCCCCTCCCCAAAACACCCGAGAAACGTGGGGACCTCATTATTGAGTTTGAAGTGATC TTCCCCGAAAGGATTCCCCAGACATCAAGAACCGTACTTGAGCAGGTTCTTCCAATATAGCTAT CTGAGCTCCCCAAGGACTGACCAGGGACCTTTCCAGAGCTCAAGGATTTCTGGACCTTTCTACC AGTTGTGGACCATGAGAGGGTGGGAGGGCCCAGGGAGGGCTTTCGTACTGCTGAATGTTTTCCA GAGCATATATTACAATCTTTCAAAGTCGCACACTAGACTTCAGTGGTTTTTCGAGCTATAGGGC ATCAGGTGGTGGGAACAGCAGGAAAAGGCATTCCAGTCTGCCCCACTGGGTCTGGCAGCCCTCC CGGGATGGGCCCACATCCACCTCCAGTCCCTGGCCAGGGGTGAGAGGCAGACCAGCAGATGGAC TTGATCCCTCTGTGTCTTTTTGCTTCTGGCTGGTAGATAATGTCAACCTGCAGTCTTGATTCCC AGACCCTGTACACTCCTCCTTTTCTGTTGTGTGATCAGTTTGTGCTTTATTCTGTATTTGTCTC CCATGTCTTGCTCTTCTCCTGGAGAATTCTGTCTTCTCTTTGGCCATCTCAAATTGAGAACCTA AACTATTCCTGCAGAACTGCCTGGTTGGCGTCCACAAGCAATACCTCTCGTTCCAGCAGGACCA AGGGAGCCAGCCTCCAGTGAGTGACTCCAGCAAGTGCAGCCACCTCTCCCTTGATGGTCTGGGA GCCTGGCCTCAGCAAGGGGCCTTCCTGACCTCTGGCTCCAGTGAAGCTGAATGTCCTCACTTTG TGGGTCACACTCTTTACATTTCTGTAAGGCAATCTTGGCACACGTGGGGCTTACCAGTGGCCCA GGTAATTTTTTGTTTCATGGACTATGGACTCTTTCAAAGGGATCTGATCCTTTTGAATTTTGCA CAGCCCTAGATACAATCCCTTTTGATAAAAGGGTCTTTGCTTCTGATTACAGGAGCACTGTGGA ACGTCTGTAAATATGTTTTTATAATTCCATGTATAGTTGGTGTACACTCAAAACCTGTCCCCGG CAGCCAGTGCTCTCTGTATAGGGCCATAATGGAATTCTGAAGAAATCTTGGGGAGGGAAGGGGA GTTGGAACAAATGTCTGTTCCCTGGAGGCCAGTCCAGTGCTCAGACCTTTAGACTCATTGTAAG TTGCCACTGCCAACATGAGACCAAAGTGTGTGACTAGTCAATGAAGTGCGACAGCATTAAAGAC TGATGCTAAACCTCAGGGGAAAAAAAAAAA Exemplary Human DNAJB1 cDNA coding sequence Variant 2 (SEQ ID NO: 158) ATGTTTGCTGAGTTCTTCGGTGGCAGAAATCCCTTTGACACCTTTTTTGGGCAGCGGAACGGGG AGGAAGGCATGGACATTGATGACCCATTCTCTGGCTTCCCTATGGGCATGGGTGGCTTCACCAA CGTGAACTTTGGCCGCTCCCGCTCTGCCCAAGAGCCCGCCCGAAAGAAGCAAGATCCCCCAGTC ACCCACGACCTTCGAGTCTCCCTTGAAGAGATCTACAGCGGCTGTACCAAGAAGATGAAAATCT CCCACAAGCGGCTAAACCCCGACGGAAAGAGCATTCGAAACGAAGACAAAATATTGACCATCGA AGTGAAGAAGGGGTGGAAAGAAGGAACCAAAATCACTTTCCCCAAGGAAGGAGACCAGACCTCC AACAACATTCCAGCTGATATCGTCTTTGTTTTAAAGGACAAGCCCCACAATATCTTTAAGAGAG ATGGCTCTGATGTCATTTATCCTGCCAGGATCAGCCTCCGGGAGGCTCTGTGTGGCTGCACAGT GAACGTCCCCACTCTGGACGGCAGGACGATACCCGTCGTATTCAAAGATGTTATCAGGCCTGGC ATGCGGCGAAAAGTTCCTGGAGAAGGCCTCCCCCTCCCCAAAACACCCGAGAAACGTGGGGACC TCATTATTGAGTTTGAAGTGATCTTCCCCGAAAGGATTCCCCAGACATCAAGAACCGTACTTGA GCAGGTTCTTCCAATATAG Exemplary Human DNAJB1 cDNA including untranslated regions Variant 3 (SEQ ID NO: 159) GTGGAAACCAAGGTAAGCGACGGTTAGGCCAGACGCGGGGGCGGGGTAAGAAGTTGAGTGACAG GCAAGGCAGCATCCACATGACAGGCGGCGCCGAAGGGGTAAATTCTGAGATCCTGCCCCTGAGC TTTCATGAGCTGTTGAACCATCTGGAATTCACAGGCCTGTCATGAGAGACACGATGAGAAGTCC TTAAAGGCCTAAAGGGGAGTGGCCCCAGTGGCGGTAGCGGCGGTGGTGCCAATGGTACCTCTTT CAGCTACACATTCCATGGAGACCCTCATGCCATGTTTGCTGAGTTCTTCGGTGGCAGAAATCCC TTTGACACCTTTTTTGGGCAGCGGAACGGGGAGGAAGGCATGGACATTGATGACCCATTCTCTG GCTTCCCTATGGGCATGGGTGGCTTCACCAACGTGAACTTTGGCCGCTCCCGCTCTGCCCAAGA GCCCGCCCGAAAGAAGCAAGATCCCCCAGTCACCCACGACCTTCGAGTCTCCCTTGAAGAGATC TACAGCGGCTGTACCAAGAAGATGAAAATCTCCCACAAGCGGCTAAACCCCGACGGAAAGAGCA TTCGAAACGAAGACAAAATATTGACCATCGAAGTGAAGAAGGGGTGGAAAGAAGGAACCAAAAT CACTTTCCCCAAGGAAGGAGACCAGACCTCCAACAACATTCCAGCTGATATCGTCTTTGTTTTA AAGGACAAGCCCCACAATATCTTTAAGAGAGATGGCTCTGATGTCATTTATCCTGCCAGGATCA GCCTCCGGGAGGCTCTGTGTGGCTGCACAGTGAACGTCCCCACTCTGGACGGCAGGACGATACC CGTCGTATTCAAAGATGTTATCAGGCCTGGCATGCGGCGAAAAGTTCCTGGAGAAGGCCTCCCC CTCCCCAAAACACCCGAGAAACGTGGGGACCTCATTATTGAGTTTGAAGTGATCTTCCCCGAAA GGATTCCCCAGACATCAAGAACCGTACTTGAGCAGGTTCTTCCAATATAGCTATCTGAGCTCCC CAAGGACTGACCAGGGACCTTTCCAGAGCTCAAGGATTTCTGGACCTTTCTACCAGTTGTGGAC CATGAGAGGGTGGGAGGGCCCAGGGAGGGCTTTCGTACTGCTGAATGTTTTCCAGAGCATATAT TACAATCTTTCAAAGTCGCACACTAGACTTCAGTGGTTTTTCGAGCTATAGGGCATCAGGTGGT GGGAACAGCAGGAAAAGGCATTCCAGTCTGCCCCACTGGGTCTGGCAGCCCTCCCGGGATGGGC CCACATCCACCTCCAGTCCCTGGCCAGGGGTGAGAGGCAGACCAGCAGATGGACTTGATCCCTC TGTGTCTTTTTGCTTCTGGCTGGTAGATAATGTCAACCTGCAGTCTTGATTCCCAGACCCTGTA CACTCCTCCTTTTCTGTTGTGTGATCAGTTTGTGCTTTATTCTGTATTTGTCTCCCATGTCTTG CTCTTCTCCTGGAGAATTCTGTCTTCTCTTTGGCCATCTCAAATTGAGAACCTAAACTATTCCT GCAGAACTGCCTGGTTGGCGTCCACAAGCAATACCTCTCGTTCCAGCAGGACCAAGGGAGCCAG CCTCCAGTGAGTGACTCCAGCAAGTGCAGCCACCTCTCCCTTGATGGTCTGGGAGCCTGGCCTC AGCAAGGGGCCTTCCTGACCTCTGGCTCCAGTGAAGCTGAATGTCCTCACTTTGTGGGTCACAC TCTTTACATTTCTGTAAGGCAATCTTGGCACACGTGGGGCTTACCAGTGGCCCAGGTAATTTTT TGTTTCATGGACTATGGACTCTTTCAAAGGGATCTGATCCTTTTGAATTTTGCACAGCCCTAGA TACAATCCCTTTTGATAAAAGGGTCTTTGCTTCTGATTACAGGAGCACTGTGGAACGTCTGTAA ATATGTTTTTATAATTCCATGTATAGTTGGTGTACACTCAAAACCTGTCCCCGGCAGCCAGTGC TCTCTGTATAGGGCCATAATGGAATTCTGAAGAAATCTTGGGGAGGGAAGGGGAGTTGGAACAA ATGTCTGTTCCCTGGAGGCCAGTCCAGTGCTCAGACCTTTAGACTCATTGTAAGTTGCCACTGC CAACATGAGACCAAAGTGTGTGACTAGTCAATGAAGTGCGACAGCATTAAAGACTGATGCTAAA CCTCAGGGGA Polypeptides Encoded by DNAJB1 Gene Exemplary Human Mature DNAJB1 Protein Isoform 1 and Isoform 3 (SEQ ID NO: 160) MGKDYYQTLGLARGASDEEIKRAYRRQALRYHPDKNKEPGAEEKFKEIAEAYDVLSDPRKREIF DRYGEEGLKGSGPSGGSGGGANGTSFSYTFHGDPHAMFAEFFGGRNPFDTFFGQRNGEEGMDID DPFSGFPMGMGGFTNVNFGRSRSAQEPARKKQDPPVTHDLRVSLEEIYSGCTKKMKISHKRLNP DGKSIRNEDKILTIEVKKGWKEGTKITFPKEGDQTSNNIPADIVFVLKDKPHNIFKRDGSDVIY PARISLREALCGCTVNVPTLDGRTIPVVFKDVIRPGMRRKVPGEGLPLPKTPEKRGDLIIEFEV IFPERIPQTSRTVLEQVLPI Exemplary Human Mature DNAJB1 Protein Isoform 2 (SEQ ID NO: 161) MFAEFFGGRNPFDTFFGQRNGEEGMDIDDPFSGFPMGMGGFTNVNFGRSRSAQEPARKKQDPPV THDLRVSLEEIYSGCTKKMKISHKRLNPDGKSIRNEDKILTIEVKKGWKEGTKITFPKEGDQTS NNIPADIVFVLKDKPHNIFKRDGSDVIYPARISLREALCGCTVNVPTLDGRTIPVVFKDVIRPG MRRKVPGEGLPLPKTPEKRGDLIIEFEVIFPERIPQTSRTVLEQVLPI
Heat Shock Protein 40 (Hsp40) (DnaJ Homolog Subfamily B Member 5) (DNAJB5)
[0292] In some embodiments, an Hsp40 protein is encoded by a DNAJB5 gene. The human DNAJB5 gene is located on chromosome Chr19:14, 514, 770-14,529,770. It contains 7 exons (NCBI Accession No. NC_000019.10). DNAJB5 encodes a 40 kDa heat shock protein.
[0293] As used herein, the term “active DNAJB5 protein” means a protein encoded by DNA that, if substituted for both wildtype alleles encoding full-length DNAJB5 protein in auditory hair cells, or ocular cells, of what is otherwise a wildtype mammal, and if expressed in the auditory hair cells, or ocular cells, of that mammal, results in that mammal's having a level of hearing, or vision, approximating the normal level of hearing, or vision, of a similar mammal that is entirely wildtype. Non-limiting examples of active DNAJB5 proteins are full-length DNAJB5 proteins (e.g., any of the full-length DNAJB5 proteins described herein).
[0294] For example, an active DNAJB5 protein can include a sequence of a wildtype, full-length DNAJB5 protein (e.g., a wildtype, human, full-length DNAJB5 protein) including about 1 to about 200 amino acid substitutions (e.g., about 1 to 190 amino acid substitutions, about 1 to about 180 amino acid substitutions, about 1 to about 160 amino acid substitutions, about 1 to about 150 amino acid substitutions, about 1 to about 140 amino acid substitutions, about 1 to about 130 amino acid substitutions, about 1 to about 120 amino acid substitutions, about 1 to about 110 amino acid substitutions, about 1 to about 100 amino acid substitutions, about 1 to about 90 amino acid substitutions, about 1 to about 80 amino acid substitutions, about 1 to about 70 amino acid substitutions, about 1 to about 60 amino acid substitutions, about 1 to about 50 amino acid substitutions, about 1 to about 40 amino acid substitutions, about 1 to about 30 amino acid substitutions, about 1 to about 25 amino acid substitutions, about 1 to about 20 amino acid substitutions, about 1 to about 10 amino acid substitutions, about 1 to about 5 amino acid substitutions, about 10 to about 200 amino acid substitutions, about 10 to about 180 amino acid substitutions, about 10 to about 160 amino acid substitutions, about 10 to about 150 amino acid substitutions, about 10 to about 140 amino acid substitutions, about 10 to about 120 amino acid substitutions, about 10 to about 100 amino acid substitutions, about 10 to about 80 amino acid substitutions, about 10 to about 60 amino acid substitutions, about 10 to about 50 amino acid substitutions, about 10 to about 40 amino acid substitutions, about 10 to about 20 amino acid substitutions, about 20 to about 200 amino acid substitutions, about 20 to about 180 amino acid substitutions, about 20 to about 160 amino acid substitutions, about 20 to about 150 amino acid substitutions, about 20 to about 140 amino acid substitutions, about 20 to about 120 amino acid substitutions, about 20 to about 100 amino acid substitutions, about 20 to about 80 amino acid substitutions, about 20 to about 60 amino acid substitutions, about 20 to about 50 amino acid substitutions, about 20 to about 40 amino acid substitutions, about 40 to about 200 amino acid substitutions, about 40 to about 180 amino acid substitutions, about 40 to about 160 amino acid substitutions, about 40 to about 150 amino acid substitutions, about 40 to about 140 amino acid substitutions, about 40 to about 120 amino acid substitutions, about 40 to about 100 amino acid substitutions, about 40 to about 80 amino acid substitutions, about 40 to about 60 amino acid substitutions, about 40 to about 50 amino acid substitutions, about 50 to about 200 amino acid substitutions, about 50 to about 180 amino acid substitutions, about 50 to about 160 amino acid substitutions, about 50 to about 150 amino acid substitutions, about 50 to about 140 amino acid substitutions, about 50 to about 120 amino acid substitutions, about 50 to about 100 amino acid substitutions, about 50 to about 80 amino acid substitutions, about 50 to about 60 amino acid substitutions, about 60 to about 200 amino acid substitutions, about 60 to about 180 amino acid substitutions, about 60 to about 160 amino acid substitutions, about 60 to about 150 amino acid substitutions, about 60 to about 140 amino acid substitutions, about 60 to about 120 amino acid substitutions, about 60 to about 100 amino acid substitutions, about 60 to about 80 amino acid substitutions, about 80 to about 200 amino acid substitutions, about 80 to about 180 amino acid substitutions, about 80 to about 160 amino acid substitutions, about 80 to about 150 amino acid substitutions, about 80 to about 140 amino acid substitutions, about 80 to about 120 amino acid substitutions, about 80 to about 100 amino acid substitutions, about 100 to about 200 amino acid substitutions, about 100 to about 180 amino acid substitutions, about 100 to about 160 amino acid substitutions, about 100 to about 150 amino acid substitutions, about 100 to about 140 amino acid substitutions, about 100 to about 120 amino acid substitutions, about 120 to about 200 amino acid substitutions, about 120 to about 180 amino acid substitutions, about 120 to about 160 amino acid substitutions, about 120 to about 150 amino acid substitutions, about 120 to about 140 amino acid substitutions, about 140 to about 200 amino acid substitutions, about 140 to about 180 amino acid substitutions, about 140 to about 160 amino acid substitutions, about 140 to about 150 amino acid substitutions, about 150 to about 200 amino acid substitutions, about 150 to about 180 amino acid substitutions, about 150 to about 160 amino acid substitutions, about 160 to about 200 amino acid substitutions, about 160 to about 180 amino acid substitutions, or about 180 to about 200 amino acid substitutions).
[0295] One skilled in the art would appreciate that amino acids that are not conserved between wildtype DNAJB5 proteins from different species can be mutated without losing activity, while those amino acids that are conserved between wildtype DNAJB5 proteins from different species should not be mutated as they are more likely (than amino acids that are not conserved between different species) to be involved in activity.
[0296] An active DNAJB5 protein can include, e.g., a sequence of a wildtype, full-length DNAJB5 protein (e.g., a wildtype, human, full-length DNAJB5 protein) that has about 1 to about 100 amino acids (e.g., about 1 to about 95 amino acids, about 1 to about 90 amino acids, about 1 to about 85 amino acids, about 1 to about 80 amino acids, about 1 to about 75 amino acids, about 1 to about 70 amino acids, about 1 to about 65 amino acids, about 1 to about 60 amino acids, about 1 to about 55 amino acids, about 1 to about 50 amino acids, about 1 to about 45 amino acids, about 1 to about 40 amino acids, about 1 to about 35 amino acids, about 1 to about 30 amino acids, about 1 to about 25 amino acids, about 1 to about 20 amino acids, about 1 to about 15 amino acids, about 1 to about 10 amino acids, about 1 to about 5 amino acids, about 5 to about 100 amino acids, about 5 to about 95 amino acids, about 5 to about 90 amino acids, about 5 to about 85 amino acids, about 5 to about 80 amino acids, about 5 to about 75 amino acids, about 5 to about 70 amino acids, about 5 to about 65 amino acids, about 5 to about 60 amino acids, about 5 to about 55 amino acids, about 5 to about 50 amino acids, about 5 to about 45 amino acids, about 5 to about 40 amino acids, about 5 to about 35 amino acids, about 5 to about 30 amino acids, about 5 to about 25 amino acids, about 5 to about 20 amino acids, about 5 to about 15 amino acids, about 5 to about 10 amino acids, about 10 to about 100 amino acids, about 10 to about 95 amino acids, about 10 to about 90 amino acids, about 10 to about 85 amino acids, about 10 to about 80 amino acids, about 10 to about 80 amino acids, about 10 to about 75 amino acids, about 10 to about 70 amino acids, about 10 to about 65 amino acids, about 10 to about 60 amino acids, about 10 to about 55 amino acids, about 10 to about 50 amino acids, about to about 45 amino acids, about 10 to about 40 amino acids, about 10 to about 35 amino acids, about 10 to about 30 amino acids, about 10 to about 25 amino acids, about 10 to about 20 amino acids, about 10 to about 15 amino acids, about 15 to about 100 amino acids, about 15 to about 95 amino acids, about 15 to about 90 amino acids, about 15 to about 85 amino acids, about 15 to about 80 amino acids, about 15 to about 75 amino acids, about 15 to about 70 amino acids, about to about 65 amino acids, about 15 to about 60 amino acids, about 15 to about 55 amino acids, about 15 to about 50 amino acids, about 15 to about 45 amino acids, about 15 to about 40 amino acids, about 15 to about 35 amino acids, about 15 to about 30 amino acids, about 15 to about 25 amino acids, about 15 to about 20 amino acids, about 20 to about 100 amino acids, about 20 to about 95 amino acids, about 20 to about 90 amino acids, about 20 to about 85 amino acids, about to about 80 amino acids, about 20 to about 75 amino acids, about 20 to about 70 amino acids, about 20 to about 65 amino acids, about 20 to about 60 amino acids, about 20 to about 55 amino acids, about 20 to about 50 amino acids, about 20 to about 45 amino acids, about 20 to about 40 amino acids, about 20 to about 35 amino acids, about 20 to about 30 amino acids, about 20 to about 25 amino acids, about 25 to about 100 amino acids, about 25 to about 95 amino acids, about 25 to about 90 amino acids, about 25 to about 85 amino acids, about 25 to about 80 amino acids, about 25 to about 75 amino acids, about 25 to about 70 amino acids, about 25 to about 65 amino acids, about 25 to about 60 amino acids, about 25 to about 55 amino acids, about 25 to about 50 amino acids, about 25 to about 45 amino acids, about 25 to about 40 amino acids, about 25 to about 35 amino acids, about 25 to about 30 amino acids, about 30 to about 100 amino acids, about 30 to about 95 amino acids, about 30 to about 90 amino acids, about 30 to about 85 amino acids, about 30 to about 80 amino acids, about 30 to about 75 amino acids, about 30 to about 70 amino acids, about 30 to about 65 amino acids, about 30 to about 60 amino acids, about 30 to about 55 amino acids, about 30 to about 50 amino acids, about 30 to about 45 amino acids, about 30 to about 40 amino acids, about 30 to about 35 amino acids, about 35 to about 100 amino acids, about 35 to about 95 amino acids, about 35 to about 90 amino acids, about 35 to about 85 amino acids, about 35 to about 80 amino acids, about 35 to about 75 amino acids, about 35 to about 70 amino acids, about 35 to about 65 amino acids, about 35 to about 60 amino acids, about 35 to about 55 amino acids, about 35 to about 50 amino acids, about 35 to about 45 amino acids, about 35 to about 40 amino acids, about 40 to about 100 amino acids, about 40 to about 95 amino acids, about 40 to about 90 amino acids, about 40 to about 85 amino acids, about 40 to about 80 amino acids, about 40 to about 75 amino acids, about 40 to about 70 amino acids, about 40 to about 65 amino acids, about 40 to about 60 amino acids, about 40 to about 55 amino acids, about 40 to about 50 amino acids, about 40 to about 45 amino acids, about 45 to about 100 amino acids, about 45 to about 95 amino acids, about 45 to about 90 amino acids, about 45 to about 85 amino acids, about 45 to about 80 amino acids, about 45 to about 75 amino acids, about 45 to about 70 amino acids, about 45 to about 65 amino acids, about 45 to about 60 amino acids, about 45 to about 55 amino acids, about 45 to about 50 amino acids, about 50 to about 100 amino acids, about 50 to about 95 amino acids, about 50 to about 90 amino acids, about 50 to about 85 amino acids, about 50 to about 80 amino acids, about 50 to about 75 amino acids, about 50 to about 70 amino acids, about 50 to about 65 amino acids, about 50 to about 60 amino acids, about 50 to about 55 amino acids, about 55 to about 100 amino acids, about 55 to about 95 amino acids, about 55 to about 90 amino acids, about 55 to about 85 amino acids, about 55 to about 80 amino acids, about 55 to about 75 amino acids, about 55 to about 70 amino acids, about 55 to about 65 amino acids, about 55 to about 60 amino acids, about 60 to about 100 amino acids, about 60 to about 95 amino acids, about 60 to about 90 amino acids, about 60 to about 85 amino acids, about 60 to about 80 amino acids, about 60 to about 75 amino acids, about 60 to about 70 amino acids, about 60 to about 65 amino acids, about 65 to about 100 amino acids, about 65 to about 95 amino acids, about 65 to about 90 amino acids, about 65 to about 85 amino acids, about 65 to about 80 amino acids, about 65 to about 75 amino acids, about 65 to about 70 amino acids, about 70 to about 100 amino acids, about 70 to about 95 amino acids, about 70 to about 90 amino acids, about 70 to about 85 amino acids, about 70 to about 80 amino acids, about 70 to about 75 amino acids, about 75 to about 100 amino acids, about 75 to about 95 amino acids, about 75 to about 90 amino acids, about 75 to about 85 amino acids, about 75 to about 80 amino acids, about 80 to about 100 amino acids, about 80 to about 95 amino acids, about 80 to about 90 amino acids, about 80 to about 85 amino acids, about 85 to about 100 amino acids, about 85 to about 95 amino acids, about 85 to about 90 amino acids, about 90 to about 100 amino acids, about 90 to about 95 amino acids, or about 95 to about 100 amino acids), removed from its N-terminus and/or 1 amino acid to 80 amino acids (or any of the subranges of this range described herein) removed from its C-terminus.
[0297] In some embodiments, an active DNAJB5 protein can, e.g., include the sequence of a wildtype, full-length DNAJB5 protein where 1 amino acid to 50 amino acids, 1 amino acid to 45 amino acids, 1 amino acid to 40 amino acids, 1 amino acid to 35 amino acids, 1 amino acid to 30 amino acids, 1 amino acid to 25 amino acids, 1 amino acid to 20 amino acids, 1 amino acid to 15 amino acids, 1 amino acid to 10 amino acids, 1 amino acid to 9 amino acids, 1 amino acid to 8 amino acids, 1 amino acid to 7 amino acids, 1 amino acid to 6 amino acids, 1 amino acid to 5 amino acids, 1 amino acid to 4 amino acids, 1 amino acid to 3 amino acids, about 2 amino acids to 50 amino acids, about 2 amino acids to 45 amino acids, about 2 amino acids to 40 amino acids, about 2 amino acids to 35 amino acids, about 2 amino acids to 30 amino acids, about 2 amino acids to 25 amino acids, about 2 amino acids to 20 amino acids, about 2 amino acids to 15 amino acids, about 2 amino acids to 10 amino acids, about 2 amino acids to 9 amino acids, about 2 amino acids to 8 amino acids, about 2 amino acids to 7 amino acids, about 2 amino acids to 6 amino acids, about 2 amino acids to 5 amino acids, about 2 amino acids to 4 amino acids, about 3 amino acids to 50 amino acids, about 3 amino acids to 45 amino acids, about 3 amino acids to 40 amino acids, about 3 amino acids to 35 amino acids, about 3 amino acids to 30 amino acids, about 3 amino acids to 25 amino acids, about 3 amino acids to 20 amino acids, about 3 amino acids to 15 amino acids, about 3 amino acids to 10 amino acids, about 3 amino acids to 9 amino acids, about 3 amino acids to 8 amino acids, about 3 amino acids to 7 amino acids, about 3 amino acids to 6 amino acids, about 3 amino acids to 5 amino acids, about 4 amino acids to 50 amino acids, about 4 amino acids to 45 amino acids, about 4 amino acids to 40 amino acids, about 4 amino acids to 35 amino acids, about 4 amino acids to 30 amino acids, about 4 amino acids to 25 amino acids, about 4 amino acids to 20 amino acids, about 4 amino acids to 15 amino acids, about 4 amino acids to 10 amino acids, about 4 amino acids to 9 amino acids, about 4 amino acids to 8 amino acids, about 4 amino acids to 7 amino acids, about 4 amino acids to 6 amino acids, about 5 amino acids to 50 amino acids, about 5 amino acids to 45 amino acids, about 5 amino acids to 40 amino acids, about 5 amino acids to 35 amino acids, about 5 amino acids to 30 amino acids, about 5 amino acids to 25 amino acids, about 5 amino acids to 20 amino acids, about 5 amino acids to 15 amino acids, about 5 amino acids to 10 amino acids, about 5 amino acids to 9 amino acids, about 5 amino acids to 8 amino acids, about 5 amino acids to 7 amino acids, about 6 amino acids to 50 amino acids, about 6 amino acids to 45 amino acids, about 6 amino acids to 40 amino acids, about 6 amino acids to 35 amino acids, about 6 amino acids to 30 amino acids, about 6 amino acids to 25 amino acids, about 6 amino acids to 20 amino acids, about 6 amino acids to 15 amino acids, about 6 amino acids to 10 amino acids, about 6 amino acids to 9 amino acids, about 6 amino acids to 8 amino acids, about 7 amino acids to 50 amino acids, about 7 amino acids to 45 amino acids, about 7 amino acids to 40 amino acids, about 7 amino acids to 35 amino acids, about 7 amino acids to 30 amino acids, about 7 amino acids to 25 amino acids, about 7 amino acids to 20 amino acids, about 7 amino acids to 15 amino acids, about 7 amino acids to 10 amino acids, about 7 amino acids to 9 amino acids, about 8 amino acids to 50 amino acids, about 8 amino acids to 45 amino acids, about 8 amino acids to 40 amino acids, about 8 amino acids to 35 amino acids, about 8 amino acids to 30 amino acids, about 8 amino acids to 25 amino acids, about 8 amino acids to 20 amino acids, about 8 amino acids to 15 amino acids, about 8 amino acids to 10 amino acids, about 10 amino acids to 50 amino acids, about 10 amino acids to 45 amino acids, about 10 amino acids to 40 amino acids, about 10 amino acids to 35 amino acids, about 10 amino acids to 30 amino acids, about 10 amino acids to 25 amino acids, about 10 amino acids to 20 amino acids, about 10 amino acids to 15 amino acids, about 15 amino acids to 50 amino acids, about 15 amino acids to 45 amino acids, about 15 amino acids to 40 amino acids, about 15 amino acids to 35 amino acids, about 15 amino acids to 30 amino acids, about 15 amino acids to 25 amino acids, about 15 amino acids to 20 amino acids, about 20 amino acids to 50 amino acids, about 20 amino acids to 45 amino acids, about 20 amino acids to 40 amino acids, about 20 amino acids to 35 amino acids, about 20 amino acids to 30 amino acids, about 20 amino acids to 25 amino acids, about 25 amino acids to 50 amino acids, about 25 amino acids to 45 amino acids, about 25 amino acids to 40 amino acids, about 25 amino acids to 35 amino acids, about 25 amino acids to 30 amino acids, about 30 amino acids to 50 amino acids, about 30 amino acids to 45 amino acids, about 30 amino acids to 40 amino acids, about 30 amino acids to 35 amino acids, about 35 amino acids to 50 amino acids, about 35 amino acids to 45 amino acids, about 35 amino acids to 40 amino acids, about 40 amino acids to 50 amino acids, about 40 amino acids to 45 amino acids, or about 45 amino acids to about 50 amino acids, are inserted. In some examples, the 1 amino acid to 50 amino acids (or any subrange thereof) can be inserted as a contiguous sequence into the sequence of a wildtype, full-length DNAJB5 protein. In some examples, the 1 amino acid to 50 amino acids (or any subrange thereof) are inserted in multiple, non-contiguous places in the sequence of a wildtype, full-length DNAJB5 protein. As can be appreciated in the art, the 1 amino acid to 50 amino acids can be inserted into a portion of the sequence of a wildtype, full-length DNAJB5 protein that is not well-conserved between species.
[0298] Methods of detecting mutations in a gene are well-known in the art. Non-limiting examples of such techniques include: real-time polymerase chain reaction (RT-PCR), PCR, sequencing, Southern blotting, and Northern blotting.
[0299] Exemplary wildtype DNAJB5 protein sequences are or include SEQ ID NO: 38, 41 43, 44, and 45. Exemplary DNA sequences that encode a NDP protein and exemplary polypeptides encoded by an NDP gene are shown below.
DNAJB5 Polynucleotides
[0300] Among other things, the present disclosure provides polynucleotides, e.g., polynucleotides comprising an DnaJ homolog subfamily B member 5 (DNAJB5) gene or characteristic portion thereof, as well as compositions including such polynucleotides and methods utilizing such polynucleotides and/or compositions.
[0301] In some embodiments, a polynucleotide comprising an DNAJB5 gene or characteristic portion thereof can be DNA or RNA. In some embodiments, DNA can be genomic DNA or cDNA. In some embodiments, RNA can be an mRNA. In some embodiments, a polynucleotide comprises exons and/or introns of an DNAJB5 gene.
[0302] In some embodiments, a gene product is expressed from a polynucleotide comprising an DNAJB5 gene or characteristic portion thereof. In some embodiments, expression of such a polynucleotide can utilize one or more control elements (e.g., promoters, enhancers, splice sites, poly-adenylation sites, translation initiation sites, etc.). Thus, in some embodiments, a polynucleotide provided herein can include one or more control elements.
[0303] In some embodiments, an DNAJB5 gene is a mammalian DNAJB5 gene. In some embodiments, an DNAJB5 gene is a murine DNAJB5 gene. In some embodiments, an DNAJB5 gene is a primate DNAJB5 gene. In some embodiments, a DNAJB5 gene is a human DNAJB5 gene. An exemplary human DNAJB5 cDNA sequence is or includes the sequence of SEQ ID NO: 165 or 168 or 171. An exemplary human DNAJB5 genomic DNA sequence can be found in SEQ ID NO: 162. An exemplary human DNAJB5 cDNA sequence including untranslated regions is or includes the sequence of SEQ ID NO: 163, 164, 166, 167, 169, or 170.
TABLE-US-00010 Exemplary Human DNAJB5 Genomic Sequence (SEQ ID NO: 162) CTGAGCTGAGTGACAGGAAGACGGTTCAATGGGCGGCGCGGAGGCGGAGCCGTGGGGGCGGGCT CCCGGTCCCGCTATCGGCGGCCGGCGGGCAGGCGACTCCTGTCCCGGGTGGAGGCGGCGGAGCC GGAGCCGGGGGAGGGGGCAGCGGCTGTCTCACGGACCACGGCGGCGCCCGCAGCTCCTCACCGG TGAGGGCGCCAAGCCAGGACTCGGGGGTCCCGGGAGCGGGGCGTCGTGAAGCCAGGGCTCCTGG TGTTGGGGGGTTGGGACACTGAGGGCCCGTAGGCTCTGCTGCCTTGGGGATGCGGGGCCCCGGG GTCTGCAGGGTGCTCGGAGTCCTAGACCAGGGCTTGGCTGTCACAGGGGGCAGCCAGCGTCTGA GTCGGGGAGGGGAGGGGAGGGGAGGGTCGAGTCGGACCGGACCAGATTGGGTTCTGTGGGGCGG AGGCATCTGTGAGCAGACCAGCCAGCCAGCGCGGGTGACATCACCGACCACCCTCCCCCGCCCG AGCCCCCTCCCCTCCTCTCCTCGCCGCGCCTTTTGTCCCGGCCGAGCTCCGCTCTGCCCCGCCC ATCTGCGAGGGAGGAGACTCCCGTCAGTGACTTCATTGAGTAGGTTCTTGGGGATTGGGGTCGG GTCCTCCCCAGTGAGAGGCGACAGGAGCTCACTGCCTCTCGGGCCCTCACAATCACACCTGTCA CAGGTATACACGCTGCCACTCACAAACACACGCTTGCACTCCCAGCCTCACTGACACGCTGCCA TACAGCCGCTCACAGCCAGCCAGACACTGCTGCACCTGAGAGCAGGTGGCCGCGGGTCTTCTCC CTTCACTCTCCACATTTGGGGATCAGGTCCGGCACCTGCTGCGCCAGACAGGCCTTGCTGGGCA CGCGCCTCTGAGAGTCCAAGGAGGTGGCTTCCAGAGCTGCAGCACTTCAGGCCGGCTCCGGTGG AGCGATCAGAGGTGAGGGGCTTGGCTGGGCATGTTTAAGCGCACAGTGCTCTCCTGCCCACCCC CAGCAGCACCCCCACTGCAGGCCCGAGGAGCTTTCCGGAGCTTCCCACACTCCTGGGGAGAAGA CTTCTTAGCCAGCTTGATGTTTAAAATTCAGCTGGAGCCCTTAAAACTTCGAGCGTGGACGCTG AATGGGTTTGTAAAGTTTCGGTAAGTCCCTCCGGAGAAGGGCACTCACACCCATACCCAGTCAA ACCCTCCACGCGGCCATCCCCTCCATTGCCTTCCTGCTTCCTCCAACTCATCCTCATCCCCTCT GTGAGATACAGGAACCCCCCTCCGGCCTCACGGAGATATTTGAATGAATTGCACCCCTTATCAT TCCTTTTCTCAAACCCTTCAGGTATCCATTTTCCACAGACATTTCCTCCATTATTTCAACTCTC CACATTCCTCCCTTTCAACCCATCATTAACCCATTTTCCACCTCAGCCATTTCCATCTGTCACT CATCCCCCTCCATCAGTACCTCATTTTCTGTATCAGCCATCTCCCCTCCCTGTACCTCAACAGT CCCCCTCGCCATTTTCTCCCCATTTTTCCTTCCTGCCTCTACCTTAACATCCTCCTCATTCTTC TCCATCAAAGGCAGTGCCATGCCATGGGGCTCCTTTGTCATTCCAACACGTGGCCTGTGGGTAG GGGGGAAGGGAAGGAGGGAGGAAAGCAGAAAGAGAAAAAGGAGGCAGATCCTGAAAGAGCTTGC CCTGTGTCTGCGGAGGCAAGAGATGTATGGGCCTAGAGCAGCTAGCAGGTGGGTACCAGGGAGG AAGCTGGCCGGGTGTGGAGGTAATGCAGGGACACCAAAAGGAGGTGGGTGGTTGGTGAGCCGTT CCACTGGCCCTCTGCCTGGAAGCTGCTCTCGTTCTCTAGGGGCACCCTCTTGGTGTGCTAGGGT TCTTTCCCAGTCCCAGGGGAAAGAAGGTGTGGGTTGAGGCAGAAATGGCAGACCACCCCCCCCC GCTCCCTCTCAGACGGCTTTTGCTGCCATTTCCGAAAACGGTTGCCCCCCCCCCCAACGAGGTG CTCTTTCTTCTCTGCCTCTCCCAACCCCACCCCCCACCCCATCTCCCCCACAGACGTGACATGC TGGTGGGGGGCAATGGGGGAGTACCCGCCTTTGTGGAGCCTTTTCTTCGTCAGAGAAAGGTGAC TCCCAGAACCATGAGTCAACCCAGCCACCTGCAGCAGGGCCTGTGGGGGCAGCAGTGAGGGCCA TTGTCCCTGCCCACCTAGGGCCTTGGGCGCCTCCCCCTTACCCGGTTACAGGCCTCACTCCTGG AAGGACACCCTGTACACCTCCCTGCTGTGAAGCAGTCAGAGGCAGAAAGGCAGGTCTCAGCTGT GGCTGCCAGCCTAGGACATTGCTAGAAAGACTCTGGTTCTCCTGGCTTTGGCAGGGGTTGGCCA AGTCTCAGAAGGGCTGAGAATGGCTTCTGGCCATCCCCCAGAGAGCTCTAGCTACAGGAAAGTC CCCCTTCCCCTGATATCCACCACAGACACACCGATAGAGGGAGACTGGAATGGAAGAATTCGGG TTGTACGTTTCTCTTATAACAAATGCTCCTGCACCCACCTTAAAAGGGACTCAGAGGAATATGG GTCTGATGATGGGCCGGATCCTAAACCACAGGGTGGACTGGGGGCGTAGGGTGAGGCTAGGAGA GGTATTAAGGAAATGGTGGTTGGTTAGTGCCCAATTGTCTCCTAACAGCTGGGGTTGAAAAAGT CCTGTCCCCAGGATGCTTTTGTTCACTCACACTGGTGCTTAGGCGGCACAGCCCTCACAGTTTT GTGTGTAGACGCTGGCAAATGAGTAAAGGGAACTGTTGCCCTGGTTGGGGGAAGGATAGTGTCG CCCCCCCTGCTAATGAGGGAGTTGGAAACAGCAGTTTCCAAAGGCTGTGAGTCACCCTGTGATC TGAGTCACACGTGAGGCTTAGCATCGTCAGAGGGGTGGGGGCGGGGAGGCCTGGGGTCTCCGAT GGCAGACAGGACTCCAGACACCCATTCCCTGACTCAGTCTGGCTGGCCAGGACAGCTGGCGGAA GACCCAGGCCTGTTCCCGGTCCTGGGCCACTGCCCCAGCTGCCCTCAGCCTGGGCACCATGCCT GAGGCGTGGCGTAGGAGACCACGATGCTCAGGTGACCTCACCCGTTACACTGACCTTCGCCAGC TCCTGTGTGTGGGTCGCTCAGTCCATGGTGCCACCTCGCTCAGCAGGGCCTCTCAGGGGTCCTC AGTCTGGCCCAAGGACCAGGGCCTCAGCCTAGCCCCAAGAGTGGGCCACAACCTTCAGCTTCCT GCTGGGAAAACCCAGGAGGTGAGGACGGAATCACAGAATTCTAGAATGTCAGAGCCAGAAGAGA TCATCTAGTCCAAACCTTTCATTTTACAGATGGGGGGACGCAGGCCCACAGAACAGGCATGACC TGCTGAAGGTTACCCAGGGAGTCATTTAGGGATTGCAAGATCATCAGGAACAGGGCTGGGGCAG AAGAGAAGGCATCAAGTCTTGGCCCTGGGTTTCTTTCAGAAACAAGGAGACCAGTGCTGGTCCA GTGGCTGTGATGGGAAAAGATTATTACAAGATTCTTGGGATCCCATCGGGGGCCAACGAGGATG AGATCAAGAAAGCCTACCGGAAGATGGCCTTGAAGTACCACCCAGACAAGAATAAAGAACCCAA CGCTGAGGAGAAGTTTAAGGAGATTGCAGAGGCCTATGATGTGCTAAGTGACCCCAAGAAACGG GGCCTGTATGACCAGTATGGGGAGGAAGGTAAGAGGGCAGCACTCCAGCCCAATCCCGGACCCC TCCGCTTGGTAGGGGTCCCAGGTGCACCAGTTTGTGTAGCGGGGAACTGGAGGCTGTGGAGGGG GTGAGGGCAGCAAGGGTTTCCAGCACTGTCACCCAGAGAAGAGAGAAGCCCACCCACTACCCAG CTCAGCTCACCCTAGTTCTCCCCGCCTCTCTCTGCCCCCTCCCTCCTCGGGTTCAGGCCTTAGA GAGGTCTGAAGCAAGAGCCAGAGGCCTTCCCCGCCCTTCTTTCCTTCCCAGCCTTATTAGTCCT GCCTGAAGTCTCCGCTGCTTCATTGGCTGATGAGGAAGGTTGGGGAGAGGGAGTGTGTGGCTCT CTGAGGACAGAGCATTTACATTCATCAGCATAAAAGTCTGGCCCTTAAGCAGCCTGGGAATTAA CTACCAGAGCTCTGAGCAGAAATGGTTTTCCCCTTCTCTCCCCCGGCCCTGCCTGCTTCTGGCC AGCAGAACAGAGGTGGAGCTTTGCCTCCCAGAGGAAGTGGCAGGGATCCAAGGGTGGACGGGGA ACTGCTGCTGGCATCTTCCCCCTTCCCTGGGGTAGCTGGTGGCCATGGGGTGGGAGGCAGGAAA GGCCCAAATAAAGCCTCTGAGAGGGAGGCCCCTCCCAGCACATCCCACGGCCATGATGTAGCTG CTAATTCTGGGCCTGCCCCTCAGGGCTGCCTGCCACCCGCCAGCCACAGGCACCTGTCAGGCTA TATTAAGAGACATTGTCACGCTGAGGACTCTCCTTCCTGGGAAGAACTGTCCTTCCTCCCCAAC AGAATCCACCTTTCTTTCACATTCTCCTCCTCTTCACCCTCCCCTTTCTATCCTCAAAACTAAA TGGGCTAATCATATGTAGATAAGAAGGTTATTTTCTTCCCCTCCTCCTGGCCTTGTGTTCCATG CCCCTCTGTGAACCCTCATCCTCCCAACTCCCTTTTCCTATTCTCCATGCTGCCGGTTCCTTCC ATTCTTCCTGCCTACCTCCTAGACGTGCCAACTCCCAGATTCACCAGATACCGGCTGGAACCCC CAGTATTCTTTCAGCTAAGACATTTTTCTCTCTGGCATGGAGGGATGGGAGGGATAGGAGTGGA AAAGAGGGAAGCAGTGCAGACACCATGCCACCATGTGACCTGTCTTGGAATTGCACCCATTCCT CCATTCAGCCTCACCTTTGTTTAGAAAGAAGGCAGGAGCTGGGAGAGCTCCCTTGAGATACTTC ACCCCTTCTGCCTAAGTGAAGGGGTCCAGACACCAGGTTCTGATGCAGAACCCCATCCCAGGAA GGCCCCTGAGGATGCGGCATGGCCTTAAGGGGACCTCCATGGCCTGGGGAGGGAGAAGCAGCTT AGAAATGTTGGTTATATTTCTGTCCACCCTGCCCAATCCCCAGCAGCCATTTATTATAGAGCAG TTATACACACAGACTCTGTGTGCAGCTCCAGCTAGGGACAAGTTGCTCCCAGCTTGAAAAGGGG GTGGTAGTGGAAGAATAGAAATCCCTATAGAGAGAATTACCCCTTCCCATCATCATGAGTCGAG CCTCCCTCCCTCTCTCACTGAGCTCTGCCAGGCTGCTTGAACTCGCTTGATAATCAGCCCCTCG ATAGAAGGAAAGGAGAGATCCTAGGTCTTGGGGGAGGGGTGAAGAGGCAGCTGCCACAAGTCCC TGAACCGTCTGCCACTTGGCCTCTCTAGTGAAACTCAAATCTGCCCTTGTGGTCCCAGGAAATA AGAGGCTAATTTAGGCTCAGATCTCCATGATCAGTTGAAATTCTAGATAAATGGCCTCAAAGAG AAGGGTCTCTTTCCTAGCCTGGACCCCACCCATAGTCTCTGTATTCCGGAGCAGTTCTCAGTGT GAACCGTCTCCCCACATCCCCCACTACTGCCAGCACCCCCTCCTGTGCATGAAGGCTGCCTCCA GGAATCTGAGCCCCTTAGACAGGACCTGACAGAGAGAGCAGCTAAGGTCACCACTGCCATGTGC CAGCCCATTCATGCATTTTGGTTGCTCAACATTAGGTGTGGGGTATGGCGTTTGCTGTTGTATA ACTCCAGGGAGCTCCATTCCCATTGTAGTCTATGTCTGTACAATGTGTGAATGACACTCCCTAA AGTTGTGTAACACCGAGCAGAGTGTGGGAACAAAGCATTTAAAAACTTTTTCTAGAAAAATAGC TCTTCATCTCCAAGCTAATGAGCTTTCCCACTTTTTAAGGCTTCCTGATTTCCCGTTTGAGCAG GGGCCAGGGAAAGAGGAAACACTGCTAGAAGCAGAGTGAAAGTGGTCTTCCTCCCGGGTTGGAT CCCTAGCCTATTTGCGACTGTTCACTTACCAGCCCCCTCTGTGGTCACATCCTGTCTGATGGCC CCTTGTCCTATATGTCTTTGGAGATAAGCAAAGATTTGGGAACCAAGTATCACTTGTAATGTGA CACCAGGCTAGTCATTCAAGGCTTTTTGAGACATCTTTGGACCATGAGTATGTGGAAAAGATTA CAGGACTGGAAGTCACATCCTCTAAGTCCTCTAGTCCTGGCTTTTCCACCTATTGGGTAAGCAA GCCTCTTTGAATCTCAAGTTGTGCATCGTTATTAAATATATAATTAGGAATGGAAGATAATCTT GCCAGAAGCCTTTCTTACTCTATTAGCCTGGCCCTCTCTGTGGGATTTAAAGGTTGCTTCTTTC TTTAAGCCTGTGAACTGGGAGCTAGAGGGCAGGGGGAAGACACTGGGATTGGGGCCAGAATAAG CCACACTGCCCCTAACTTCTCCTCCCCTCTAGGCCTGAAGACCGGCGGTGGCACATCAGGTGGC TCCAGTGGCTCCTTTCACTACACCTTTCATGGGGACCCCCATGCCACCTTTGCCTCCTTCTTTG GTGGCTCCAACCCCTTCGATATCTTCTTTGCCAGCAGCCGCTCCACTCGGCCCTTCAGTGGCTT TGACCCAGATGACATGGATGTGGATGAAGATGAGGACCCATTTGGCGCTTTCGGCCGTTTTGGC TTCAATGGGCTGAGTAGGGGTCCAAGGCGAGCCCCAGAACCACTGTACCCTCGGCGCAAGGTGC AGGACCCCCCAGTGGTGCACGAGCTGCGGGTGTCCCTGGAGGAGATCTACCATGGCTCCACCAA GCGCATGAAGATCACAAGGCGTCGCCTCAACCCTGATGGGCGAACTGTGCGCACCGAGGACAAG ATCCTGCACATAGTCATCAAGCGTGGCTGGAAGGAAGGCACCAAGATCACCTTCCCCAAAGAAG GCGACGCCACACCTGACAACATCCCTGCTGACATCGTCTTTGTGCTCAAAGACAAGCCCCATGC ACACTTCCGCCGAGATGGCACCAACGTGCTCTACAGTGCCCTGATCAGCCTCAAGGAGGTGGGG CCTAGTCAGGCTGTGTGTGTGTGCTGGGGAGATGGTGGGGACATTCCCTCTCTTCCCGCCAGCT GGCACATTCTTTCCCCACCTCAGTCTATTTCCTTCCTTCCCACTCACCTTACCCCACCTTTTCC TCACTTTCTGCTTCGTCTTTCCCAGGCGCTGTGTGGCTGCACTGTGAACATTCCCACTATCGAC GGCCGAGTGATCCCTTTGCCCTGCAATGATGTCATCAAGCCAGGCACCGTGAAGAGACTCCGTG GGGAGGGCCTTCCCTTCCCCAAAGTGCCAACTCAGCGAGGAGACCTCATTGTTGAGTTCAAAGT TCGCTTCCCAGACAGATTAACACCACAGACAAGACAGATCCTTAAGCAGCACCTACCCTGTTCC TAGGCTCTGCCCCAGCCAGTCCAGAGCCTACCACAGCAATACCCCCAACACTCACTCCACTCAA TGTGCACCCAGCTTGATGTCCACTGGACACTGGCAACTTTTTCTAAAATGCAAAAAAAAGCCAC TGGTTTTCAGGAAAATGTTCCTGTCCCTGACCCCTTTTAGAGCTGGGCTGCCTGGGGGGAGTGG GAGGGAGGTGGGGAGAGCTAGCCCAGGCCAGGGGTCAATGTTCATGTCACTAGCTTTCAATCCA GTTTCCACTGCAGTGGCTGGAGAGTGACCTGAGTGCTACTTGAAGATATGTAGAGATTCCTTAT CCATGCCTGTACATAGCATGTCCTCCTCCCCTCAGCTTTGCTAATACCAGTCCCCTCCCTTCCC TTTGGCTTCCTGCTGTTGGGGGTGGAAAAGACTAGAAAGGACATTGCTTTCTCAGCCCCACCCC CAGGTCCACAGTGTCCAAGGTAATGGCACACATATTCATGCACAAGAAGCACTCACCTAGAGCT GTCTGTGCCTCTCTGGGAGAAAGGAGAGAGGATAAGAAGGGAAAGTTCACAACCTGTGAACAGG GACTTGAGCACGAGACACTTTTCATCTGGAGCAGGGGGGTGAGCTCCTCAGCAGCCTCCGTAAC AGCTGCCCTGCCTACACCTGCAGAGCTGGAGGTTCTGTCCTCCCTGCTGCTCTCAGGAGTGTTC AAGGGTGGAGGGCAGGGAATGGGGCCTGAGGTATTAAGGCTAAGAGGTGGGGGACAGGGCCCAT CTACCAGCCTTCATCAGGAAGGGAAAGGGCTTTGGGGTCAGGTGGCAGCTATTCCCCACCAAGA GATTCAGGGTCACAGGTTTTTCCCCACACCTCTGAACTCAGGGCCTAGCCACCCCCAAACTTCC AAATCCCACTTTTGTGATATGTGAAGCTACTCATTTCTTACCCTTGGAGGCTGTGTGGGGAATT TCCAGCCCTTTTATGCCTGTGTGATCCCACCCCACCCCCATAGTTGTATAAAGGTCATAGTGAA GAAGCTGGGGGAAGACTGCTTCAGCCAGATCCTGGGGTGGGGTCTTTAGGTTTTCTCACTTTGA CAACCCCCGAATGTTTTTATAGTAGTTTTTTTGTATTTTTTTGTAATGACAGTATTATGTAAAA AATAAAGTATTTTAAAAATATGGCA Exemplary Human DNAJB5 cDNA including untranslated regions Variant 1 (SEQ ID NO: 163) AGTGAGAGGCGACAGGAGCTCACTGCCTCTCGGGCCCTCACAATCACACCTGTCACAGGTATAC ACGCTGCCACTCACAAACACACGCTTGCACTCCCAGCCTCACTGACACGCTGCCATACAGCCGC TCACAGCCAGCCAGACACTGCTGCACCTGAGAGCAGGTGGCCGCGGGTCTTCTCCCTTCACTCT CCACATTTGGGGATCAGGTCCGGCACCTGCTGCGCCAGACAGGCCTTGCTGGGCACGCGCCTCT GAGAGTCCAAGGAGGTGGCTTCCAGAGCTGCAGCACTTCAGGCCGGCTCCGGTGGAGCGATCAG AGGTGAGGGGCTTGGCTGGGCATGTTTAAGCGCACAGTGCTCTCCTGCCCACCCCCAGCAGCAC CCCCACTGCAGGCCCGAGGAGCTTTCCGGAGCTTCCCACACTCCTGGGGAGAAGACTTCTTAGC CAGCTTGATGTTTAAAATTCAGCTGGAGCCCTTAAAACTTCGAGCGTGGACGCTGAATGGGTTT GTAAAGTTTCGAAACAAGGAGACCAGTGCTGGTCCAGTGGCTGTGATGGGAAAAGATTATTACA AGATTCTTGGGATCCCATCGGGGGCCAACGAGGATGAGATCAAGAAAGCCTACCGGAAGATGGC CTTGAAGTACCACCCAGACAAGAATAAAGAACCCAACGCTGAGGAGAAGTTTAAGGAGATTGCA GAGGCCTATGATGTGCTAAGTGACCCCAAGAAACGGGGCCTGTATGACCAGTATGGGGAGGAAG GCCTGAAGACCGGCGGTGGCACATCAGGTGGCTCCAGTGGCTCCTTTCACTACACCTTTCATGG GGACCCCCATGCCACCTTTGCCTCCTTCTTTGGTGGCTCCAACCCCTTCGATATCTTCTTTGCC AGCAGCCGCTCCACTCGGCCCTTCAGTGGCTTTGACCCAGATGACATGGATGTGGATGAAGATG AGGACCCATTTGGCGCTTTCGGCCGTTTTGGCTTCAATGGGCTGAGTAGGGGTCCAAGGCGAGC CCCAGAACCACTGTACCCTCGGCGCAAGGTGCAGGACCCCCCAGTGGTGCACGAGCTGCGGGTG TCCCTGGAGGAGATCTACCATGGCTCCACCAAGCGCATGAAGATCACAAGGCGTCGCCTCAACC CTGATGGGCGAACTGTGCGCACCGAGGACAAGATCCTGCACATAGTCATCAAGCGTGGCTGGAA GGAAGGCACCAAGATCACCTTCCCCAAAGAAGGCGACGCCACACCTGACAACATCCCTGCTGAC ATCGTCTTTGTGCTCAAAGACAAGCCCCATGCACACTTCCGCCGAGATGGCACCAACGTGCTCT ACAGTGCCCTGATCAGCCTCAAGGAGGCGCTGTGTGGCTGCACTGTGAACATTCCCACTATCGA CGGCCGAGTGATCCCTTTGCCCTGCAATGATGTCATCAAGCCAGGCACCGTGAAGAGACTCCGT GGGGAGGGCCTTCCCTTCCCCAAAGTGCCAACTCAGCGAGGAGACCTCATTGTTGAGTTCAAAG TTCGCTTCCCAGACAGATTAACACCACAGACAAGACAGATCCTTAAGCAGCACCTACCCTGTTC CTAGGCTCTGCCCCAGCCAGTCCAGAGCCTACCACAGCAATACCCCCAACACTCACTCCACTCA ATGTGCACCCAGCTTGATGTCCACTGGACACTGGCAACTTTTTCTAAAATGCAAAAAAAAGCCA CTGGTTTTCAGGAAAATGTTCCTGTCCCTGACCCCTTTTAGAGCTGGGCTGCCTGGGGGGAGTG GGAGGGAGGTGGGGAGAGCTAGCCCAGGCCAGGGGTCAATGTTCATGTCACTAGCTTTCAATCC AGTTTCCACTGCAGTGGCTGGAGAGTGACCTGAGTGCTACTTGAAGATATGTAGAGATTCCTTA TCCATGCCTGTACATAGCATGTCCTCCTCCCCTCAGCTTTGCTAATACCAGTCCCCTCCCTTCC CTTTGGCTTCCTGCTGTTGGGGGTGGAAAAGACTAGAAAGGACATTGCTTTCTCAGCCCCACCC CCAGGTCCACAGTGTCCAAGGTAATGGCACACATATTCATGCACAAGAAGCACTCACCTAGAGC TGTCTGTGCCTCTCTGGGAGAAAGGAGAGAGGATAAGAAGGGAAAGTTCACAACCTGTGAACAG GGACTTGAGCACGAGACACTTTTCATCTGGAGCAGGGGGGTGAGCTCCTCAGCAGCCTCCGTAA CAGCTGCCCTGCCTACACCTGCAGAGCTGGAGGTTCTGTCCTCCCTGCTGCTCTCAGGAGTGTT CAAGGGTGGAGGGCAGGGAATGGGGCCTGAGGTATTAAGGCTAAGAGGTGGGGGACAGGGCCCA TCTACCAGCCTTCATCAGGAAGGGAAAGGGCTTTGGGGTCAGGTGGCAGCTATTCCCCACCAAG AGATTCAGGGTCACAGGTTTTTCCCCACACCTCTGAACTCAGGGCCTAGCCACCCCCAAACTTC CAAATCCCACTTTTGTGATATGTGAAGCTACTCATTTCTTACCCTTGGAGGCTGTGTGGGGAAT TTCCAGCCCTTTTATGCCTGTGTGATCCCACCCCACCCCCATAGTTGTATAAAGGTCATAGTGA AGAAGCTGGGGGAAGACTGCTTCAGCCAGATCCTGGGGTGGGGTCTTTAGGTTTTCTCACTTTG ACAACCCCCGAATGTTTTTATAGTAGTTTTTTTGTATTTTTTTGTAATGACAGTATTATGTAAA AAATAAAGTATTTTAAAAATATGGCATCTGAGCAGGAGCAACAAACCTGGGATGGGGGTGGCTG AGGAGGGCCACTGTCATCCTCCCTCCCGGGCTCTGGTCACCTTTGAGAAGCCCAAGCAGGCCCT CAGTATAAGCTGAAGCTGACCTCTGCCTTCCTCGAAGCCTCCTGGGATTTCTAAAACTTATACT TCAAACACAGCACAGACAGAAAGTACCACTCAGCTATTAGAAGAAACATCTATTTGGGAAGGAA AAATATCCCTGCTCATGAGAGACAGAGACCATTGTCTTCAGATGTGCTAGCATGAAAACAGATT TTCTCTTCTTGTCTAAATTTTCTCTGAGTCAGACAAATTTTCCTTCTAGGAGAAAATTTTTTTC TAGGAGGGGGTGGAGAACTTTTTTTTTTTTTTTTTTTTTTTGACACGGAGTCTTGCTCTGTCGC CCAGGCTGGAGTGCAGTGATGCGATCTTGGTTCACTGCAGCCTCT Exemplary Human DNAJB5 cDNA including untranslated regions Variant 4 (SEQ ID NO: 164) GTCCCGGGTGGAGGCGGCGGAGCCGGAGCCGGGGGAGGGGGCAGCGGCTGTCTCACGGACCACG GCGGCGCCCGCAGCTCCTCACCGGTCCGGCACCTGCTGCGCCAGACAGGCCTTGCTGGGCACGC GCCTCTGAGAGTCCAAGGAGGTGGCTTCCAGAGCTGCAGCACTTCAGGCCGGCTCCGGTGGAGC GATCAGAGGTGAGGGGCTTGGCTGGGCATGTTTAAGCGCACAGTGCTCTCCTGCCCACCCCCAG CAGCACCCCCACTGCAGGCCCGAGGAGCTTTCCGGAGCTTCCCACACTCCTGGGGAGAAGACTT CTTAGCCAGCTTGATGTTTAAAATTCAGCTGGAGCCCTTAAAACTTCGAGCGTGGACGCTGAAT GGGTTTGTAAAGTTTCGAAACAAGGAGACCAGTGCTGGTCCAGTGGCTGTGATGGGAAAAGATT ATTACAAGATTCTTGGGATCCCATCGGGGGCCAACGAGGATGAGATCAAGAAAGCCTACCGGAA GATGGCCTTGAAGTACCACCCAGACAAGAATAAAGAACCCAACGCTGAGGAGAAGTTTAAGGAG ATTGCAGAGGCCTATGATGTGCTAAGTGACCCCAAGAAACGGGGCCTGTATGACCAGTATGGGG AGGAAGGCCTGAAGACCGGCGGTGGCACATCAGGTGGCTCCAGTGGCTCCTTTCACTACACCTT TCATGGGGACCCCCATGCCACCTTTGCCTCCTTCTTTGGTGGCTCCAACCCCTTCGATATCTTC TTTGCCAGCAGCCGCTCCACTCGGCCCTTCAGTGGCTTTGACCCAGATGACATGGATGTGGATG AAGATGAGGACCCATTTGGCGCTTTCGGCCGTTTTGGCTTCAATGGGCTGAGTAGGGGTCCAAG GCGAGCCCCAGAACCACTGTACCCTCGGCGCAAGGTGCAGGACCCCCCAGTGGTGCACGAGCTG CGGGTGTCCCTGGAGGAGATCTACCATGGCTCCACCAAGCGCATGAAGATCACAAGGCGTCGCC TCAACCCTGATGGGCGAACTGTGCGCACCGAGGACAAGATCCTGCACATAGTCATCAAGCGTGG CTGGAAGGAAGGCACCAAGATCACCTTCCCCAAAGAAGGCGACGCCACACCTGACAACATCCCT GCTGACATCGTCTTTGTGCTCAAAGACAAGCCCCATGCACACTTCCGCCGAGATGGCACCAACG TGCTCTACAGTGCCCTGATCAGCCTCAAGGAGGCGCTGTGTGGCTGCACTGTGAACATTCCCAC TATCGACGGCCGAGTGATCCCTTTGCCCTGCAATGATGTCATCAAGCCAGGCACCGTGAAGAGA CTCCGTGGGGAGGGCCTTCCCTTCCCCAAAGTGCCAACTCAGCGAGGAGACCTCATTGTTGAGT TCAAAGTTCGCTTCCCAGACAGATTAACACCACAGACAAGACAGATCCTTAAGCAGCACCTACC CTGTTCCTAGGCTCTGCCCCAGCCAGTCCAGAGCCTACCACAGCAATACCCCCAACACTCACTC CACTCAATGTGCACCCAGCTTGATGTCCACTGGACACTGGCAACTTTTTCTAAAATGCAAAAAA AAGCCACTGGTTTTCAGGAAAATGTTCCTGTCCCTGACCCCTTTTAGAGCTGGGCTGCCTGGGG GGAGTGGGAGGGAGGTGGGGAGAGCTAGCCCAGGCCAGGGGTCAATGTTCATGTCACTAGCTTT CAATCCAGTTTCCACTGCAGTGGCTGGAGAGTGACCTGAGTGCTACTTGAAGATATGTAGAGAT TCCTTATCCATGCCTGTACATAGCATGTCCTCCTCCCCTCAGCTTTGCTAATACCAGTCCCCTC CCTTCCCTTTGGCTTCCTGCTGTTGGGGGTGGAAAAGACTAGAAAGGACATTGCTTTCTCAGCC CCACCCCCAGGTCCACAGTGTCCAAGGTAATGGCACACATATTCATGCACAAGAAGCACTCACC TAGAGCTGTCTGTGCCTCTCTGGGAGAAAGGAGAGAGGATAAGAAGGGAAAGTTCACAACCTGT GAACAGGGACTTGAGCACGAGACACTTTTCATCTGGAGCAGGGGGGTGAGCTCCTCAGCAGCCT CCGTAACAGCTGCCCTGCCTACACCTGCAGAGCTGGAGGTTCTGTCCTCCCTGCTGCTCTCAGG AGTGTTCAAGGGTGGAGGGCAGGGAATGGGGCCTGAGGTATTAAGGCTAAGAGGTGGGGGACAG GGCCCATCTACCAGCCTTCATCAGGAAGGGAAAGGGCTTTGGGGTCAGGTGGCAGCTATTCCCC ACCAAGAGATTCAGGGTCACAGGTTTTTCCCCACACCTCTGAACTCAGGGCCTAGCCACCCCCA AACTTCCAAATCCCACTTTTGTGATATGTGAAGCTACTCATTTCTTACCCTTGGAGGCTGTGTG GGGAATTTCCAGCCCTTTTATGCCTGTGTGATCCCACCCCACCCCCATAGTTGTATAAAGGTCA TAGTGAAGAAGCTGGGGGAAGACTGCTTCAGCCAGATCCTGGGGTGGGGTCTTTAGGTTTTCTC ACTTTGACAACCCCCGAATGTTTTTATAGTAGTTTTTTTGTATTTTTTTGTAATGACAGTATTA TGTAAAAAATAAAGTATTTTAAAAATATGGCATCTGAGCAGGAGCAACAAACCTGGGATGGGGG TGGCTGAGGAGGGCCACTGTCATCCTCCCTCCCGGGCTCTGGTCACCTTTGAGAAGCCCAAGCA GGCCCTCAGTATAAGCTGAAGCTGACCTCTGCCTTCCTCGAAGCCTCCTGGGATTTCTAAAACT TATACTTCAAACACAGCACAGACAGAAAGTACCACTCAGCTATTAGAAGAAACATCTATTTGGG AAGGAAAAATATCCCTGCTCATGAGAGACAGAGACCATTGTCTTCAGATGTGCTAGCATGAAAA CAGATTTTCTCTTCTTGTCTAAATTTTCTCTGAGTCAGACAAATTTTCCTTCTAGGAGAAAATT TTTTTCTAGGAGGGGGTGGAGAACTTTTTTTTTTTTTTTTTTTTTTTGACACGGAGTCTTGCTC TGTCGCCCAGGCTGGAGTGCAGTGATGCGATCTTGGTTCACTGCAGCCTCT Exemplary Human DNAJB5 cDNA coding sequence Variant 1 and Variant 4 (SEQ ID NO: 165) ATGTTTAAGCGCACAGTGCTCTCCTGCCCACCCCCAGCAGCACCCCCACTGCAGGCCCGAGGAG CTTTCCGGAGCTTCCCACACTCCTGGGGAGAAGACTTCTTAGCCAGCTTGATGTTTAAAATTCA GCTGGAGCCCTTAAAACTTCGAGCGTGGACGCTGAATGGGTTTGTAAAGTTTCGAAACAAGGAG ACCAGTGCTGGTCCAGTGGCTGTGATGGGAAAAGATTATTACAAGATTCTTGGGATCCCATCGG GGGCCAACGAGGATGAGATCAAGAAAGCCTACCGGAAGATGGCCTTGAAGTACCACCCAGACAA GAATAAAGAACCCAACGCTGAGGAGAAGTTTAAGGAGATTGCAGAGGCCTATGATGTGCTAAGT GACCCCAAGAAACGGGGCCTGTATGACCAGTATGGGGAGGAAGGCCTGAAGACCGGCGGTGGCA CATCAGGTGGCTCCAGTGGCTCCTTTCACTACACCTTTCATGGGGACCCCCATGCCACCTTTGC CTCCTTCTTTGGTGGCTCCAACCCCTTCGATATCTTCTTTGCCAGCAGCCGCTCCACTCGGCCC TTCAGTGGCTTTGACCCAGATGACATGGATGTGGATGAAGATGAGGACCCATTTGGCGCTTTCG GCCGTTTTGGCTTCAATGGGCTGAGTAGGGGTCCAAGGCGAGCCCCAGAACCACTGTACCCTCG GCGCAAGGTGCAGGACCCCCCAGTGGTGCACGAGCTGCGGGTGTCCCTGGAGGAGATCTACCAT GGCTCCACCAAGCGCATGAAGATCACAAGGCGTCGCCTCAACCCTGATGGGCGAACTGTGCGCA CCGAGGACAAGATCCTGCACATAGTCATCAAGCGTGGCTGGAAGGAAGGCACCAAGATCACCTT CCCCAAAGAAGGCGACGCCACACCTGACAACATCCCTGCTGACATCGTCTTTGTGCTCAAAGAC AAGCCCCATGCACACTTCCGCCGAGATGGCACCAACGTGCTCTACAGTGCCCTGATCAGCCTCA AGGAGGCGCTGTGTGGCTGCACTGTGAACATTCCCACTATCGACGGCCGAGTGATCCCTTTGCC CTGCAATGATGTCATCAAGCCAGGCACCGTGAAGAGACTCCGTGGGGAGGGCCTTCCCTTCCCC AAAGTGCCAACTGAGCGAGGAGACCTCATTGTTGAGTTCAAAGTTCGCTTCCCAGACAGATTAA CACCACAGACAAGACAGATCCTTAAGCAGCACCTACCCTGTTCCTAG Exemplary Human DNAJB5 cDNA including untranslated regions Variant 2 (SEQ ID NO: 166) GTCCCGGGTGGAGGCGGCGGAGCCGGAGCCGGGGGAGGGGGCAGCGGCTGTCTCACGGACCACG GCGGCGCCCGCAGCTCCTCACCGCAGCACCCCCACTGCAGGCCCGAGGAGCTTTCCGGAGCTTC CCACACTCCTGGGGAGAAGACTTCTTAGCCAGCTTGATGTTTAAAATTCAGCTGGAGCCCTTAA AACTTCGAGCGTGGACGCTGAATGGGTTTGTAAAGTTTCGAAACAAGGAGACCAGTGCTGGTCC AGTGGCTGTGATGGGAAAAGATTATTACAAGATTCTTGGGATCCCATCGGGGGCCAACGAGGAT GAGATCAAGAAAGCCTACCGGAAGATGGCCTTGAAGTACCACCCAGACAAGAATAAAGAACCCA ACGCTGAGGAGAAGTTTAAGGAGATTGCAGAGGCCTATGATGTGCTAAGTGACCCCAAGAAACG GGGCCTGTATGACCAGTATGGGGAGGAAGGCCTGAAGACCGGCGGTGGCACATCAGGTGGCTCC AGTGGCTCCTTTCACTACACCTTTCATGGGGACCCCCATGCCACCTTTGCCTCCTTCTTTGGTG GCTCCAACCCCTTCGATATCTTCTTTGCCAGCAGCCGCTCCACTCGGCCCTTCAGTGGCTTTGA CCCAGATGACATGGATGTGGATGAAGATGAGGACCCATTTGGCGCTTTCGGCCGTTTTGGCTTC AATGGGCTGAGTAGGGGTCCAAGGCGAGCCCCAGAACCACTGTACCCTCGGCGCAAGGTGCAGG ACCCCCCAGTGGTGCACGAGCTGCGGGTGTCCCTGGAGGAGATCTACCATGGCTCCACCAAGCG CATGAAGATCACAAGGCGTCGCCTCAACCCTGATGGGCGAACTGTGCGCACCGAGGACAAGATC CTGCACATAGTCATCAAGCGTGGCTGGAAGGAAGGCACCAAGATCACCTTCCCCAAAGAAGGCG ACGCCACACCTGACAACATCCCTGCTGACATCGTCTTTGTGCTCAAAGACAAGCCCCATGCACA CTTCCGCCGAGATGGCACCAACGTGCTCTACAGTGCCCTGATCAGCCTCAAGGAGGCGCTGTGT GGCTGCACTGTGAACATTCCCACTATCGACGGCCGAGTGATCCCTTTGCCCTGCAATGATGTCA TCAAGCCAGGCACCGTGAAGAGACTCCGTGGGGAGGGCCTTCCCTTCCCCAAAGTGCCAACTCA GCGAGGAGACCTCATTGTTGAGTTCAAAGTTCGCTTCCCAGACAGATTAACACCACAGACAAGA CAGATCCTTAAGCAGCACCTACCCTGTTCCTAGGCTCTGCCCCAGCCAGTCCAGAGCCTACCAC AGCAATACCCCCAACACTCACTCCACTCAATGTGCACCCAGCTTGATGTCCACTGGACACTGGC AACTTTTTCTAAAATGCAAAAAAAAGCCACTGGTTTTCAGGAAAATGTTCCTGTCCCTGACCCC TTTTAGAGCTGGGCTGCCTGGGGGGAGTGGGAGGGAGGTGGGGAGAGCTAGCCCAGGCCAGGGG TCAATGTTCATGTCACTAGCTTTCAATCCAGTTTCCACTGCAGTGGCTGGAGAGTGACCTGAGT GCTACTTGAAGATATGTAGAGATTCCTTATCCATGCCTGTACATAGCATGTCCTCCTCCCCTCA GCTTTGCTAATACCAGTCCCCTCCCTTCCCTTTGGCTTCCTGCTGTTGGGGGTGGAAAAGACTA GAAAGGACATTGCTTTCTCAGCCCCACCCCCAGGTCCACAGTGTCCAAGGTAATGGCACACATA TTCATGCACAAGAAGCACTCACCTAGAGCTGTCTGTGCCTCTCTGGGAGAAAGGAGAGAGGATA AGAAGGGAAAGTTCACAACCTGTGAACAGGGACTTGAGCACGAGACACTTTTCATCTGGAGCAG GGGGGTGAGCTCCTCAGCAGCCTCCGTAACAGCTGCCCTGCCTACACCTGCAGAGCTGGAGGTT CTGTCCTCCCTGCTGCTCTCAGGAGTGTTCAAGGGTGGAGGGCAGGGAATGGGGCCTGAGGTAT TAAGGCTAAGAGGTGGGGGACAGGGCCCATCTACCAGCCTTCATCAGGAAGGGAAAGGGCTTTG GGGTCAGGTGGCAGCTATTCCCCACCAAGAGATTCAGGGTCACAGGTTTTTCCCCACACCTCTG AACTCAGGGCCTAGCCACCCCCAAACTTCCAAATCCCACTTTTGTGATATGTGAAGCTACTCAT TTCTTACCCTTGGAGGCTGTGTGGGGAATTTCCAGCCCTTTTATGCCTGTGTGATCCCACCCCA CCCCCATAGTTGTATAAAGGTCATAGTGAAGAAGCTGGGGGAAGACTGCTTCAGCCAGATCCTG GGGTGGGGTCTTTAGGTTTTCTCACTTTGACAACCCCCGAATGTTTTTATAGTAGTTTTTTTGT ATTTTTTTGTAATGAGAGTATTATGTAAAAAATAAAGTATTTTAAAAATA Exemplary Human DNAJB5 cDNA including untranslated regions Variant 6 (SEQ ID NO: 167) GTCCCGGGTGGAGGCGGCGGAGCCGGAGCCGGGGGAGGGGGCAGCGGCTGTCTCACGGACCACG GCGGCGCCCGCAGCTCCTCACCGCACCCCCACTGCAGGCCCGAGGAGCTTTCCGGAGCTTCCCA CACTCCTGGGGAGAAGACTTCTTAGCCAGCTTGATGTTTAAAATTCAGCTGGAGCCCTTAAAAC TTCGAGCGTGGACGCTGAATGGGTTTGTAAAGTTTCGAAACAAGGAGACCAGTGCTGGTCCAGT GGCTGTGATGGGAAAAGATTATTACAAGATTCTTGGGATCCCATCGGGGGCCAACGAGGATGAG ATCAAGAAAGCCTACCGGAAGATGGCCTTGAAGTACCACCCAGACAAGAATAAAGAACCCAACG CTGAGGAGAAGTTTAAGGAGATTGCAGAGGCCTATGATGTGCTAAGTGACCCCAAGAAACGGGG CCTGTATGACCAGTATGGGGAGGAAGGCCTGAAGACCGGCGGTGGCACATCAGGTGGCTCCAGT GGCTCCTTTCACTACACCTTTCATGGGGACCCCCATGCCACCTTTGCCTCCTTCTTTGGTGGCT CCAACCCCTTCGATATCTTCTTTGCCAGCAGCCGCTCCACTCGGCCCTTCAGTGGCTTTGACCC AGATGACATGGATGTGGATGAAGATGAGGACCCATTTGGCGCTTTCGGCCGTTTTGGCTTCAAT GGGCTGAGTAGGGGTCCAAGGCGAGCCCCAGAACCACTGTACCCTCGGCGCAAGGTGCAGGACC CCCCAGTGGTGCACGAGCTGCGGGTGTCCCTGGAGGAGATCTACCATGGCTCCACCAAGCGCAT GAAGATCACAAGGCGTCGCCTCAACCCTGATGGGCGAACTGTGCGCACCGAGGACAAGATCCTG CACATAGTCATCAAGCGTGGCTGGAAGGAAGGCACCAAGATCACCTTCCCCAAAGAAGGCGACG CCACACCTGACAACATCCCTGCTGACATCGTCTTTGTGCTCAAAGACAAGCCCCATGCACACTT CCGCCGAGATGGCACCAACGTGCTCTACAGTGCCCTGATCAGCCTCAAGGAGGCGCTGTGTGGC TGCACTGTGAACATTCCCACTATCGACGGCCGAGTGATCCCTTTGCCCTGCAATGATGTCATCA AGCCAGGCACCGTGAAGAGACTCCGTGGGGAGGGCCTTCCCTTCCCCAAAGTGCCAACTCAGCG AGGAGACCTCATTGTTGAGTTCAAAGTTCGCTTCCCAGACAGATTAACACCACAGACAAGACAG ATCCTTAAGCAGCACCTACCCTGTTCCTAGGCTCTGCCCCAGCCAGTCCAGAGCCTACCACAGC AATACCCCCAACACTCACTCCACTCAATGTGCACCCAGCTTGATGTCCACTGGACACTGGCAAC TTTTTCTAAAATGCAAAAAAAAGCCACTGGTTTTCAGGAAAATGTTCCTGTCCCTGACCCCTTT TAGAGCTGGGCTGCCTGGGGGGAGTGGGAGGGAGGTGGGGAGAGCTAGCCCAGGCCAGGGGTCA ATGTTCATGTCACTAGCTTTCAATCCAGTTTCCACTGCAGTGGCTGGAGAGTGACCTGAGTGCT ACTTGAAGATATGTAGAGATTCCTTATCCATGCCTGTACATAGCATGTCCTCCTCCCCTCAGCT TTGCTAATACCAGTCCCCTCCCTTCCCTTTGGCTTCCTGCTGTTGGGGGTGGAAAAGACTAGAA AGGACATTGCTTTCTCAGCCCCACCCCCAGGTCCACAGTGTCCAAGGTAATGGCACACATATTC ATGCACAAGAAGCACTCACCTAGAGCTGTCTGTGCCTCTCTGGGAGAAAGGAGAGAGGATAAGA AGGGAAAGTTCACAACCTGTGAACAGGGACTTGAGCACGAGACACTTTTCATCTGGAGCAGGGG GGTGAGCTCCTCAGCAGCCTCCGTAACAGCTGCCCTGCCTACACCTGCAGAGCTGGAGGTTCTG TCCTCCCTGCTGCTCTCAGGAGTGTTCAAGGGTGGAGGGCAGGGAATGGGGCCTGAGGTATTAA GGCTAAGAGGTGGGGGACAGGGCCCATCTACCAGCCTTCATCAGGAAGGGAAAGGGCTTTGGGG TCAGGTGGCAGCTATTCCCCACCAAGAGATTCAGGGTCACAGGTTTTTCCCCACACCTCTGAAC TCAGGGCCTAGCCACCCCCAAACTTCCAAATCCCACTTTTGTGATATGTGAAGCTACTCATTTC TTACCCTTGGAGGCTGTGTGGGGAATTTCCAGCCCTTTTATGCCTGTGTGATCCCACCCCACCC CCATAGTTGTATAAAGGTCATAGTGAAGAAGCTGGGGGAAGACTGCTTCAGCCAGATCCTGGGG TGGGGTCTTTAGGTTTTCTCACTTTGACAACCCCCGAATGTTTTTATAGTAGTTTTTTTGTATT TTTTTGTAATGAGAGTATTATGTAAAAAATAAAGTATTTTAAAAATA Exemplary Human DNAJB5 cDNA coding sequence Variant 2 and Variant 6 (SEQ ID NO: 168) ATGTTTAAAATTCAGCTGGAGCCCTTAAAACTTCGAGCGTGGACGCTGAATGGGTTTGTAAAGT TTCGAAACAAGGAGACCAGTGCTGGTCCAGTGGCTGTGATGGGAAAAGATTATTACAAGATTCT TGGGATCCCATCGGGGGCCAACGAGGATGAGATCAAGAAAGCCTACCGGAAGATGGCCTTGAAG TACCACCCAGACAAGAATAAAGAACCCAACGCTGAGGAGAAGTTTAAGGAGATTGCAGAGGCCT ATGATGTGCTAAGTGACCCCAAGAAACGGGGCCTGTATGACCAGTATGGGGAGGAAGGCCTGAA GACCGGCGGTGGCACATCAGGTGGCTCCAGTGGCTCCTTTCACTACACCTTTCATGGGGACCCC CATGCCACCTTTGCCTCCTTCTTTGGTGGCTCCAACCCCTTCGATATCTTCTTTGCCAGCAGCC GCTCCACTCGGCCCTTCAGTGGCTTTGACCCAGATGACATGGATGTGGATGAAGATGAGGACCC ATTTGGCGCTTTCGGCCGTTTTGGCTTCAATGGGCTGAGTAGGGGTCCAAGGCGAGCCCCAGAA CCACTGTACCCTCGGCGCAAGGTGCAGGACCCCCCAGTGGTGCACGAGCTGCGGGTGTCCCTGG AGGAGATCTACCATGGCTCCACCAAGCGCATGAAGATCACAAGGCGTCGCCTCAACCCTGATGG GCGAACTGTGCGCACCGAGGACAAGATCCTGCACATAGTCATCAAGCGTGGCTGGAAGGAAGGC ACCAAGATCACCTTCCCCAAAGAAGGCGACGCCACACCTGACAACATCCCTGCTGACATCGTCT TTGTGCTCAAAGACAAGCCCCATGCACACTTCCGCCGAGATGGCACCAACGTGCTCTACAGTGC CCTGATCAGCCTCAAGGAGGCGCTGTGTGGCTGCACTGTGAACATTCCCACTATCGACGGCCGA GTGATCCCTTTGCCCTGCAATGATGTCATCAAGCCAGGCACCGTGAAGAGACTCCGTGGGGAGG GCCTTCCCTTCCCCAAAGTGCCAACTCAGCGAGGAGACCTCATTGTTGAGTTCAAAGTTCGCTT CCCAGACAGATTAACACCACAGACAAGACAGATCCTTAAGCAGCACCTACCCTGTTCCTAG Exemplary Human DNAJB5 cDNA including untranslated regions Variant 3 (SEQ ID NO: 169) GTCCCGGGTGGAGGCGGCGGAGCCGGAGCCGGGGGAGGGGGCAGCGGCTGTCTCACGGACCACG GCGGCGCCCGCAGCTCCTCACCGAAACAAGGAGACCAGTGCTGGTCCAGTGGCTGTGATGGGAA AAGATTATTACAAGATTCTTGGGATCCCATCGGGGGCCAACGAGGATGAGATCAAGAAAGCCTA CCGGAAGATGGCCTTGAAGTACCACCCAGACAAGAATAAAGAACCCAACGCTGAGGAGAAGTTT AAGGAGATTGCAGAGGCCTATGATGTGCTAAGTGACCCCAAGAAACGGGGCCTGTATGACCAGT ATGGGGAGGAAGGCCTGAAGACCGGCGGTGGCACATCAGGTGGCTCCAGTGGCTCCTTTCACTA CACCTTTCATGGGGACCCCCATGCCACCTTTGCCTCCTTCTTTGGTGGCTCCAACCCCTTCGAT ATCTTCTTTGCCAGCAGCCGCTCCACTCGGCCCTTCAGTGGCTTTGACCCAGATGACATGGATG TGGATGAAGATGAGGACCCATTTGGCGCTTTCGGCCGTTTTGGCTTCAATGGGCTGAGTAGGGG TCCAAGGCGAGCCCCAGAACCACTGTACCCTCGGCGCAAGGTGCAGGACCCCCCAGTGGTGCAC GAGCTGCGGGTGTCCCTGGAGGAGATCTACCATGGCTCCACCAAGCGCATGAAGATCACAAGGC GTCGCCTCAACCCTGATGGGCGAACTGTGCGCACCGAGGACAAGATCCTGCACATAGTCATCAA GCGTGGCTGGAAGGAAGGCACCAAGATCACCTTCCCCAAAGAAGGCGACGCCACACCTGACAAC ATCCCTGCTGACATCGTCTTTGTGCTCAAAGACAAGCCCCATGCACACTTCCGCCGAGATGGCA CCAACGTGCTCTACAGTGCCCTGATCAGCCTCAAGGAGGCGCTGTGTGGCTGCACTGTGAACAT TCCCACTATCGACGGCCGAGTGATCCCTTTGCCCTGCAATGATGTCATCAAGCCAGGCACCGTG AAGAGACTCCGTGGGGAGGGCCTTCCCTTCCCCAAAGTGCCAACTCAGCGAGGAGACCTCATTG TTGAGTTCAAAGTTCGCTTCCCAGACAGATTAACACCACAGACAAGACAGATCCTTAAGCAGCA CCTACCCTGTTCCTAGGCTCTGCCCCAGCCAGTCCAGAGCCTACCACAGCAATACCCCCAACAC TCACTCCACTCAATGTGCACCCAGCTTGATGTCCACTGGACACTGGCAACTTTTTCTAAAATGC AAAAAAAAGCCACTGGTTTTCAGGAAAATGTTCCTGTCCCTGACCCCTTTTAGAGCTGGGCTGC CTGGGGGGAGTGGGAGGGAGGTGGGGAGAGCTAGCCCAGGCCAGGGGTCAATGTTCATGTCACT AGCTTTCAATCCAGTTTCCACTGCAGTGGCTGGAGAGTGACCTGAGTGCTACTTGAAGATATGT AGAGATTCCTTATCCATGCCTGTACATAGCATGTCCTCCTCCCCTCAGCTTTGCTAATACCAGT CCCCTCCCTTCCCTTTGGCTTCCTGCTGTTGGGGGTGGAAAAGACTAGAAAGGACATTGCTTTC TCAGCCCCACCCCCAGGTCCACAGTGTCCAAGGTAATGGCACACATATTCATGCACAAGAAGCA CTCACCTAGAGCTGTCTGTGCCTCTCTGGGAGAAAGGAGAGAGGATAAGAAGGGAAAGTTCACA ACCTGTGAACAGGGACTTGAGCACGAGACACTTTTCATCTGGAGCAGGGGGGTGAGCTCCTCAG CAGCCTCCGTAACAGCTGCCCTGCCTACACCTGCAGAGCTGGAGGTTCTGTCCTCCCTGCTGCT CTCAGGAGTGTTCAAGGGTGGAGGGCAGGGAATGGGGCCTGAGGTATTAAGGCTAAGAGGTGGG GGACAGGGCCCATCTACCAGCCTTCATCAGGAAGGGAAAGGGCTTTGGGGTCAGGTGGCAGCTA TTCCCCACCAAGAGATTCAGGGTCACAGGTTTTTCCCCACACCTCTGAACTCAGGGCCTAGCCA CCCCCAAACTTCCAAATCCCACTTTTGTGATATGTGAAGCTACTCATTTCTTACCCTTGGAGGC TGTGTGGGGAATTTCCAGCCCTTTTATGCCTGTGTGATCCCACCCCACCCCCATAGTTGTATAA AGGTCATAGTGAAGAAGCTGGGGGAAGACTGCTTCAGCCAGATCCTGGGGTGGGGTCTTTAGGT TTTCTCACTTTGACAACCCCCGAATGTTTTTATAGTAGTTTTTTTGTATTTTTTTGTAATGACA GTATTATGTAAAAAATAAAGTATTTTAAAAATATGGCATCTGAGCAGGAGCAACAAACCTGGGA TGGGGGTGGCTGAGGAGGGCCACTGTCATCCTCCCTCCCGGGCTCTGGTCACCTTTGAGAAGCC CAAGCAGGCCCTCAGTATAAGCTGAAGCTGACCTCTGCCTTCCTCGAAGCCTCCTGGGATTTCT AAAACTTATACTTCAAACACAGCACAGACAGAAAGTACCACTCAGCTATTAGAAGAAACATCTA TTTGGGAAGGAAAAATATCCCTGCTCATGAGAGACAGAGACCATTGTCTTCAGATGTGCTAGCA TGAAAACAGATTTTCTCTTCTTGTCTAAATTTTCTCTGAGTCAGACAAATTTTCCTTCTAGGAG AAAATTTTTTTCTAGGAGGGGGTGGAGAACTTTTTTTTTTTTTTTTTTTTTTTGACACGGAGTC TTGCTCTGTCGCCCAGGCTGGAGTGCAGTGATGCGATCTTGGTTCACTGCAGCCTCT Exemplary Human DNAJB5 cDNA including untranslated regions Variant 5 (SEQ ID NO: 170) GTCCCGGGTGGAGGCGGCGGAGCCGGAGCCGGGGGAGGGGGCAGCGGCTGTCTCACGGACCACG GCGGCGCCCGCAGCTCCTCACCGGTCCGGCACCTGCTGCGCCAGACAGGCCTTGCTGGGCACGC GCCTCTGAGAGTCCAAGGAGGTGGCTTCCAGAGCTGCAGCACTTCAGGCCGGCTCCGGTGGAGC GATCAGAGAAACAAGGAGACCAGTGCTGGTCCAGTGGCTGTGATGGGAAAAGATTATTACAAGA TTCTTGGGATCCCATCGGGGGCCAACGAGGATGAGATCAAGAAAGCCTACCGGAAGATGGCCTT GAAGTACCACCCAGACAAGAATAAAGAACCCAACGCTGAGGAGAAGTTTAAGGAGATTGCAGAG GCCTATGATGTGCTAAGTGACCCCAAGAAACGGGGCCTGTATGACCAGTATGGGGAGGAAGGCC TGAAGACCGGCGGTGGCACATCAGGTGGCTCCAGTGGCTCCTTTCACTACACCTTTCATGGGGA CCCCCATGCCACCTTTGCCTCCTTCTTTGGTGGCTCCAACCCCTTCGATATCTTCTTTGCCAGC AGCCGCTCCACTCGGCCCTTCAGTGGCTTTGACCCAGATGACATGGATGTGGATGAAGATGAGG ACCCATTTGGCGCTTTCGGCCGTTTTGGCTTCAATGGGCTGAGTAGGGGTCCAAGGCGAGCCCC AGAACCACTGTACCCTCGGCGCAAGGTGCAGGACCCCCCAGTGGTGCACGAGCTGCGGGTGTCC CTGGAGGAGATCTACCATGGCTCCACCAAGCGCATGAAGATCACAAGGCGTCGCCTCAACCCTG ATGGGCGAACTGTGCGCACCGAGGACAAGATCCTGCACATAGTCATCAAGCGTGGCTGGAAGGA AGGCACCAAGATCACCTTCCCCAAAGAAGGCGACGCCACACCTGACAACATCCCTGCTGACATC GTCTTTGTGCTCAAAGACAAGCCCCATGCACACTTCCGCCGAGATGGCACCAACGTGCTCTACA GTGCCCTGATCAGCCTCAAGGAGGCGCTGTGTGGCTGCACTGTGAACATTCCCACTATCGACGG CCGAGTGATCCCTTTGCCCTGCAATGATGTCATCAAGCCAGGCACCGTGAAGAGACTCCGTGGG GAGGGCCTTCCCTTCCCCAAAGTGCCAACTCAGCGAGGAGACCTCATTGTTGAGTTCAAAGTTC GCTTCCCAGACAGATTAACACCACAGACAAGACAGATCCTTAAGCAGCACCTACCCTGTTCCTA GGCTCTGCCCCAGCCAGTCCAGAGCCTACCACAGCAATACCCCCAACACTCACTCCACTCAATG TGCACCCAGCTTGATGTCCACTGGACACTGGCAACTTTTTCTAAAATGCAAAAAAAAGCCACTG GTTTTCAGGAAAATGTTCCTGTCCCTGACCCCTTTTAGAGCTGGGCTGCCTGGGGGGAGTGGGA GGGAGGTGGGGAGAGCTAGCCCAGGCCAGGGGTCAATGTTCATGTCACTAGCTTTCAATCCAGT TTCCACTGCAGTGGCTGGAGAGTGACCTGAGTGCTACTTGAAGATATGTAGAGATTCCTTATCC ATGCCTGTACATAGCATGTCCTCCTCCCCTCAGCTTTGCTAATACCAGTCCCCTCCCTTCCCTT TGGCTTCCTGCTGTTGGGGGTGGAAAAGACTAGAAAGGACATTGCTTTCTCAGCCCCACCCCCA GGTCCACAGTGTCCAAGGTAATGGCACACATATTCATGCACAAGAAGCACTCACCTAGAGCTGT CTGTGCCTCTCTGGGAGAAAGGAGAGAGGATAAGAAGGGAAAGTTCACAACCTGTGAACAGGGA CTTGAGCACGAGACACTTTTCATCTGGAGCAGGGGGGTGAGCTCCTCAGCAGCCTCCGTAACAG CTGCCCTGCCTACACCTGCAGAGCTGGAGGTTCTGTCCTCCCTGCTGCTCTCAGGAGTGTTCAA GGGTGGAGGGCAGGGAATGGGGCCTGAGGTATTAAGGCTAAGAGGTGGGGGACAGGGCCCATCT ACCAGCCTTCATCAGGAAGGGAAAGGGCTTTGGGGTCAGGTGGCAGCTATTCCCCACCAAGAGA TTCAGGGTCACAGGTTTTTCCCCACACCTCTGAACTCAGGGCCTAGCCACCCCCAAACTTCCAA ATCCCACTTTTGTGATATGTGAAGCTACTCATTTCTTACCCTTGGAGGCTGTGTGGGGAATTTC CAGCCCTTTTATGCCTGTGTGATCCCACCCCACCCCCATAGTTGTATAAAGGTCATAGTGAAGA AGCTGGGGGAAGACTGCTTCAGCCAGATCCTGGGGTGGGGTCTTTAGGTTTTCTCACTTTGACA ACCCCCGAATGTTTTTATAGTAGTTTTTTTGTATTTTTTTGTAATGACAGTATTATGTAAAAAA TAAAGTATTTTAAAAATATGGCATCTGAGCAGGAGCAACAAACCTGGGATGGGGGTGGCTGAGG AGGGCCACTGTCATCCTCCCTCCCGGGCTCTGGTCACCTTTGAGAAGCCCAAGCAGGCCCTCAG TATAAGCTGAAGCTGACCTCTGCCTTCCTCGAAGCCTCCTGGGATTTCTAAAACTTATACTTCA AACACAGCACAGACAGAAAGTACCACTCAGCTATTAGAAGAAACATCTATTTGGGAAGGAAAAA TATCCCTGCTCATGAGAGACAGAGACCATTGTCTTCAGATGTGCTAGCATGAAAACAGATTTTC TCTTCTTGTCTAAATTTTCTCTGAGTCAGACAAATTTTCCTTCTAGGAGAAAATTTTTTTCTAG GAGGGGGTGGAGAACTTTTTTTTTTTTTTTTTTTTTTTGACACGGAGTCTTGCTCTGTCGCCCA GGCTGGAGTGCAGTGATGCGATCTTGGTTCACTGCAGCCTCT Exemplary Human DNAJB5 cDNA coding sequence Variant 3 and Variant 5 (SEQ ID NO: 171) ATGGGAAAAGATTATTACAAGATTCTTGGGATCCCATCGGGGGCCAACGAGGATGAGATCAAGA AAGCCTACCGGAAGATGGCCTTGAAGTACCACCCAGACAAGAATAAAGAACCCAACGCTGAGGA GAAGTTTAAGGAGATTGCAGAGGCCTATGATGTGCTAAGTGACCCCAAGAAACGGGGCCTGTAT GACCAGTATGGGGAGGAAGGCCTGAAGACCGGCGGTGGCACATCAGGTGGCTCCAGTGGCTCCT TTCACTACACCTTTCATGGGGACCCCCATGCCACCTTTGCCTCCTTCTTTGGTGGCTCCAACCC CTTCGATATCTTCTTTGCCAGCAGCCGCTCCACTCGGCCCTTCAGTGGCTTTGACCCAGATGAC ATGGATGTGGATGAAGATGAGGACCCATTTGGCGCTTTCGGCCGTTTTGGCTTCAATGGGCTGA GTAGGGGTCCAAGGCGAGCCCCAGAACCACTGTACCCTCGGCGCAAGGTGCAGGACCCCCCAGT GGTGCACGAGCTGCGGGTGTCCCTGGAGGAGATCTACCATGGCTCCACCAAGCGCATGAAGATC ACAAGGCGTCGCCTCAACCCTGATGGGCGAACTGTGCGCACCGAGGACAAGATCCTGCACATAG TCATCAAGCGTGGCTGGAAGGAAGGCACCAAGATCACCTTCCCCAAAGAAGGCGACGCCACACC TGACAACATCCCTGCTGACATCGTCTTTGTGCTCAAAGACAAGCCCCATGCACACTTCCGCCGA GATGGCACCAACGTGCTCTACAGTGCCCTGATCAGCCTCAAGGAGGCGCTGTGTGGCTGCACTG TGAACATTCCCACTATCGACGGCCGAGTGATCCCTTTGCCCTGCAATGATGTCATCAAGCCAGG CACCGTGAAGAGACTCCGTGGGGAGGGCCTTCCCTTCCCCAAAGTGCCAACTCAGCGAGGAGAC CTCATTGTTGAGTTCAAAGTTCGCTTCCCAGACAGATTAACACCACAGACAAGACAGATCCTTA AGCAGCACCTACCCTGTTCCTAG Polypeptides Encoded by DNAJB5 Gene Exemplary Human Mature DNAJB5 Protein Isoform 1 (encoded by Variant 1 and Variant 4) (SEQ ID NO: 172) MFKRTVLSCPPPAAPPLQARGAFRSFPHSWGEDFLASLMFKIQLEPLKLRAWTLNGFVKFRNKE TSAGPVAVMGKDYYKILGIPSGANEDEIKKAYRKMALKYHPDKNKEPNAEEKFKEIAEAYDVLS DPKKRGLYDQYGEEGLKTGGGTSGGSSGSFHYTFHGDPHATFASFFGGSNPFDIFFASSRSTRP FSGFDPDDMDVDEDEDPFGAFGRFGFNGLSRGPRRAPEPLYPRRKVQDPPVVHELRVSLEEIYH GSTKRMKITRRRLNPDGRTVRTEDKILHIVIKRGWKEGTKITFPKEGDATPDNIPADIVFVLKD KPHAHFRRDGTNVLYSALISLKEALCGCTVNIPTIDGRVIPLPCNDVIKPGTVKRLRGEGLPFP KVPTQRGDLIVEFKVRFPDRLTPQTRQILKQHLPCS Exemplary Human Mature DNAJB5 Protein Isoform 2 (encoded by Variant 2 and Variant 6) (SEQ ID NO: 173) MFKIQLEPLKLRAWTLNGFVKFRNKETSAGPVAVMGKDYYKILGIPSGANEDEIKKAYRKMALK YHPDKNKEPNAEEKFKEIAEAYDVLSDPKKRGLYDQYGEEGLKTGGGTSGGSSGSFHYTFHGDP HATFASFFGGSNPFDIFFASSRSTRPFSGFDPDDMDVDEDEDPFGAFGRFGFNGLSRGPRRAPE PLYPRRKVQDPPVVHELRVSLEEIYHGSTKRMKITRRRLNPDGRTVRTEDKILHIVIKRGWKEG TKITFPKEGDATPDNIPADIVFVLKDKPHAHFRRDGTNVLYSALISLKEALCGCTVNIPTIDGR VIPLPCNDVIKPGTVKRLRGEGLPFPKVPTQRGDLIVEFKVRFPDRLTPQTRQILKQHLPCS Exemplary Human Mature DNAJB5 Protein Isoform 3 (encoded by Variant 3 and Variant 5) (SEQ ID NO: 174) MGKDYYKILGIPSGANEDEIKKAYRKMALKYHPDKNKEPNAEEKFKEIAEAYDVLSDPKKRGLY DQYGEEGLKTGGGTSGGSSGSFHYTFHGDPHATFASFFGGSNPFDIFFASSRSTRPFSGFDPDD MDVDEDEDPFGAFGRFGFNGLSRGPRRAPEPLYPRRKVQDPPVVHELRVSLEEIYHGSTKRMKI TRRRLNPDGRTVRTEDKILHIVIKRGWKEGTKITFPKEGDATPDNIPADIVFVLKDKPHAHFRR DGTNVLYSALISLKEALCGCTVNIPTIDGRVIPLPCNDVIKPGTVKRLRGEGLPFPKVPTQRGD LIVEFKVRFPDRLTPQTRQILKQHLPCS
Constructs
[0304] Among other things, the present disclosure provides that some polynucleotides as described herein are polynucleotide constructs. Polynucleotide constructs according to the present disclosure include all those known in the art, including cosmids, plasmids (e.g., naked or contained in liposomes) and viral constructs (e.g., lentiviral, retroviral, adenoviral, and adeno-associated viral constructs) that incorporate a polynucleotide comprising an NDP gene or characteristic portion thereof. In addition, polynucleotide constructs according to the present disclosure include all those known in the art, including cosmids, plasmids (e.g., naked or contained in liposomes) and viral constructs (e.g., lentiviral, retroviral, adenoviral, and adeno-associated viral constructs) that incorporate a polynucleotide comprising an HSPA1A gene or characteristic portion thereof. Moreover, polynucleotide constructs according to the present disclosure include all those known in the art, including cosmids, plasmids (e.g., naked or contained in liposomes) and viral constructs (e.g., lentiviral, retroviral, adenoviral, and adeno-associated viral constructs) that incorporate a polynucleotide comprising a gene encoding a secreted target protein (e.g., a NDP gene (e.g., any of the exemplary NDP genes described herein), a HSPA1A gene (e.g., any of the exemplary HSPA1A genes described herein)) gene or characteristic portion thereof.
[0305] Those of skill in the art will be capable of selecting suitable constructs, as well as cells, for making any of the polynucleotides described herein. In some embodiments, a construct is a plasmid (i.e., a circular DNA molecule that can autonomously replicate inside a cell). In some embodiments, a construct can be a cosmid (e.g., pWE or sCos series).
[0306] In some embodiments, a construct is a viral construct. In some embodiments, a viral construct is a lentivirus, retrovirus, adenovirus, or adeno-associated virus construct. In some embodiments, a construct is an adeno-associated virus (AAV) construct (see, e.g., Asokan et al., Mol. Ther. 20: 699-7080, 2012, which is incorporated in its entirety herein by reference). In some embodiments, a viral construct is an adenovirus construct. In some embodiments, a viral construct may also be based on or derived from an alphavirus. Alphaviruses include Sindbis (and VEEV) virus, Aura virus, Babanki virus, Barmah Forest virus, Bebaru virus, Cabassou virus, Chikungunya virus, Eastern equine encephalitis virus, Everglades virus, Fort Morgan virus, Getah virus, Highlands J virus, Kyzylagach virus, Mayaro virus, Me Tri virus, Middelburg virus, Mosso das Pedras virus, Mucambo virus, Ndumu virus, O'nyong-nyong virus, Pixuna virus, Rio Negro virus, Ross River virus, Salmon pancreas disease virus, Semliki Forest virus, Southern elephant seal virus, Tonate virus, Trocara virus, Una virus, Venezuelan equine encephalitis virus, Western equine encephalitis virus, and Whataroa virus. Generally, the genome of such viruses encode nonstructural (e.g., replicon) and structural proteins (e.g., capsid and envelope) that can be translated in the cytoplasm of the host cell. Ross River virus, Sindbis virus, Semliki Forest virus (SFV), and Venezuelan equine encephalitis virus (VEEV) have all been used to develop viral constructs for coding sequence delivery. Pseudotyped viruses may be formed by combining alphaviral envelope glycoproteins and retroviral capsids. Examples of alphaviral constructs can be found in U.S. Publication Nos. 20150050243, 20090305344, and 20060177819; constructs and methods of their making are incorporated herein by reference to each of the publications in its entirety.
[0307] Constructs provided herein can be of different sizes. In some embodiments, a construct is a plasmid and can include a total length of up to about 1 kb, up to about 2 kb, up to about 3 kb, up to about 4 kb, up to about 5 kb, up to about 6 kb, up to about 7 kb, up to about 8kb, up to about 9 kb, up to about 10 kb, up to about 11 kb, up to about 12 kb, up to about 13 kb, up to about 14 kb, or up to about 15 kb. In some embodiments, a construct is a plasmid and can have a total length in a range of about 1 kb to about 2 kb, about 1 kb to about 3 kb, about 1 kb to about 4 kb, about 1 kb to about 5 kb, about 1 kb to about 6 kb, about 1 kb to about 7 kb, about 1 kb to about 8 kb, about 1 kb to about 9 kb, about 1 kb to about 10 kb, about 1 kb to about 11 kb, about 1 kb to about 12 kb, about 1 kb to about 13 kb, about 1 kb to about 14 kb, or about 1 kb to about 15 kb.
[0308] In some embodiments, a construct is a viral construct and can have a total number of nucleotides of up to 10 kb. In some embodiments, a viral construct can have a total number of nucleotides in the range of about 1 kb to about 2 kb, 1 kb to about 3 kb, about 1 kb to about 4 kb, about 1 kb to about 5 kb, about 1 kb to about 6 kb, about 1 kb to about 7 kb, about 1 kb to about 8 kb, about 1 kb to about 9 kb, about 1 kb to about 10 kb, about 2 kb to about 3 kb, about 2 kb to about 4 kb, about 2 kb to about 5 kb, about 2 kb to about 6 kb, about 2 kb to about 7 kb, about 2 kb to about 8 kb, about 2 kb to about 9 kb, about 2 kb to about 10 kb, about 3 kb to about 4 kb, about 3 kb to about 5 kb, about 3 kb to about 6 kb, about 3 kb to about 7 kb, about 3 kb to about 8 kb, about 3 kb to about 9 kb, about 3 kb to about 10 kb, about 4 kb to about 5 kb, about 4 kb to about 6 kb, about 4 kb to about 7 kb, about 4 kb to about 8 kb, about 4 kb to about 9 kb, about 4 kb to about 10 kb, about 5 kb to about 6 kb, about 5 kb to about 7 kb, about 5 kb to about 8 kb, about 5 kb to about 9 kb, about 5 kb to about 10 kb, about 6 kb to about 7 kb, about 6 kb to about 8 kb, about 6 kb to about 9 kb, about 6 kb to about 10 kb, about 7 kb to about 8 kb, about 7 kb to about 9 kb, about 7 kb to about 10 kb, about 8 kb to about 9 kb, about 8 kb to about 10 kb, or about 9 kb to about 10 kb.
[0309] In some embodiments, a construct is a lentivirus construct and can have a total number of nucleotides of up to 8 kb. In some examples, a lentivirus construct can have a total number of nucleotides of about 1 kb to about 2 kb, about 1 kb to about 3 kb, about 1 kb to about 4 kb, about 1 kb to about 5 kb, about 1 kb to about 6 kb, about 1 kb to about 7 kb, about 1 kb to about 8 kb, about 2 kb to about 3 kb, about 2 kb to about 4 kb, about 2 kb to about 5 kb, about 2 kb to about 6 kb, about 2 kb to about 7 kb, about 2 kb to about 8 kb, about 3 kb to about 4 kb, about 3 kb to about 5 kb, about 3 kb to about 6 kb, about 3 kb to about 7 kb, about 3 kb to about 8 kb, about 4 kb to about 5 kb, about 4 kb to about 6 kb, about 4 kb to about 7 kb, about 4 kb to about 8 kb, about 5 kb to about 6 kb, about 5 kb to about 7 kb, about 5 kb to about 8 kb, about 6 kb to about 8kb, about 6 kb to about 7 kb, or about 7 kb to about 8 kb
[0310] In some embodiments, a construct is an adenovirus construct and can have a total number of nucleotides of up to 8 kb. In some embodiments, an adenovirus construct can have a total number of nucleotides in the range of about 1 kb to about 2 kb, about 1 kb to about 3 kb, about 1 kb to about 4 kb, about 1 kb to about 5 kb, about 1 kb to about 6 kb, about 1 kb to about 7 kb, about 1 kb to about 8 kb, about 2 kb to about 3 kb, about 2 kb to about 4 kb, about 2 kb to about 5 kb, about 2 kb to about 6 kb, about 2 kb to about 7 kb, about 2 kb to about 8 kb, about 3 kb to about 4 kb, about 3 kb to about 5 kb, about 3 kb to about 6 kb, about 3 kb to about 7 kb, about 3 kb to about 8 kb, about 4 kb to about 5 kb, about 4 kb to about 6 kb, about 4 kb to about 7 kb, about 4 kb to about 8 kb, about 5 kb to about 6 kb, about 5 kb to about 7 kb, about 5 kb to about 8 kb, about 6 kb to about 7 kb, about 6 kb to about 8 kb, or about 7 kb to about 8 kb.
[0311] Any of the constructs described herein can further include a control sequence, e.g., a control sequence selected from the group of a transcription initiation sequence, a transcription termination sequence, a promoter sequence, an enhancer sequence, an RNA splicing sequence, a polyadenylation (poly(A)) sequence, a Kozak consensus sequence, and/or additional untranslated regions which may house pre- or post-transcriptional regulatory and/or control elements. In some embodiments, a promoter can be a native promoter, a constitutive promoter, an inducible promoter, and/or a tissue-specific promoter. Non-limiting examples of control sequences are described herein.
AAV Particles
[0312] Among other things, the present disclosure provides AAV particles that comprise a construct encoding an NDP gene or characteristic portion thereof described herein, and a capsid described herein. In addition, the present disclosure provides AAV particles that comprise a construct encoding an HSPA1A gene or characteristic portion thereof described herein, and a capsid described herein. In addition, the present disclosure provides AAV particles that comprise a construct encoding a gene encoding a secreted target protein (e.g., a NDP gene (e.g., any of the exemplary NDP genes described herein), a HSPA1A gene (e.g., any of the exemplary HSPA1A genes described herein)) gene or characteristic portion thereof described herein, and a capsid described herein.
[0313] In some embodiments, AAV particles can be described as having a serotype, which is a description of the construct strain and the capsid strain. For example, in some embodiments an AAV particle may be described as AAV2, wherein the particle has an AAV2 capsid and a construct that comprises characteristic AAV2 Inverted Terminal Repeats (ITRs). In some embodiments, an AAV particle may be described as AAV1, In some embodiments, an AAV particle may be described as AAV2, In some embodiments, an AAV particle may be described as AAV3. In some embodiments, an AAV particle may be described as AAV4. In some embodiments, an AAV particle may be described as AAV5. In some embodiments, an AAV particle may be described as AAV6. In some embodiments, an AAV particle may be described as AAV7. In some embodiments, an AAV particle may be described as AAV8. In some embodiments, an AAV particle may be described as AAV9. In some embodiments, an AAV particle may be AAV-rh8. In some embodiments, an AAV particle may comprise an AAV-rh10 capsid. In some embodiments, an AAV particle may comprise an AAV-rh39 capsid. In some embodiments, an AAV particle may comprise an AAV-rh43 capsid. In some embodiments, an AAV particle may comprise an AAV Anc80 capsid. In some embodiments, an AAV particle may be described as a pseudotype, wherein the capsid and construct are derived from different AAV strains as described herein, for example, AAV2/9 would refer to an AAV particle that comprises a construct utilizing the AAV2 ITRs and an AAV9 capsid. In some embodiments, an AAV particle may be described as other AAV variants and serotypes as described herein.
AAV Construct
[0314] The present disclosure provides polynucleotide constructs that comprise a gene encoding a secreted target protein (e.g., a NDP gene (e.g., any of the exemplary NDP genes described herein), a HSPA1A gene (e.g., any of the exemplary HSPA1A genes described herein)) gene or characteristic portion thereof. For example, the present disclosure provides polynucleotide constructs that comprise an NDP gene or characteristic portion thereof. In some embodiments described herein, a polynucleotide comprising an NDP gene or characteristic portion thereof can be included in an AAV particle. As another example, the present disclosure also provides polynucleotide constructs that comprise an HSPA1A gene or characteristic portion thereof. In some embodiments described herein, a polynucleotide comprising an HSPA1A gene or characteristic portion thereof can be included in an AAV particle.
[0315] In some embodiments, a polynucleotide construct comprises one or more components derived from or modified from a naturally occurring AAV genomic construct. In some embodiments, a sequence derived from an AAV construct is an AAV1 construct, an AAV2 construct, an AAV3 construct, an AAV4 construct, an AAV5 construct, an AAV6 construct, an AAV7 construct, an AAV8 construct, an AAV9 construct, an AAV2.7m8 construct, an AAV8BP2 construct, an AAV293 construct, or AAV Anc80 construct. Additional exemplary AAV constructs that can be used herein are known in the art. See, e.g., Kanaan et al., Mol. Ther. Nucleic Acids 8:184-197, 2017; Li et al., Mol. Ther. 16(7): 1252-1260, 2008; Adachi et al., Nat. Commun. 5: 3075, 2014; Isgrig et al., Nat. Commun. 10(1): 427, 2019; and Gao et al., J. Virol. 78(12): 6381-6388, 2004; each of which is incorporated in its entirety herein by reference.
[0316] In some embodiments, provided constructs comprise coding sequence, e.g., an gene encoding a secreted target protein (e.g., an NDP gene, e.g., an HSPA1A gene) or a characteristic portion thereof, one or more regulatory and/or control sequences, and optionally 5′ and 3′ AAV derived inverted terminal repeats (ITRs). In some embodiments wherein a 5′ and 3′ AAV derived ITR is utilized, the polynucleotide construct may be referred to as an AAV construct. In some embodiments, provided AAV constructs are packaged into an AAV capsid to form an AAV particle.
[0317] In some embodiments, AAV derived sequences (which are comprised in a polynucleotide construct) typically include the cis-acting 5′ and 3′ ITR sequences (see, e.g., B. J. Carter, in “Handbook of Parvoviruses,” ed., P. Tijsser, CRC Press, pp. 155 168, 1990, which is incorporated herein by reference in its entirety). Typical AAV2-derived ITR sequences are about 145 nucleotides in length. In some embodiments, at least 80% of a typical ITR sequence (e.g., at least 85%, at least 90%, or at least 95%) is incorporated into a construct provided herein. The ability to modify these ITR sequences is within the skill of the art. (See, e.g., texts such as Sambrook et al., “Molecular Cloning. A Laboratory Manual”, 2d ed., Cold Spring Harbor Laboratory, New York, 1989; and K. Fisher et al., J Virol. 70:520 532, 1996, each of which is incorporated in its entirety by reference). In some embodiments, any of the coding sequences and/or constructs described herein are flanked by 5′ and 3′ AAV ITR sequences. The AAV ITR sequences may be obtained from any known AAV, including presently identified AAV types.
[0318] In some embodiments, polynucleotide constructs described in accordance with this disclosure and in a pattern known to the art (see, e.g., Asokan et al., Mol. Ther. 20: 699-7080, 2012, which is incorporated herein by reference in its entirety) are typically comprised of, a coding sequence or a portion thereof, at least one and/or control sequence, and optionally 5′ and 3′ AAV inverted terminal repeats (ITRs). In some embodiments, provided constructs can be packaged into a capsid to create an AAV particle. An AAV particle may be delivered to a selected target cell. In some embodiments, provided constructs comprise an additional optional coding sequence that is a nucleic acid sequence (e.g., inhibitory nucleic acid sequence), heterologous to the construct sequences, which encodes a polypeptide, protein, functional RNA molecule (e.g., miRNA, miRNA inhibitor) or other gene product, of interest. In some embodiments, a nucleic acid coding sequence is operatively linked to and/or control components in a manner that permits coding sequence transcription, translation, and/or expression in a cell of a target tissue.
[0319] As shown in
[0320] In some embodiments, an AAV construct is a recombinant AAV construct or “rAAV” construct. In some embodiments, an AAV construct can include at least 500 bp, at least 1 kb, at least 1.5 kb, at least 2 kb, at least 2.5 kb, at least 3 kb, at least 3.5 kb, at least 4 kb, or at least 4.5 kb. In some embodiments, an AAV construct can include at most 7.5 kb, at most 7 kb, at most 6.5 kb, at most 6 kb, at most 5.5 kb, at most 5 kb, at most 4.5 kb, at most 4 kb, at most 3.5 kb, at most 3 kb, or at most 2.5 kb. In some embodiments, an AAV construct can include about 1 kb to about 2 kb, about 1 kb to about 3 kb, about 1 kb to about 4 kb, about 1 kb to about 5 kb, about 2 kb to about 3 kb, about 2 kb to about 4 kb, about 2 kb to about 5kb, about 3 kb to about 4 kb, about 3 kb to about 5 kb, or about 4 kb to about 5 kb. AAV constructs are typically composed of, at a minimum, a transgene or a portion thereof and a regulatory sequence, and optionally 5′ and 3′ AAV inverted terminal repeats (ITRs). Such an AAV construct is packaged into a capsid and delivered to a selected target cell (e.g., a cochlear hair cell).
[0321] Any of the constructs described herein can further include regulatory and/or control sequences, e.g., a control sequence selected from the group of a transcription initiation sequence, a transcription termination sequence, a promoter sequence, an enhancer sequence, an RNA splicing sequence, a polyadenylation (poly(A)) sequence, a Kozak consensus sequence, and/or any combination thereof. In some embodiments, a promoter can be a native promoter, a constitutive promoter, an inducible promoter, and/or a tissue-specific promoter. Non-limiting examples of control sequences are described herein.
[0322] The AAV sequences of the vector typically comprise the cis-acting 5′ and 3′ ITR sequences (See, e.g., B. J. Carter, in “Handbook of Parvoviruses”, ed., P. Tijsser, CRC Press, pp. 155 168, 1990, the contents of which is hereby incorporated by reference herein in its entirety). Typical AAV ITR sequences are about 145 nucleotides in length. In some embodiments, at least 75% of a typical ITR sequence (e.g., at least 80%, at least 85%, at least 90%, or at least 95%) is incorporated into the AAV vector. The ability to modify these ITR sequences is within the skill of the art. (See, e.g., texts such as Sambrook et al., “Molecular Cloning. A Laboratory Manual”, 2d ed., Cold Spring Harbor Laboratory, New York, 1989; and K. Fisher et al., J Virol. 70:520 532, 1996, each of which is incorporated in its entirety herein by reference). In some embodiments, any of the coding sequences described herein is flanked by 5′ and 3′ AAV ITR sequences in the AAV vectors. The AAV ITR sequences may be obtained from any known AAV, including presently identified AAV types.
[0323] AAV vectors as described herein may include any of the regulatory elements described herein (e.g., one or more of a promoter, a polyadenylation (poly(A)) signal sequence, and an IRES).
[0324] In some embodiments, the AAV vector is selected from the group consisting of: an AAV1 vector, an AAV2 vector, an AAV3 vector, an AAV4 vector, an AAV5 vector, an AAV6 vector, an AAV7 vector, an AAV8 vector, an AAV9 vector, an AAV2.7m8 vector, an AAV8BP2 vector, and an AAV293 vector. Additional exemplary AAV vectors that can be used herein are known in the art. See, e.g., Kanaan et al., Mol. Ther. Nucleic Acids 8:184-197, 2017; Li et al., Mol. Ther. 16(7): 1252-1260; Adachi et al., Nat. Commun. 5: 3075, 2014; Isgrig et al., Nat. Commun. 10(1): 427, 2019; and Gao et al., J. Virol. 78(12): 6381-6388, each of which is incorporated in its entirety herein by reference.
[0325] In some embodiments, an AAV vector provided herein includes or consists of a sequence that is at least 80% identical (e.g., at least 82%, at least 84%, at least 85%, at least 86%, at least 88%, at least 90%, at least 92%, at least 94%, at least 95%, at least 96%, at least 98%, at least 99%, or 100% identical) to SEQ ID NO: 9, 10, 11, 12, or 13.
[0326] In some embodiments, the vector(s) is an adenovirus vector (see, e.g., Dmitriev et al. (1998) J. Virol. 72: 9706-9713; and Poulin et al., J. Virol 8: 10074-10086, 2010, each of which is incorporated in its entirety herein by reference). In some embodiments, the vector(s) is a retrovirus (see, e.g., Maier et al. (2010) Future Microbiol 5: 1507-23, the contents of which is incorporated in its entirety herein).
[0327] The vectors provided herein can be of different sizes. The choice of vector that is used in any of the compositions, kits, and methods described herein may depend on the size of the vector.
[0328] In some embodiments, the vector(s) can have a total number of nucleotides of up to 10 kb. In some embodiments, the viral vector(s) can have a total number of nucleotides in the range of about 1 kb to about 2 kb, 1 kb to about 3 kb, about 1 kb to about 4 kb, about 1 kb to about 5 kb, about 1 kb to about 6 kb, about 1 kb to about 7 kb, about 1 kb to about 8 kb, about 1 kb to about 9 kb, about 1 kb to about 10 kb, about 2 kb to about 3 kb, about 2 kb to about 4 kb, about 2 kb to about 5 kb, about 2 kb to about 6 kb, about 2 kb to about 7 kb, about 2 kb to about 8 kb, about 2 kb to about 9 kb, about 2 kb to about 10 kb, about 3 kb to about 4 kb, about 3 kb to about 5 kb, about 3 kb to about 6 kb, about 3 kb to about 7 kb, about 3 kb to about 8 kb, about 3 kb to about 9 kb, about 3 kb to about 10 kb, about 4 kb to about 5 kb, about 4 kb to about 6 kb, about 4 kb to about 7 kb, about 4 kb to about 8 kb, about 4 kb to about 9 kb, about 4 kb to about 10 kb, about 5 kb to about 6 kb, about 5 kb to about 7 kb, about 5 kb to about 8 kb, about 5 kb to about 9 kb, about 5 kb to about 10 kb, about 6 kb to about 7 kb, about 6 kb to about 8 kb, about 6 kb to about 9 kb, about 6 kb to about 10 kb, about 7 kb to about 8 kb, about 7 kb to about 9 kb, about 7 kb to about 10 kb, about 8 kb to about 9 kb, about 8 kb to about 10 kb, or about 9 kb to about 10 kb.
[0329] In some embodiments, the vector(s) is a lentivirus and can have a total number of nucleotides of up to 8 kb. In some examples, the lentivirus(es) can have a total number of nucleotides of about 1 kb to about 2 kb, about 1 kb to about 3 kb, about 1 kb to about 4 kb, about 1 kb to about 5 kb, about 1 kb to about 6 kb, about 1 kb to about 7 kb, about 1 kb to about 8 kb, about 2 kb to about 3 kb, about 2 kb to about 4 kb, about 2 kb to about 5 kb, about 2 kb to about 6 kb, about 2 kb to about 7 kb, about 2 kb to about 8 kb, about 3 kb to about 4 kb, about 3 kb to about 5 kb, about 3 kb to about 6 kb, about 3 kb to about 7 kb, about 3 kb to about 8 kb, about 4 kb to about 5 kb, about 4 kb to about 6 kb, about 4 kb to about 7 kb, about 4 kb to about 8 kb, about 5 kb to about 6 kb, about 5 kb to about 7 kb, about 5 kb to about 8 kb, about 6 kb to about 8kb, about 6 kb to about 7 kb, or about 7 kb to about 8 kb.
[0330] In some embodiments, the vector(s) is an adenovirus and can have a total number of nucleotides of up to 8 kb. In some embodiments, the adenovirus(es) can have a total number of nucleotides in the range of about 1 kb to about 2 kb, about 1 kb to about 3 kb, about 1 kb to about 4 kb, about 1 kb to about 5 kb, about 1 kb to about 6 kb, about 1 kb to about 7 kb, about 1 kb to about 8 kb, about 2 kb to about 3 kb, about 2 kb to about 4 kb, about 2 kb to about 5 kb, about 2 kb to about 6 kb, about 2 kb to about 7 kb, about 2 kb to about 8 kb, about 3 kb to about 4 kb, about 3 kb to about 5 kb, about 3 kb to about 6 kb, about 3 kb to about 7 kb, about 3 kb to about 8 kb, about 4 kb to about 5 kb, about 4 kb to about 6 kb, about 4 kb to about 7 kb, about 4 kb to about 8 kb, about 5 kb to about 6 kb, about 5 kb to about 7 kb, about 5 kb to about 8 kb, about 6 kb to about 7 kb, about 6 kb to about 8 kb, or about 7 kb to about 8 kb.
[0331] In some embodiments, the vector(s) is an adeno-associated virus (AAV vector) and can include a total number of nucleotides of up to 5 kb. In some embodiments, the AAV vector(s) can include a total number of nucleotides in the range of about 1 kb to about 2 kb, about 1 kb to about 3 kb, about 1 kb to about 4 kb, about 1 kb to about 5 kb, about 2 kb to about 3 kb, about 2 kb to about 4 kb, about 2 kb to about 5kb, about 3 kb to about 4 kb, about 3 kb to about 5 kb, or about 4 kb to about 5 kb.
[0332] Provided herein are exemplary vectors that can be used in any of the compositions and methods described herein. See, e.g.,
[0333] A variety of different methods known in the art can be used to introduce any of vectors disclosed herein into a mammalian cell (e.g., an inner ear cell, a cochlear inner hair cell). Non-limiting examples of methods for introducing nucleic acid into a mammalian cell include: lipofection, transfection (e.g., calcium phosphate transfection, transfection using highly branched organic compounds, transfection using cationic polymers, dendrimer-based transfection, optical transfection, particle-based transfection (e.g., nanoparticle transfection), or transfection using liposomes (e.g., cationic liposomes)), microinjection, electroporation, cell squeezing, sonoporation, protoplast fusion, impalefection, hydrodynamic delivery, gene gun, magnetofection, viral transfection, and nucleofection.
[0334] Any of the vectors described herein can further include a control sequence, e.g., a control sequence selected from the group of a transcription initiation sequence, a transcription termination sequence, a promoter sequence, an enhancer sequence, an RNA splicing sequence, a polyadenylation (polyA) signal, and a Kozak consensus sequence. Non-limiting examples of these control sequences are described herein. In some embodiments, a promoter can be a native promoter, a constitutive promoter, an inducible promoter, and/or a tissue-specific promoter.
TABLE-US-00011 Exemplary Construct 1 (SEQ ID NO: 9) CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CTGCGGCCGCACGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCAT TAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTG ACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATA GGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATC AAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCA TTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC GCTATTACCATGGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCC CCACCCCCAATTTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGG GGGGGGCGCGCGCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTG CGGCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCG GCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCC CCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGA GCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTC TTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGG CTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCG GCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGT GTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCC CCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCG GGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTC GGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGC GGCGAGCCGCAGCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCC CAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGA AGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCG TCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGG GGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTC ATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCAT GAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACA GACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACT ATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTG CGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAG CAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGC TGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGCGA GGAATGCAATTCCTAAGGCCCGCTGCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGT TCGACCAGCCAGGGAAAGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTCTCC GGGGACTCTGCATATTCTAGTAATAAAGACTCTACATGCTTGTTGACAGAGAGAGATACTCTGG GAACTTCTTTGCAGTTCCCATCTCCTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTC AGGCATTTTCCCCCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGGAAA AAGTGGGCCCTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAATTT ACTTTGGAAAGTAGAAAAGCCCAGCTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTTTTT TTACCTTGTCATTTTGGTCTAAGGTTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGA TACCAAGCATGTGGATATGTTTAGCTACGTTTACTCACAGCCAGCGAACTGACATTAAAATAAC TAACAAACAGATTCTTTTATGTGATGCTGGAACTCTTGACAGCTATAATTATTATTCAGAAATG ACTTTTTGAAAGTAAAAGCAGCATAAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATG GTAAAATTTTGTAAGGGAGCAGACTTTTAAAGACTTGCACAAATACGGATCCTGCACTGACTCT GGAAAAGGCATATATGTACTAGTGGCATGGAGAATGCACCATACTCATGCATGCAAATTAGACA ACCAAGTATGAATCTATTTGTGGGTGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCTCTA ATATCCACTTGTCCATGTGAAACATGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTT GGTTCAAATGTGTTTTGGTCCTGGAGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATAAAA AGAGTATATTCAAAAGAGCTCCTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCC CGTGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATT GCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGG GGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGCAATTCAGAC TCCCACTAGAAGAAACAGAACTTGAAAAAAGGATAATTGTGGTGAACACCTCAACCCAGTGGTC CAAAAACTGAAAAACTTTGACCCCGAGCACCCTCACACAGATAGACTACTGAAAGAAGGAGAAA GGGGCACTTACTCAGTCTCCCTTATCAGATTGCCTTACGAGGAGTACTAGACTCCCTCAAGCAA ATAGATCTCACTTACCACTTGCAAAGGTATACTACTTTCACTCTATTAGACCTATATATCTGAC CAGGGTCCTATTCCAAGACAACTCTTCTACTCTTCCAGCAGACTCTTATAGAAAGAAAGATTCT GGGGTCCAAGAAAGCAGTACTTTCTTACAAGGAGCCAAAAAAAATAACCTTTCTTTAGCACTTC TAACCTTGGAGTGAACTGGTGATCAAAGAGAGGTTGGCTCCCTGGGGACAAGTGCCACAAATTC AGTCAACTACAAGAAAGTTGAGAACACTGTTCTCCCGAAACCAAGCTTGAATTCAGCTGACGTG CCTCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCT CACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGC GAGCGAGCGCGCAGCTGCCTGCAGG Exemplary Construct 2 (SEQ ID NO: 10) CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CTGCGGCCGCACGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCAT TAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTG ACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATA GGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATC AAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCA TTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC GCTATTACCATGGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCC CCACCCCCAATTTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGG GGGGGGCGCGCGCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTG CGGCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCG GCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCC CCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGA GCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTC TTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGG CTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCG GCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGT GTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCC CCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCG GGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTC GGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGC GGCGAGCCGCAGCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCC CAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGA AGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCG TCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGG GGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTC ATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCAT GAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACA GACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACT ATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTG CGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAG CAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGC TGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGCGA GGAATGCAATTCCTAAGAGCTCCTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCC CCGTGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAAT TGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAG GGGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGGTTGACTAA AGATTGAACCTTATTCAAAGTTAGACTCTCTTTGTTAAAGACAAACAAAACTTCCAATAATTGA GCAACTAATAGAAAGACTCAACTTGTGAGCGAGTACTTATTAATTGAGAATAGTGAGTGAGTCT GGCAAAATATATTAGAAAGTGACACTGAGTTTAAAAAAGTGACACCTTTGATTCTGAACAGTGA ACTTTGAGACTGAAAAACTACAGCTTTGAGGCTAAATACTTGATCAAATAAAACTAGTTAGTCA AAAAACTGAGTGAAAGTCCAACAGAAAAAAGAGGGCCCACTTCCACCAGTGACACAAAATCCAG ATTGATCGTTCTTTAAGTGACTATTCTTGCCAGAATCAGCAAGGTGGATACAAAGGACTCTGAG AAAGAACTCTCTGAACTCTGGGCAAGGCCCCAGTCCAAAGCAATTAGTATCCTTAGGACCAGAA AAATCTGTGGAAGGTCAGAATTTCTTGTCTGAGAAAAACAAAGCAATTCAGACTCCCACTAGAA GAAACAGAACTTGAAAAAAGGATAATTGTGGTGAACACCTCAACCCAGTGGTCCAAAAACTGAA AAACTTTGACCCCGAGCACCCTCACACAGATAGACTACTGAAAGAAGGAGAAAGGGGCACTTAC TCAGTCTCCCTTATCAGATTGCCTTACGAGGAGTAGTAGACTCCCTCAAGCAAATAGATCTCAC TTACCACTTGCAAAGGTATACTACTTTCACTCTATTAGACCTATATATCTGACCAGGGTCCTAT TCCAAGACAACTCTTCTAGTCTTCCAGCAGACTCTTATAGAAAGAAAGATTCTGGGGTCCAAGA AAGCAGTACTTTCTTACAAGGAGCCAAAAAAAATAACCTTTCTTTAGCACTTCTAACCTTGGAG TGAACTGGTGATCAAAGAGAGGTTGGCTCCCTGGGGACAAGTGCCACAAATTCAGTCAACTACA AGAAAGTTGAGAACACTGTTCTCCCGAAACCAAGCTTGAATTCAGCTGACGTGCCTCGGACCGC TAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCG GGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCG CAGCTGCCTGCAGG Exemplary Construct 3 (SEQ ID NO: 11) CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CTGCGGCCGCACGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCAT TAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTG ACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATA GGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATC AAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCA TTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC GCTATTACCATGGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCC CCACCCCCAATTTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGG GGGGGGCGCGCGCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTG CGGCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCG GCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCC CCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGA GCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTC TTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGG CTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCG GCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGT GTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCC CCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCG GGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTC GGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGC GGCGAGCCGCAGCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCC CAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGA AGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCG TCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGG GGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTC ATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCAT GAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACA GACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACT ATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTG CGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAG CAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGC TGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGCGA GGAATGCAATTCCGGCTCCGGAGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAG AATCCTGGCCCAATGGAGAGCGACGAGAGCGGCCTGCCCGCCATGGAGATCGAGTGCCGCATCA CCGGCACCCTGAACGGCGTGGAGTTCGAGCTGGTGGGCGGCGGAGAGGGCACCCCCGAGCAGGG CCGCATGACCAACAAGATGAAGAGCACCAAAGGCGCCCTGACCTTCAGCCCCTACCTGCTGAGC CACGTGATGGGCTACGGCTTCTACCACTTCGGCACCTACCCCAGCGGCTACGAGAACCCCTTCC TGCACGCCATCAACAACGGCGGCTACACCAACACCCGCATCGAGAAGTACGAGGACGGCGGCGT GCTGCACGTGAGCTTCAGCTACCGCTACGAGGCCGGCCGCGTGATCGGCGACTTCAAGGTGATG GGCACCGGCTTCCCCGAGGACAGCGTGATCTTCACCGACAAGATCATCCGCAGCAACGCCACCG TGGAGCACCTGCACCCCATGGGCGATAACGATCTGGATGGCAGCTTCACCCGCACCTTCAGCCT GCGCGACGGCGGCTACTACAGCTCCGTGGTGGACAGCCACATGCACTTCAAGAGCGCCATCCAC CCCAGCATCCTGCAGAACGGGGGCCCCATGTTCGCCTTCCGCCGCGTGGAGGAGGATCACAGCA ACACCGAGCTGGGCATCGTGGAGTACCAGCACGCCTTCAAGACCCCGGATGCAGATGCCGGTGA AGAATAAGGCCCGCTGCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGTTCGACCAGC CAGGGAAAGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTCTCCCGGGACTCT GCATATTCTAGTAATAAAGACTCTACATGCTTGTTGACAGAGAGAGATACTCTGGGAACTTCTT TGCAGTTCCCATCTCCTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTCAGGCATTTT CCCCCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGGAAAAAGTGGGCC CTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAATTTACTTTGGAA AGTAGAAAAGCCCAGCTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTTTTTTTACCTTGT CATTTTGGTCTAAGGTTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGATACCAAGCA TGTGGATATGTTTAGCTACGTTTACTCACAGCCAGCGAACTGACATTAAAATAACTAACAAACA GATTCTTTTATGTGATGCTGGAACTCTTGACAGCTATAATTATTATTCAGAAATGACTTTTTGA AAGTAAAAGCAGCATAAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATGGTAAAATTT TGTAAGGGAGCAGACTTTTAAAGACTTGCACAAATACGGATCCTGCACTGACTCTGGAAAAGGC ATATATGTACTAGTGGCATGGAGAATGCACCATACTCATGCATGCAAATTAGACAACCAAGTAT GAATCTATTTGTGGGTGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCTCTAATATCCACT TGTCCATGTGAAACATGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTTGGTTCAAAT GTGTTTTGGTCCTGGAGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATAAAAAGAGTATAT TCAAAAGAGCTCCTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTC CTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCAT TGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGATT GGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGAAGCTTGAATTCAGCTGAC GTGCCTCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTC GCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTG AGCGAGCGAGCGCGCAGCTGCCTGCAGG Exemplary Construct 4 (SEQ ID NO: 12) CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CTGCGGCCGCACGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCAT TAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTG ACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATA GGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATC AAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCA TTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC GCTATTACCATGGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCC CCACCCCCAATTTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGG GGGGGGCGCGCGCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTG CGGCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCG GCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCC CCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGA GCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTC TTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGG CTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCG GCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGT GTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCC CCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCG GGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTC GGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGC GGCGAGCCGCAGCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCC CAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGA AGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCG TCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGG GGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTC ATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCAT GAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACA GACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACT ATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTG CGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAG CAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGC TGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGCGA GGAATGCAATTCCGGCTCCGGAGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAG AATCCTGGCCCAATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCG AGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCAC CTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACC CTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGC ACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGA CGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATC GAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACT ACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCC ATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCA AAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCAC TCTCGGCATGGACGAGCTGTACAAGTAATAAGGCCCGCTGCTGTGTGTGGCTTCTGGATGGGAC AACTGTAGAGGCAGTTCGACCAGCCAGGGAAAGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGG ATGCAACAATTCTCCCGGGACTCTGCATATTCTAGTAATAAAGACTCTACATGCTTGTTGACAG AGAGAGATACTCTGGGAACTTCTTTGCAGTTCCCATCTCCTTTCTCTGGTACAATTTCTTTTGG TTCATTTTCAGATTCAGGCATTTTCCCCCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTC AGCATTAGTGGGAAAAAGTGGGCCCTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCAC GCTGGGGAAGAATTTACTTTGGAAAGTAGAAAAGCCCAGCTTTTCCTGGGACATCTTCTGTTAT TGTTGATGTTTTTTTTTACCTTGTCATTTTGGTCTAAGGTTGCCATTGCTGCTAAAGGTTACCG ATTTCAAAGTCCAGATACCAAGCATGTGGATATGTTTAGCTACGTTTACTCACAGCCAGCGAAC TGACATTAAAATAACTAACAAACAGATTCTTTTATGTGATGCTGGAACTCTTGACAGCTATAAT TATTATTCAGAAATGAGTTTTTGAAAGTAAAAGCAGCATAAAGAATTTGTCACAGGAAGGCTGT CTCAGATAAATTATGGTAAAATTTTGTAAGGGAGCAGACTTTTAAAGACTTGCACAAATACGGA TCCTGCACTGACTCTGGAAAAGGCATATATGTACTAGTGGCATGGAGAATGCACCATACTCATG CATGCAAATTAGACAACCAAGTATGAATCTATTTGTGGGTGTGCTATAGCTTTAGCCGTGTCAC GGGCATCATTCTCTAATATCCACTTGTCCATGTGAAACATGTTGCCAAAATGGTGGCCTGGCTT GTCTTCTGAACGTTTGGTTCAAATGTGTTTTGGTCCTGGAGGCTCAAATTTTGAGTTATTCCCA CGTTTTGAAATAAAAAGAGTATATTCAAAAGAGCTCCTGTGCCTTCTAGTTGCCAGCCATCTGT TGTTTGCCCCTCCCCCGTGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAA TAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGG GGCAGGACAGCAAGGGGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTC TATGGAAGCTTGAATTCAGCTGACGTGCCTCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGC CACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCG GGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCTGCCTGCAGG Exemplary Construct 5 (SEQ ID NO: 13) CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CTGCGGCCGCACGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCAT TAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTG ACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATA GGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATC AAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCA TTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC GCTATTACCATGGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCC CCACCCCCAATTTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGG GGGGGGCGCGCGCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTG CGGCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCG GCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCC CCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGA GCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTC TTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGG CTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCG GCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGT GTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCC CCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCG GGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTC GGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGC GGCGAGCCGCAGCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCC CAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGA AGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCG TCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGG GGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTC ATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTAAGATGCT CCGTGGAAGGGAGCCGAGCGGTGGGCAGAGGCTGAGTCCCCGATAACGAGCGCCTCACATTTCC GTGGCATTCCCATTTGCTAGTGCGCTGCTGCGGCCGCACGCCTGATTGATATATGACTGCAATG GCACTTTTCCATTTGACATTCTCTCTCTCTCTCTCCCTCTCTCTCTCTCCCTCTCTCTCTCCCT CTCTCTCTCTCCCTGTGTCGCTTAAACAACAGTCCTAACTTTTGTGTGTTGCAAATATAAAAGG CAAGCCATGTGACAGAGGGACAGAAGAACAAAAGCATTTGGAAGTAACAGGACCTCTTTCTAGC TCTCAGAAAAGTCTGAGAAGAAAGGAGCCCTGCGTTCCCCTAAGCTGTGCAGCAGATACTGTGA TGATGGATTGCAAGTGCAAAGAGTAAGACAAAACTCCAGCACATAAAGGACAATGACAACCAGA AAGCTTCAGCCCGATCCTGCCCTTTCCTTGAACGGGACTGGATCCTAGGAGGTGAAGCCATTTC CAATTTTTTGTCCTCTGCCTCCCTCTGCTGTTCTTCTAGAGAAGTTTTTCCTTACAACAGCCAC CATGAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGAT ACAGACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACC ACTATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAG GTGCGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTC AAGCAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGC GGCTGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTG CGAGGAATGCAATTCCTAAGGCCCGCTGCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGC AGTTCGACCAGCCAGGGAAAGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTC TCCCGGGACTCTGCATATTCTAGTAATAAAGACTCTAGATGCTTGTTGACAGAGAGAGATACTC TGGGAACTTCTTTGCAGTTCCCATCTCCTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGA TTCAGGCATTTTCCCCCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGG AAAAAGTGGGCCCTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAA TTTACTTTGGAAAGTAGAAAAGCCCAGCTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTT TTTTTACCTTGTCATTTTGGTCTAAGGTTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCC AGATACCAAGCATGTGGATATGTTTAGCTACGTTTACTCACAGCCAGCGAACTGACATTAAAAT AACTAACAAACAGATTCTTTTATGTGATGCTGGAACTCTTGACAGCTATAATTATTATTCAGAA ATGACTTTTTGAAAGTAAAAGCAGCATAAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATT ATGGTAAAATTTTGTAAGGGAGCAGACTTTTAAAGACTTGCACAAATACGGATCCTGCACTGAC TCTGGAAAAGGCATATATGTACTAGTGGCATGGAGAATGCACCATACTCATGCATGCAAATTAG ACAACCAAGTATGAATCTATTTGTGGGTGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCT CTAATATCCACTTGTCCATGTGAAACATGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACG TTTGGTTCAAATGTGTTTTGGTCCTGGAGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATA AAAAGAGTATATTCAAAAGAGCTCCTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTC CCCCGTGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAA ATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCA AGGGGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGCAATTCA GACTCCCACTAGAAGAAACAGAACTTGAAAAAAGGATAATTGTGGTGAACACCTCAACCCAGTG GTCCAAAAACTGAAAAACTTTGACCCCGAGCACCCTCACACAGATAGACTACTGAAAGAAGGAG AAAGGGGCACTTACTCAGTCTCCCTTATCAGATTGCCTTACGAGGAGTACTAGACTCCCTCAAG CAAATAGATCTCACTTACCACTTGCAAAGGTATACTACTTTCACTCTATTAGACCTATATATCT GAGGAGGGTCCTATTCCAAGACAACTCTTCTACTCTTCGAGCAGACTCTTATAGAAAGAAAGAT TCTGGGGTCCAAGAAAGCAGTACTTTCTTACAAGGAGCCAAAAAAAATAACCTTTCTTTAGCAC TTCTAACCTTGGAGTGAACTGGTGATCAAAGAGAGGTTGGCTCCCTGGGGACAAGTGCCACAAA TTCAGTCAACTACAAGAAAGTTGAGAACACTGTTCTCCCGAAACCAAGCTTGAATTCAGCTGAC GTGCCTCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTC GCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTG AGCGAGCGAGCGCGCAGCTGCCTGCAGG Exemplary Construct 6 (SEQ ID NO: 96) CTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCA GTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCA CGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGC CCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACG ACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCA TTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCAT ATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGT ACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCAT GGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCCCCACCCCCAAT TTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGCGCGC GCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGGCGGCAGCC AATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCGGCGGCCCTATA AAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCCCCGCTCCGCGC CGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGAGCGGGCGGGAC GGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTCTTTTCTGTGGC TGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGGCTCGGGGGGTG CGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCGGCTGTGAGCGC TGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCGGCCGGGGGCGG TGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGTGTGCGTGGGGG GGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCCCCCTCCCCGAG TTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCGGGGCTCGCCGT GCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTCGGGCCGGGGAG GGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGCGGCGAGCCGCA GCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCCCAAATCTGTGC GGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGAAGCGGTGCGGC GCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCGTCCCCTTCTCC CTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGGGGCAGGGCGGG GTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTCATGCCTTCTTC TTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCATGTACCGGATGC AGCTGCTGAGCTGTATCGCCCTGTCTCTGGCCCTGGTCACCAATTCTGCCAAAGCCGCGGCGAT CGGCATCGACCTGGGCACCACCTACTCCTGCGTGGGGGTGTTCCAACACGGCAAGGTGGAGATC ATCGCCAACGACCAGGGCAACCGCACCACCCCCAGCTACGTGGCCTTCACGGACACCGAGCGGC TCATCGGGGATGCGGCCAAGAACCAGGTGGCGCTGAACCCGCAGAACACCGTGTTTGACGCGAA GCGGCTGATCGGCCGCAAGTTCGGCGACCCGGTGGTGCAGTCGGACATGAAGCACTGGCCTTTC CAGGTGATCAACGACGGAGACAAGCCCAAGGTGCAGGTGAGCTACAAGGGGGAGACCAAGGCAT TCTACCCCGAGGAGATCTCGTCCATGGTGCTGACCAAGATGAAGGAGATCGCCGAGGCGTACCT GGGCTACCCGGTGACCAACGCGGTGATCACCGTGCCGGCCTACTTCAACGACTCGCAGCGCCAG GCCACCAAGGATGCGGGTGTGATCGCGGGGCTCAACGTGCTGCGGATCATCAACGAGCCCACGG CCGCCGCCATCGCCTACGGCCTGGACAGAACGGGCAAGGGGGAGCGCAACGTGCTCATCTTTGA CCTGGGCGGGGGCACCTTCGACGTGTCCATCCTGACGATCGACGACGGCATCTTCGAGGTGAAG GCCACGGCCGGGGACACCCACCTGGGTGGGGAGGACTTTGACAACAGGCTGGTGAACCACTTCG TGGAGGAGTTCAAGAGAAAACACAAGAAGGACATCAGCCAGAACAAGCGAGCCGTGAGGCGGCT GCGCACCGCCTGCGAGAGGGCCAAGAGGACCCTGTCGTCCAGCACCCAGGCCAGCCTGGAGATC GACTCCCTGTTTGAGGGCATCGACTTCTACACGTCCATCACCAGGGCGAGGTTCGAGGAGCTGT GCTCCGACCTGTTCCGAAGCACCCTGGAGCCCGTGGAGAAGGCTCTGCGCGACGCCAAGCTGGA CAAGGCCCAGATTCACGACCTGGTCCTGGTCGGGGGCTCCACCCGCATCCCCAAGGTGCAGAAG CTGCTGCAGGACTTCTTCAACGGGCGCGACCTGAACAAGAGCATCAACCCCGACGAGGCTGTGG CCTACGGGGCGGCGGTGCAGGCGGCCATCCTGATGGGGGACAAGTCCGAGAACGTGCAGGACCT GCTGCTGCTGGACGTGGCTCCCCTGTCGCTGGGGCTGGAGACGGCCGGAGGCGTGATGACTGCC CTGATCAAGCGCAACTCCACCATCCCCACCAAGCAGACGCAGATCTTCACCACCTACTCCGAGA ACCAACCCGGGGTGCTGATCCAGGTGTACGAGGGCGAGAGGGCCATGACGAAAGACAACAATCT GTTGGGGCGCTTCGAGCTGAGCGGCATCCCTCCGGCCCCCAGGGGCGTGCCCCAGATCGAGGTG ACCTTCGACATCGATGCCAACGGCATCCTGAACGTCACGGCCACGGACAAGAGCACCGGCAAGG CCAACAAGATCACCATCACCAACGACAAGGGCCGCCTGAGCAAGGAGGAGATCGAGCGCATGGT GCAGGAGGCGGAGAAGTACAAAGCGGAGGACGAGGTGCAGCGCGAGAGGGTGTCAGCCAAGAAC GCCCTGGAGTCCTACGCCTTCAACATGAAGAGCGCCGTGGAGGATGAGGGGCTCAAGGGCAAGA TCAGCGAGGCGGACAAGAAGAAGGTTCTGGACAAGTGTCAAGAGGTCATCTCGTGGCTGGACGC CAACACCTTGGCCGAGAAGGACGAGTTTGAGCACAAGAGGAAGGAGCTGGAGCAGGTGTGTAAC CCCATCATCAGCGGACTGTACCAGGGTGCCGGTGGTCCCGGGCCTGGGGGCTTCGGGGCTCAGG GTCCCAAGGGAGGGTCTGGGTCAGGCCCCACCATTGAGGAGGTGGATTAAGAGCTCGCTGATCA GCCTCGACTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGA CCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCATTGTCT GAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGATTGGGAA GACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGAAGCTTGAATTCAGCTGACGTGCC TCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCA CTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGA GCGAGCGCGCAG
TABLE-US-00012 Components Position SEQ (5′ to 3′ Size (in ID order) (nt) Origins and Notes construct) NO 5′ITR 119 AAV ITR (AF043303.1) 1-119 116 Cloning site 14 N/A (/NotI/MluI) 120-133 117 CMV 381 NCBI K03104.1 (140 . . . 520) 134-514 118 enhancer CBA 279 NCBI X00182.1 (268 . . . 543) 515-791 119 promoter Chimeric 1013 NCBI X00182.1 (544 . . . 1503) 792-1804 120 intron NCBI V00882.1 (1250 . . . 1317); XbaI cloning site Cloning site 39 N/A; AgeI 1805-1843 121 IL2ss 60 NCBI AAB86861.1 1844-1903 122 Hsp70 1920 NCBI NM_005345 1904-3823 123 Cloning site 24 Stop codon/SacI 3824-3847 124 BGHpA 225 NCBI M57764.1 (2326 . . . 2550); 3848-4072 125 termination site Cloning site 34 N/A; HindIII/EcoRI/PvuII/RsrII 4073-4106 126 3′ITR 130 AAV ITR (AF043303.1) 4107-4236 127
TABLE-US-00013 Exemplary Construct 7 (SEQ ID NO: 97) CTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCA GTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCA CGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGC CCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACG ACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCA TTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCAT ATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGT ACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCAT GGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCCCCACCCCCAAT TTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGCGCGC GCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGGCGGCAGCC AATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCGGCGGCCCTATA AAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCCCCGCTCCGCGC CGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGAGCGGGCGGGAC GGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTCTTTTCTGTGGC TGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGGCTCGGGGGGTG CGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCGGCTGTGAGCGC TGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCGGCCGGGGGCGG TGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGTGTGCGTGGGGG GGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCCCCCTCCCCGAG TTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCGGGGCTCGCCGT GCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTCGGGCCGGGGAG GGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGCGGCGAGCCGCA GCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCCCAAATCTGTGC GGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGAAGCGGTGCGGC GCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCGTCCCCTTCTCC CTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGGGGCAGGGCGGG GTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTCATGCCTTCTTC TTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCATGGCCAAAGCCG CGGCGATCGGCATCGACCTGGGCACCACCTACTCCTGCGTGGGGGTGTTCCAACACGGCAAGGT GGAGATCATCGCCAACGACCAGGGCAACCGCACCACCCCCAGCTACGTGGCCTTCACGGACACC GAGCGGCTCATCGGGGATGCGGCCAAGAACCAGGTGGCGCTGAACCCGCAGAACACCGTGTTTG ACGCGAAGCGGCTGATCGGCCGCAAGTTCGGCGACCCGGTGGTGCAGTCGGACATGAAGCACTG GCCTTTCCAGGTGATCAACGACGGAGACAAGCCCAAGGTGCAGGTGAGCTACAAGGGGGAGACC AAGGCATTCTACCCCGAGGAGATCTCGTCCATGGTGCTGACCAAGATGAAGGAGATCGCCGAGG CGTACCTGGGCTACCCGGTGACCAACGCGGTGATCACCGTGCCGGCCTACTTCAACGACTCGCA GCGCCAGGCCACCAAGGATGCGGGTGTGATCGCGGGGCTCAACGTGCTGCGGATCATCAACGAG CCCACGGCCGCCGCCATCGCCTACGGCCTGGACAGAACGGGCAAGGGGGAGCGCAACGTGCTCA TCTTTGACCTGGGCGGGGGCACCTTCGACGTGTCCATCCTGACGATCGACGACGGCATCTTCGA GGTGAAGGCCACGGCCGGGGACACCCACCTGGGTGGGGAGGACTTTGACAACAGGCTGGTGAAC CACTTCGTGGAGGAGTTCAAGAGAAAACACAAGAAGGACATCAGCCAGAACAAGCGAGCCGTGA GGCGGCTGCGCACCGCCTGCGAGAGGGCCAAGAGGACCCTGTCGTCCAGCACCCAGGCCAGCCT GGAGATCGACTCCCTGTTTGAGGGCATCGACTTCTACACGTCCATCACCAGGGCGAGGTTCGAG GAGCTGTGCTCCGACCTGTTCCGAAGCACCCTGGAGCCCGTGGAGAAGGCTCTGCGCGACGCCA AGCTGGACAAGGCCCAGATTCACGACCTGGTCCTGGTCGGGGGCTCCACCCGCATCCCCAAGGT GCAGAAGCTGCTGCAGGACTTCTTCAACGGGCGCGACCTGAACAAGAGCATCAACCCCGACGAG GCTGTGGCCTACGGGGCGGCGGTGCAGGCGGCCATCCTGATGGGGGACAAGTCCGAGAACGTGC AGGACCTGCTGCTGCTGGACGTGGCTCCCCTGTCGCTGGGGCTGGAGACGGCCGGAGGCGTGAT GACTGCCCTGATCAAGCGCAACTCCACCATCCCCACCAAGCAGACGCAGATCTTCACCACCTAC TCCGACAACCAACCCGGGGTGCTGATCCAGGTGTACGAGGGCGAGAGGGCCATGACGAAAGACA ACAATCTGTTGGGGCGCTTCGAGCTGAGCGGCATCCCTCCGGCCCCCAGGGGCGTGCCCCAGAT CGAGGTGACCTTCGACATCGATGCCAACGGCATCCTGAACGTCACGGCCACGGACAAGAGCACC GGCAAGGCCAACAAGATCACCATCACCAACGACAAGGGCCGCCTGAGCAAGGAGGAGATCGAGC GCATGGTGCAGGAGGCGGAGAAGTACAAAGCGGAGGACGAGGTGCAGCGCGAGAGGGTGTCAGC CAAGAACGCCCTGGAGTCCTACGCCTTCAACATGAAGAGCGCCGTGGAGGATGAGGGGCTCAAG GGCAAGATCAGCGAGGCGGACAAGAAGAAGGTTCTGGACAAGTGTCAAGAGGTCATCTCGTGGC TGGACGCCAACACCTTGGCCGAGAAGGACGAGTTTGAGCACAAGAGGAAGGAGCTGGAGCAGGT GTGTAACCCCATCATCAGCGGACTGTACCAGGGTGCCGGTGGTCCCGGGCCTGGGGGCTTCGGG GCTCAGGGTCCCAAGGGAGGGTCTGGGTCAGGCCCCACCATTGAGGAGGTGGATTAAGAGCTCG CTGATCAGCCTCGACTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCT TCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGC ATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGA TTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGAAGCTTGAATTCAGCTG ACGTGCCTCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGC TCGCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAG TGAGCGAGCGAGCGCGCAG
TABLE-US-00014 Components Position SEQ (5′ to 3′ Size (in ID order) (nt) Origins and Notes construct) NO 5′ITR 119 AAV ITR (AF043303.1) 1-119 128 Cloning site 14 N/A (/NotI/MluI) 120-133 129 CMV 381 NCBI K03104.1 (140 . . . 520) 134-514 130 enhancer CBA 279 NCBI X00182.1 (268 . . . 543) 515-791 131 promoter Chimeric 1013 NCBI X00182.1 (544 . . . 1503) 792-1804 132 intron NCBI V00882.1 (1250 . . . 1317); XbaI cloning site Cloning site 39 N/A; AgeI 1805-1843 133 Hsp70 1923 NCBI NM_005345 1844-3766 134 Cloning site 24 Stop codon/SacI 3767-3790 135 BGHpA 225 NCBI M57764.1 (2326 . . . 2550); 3791-4015 136 termination site Cloning site 34 N/A; HindIII/EcoRI/PvuII/RsrII 4016-4049 137 3′ITR 130 AAV2AAV ITR (AF043303.1) 4050-4179 138
TABLE-US-00015 Exemplary Construct 8 (SEQ ID NO: 98) CTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCA GTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCA CGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGC CCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACG ACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCA TTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCAT ATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGT ACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCAT GGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCCCCACCCCCAAT TTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGCGCGC GCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGGCGGCAGCC AATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCGGCGGCCCTATA AAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCCCCGCTCCGCGC CGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGAGCGGGCGGGAC GGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTCTTTTCTGTGGC TGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGGCTCGGGGGGTG CGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCGGCTGTGAGCGC TGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCGGCCGGGGGCGG TGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGTGTGCGTGGGGG GGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCCCCCTCCCCGAG TTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCGGGGCTCGCCGT GCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTCGGGCCGGGGAG GGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGCGGCGAGCCGCA GCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCCCAAATCTGTGC GGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGAAGCGGTGCGGC GCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCGTCCCCTTCTCC CTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGGGGCAGGGCGGG GTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTCATGCCTTCTTC TTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCATGGCCAAAGCCG CGGCGATCGGCATCGACCTGGGCACCACCTACTCCTGCGTGGGGGTGTTCCAACACGGCAAGGT GGAGATCATCGCCAACGACCAGGGCAACCGCACCACCCCCAGCTACGTGGCCTTCACGGACACC GAGCGGCTCATCGGGGATGCGGCCAAGAACCAGGTGGCGCTGAACCCGCAGAACACCGTGTTTG ACGCGAAGCGGCTGATCGGCCGCAAGTTCGGCGACCCGGTGGTGCAGTCGGACATGAAGCACTG GCCTTTCCAGGTGATCAACGACGGAGACAAGCCCAAGGTGCAGGTGAGCTACAAGGGGGAGACC AAGGCATTCTACCCCGAGGAGATCTCGTCCATGGTGCTGACCAAGATGAAGGAGATCGCCGAGG CGTACCTGGGCTACCCGGTGACCAACGCGGTGATCACCGTGCCGGCCTACTTCAACGACTCGCA GCGCCAGGCCACCAAGGATGCGGGTGTGATCGCGGGGCTCAACGTGCTGCGGATCATCAACGAG CCCACGGCCGCCGCCATCGCCTACGGCCTGGACAGAACGGGCAAGGGGGAGCGCAACGTGCTCA TCTTTGACCTGGGCGGGGGCACCTTCGACGTGTCCATCCTGACGATCGACGACGGCATCTTCGA GGTGAAGGCCACGGCCGGGGACACCCACCTGGGTGGGGAGGACTTTGACAACAGGCTGGTGAAC CACTTCGTGGAGGAGTTCAAGAGAAAACACAAGAAGGACATCAGCCAGAACAAGCGAGCCGTGA GGCGGCTGCGCACCGCCTGCGAGAGGGCCAAGAGGACCCTGTCGTCCAGCACCCAGGCCAGCCT GGAGATCGACTCCCTGTTTGAGGGCATCGACTTCTACACGTCCATCACCAGGGCGAGGTTCGAG GAGCTGTGCTCCGACCTGTTCCGAAGCACCCTGGAGCCCGTGGAGAAGGCTCTGCGCGACGCCA AGCTGGACAAGGCCCAGATTCACGACCTGGTCCTGGTCGGGGGCTCCACCCGCATCCCCAAGGT GCAGAAGCTGCTGCAGGACTTCTTCAACGGGCGCGACCTGAACAAGAGCATCAACCCCGACGAG GCTGTGGCCTACGGGGCGGCGGTGCAGGCGGCCATCCTGATGGGGGACAAGTCCGAGAACGTGC AGGACCTGCTGCTGCTGGACGTGGCTCCCCTGTCGCTGGGGCTGGAGACGGCCGGAGGCGTGAT GACTGCCCTGATCAAGCGCAACTCCACCATCCCCACCAAGCAGACGCAGATCTTCACCACCTAC TCCGACAACCAACCCGGGGTGCTGATCCAGGTGTACGAGGGCGAGAGGGCCATGACGAAAGACA ACAATCTGTTGGGGCGCTTCGAGCTGAGCGGCATCCCTCCGGCCCCCAGGGGCGTGCCCCAGAT CGAGGTGACCTTCGACATCGATGCCAACGGCATCCTGAACGTCACGGCCACGGACAAGAGCACC GGCAAGGCCAACAAGATCACCATCACCAACGACAAGGGCCGCCTGAGCAAGGAGGAGATCGAGC GCATGGTGCAGGAGGCGGAGAAGTACAAAGCGGAGGACGAGGTGCAGCGCGAGAGGGTGTCAGC CAAGAACGCCCTGGAGTCCTACGCCTTCAACATGAAGAGCGCCGTGGAGGATGAGGGGCTCAAG GGCAAGATCAGCGAGGCGGACAAGAAGAAGGTTCTGGACAAGTGTCAAGAGGTCATCTCGTGGC TGGACGCCAACACCTTGGCCGAGAAGGACGAGTTTGAGCACAAGAGGAAGGAGCTGGAGCAGGT GTGTAACCCCATCATCAGCGGACTGTACCAGGGTGCCGGTGGTCCCGGGCCTGGGGGCTTCGGG GCTCAGGGTCCCAAGGGAGGGTCTGGGTCAGGCCCCACCATTGAGGAGGTGGATGGATCCCGGG CTGAGTACAAAGACCATGACGGTGATTATAAAGATCATGACATCGACTACAAGGATGACGATGA CAAGGGCTCCGGAGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGC CCAATGGAGAGCGACGAGAGCGGCCTGCCCGCCATGGAGATCGAGTGCCGCATCACCGGCACCC TGAACGGCGTGGAGTTCGAGCTGGTGGGCGGCGGAGAGGGCACCCCCGAGCAGGGCCGCATGAC CAACAAGATGAAGAGCACCAAAGGCGCCCTGACCTTCAGCCCCTACCTGCTGAGCCACGTGATG GGCTACGGCTTCTACCACTTCGGCACCTACCCCAGCGGCTACGAGAACCCCTTCCTGCACGCCA TCAACAACGGCGGCTACACCAACACCCGCATCGAGAAGTACGAGGACGGCGGCGTGCTGCACGT GAGCTTCAGCTACCGCTACGAGGCCGGCCGCGTGATCGGCGACTTCAAGGTGATGGGCACCGGC TTCCCCGAGGACAGCGTGATCTTCACCGACAAGATCATCCGCAGCAACGCCACCGTGGAGCACC TGCACCCCATGGGCGATAACGATCTGGATGGCAGCTTCACCCGCACCTTCAGCCTGCGCGACGG CGGCTACTACAGCTCCGTGGTGGACAGCCACATGCACTTCAAGAGCGCCATCCACCCCAGCATC CTGCAGAACGGGGGCCCCATGTTCGCCTTCCGCCGCGTGGAGGAGGATCACAGCAACACCGAGC TGGGCATCGTGGAGTACCAGCACGCCTTCAAGACCCCGGATGCAGATGCCGGTGAAGAATAAGA GCTCGCTGATCAGCCTCGACTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCG TGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGC ATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGG GAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGAAGCTTGAATTC AGCTGACGTGCCTCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCG CTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGC CTCAGTGAGCGAGCGAGCGCGCAG
TABLE-US-00016 Components Position SEQ (5′ to 3′ Size (in ID order) (nt) Origins and Notes construct) NO 5′ITR 119 AAV2 ITR (AF043303.1) 1-119 139 Cloning site 14 N/A (/NotI/MluI) 120-133 140 CMV 381 NCBI K03104.1 (140 . . . 520) 134-514 141 enhancer CBA 279 NCBI X00182.1 (268 . . . 543) 515-791 142 promoter Chimeric 1013 NCBI X00182.1 (544 . . . 1503) 792-1804 143 intron NCBI V00882.1 (1250 . . . 1317); XbaI cloning site Cloning site 39 N/A; AgeI 1805-1843 144 Hsp70 1923 NCBI NM_005345 1844-3766 145 Linker 12 BamHI/GSRA 3767-3778 146 3xFLAG 66 NCBI AVE26326.1 3779-3844 147 Linker 9 GSG linker 3845-3853 148 T2A 54 NCBI YP_009665206.1 3854-3907 149 tGFP 696 NCBI ADD23343.1 3908-4603 150 Cloning site 24 Stop codon/SacI 3767-3790 151 BGHpA 225 NCBI M57764.1 (2326 . . .2550); 3791-4015 152 termination site Cloning site 34 N/A; HindIII/EcoRI/PvuII/RsrII 4016-4049 153 3′ITR 130 AAV2 ITR (AF043303.1) 4050-4179 154
Exemplary Construct Components
[0335] Inverted Terminal Repeat Sequences (ITRs)
[0336] AAV derived sequences of a construct typically comprises the cis-acting 5′ and 3′ ITRs (see, e.g., B. J. Carter, in “Handbook of Parvoviruses”, ed., P. Tijsser, CRC Press, pp. 155 168 (1990), which is incorporated in its entirety herein by reference). Generally, ITRs are able to form a hairpin. The ability to form a hairpin can contribute to an ITRs ability to self-prime, allowing primase-independent synthesis of a second DNA strand. ITRs can also aid in efficient encapsidation of an AAV construct in an AAV particle.
[0337] An rAAV particle (e.g., an AAV2/Anc80 particle) of the present disclosure can comprise a rAAV construct comprising a coding sequence (e.g., a gene encoding a secreted target protein or any characteristic portion thereof, e.g., an NDP gene or any characteristic portion thereof, e.g., an HSPA1A gene or any characteristic portion thereof) and associated elements flanked by a 5′ and a 3′ AAV ITR sequences. In some embodiments, an ITR is or comprises about 145 nucleic acids. In some embodiments, all or substantially all of a sequence encoding an ITR is used. An AAV ITR sequence may be obtained from any known AAV, including presently identified mammalian AAV types. In some embodiments an ITR is an AAV2 ITR.
[0338] An example of a construct molecule employed in the present disclosure is a “cis-acting” construct containing a transgene, in which the selected transgene sequence and associated regulatory elements are flanked by 5′ or “left” and 3′ or “right” AAV ITR sequences. 5′ and left designations refer to a position of an ITR sequence relative to an entire construct, read left to right, in a sense direction. For example, in some embodiments, a 5′ or left ITR is an ITR that is closest to a promoter (as opposed to a polyadenylation sequence) for a given construct, when a construct is depicted in a sense orientation, linearly. Concurrently, 3′ and right designations refer to a position of an ITR sequence relative to an entire construct, read left to right, in a sense direction. For example, in some embodiments, a 3′ or right ITR is an ITR that is closest to a polyadenylation sequence (as opposed to a promoter sequence) for a given construct, when a construct is depicted in a sense orientation, linearly. ITRs as provided herein are depicted in 5′ to 3′ order in accordance with a sense strand. Accordingly, one of skill in the art will appreciate that a 5′ or “left” orientation ITR can also be depicted as a 3′ or “right” ITR when converting from sense to antisense direction. Further, it is well within the ability of one of skill in the art to transform a given sense ITR sequence (e.g., a 5′/left AAV ITR) into an antisense sequence (e.g., 3′/right ITR sequence). One of ordinary skill in the art would understand how to modify a given ITR sequence for use as either a 5′/left or 3′/right ITR, or an antisense version thereof.
[0339] For example, an ITR (e.g., a 5′ ITR) can have a sequence according to SEQ ID NO: 60, 62, 116, 128, or 139. In some embodiments, an ITR (e.g., a 3′ ITR) can have a sequence according to SEQ ID NO: 61, 63, 127, 138, or 152. In some embodiments, an ITR includes one or more modifications, e.g., truncations, deletions, substitutions or insertions, as is known in the art. In some embodiments, an ITR comprises fewer than 145 nucleotides, e.g., 127, 130, 134 or 141 nucleotides. For example, in some embodiments, an ITR comprises 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123,124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143 144, or 145 nucleotides. In some embodiments, an ITR (e.g., a 5′ ITR) can have a sequence according to SEQ ID NO: 62. In some embodiments, an ITR (e.g., a 3′ ITR) can have a sequence according to SEQ ID NO: 63.
[0340] A non-limiting example of a 5′ AAV ITR sequence is SEQ ID NO: 60. A non-limiting example of a 3′ AAV ITR sequence is SEQ ID NO: 61. In some embodiments, rAAV constructs of the present disclosure comprise a 5′ AAV ITR and/or a 3′ AAV ITR. In some embodiments, a 5′ AAV ITR sequence is SEQ ID NO: 62. In some embodiments, a 3′ AAV ITR sequence is SEQ ID NO: 63. In some embodiments, the 5′ and a 3′ AAV ITRs (e.g., SEQ ID NOs: 60 and 61, or 62 and 63) flank a portion of a coding sequence, e.g., all or a portion of an NDP gene or HSPA1A gene (e.g., SEQ ID NO: 57 or 86). The ability to modify these ITR sequences is within the skill of the art. (See, e.g., texts such as Sambrook et al. “Molecular Cloning. A Laboratory Manual”, 2d ed., Cold Spring Harbor Laboratory, New York (1989); and K. Fisher et al., J Virol., 70:520 532 (1996), each of which is incorporated in its entirety herein by reference). In some embodiments, a 5′ ITR sequence is at least 85%, 90%, 95%, 98% or 99% identical to a 5′ ITR sequence represented by SEQ ID NO: 60 or 62. In some embodiments, a 3′ ITR sequence is at least 85%, 90%, 95%, 98% or 99% identical to a 3′ ITR sequence represented by SEQ ID NO: 61 or 63.
TABLE-US-00017 Exemplary 5′ AAV ITR (SEQ ID NO: 60) TTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGG CAAAGCCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGA GCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCT Exemplary 3′ AAV ITR (SEQ ID NO: 61) AGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTC GCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCC CGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAA Exemplary 5′ AAV ITR (SEQ ID NO: 62) CTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTT GGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCA ACTCCATCACTAGGGGTTCCT Exemplary 3′ AAV ITR (SEQ ID NO: 63) AGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTC GCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCC CGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAG
[0341] Promoter
[0342] The term “promoter” means a DNA sequence recognized by enzymes/proteins in a mammalian cell required to initiate the transcription of a specific gene (e.g., a secreted target gene (e.g., a NDP gene (e.g., any of the exemplary NDP genes described herein), a HSPA1A gene (e.g., any of the exemplary HSPA1A genes described herein)). A promoter typically refers to, e.g., a nucleotide sequence to which an RNA polymerase and/or any associated factor binds and at which transcription is initiated. Non-limiting examples of promoters are described herein. Additional examples of promoters are known in the art.
[0343] In some embodiments, a vector encoding an N-terminal portion of secreted target protein (e.g., a NDP protein (e.g., any of the exemplary NDP proteins described herein), a HSPA1A protein (e.g., any of the exemplary HSPA1A proteins described herein) can include a promoter and/or an enhancer. The vector encoding the N-terminal portion of the secreted target protein (e.g., a NDP protein (e.g., any of the exemplary NDP proteins described herein), a HSPA1A protein (e.g., any of the exemplary HSPA1A proteins described herein) can include any of the promoters and/or enhancers described herein or known in the art.
[0344] In some embodiments, the promoter is an inducible promoter, a constitutive promoter, a mammalian cell promoter, a viral promoter, a chimeric promoter, an engineered promoter, a tissue-specific promoter, or any other type of promoter known in the art. In some embodiments, the promoter is a RNA polymerase II promoter, such as a mammalian RNA polymerase II promoter. In some embodiments, the promoter is a RNA polymerase III promoter, including, but not limited to, a H1 promoter, a human U6 promoter, a mouse U6 promoter, or a swine U6 promoter. The promoter will generally be one that is able to promote transcription in an inner hair cell In some examples, the promoter is a cochlea-specific promoter or a cochlea-oriented promoter.
[0345] A variety of promoters are known in the art that can be used herein. Non-limiting examples of promoters that can be used herein include: human EF1a, human cytomegalovirus (CMV) (U.S. Pat. No. 5,168,062), human ubiquitin C (UBC), mouse phosphoglycerate kinase 1, polyoma adenovirus, simian virus 40 (SV40), β-globin, β-actin, α-fetoprotein, γ-globin, β-interferon, γ-glutamyl transferase, mouse mammary tumor virus (MMTV), Rous sarcoma virus, rat insulin, glyceraldehyde-3-phosphate dehydrogenase, metallothionein II (MT II), amylase, cathepsin, MI muscarinic receptor, retroviral LTR (e.g. human T-cell leukemia virus HTLV), AAV ITR, interleukin-2, collagenase, platelet-derived growth factor, adenovirus 5 E2, stromelysin, murine MX gene, glucose regulated proteins (GRP78 and GRP94), α-2-macroglobulin, vimentin, MHC class I gene H-2κ b, HSP70, proliferin, tumor necrosis factor, thyroid stimulating hormone α gene, immunoglobulin light chain, T-cell receptor, HLA DQα and DQβ, interleukin-2 receptor, MHC class IL, MHC class II HLA-DRα, muscle creatine kinase, prealbumin (transthyretin), elastase I, albumin gene, c-fos, c-HA-ras, neural cell adhesion molecule (NCAM), H2B (TH2B) histone, rat growth hormone, human serum amyloid (SAA), troponin I (TN I), duchenne muscular dystrophy, human immunodeficiency virus, and Gibbon Ape Leukemia Virus (GALV) promoters. Additional examples of promoters are known in the art. See, e.g., Lodish, Molecular Cell Biology, Freeman and Company, New York 2007. In some embodiments, the promoter is the CMV immediate early promoter. In some embodiments, the promoter is a CAG promoter or a CAG/CBA promoter. In some embodiments, the promoter is a CBA promoter, e.g., a CBA promoter comprising or consisting of SEQ ID NO: 18.
[0346] The term “constitutive” promoter refers to a nucleotide sequence that, when operably linked with a nucleic acid encoding a protein (e.g., a secreted target protein (e.g., a NDP protein (e.g., any of the exemplary NDP proteins described herein), a HSPA1A protein (e.g., any of the exemplary HSPA1A proteins described herein)), causes RNA to be transcribed from the nucleic acid in a mammalian cell under most or all physiological conditions.
[0347] Examples of constitutive promoters include, without limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter, the cytomegalovirus (CMV) promoter (see, e.g., Boshart et al, Cell 41:521-530, 1985), the SV40 promoter, the dihydrofolate reductase promoter, the beta-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1-alpha promoter (Invitrogen).
[0348] Inducible promoters allow regulation of gene expression and can be regulated by exogenously supplied compounds, environmental factors such as temperature, or the presence of a specific physiological state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells only. Inducible promoters and inducible systems are available from a variety of commercial sources, including, without limitation, Invitrogen, Clontech, and Ariad. Additional examples of inducible promoters are known in the art.
[0349] Examples of inducible promoters regulated by exogenously supplied compounds include the zinc-inducible sheep metallothionine (MT) promoter, the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter, the T7 polymerase promoter system (WO 98/10088); the ecdysone insect promoter (No et al, Proc. Natl. Acad. Sci. U.S.A. 93:3346-3351, 1996), the tetracycline-repressible system (Gossen et al, Proc. Natl. Acad. Sci. U.S.A. 89:5547-5551, 1992), the tetracycline-inducible system (Gossen et al, Science 268:1766-1769, 1995, see also Harvey et al, Curr. Opin. Chem. Biol. 2:512-518, 1998), the RU486-inducible system (Wang et al, Nat. Biotech. 15:239-243, 1997) and Wang et al, Gene Ther. 4:432-441, 1997), and the rapamycin-inducible system (Magari et al. J. Clin. Invest. 100:2865-2872, 1997), each of which is incorporated by reference in their entireties herein.
[0350] The term “tissue-specific” promoter refers to a promoter that is active only in certain specific cell types and/or tissues (e.g., transcription of a specific gene occurs only within cells expressing transcription regulatory proteins that bind to the tissue-specific promoter).
[0351] In some embodiments, the regulatory sequences impart tissue-specific gene expression capabilities. In some cases, the tissue-specific regulatory sequences bind tissue-specific transcription factors that induce transcription in a tissue-specific manner.
[0352] Exemplary tissue-specific promoters include but are not limited to the following: a liver-specific thyroxin binding globulin (TBG) promoter, an insulin promoter, a glucagon promoter, a somatostatin promoter, a pancreatic polypeptide (PPY) promoter, a synapsin-1 (Syn) promoter, a creatine kinase (MCK) promoter, a mammalian desmin (DES) promoter, an alpha-myosin heavy chain (a-MHC) promoter, and a cardiac Troponin T (cTnT) promoter. Additional exemplary promoters include Beta-actin promoter, hepatitis B virus core promoter (Sandig et al., Gene Ther. 3:1002-1009, 1996), alpha-fetoprotein (AFP) promoter (Arbuthnot et al., Hum. Gene Ther. 7:1503-1514, 1996), bone osteocalcin promoter (Stein et al., Mol. Biol. Rep. 24:185-196, 1997); bone sialoprotein promoter (Chen et al., J. Bone Miner. Res. 11:654-664, 1996), CD2 promoter (Hansal et al., J. Immunol. 161:1063-1068, 1998); immunoglobulin heavy chain promoter; T cell receptor alpha-chain promoter, neuronal such as neuron-specific enolase (NSE) promoter (Andersen et al., Cell. Mol. Neurobiol. 13:503-515, 1993), neurofilament light-chain gene promoter (Piccioli et al., Proc. Natl. Acad. Sci. U.S.A. 88:5611-5615, 1991), and the neuron-specific vgf gene promoter (Piccioli et al., Neuron 15:373-384, 1995), each of which is incorporated by reference in their entireties herein.
[0353] In some embodiments, the tissue-specific promoter is a cochlea-specific promoter. In some embodiments, the tissue-specific promoter is a cochlear hair cell-specific promoter. Non-limiting examples of cochlear hair cell-specific promoters include but are not limited to: a ATOH1 promoter, a POU4F3 promoter, a LHX3 promoter, a MYO7A promoter, a MYO6 promoter, a α9ACHR promoter, and a α10ACHR promoter. In some embodiments, the promoter is an cochlear hair cell-specific promoter such as a PRESTIN promoter or an ONCOMOD promoter. See, e.g., Zheng et al., Nature 405:149-155, 2000; Tian et al. Dev. Dyn. 231:199-203, 2004; and Ryan et al., Adv. Otorhinolaryngol. 66: 99-115, 2009, each of which is incorporated by reference in their entireties herein.
[0354] In some embodiments, a tissue-specific promoter is an ear cell specific promoter. In some embodiments, a tissue-specific promoter is an inner ear cell specific promoter. Non-limiting examples of inner ear non-sensory cell-specific promoters include but are not limited to: GJB2, GJB6, SLC26A4, TECTA, DFNA5, COCH, NDP, SYN1, GFAP, PLP, TAK1, or SOX21. In some embodiments, a cochlear non-sensory cell specific promoter may be an inner ear supporting cell specific promoter. Non-limiting examples of inner ear supporting cell specific promoters include but are not limited to: SOX2, FGFR3, PROX1, GLAST1, LGR5, HES1, HES5, NOTCH1, JAG1, CDKN1A, CDKN1B, SOX10, P75, CD44, HEY2, LFNG, or S100b.
[0355] In some embodiments, provided AAV constructs comprise a promoter sequence selected from a CAG, a CBA, a CMV, or a CB7 promoter. In some embodiments of any of the therapeutic compositions described herein, the first or sole AAV construct further includes at least one promoter sequence selected from Cochlea and/or inner ear specific promoters.
TABLE-US-00018 Exemplary CBA promoter (SEQ ID NO: 64) GTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCCCCACCCCCAATTT TGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGGGCGCGC GCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGGCGGCAGCC AATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCGGCGGCCCTATA AAAAGCGAAGCGCGCGGCGGGCG Exemplary CBA promoter (SEQ ID NO: 65) GTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCCCCACCCCCAATTT TGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGCGCGCGC CAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGGCGGCAGCCAA TCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCGGCGGCCCTATAAA AAGCGAAGCGCGCGGCGGGCG Exemplary CMV/CBA enhancer/promoter (SEQ ID NO: 66) GACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATA TATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCC CGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGAC GTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCC AAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATG ACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGGTC GAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCCCCACCCCCAATTTTGT ATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGGGCGCGCGCC AGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGGCGGCAGCCAAT CAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCGGCGGCCCTATAAAA AGCGAAGCGCGCGGCGGGCG Exemplary CMV/CBA enhancer/promoter (SEQ ID NO: 67) GACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATA TATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCC CGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGAC GTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCC AAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATG ACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGGTC GAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCCCCACCCCCAATTTTGT ATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGCGCGCGCCAG GCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGGCGGCAGCCAATCA GAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCGGCGGCCCTATAAAAAG CGAAGCGCGCGGCGGGCG Exemplary CAG enhancer/promoter (SEQ ID NO: 101) GACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATA TATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCC CGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGAC GTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCC AAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATG ACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGGTC GAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCCCCACCCCCAATTTTGT ATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGGGCGCGCGCC AGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGGCGGCAGCCAAT CAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCGGCGGCCCTATAAAA AGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCCCCGCTCCGCGCCGC CTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGAGCGGGCGGGACGGC CCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTCTTTTCTGTGGCTGC GTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGGCTCGGGGGGTGCGT GCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCGGCTGTGAGCGCTGC GGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCGGCCGGGGGCGGTGC CCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGTGTGCGTGGGGGGGT GAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCCCCCTCCCCGAGTTG CTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCGGGGCTCGCCGTGCC GGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTCGGGCCGGGGAGGGC TCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGCGGCGAGCCGCAGCC ATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCCCAAATCTGTGCGGA GCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGAAGCGGTGCGGCGCC GGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCGTCCCCTTCTCCCTC TCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGGGGCAGGGCGGGGTT CGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTCATGCCTTCTTCTTT TTCCTAGAG Exemplary CAG enhancer/promoter (SEQ ID NO: 102) GAGATTGATTATTGAGTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATA TATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCC CGCCCATTGAGGTCAATAATGAGGTATGTTCCCATAGTAACGCCAATAGGGAGTTTCCATTGAG GTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCC AAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATG ACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGGTC GAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCCCCACCCCCAATTTTGT ATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGCGCGCGCCAG GCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGGCGGCAGCCAATCA GAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCGGCGGCCCTATAAAAAG CGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCCCCGCTCCGCGCCGCCT CGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGAGCGGGCGGGACGGCCC TTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTCTTTTCTGTGGCTGCGT GAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGGCTCGGGGGGTGCGTGC GTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCGGCTGTGAGCGCTGCGG GCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCGGCCGGGGGCGGTGCCC CGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGTGTGCGTGGGGGGGTGA GCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCCCCCTCCCCGAGTTGCT GAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCGGGGCTCGCCGTGCCGG GCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTCGGGCCGGGGAGGGCTC GGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGCGGCGAGCCGCAGCCAT TGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCCCAAATCTGTGCGGAGC CGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGAAGCGGTGCGGCGCCGG CAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCGTCCCCTTCTCCCTCTC CAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGGGGCAGGGCGGGGTTCG GCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTCATGCCTTCTTCTTTTT CCTACAG
[0356] In certain embodiments, a promoter is an endogenous human ATOH1 enhancer-promoter as set forth in SEQ ID NO: 68. In some embodiments, an enhancer-promoter sequence is at least 85%, 90%, 95%, 98% or 99% identical to enhancer-promoter sequence represented by SEQ ID NO: 18.
TABLE-US-00019 Exemplary Human ATOH1 enhancer-promoter (SEQ ID NO: 68) CTATGGAGTTTGCATAACAAACGTTTGGCAGCTCGCTCTCTTACACTCCATTAACAAGCTGTAA CATATAGCTGCAGGTTGCTATAATCTCATTAATATTTTGGAAACTTGAATATTGAGTATTTCTG AGTGCTCATTCCCCATATGCCAGCCACTTCTGCCATGCTGACTGGTTCCTTTCTCTCCATTATT AGCAATTAGCTTCTTACCTTCCAAAGTCAGATCCAAGGTATCCAAGATACTAGCAAAGGAATCA ACTATGTGTGCAAGTTAAGCATGCTTAATATCACCCAAACAAACAAAGAGGCAGCATTTCTTAA AGTAATGAAGATAGATAAATCGGGTTAGTCCTTTGCGACACTGCTGGTGCTTTCTAGAGTTTTA TATATTTTAAGCAGCTTGCTTTATATTCTGTCTTTGCCTCCCACCCCACCAGCACTTTTATTTG TGGAGGGTTTTGGCTCGCCACACTTTGGGAAACTTATTTGATTTCACGGAGAGCTGAAGGAAGA TCATTTTTGGCAACAGACAAGTTTAAACACGATTTCTATGGGACATTGCTAACTGGGGCCCCTA AGGAGAAAGGGGAAACTGAGCGGAGAATGGGTTAAATCCTTGGAAGCAGGGGAGAGGCAGGGGA GGAGAGAAGTCGGAGGAGTATAAAGAAAAGGACAGGAACCAAGAAGCGTGGGGGTGGTTTGCCG TAATGTGAGTGTTTCTTAATTAGAGAACGGTTGACAATAGAGGGTCTGGCAGAGGCTCCTGGCC GCGGTGCGGAGCGTCTGGAGCGGAGCACGCGCTGTCAGCTGGTGAGCGCACTCTCCTTTCAGGC AGCTCCCCGGGGAGCTGTGCGGCCACATTTAACACCATCATCACCCCTCCCCGGCCTCCTCAAC CTCGGCCTCCTCCTCGTCGACAGCCTTCCTTGGCCCCCACCAGCAGAGCTCACAGTAGCGAGCG TCTCTCGCCGTCTCCCGCACTCGGCCGGGGCCTCTCTCCTCCCCCAGCTGCGCAGCGGGAGCCG CCACTGCCCACTGCACCTCCCAGCAACCAGCCCAGCACGCAAAGAAGCTGCGCAAAGTTAAAGC CAAGCAATGCCAAGGGGAGGGGAAGCTGGAGGCGGGCTTTGAGTGGCTTCTGGGCGCCTGGCGG GTCCAGAATCGCCCAGAGCCGCCCGCGGTCGTGCACATCTGACCCGAGTCAGCTTGGGCACCAG CCGAGAGCCGGCTCCGCACCGCTCCCGCACCCCAGCCGCCGGGGTGGTGACACACACCGGAGTC GAATTACAGGCCTGCAATTAACATATGAATCTGAGGAATTTAAAAGAAGGAAAAAAAAAAAAAA ACCTGAGCAGGCTTGGGAGTCCTCTGCACACAAGAACTTTTCTCGGGGTGTAAAAACTCTTTGA TTGGCTGCTCGCACGCGCCTGCCCGCGCCCTCCATTGGCTGAGAAGACACGCGACCGGCGCGAG GAGGGGGTTGGGAGAGGAGCGGGGGGAGACTGAGTGGCGCGTGCCGCTTTTTAAAGGGGCGCAG CGCCTTCAGCAACCGGAGAAGCATAGTTGCACGCGACCTGGTGTGTGATCTCCGAGTGGGTGGG GGAGGGTCGAGGAGGGAAAAAAAAATAAGACGTTGCAGAAGAGACCCGGAAAGGGCCTTTTTTT TGGTTGAGCTGGTGTCCCAGTGCTGCCTCCGATCCTGAGCCTCCGAGCCTTTGCAGTGCAA
[0357] In certain embodiments, a promoter is an endogenous human SLC26A4 immediate promoter as set forth in SEQ ID NO: 103 or 93. In certain embodiments, a promoter is an endogenous human SLC26A4 enhancer-promoter as set forth in SEQ ID NO: 47, 48 or 50. In some embodiments, an enhancer-promoter sequence is at least 85%, 90%, 95%, 98% or 99% identical to a promoter or enhancer-promoter sequence represented by SEQ ID NO: 103, 93, 47, 48, or 50. In certain embodiments, a promoter is a human SLC26A4 endogenous enhancer-promoter sequence comprised within SEQ ID NO: 47, 48, or 50.
TABLE-US-00020 Exemplary Human SLC26A4 immediate promoter (SEQ ID NO: 103) CTGCCTTCTGAGAGCGCTATAAAGGCAGCGGAAGGGTAGTCCGCGGGGCATTCCGGGCGG Exemplary Human SLC26A4 immediate promoter (SEQ ID NO: 93) CTCTAGGCGGGCTCTGCTCTTCTTTAAGGAGTCCCACAGGGCCTGGCCCGCCCCTGACCT Exemplary Human SLC26A4 enhancer-promoter (SEQ ID NO: 47) TAAAGAGTTGTGAGTTGTGTAGGTGAGTTGCCATGGAGCTACAAATATGAGTTGATATTCTGAA ATCCTAGACAGCCATCTCCAAGGTTAAGAAAAATCCTTATGCACTCACTTGCAAAGATATCCAC AGCATGCTCTTAATGGAGAAAAACAAAGCCTTAGATCAAATATGTAAAGTAATTTTTAGTTTTT TGAAAAGGTATGTTTGGGCTATAGATAAATCTGTTCAAAAAACATGAGAGAAGATAATAATGGT TGAAAGGAGACACAGTGCTTGCCCTCAAGAAGTTTTTGTCTAGTGAGGGAGAGAGAACTTGTAT GTAAATAAAATTGTGTTACTAAGGTAGATAGTGAGAAGTAACTTAAGAGAGGATCAGATAAGGT ATTAAGAGAATACAGAAAAGGGTCTGGATTAATTCTGAACAGCATCAAAGAATGTTCTTGCAAG AGATAGTGTTTTCACCAGATCTTGAAGGTATGGATGAGGGTATACAGAGTGAGTATATTCAGAT TCTACTTTAAAACAAATACTTTCCTCTGTTGTAGTGGAGTTGAGCTATACATCCAACAATAATG AAAAAATACACGCATATATACATATATGGAGAGAGATACATATTTTAGTAGATGTAGCAATTGA TTAATAAATGTACAGTTTAAGTCGCATGCAAAACCTTGGAGTGATAGCAAACTTCATTGTAGGA TGTTTAGCAGCATCTCTGGTCTCTACTCACTAGATCCCAATAGCATCTCCCTAGGTGTGACAAC CAAAAATGTCTCCAGGCATTGACCTCTGGAGGCAAAAAAAGCCCTTTATTAAGAACCAGTGGTA TAGATAAGTAAAACATACACAAGAGATTCCTCCCCTCTTCTCTGTATGTGAATAAAAATTGCAA AGTTCATGACCTGGATTTTCCTTTTAGGTTTCTTCTTTAGTGGTTCTTAACTTCATTGGGTGAA GTAAGCCTTTGAAGATCTGTTGAAAGCTGTTGACTCATTCACTTCTCAGGAAAACGCACATGCT GACTACCATTTCAGAGAATTTGCATCAGGGTTCTCTGGGGAGGAGTTCTGAGTTCTGTTTCCAG GAGCTCGTAGAATTGTCATGGTCTGCATATGCAAGGCAGGTGGATTACGGAAGGTTGATGTACA GAGGTCTGTATTTTGGAGCCTCTTCTGTATTTACTTCAGAACACTAACAATCAGGCGAGAATGT TCTGGTTTATCAAACCCTTCCTTCTGCCTTTCATCTTAACCATGCATTAGTTTTAACAAAGTTC ATCCCAACAGAAGACAAAACACTGATGAGGTAGGATAGCTCCAGCTCCTCCTCCCTCTCTTCTA GTCTTGATTTCCATGTAGTCCAGTTTATTCCTTCCCTGATTGTCCAGGAGAATGAGAAAAAGAA AAAACAGAGTCTAGTGGGTAAGAAAGGGCCACCTGGACGGCTTGATTTGGATTGTGAAATAAAA CACACACACATGCACACGTAGAATAAGTGGCTAAAATCTGAGTAAATCGTGAACTCTCTGTATC CTCCACCCATTGAATACTCCTAAAAGACTTTCTAGAAATTCAAGGACTTATTAATATAGAAACC TGGCCATTGTTCCTCTTCTCCTCCCCATGTGGTATGAGAGCACCTGTGGCAGGCTCCCAGAGAC CACGGACCTCTTCCTCTAGGCGGGCTCTGCTCTTCTTTAAGGAGTCCCACAGGGCCTGGCCCGC CCCTGACCTCGCAACCCTTGAGATTAGTAACGGGATGAGTGAGGATCCGGGTGGCCCCTGCGTG GCAGCCAGTAAGAGTCTCAGCCTTCCCGGTTCGGGAAAGGGGAAGAATGCAGGAGGGGTAGGAT TTCTTTCCTGATAGGATCGGTTGGGAAAGACCGCAGCCTGTGTGTGTCTTTCCCTTCGACCAAG GTGTCTGTTGCTCCGTAAATAAAACGTCCCACTGCCTTCTGAGAGCGCTATAAAGGCAGCGGAA GGGTAGTCCGCGGGGC Exemplary Human SLC26A4 enhancer-promoter (SEQ ID NO: 48) GGCTGCTCGGAAAACAGGACGAGGGGAGAGACTTGCTCAATAAGCTGAAAGTTCTGCCCCCGAG AGGGCTGCGACAGCTGCTGGAATGTGCCTGCAGCGTCCGCCTCTTGGGGACCCGCGGAGCGCGC CCTGACGGTTCCACGCCTGGCCCGGGGGTCTGCACCTCTCCTCCAGTGCGCACCTGGAGCTGCG TCCCGGGTCAGGTGCGGGGAGGGAGGGAATCTCAGTGTCCCCTTCCAGCCTTGCAAGCGCCTTT GGCCCCTGCCCCAGCCCCTCGGTTTGGGGGAGATTTCAGAACGCGGACAGCGCCCTGGCTGCGG GCCATAGGGGACTGGGTGGAACTCGGGAAGCCCCCAGAGCAGGGGCTTACTCGCTTCAAGTTTG GGGAACCCCGGGCAGCGGGTGCAGGCCACGAGACCCGAAGGTTCTCAGGTGCCCCCCTGCAGGC TGGCCGTGCGCGCCGTGGGGCGCTTGTCGCGAGCGCCGAGGGCTGCAGGACGCGGACCAGACTC GCGGTGCAGGGGGGCCTGGCTGCAGCTAACAGGTGATCCCGTTCTTTCTGTTCCTCGCTCTTCC CCTCCGATCGTCCTCGCTTACCGCGTGTCCTCCCTCCTCGCTGTCCTCTGGCTCGCAGGTCATG GCAGCGCCAGGCGGCAGGTCGGAGCCGCCGCAGCTCCCCGAGTACAGCTGCAGCTACATGGTGT CGCGGCCGGTCTACAGCGAGCTCGCTTTCCAGCAACAGCACGAGCGGCGCCTGCAGGAGCGCAA GACGCTGCGGGAGAGCCTGGCCAAGTGCTGCAGGTAGCGGCCGCGCGGGCCTGCGTAGAGAGAA GCGGAGCGGGGCGTCCACGCCTTGGGGAGGGAAGGGCGTCCCCAGCGGGCGAGAGTGGGGTGCG GGCGGCGGAGCCCCTGGGCGCCAGCTGCTTCTCCCAGAGGCCCGACTTTCGGTCTCCGGTCCTC CACGCCGCCCTTCTGGTGGGAGGGTGGCTCCATCAGTCTCGGGCCCGAAATGAACTTACCTGGG AAACTCGCCTTTGGGGAGAGTGGGTTCTAGGAGCCCCGTCTCTCTTTTTCCTCTCTGAAGGAAA CTTGGAGTGCCTCTTGGGGTACAGTGGGTCCCTGTTGCCTTCTTGGGAGCTTGTTTAAATGAAA TGAATAGGGAAACCCAGCTCTTGACCAGGAGGAGTCCTTGAAACACTCAAGCTAAGTAGGCGGG CTACCATTCAGTTAGAGACCAGGATGCAAGCTAGAACCCAGGGGAGCGCGGGGTGTGCCAAGTA CTTCATCAGCAGGCTGTGGGACCCCTGGGGAAAGCCACCCTCAGTCTCTAAACCCAAACATGCC GTAACTAGATGTCACAAACATAAAGAAATTAGAGTTTCTAAAACCTTTCATTATAG Exemplary Human SLC26A4 enhancer-promoter (SEQ ID NO: 50) CGGAAGGTTGATGTACAGAGGTCTGTATTTTGGAGCCTCTTCTGTATTTACTTCAGAACACTAA CAATCAGGCGAGAATGTTCTGGTTTATCAAACCCTTCCTTCTGCCTTTCATCTTAACCATGCAT TAGTTTTAACAAAGTTCATCCCAACAGAAGACAAAACACTGATGAGGTAGGATAGCTCCAGCTC CTCCTCCCTCTCTTCTAGTCTTGATTTCCATGTAGTCCAGTTTATTCCTTCCCTGATTGTCCAG GAGAATGAGAAAAAGAAAAAACAGAGTCTAGTGGGTAAGAAAGGGCCACCTGGACGGCTTGATT TGGATTGTGAAATAAAACACACACACATGCACACGTAGAATAAGTGGCTAAAATCTGAGTAAAT CGTGAACTCTCTGTATCCTCCACCCATTGAATACTCCTAAAAGACTTTCTAGAAATTCAAGGAC TTATTAATATAGAAACCTGGCCATTGTTCCTCTTCTCCTCCCCATGTGGTATGAGAGCACCTGT GGCAGGCTCCCAGAGACCACGGACCTCTTCCTCTAGGCGGGCTCTGCTCTTCTTTAAGGAGTCC CACAGGGCCTGGCCCGCCCCTGACCTCGCAACCCTTGAGATTAGTAACGGGATGAGTGAGGATC CGGGTGGCCCCTGCGTGGCAGCCAGTAAGAGTCTCAGCCTTCCCGGTTCGGGAAAGGGGAAGAA TGCAGGAGGGGTAGGATTTCTTTCCTGATAGGATCGGTTGGGAAAGACCGCAGCCTGTGTGTGT CTTTCCCTTCGACCAAGGTGTCTGTTGCTCCGTAAATAAAACGTCCCACTGCCTTCTGAGAGCG CTATAAAGGCAGCGGAAGGGTAGTCCGCGGGGCATTCCGGGCGGGGCGCGAGCAGAGACAGGTG AGTT
[0358] In certain embodiments, a promoter is a human LGR5 enhancer-promoter as set forth in SEQ ID NO: 51. In some embodiments, an enhancer-promoter sequence is at least 85%, 90%, 95%, 98% or 99% identical to enhancer-promoter sequence represented by SEQ ID NO: 51. In some embodiments, a promoter is a human LGR5 endogenous enhancer-promoter sequence comprised within SEQ ID NO: 51.
TABLE-US-00021 Exemplary Human LGR5 enhancer-promoter (SEQ ID NO: 51) AGGGCTATTTGTACCTCAACGAGGGCTTCTCTCCAAGAAAGCCCTGAATC CTTTTCCTCCTTTTTCCTGCAGATTCACTATAGGACACTTTTTGAAGCAA GAGCATGCATTTTCCCCCTGGCGCTCTGCAGCGGTTCTCAGAGCCCAGTG TCACTCACATAGGTGGGACTGCTCTCAGTTCAGAGAGCGCTGGGACACTT AAGATGAAAAGTCCCTGGAAGTTAGCAAACAGCCATCTGTCACTCTGGCA TCGATTTAGTAAAAGTGAGTTCTAGGGTATTCTAAACGAGTTTTAAAAAA CAAATGAGTGAGTTCGAGTTCCTCACCCCGCAAGAGATAGGAAGGCAGCA GTGGAGTGCTCGCTCAGGAGCTGTATTTGTTTAGCGATTAGCCTAGAGCT TTGATTTTAGGGCAAAAGCGAGCCAGACAGTGCGGCAGACGTAAGGATCA AAAAGGCCACCTATCATTCGCCGGGGACGCCTGCCTCCTTACCCTGATAA CGTAACTATTTCTCTGCATAGGATTTTAGTTTTTGTGTTTTTGTTTTGTT TTATTCTGTTTAATCACTTCAAGTATCTCATCCATTATTTGAAGCGGGCT CGGAGGAAACGTGCCGCATCCTCCAGTCCTTGTGCGTCTGTTTAGGTCTC TCCGAAGCAGGTCCCTCTCGACTCTTAGATCTGGGTCTCCAGCACGCATG AAGGGGTAAGGGTGGGGGGGTCCCCTATTCCGGCGCGCGGCGTTGAGCAC TGAATCTTCCAGGCGGAGGCTCAGTGGGAGCGCCGAGAACTCGCCAGTAC CGCGCGCTGCCTGCTGCCTGCTGCCTCCCAGCCCAGGACTTGGGAAAGGA GGGAGGGGACAAGTGGAGGGAAAGTGGGGCCGGGCGGGGGGTGCCTGGGA AGCCAGGCTGCGCTGACGTCACTGGGCGCGCAATTCGGGCTGGAGCGCTT TAAAAAACGAGCGTGCAAGCAGAGATGCTGCTCCACACCGCTCAGGCCGC GAGCAGCAGCAAGGCGCACCGCCACTGTCGCCGCTGCAGCCAGGGCTGCT CCGAAGGCCGGCGTGGCGGCAACCGGCACCTCTGTCCCCGCCGCGCTTCT CCTCGCCGCCCACGCCGTGGGGTCAGGAACGCGGCGTCTGGCGCTGCAGA CGCCCGCTGAGTTGCAGAAGCCCACGGAGCGGCGCCCGGCGCGCCACGGC CCGTAGCAGTCCGGTGCTGCTCTCCGCCCGCGTCCGGCTCGTGGCCCCCT ACTTCGGGCACCGACCGGT
[0359] In certain embodiments, a promoter is a human SYN1 enhancer-promoter as set forth in SEQ ID NO: 52. In some embodiments, an enhancer-promoter sequence is at least 85%, 90%, 95%, 98% or 99% identical to enhancer-promoter sequence represented by SEQ ID NO: 52. In some embodiments, a promoter is a human SYN1 endogenous enhancer-promoter sequence comprised within SEQ ID NO: 52.
TABLE-US-00022 Exemplary Human SYN1 enhancer-promoter (SEQ ID NO: 52) TGCGTATGAGTGCAAGTGGGTTTTAGGACCAGGATGAGGCGGGGTGGGGG TGCCTACCTGACGACCGACCCCGACCCACTGGACAAGCACCCAACCCCCA TTCCCCAAATTGCGCATCCCCTATCAGAGAGGGGGAGGGGAAACAGGATG CGGCGAGGCGCGTGCGCACTGCCAGCTTCAGCACCGCGGACAGTGCCTTC GCCCCCGCCTGGCGGCGCGCGCCACCGCCGCCTCAGCACTGAAGGCGCGC TGACGTCACTCGCCGGTCCCCCGCAAACTCCCCTTCCCGGCCACCTTGGT CGCGTCCGCGCCGCCGCCGGCCCAGCCGGACCGCACCACGCGAGGCGCGA GATAGGGGGGCACGGGCGCGACCATCTGCGCTGCGGCGCCGGCGACTCAG CGCTGCCTCAGTCTGCGGTGGGCAGCGGAGGAGTCGTGTCGTGCCTGAGA GCGCAGTCGAGAA
[0360] In certain embodiments, a promoter is a human GFAP enhancer-promoter as set forth in SEQ ID NO: 53. In some embodiments, an enhancer-promoter sequence is at least 85%, 90%, 95%, 98% or 99% identical to enhancer-promoter sequence represented by SEQ ID NO: 53. In some embodiments, a promoter is a human GFAP endogenous enhancer-promoter sequence comprised within SEQ ID NO: 53.
TABLE-US-00023 Exemplary Human GFAP enhancer-promoter (SEQ ID NO: 53) CCCACCTCCCTCTCTGTGCTGGGACTCACAGAGGGAGACCTCAGGAGGCA GTCTGTCCATCACATGTCCAAATGCAGAGCATACCCTGGGCTGGGCGCAG TGGCGCACAACTGTAATTCCAGCACTTTGGGAGGCTGATGTGGAAGGATC ACTTGAGCCCAGAAGTTCTAGACCAGCCTGGGCAACATGGCAAGACCCTA TCTCTACAAAAAAAGTTAAAAAATCAGCCACGTGTGGTGAGACACACCTG TAGTCCCAGCTATTCAGGAGGCTGAGGTGAGGGGATCACTTAAGGCTGGG AGGTTGAGGCTGCAGTGAGTCGTGGTTGCGCCACTGCACTCCAGCCTGGG CAACAGTGAGACCCTGTCTCAAAAGACAAAAAAAAAAAAAAAAAAAAAAA GAACATATCCTGGTGTGGAGTAGGGGACGCTGCTCTGACAGAGGCTCGGG GGCCTGAGCTGGCTCTGTGAGCTGGGGAGGAGGCAGACAGCCAGGCCTTG TCTGCAAGCAGACCTGGCAGCATTGGGCTGGCCGCCCCCCAGGGCCTCCT CTTCATGCCCAGTGAATGACTCACCTTGGCACAGACACAATGTTCGGGGT GGGCACAGTGCCTGCTTCCCGCCGCACCCCAGCCCCCCTCAAATGCCTTC CGAGAAGCCCATTGAGCAGGGGGCTTGCATTGCACCCCAGCCTGACAGCC TGGCATCTTGGGATAAAAGCAGCACAGCCCCCTAGGGGCTGCCCTTGCTG TGTGGCGCCACCGGCGGTGGAGAACAAGGCTCTATTCAGCCTGTGCCCAG GAAAGGGGATCAGGGGATGCCCAGGCATGGACAGTGGGTGGCAGGGGGGG AGAGGAGGGCTGTCTGCTTCCCAGAAGTCCAAGGACACAAATGGGTGAGG GGACTGGGCAGGGTTCTGACCCTGTGGGACCAGAGTGGAGGGCGTAGATG GACCTGAAGTCTCCAGGGACAACAGGGCCCAGGTCTCAGGCTCCTAGTTG GGCCCAGTGGCTCCAGCGTTTCCAAACCCATCCATCCCCAGAGGTTCTTC CCATCTCTCCAGGCTGATGTGTGGGAACTCGAGGAAATAAATCTCCAGTG GGAGACGGAGGGGTGGCCAGGGAAACGGGGCGCTGCAGGAATAAAGACGA GCCAGCACAGCCAGCTCATGTGTAACGGCTTTGTGGAGCTGTCAAGGCCT GGTCTCTGGGAGAGAGGCACAGGGAGGCCAGACAAGGAAGGGGTGACCTG GAGGGACAGATCCAGGGGCTAAAGTCCTGATAAGGCAAGAGAGTGCCGGC CCCCTCTTGCCCTATCAGGACCTCCACTGCCACATAGAGGCCATGATTGA CCCTTAGACAAAGGGCTGGTGTCCAATCCCAGCCCCCAGCCCCAGAACTC CAGGGAATGAATGGGCAGAGAGCAGGAATGTGGGACATCTGTGTTCAAGG GAAGGACTCCAGGAGTCTGCTGGGAATGAGGCCTAGTAGGAAATGAGGTG GCCCTTGAGGGTACAGAACAGGTTCATTCTTCGCCAAATTCCCAGCACCT TGCAGGCACTTACAGCTGAGTGAGATAATGCCTGGGTTATGAAATCAAAA AGTTGGAAAGCAGGTCAGAGGTCATCTGGTACAGCCCTTCCTTCCCTTTT TTTTTTTTTTTTTTGTGAGACAAGGTCTCTCTCTGTTGCCCAGGCTGGAG TGGCGCAAACACAGCTCACTGCAGCCTCAACCTACTGGGCTCAAGCAATC CTCCAGCCTCAGCCTCCCAAAGTGCTGGGATTACAAGCATGAGCCACCCC ACTCAGCCCTTTCCTTCCTTTTTAATTGATGCATAATAATTGTAAGTATT CATCATGGTCCAACCAACCCTTTCTTGACCCACCTTCCTAGAGAGAGGGT CCTCTTGCTTCAGCGGTCAGGGCCCCAGACCCATGGTCTGGCTCCAGGTA CCACCTGCCTCATGCAGGAGTTGGCGTGCCCAGGAAGCTCTGCCTCTGGG CACAGTGACCTCAGTGGGGTGAGGGGAGCTCTCCCCATAGCTGGGCTGCG GCCCAACCCCACCCCCTCAGGCTATGCCAGGGGGTGTTGCCAGGGGCACC CGGGCATCGCCAGTCTAGCCCACTCCTTCATAAAGCCCTCGCATCCCAGG AGCGAGCAGAGCCAGAGCAGGTTGGAGAGGAGACGCATCACCTCCGCTGC TCGC Exemplary NDP_promoter_NG_009832.1 (SEQ ID NO: 99) AGCTGCAGGTATTTAACCCACACACATAGTTACCTTTGTCACTTTTCTAC TAGTTGTTGTGTAATGGGTTAGCTTTATCTATTAGTTTCTCCTTCATATA GCTCAAGGAGTCAAGGAATACACGTTCTCATTGTTTCTAAATCAGACCTA AAGTTTTATTCTAAATGATGCAGATGAAGGGGCTTATTAAAGTGCCATTG TGAATTTAATCATGTATATGCTGCTAAAAATCATTTAATGGATGAACAAC CCGGAAAAAAAGAACTCTCACCACCCACACACATCGTGATAAAATAGGGC AGCGTTTTGCACTTGCTTTAACAGCATCGCCTGAGAAAAAAATTTCTGGT TCCCATCTTTCCCCTCTCTTACTTAAAATTTCAACTTCATCACAGTCAGC TGCCGAATCGTTCAACAGAATGCCACACTGCCTTTGTATTTCCAAAACTA TACGTCTCTATTGCGGACGGCACATCTTTATGGCAGCCACATGCTTGAAA AAGAATTAAATTCAGAATATTCATTTGGCCTCTTATTAGTTCCATAATAC CATTAAAAAAGAAAGAAAGAAAGAAACTTCCTCGCCCTTGTTCTGCTACG CTGTTCCCATCGTAAGATGCTCCGTGGAAGGGAGCCGAGCGGTGGGCAGA GGCTGAGTCCCCGATAACGAGCGCCTCACATTTCCGTGGCATTCCCATTT GCTAGTGCGCTGCTGCGGCCGCACGCCTGATTGATATATGACTGCAATGG CACTTTTCCATTTGACATTCTCTCTCTCTCTCTCCCTCTCTCTCTCTCCC TCTCTCTCTCCCTCTCTCTCTCTCCCTGTGTCGCTTAAACAACAGTCCTA ACTTTTGTGTGTTGCAAATATAAAAGGCAAGCCATGTGACAGAGGGACAG AAGAACAAAAGCATTTGGAAGTAACAGGACCTCTTTCTAGCTCTCAGAAA AGTCTGAGAAGAAAGGAGCCCTGCGTTCCCCTAAGGTAAGCAGGAAAACA AGCG
[0361] Enhancers
[0362] In some instances, a vector can include an enhancer sequence. The term “enhancer” refers to a nucleotide sequence that can increase the level of transcription of a nucleic acid encoding a protein of interest (e.g., a secreted target protein (e.g., a NDP protein (e.g., any of the exemplary NDP proteins described herein), a HSPA1A protein (e.g., any of the exemplary HSPA1A proteins described herein))). Enhancer sequences (50-1500 basepairs in length) generally increase the level of transcription by providing additional binding sites for transcription-associated proteins (e.g., transcription factors). In some embodiments, an enhancer sequence is found within an intronic sequence. Unlike promoter sequences, enhancer sequences can act at much larger distance away from the transcription start site (e.g., as compared to a promoter). Non-limiting examples of enhancers include a RSV enhancer, a CMV enhancer, and a SV40 enhancer. In some embodiments, the CMV enhancer sequence comprises or consists of SEQ ID NO: 17. In some embodiments, a construct comprises a CMV enhancer exemplified by SEQ ID NO: 69. In some embodiments, an enhancer sequence is at least 85%, 90%, 95%, 98% or 99% identical to the enhancer sequence represented by SEQ ID NO: 69. In some embodiments, an SV-40 derived enhancer is the SV-40 T intron sequence, which is exemplified by SEQ ID NO: 70. In some embodiments, a an enhancer sequence is at least 85%, 90%, 95%, 98% or 99% identical to the enhancer sequence represented by SEQ ID NO: 70.
TABLE-US-00024 Exemplary CMV enhancer (SEQ ID NO: 69) GACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTA GTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGG CCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGA CGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGG GTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCA TATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCT GGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTAC ATCTACGTATTAGTCATCGCTATTACCATGG Exemplary SV-40 synthetic intron (SEQ ID NO: 70) GGAGTCGCTGCGTTGCCTTCGCCCCGTGCCCCGCTCCGCGCCGCCTCGCG CCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGAGCGGGC GGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGG CTCGTTTCTTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGG CCCTTTGTGCGGGGGGGAGCGGCTCGGGGGGTGCGTGCGTGTGTGTGTGC GTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCGGCTGTGAGCGCTGC GGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGC TGCGTGCGGGGTGTGTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGC GGTCGGGCTGTAACCCCCCCCTGCACCCCCCTCCCCGAGTTGCTGAGCAC GGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCGGGGCTCGC CGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGC CGCCTCGGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGC GCCGGCGGCTGTCGAGGCGCGGCGAGCCGCAGCCATTGCCTTTTATGGTA ATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCCCAAATCTGTGCGGAGC CGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGAAG CGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTC GCCGCGCCGCCGTCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGG GGGACGGCTGCCTTCGGGGGGGACGGGGCAGGGCGGGGTTCGGCTTCTGG CGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTCATGCCTTCTT CTTTTTCCTACAG
[0363] Splice Sites
[0364] In some embodiments, any of the constructs provided herein can include splice donor and/or splice acceptor sequences, which are functional during RNA processing occurring during transcription. In some embodiments, splice sites are involved in trans-splicing.
TABLE-US-00025 Exemplary splice donor intron (SEQ ID NO: 155) GTAAGTATCAAGGTTACAAGACAGGTTTAAGGAGACCAATAGAAACTGGG CTTGTCGAGACAGAGAAGACTCTTGCGTTTCT Exemplary splice acceptor intron (SEQ ID NO: 104) GATAGGCACCTATTGGTCTTACTGACATCCACTTTGCCTTTCTCTCCACA G
[0365] Polyadenylation Sequences
[0366] In some embodiments, any of the vectors provided herein can include a polyadenylation (poly(A)) signal sequence. Most nascent eukaryotic mRNAs possess a poly(A) tail at their 3′ end which is added during a complex process that includes cleavage of the primary transcript and a coupled polyadenylation reaction driven by the poly(A) signal sequence (see, e.g., Proudfoot et al., Cell 108:501-512, 2002). The poly(A) tail confers mRNA stability and transferability (Molecular Biology of the Cell, Third Edition by B. Alberts et al., Garland Publishing, 1994). In some embodiments, the poly(A) signal sequence is positioned 3′ to the nucleic acid sequence encoding the C-terminus of the secreted target protein (e.g., a NDP protein (e.g., any of the exemplary NDP proteins described herein), a HSPA1A protein (e.g., any of the exemplary HSPA1A proteins described herein)).
[0367] As used herein, “polyadenylation” refers to the covalent linkage of a polyadenylyl moiety, or its modified variant, to a messenger RNA molecule. In eukaryotic organisms, most messenger RNA (mRNA) molecules are polyadenylated at the 3′ end. The 3′ poly(A) tail is a long sequence of adenine nucleotides (e.g., 50, 60, 70, 100, 200, 500, 1000, 2000, 3000, 4000, or 5000 (SEQ ID NO: 100)) added to the pre-mRNA through the action of an enzyme, polyadenylate polymerase. In higher eukaryotes, the poly(A) tail is added onto transcripts that contain a specific sequence, the polyadenylation (or poly(A)) signal. The poly(A) tail and the protein bound to it aid in protecting mRNA from degradation by exonucleases. Polyadenylation is also important for transcription termination, export of the mRNA from the nucleus, and translation. Polyadenylation occurs in the nucleus immediately after transcription of DNA into RNA, but also can occur later in the cytoplasm. After transcription has been terminated, the mRNA chain is cleaved through the action of an endonuclease complex associated with RNA polymerase. The cleavage site is usually characterized by the presence of the base sequence AAUAAA near the cleavage site. After the mRNA has been cleaved, adenosine residues are added to the free 3′ end at the cleavage site.
[0368] As used herein, a “poly(A) signal sequence” or “polyadenylation signal sequence” is a sequence that triggers the endonuclease cleavage of an mRNA and the addition of a series of adenosines to the 3′ end of the cleaved mRNA.
[0369] There are several poly(A) signal sequences that can be used, including those derived from bovine growth hormone (bgh) (Woychik et al., Proc. Natl. Acad. Sci. U.S.A. 81(13):3944-3948, 1984; U.S. Pat. No. 5,122,458, each of which is incorporated herein by reference in its entirety), mouse-β-globin, mouse-α-globin (Orkin et al., EMBO J. 4(2):453-456, 1985; Thein et al., Blood 71(2):313-319, 1988, each of which is incorporated herein by reference in its entirety), human collagen, polyoma virus (Batt et al., Mol. Cell Biol. 15(9):4783-4790, 1995, which is incorporated herein by reference in its entirety), the Herpes simplex virus thymidine kinase gene (HSV TK), IgG heavy-chain gene polyadenylation signal (US 2006/0040354, which is incorporated herein by reference in its entirety), human growth hormone (hGH) (Szymanski et al., Mol. Therapy 15(7):1340-1347, 2007, which is incorporated herein by reference in its entirety), the group consisting of SV40 poly(A) site, such as the SV40 late and early poly(A) site (Schek et al., Mol. Cell Biol. 12(12):5386-5393, 1992, which is incorporated herein by reference in its entirety).
[0370] The poly(A) signal sequence can be AATAAA. The AATAAA sequence may be substituted with other hexanucleotide sequences with homology to AATAAA and that are capable of signaling polyadenylation, including ATTAAA, AGTAAA, CATAAA, TATAAA, GATAAA, ACTAAA, AATATA, AAGAAA, AATAAT, AAAAAA, AATGAA, AATCAA, AACAAA, AATCAA, AATAAC, AATAGA, AATTAA, or AATAAG (see, e.g., WO 06/12414, which is incorporated herein by reference in its entirety).
[0371] In some embodiments, the poly(A) signal sequence can be a synthetic polyadenylation site (see, e.g., the pCl-neo expression vector of Promega that is based on Levitt el al, Genes Dev. 3(7):1019-1025, 1989, which is incorporated herein by reference in its entirety). In some embodiments, the poly(A) signal sequence is the polyadenylation signal of bovine growth hormone (SEQ ID NO: 23). In some embodiments, the poly(A) signal sequence is the polyadenylation signal of soluble neuropilin-1 (sNRP) (AAATAAAATACGAAATG (SEQ ID NO: 46)) (see, e.g., WO 05/073384, which is incorporated herein by reference in its entirety). In some embodiments, a poly(A) signal sequence comprises or consists of the SV40 poly(A) site. In some embodiments, a poly(A) signal comprises or consists of SEQ ID NO: 72. In some embodiments, a poly(A) signal sequence comprises or consists of bGHpA. In some embodiments, a poly(A) signal comprises or consists of SEQ ID NO: 71. Additional examples of poly(A) signal sequences are known in the art. In some embodiments, a poly(A) sequence is at least 85%, 90%, 95%, 98% or 99% identical to the poly(A) sequence represented by SEQ ID NO: 71 or 72.
TABLE-US-00026 Exemplary bGH poly(A) signal sequence (SEQ ID NO: 71) CTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCT TCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGA GGAAATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTG GGGTGGGGCAGGACAGCAAGGGGGAGGATTGGGAAGACAATAGCAGGCAT GCTGGGGATGCGGTGGGCTCTATGG Exemplary SV40 poly(A) signal sequence (SEQ ID NO: 72) AACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCAC AAATTTCACAAATAAAGCATTTTTTTCACTGCATTCTAGTTGTGGTTTGT CCAAACTCATCAATGTATCTTA
Internal Ribosome Entry Sites (IRES)
[0372] In some embodiments, a vector encoding the C-terminal portion of the secreted target protein (e.g., a NDP protein (e.g., any of the exemplary NDP proteins described herein), a HSPA1A protein (e.g., any of the exemplary HSPA1A proteins described herein)) can include a polynucleotide internal ribosome entry site (IRES). An IRES sequence is used to produce more than one polypeptide from a single gene transcript. An IRES forms a complex secondary structure that allows translation initiation to occur from any position with an mRNA immediately downstream from where the IRES is located (see, e.g., Pelletier and Sonenberg, Mol. Cell. Biol. 8(3):1103-1112, 1988, which is incorporated herein in its entirety by reference).
[0373] There are several IRES sequences known to those in skilled in the art, including those from, e.g., foot and mouth disease virus (FMDV), encephalomyocarditis virus (EMCV), human rhinovirus (HRV), cricket paralysis virus, human immunodeficiency virus (HIV), hepatitis A virus (HAV), hepatitis C virus (HCV), and poliovirus (PV). See e.g., Alberts, Molecular Biology of the Cell, Garland Science, 2002; and Hellen et al., Genes Dev. 15(13):1593-612, 2001, each of which is incorporated herein in their entireties by reference.
[0374] In some embodiments, the IRES sequence that is incorporated into the vector that encodes the C-terminal portion of a secreted target protein (e.g., a NDP protein (e.g., any of the exemplary NDP proteins described herein), a HSPA1A protein (e.g., any of the exemplary HSPA1A proteins described herein)) is the foot and mouth disease virus (FMDV) 2A sequence. The Foot and Mouth Disease Virus 2A sequence is a small peptide (approximately 18 amino acids in length) that has been shown to mediate the cleavage of polyproteins (Ryan, M D et al., EMBO 4:928-933, 1994; Mattion et al., J. Virology 70:8124-8127, 1996; Furler et al., Gene Therapy 8:864-873, 2001; and Halpin et al., Plant Journal 4:453-459, 1999, each of which is incorporated in its entirety herein by reference). The cleavage activity of the 2A sequence has previously been demonstrated in artificial systems including plasmids and gene therapy vectors (AAV and retroviruses) (Ryan et al., EMBO 4:928-933, 1994; Mattion et al., J. Virology 70:8124-8127, 1996; Furler et al., Gene Therapy 8:864-873, 2001; and Halpin et al., Plant Journal 4:453-459, 1999; de Felipe et al., Gene Therapy 6:198-208, 1999; de Felipe et al., Human Gene Therapy 11:1921-1931, 2000; and Klump et al., Gene Therapy 8:811-817, 2001, each of which is incorporated in its entirety herein by reference).
[0375] An IRES can be utilized in an AAV construct. In some embodiments, a construct encoding the C-terminal portion of the secreted target protein (e.g., a NDP protein (e.g., any of the exemplary NDP proteins described herein), a HSPA1A protein (e.g., any of the exemplary HSPA1A proteins described herein)) can include a polynucleotide internal ribosome entry site (IRES). In some embodiments, an IRES can be part of a composition comprising more than one construct. In some embodiments, an IRES is used to produce more than one polypeptide from a single gene transcript.
[0376] Reporter Sequences or Elements
[0377] In some embodiments, constructs provided herein can optionally include a sequence encoding a reporter polypeptide and/or protein (“a reporter sequence”). Non-limiting examples of reporter sequences include DNA sequences encoding: a beta-lactamase, a beta-galactosidase (LacZ), an alkaline phosphatase, a thymidine kinase, a green fluorescent protein (GFP), a red fluorescent protein, an mCherry fluorescent protein, a yellow fluorescent protein, a chloramphenicol acetyltransferase (CAT), and a luciferase. Additional examples of reporter sequences are known in the art. When associated with regulatory elements which drive their expression, the reporter sequence can provide signals detectable by conventional means, including enzymatic, radiographic, colorimetric, fluorescence, or other spectrographic assays; fluorescent activating cell sorting (FACS) assays; immunological assays (e.g., enzyme linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and immunohistochemistry).
[0378] In some embodiments, the reporter sequence is tGFP (SEQ ID NO: 27). In some embodiments, the reporter sequence is eGFP (SEQ ID NO: 28 or SEQ ID NO: 29). In some embodiments, the reporter sequence is the LacZ gene, and the presence of a vector carrying the LacZ gene in a mammalian cell (e.g., a cochlear hair cell, an ocular cell, such as a retinal cell) is detected by assays for beta-galactosidase activity. When the reporter is a fluorescent protein (e.g., green fluorescent protein) or luciferase, the presence of a vector carrying the fluorescent protein or luciferase in a mammalian cell (e.g., a cochlear hair cell, an ocular cell, such as a retinal cell) may be measured by fluorescent techniques (e.g., fluorescent microscopy or FACS) or light production in a luminometer (e.g., a spectrophotometer or an IVIS imaging instrument). In some embodiments, the reporter sequence can be used to verify the tissue-specific targeting capabilities and tissue-specific promoter regulatory activity of any of the vectors described herein.
[0379] In some embodiments, a reporter sequence is the LacZ gene, and the presence of a construct carrying the LacZ gene in a mammalian cell (e.g., a cochlear hair cell) is detected by assays for beta-galactosidase activity. When the reporter is a fluorescent protein (e.g., green fluorescent protein) or luciferase, the presence of a construct carrying the fluorescent protein or luciferase in a mammalian cell (e.g., a cochlear hair cell) may be measured by fluorescent techniques (e.g., fluorescent microscopy or FACS) or light production in a luminometer (e.g., a spectrophotometer or an IVIS imaging instrument). In some embodiments, a reporter sequence can be used to verify the tissue-specific targeting capabilities and tissue-specific promoter regulatory and/or control activity of any of the constructs described herein.
[0380] In some embodiments, a reporter sequence is a FLAG tag (e.g., a 3×FLAG tag), and the presence of a construct carrying the FLAG tag in a mammalian cell (e.g., an inner ear cell, e.g., a cochlear hair or supporting cell) is detected by protein binding or detection assays (e.g., Western blots, immunohistochemistry, radioimmunoassay (RIA), mass spectrometry). An exemplary 3×FLAG tag sequence is provided as SEQ ID NO: 105.
TABLE-US-00027 Exemplary 3xFLAG tag sequence (SEQ ID NO: 105) GGATCCCGGGCTGACTACAAAGACCATGACGGTGATTATAAAGATCATGA CATCGACTACAAGGATGACGATGACAAG
[0381] Flanking Untranslated Regions, 5′ UTRs and 3′ UTRs
[0382] In some embodiments, any of the vectors described herein (e.g., any of the at least two different vectors) can include an untranslated region, such as a 5′UTR or a 3′ UTR.
[0383] Untranslated regions (UTRs) of a gene are transcribed but not translated. A 5′ UTR starts at the transcription start site and continues to the start codon but does not include the start codon. A 3′ UTR starts immediately following the stop codon and continues until the transcriptional termination signal. There is a growing body of evidence about the regulatory roles played by the UTRs in terms of stability of the nucleic acid molecule and translation. The regulatory and/or control features of a UTR can be incorporated into any of the vectors, constructs, compositions, kits, or methods as described herein to otherwise modulate the expression of a secreted target protein (e.g., a NDP protein, a HSPA1A protein).
[0384] Natural 5′ UTRs include a sequence that plays a role in translation initiation. They harbor signatures like Kozak sequences, which are commonly known to be involved in the process by which the ribosome initiates translation of many genes. Kozak sequences have the consensus sequence CCR(A/G)CCAUGG, where R is a purine (A or G) three bases upstream of the start codon (AUG), and the start codon is followed by another “G”. The 5′UTRs have also been known to form secondary structures that are involved in elongation factor binding.
[0385] In some embodiments, a 5′UTR is included in any of the vectors described herein. Non-limiting examples of 5′UTRs, including those from the following genes: albumin, serum amyloid A, Apolipoprotein A/B/E, transferrin, alpha fetoprotein, erythropoietin, and Factor VIII, can be used to enhance expression of a nucleic acid molecule, such as an mRNA.
[0386] In some embodiments, a 5′UTR from an mRNA that is transcribed by a cell in the cochlea or retina can be included in any of the vectors, compositions, kits, and methods described herein. In some embodiments, a 5′ UTR is derived from the endogenous SLC26A4 gene loci and may include all or part of the endogenous sequence exemplified by SEQ ID NO: 21. In some embodiments, a 5′ UTR sequence is at least 85%, 90%, 95%, 98% or 99% identical to the 5′ UTR sequence represented by SEQ ID NO: 21.
[0387] 3′ UTRs are found immediately 3′ to the stop codon of the gene of interest. In some embodiments, a 3′ UTR from an mRNA that is transcribed by a cell in the cochlea can be included in any of the constructs, compositions, kits, and methods described herein. In some embodiments, a 3′ UTR is derived from the endogenous SLC26A4 gene loci and may include all or part of the endogenous sequence exemplified by SEQ ID NO: 22. In some embodiments, a 3′ UTR sequence is at least 85%, 90%, 95%, 98% or 99% identical to the 3′ UTR sequence represented by SEQ ID NO: 22.
[0388] 3′UTRs are known to have stretches of adenosines and uridines (in the RNA form) or thymidines (in the DNA form) embedded in them. These AU-rich signatures are particularly prevalent in genes with high rates of turnover. Based on their sequence features and functional properties, the AU-rich elements (AREs) can be separated into three classes (Chen et al., Mol. Cell. Biol. 15:5777-5788, 1995; Chen et al., Mol. Cell. Biol. 15:2010-2018, 1995, each of which is incorporated herein by reference in its entirety): Class I AREs contain several dispersed copies of an AUUUA motif within U-rich regions. For example, c-Myc and MyoD mRNAs contain class I AREs. Class II AREs possess two or more overlapping UUAUUUA(U/A) (U/A) nonamers. GM-CSF and TNF-alpha mRNAs are examples that contain class II AREs. Class III AREs are less well defined. These U-rich regions do not contain an AUUUA motif. Two well-studied examples of this class are c-Jun and myogenin mRNAs.
[0389] Most proteins that bind to the AREs are known to destabilize the messenger, whereas members of the ELAV family, most notably HuR, have been documented to increase the stability of mRNA. HuR binds to AREs of all the three classes. Engineering the HuR specific binding sites into the 3′UTR of nucleic acid molecules will lead to HuR binding and thus, stabilization of the message in vivo.
[0390] An exemplary human wildtype 5′UTR is or includes the sequence of SEQ ID NO: 14 or 30. An exemplary human wildtype 3′UTR is or includes the sequence of SEQ ID NO: 15 or 22.
[0391] In some embodiments of any of the compositions described herein, a 5′ untranslated region (UTR), a 3′UTR, or both are included in a vector (e.g., any of the vectors described herein). For example, any of the 5′UTRs described herein can be operatively linked to the start codon in any of the coding sequences described herein. For example, any of the 3′UTRs can be operably linked to the 3′-terminal codon (last codon) in any of the coding sequences described herein.
[0392] In some embodiments of any of the compositions described herein, the 5′UTR includes at least 10 contiguous (e.g., at least 15 contiguous, at least 20 contiguous, at least 25 contiguous, at least 30 contiguous, at least 35 contiguous, at least 40 contiguous, at least 45 contiguous, at least 50 contiguous, at least 55 contiguous, at least 60 contiguous, at least 65 contiguous, at least 70 contiguous, at least 75 contiguous, at least 80 contiguous, at least 85 contiguous, at least 90 contiguous, at least 95 contiguous, at least 100 contiguous, at least 105 contiguous, at least 110 contiguous, at least 115 contiguous, at least 120 contiguous, at least 125 contiguous, at least 130 contiguous, at least 135 contiguous, at least 140 contiguous, at least 145 contiguous, at least 150 contiguous, at least 155 contiguous, at least 160 contiguous, at least 165 contiguous, at least 170 contiguous, at least 175 contiguous, at least 180 contiguous, at least 185 contiguous, at least 190 contiguous, at least 195 contiguous, at least 200 contiguous, at least 205 contiguous, at least 210 contiguous, at least 215 contiguous, at least 220 contiguous, at least 225 contiguous, at least 230 contiguous, at least 235 contiguous, at least 240 contiguous, at least 245 contiguous, at least 250 contiguous, at least 255 contiguous, at least 260 contiguous, at 265 contiguous, at least 270 contiguous, at least 275 contiguous, at least 280 contiguous, at least 285 contiguous, at least 290 contiguous, at least 295 contiguous, at least 300 contiguous, at least 305 contiguous, at least 310 contiguous, at least 315 contiguous, at least 320 contiguous, at least 325 contiguous, at least 330 contiguous, at least 335 contiguous, at least 340 contiguous, at least 345 contiguous, at least 350 contiguous, at least 355 contiguous, at least 360 contiguous, at least 365 contiguous, at least 370 contiguous, at least 375 contiguous, at least 380 contiguous, at least 385 contiguous, at least 390 contiguous, at least 395 contiguous, at least 400 contiguous, at least 405 contiguous, at least 410 contiguous, at least 415 contiguous, at least 420 contiguous, at least 425 contiguous, at least 430 contiguous, at least 435 contiguous, at least 440 contiguous, at least 445 contiguous, at least 450 contiguous, at least 455 contiguous, at least 460 contiguous, at least 465 contiguous, at least 470 contiguous, at least 475 contiguous, at least 480 contiguous, at least 485 contiguous, at least 490 contiguous, at least 495 contiguous, at least 500 contiguous, at least 505 contiguous, at least 510 contiguous, at least 515 contiguous, at least 520 contiguous, at least 525 contiguous, at least 530 contiguous, at least 535 contiguous, at least 540 contiguous, at least 545 contiguous, or at least 550 contiguous) nucleotides from anywhere within SEQ ID NO: 14 or SEQ ID NO: 31.
[0393] For example, a 5′UTR can include or consist of one or more of: nucleotide positions 1 to 550, nucleotide positions 1 to 500, nucleotide positions 1 to 450, nucleotide positions 1 to 400, nucleotide positions 1 to 350, nucleotide positions 1 to 300, nucleotide positions 1 to 250, nucleotide positions 1 to 200, nucleotide positions 1 to 150, nucleotide positions 1 to 100, nucleotide positions 1 to 50, nucleotide positions 50 to 550, nucleotide positions 50 to 500, nucleotide positions 50 to 450, nucleotide positions 50 to 400, nucleotide positions 50 to 350, nucleotide positions 50 to 300, nucleotide positions 50 to 250, nucleotide positions 50 to 200, nucleotide positions 50 to 150, nucleotide positions 50 to 100, nucleotide positions 100 to 550, nucleotide positions 100 to 500, nucleotide positions 100 to 450, nucleotide positions 100 to 400, nucleotide positions 100 to 350, nucleotide positions 100 to 300, nucleotide positions 100 to 250, nucleotide positions 100 to 200, nucleotide positions 100 to 150, nucleotide positions 150 to 550, nucleotide positions 150 to 500, nucleotide positions 150 to 450, nucleotide positions 150 to 400, nucleotide positions 150 to 350, nucleotide positions 150 to 300, nucleotide positions 150 to 250, nucleotide positions 150 to 200, nucleotide positions 200 to 550, nucleotide positions 200 to 500, nucleotide positions 200 to 450, nucleotide positions 200 to 400, nucleotide positions 200 to 350, nucleotide positions 200 to 300, nucleotide positions 200 to 250, nucleotide positions 250 to 550, nucleotide positions 250 to 500, nucleotide positions 250 to 450, nucleotide positions 250 to 400, nucleotide positions 250 to 350, nucleotide positions 250 to 300, nucleotide positions 300 to 550, nucleotide positions 300 to 500, nucleotide positions 300 to 450, nucleotide positions 300 to 400, nucleotide positions 300 to 350, nucleotide positions 350 to 550, nucleotide positions 350 to 500, nucleotide positions 350 to 450, nucleotide positions 350 to 400, nucleotide positions 400 to 550, nucleotide positions 400 to 500, nucleotide positions 400 to 450, nucleotide positions 450 to 500, or nucleotide positions 500 to 550, of SEQ ID NO: 14 or SEQ ID NO: 31.
[0394] In some embodiments of any of the compositions described herein, the 5′UTR includes a sequence that is at least 70% (e.g., at least 72%, at least 74%, at least 75%, at least 76%, at least 78%, at least 80%, at least 82%, at least 84%, at least 85%, at least 86%, at least 88%, at least 90%, at least 92%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) identical to SEQ ID NO: 14 or SEQ ID NO: 31.
TABLE-US-00028 Exemplary 5′ UTR Sequence (SEQ ID NO: 14) AAGATGCTCCGTGGAAGGGAGCCGAGCGGTGGGCAGAGGCTGAGTCCCCG ATAACGAGCGCCTCACATTTCCGTGGCATTCCCATTTGCTAGTGCGCTGC TGCGGCCGCACGCCTGATTGATATATGACTGCAATGGCACTTTTCCATTT GACATTCTCTCTCTCTCTCTCCCTCTCTCTCTCTCCCTCTCTCTCTCCCT CTCTCTCTCTCCCTGTGTCGCTTAAACAACAGTCCTAACTTTTGTGTGTT GCAAATATAAAAGGCAAGCCATGTGACAGAGGGACAGAAGAACAAAAGCA TTTGGAAGTAACAGGACCTCTTTCTAGCTCTCAGAAAAGTCTGAGAAGAA AGGAGCCCTGCGTTCCCCTAAGCTGTGCAGCAGATAGTGTGATGATGGAT TGCAAGTGCAAAGAGTAAGACAAAACTCCAGCACATAAAGGACAATGACA ACCAGAAAGCTTCAGCCCGATCCTGCCCTTTCCTTGAACGGGACTGGATC CTAGGAGGTGAAGCCATTTCCAATTTTTTGTCCTCTGCCTCCCTCTGCTG TTCTTCTAGAGAAGTTTTTCCTTACAACA Exemplary 5′ UTR Sequence (SEQ ID NO: 31) AAGATGCTCCGTGGAAGGGAGCCGAGCGGTGGGCAGAGGCTGAGTCCCCG ATAACGAGCGCCTCACATTTCCGTGGCATTCCCATTTGCTAGTGCGCTGC TGCGGCCGCACGCCTGATTGATATATGACTGCAATGGCACTTTTCCATTT GACATTCTCTCTCTCTCTCTCCCTCTCTCTCTCTCCCTCTCTCTCTCCCT CTCTCTCTCTCCCTGTGTCGCTTAAACAACAGTCCTAACTTTTGTGTGTT GCAAATATAAAAGGCAAGCCATGTGACAGAGGGACAGAAGAACAAAAGCA TTTGGAAGTAACAGGACCTCTTTCTAGCTCTCAGAAAAGTCTGAGAAGAA AGGAGCCCTGCGTTCCCCTAAGCTGTGCAGCAGATAGTGTGATGATGGAT TGCAAGTGCAAAGAGTAAGACAAAACTCCAGCACATAAAGGACAATGACA ACCAGAAAGCTTCAGCCCGATCCTGCCCTTTCCTTGAACGGGACTGGATC CTAGGAGGTGAAGCCATTTCCAATTTTTTGTCCTCTGCCTCCCTCTGCTG TTCTTCTAGAGAAGTTTTTCCTTACAACA
[0395] In some embodiments of any of the compositions described herein, the 3′UTR includes at least 10 contiguous (e.g., at least 15 contiguous, at least 20 contiguous, at least 25 contiguous, at least 30 contiguous, at least 35 contiguous, at least 40 contiguous, at least 45 contiguous, at least 50 contiguous, at least 55 contiguous, at least 60 contiguous, at least 65 contiguous, at least 70 contiguous, at least 75 contiguous, at least 80 contiguous, at least 85 contiguous, at least 90 contiguous, at least 95 contiguous, at least 100 contiguous, at least 105 contiguous, at least 110 contiguous, at least 115 contiguous, at least 120 contiguous, at least 125 contiguous, at least 130 contiguous, at least 135 contiguous, at least 140 contiguous, at least 145 contiguous, at least 150 contiguous, at least 155 contiguous, at least 160 contiguous, at least 165 contiguous, at least 170 contiguous, at least 175 contiguous, at least 180 contiguous, at least 185 contiguous, at least 190 contiguous, at least 195 contiguous, at least 200 contiguous, at least 205 contiguous, at least 210 contiguous, at least 215 contiguous, at least 220 contiguous, at least 225 contiguous, at least 230 contiguous, at least 235 contiguous, at least 240 contiguous, at least 245 contiguous, at least 250 contiguous, at least 255 contiguous, at least 260 contiguous, at 265 contiguous, at least 270 contiguous, at least 275 contiguous, at least 280 contiguous, at least 285 contiguous, at least 290 contiguous, at least 295 contiguous, at least 300 contiguous, at least 305 contiguous, at least 310 contiguous, at least 315 contiguous, at least 320 contiguous, at least 325 contiguous, at least 330 contiguous, at least 335 contiguous, at least 340 contiguous, at least 345 contiguous, at least 350 contiguous, at least 355 contiguous, at least 360 contiguous, at least 365 contiguous, at least 370 contiguous, at least 375 contiguous, at least 380 contiguous, at least 385 contiguous, at least 390 contiguous, at least 395 contiguous, at least 400, at least 405 contiguous, at least 410 contiguous, at least 415 contiguous, at least 420 contiguous, at least 425 contiguous, at least 430 contiguous, at least 435 contiguous, at least 440 contiguous, at least 445 contiguous, at least 450 contiguous, contiguous, 455 contiguous, at least 460 contiguous, at least 465 contiguous, at least 470 contiguous, at least 475 contiguous, at least 480 contiguous, at least 485 contiguous, at least 490 contiguous, at least 495 contiguous, at least 500 contiguous, at least 505 contiguous, at least 510 contiguous, at least 515 contiguous, at least 520 contiguous, at least 525 contiguous, at least 530 contiguous, at least 535 contiguous, at least 540 contiguous, at least 545 contiguous, at least 550 contiguous, at least 555 contiguous, at least 560 contiguous, at least 565 contiguous, at least 570 contiguous, at least 575 contiguous, at least 580 contiguous, at least 585 contiguous, at least 590 contiguous, at least 595 contiguous, at least 600 contiguous, at least 605 contiguous, at least 610 contiguous, at least 615 contiguous, at least 620 contiguous, at least 625 contiguous, at least 630 contiguous, at least 635 contiguous, at least 640 contiguous, at least 645 contiguous, at least 650 contiguous, at least 655 contiguous, at least 660 contiguous, at least 665 contiguous, at least 670 contiguous, at least 675 contiguous, at least 680 contiguous, at least 685 contiguous, at least 690 contiguous, at least 695 contiguous, at least 700 contiguous, at least 705 contiguous, at least 710 contiguous, at least 715 contiguous, at least 720 contiguous, at least 725 contiguous, at least 730 contiguous, at least 735 contiguous, at least 740 contiguous, at least 745 contiguous, at least 750 contiguous, at least 755 contiguous, at least 760 contiguous, at least 765 contiguous, at least 770 contiguous, at least 775 contiguous, at least 780 contiguous, at least 785 contiguous, at least 790 contiguous, at least 795 contiguous, at least 800 contiguous, at least 805 contiguous, at least 810 contiguous, at least 815 contiguous, at least 820 contiguous, at least 825 contiguous, at least 830 contiguous, at least 835 contiguous, at least 840 contiguous, at least 845 contiguous, at least 850 contiguous, at least 855 contiguous, at least 860 contiguous, at least 865 contiguous, at least 870 contiguous, at least 875 contiguous, at least 880 contiguous, at least 885 contiguous, at least 890 contiguous, at least 895 contiguous, at least 900 contiguous, at least 905 contiguous, at least 910 contiguous, at least 915 contiguous, at least 920 contiguous, at least 925 contiguous, at least 930 contiguous, at least 935 contiguous, at least 940 contiguous, at least 945 contiguous, at least 950 contiguous, at least 955 contiguous, at least 960 contiguous, at least 965 contiguous, at least 970 contiguous, at least 975 contiguous, at least 980 contiguous, at least 985 contiguous, at least 990 contiguous, at least 995 contiguous, at least 1000 contiguous, at least 1005 contiguous, or at least 1010 contiguous) nucleotides from anywhere within SEQ ID NO: 15 or SEQ ID NO: 22.
[0396] For example, a 3′UTR can include or consist of one or more of: nucleotide positions 1 to 1000, nucleotide positions 1 to 950, nucleotide positions 1 to 900, nucleotide positions 1 to 850, nucleotide positions 1 to 800, nucleotide positions 1 to 750, nucleotide positions 1 to 700, nucleotide positions 1 to 650, nucleotide positions 1 to 600, nucleotide positions 1 to 550, nucleotide positions 1 to 500, nucleotide positions 1 to 450, nucleotide positions 1 to 400, nucleotide positions 1 to 350, nucleotide positions 1 to 300, nucleotide positions 1 to 250, nucleotide positions 1 to 200, nucleotide positions 1 to 150, nucleotide positions 1 to 100, nucleotide positions 1 to 50, nucleotide positions 50 to 1000, nucleotide positions 50 to 950, nucleotide positions 50 to 900, nucleotide positions 50 to 850, nucleotide positions 50 to 800, nucleotide positions 50 to 750, nucleotide positions 50 to 700, nucleotide positions 50 to 650, nucleotide positions 50 to 600, nucleotide positions 50 to 550, nucleotide positions 50 to 500, nucleotide positions 50 to 450, nucleotide positions 50 to 400, nucleotide positions 50 to 350, nucleotide positions 50 to 300, nucleotide positions 50 to 250, nucleotide positions 50 to 200, nucleotide positions 50 to 150, nucleotide positions 50 to 100, nucleotide positions 100 to 1000, nucleotide positions 100 to 950, nucleotide positions 100 to 900, nucleotide positions 100 to 850, nucleotide positions 100 to 800, nucleotide positions 100 to 750, nucleotide positions 100 to 700, nucleotide positions 100 to 650, nucleotide positions 100 to 600, nucleotide positions 100 to 550, nucleotide positions 100 to 500, nucleotide positions 100 to 450, nucleotide positions 100 to 400, nucleotide positions 100 to 350, nucleotide positions 100 to 300, nucleotide positions 100 to 250, nucleotide positions 100 to 200, nucleotide positions 100 to 150, nucleotide positions 150 to 1000, nucleotide positions 150 to 950, nucleotide positions 150 to 900, nucleotide positions 150 to 850, nucleotide positions 150 to 800, nucleotide positions 150 to 750, nucleotide positions 150 to 700, nucleotide positions 150 to 650, nucleotide positions 150 to 600, nucleotide positions 150 to 550, nucleotide positions 150 to 500, nucleotide positions 150 to 450, nucleotide positions 150 to 400, nucleotide positions 150 to 350, nucleotide positions 150 to 300, nucleotide positions 150 to 250, nucleotide positions 150 to 200, nucleotide positions 200 to 1000, nucleotide positions 200 to 950, nucleotide positions 200 to 900, nucleotide positions 200 to 850, nucleotide positions 200 to 800, nucleotide positions 200 to 750, nucleotide positions 200 to 700, nucleotide positions 200 to 650, nucleotide positions 200 to 600, nucleotide positions 200 to 550, nucleotide positions 200 to 500, nucleotide positions 200 to 450, nucleotide positions 200 to 400, nucleotide positions 200 to 350, nucleotide positions 200 to 300, nucleotide positions 200 to 250, nucleotide positions 250 to 1000, nucleotide positions 250 to 950, nucleotide positions 250 to 900, nucleotide positions 250 to 850, nucleotide positions 250 to 800, nucleotide positions 250 to 750, nucleotide positions 250 to 700, nucleotide positions 250 to 650, nucleotide positions 250 to 600, nucleotide positions 250 to 550, nucleotide positions 250 to 500, nucleotide positions 250 to 450, nucleotide positions 250 to 400, nucleotide positions 250 to 350, nucleotide positions 250 to 300, nucleotide positions 300 to 1000, nucleotide positions 300 to 950, nucleotide positions 300 to 900, nucleotide positions 300 to 850, nucleotide positions 300 to 800, nucleotide positions 300 to 750, nucleotide positions 300 to 700, nucleotide positions 300 to 650, nucleotide positions 300 to 600, nucleotide positions 300 to 550, nucleotide positions 300 to 500, nucleotide positions 300 to 450, nucleotide positions 300 to 400, nucleotide positions 300 to 350, nucleotide positions 350 to 1000, nucleotide positions 350 to 950, nucleotide positions 350 to 900, nucleotide positions 350 to 850, nucleotide positions 350 to 800, nucleotide positions 350 to 750, nucleotide positions 350 to 700, nucleotide positions 350 to 650, nucleotide positions 350 to 600, nucleotide positions 350 to 550, nucleotide positions 350 to 500, nucleotide positions 350 to 450, nucleotide positions 350 to 400, nucleotide positions 400 to 1000, nucleotide positions 400 to 950, nucleotide positions 400 to 900, nucleotide positions 400 to 850, nucleotide positions 400 to 800, nucleotide positions 400 to 750, nucleotide positions 400 to 700, nucleotide positions 400 to 650, nucleotide positions 400 to 600, nucleotide positions 400 to 550, nucleotide positions 400 to 500, nucleotide positions 400 to 450, nucleotide positions 450 to 1000, nucleotide positions 450 to 950, nucleotide positions 450 to 900, nucleotide positions 450 to 850, nucleotide positions 450 to 800, nucleotide positions 450 to 750, nucleotide positions 450 to 700, nucleotide positions 450 to 650, nucleotide positions 450 to 600, nucleotide positions 450 to 550, nucleotide positions 450 to 500, nucleotide positions 500 to 1000, nucleotide positions 500 to 950, nucleotide positions 500 to 900, nucleotide positions 500 to 850, nucleotide positions 500 to 800, nucleotide positions 500 to 750, nucleotide positions 500 to 700, nucleotide positions 500 to 650, nucleotide positions 500 to 600, nucleotide positions 500 to 550, nucleotide positions 550 to 1000, nucleotide positions 550 to 950, nucleotide positions 550 to 900, nucleotide positions 550 to 850, nucleotide positions 550 to 800, nucleotide positions 550 to 750, nucleotide positions 550 to 700, nucleotide positions 550 to 650, nucleotide positions 550 to 600, nucleotide positions 600 to 1000, nucleotide positions 600 to 950, nucleotide positions 600 to 900, nucleotide positions 600 to 850, nucleotide positions 600 to 800, nucleotide positions 600 to 750, nucleotide positions 600 to 700, nucleotide positions 600 to 650, nucleotide positions 650 to 1000, nucleotide positions 650 to 950, nucleotide positions 650 to 900, nucleotide positions 650 to 850, nucleotide positions 650 to 800, nucleotide positions 650 to 750, nucleotide positions 650 to 700, nucleotide positions 700 to 1000, nucleotide positions 700 to 950, nucleotide positions 700 to 900, nucleotide positions 700 to 850, nucleotide positions 700 to 800, nucleotide positions 700 to 750, nucleotide positions 750 to 1000, nucleotide positions 750 to 950, nucleotide positions 750 to 900, nucleotide positions 750 to 850, nucleotide positions 750 to 800, nucleotide positions 800 to 1000, nucleotide positions 800 to 950, nucleotide positions 800 to 900, nucleotide positions 800 to 850, nucleotide positions 850 to 1000, nucleotide positions 850 to 950, nucleotide positions 850 to 900, nucleotide positions 900 to 1000, nucleotide positions 900 to 950, or nucleotide positions 950 to 1000, of SEQ ID NO: 15 or SEQ ID NO: 22.
[0397] In some embodiments of any of the compositions described herein, the 3′UTR includes a sequence that is at least 70% (e.g., at least 72%, at least 74%, at least 75%, at least 76%, at least 78%, at least 80%, at least 82%, at least 84%, at least 85%, at least 86%, at least 88%, at least 90%, at least 92%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) identical to SEQ ID NO: 15 or SEQ ID NO: 22.
TABLE-US-00029 Exemplary 3′UTR Sequence, 3′UTR-1023 (SEQ ID NO: 15) GGCCCGCTGCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGTTC GACCAGCCAGGGAAAGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGGATG CAACAATTCTCCCGGGACTCTGCATATTCTAGTAATAAAGACTCTACATG CTTGTTGACAGAGAGAGATACTCTGGGAACTTCTTTGCAGTTCCCATCTC CTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTCAGGCATTTTC CCCCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGG GAAAAAGTGGGCCCTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTG CACGCTGGGGAAGAATTTACTTTGGAAAGTAGAAAAGCCCAGCTTTTCCT GGGACATCTTCTGTTATTGTTGATGTTTTTTTTTACCTTGTCATTTTGGT CTAAGGTTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGATACC AAGCATGTGGATATGTTTAGCTACGTTTACTCACAGCCAGCGAACTGACA TTAAAATAACTAACAAACAGATTCTTTTATGTGATGCTGGAACTCTTGAC AGCTATAATTATTATTCAGAAATGACTTTTTGAAAGTAAAAGCAGCATAA AGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATGGTAAAATTTTGT AAGGGAGCAGACTTTTAAAGACTTGCACAAATACGGATCCTGCACTGAGT CTGGAAAAGGCATATATGTAGTAGTGGCATGGAGAATGCACCATACTCAT GCATGCAAATTAGACAACCAAGTATGAATCTATTTGTGGGTGTGCTATAG CTTTAGCCGTGTCACGGGCATCATTCTCTAATATCCACTTGTCCATGTGA AACATGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTTGGTTCA AATGTGTTTTGGTCCTGGAGGCTCAAATTTTGAGTTATTCCCACGTTTTG AAATAAAAAGAGTATATTCAAAA Exemplary 3′UTR Sequence (SEQ ID NO: 22) TGCCCGCTGCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGTTC GACCAGCCAGGGAAAGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGGATG CAACAATTCTCCCGGGACTCTGCATATTCTAGTAATAAAGACTCTACATG CTTGTTGACAGAGAGAGATACTCTGGGAACTTCTTTGCAGTTCCCATCTC CTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTCAGGCATTTTC CCCCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGG GAAAAAGTGGGCCCTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTG CACGCTGGGGAAGAATTTACTTTGGAAAGTAGAAAAGCCCAGCTTTTCCT GGGACATCTTCTGTTATTGTTGATGTTTTTTTTTACCTTGTCATTTTGGT CTAAGGTTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGATACC AAGCATGTGGATATGTTTAGCTACGTTTACTCACAGCCAGCGAACTGACA TTAAAATAACTAACAAACAGATTCTTTTATGTGATGCTGGAACTCTTGAC AGCTATAATTATTATTCAGAAATGACTTTTTGAAAGTAAAAGCAGCATAA AGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATGGTAAAATTTTGT AAGGGAGCAGACTTTTAAAGACTTGCACAAATACGGATCCTGCACTGACT CTGGAAAAGGCATATATGTACTAGTGGCATGGAGAATGCACCATACTCAT GCATGCAAATTAGACAACCAAGTATGAATCTATTTGTGGGTGTGCTATAG CTTTAGCCGTGTCACGGGCATCATTCTCTAATATCCACTTGTCCATGTGA AACATGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTTGGTTCA AATGTGTTTTGGTCCTGGAGGCTCAAATTTTGAGTTATTCCCACGTTTTG AAATAAAAAGAGTATATTCAAAA
[0398] In some embodiments, the introduction, removal, or modification of 3′UTR AREs can be used to modulate the stability of an mRNA encoding a secreted target protein (e.g., a NDP protein, a HSPA1A protein). In other embodiments, AREs can be removed or mutated to increase the intracellular stability and thus increase translation and production of a secreted target protein (e.g., a NDP protein, a HSPA1A protein).
[0399] In other embodiments, non-ARE sequences may be incorporated into the 5′ or 3′ UTRs. In some embodiments, introns or portions of intron sequences may be incorporated into the flanking regions of the polynucleotides in any of the vectors, compositions, kits, and methods provided herein. Incorporation of intronic sequences may increase protein production as well as mRNA levels.
[0400] In some embodiments of any of the vectors described herein, the vector includes a chimeric intron sequence (SEQ ID NO: 19).
TABLE-US-00030 Exemplary Chimeric Intron cDNA Sequence (SEQ ID NO: 19) GGAGTCGCTGCGTTGCCTTCGCCCCGTGCCCCGCTCCGCGCCGCCTCGCG CCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGAGCGGGC GGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGG CTCGTTTCTTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGG CCCTTTGTGCGGGGGGGAGCGGCTCGGGGGGTGCGTGCGTGTGTGTGTGC GTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCGGCTGTGAGCGCTGC GGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGC TGCGTGCGGGGTGTGTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGC GGTCGGGCTGTAACCCCCCCCTGCACCCCCCTCCCCGAGTTGCTGAGCAC GGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCGGGGCTCGC CGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGC CGCCTCGGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGC GCCGGCGGCTGTCGAGGCGCGGCGAGCCGCAGCCATTGCCTTTTATGGTA ATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCCCAAATCTGTGCGGAGC CGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGAAG CGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTC GCCGCGCCGCCGTCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGG GGGACGGCTGCCTTCGGGGGGGACGGGGCAGGGCGGGGTTCGGCTTCTGG CGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTCATGCCTTCTT CTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTG
[0401] Additional Sequences
[0402] In some embodiments, constructs of the present disclosure may comprise a T2A element or sequence. In some embodiments, constructs of the present disclosure may include one or more cloning sites. In some such embodiments, cloning sites may not be fully removed prior to manufacturing for administration to a subject. In some embodiments, cloning sites may have functional roles including as linker sequences, or as portions of a Kozak site. As will be appreciated by those skilled in the art, cloning sites may vary significantly in primary sequence while retaining their desired function. In some embodiments, constructs may contain any combination of cloning sites, exemplary cloning sites are represented by SEQ ID NO:73-77, 106, 75 and 107, respectively, in order of appearance.
TABLE-US-00031 Exemplary cloning site A (SEQ ID NO: 73) TTGTCGACGCGGCCGCACGCGT Exemplary cloning site B (SEQ ID NO: 74) CTCCTGGGCAACGTGCTGGTTATTGTGACCGGTCGCTAGCCACC Exemplary cloning site C (SEQ ID NO: 75) TAAGAGCTCGCTGATCAGCCTCGA Exemplary cloning site D (SEQ ID NO: 76) AAGCTTGAATTCAGCTGACGTGCCTCGGACCGTCCTAGG Exemplary cloning site E (SEQ ID NO: 77) GCGGCCGCACGCGT Exemplary cloning site F (SEQ ID NO: 106) CTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACC Exemplary cloning site G (SEQ ID NO: 75) TAAGAGCTCGCTGATCAGCCTCGA Exemplary cloning site H (SEQ ID NO: 107) AAGCTTGAATTCAGCTGACGTGCCTCGGACCGCT
[0403] Destabilization Domains
[0404] In some embodiments, any of the constructs provided herein can optionally include a sequence encoding a destabilizing domain (“a destabilizing sequence”) for temporal control of protein expression. Non-limiting examples of destabilizing sequences include sequences encoding a FK506 sequence, a dihydrofolate reductase (DHFR) sequence, or other exemplary destabilizing sequences.
[0405] In the absence of a stabilizing ligand, a protein sequence operatively linked to a destabilizing sequence is degraded by ubiquitination. In contrast, in the presence of a stabilizing ligand, protein degradation is inhibited, thereby allowing the protein sequence operatively linked to the destabilizing sequence to be actively expressed. As a positive control for stabilization of protein expression, protein expression can be detected by conventional means, including enzymatic, radiographic, colorimetric, fluorescence, or other spectrographic assays; fluorescent activating cell sorting (FACS) assays; immunological assays (e.g., enzyme linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and immunohistochemistry).
[0406] Additional examples of destabilizing sequences are known in the art. In some embodiments, the destabilizing sequence is a FK506- and rapamycin-binding protein (FKBP12) sequence, and the stabilizing ligand is Shield-1 (Shld1) (Banaszynski et al. (2012) Cell 126(5): 995-1004, which is incorporated in its entirety herein by reference). In some embodiments, a destabilizing sequence is a DHFR sequence, and a stabilizing ligand is trimethoprim (TMP) (Iwamoto et al. (2010) Chem Biol 17:981-988, which is incorporated in its entirety herein by reference).
[0407] In some embodiments, a destabilizing sequence is a FKBP12 sequence, and a presence of an AAV construct carrying the FKBP12 gene in a subject cell (e.g., a supporting cochlear outer hair cell) is detected by western blotting. In some embodiments, a destabilizing sequence can be used to verify the temporally-specific activity of any of the AAV constructs described herein.
TABLE-US-00032 Exemplary DHFR destabilizing amino acid sequence (SEQ ID NO: 108) MISLIAALAVDYVIGMENAMPWNLPADLAWFKRNTLNKPVIMGR HTWESIGRPLPGRKNIILSSQPSTDDRVTWVKSVDEAIAACGDV PEIMVIGGGRVIEQFLPKAQKLYLTHIDAEVEGDTHFPDYEPDD WESVFSEFHDADAQNSHSYCFEILERR Exemplary DHFR destabilizing nucleotide sequence (SEQ ID NO: 109) GGTACCATCAGTCTGATTGCGGCGTTAGCGGTAGATTACGTTAT CGGCATGGAAAACGCCATGCCGTGGAACCTGCCTGCCGATCTCG CCTGGTTTAAACGCAACACCTTAAATAAACCCGTGATTATGGGC CGCCATACCTGGGAATCAATCGGTCGTCCGTTGCCAGGACGCAA AAATATTATCCTCAGCAGTCAACCGAGTACGGACGATCGCGTAA CGTGGGTGAAGTCGGTGGATGAAGCCATCGCGGCGTGTGGTGAC GTACCAGAAATCATGGTGATTGGCGGCGGTCGCGTTATTGAACA GTTCTTGCCAAAAGCGCAAAAACTGTATCTGACGCATATCGACG CAGAAGTGGAAGGCGACACCCATTTCCCGGATTACGAGCCGGAT GACTGGGAATCGGTATTCAGCGAATTCCACGATGCTGATGCGCA GAACTCTCACAGCTATTGCTTTGAGATTCTGGAGCGGCGATAA Exemplary destabilizing domain (SEQ ID NO: 110) ATCAGTCTGATTGCGGCGTTAGCGGTAGATTACGTTATCGGCAT GGAAAACGCCATGCCGTGGAACCTGCCTGCCGATCTCGCCTGGT TTAAACGCAACACCTTAAATAAACCCGTGATTATGGGCCGCCAT ACCTGGGAATCAATCGGTCGTCCGTTGCCAGGACGCAAAAATAT TATCCTCAGCAGTCAACCGAGTACGGACGATCGCGTAACGTGGG TGAAGTCGGTGGATGAAGCCATCGCGGCGTGTGGTGACGTACCA GAAATCATGGTGATTGGCGGCGGTCGCGTTATTGAACAGTTCTT GCCAAAAGCGCAAAAACTGTATCTGACGCATATCGACGCAGAAG TGGAAGGCGACACCCATTTCCCGGATTACGAGCCGGATGACTGG GAATCGGTATTCAGCGAATTCCACGATGCTGATGCGCAGAACTC TCACAGCTATTGCTTTGAGATTCTGGAGCGGCGA Exemplary FKBP12 destabilizing peptide amino acid sequence (SEQ ID NO: 111) MGVEKQVIRPGNGPKPAPGQTVTVHCTGFGKDGDLSQKFWSTKD EGQKPFSFQIGKGAVIKGWDEGVIGMQIGEVARLRCSSDYAYGA GGFPAWGIQPNSVLDFEIEVLSVQ
[0408] AAV Capsids
[0409] The present disclosure provides one or more polynucleotide constructs packaged into an AAV capsid. In some embodiments, an AAV capsid is from or derived from an AAV capsid of an AAV2, 3, 4, 5, 6, 7, 8, 9, 10, rh8, rh10, rh39, rh43 or Anc80 serotype, or one or more hybrids thereof. In some embodiments, an AAV capsid is from an AAV ancestral serotype. In some embodiments, an AAV capsid is an ancestral (Anc) AAV capsid. An Anc capsid is created from a construct sequence that is constructed using evolutionary probabilities and evolutionary modeling to determine a probable ancestral sequence. Thus, an Anc capsid/construct sequence is not known to have existed in nature. For example, in some embodiments, an AAV capsid is an Anc80 capsid (e.g., an Anc80L65 capsid). In some embodiments, an AAV capsid is created using a template nucleotide coding sequence comprising SEQ ID NO: 58. In some embodiments, the capsid comprises a polypeptide represented by SEQ ID NO: 59. In some embodiments, the capsid comprises a polypeptide with at least 85%, 90°/, 95%, 98% or 99% sequence identity to the polypeptide represented by SEQ ID NO: 59.
[0410] As provided herein, any combination of AAV capsids and AAV constructs (e.g., comprising AAV ITRs) may be used in recombinant AAV (rAAV) particles of the present disclosure. For example, wild type or variant AAV2 ITRs and Anc80 capsid, wild type or variant AAV2 ITRs and AAV6 capsid, etc. In some embodiments of the present disclosure, an AAV particle is wholly comprised of AAV2 components (e.g., capsid and ITRs are AAV2 serotype). In some embodiments, an AAV particle is an AAV2/6, AAV2/8 or AAV2/9 particle (e.g., an AAV6, AAV8 or AAV9 capsid with an AAV construct having AAV2 ITRs). In some embodiments of the present disclosure, an AAV particle is an AAV2/Anc80 particle that comprises an Anc80 capsid (e.g., comprising a polypeptide of SEQ ID NO: 58) that encapsulates an AAV construct with AAV2 ITRs (e.g., SEQ ID NOs: 60 and 61) flanking a portion of a coding sequence, for example, a gene encoding a secreted target protein (e.g., an NDP gene, e.g., an HSPA1A gene) or characteristic portion thereof (e.g., SEQ ID NO: 1, 33, 3, 33, or 85). Other AAV particles are known in the art and are described in, e.g., Sharma et al., Brain Res Bull. 2010 Feb. 15; 81(2-3): 273, which is incorporated in its entirety herein by reference. In some embodiments, a capsid sequence is at least 85%, 90%, 95%, 98% or 99% identical to a capsid nucleotide or amino acid sequence represented by SEQ ID NO: 8 or 9, respectively.
TABLE-US-00033 Exemplary AAV Anc80 Capsid DNA Sequence (SEQ ID NO: 58) ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCT CTCTGAGGGCATTCGCGAGTGGTGGGACTTGAAACCTGGAGCCC CGAAACCCAAAGCCAACCAGCAAAAGCAGGACGACGGCCGGGGT CTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACT CGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCG AGCACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAAT CCGTACCTGCGGTATAACCACGCCGACGCCGAGTTTCAGGAGCG TCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGCAG TCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTT GAGGAAGGCGCTAAGACGGCTCCTGGAAAGAAGAGACCGGTAGA GCAATCACCCCAGGAACCAGACTCCTCTTCGGGCATCGGCAAGA AAGGCCAGCAGCCCGCGAAGAAGAGACTCAACTTTGGGCAGACA GGCGACTCAGAGTCAGTGCCCGACCCTCAACCACTCGGAGAACC CCCCGCAGCCCCCTCTGGTGTGGGATCTAATACAATGGCAGCAG GCGGTGGCGCTCCAATGGCAGACAATAACGAAGGCGCCGACGGA GTGGGTAACGCCTCAGGAAATTGGCATTGCGATTCCACATGGCT GGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTCC CCACCTACAACAACCACCTCTACAAGCAAATCTCCAGCCAATCG GGAGCAAGCACCAACGACAACACCTACTTCGGCTACAGCACCCC CTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTCTCAC CACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGG CCCAAGAGACTCAACTTCAAGCTCTTCAACATCCAGGTCAAGGA GGTCACGACGAATGATGGCACCACGACCATCGCCAATAACCTTA CCAGCACGGTTCAGGTCTTTACGGACTCGGAATACCAGCTCCCG TACGTCCTCGGCTCTGCGCACCAGGGCTGCCTGCCTCCGTTCCC GGCGGACGTCTTCATGATTCCTCAGTACGGGTACCTGACTCTGA ACAATGGCAGTCAGGCCGTGGGCCGTTCCTCCTTCTACTGCCTG GAGTACTTTCCTTCTCAAATGCTGAGAACGGGCAACAACTTTGA GTTCAGCTACACGTTTGAGGACGTGCCTTTTCACAGCAGCTACG CGCACAGCCAAAGCCTGGACCGGCTGATGAACCCCCTCATCGAC CAGTACCTGTACTACCTGTCTCGGACTCAGACCACGAGTGGTAC CGCAGGAAATCGGACGTTGCAATTTTCTCAGGCCGGGCCTAGTA GCATGGCGAATCAGGCCAAAAACTGGCTACCCGGGCCCTGCTAC CGGCAGCAACGCGTCTCCAAGACAGCGAATCAAAATAACAACAG CAACTTTGCCTGGACCGGTGCCACCAAGTATCATCTGAATGGCA GAGACTCTCTGGTAAATCCCGGTCCCGCTATGGCAACCCACAAG GACGACGAAGACAAATTTTTTCCGATGAGCGGAGTCTTAATATT TGGGAAACAGGGAGCTGGAAATAGCAACGTGGACCTTGACAACG TTATGATAACGAGTGAGGAAGAAATTAAAACCACCAACCGAGTG GCCACAGAACAGTACGGCACGGTGGCCACTAACCTGCAATCGTC AAACACCGCTCCTGCTACAGGGACCGTCAACAGTCAAGGAGCCT TACCTGGCATGGTCTGGCAGAACCGGGACGTGTACCTGCAGGGT CCTATCTGGGCCAAGATTCCTCACACGGACGGACACTTTCATCC CTCGCCGCTGATGGGAGGCTTTGGACTGAAACACCCGCCTCCTC AGATCCTGATTAAGAATACACCTGTTCCCGCGAATCCTCCAACT ACCTTCAGTCCAGCTAAGTTTGCGTCGTTCATCACGCAGTACAG CACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGAAAG AAAACAGCAAACGCTGGAACCCAGAGATTCAATACACTTCCAAC TACAACAAATCTACAAATGTGGACTTTGCTGTTGACACAAATGG CGTTTATTCTGAGCCTCGCCCCATCGGCACCCGTTACCTCACCC GTAATCTG Exemplary AAV Anc80 Capsid Amino Acid Sequence (SEQ ID NO: 59) MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRG LVLPGYKYLGPFNGLDKGEPVNAADAAALEHDKAYDQQLKAGDN PYLRYNHADAEFQERLQEDTSFGGNLGRAVFQAKKRVLEPLGLV EEGAKTAPGKKRPVEQSPQEPDSSSGIGKKGQQPAKKRLNFGQT GDSESVPDPQPLGEPPAAPSGVGSNTMAAGGGAPMADNNEGADG VGNASGNWHCDSTWLGDRVITTSTRTWALPTYNNHLYKQISSQS GASTNDNTYFGYSTPWGYFDFNRFHCHFSPRDWQRLINNNWGFR PKRLNFKLFNIQVKEVTTNDGTTTIANNLTSTVQVFTDSEYQLP YVLGSAHQGCLPPFPADVFMIPQYGYLTLNNGSQAVGRSSFYCL EYFPSQMLRTGNNFEFSYTFEDVPFHSSYAHSQSLDRLMNPLID QYLYYLSRTQTTSGTAGNRTLQFSQAGPSSMANQAKNWLPGPCY RQQRVSKTANQNNNSNFAWTGATKYHLNGRDSLVNPGPAMATHK DDEDKFFPMSGVLIFGKQGAGNSNVDLDNVMITSEEEIKTTNPV ATEQYGTVATNLQSSNTAPATGTVNSQGALPGMVWQNRDVYLQG PIWAKIPHTDGHFHPSPLMGGFGLKHPPPQILIKNTPVPANPPT TFSPAKFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSN YNKSTNVDFAVDTNGVYSEPRPIGTRYLTRNL*
[0411] Compositions
[0412] Among other things, the present disclosure provides compositions. In some embodiments, a composition comprises a construct as described herein. In some embodiments, a composition comprises one or more constructs as described herein. In some embodiments, a composition comprises a plurality of constructs as described herein. In some embodiments, when more than one construct is included in the composition, the constructs are each different.
[0413] In some embodiments, a composition comprises an AAV particle as described herein. In some embodiments, a composition comprises one or more AAV particles as described herein. In some embodiments, a composition comprises a plurality of AAV particles. In come embodiments, when more than one AAV particle is included in the composition, the AAV particles are each different.
[0414] In some embodiments, a composition comprises secreted target protein. In some embodiments, a composition comprises a cell.
[0415] In some embodiments, a composition is or comprises a pharmaceutical composition.
[0416] Single AAV Construct Compositions
[0417] In some embodiments, the present disclosure provides compositions or systems comprising AAV particles comprised of a single construct. In some such embodiments, a single construct may deliver a polynucleotide that encodes a functional (e.g., wild type or otherwise functional, e.g., codon optimized) copy of a gene encoding a secreted target protein (e.g., an NDP gene, e.g., an HSPA1A) or characteristic portion thereof. In some embodiments, a construct is or comprises an rAAV construct. In some embodiments described herein, a single rAAV construct is capable of expressing a full-length secreted target protein (e.g., NDP, e.g., HSPA1A) messenger RNA or a characteristic protein thereof in a target cell (e.g., an inner ear cell). In some embodiments, a single construct (e.g., any of the constructs described herein) can include a sequence encoding a functional secreted target protein (e.g., any construct that generates functional secreted target protein). In some embodiments, a single construct (e.g., any of the constructs described herein) can include a sequence encoding a functional secreted target protein (e.g., any construct that generates functional secreted target protein) and optionally additional polypeptide sequences (e.g., regulatory sequences, and/or reporter sequences).
[0418] In some embodiments, a single construct composition or system may comprise any or all of the exemplary construct components described herein. In some embodiments, an exemplary single construct is represented by SEQ ID NO: 9. In some embodiments, an exemplary single construct is represented by SEQ ID NO: 10. In some embodiments, an exemplary single construct is represented by SEQ ID NO: 11. In some embodiments, an exemplary single construct is represented by SEQ ID NO: 12. In some embodiments, an exemplary single construct is represented by SEQ ID NO: 13. In some embodiments, an exemplary single construct is represented by SEQ ID NO: 96. In some embodiments, an exemplary single construct is represented by SEQ ID NO: 97. In some embodiments, an exemplary single construct is represented by SEQ ID NO: 98. In some embodiments, an exemplary single construct is at least 85%, 90%, 95%, 98% or 99% identical to the sequence represented by SEQ ID NO: 9, 10, 11, 13, 96, 97, or 98. One skilled in the art would recognize that constructs may undergo additional modifications including codon-optimization, introduction of novel but functionally equivalent (e.g., silent mutations), addition of reporter sequences, and/or other routine modification.
[0419] In some embodiments, an exemplary construct comprises: a 5′ ITR exemplified by SEQ ID NO: 129, optionally a cloning site exemplified by SEQ ID NO: 129, a CMV enhancer exemplified by SEQ ID NO: 130, a CBA promoter exemplified by SEQ ID NO: 131, a chimeric intron exemplified by SEQ ID NO: 132, optionally a cloning site exemplified by SEQ ID NO: 133, an NDP coding region exemplified by SEQ ID NO: 1 or an HSPA1A coding region exemplified by SEQ ID NO: 134 (optionally, wherein a IL2ss coding region precedes the HSPA1A codon region exemplified by SEQ ID NO: 122), optionally a cloning site exemplified by SEQ ID NO: 135, a poly(A) site exemplified by SEQ ID NO: 136, optionally a cloning site exemplified by SEQ ID NO: 137, and a 3′ ITR exemplified by SEQ ID NO: 138.
[0420] In some embodiments, an exemplary construct comprises: a 5′ ITR exemplified by SEQ ID NO: 139, optionally a cloning site exemplified by SEQ ID NO: 140, a CMV enhancer exemplified by SEQ ID NO: 141, a CBA promoter exemplified by SEQ ID NO: 142, a chimeric intron exemplified by SEQ ID NO: 143, optionally a cloning site exemplified by SEQ ID NO: 144, an NDP coding region exemplified by SEQ ID NO: 1 or an HSPA1A coding region as exemplified by SEQ ID NO: 145, optionally a reporter sequence exemplified by SEQ ID NO: 148, optionally a cloning site exemplified by SEQ ID NO: 149, a poly(A) site exemplified by SEQ ID NO: 150, optionally a cloning site exemplified by SEQ ID NO: 151, and a 3′ ITR exemplified by SEQ ID NO: 152.
[0421] Multiple AAV Construct Compositions
[0422] The present disclosure recognizes that some coding sequences encoding a protein (e.g., secreted target protein) may be delivered by dividing the coding sequence into multiple portions, which are each included in a different construct. In some embodiments, provided herein are compositions or systems comprising at least two different constructs (e.g., two, three, four, five, or six). In some embodiments, each of the at least two different constructs includes a coding sequence that encodes a different portion of a coding region (e.g., encoding a target protein, e.g., an inner ear target protein, e.g., a secreted target protein), each of the encoded portions being at least 10 amino acids (e.g., at least about 10 amino acids, at least about 20 amino acids, at least about 30 amino acids, at least about 60 amino acids, at least about 70 amino acids, at least about 80 amino acids, at least about 90 amino acids, at least about 100 amino acids, at least about 110 amino acids, at least about 120 amino acids, at least about 130 amino acids, at least about 140 amino acids, at least about 150 amino acids, at least about 160 amino acids, at least about 170 amino acids, at least about 180 amino acids, at least about 190 amino acids, at least about 200 amino acids, at least about 210 amino acids, at least about 220 amino acids, at least about 230 amino acids, at least about 240 amino acids, at least about 250 amino acids, at least about 260 amino acids, at least about 270 amino acids, at least about 280 amino acids, at least about 290 amino acids, at least about 300 amino acids, at least about 310 amino acids, at least about 320 amino acids, at least about 330 amino acids, at least about 340 amino acids, at least about 350 amino acids, at least about 360 amino acids, at least about 370 amino acids, at least about 380 amino acids, at least about 390 amino acids, at least about 400 amino acids, at least about 410 amino acids, at least about 420 amino acids, at least about 430 amino acids, at least about 440 amino acids, at least about 450 amino acids, at least about 460 amino acids, at least about 470 amino acids, at least about 480 amino acids, at least about 490 amino acids, at least about 500 amino acids, at least about 510 amino acids, at least about 520 amino acids, at least about 530 amino acids, at least about 540 amino acids, at least about 550 amino acids, at least about 560 amino acids, at least about 570 amino acids, at least about 580 amino acids, at least about 590 amino acids, at least about 600 amino acids, at least about 610 amino acids, at least about 620 amino acids, at least about 630 amino acids, at least about 640 amino acids, at least about 650 amino acids, at least about 660 amino acids, at least about 670 amino acids, at least about 680 amino acids, at least about 690 amino acids, at least about 700 amino acids, at least about 710 amino acids, at least about 720 amino acids, at least about 730 amino acids, at least about 740 amino acids, at least about 750 amino acids, at least about 760 amino acids, at least about 770 amino acids, at least about 780 amino acids, at least about 790 amino acids, at least about 800 amino acids, at least about 810 amino acids, or at least about 820 amino acids) where the amino acid sequence of each of the encoded portions may optionally partially overlap with the amino acid sequence of a different one of the encoded portions; no single construct of the at least two different constructs encodes the active target protein; and, when introduced into a subject cell (e.g., an animal cell, e.g., a primate cell, e.g., a human cell), the at least two different constructs undergo homologous recombination with each other, where the recombined nucleic acid encodes an active target protein (e.g., a gene product encoded by an gene encoding a secreted target protein or a characteristic portion thereof). In some embodiments, one of the nucleic acid constructs can include a coding sequence that encodes a portion of a target protein (e.g., an inner ear target protein, e.g., a secreted target protein), where the encoded portion is at most about 820 amino acids (e.g., at most about 10 amino acids, at most about 20 amino acids, at most about 30 amino acids, at most about 60 amino acids, at most about 70 amino acids, at most about 80 amino acids, at most about 90 amino acids, at most about 100 amino acids, at most about 110 amino acids, at most about 120 amino acids, at most about 130 amino acids, at most about 140 amino acids, at most about 150 amino acids, at most about 160 amino acids, at most about 170 amino acids, at most about 180 amino acids, at most about 190 amino acids, at most about 200 amino acids, at most about 210 amino acids, at most about 220 amino acids, at most about 230 amino acids, at most about 240 amino acids, at most about 250 amino acids, at most about 260 amino acids, at most about 270 amino acids, at most about 280 amino acids, at most about 290 amino acids, at most about 300 amino acids, at most about 310 amino acids, at most about 320 amino acids, at most about 330 amino acids, at most about 340 amino acids, at most about 350 amino acids, at most about 360 amino acids, at most about 370 amino acids, at most about 380 amino acids, at most about 390 amino acids, at most about 400 amino acids, at most about 410 amino acids, at most about 420 amino acids, at most about 430 amino acids, at most about 440 amino acids, at most about 450 amino acids, at most about 460 amino acids, at most about 470 amino acids, at most about 480 amino acids, at most about 490 amino acids, at most about 500 amino acids, at most about 510 amino acids, at most about 520 amino acids, at most about 530 amino acids, at most about 540 amino acids, at most about 550 amino acids, at most about 560 amino acids, at most about 570 amino acids, at most about 580 amino acids, at most about 590 amino acids, at most about 600 amino acids, at most about 610 amino acids, at most about 620 amino acids, at most about 630 amino acids, at most about 640 amino acids, at most about 650 amino acids, at most about 660 amino acids, at most about 670 amino acids, at most about 680 amino acids, at most about 690 amino acids, at most about 700 amino acids, at most about 710 amino acids, at most about 720 amino acids, at most about 730 amino acids, at most about 740 amino acids, at most about 750 amino acids, at most about 760 amino acids, at most about 770 amino acids, at most about 780 amino acids, at most about 790 amino acids, at most about 800 amino acids, at most about 810 amino acids, or at most about 820 amino acids).
[0423] In some embodiments, at least one of the constructs includes a nucleotide sequence spanning two neighboring exons of target genomic DNA (e.g., an inner ear target genomic DNA, e.g., NDP genomic DNA, e.g., HSPA1A genomic DNA), and lacks the intronic sequence that naturally occurs between the two neighboring exons.
[0424] In some embodiments, an amino acid sequence of an encoded portion of each of the constructs does not overlap, even in part, with an amino acid sequence of a different one of the encoded portions. In some embodiments, an amino acid sequence of an encoded portion of a construct partially overlaps with an amino acid sequence of an encoded portion of a different construct. In some embodiments, an amino acid sequence of an encoded portion of each construct partially overlaps with an amino acid sequence of an encoded portion of at least one different construct. In some embodiments, an overlapping amino acid sequence is between about 10 amino acid residues to about 820 amino acids, or any of the subranges of this range (e.g., about 10 amino acids, about 20 amino acids, about 30 amino acids, about 60 amino acids, about 70 amino acids, about 80 amino acids, about 90 amino acids, about 100 amino acids, about 110 amino acids, about 120 amino acids, about 130 amino acids, about 140 amino acids, about 150 amino acids, about 160 amino acids, about 170 amino acids, about 180 amino acids, about 190 amino acids, about 200 amino acids, about 210 amino acids, about 220 amino acids, about 230 amino acids, about 240 amino acids, about 250 amino acids, about 260 amino acids, about 270 amino acids, about 280 amino acids, about 290 amino acids, about 300 amino acids, about 310 amino acids, about 320 amino acids, about 330 amino acids, about 340 amino acids, about 350 amino acids, about 360 amino acids, about 370 amino acids, about 380 amino acids, about 390 amino acids, about 400 amino acids, about 410 amino acids, about 420 amino acids, about 430 amino acids, about 440 amino acids, about 450 amino acids, about 460 amino acids, about 470 amino acids, about 480 amino acids, about 490 amino acids, about 500 amino acids, about 510 amino acids, about 520 amino acids, about 530 amino acids, about 540 amino acids, about 550 amino acids, about 560 amino acids, about 570 amino acids, about 580 amino acids, about 590 amino acids, about 600 amino acids, about 610 amino acids, about 620 amino acids, about 630 amino acids, about 640 amino acids, about 650 amino acids, about 660 amino acids, about 670 amino acids, about 680 amino acids, about 690 amino acids, about 700 amino acids, about 710 amino acids, about 720 amino acids, about 730 amino acids, about 740 amino acids, about 750 amino acids, about 760 amino acids, about 770 amino acids, about 780 amino acids, about 790 amino acids, about 800 amino acids, about 810 amino acids, or about 820 amino acids in length).
[0425] In some examples, a desired gene product (e.g., a therapeutic gene product) is encoded by at least two different constructs. In some embodiments, each of at least two different constructs includes a different segment of an intron, where the intron includes a nucleotide sequence of an intron that is present in a target genomic DNA (e.g., an inner ear cell target genomic DNA (e.g., NDP genomic DNA, e.g., HSPA1A genomic DNA) (e.g., any of the exemplary introns in SEQ ID NO: 3 described herein). In some embodiments, different intron segments overlap. In some embodiments, different intron segments overlap in sequence by at most about 12,000 nucleotides (e.g., at most about 100 nucleotides, at most about 200 nucleotides, at most about 300 nucleotides, at most about 600 nucleotides, at most about 700 nucleotides, at most about 800 nucleotides, at most about 900 nucleotides, at most about 1,000 nucleotides, at most about 1,100 nucleotides, at most about 1,200 nucleotides, at most about 1,300 nucleotides, at most about 1,400 nucleotides, at most about 1,500 nucleotides, at most about 1,600 nucleotides, at most about 1,700 nucleotides, at most about 1,800 nucleotides, at most about 1,900 nucleotides, at most about 2,000 nucleotides, at most about 2,100 nucleotides, at most about 2,200 nucleotides, at most about 2,300 nucleotides, at most about 2,400 nucleotides, at most about 2,500 nucleotides, at most about 2,600 nucleotides, at most about 2,700 nucleotides, at most about 2,800 nucleotides, at most about 2,900 nucleotides, at most about 3,000 nucleotides, at most about 3,100 nucleotides, at most about 3,200 nucleotides, at most about 3,300 nucleotides, at most about 3,400 nucleotides, at most about 3,500 nucleotides, at most about 3,600 nucleotides, at most about 3,700 nucleotides, at most about 3,800 nucleotides, at most about 3,900 nucleotides, at most about 4,000 nucleotides, at most about 4,100 nucleotides, at most about 4,200 nucleotides, at most about 4,300 nucleotides, at most about 4,400 nucleotides, at most about 4,500 nucleotides, at most about 4,600 nucleotides, at most about 4,700 nucleotides, at most about 4,800 nucleotides, at most about 4,900 nucleotides, at most about 5,000 nucleotides, at most about 5,100 nucleotides, at most about 5,200 nucleotides, at most about 5,300 nucleotides, at most about 5,400 nucleotides, at most about 5,500 nucleotides, at most about 5,600 nucleotides, at most about 5,700 nucleotides, at most about 5,800 nucleotides, at most about 5,900 nucleotides, at most about 6,000 nucleotides, at most about 6,100 nucleotides, at most about 6,200 nucleotides, at most about 6,300 nucleotides, at most about 6,400 nucleotides, at most about 6,500 nucleotides, at most about 6,600 nucleotides, at most about 6,700 nucleotides, at most about 6,800 nucleotides, at most about 6,900 nucleotides, at most about 7,000 nucleotides, at most about 7,100 nucleotides, at most about 7,200 nucleotides, at most about 7,300 nucleotides, at most about 7,400 nucleotides, at most about 7,500 nucleotides, at most about 7,600 nucleotides, at most about 7,700 nucleotides, at most about 7,800 nucleotides, at most about 7,900 nucleotides, at most about 8,000 nucleotides, at most about 8,100 nucleotides, at most about 8,200 nucleotides, at most about 8,300 nucleotides, at most about 8,400 nucleotides, at most about 8,500 nucleotides, at most about 8,600 nucleotides, at most about 8,700 nucleotides, at most about 8,800 nucleotides, at most about 8,900 nucleotides, at most about 9,000 nucleotides, at most about 9,100 nucleotides, at most about 9,200 nucleotides, at most about 9,300 nucleotides, at most about 9,400 nucleotides, at most about 9,500 nucleotides, at most about 9,600 nucleotides, at most about 9,700 nucleotides, at most about 9,800 nucleotides, at most about 9,900 nucleotides, at most about 10,000 nucleotides, at most about 10,100 nucleotides, at most about 10,200 nucleotides, at most about 10,300 nucleotides, at most about 10,400 nucleotides, at most about 10,500 nucleotides, at most about 10,600 nucleotides, at most about 10,700 nucleotides, at most about 10,800 nucleotides, at most about 10,900 nucleotides, at most about 11,000 nucleotides, at most about 11,100 nucleotides, at most about 11,200 nucleotides, at most about 11,300 nucleotides, at most about 11,400 nucleotides, at most about 11,500 nucleotides, at most about 11,600 nucleotides, at most about 11,700 nucleotides, at most about 11,800 nucleotides, at most about 11,900 nucleotides, or at most about 12,000 nucleotides) in length. In some embodiments, the overlapping nucleotide sequence in any two of the different constructs can include part or all of one or more exons of a target gene (e.g., an inner ear cell target gene (e.g., a secreted target protein (e.g., NDP, e.g., HSPA1A) gene) (e.g., any one or more of the exemplary exons in SEQ ID NO: 3 described herein).
[0426] In some embodiments, a composition or system is or comprises two, three, four, or five different constructs. In compositions where the number of different constructs in the composition is two, the first of the two different constructs can include a coding sequence that encodes an N-terminal portion of a protein (e.g., secreted target protein), which may be referred to as a lead portion, a first construct, or a 5′ portion (e.g., an N-terminal portion of an inner ear cell protein, e.g., an N-terminal portion of a secreted target protein). In some examples, an N-terminal portion of the target gene is at least about 10 amino acids (e.g., at least about 10 amino acids, at least about 20 amino acids, at least about 30 amino acids, at least about 60 amino acids, at least about 70 amino acids, at least about 80 amino acids, at least about 90 amino acids, at least about 100 amino acids, at least about 110 amino acids, at least about 120 amino acids, at least about 130 amino acids, at least about 140 amino acids, at least about 150 amino acids, at least about 160 amino acids, at least about 170 amino acids, at least about 180 amino acids, at least about 190 amino acids, at least about 200 amino acids, at least about 210 amino acids, at least about 220 amino acids, at least about 230 amino acids, at least about 240 amino acids, at least about 250 amino acids, at least about 260 amino acids, at least about 270 amino acids, at least about 280 amino acids, at least about 290 amino acids, at least about 300 amino acids, at least about 310 amino acids, at least about 320 amino acids, at least about 330 amino acids, at least about 340 amino acids, at least about 350 amino acids, at least about 360 amino acids, at least about 370 amino acids, at least about 380 amino acids, at least about 390 amino acids, at least about 400 amino acids, at least about 410 amino acids, at least about 420 amino acids, at least about 430 amino acids, at least about 440 amino acids, at least about 450 amino acids, at least about 460 amino acids, at least about 470 amino acids, at least about 480 amino acids, at least about 490 amino acids, at least about 500 amino acids, at least about 510 amino acids, at least about 520 amino acids, at least about 530 amino acids, at least about 540 amino acids, at least about 550 amino acids, at least about 560 amino acids, at least about 570 amino acids, at least about 580 amino acids, at least about 590 amino acids, at least about 600 amino acids, at least about 610 amino acids, at least about 620 amino acids, at least about 630 amino acids, at least about 640 amino acids, at least about 650 amino acids, at least about 660 amino acids, at least about 670 amino acids, at least about 680 amino acids, at least about 690 amino acids, at least about 700 amino acids, at least about 710 amino acids, at least about 720 amino acids, at least about 730 amino acids, at least about 740 amino acids, at least about 750 amino acids, at least about 760 amino acids, at least about 770 amino acids, at least about 780 amino acids, at least about 790 amino acids, at least about 800 amino acids, at least about 810 amino acids, or at least about 820 amino acids) in length. In some examples, a first construct includes one or both of a promoter (e.g., any of the promoters described herein or known in the art) and a Kozak sequence (e.g., any of the exemplary Kozak sequences described herein or known in the art). In some examples, a first construct includes a promoter that is an inducible promoter, a constitutive promoter, or a tissue-specific promoter. In some examples, a second of the two different constructs includes a coding sequence that encodes a C-terminal portion of the protein, which may be referred to as a terminal portion, a second construct, or a 3′ portion (e.g., a C-terminal portion of an inner ear cell target protein, e.g., a C-terminal portion of a secreted target protein). In some examples, a C-terminal portion of the target protein is at least about 10 amino acids (e.g., at least about 10 amino acids, at least about 20 amino acids, at least about 30 amino acids, at least about 60 amino acids, at least about 70 amino acids, at least about 80 amino acids, at least about 90 amino acids, at least about 100 amino acids, at least about 110 amino acids, at least about 120 amino acids, at least about 130 amino acids, at least about 140 amino acids, at least about 150 amino acids, at least about 160 amino acids, at least about 170 amino acids, at least about 180 amino acids, at least about 190 amino acids, at least about 200 amino acids, at least about 210 amino acids, at least about 220 amino acids, at least about 230 amino acids, at least about 240 amino acids, at least about 250 amino acids, at least about 260 amino acids, at least about 270 amino acids, at least about 280 amino acids, at least about 290 amino acids, at least about 300 amino acids, at least about 310 amino acids, at least about 320 amino acids, at least about 330 amino acids, at least about 340 amino acids, at least about 350 amino acids, at least about 360 amino acids, at least about 370 amino acids, at least about 380 amino acids, at least about 390 amino acids, at least about 400 amino acids, at least about 410 amino acids, at least about 420 amino acids, at least about 430 amino acids, at least about 440 amino acids, at least about 450 amino acids, at least about 460 amino acids, at least about 470 amino acids, at least about 480 amino acids, at least about 490 amino acids, at least about 500 amino acids, at least about 510 amino acids, at least about 520 amino acids, at least about 530 amino acids, at least about 540 amino acids, at least about 550 amino acids, at least about 560 amino acids, at least about 570 amino acids, at least about 580 amino acids, at least about 590 amino acids, at least about 600 amino acids, at least about 610 amino acids, at least about 620 amino acids, at least about 630 amino acids, at least about 640 amino acids, at least about 650 amino acids, at least about 660 amino acids, at least about 670 amino acids, at least about 680 amino acids, at least about 690 amino acids, at least about 700 amino acids, at least about 710 amino acids, at least about 720 amino acids, at least about 730 amino acids, at least about 740 amino acids, at least about 750 amino acids, at least about 760 amino acids, at least about 770 amino acids, at least about 780 amino acids, at least about 790 amino acids, at least about 800 amino acids, at least about 810 amino acids, or at least about 820 amino acids) in length. In some examples, a second construct further includes a poly(A) sequence.
[0427] In some examples where the number of different constructs in the composition is two, an N-terminal portion encoded by one of the two constructs can include a portion including amino acid position 1 to about amino acid position 820, or any subrange of this range (e.g., amino acid 1 to at least about amino acid 10, amino acid 1 to at least about amino acid 20, amino acid 1 to at least about amino acid 30, amino acid 1 to at least about amino acid 60, amino acid 1 to at least about amino acid 70, amino acid 1 to at least about amino acid 80, amino acid 1 to at least about amino acid 90, amino acid 1 to at least about amino acid 100, amino acid 1 to at least about amino acid 110, amino acid 1 to at least about amino acid 120, amino acid 1 to at least about amino acid 130, amino acid 1 to at least about amino acid 140, amino acid 1 to at least about amino acid 150, amino acid 1 to at least about amino acid 160, amino acid 1 to at least about amino acid 170, amino acid 1 to at least about amino acid 180, amino acid 1 to at least about amino acid 190, amino acid 1 to at least about amino acid 200, amino acid 1 to at least about amino acid 210, amino acid 1 to at least about amino acid 220, amino acid 1 to at least about amino acid 230, amino acid 1 to at least about amino acid 240, amino acid 1 to at least about amino acid 250, amino acid 1 to at least about amino acid 260, amino acid 1 to at least about amino acid 270, amino acid 1 to at least about amino acid 280, amino acid 1 to at least about amino acid 290, amino acid 1 to at least about amino acid 300, amino acid 1 to at least about amino acid 310, amino acid 1 to at least about amino acid 320, amino acid 1 to at least about amino acid 330, amino acid 1 to at least about amino acid 340, amino acid 1 to at least about amino acid 350, amino acid 1 to at least about amino acid 360, amino acid 1 to at least about amino acid 370, amino acid 1 to at least about amino acid 380, amino acid 1 to at least about amino acid 390, amino acid 1 to at least about amino acid 400, amino acid 1 to at least about amino acid 410, amino acid 1 to at least about amino acid 420, amino acid 1 to at least about amino acid 430, amino acid 1 to at least about amino acid 440, amino acid 1 to at least about amino acid 450, amino acid 1 to at least about amino acid 460, amino acid 1 to at least about amino acid 470, amino acid 1 to at least about amino acid 480, amino acid 1 to at least about amino acid 490, amino acid 1 to at least about amino acid 500, amino acid 1 to at least about amino acid 510, amino acid 1 to at least about amino acid 520, amino acid 1 to at least about amino acid 530, amino acid 1 to at least about amino acid 540, amino acid 1 to at least about amino acid 550, amino acid 1 to at least about amino acid 560, amino acid 1 to at least about amino acid 570, amino acid 1 to at least about amino acid 580, amino acid 1 to at least about amino acid 590, amino acid 1 to at least about amino acid 600, amino acid 1 to at least about amino acid 610, amino acid 1 to at least about amino acid 620, amino acid 1 to at least about amino acid 630, amino acid 1 to at least about amino acid 640, amino acid 1 to at least about amino acid 650, amino acid 1 to at least about amino acid 660, amino acid 1 to at least about amino acid 670, amino acid 1 to at least about amino acid 680, amino acid 1 to at least about amino acid 690, amino acid 1 to at least about amino acid 700, amino acid 1 to at least about amino acid 710, amino acid 1 to at least about amino acid 720, amino acid 1 to at least about amino acid 730, amino acid 1 to at least about amino acid 740, amino acid 1 to at least about amino acid 750, amino acid 1 to at least about amino acid 760, amino acid 1 to at least about amino acid 770, amino acid 1 to at least about amino acid 780, amino acid 1 to at least about amino acid 790, amino acid 1 to at least about amino acid 800, amino acid 1 to at least about amino acid 810, or amino acid 1 to at least about amino acid 820) of an inner ear cell target protein (e.g., SEQ ID NO: 6 or 7). In some examples where the number of different constructs in the composition is two, an N-terminal portion of the precursor inner ear cell target protein can include a portion including at most amino acid position 1 to amino acid position 820 or any subrange of this range (e.g., amino acid 1 to at most about amino acid 10, amino acid 1 to at most about amino acid 20, amino acid 1 to at most about amino acid 30, amino acid 1 to at most about amino acid 60, amino acid 1 to at most about amino acid 70, amino acid 1 to at most about amino acid 80, amino acid 1 to at most about amino acid 90, amino acid 1 to at most about amino acid 100, amino acid 1 to at most about amino acid 110, amino acid 1 to at most about amino acid 120, amino acid 1 to at most about amino acid 130, amino acid 1 to at most about amino acid 140, amino acid 1 to at most about amino acid 150, amino acid 1 to at most about amino acid 160, amino acid 1 to at most about amino acid 170, amino acid 1 to at most about amino acid 180, amino acid 1 to at most about amino acid 190, amino acid 1 to at most about amino acid 200, amino acid 1 to at most about amino acid 210, amino acid 1 to at most about amino acid 220, amino acid 1 to at most about amino acid 230, amino acid 1 to at most about amino acid 240, amino acid 1 to at most about amino acid 250, amino acid 1 to at most about amino acid 260, amino acid 1 to at most about amino acid 270, amino acid 1 to at most about amino acid 280, amino acid 1 to at most about amino acid 290, amino acid 1 to at most about amino acid 300, amino acid 1 to at most about amino acid 310, amino acid 1 to at most about amino acid 320, amino acid 1 to at most about amino acid 330, amino acid 1 to at most about amino acid 340, amino acid 1 to at most about amino acid 350, amino acid 1 to at most about amino acid 360, amino acid 1 to at most about amino acid 370, amino acid 1 to at most about amino acid 380, amino acid 1 to at most about amino acid 390, amino acid 1 to at most about amino acid 400, amino acid 1 to at most about amino acid 410, amino acid 1 to at most about amino acid 420, amino acid 1 to at most about amino acid 430, amino acid 1 to at most about amino acid 440, amino acid 1 to at most about amino acid 450, amino acid 1 to at most about amino acid 460, amino acid 1 to at most about amino acid 470, amino acid 1 to at most about amino acid 480, amino acid 1 to at most about amino acid 490, amino acid 1 to at most about amino acid 500, amino acid 1 to at most about amino acid 510, amino acid 1 to at most about amino acid 520, amino acid 1 to at most about amino acid 530, amino acid 1 to at most about amino acid 540, amino acid 1 to at most about amino acid 550, amino acid 1 to at most about amino acid 560, amino acid 1 to at most about amino acid 570, amino acid 1 to at most about amino acid 580, amino acid 1 to at most about amino acid 590, amino acid 1 to at most about amino acid 600, amino acid 1 to at most about amino acid 610, amino acid 1 to at most about amino acid 620, amino acid 1 to at most about amino acid 630, amino acid 1 to at most about amino acid 640, amino acid 1 to at most about amino acid 650, amino acid 1 to at most about amino acid 660, amino acid 1 to at most about amino acid 670, amino acid 1 to at most about amino acid 680, amino acid 1 to at most about amino acid 690, amino acid 1 to at most about amino acid 700, amino acid 1 to at most about amino acid 710, amino acid 1 to at most about amino acid 720, amino acid 1 to at most about amino acid 730, amino acid 1 to at most about amino acid 740, amino acid 1 to at most about amino acid 750, amino acid 1 to at most about amino acid 760, amino acid 1 to at most about amino acid 770, amino acid 1 to at most about amino acid 780, amino acid 1 to at most about amino acid 790, amino acid 1 to at most about amino acid 800, amino acid 1 to at most about amino acid 810, or amino acid 1 to at most about amino acid 820) of an inner ear cell target protein (e.g., SEQ ID NO: 6 or 7)
[0428] In some examples where the number of different constructs in the composition is two, a C-terminal portion encoded by one of the two constructs can include a portion including the final amino acid (e.g., about amino acid position 820) to about amino acid position 1, or any subrange of this range (e.g., amino acid 820 to at least about amino acid 10, amino acid 820 to at least about amino acid 20, amino acid 820 to at least about amino acid 30, amino acid 820 to at least about amino acid 60, amino acid 820 to at least about amino acid 70, amino acid 820 to at least about amino acid 80, amino acid 820 to at least about amino acid 90, amino acid 820 to at least about amino acid 100, amino acid 820 to at least about amino acid 110, amino acid 820 to at least about amino acid 120, amino acid 820 to at least about amino acid 130, amino acid 820 to at least about amino acid 140, amino acid 820 to at least about amino acid 150, amino acid 820 to at least about amino acid 160, amino acid 820 to at least about amino acid 170, amino acid 820 to at least about amino acid 180, amino acid 820 to at least about amino acid 190, amino acid 820 to at least about amino acid 200, amino acid 820 to at least about amino acid 210, amino acid 820 to at least about amino acid 220, amino acid 820 to at least about amino acid 230, amino acid 820 to at least about amino acid 240, amino acid 820 to at least about amino acid 250, amino acid 820 to at least about amino acid 260, amino acid 820 to at least about amino acid 270, amino acid 820 to at least about amino acid 280, amino acid 820 to at least about amino acid 290, amino acid 820 to at least about amino acid 300, amino acid 820 to at least about amino acid 310, amino acid 820 to at least about amino acid 320, amino acid 820 to at least about amino acid 330, amino acid 820 to at least about amino acid 340, amino acid 820 to at least about amino acid 350, amino acid 820 to at least about amino acid 360, amino acid 820 to at least about amino acid 370, amino acid 820 to at least about amino acid 380, amino acid 820 to at least about amino acid 390, amino acid 820 to at least about amino acid 400, amino acid 820 to at least about amino acid 410, amino acid 820 to at least about amino acid 420, amino acid 820 to at least about amino acid 430, amino acid 820 to at least about amino acid 440, amino acid 820 to at least about amino acid 450, amino acid 820 to at least about amino acid 460, amino acid 820 to at least about amino acid 470, amino acid 820 to at least about amino acid 480, amino acid 820 to at least about amino acid 490, amino acid 820 to at least about amino acid 500, amino acid 820 to at least about amino acid 510, amino acid 820 to at least about amino acid 520, amino acid 820 to at least about amino acid 530, amino acid 820 to at least about amino acid 540, amino acid 820 to at least about amino acid 550, amino acid 820 to at least about amino acid 560, amino acid 820 to at least about amino acid 570, amino acid 820 to at least about amino acid 580, amino acid 820 to at least about amino acid 590, amino acid 820 to at least about amino acid 600, amino acid 820 to at least about amino acid 610, amino acid 820 to at least about amino acid 620, amino acid 820 to at least about amino acid 630, amino acid 820 to at least about amino acid 640, amino acid 820 to at least about amino acid 650, amino acid 820 to at least about amino acid 660, amino acid 820 to at least about amino acid 670, amino acid 820 to at least about amino acid 680, amino acid 820 to at least about amino acid 690, amino acid 820 to at least about amino acid 700, amino acid 820 to at least about amino acid 710, amino acid 820 to at least about amino acid 720, amino acid 820 to at least about amino acid 730, amino acid 820 to at least about amino acid 740, amino acid 820 to at least about amino acid 750, amino acid 820 to at least about amino acid 760, amino acid 820 to at least about amino acid 770, amino acid 820 to at least about amino acid 780, amino acid 820 to at least about amino acid 790, amino acid 820 to at least about amino acid 800, amino acid 820 to at least about amino acid 810, or amino acid 820 to at least about amino acid 820) of an inner ear cell target protein (e.g., SEQ ID NO: 6 or 7). In some examples where the number of different constructs in the composition is two, a C-terminal portion of the precursor inner ear cell target protein can include a portion including the final amino acid (e.g., about amino acid position 820) to at most about amino acid position 1, or any subrange of this range (e.g., amino acid 820 to at most about amino acid 10, amino acid 820 to at most about amino acid 20, amino acid 820 to at most about amino acid 30, amino acid 820 to at most about amino acid 60, amino acid 820 to at most about amino acid 70, amino acid 820 to at most about amino acid 80, amino acid 820 to at most about amino acid 90, amino acid 820 to at most about amino acid 100, amino acid 820 to at most about amino acid 110, amino acid 820 to at most about amino acid 120, amino acid 820 to at most about amino acid 130, amino acid 820 to at most about amino acid 140, amino acid 820 to at most about amino acid 150, amino acid 820 to at most about amino acid 160, amino acid 820 to at most about amino acid 170, amino acid 820 to at most about amino acid 180, amino acid 820 to at most about amino acid 190, amino acid 820 to at most about amino acid 200, amino acid 820 to at most about amino acid 210, amino acid 820 to at most about amino acid 220, amino acid 820 to at most about amino acid 230, amino acid 820 to at most about amino acid 240, amino acid 820 to at most about amino acid 250, amino acid 820 to at most about amino acid 260, amino acid 820 to at most about amino acid 270, amino acid 820 to at most about amino acid 280, amino acid 820 to at most about amino acid 290, amino acid 820 to at most about amino acid 300, amino acid 820 to at most about amino acid 310, amino acid 820 to at most about amino acid 320, amino acid 820 to at most about amino acid 330, amino acid 820 to at most about amino acid 340, amino acid 820 to at most about amino acid 350, amino acid 820 to at most about amino acid 360, amino acid 820 to at most about amino acid 370, amino acid 820 to at most about amino acid 380, amino acid 820 to at most about amino acid 390, amino acid 820 to at most about amino acid 400, amino acid 820 to at most about amino acid 410, amino acid 820 to at most about amino acid 420, amino acid 820 to at most about amino acid 430, amino acid 820 to at most about amino acid 440, amino acid 820 to at most about amino acid 450, amino acid 820 to at most about amino acid 460, amino acid 820 to at most about amino acid 470, amino acid 820 to at most about amino acid 480, amino acid 820 to at most about amino acid 490, amino acid 820 to at most about amino acid 500, amino acid 820 to at most about amino acid 510, amino acid 820 to at most about amino acid 520, amino acid 820 to at most about amino acid 530, amino acid 820 to at most about amino acid 540, amino acid 820 to at most about amino acid 550, amino acid 820 to at most about amino acid 560, amino acid 820 to at most about amino acid 570, amino acid 820 to at most about amino acid 580, amino acid 820 to at most about amino acid 590, amino acid 820 to at most about amino acid 600, amino acid 820 to at most about amino acid 610, amino acid 820 to at most about amino acid 620, amino acid 820 to at most about amino acid 630, amino acid 820 to at most about amino acid 640, amino acid 820 to at most about amino acid 650, amino acid 820 to at most about amino acid 660, amino acid 820 to at most about amino acid 670, amino acid 820 to at most about amino acid 680, amino acid 820 to at most about amino acid 690, amino acid 820 to at most about amino acid 700, amino acid 820 to at most about amino acid 710, amino acid 820 to at most about amino acid 720, amino acid 820 to at most about amino acid 730, amino acid 820 to at most about amino acid 740, amino acid 820 to at most about amino acid 750, amino acid 820 to at most about amino acid 760, amino acid 820 to at most about amino acid 770, amino acid 820 to at most about amino acid 780, amino acid 820 to at most about amino acid 790, amino acid 820 to at most about amino acid 800, amino acid 820 to at most about amino acid 810, or amino acid 820 to at most about amino acid 820, or any length sequence there between of an inner ear cell target protein (e.g., SEQ ID NO: 6 or 7).
[0429] In some embodiments, splice sites are involved in trans-splicing. In some embodiments, a splice donor site (Trapani et al. EMBO Mol. Med. 6(2):194-211, 2014, which is incorporated in its entirety herein by reference) follows the coding sequence in the N-terminal construct. In the C-terminal construct, a splice acceptor site may be subcloned just before the coding sequence for NDP or HSPA1A or other secreted target protein. In some embodiments, within the coding sequence, a silent mutation can be introduced, generating an additional site for restriction digestion.
[0430] In some embodiments, any of the constructs provided herein can be included in a composition suitable for administration to an animal for the amelioration of symptoms associated with syndromic and/or non-syndromic hearing loss.
[0431] Pharmaceutical Compositions
[0432] Among other things, the present disclosure provides pharmaceutical compositions. In some embodiments compositions provided herein are suitable for administration to an animal for the amelioration of symptoms associated with syndromic and/or non-syndromic hearing loss.
[0433] In some embodiments, pharmaceutical compositions of the present disclosure may comprise, e.g., a polynucleotide, e.g., one or more constructs, as described herein. In some embodiments, a pharmaceutical composition may comprise one or more AAV particles, e.g., one or more rAAV construct encapsidated by one or more AAV serotype capsids, as described herein.
[0434] In some embodiments, a pharmaceutical composition comprises one or more pharmaceutically or physiologically acceptable carriers, diluents or excipients. As used herein, the term “pharmaceutically acceptable carrier” includes solvents, dispersion media, coatings, antibacterial agents, antifungal agents, and the like that are compatible with pharmaceutical administration. Supplementary active compounds can also be incorporated into any of the compositions described herein. Such compositions may include one or more buffers, such as neutral-buffered saline, phosphate-buffered saline, and the like; one or more carbohydrates, such as glucose, mannose, sucrose, and dextran; mannitol; one or more proteins, polypeptides, or amino acids, such as glycine; one or more antioxidants; one or more chelating agents, such as EDTA or glutathione; and/or one or more preservatives. In some embodiments, formulations are in a dosage forms, such as injectable solutions, injectable gels, drug-release capsules, and the like.
[0435] In some embodiments, compositions of the present disclosure are formulated for intravenous administration. In some embodiments compositions of the present disclosure are formulated for intra-cochlear administration. In some embodiments, a therapeutic composition is formulated to comprise a lipid nanoparticle, a polymeric nanoparticle, a mini-circle DNA and/or a CELiD DNA.
[0436] In some embodiments, a therapeutic composition is formulated to comprise a synthetic perilymph solution. For example, in some embodiments, a synthetic perilymph solution includes 20-200 mM NaCl; 1-5 mM KCl; 0.1-10 mM CaCl.sub.2); 1-10 mM glucose; and 2-50 mM HEPES, with a pH between about 6 and about 9. In some embodiments, a therapeutic composition is formulated to comprise a physiologically suitable solution. For example, in some embodiments, a physiologically suitable solution comprises commercially available 1×PBS with pluronic acid F68, prepared to a final concentration of: 8.10 mM Sodium Phosphate Dibasic, 1.5 mM Monopotassium Phosphate, 2.7 mM Potassium Chloride, 172 mM Sodium Chloride, and 0.001% Pluronic Acid F68). In some embodiments, alternative pluronic acids are utilized. In some embodiments, alternative ion concentrations are utilized.
[0437] In some embodiments, any of the pharmaceutical compositions described herein may further comprise one or more agents that promote the entry of a nucleic acid or any of the constructs described herein into a mammalian cell (e.g., a liposome or cationic lipid).In some embodiments, any of the constructs described herein can be formulated using natural and/or synthetic polymers. Non-limiting examples of polymers that may be included in any of the compositions described herein can include, but are not limited to, DYNAMIC POLYCONJUGATE® (Arrowhead Research Corp., Pasadena, Calif.), formulations from Mirus Bio (Madison, Wis.) and Roche Madison (Madison, Wis.), PhaseRX polymer formulations such as, without limitation, SMARTT POLYMER TECHNOLOGY® (PhaseRX, Seattle, Wash.), DMRI/DOPE, poloxamer, VAXFECTIN® adjuvant from Vical (San Diego, Calif.), chitosan, cyclodextrin from Calando Pharmaceuticals (Pasadena, Calif.), dendrimers and poly (lactic-co-glycolic acid) (PLGA) polymers, RONDEL™ (RNAi/Oligonucleotide Nanoparticle Delivery) polymers (Arrowhead Research Corporation, Pasadena, Calif.), and pH responsive co-block polymers, such as, but not limited to, those produced by PhaseRX (Seattle, Wash.). Many of these polymers have demonstrated efficacy in delivering oligonucleotides in vivo into a mammalian cell (see, e.g., deFougerolles, Human Gene Ther. 19:125-132, 2008; Rozema et al., Proc. Natl. Acad. Sci. U.S.A. 104:12982-12887, 2007; Rozema et al., Proc. Natl. Acad. Sci. U.S.A. 104:12982-12887, 2007; Hu-Lieskovan et al., Cancer Res. 65:8984-8982, 2005; Heidel et al., Proc. Natl. Acad. Sci. U.S.A. 104:5715-5721, 2007, each of which is incorporated in its entirety herein by reference).
[0438] In some embodiments, a composition includes a pharmaceutically acceptable carrier (e.g., phosphate buffered saline, saline, or bacteriostatic water). Upon formulation, solutions will be administered in a manner compatible with a dosage formulation and in such amount as is therapeutically effective. Formulations are easily administered in a variety of dosage forms such as injectable solutions, injectable gels, drug-release capsules, and the like.
[0439] In some embodiments, a composition provided herein can be, e.g., formulated to be compatible with their intended route of administration. A non-limiting example of an intended route of administration is local administration (e.g., intra-cochlear administration).
[0440] In some embodiments, a provided composition comprises one nucleic acid construct. In some embodiments, a provided composition comprises two or more different constructs. In some embodiments, a composition that include a single nucleic acid construct comprising a coding sequence that encodes a secreted target protein and/or a functional characteristic portion thereof. In some embodiments, compositions comprise a single nucleic acid construct comprising a coding sequence that encodes a secreted target protein and/or a functional characteristic portion thereof, which, when introduced into a mammalian cell, that coding sequence is integrated into the genome of the mammalian cell. In some embodiments, a composition comprising at least two different constructs, e.g., constructs comprise coding sequences that encode a different portion of a secreted target protein, the constructs can be combined to generate a sequence encoding an active secreted target protein (e.g., a full-length secreted target protein) in a mammalian cell, and thereby treat associated syndromic or non-syndromic sensorineural hearing loss in a subject in need thereof.
[0441] In some embodiments, a single dose of any of the compositions described herein can include a total sum amount of the at least two different vectors of at least 1 ng, at least 2 ng, at least 4 ng, about 6 ng, about 8 ng, at least 10 ng, at least 20 ng, at least 30 ng, at least 40 ng, at least 50 ng, at least 60 ng, at least 70 ng, at least 80 ng, at least 90 ng, at least 100 ng, at least 200 ng, at least 300 ng, at least 400 ng, at least 500 ng, at least 1 μg, at least 2 μg, at least 4 μg, at least 6 μg, at least 8 μg, at least 10 μg, at least 12 μg, at least 14 μg, at least 16 μg, at least 18 μg, at least 20 μg, at least 22 μg, at least 24 μg, at least 26 μg, at least 28 μg, at least 30 pg at least 32 μg, at least 34 μg, at least 36 μg, at least 38 μg, at least 40 μg, at least 42 μg, at least 44 μg, at least 46 μg, at least 48 μg, at least 50 μg, at least 52 μg, at least 54 μg, at least 56 μg, at least 58 μg, at least 60 μg, at least 62 μg, at least 64 μg, at least 66 μg, at least 68 μg, at least 70 μg, at least 72 μg, at least 74 μg, at least 76 μg, at least 78 μg, at least 80 μg, at least 82 μg, at least 84 μg, at least 86 μg, at least 88 μg, at least 90 μg, at least 92 μg, at least 94 μg, at least 96 μg, at least 98 μg, at least 100 μg, at least 102 μg, at least 104 μg, at least 106 μg, at least 108 μg, at least 110 μg, at least 112 μg, at least 114 μg, at least 116 μg, at least 118 μg, at least 120 μg, at least 122 μg, at least 124 μg, at least 126 μg, at least 128 μg, at least 130 pg at least 132 μg, at least 134 μg, at least 136 μg, at least 138 μg, at least 140 μg, at least 142 μg, at least 144 μg, at least 146 μg, at least 148 μg, at least 150 μg, at least 152 μg, at least 154 μg, at least 156 μg, at least 158 μg, at least 160 μg, at least 162 μg, at least 164 μg, at least 166 μg, at least 168 μg, at least 170 μg, at least 172 μg, at least 174 μg, at least 176 μg, at least 178 μg, at least 180 μg, at least 182 μg, at least 184 μg, at least 186 μg, at least 188 μg, at least 190 μg, at least 192 μg, at least 194 μg, at least 196 μg, at least 198 μg, or at least 200 μg, e.g., in a buffered solution.
[0442] The compositions provided herein can be, e.g., formulated to be compatible with their intended route of administration. A non-limiting example of an intended route of administration is local administration (e.g., intra-cochlear administration).
[0443] In some embodiments, the therapeutic compositions are formulated to include a lipid nanoparticle. In some embodiments, the therapeutic compositions are formulated to include a polymeric nanoparticle. In some embodiments, the therapeutic compositions are formulated to comprise a mini-circle DNA. In some embodiments, the therapeutic compositions are formulated to comprise a CELiD DNA. In some embodiments, the therapeutic compositions are formulated to comprise a synthetic perilymph solution. An exemplary synthetic perilymph solution includes 20-200 mM NaCl; 1-5 mM KCl; 0.1-10 mM CaCl.sub.2; 1-10 mM glucose; 2-50 mM HEPES, having a pH of between about 6 and about 9.
[0444] Also provided are kits including any of the compositions described herein. In some embodiments, a kit can include a solid composition (e.g., a lyophilized composition including the at least two different vectors described herein) and a liquid for solubilizing the lyophilized composition. In some embodiments, a kit can include a pre-loaded syringe including any of the compositions described herein.
[0445] In some embodiments, the kit includes a vial comprising any of the compositions described herein (e.g., formulated as an aqueous composition, e.g., an aqueous pharmaceutical composition).
[0446] In some embodiments, the kits can include instructions for performing any of the methods described herein.
[0447] Also provided are kits including any of the compositions described herein. In some embodiments, a kit can include a solid composition (e.g., a lyophilized composition including the at least two different constructs described herein) and a liquid for solubilizing the lyophilized composition. In some embodiments, a kit can include a pre-loaded syringe including any of the compositions described herein.
Genetically Modified Cells
[0448] The present disclosure also provides a cell (e.g., an animal cell, e.g., a mammalian cell, e.g., a primate cell, e.g., a human cell) that includes any of the nucleic acids, constructs or compositions described herein. In some embodiments, an animal cell is a human cell (e.g., a human supporting cell or a human hair cell). In other embodiments, an animal cell is a non-human mammal (e.g., Simian cell, Felidae cell, Canidae cell etc.). A person skilled in the art will appreciate that the nucleic acids and constructs described herein can be introduced into any animal cell (e.g., the supporting or hair cells of any animal suitable for veterinary intervention). Non-limiting examples of constructs and methods for introducing constructs into animal cells are described herein.
[0449] In some embodiments, an animal cell can be any cell of the inner ear, including hair and/or supporting cells. Non-limiting examples such cells include: Hensen's cells, Deiters' cells, cells of the endolymphatic sac and duct, transitional cells in the saccule, utricle, and ampulla, inner and outer hair cells, spiral ligament cells, spiral ganglion cells, spiral prominence cells, external saccule cells, marginal cells, intermediate cells, basal cells, inner pillar cells, outer pillar cells, Claudius cells, inner border cells, inner phalangeal cells, or cells of the stria vascularis.
[0450] In some embodiments, an animal cell is a specialized cell of the cochlea. In some embodiments, an animal cell is a hair cell. In some embodiments, an animal cell is a cochlear inner hair cell or a cochlear outer hair cell. In some embodiments, an animal cell is a cochlear inner hair cell. In some embodiments, an animal cell is a cochlear outer hair cell. In some embodiments, an animal cell is in vitro. In some embodiments, an animal cell is of a cell type which is endogenously present in an animal, e.g., in a primate and/or human. In some embodiments, an animal cell is an autologous cell obtained from an animal and cultured ex vivo.
[0451] Also provided herein is a cell (e.g., a mammalian cell) that includes any of the nucleic acids, vectors (e.g., at least two different vectors described herein), or compositions described herein. Skilled practitioners will appreciate that the nucleic acids and vectors described herein can be introduced into any mammalian cell. Non-limiting examples of vectors and methods for introducing vectors into mammalian cells are described herein.
[0452] In some embodiments, the cell is a human cell, a mouse cell, a porcine cell, a rabbit cell, a dog cell, a cat cell, a rat cell, or a non-human primate cell. In some embodiments, the cell is a specialized cell of the cochlea. In some embodiments, the cell is a cochlear hair cell, such as a cochlear inner hair cell, a cochlear out hair cell, a supporting cell, a ganglion cell, a clear cell, a cuboidal cell, a cartilage cell, a cell of the tegmentum vasculosum, a homogene cell, a Hensen's cell, a Deiters' cell, a pillar cell, or a border cell. In some embodiments, the cell is an ocular cell (e.g. a retinal cell, a retinal ganglion cell, an amacrine cell, a horizontal cell, a bipolar cell, a photoreceptor cell).
[0453] In some embodiments, the mammalian cell is in vitro. In some embodiments, the mammalian cell is present in a mammal. In some embodiments, the mammalian cell is an autologous cell obtained from a subject and cultured ex vivo.
Methods
[0454] Among other things, the present disclosure provides methods. In some embodiments, a method comprises introducing a composition as described herein into the inner ear (e.g., a cochlea) of a subject. For example, provided herein are methods that in some embodiments include administering to an inner ear (e.g., cochlea) of a subject (e.g., an animal, e.g., a mammal, e.g., a primate, e.g., a human) a therapeutically effective amount of any composition described herein. In some embodiments of any of these methods, the subject has been previously identified as having a defective inner ear cell target gene (e.g., a supporting and/or hearing cell target gene having a mutation that results in a decrease in the expression and/or activity of a supporting and/or hearing cell target protein encoded by the gene). Some embodiments of any of these methods further include, prior to the introducing or administering step, determining that the subject has a defective inner ear cell target gene. Some embodiments of any of these methods can further include detecting a mutation in an inner ear cell target gene in a subject. Some embodiments of any of the methods can further include identifying or diagnosing a subject as having non-syndromic or syndromic sensorineural hearing loss.
[0455] In some embodiments, provided herein are methods of correcting an inner ear cell target gene defect (e.g., a defect in a secreted target protein gene) in an inner ear of a subject, e.g., an animal, e.g., a mammal, e.g., a primate, e.g., a human. In some embodiments, methods include administering to the inner ear of a subject a therapeutically effective amount of any of the compositions described herein, where the administering repairs and or ameliorates the inner ear cell target gene defect in any cell subset of the inner ear of a subject. In some embodiments, the inner ear target cell may be a sensory cell, e.g., a hair cell, and/or a non-sensory cell, e.g., a supporting cell, and/or all or any subset of inner ear cells.
[0456] Also provided herein are methods that include introducing into an inner ear of a mammal a therapeutically effective amount of any of the compositions described herein.
[0457] Also provided herein are methods of increasing expression of a full-length secreted target protein in any subset of inner ear cells (e.g., mammalian cells, e.g., animal cells, e.g., primate cells, e.g., human cells) that include introducing any of the compositions described herein into the cell.
[0458] Also provided herein are methods of increasing the expression level of a full-length secreted target protein in any subset of inner ear cells of a subject (e.g., an animal, e.g., a mammal, e.g., a primate, e.g., a human) that include introducing into the inner ear of the mammal a therapeutically effective amount of any of the compositions described herein. Also provided herein are methods of increasing the expression level of a full-length secreted target protein in any subset of an inner ear cells of a subject (e.g., an animal, e.g., a mammal, e.g., a primate, e.g., a human) that include administering to the inner ear of the subject a therapeutically effective amount of any of the compositions described herein, where the administering results in an increase in the expression level of the inner ear cell target protein (e.g., a secreted target protein) in any cell subset of the inner ear of a subject. In some embodiments, the inner ear target cell may be a sensory cell, e.g., a hair cell, and/or a non-sensory cell, e.g., a supporting cell, and/or all or any subset of inner ear cells.
[0459] Also provided herein are methods of treating syndromic and non-syndromic sensorineural hearing loss in a subject identified as having a defective secreted target gene that include: administering a therapeutically effective amount of any of the composition described herein into the inner ear of the subject.
[0460] Also provided herein are methods of treating or preventing hearing loss, e.g., non-syndromic sensorineural hearing loss or syndromic sensorineural hearing loss in a subject (e.g., an animal, e.g., a mammal, e.g., a primate, e.g., a human) identified as having a defective NDP gene or defective HSPA1A gene that include: administering a therapeutically effective amount of any of the compositions described herein into an inner ear of the subject.
[0461] Also provided herein are methods of treating or preventing vision loss in a subject (e.g., an animal, e.g., a mammal, e.g., a primate, e.g., a human) identified as having a defective NDP gene or defective HSPA1A gene that include: administering a therapeutically effective amount of any of the compositions described herein into an inner ear or central nervous system of the subject, or systemically administering a therapeutically effective amount of any of these compositions described herein to the subject.
[0462] In some embodiments of any of the methods described herein, the subject (e.g., animal, e.g., mammal, e.g., primate, e.g., human) has been previously identified as having a defective secreted target gene (e.g., a defective NDP gene or a defective HSPA1A gene) (e.g., a NDP gene having a mutation that results in a decrease in the expression and/or activity of a NDP protein encoded by the gene, or a HSPA1A gene having a mutation that results in a decrease in the expression and/or activity of a HSPA1A protein encoded by the gene). Some embodiments of any of these methods further include, prior to the introducing or administering step, determining that the subject has a defective secreted target gene (e.g., a defective NDP gene or a defective HSPA1A gene). Some embodiments of any of these methods can further include detecting a mutation in a secreted target gene (e.g., a NDP gene or a HSPA1A gene) in a subject. Some embodiments of any of the methods can further include identifying or diagnosing a subject as having hearing loss and/or vision loss.
[0463] In some embodiments of any of these methods, two or more doses of any of the compositions described herein are introduced or administered into the cochlea of the mammal or subject. Some embodiments of any of these methods can include introducing or administering a first dose of the composition into the cochlea of the mammal or subject, assessing hearing function of the mammal or subject following the introducing or the administering of the first dose, and administering an additional dose of the composition into the cochlea of the mammal or subject found not to have a hearing function within a normal range (e.g., as determined using any test for hearing known in the art).
[0464] In some embodiments of any of the methods described herein, the composition can be formulated for intra-cochlear administration. In some embodiments of any of the methods described herein, the compositions described herein can be administered via intra-cochlear administration or local administration. In some embodiments of any of the methods described herein, the compositions are administered through the use of a medical device (e.g., any of the exemplary medical devices described herein).
[0465] Also provided herein are methods of restoring synapses and/or preserving spiral ganglion nerves in a subject identified or diagnosed as having an inner ear disorder that include: administering to the inner ear of the subject a therapeutically effective amount of any of the compositions described herein.
[0466] Also provided herein are methods of reducing the size of, and/or restoring the vestibular aqueduct to an appropriate size. Also provided herein are methods of restoring endolymphatic pH to an appropriate and/or acceptable level in a subject identified or diagnosed as having an inner ear disorder that include: administering to the inner ear of the subject a therapeutically effective amount of any of the compositions described herein.
[0467] Also provided herein are methods that include administering to an inner ear of a subject a therapeutically effective amount of any of the compositions described herein.
[0468] Also provided herein are surgical methods for treatment of hearing loss (e.g., non-syndromic sensorineural hearing loss or syndromic sensorineural hearing loss). In some embodiments, the methods include the steps of: introducing into a cochlea of a subject a first incision at a first incision point; and administering intra-cochlearly a therapeutically effective amount of any of the compositions provided herein. In some embodiments, the composition is administered to the subject at the first incision point. In some embodiments, the composition is administered to the subject into or through the first incision.
[0469] In some embodiments of any of the methods described herein, any composition described herein is administered to the subject into or through the cochlea oval window membrane. In some embodiments of any of the methods described herein, any of the compositions described herein is administered to the subject into or through the cochlea round window membrane. In some embodiments of any of the methods described herein, the composition is administered using a medical device capable of creating a plurality of incisions in the round window membrane. In some embodiments, the medical device includes a plurality of micro-needles. In some embodiments, the medical device includes a plurality of micro-needles including a generally circular first aspect, where each micro-needle has a diameter of at least about 10 microns. In some embodiments, the medical device includes a base and/or a reservoir capable of holding the composition. In some embodiments, the medical device includes a plurality of hollow micro-needles individually including a lumen capable of transferring the composition. In some embodiments, the medical device includes a means for generating at least a partial vacuum.
[0470] In some embodiments, technologies of the present disclosure are used to treat subjects with or at risk of hearing loss. For example, in some embodiments, a subject has an autosomal recessive hearing loss attributed to at least one pathogenic variant of NDP or HSPA1A. It will be understood by those in the art that many different mutations in NDP or HSPA1A can result in a pathogenic variant. In some such embodiments, a pathogenic variant causes or is at risk of causing hearing loss.
[0471] In some embodiments, intra-cochlear administration can be performed using any of the methods described herein or known in the art. For example, a composition can be administered or introduced into the cochlea using the following surgical technique: first using visualization with a 0 degree, 2.5-mm rigid endoscope, the external auditory canal is cleared and a round knife is used to sharply delineate an approximately 5-mm tympanomeatal flap. The tympanomeatal flap is then elevated and the middle ear is entered posteriorly. The chorda tympani nerve is identified and divided, and a currette is used to remove the scutal bone, exposing the round window membrane. To enhance apical distribution of the administered or introduced composition, a surgical laser may be used to make a small 2-mm fenestration in the oval window to allow for perilymph displacement during trans-round window membrane infusion of the composition. The microinfusion device is then primed and brought into the surgical field. The device is maneuvered to the round window, and the tip is seated within the bony round window overhang to allow for penetration of the membrane by the microneedle(s). The footpedal is engaged to allow for a measured, steady infusion of the composition. The device is then withdrawn and the round window and stapes foot plate are sealed with a gelfoam patch.
[0472] In some embodiments of any of these methods, two or more doses of any of the compositions described herein are introduced or administered into the eye of the mammal or subject. Some embodiments of any of these methods can include introducing or administering a first dose of the composition into the eye (e.g., intraocular space) of the mammal or subject, assessing hearing function of the mammal or subject following the introducing or the administering of the first dose, and administering an additional dose of the composition into the eye of the mammal or subject found not to have a vision within a normal range (e.g., as determined using any test for vision known in the art).
[0473] In some embodiments of any of the methods described herein, the composition can be formulated for intra-ocular administration. In some embodiments of any of the methods described herein, the compositions described herein can be administered via intra-ocular administration or local administration. In some embodiments of any of the methods described herein, the composition if formulated for systemic administration.
[0474] In some embodiments, intra-ocular administration can be performed using any of the methods described herein or known in the art.
[0475] In some embodiments of any of the methods described herein, the subject or mammal is a rodent, a non-human primate, or a human. In some embodiments of any of the methods described herein, the subject or mammal is an adult, a teenager, a juvenile, a child, a toddler, an infant, or a newborn. In some embodiments of any of the methods described herein, the subject or mammal is 1-5, 1-10, 1-20, 1-30, 1-40, 1-50, 1-60, 1-70, 1-80, 1-90, 1-100, 1-110, 2-5, 2-10, 10-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80-90, 90-100, 100-110, 10-30, 10-40, 10-50, 10-60, 10-70, 10-80, 10-90, 10-100, 10-110, 20-40, 20-50, 20-60, 20-70, 20-80, 20-90, 20-100, 20-110, 30-50, 30-60, 30-70, 30-80, 30-90, 30-100, 40-60, 40-70, 40-80, 40-90, 40-100, 50-70, 50-80, 50-90, 50-100, 60-80, 60-90, 60-100, 70-90, 70-100, 70-110, 80-100, 80-110, or 90-110 years of age. In some embodiments of any of the methods described herein, the subject or mammal is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or 11 months of age.
[0476] In some embodiments of any of the methods described herein, the subject or mammal has or is at risk of developing hearing loss and/or vision loss (e.g., Norrie Disease Pseudoglioma). In some embodiments of any of the methods described herein, the subject or mammal has been previously identified as having a mutation in a secreted target gene (e.g., a NDP gene or a HSPA1A gene). In some embodiments of any of the methods described herein, the subject or mammal has any of the mutations in a secreted target gene (e.g., a NDP gene or a HSPA1A gene) that are described herein or are known in the art to be associated with hearing loss and/or vision loss.
[0477] In some embodiments of any of the methods described herein, the subject or mammal has been identified as being a carrier of a mutation in a secreted target gene (e.g., a NDP gene or a HSPA1A gene) (e.g., via genetic testing). In some embodiments of any of the methods described herein, the subject or human has been identified as having a mutation in a secreted target gene (e.g., a NDP gene or a HSPA1A gene) and has been diagnosed with hearing loss and/or vision loss (e.g., Norrie Disease Pseudoglioma).
[0478] In some embodiments, successful treatment of hearing loss (e.g., Usher syndrome type III) can be determined in a subject using any of the conventional functional hearing tests known in the art. Non-limiting examples of functional hearing tests are various types of audiometric assays (e.g., pure-tone testing, speech testing, test of the middle ear, auditory brainstem response, and otoacoustic emissions).
[0479] In some embodiments, successful treatment of vision loss can be determined in a subject using any of the conventional functional vision tests known in the art. Non-limiting examples of functional retinal and vision tests are acuity testing, intraocular pressure (IOP) testing, and an electroretinogram (ERG).
[0480] Also provided herein are methods of increasing expression of an active secreted target protein (e.g., a full-length secreted target protein (e.g., a full-length NDP protein (e.g., any of the exemplary full-length NDP proteins described herein) or a full-length HSPA1A protein (e.g., any of the exemplary full-length HPSA1A proteins described herein)) in a mammalian cell that include introducing any of the compositions described herein into the mammalian cell. In some embodiments of these methods, the mammalian cell is a cochlear hair cell (e.g., an inner hair cell, an outer hair cell) or an ocular cell (e.g., a retinal cell). In some embodiments of these methods, the mammalian cell is a human cell (e.g., a human cochlear hair cell). In some embodiments of these methods, the mammalian cell is in vitro. In some embodiments of these methods, the mammalian cell is in a mammal. In some embodiments of these methods, the mammalian cell is originally obtained from a mammal and is cultured ex vivo. In some embodiments, the mammalian cell has previously been determined to have a defective secreted target gene (e.g., a defective NDP gene or a defective HSPA1A gene).Methods for introducing any of the compositions described herein into a mammalian cell are known in the art (e.g., via lipofection or through the use of a viral vector, e.g., any of the viral vectors described herein).
[0481] An increase in expression of an active secreted target protein (e.g., a full-length secreted target protein (e.g., a full-length NDP protein or a full-length HSPA1A protein)) as described herein is, e.g., as compared to a control or to the level of expression of an active secreted target protein (e.g., a full-length secreted target protein (e.g., a full-length NDP protein or a full-length HSPA1A protein)) prior to the introduction of the vector(s).
[0482] Methods of detecting expression and/or activity of a secreted target protein (e.g., a NDP protein or a HSPA1A protein) are known in the art. In some embodiments, the level of expression of a secreted target protein (e.g., a NDP protein or a HSPA1A protein) can be detected directly (e.g., detecting NDP protein or detecting NDP mRNA, or detecting HSPA1A protein or detecting HSPA1A mRNA). Non-limiting examples of techniques that can be used to detect expression and/or activity of a secreted target protein directly include: real-time PCR, Western blotting, immunoprecipitation, immunohistochemistry, ELISA or immunofluorescence. In some embodiments, expression of a secreted target protein (e.g., a NDP protein or a HSPA1A protein) can be detected indirectly (e.g., through functional hearing tests, functional retinal and vision tests).
[0483] Administration
[0484] Provided herein are therapeutic delivery systems for treating hearing loss and/or vision loss (e.g., Norrie Disease Pseudoglioma). In one aspect, the therapeutic delivery systems include i) a medical device capable of creating one or a plurality of incisions in a round window membrane of an inner ear of a human subject in need thereof, and ii) an effective dose of a composition (e.g., any of the compositions described herein). In some embodiments, the medical device includes a plurality of micro-needles.
[0485] Also provided herein are surgical methods for treatment of hearing loss (e.g., Norrie Disease Pseudoglioma). In some embodiments, the methods include the steps of: introducing into a cochlea of a human subject a first incision at a first incision point; and administering intra-cochlearly a therapeutically effective amount of any of the compositions provided herein. In some embodiments, the composition is administered to the subject at the first incision point. In some embodiments, the composition is administered to the subject into or through the first incision.
[0486] In some embodiments of any of the methods described herein, any of the compositions described herein is administered to the subject into or through the cochlea oval window membrane. In some embodiments of any of the methods described herein, any of the compositions described herein is administered to the subject into or through the cochlea round window membrane. In some embodiments of any of the methods described herein, the composition is administered using a medical device capable of creating a plurality of incisions in the round window membrane. In some embodiments, the medical device includes a plurality of micro-needles. In some embodiments, the medical device includes a plurality of micro-needles including a generally circular first aspect, where each micro-needle has a diameter of at least about 10 microns. In some embodiments, the medical device includes a base and/or a reservoir capable of holding the composition. In some embodiments, the medical device includes a plurality of hollow micro-needles individually including a lumen capable of transferring the composition. In some embodiments, the medical device includes a means for generating at least a partial vacuum.
[0487] Also provided herein are surgical methods for treatment of vision loss (e.g., Norrie Disease Pseudoglioma). In some embodiments, the methods include the steps of: administering intra-ocularly a therapeutically effective amount of any of the compositions provided herein.
[0488] Evaluating Hearing Loss and Recovery
[0489] In some embodiments, hearing function is determined using auditory brainstem response measurements (ABR). In some embodiments, hearing is tested by measuring distortion product optoacoustic emissions (DPOAEs). In some such embodiments, measurements are taken from one or both ears of a subject. In some such embodiments, recordings are compared to prior recordings for the same subject and/or known thresholds on such response measurements used to define, e.g., hearing loss versus acceptable hearing ranges to be defined as normal hearing. In some embodiments, a subject has ABR and/or DPOAE measurements recorded prior to receiving any treatment. In some embodiments, a subject treated with one or more technologies described herein will have improvements on ABR and/or DPOAE measurements after treatment as compared to before treatment. In some embodiments, ABR and/or DPOAE measurements are taken after treatment is administered and at regular follow-up intervals post-treatment.
[0490] In some embodiments, hearing function is determined using speech pattern recognition or is determined by a speech therapist. In some embodiments, hearing function is determined by pure tone testing. In some embodiments, hearing function is determined by bone conduction testing. In some embodiments, hearing function is determined by acoustic reflex testing. In some embodiments hearing function is determined by tympanometry. In some embodiments, hearing function is determined by any combination of hearing analysis known in the art. In some such embodiments, measurements are taken holistically, and/or from one or both ears of a subject. In some such embodiments, recordings and/or professional analysis are compared to prior recordings and/or analysis for the same subject and/or known thresholds on such response measurements used to define, e.g., hearing loss versus acceptable hearing ranges to be defined as normal hearing. In some embodiments, a subject has speech pattern recognition, pure tone testing, bone conduction testing, acoustic reflex testing and/or tympanometry measurements and/or analysis conducted prior to receiving any treatment. In some embodiments a subject treated with one or more technologies described herein will have improvements on speech pattern recognition, pure tone testing, bone conduction testing, acoustic reflex testing and/or tympanometry measurements after treatment as compared to before treatment. In some embodiments, speech pattern recognition, pure tone testing, bone conduction testing, acoustic reflex testing and/or tympanometry measurements are taken after treatment is administered and at regular follow-up intervals post-treatment.
[0491] Methods of Characterizing
[0492] The term “mutation in a secreted target protein (or NDP or HSPA1A) gene” refers to a modification in a known consensus functional secreted target protein (or NDP or HSPA1A) gene that results in the production of a secreted target protein (or NDP protein or HSPA1A protein) having one or more of: a deletion in one or more amino acids, one or more amino acid substitutions, and one or more amino acid insertions as compared to the consensus functional secreted target protein, and/or results in a decrease in the expressed level of the encoded secreted target protein in a mammalian cell as compared to the expressed level of the encoded secreted target protein in a mammalian cell not having a mutation. In some embodiments, a mutation can result in the production of a secreted target protein having a deletion in one or more amino acids (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 16, 17, 18, 19, 20, or more amino acids). In some embodiments, the mutation can result in a frameshift in the NDP gene. The term “frameshift” is known in the art to encompass any mutation in a coding sequence that results in a shift in the reading frame of the coding sequence. In some embodiments, a frameshift can result in a nonfunctional protein. In some embodiments, a point mutation can be a nonsense mutation (i.e., result in a premature stop codon in an exon of the gene). A nonsense mutation can result in the production of a truncated protein (as compared to a corresponding consensus functional protein) that may or may not be functional. In some embodiments, the mutation can result in the loss (or a decrease in the level) of expression of NDP mRNA or secreted target protein or both the mRNA and protein. In some embodiments, the mutation can result in the production of an altered secreted target protein having a loss or decrease in one or more biological activities (functions) as compared to a consensus functional secreted target protein.
[0493] In some embodiments, the mutation is an insertion of one or more nucleotides into a secreted target protein (or NDP or HSPA1A) gene. In some embodiments, the mutation is in a regulatory and/or control sequence of the secreted target gene, i.e., a portion of the gene that is not coding sequence. In some embodiments, a mutation in a regulatory and/or control sequence may be in a promoter or enhancer region and prevent or reduce the proper transcription of the NDP gene or HSPA1A gene. In some embodiments, a mutation is in a known heterologous gene known to interact with a secreted target protein, or the NDP gene or the HSPA1A.
[0494] Methods of genotyping and/or detecting expression or activity of secreted target protein (or NDP or HSPA1A) mRNA and/or secreted target protein are known in the art (see e.g., Ito et al., World J Otorhinolaryngol. 2013 May 28; 3(2): 26-34, and Roesch et al., Int J Mol Sci. 2018 January; 19(1): 209, each of which is incorporated in its entirety herein by reference). In some embodiments, level of expression of secreted target protein (or NDP or HSPA1A) mRNA or secreted target protein may be detected directly (e.g., detecting secreted target protein, e.g., detecting secreted target protein (or NDP or HSPA1A) mRNA etc.). Non-limiting examples of techniques that can be used to detect expression and/or activity of secreted target protein (or NDP or HSPA1A) directly include, e.g., real-time PCR, quantitative real-time PCR, Western blotting, immunoprecipitation, immunohistochemistry, mass spectrometry, or immunofluorescence. In some embodiments, expression of secreted target protein (or NDP or HSPA1A) can be detected indirectly (e.g., through functional hearing tests, ABRs, DPOAEs, etc.).
[0495] In some embodiments, tissue samples (e.g., comprising one or more inner ear cells, e.g., comprising one or more hair cells and/or one or more supporting cells) may be evaluated via morphological analysis to determine morphology of hair cells and/or support cells before and after administration of any agents (e.g., compositions, e.g., compositions comprising constructs, and/or particles, etc.) as described herein. In some such embodiments, standard immunohistochemical or histological analyses may be performed. In some embodiments, if cells are used in vitro or ex vivo, additional immunocytochemical or immunohistochemical analyses may be performed. In some embodiments, one or more assays of one or more proteins or transcripts (e.g., western blot, ELISA, polymerase chain reactions) may be performed on one or more samples from a subject or in vitro cell populations.
[0496] Production Methods
[0497] AAV systems are generally well known in the art (see, e.g., Kelleher and Vos, Biotechniques, 17(6):1110-17 (1994); Cotten et al., P.N.A.S. U.S.A., 89(13):6094-98 (1992); Curiel, Nat Immun, 13(2-3):141-64 (1994); Muzyczka, Curr Top Microbiol Immunol, 158:97-129 (1992); and Asokan A, et al., Mol. Ther., 20(4):699-708 (2012), each of which is incorporated in its entirety herein by reference). Methods for generating and using AAV constructs are described, for example, in U.S. Pat. Nos. 5,139,941, 4,797,368 and PCT filing application US2019/060328, each of which is incorporated in its entirety herein by reference.
[0498] Methods for obtaining viral constructs are known in the art. For example, to produce AAV constructs, the methods typically involve culturing a host cell which contains a nucleic acid sequence encoding an AAV capsid protein or fragment thereof; a functional rep gene; a recombinant AAV construct composed of AAV inverted terminal repeats (ITRs) and a coding sequence; and/or sufficient helper functions to permit packaging of the recombinant AAV construct into the AAV capsid proteins.
[0499] In some embodiments, components to be cultured in a host cell to package an AAV construct in an AAV capsid may be provided to the host cell in trans. Alternatively, any one or more components (e.g., recombinant AAV construct, rep sequences, cap sequences, and/or helper functions) may be provided by a stable host cell that has been engineered to contain one or more such components using methods known to those of skill in the art. In some embodiments, such a stable host cell contains such component(s) under the control of an inducible promoter. In some embodiments, such component(s) may be under the control of a constitutive promoter. In some embodiments, a selected stable host cell may contain selected component(s) under the control of a constitutive promoter and other selected component(s) under the control of one or more inducible promoters. For example, a stable host cell may be generated that is derived from HEK293 cells (which contain E1 helper functions under the control of a constitutive promoter), but that contain the rep and/or cap proteins under the control of inducible promoters. Other stable host cells may be generated by one of skill in the art using routine methods.
[0500] Recombinant AAV construct, rep sequences, cap sequences, and helper functions required for producing an AAV of the disclosure may be delivered to a packaging host cell using any appropriate genetic element (e.g., construct). A selected genetic element may be delivered by any suitable method known in the art, e.g., to those with skill in nucleic acid manipulation and include genetic engineering, recombinant engineering, and synthetic techniques (see, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, N.Y., which is incorporated in its entirety herein by reference). Similarly, methods of generating AAV particles are well known and any suitable method can be used with the present disclosure (see, e.g., K. Fisher et al, J. Virol., 70:520-532 (1993) and U.S. Pat. No. 5,478,745, which are incorporated in their entirety herein by reference).
[0501] In some embodiments, recombinant AAVs may be produced using a triple transfection method (e.g., as described in U.S. Pat. No. 6,001,650, which is incorporated in its entirety herein by reference). In some embodiments, recombinant AAVs are produced by transfecting a host cell with a recombinant AAV construct (comprising a coding sequence) to be packaged into AAV particles, an AAV helper function construct, and an accessory function construct. An AAV helper function construct encodes “AAV helper function” sequences (i.e., rep and cap), which function in trans for productive AAV replication and encapsidation. In some embodiments, the AAV helper function construct supports efficient AAV construct production without generating any detectable wild type AAV particles (i.e., AAV particles containing functional rep and cap genes). Non-limiting examples of constructs suitable for use with the present disclosure include pHLP19 (see, e.g., U.S. Pat. No. 6,001,650, which is incorporated in its entirety herein by reference) and pRep6cap6 construct (see, e.g., U.S. Pat. No. 6,156,303, which is incorporated in its entirety herein by reference). An accessory function construct encodes nucleotide sequences for non-AAV derived viral and/or cellular functions upon which AAV is dependent for replication (i.e., “accessory functions”). Accessory functions may include those functions required for AAV replication, including, without limitation, those moieties involved in activation of AAV gene transcription, stage specific AAV mRNA splicing, AAV DNA replication, synthesis of cap expression products, and AAV capsid assembly. Viral-based accessory functions can be derived from any known helper viruses such as adenovirus, herpesvirus (other than herpes simplex virus type-1), and vaccinia virus.
[0502] Additional methods for generating and isolating AAV viral constructs suitable for delivery to a subject are described in, e.g., U.S. Pat. Nos. 7,790,449; 7,282,199; WO 2003/042397; WO 2005/033321, WO 2006/110689; and U.S. Pat. No. 7,588,772, each of which is incorporated in its entirety herein by reference. In one system, a producer cell line is transiently transfected with a construct that encodes a coding sequence flanked by ITRs and a construct(s) that encodes rep and cap. In another system, a packaging cell line that stably supplies rep and cap is transiently transfected with a construct encoding a coding sequence flanked by ITRs. In each of these systems, AAV particles are produced in response to infection with helper adenovirus or herpesvirus, and AAVs are separated from contaminating virus. Other systems do not require infection with helper virus to recover the AAV—the helper functions (i.e., adenovirus E1, E2a, VA, and E4 or herpesvirus UL5, UL8, UL52, and UL29, and herpesvirus polymerase) are also supplied, in trans, by the system. In such systems, helper functions can be supplied by transient transfection of the cells with constructs that encode the helper functions, or the cells can be engineered to stably contain genes encoding the helper functions, the expression of which can be controlled at the transcriptional or posttranscriptional level.
[0503] In some embodiments, viral construct titers post-purification are determined. In some embodiments, titers are determined using quantitative PCR. In certain embodiments, a TaqMan probe specific to a construct is utilized to determine construct levels. In certain embodiments, the TaqMan probe is represented by SEQ ID NO: 49, while forward and reverse amplifying primers are exemplified by SEQ ID NO: 54 and 55 respectively.
TABLE-US-00034 Exemplary TaqMan probe for quantification of constructs (SEQ ID NO: 49) /56-FAM/TAATTCCAA/ZEN/CCAGCAGAGTCAGGGC/3IABkFQ/ Exemplary forward qPCR primer for quantification of constructs (SEQ ID NO: 54) GATACAGCTAGAGTCCTGATTGC Exemplary reverse qPCR primer for quantification of constructs (SEQ ID NO: 55) GATCTGCCAAGTACCTCACTATG
[0504] As described herein, in some embodiments, a viral construct of the present disclosure is an adeno-associated virus (AAV) construct. Several AAV serotypes have been characterized, including AAV1, AAV2, AAV3 (e.g., AAV3B), AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, and AAV Anc80, as well as variants thereof. In some embodiments, an AAV particle is an AAV2/6, AAV2/8, AAV2/9, or AAV2/Anc80 particle (e.g., with AAV6, AAV8, AAV9 or Anc80 capsid and construct with AAV2 ITR). Other AAV particles and constructs are described in, e.g., Sharma et al., Brain Res Bull. 2010 Feb. 15; 81(2-3): 273, which is incorporated in its entirety herein by reference. Generally, any AAV particle may be used to deliver a coding sequence described herein. However, the serotypes have different tropisms, e.g., they preferentially infect different tissues. In some embodiments, an AAV construct is a self-complementary AAV construct.
[0505] The present disclosure provides, among other things, methods of making AAV-based constructs. In some embodiments, such methods include use of host cells. In some embodiments, a host cell is a mammalian cell. A host cell may be used as a recipient of an AAV helper construct, an AAV minigene plasmid, an accessory function construct, and/or other transfer DNA associated with the production of recombinant AAVs. The term includes the progeny of an original cell that has been transfected. Thus, a “host cell” as used herein may refer to a cell that has been transfected with an exogenous DNA sequence. It is understood that the progeny of a single parental cell may not necessarily be completely identical in morphology or in genomic or total DNA complement as the original parent, due to natural, accidental, or deliberate mutation.
[0506] Additional methods for generating and isolating AAV particles suitable for delivery to a subject are described in, e.g., U.S. Pat. Nos. 7,790,449; 7,282,199; WO 2003/042397; WO 2005/033321, WO 2006/110689; and U.S. Pat. No. 7,588,772, each of which is incorporated in its entirety herein by reference. In one system, a producer cell line is transiently transfected with a construct that encodes a coding sequence flanked by ITRs and a construct(s) that encodes rep and cap. In another system, a packaging cell line that stably supplies rep and cap is transiently transfected with a construct encoding a coding sequence flanked by ITRs. In each of these systems, AAV particles are produced in response to infection with helper adenovirus or herpesvirus, and AAV particles are separated from contaminating virus. Other systems do not require infection with helper virus to recover the AAV particles—the helper functions (i.e., adenovirus E1, E2a, VA, and E4 or herpesvirus UL5, UL8, UL52, and UL29, and herpesvirus polymerase) are also supplied, in trans, by the system. In such systems, helper functions can be supplied by transient transfection of the cells with constructs that encode the helper functions, or the cells can be engineered to stably contain genes encoding the helper functions, the expression of which can be controlled at the transcriptional or posttranscriptional level.
[0507] In yet another system, a coding sequence flanked by ITRs and rep/cap genes are introduced into insect host cells by infection with baculovirus-based constructs. Such production systems are known in the art (see generally, e.g., Zhang et al., 2009, Human Gene Therapy 20:922-929, which is incorporated in its entirety herein by reference). Methods of making and using these and other AAV production systems are also described in U.S. Pat. Nos. 5,139,941; 5,741,683; 6,057,152; 6,204,059; 6,268,213; 6,491,907; 6,660,514; 6,951,753; 7,094,604; 7,172,893; 7,201,898; 7,229,823; and 7,439,065, each of which is incorporated in its entirety herein by reference.
EXAMPLES
[0508] The disclosure is further described in detail by reference to the following experimental examples. These examples are provided for purposes of illustration only, and are not intended to be limiting unless otherwise specified. Thus, the disclosure should in no way be construed as being limited to the following examples, but rather should be construed to encompass any and all variations that become evident as a result of the teaching provided herein.
[0509] It is believed that one or ordinary skill in the art can, using the preceding description and following Examples, as well as what is known in the art, to make and utilize technologies of the present disclosure.
Example 1. NDP Expression in HEK293FT Cells
[0510] HEK293FT cells were transfected with exemplary NDP vectors using jetprime reagent and were seeded overnight at 7E4 cells/well in a 24-well format (
Example 2. NDP Expression in P2 Cochlear Mouse Explants
[0511] P2 cochlear explants from WT mice were infected 16 hours after plating and were harvested for RNA and immunofluorescence 72 hours after infection. As shown in
Example 3. Promoter Selection
[0512] The present example further confirms use of constructs as described herein in methods treating hearing loss using a secreted target protein in accordance with the present disclosure. HEK293FT cells were transfected with 800ng plasmid DNA (mock, CAG-EGFP, NDP-EGFP) using jetprime reagent and were seeded overnight at 1.5E5 cells/well in a 24-well format (
Example 4. HSP70 Expression in HEK293FT Cells
[0513] The present example further confirms use of constructs as described herein in methods treating hearing loss using a secreted target protein in accordance with the present disclosure. HEK293FT cells were transfected with exemplary HSP70 vectors using jetprime reagent and were seeded overnight at 1.4E5 cells/well in a 24-well format (
Example 5. HSP70 Expression in P2 Cochlear Mouse Explants
[0514] P2 cochlear explants from WT mice were infected 16 hours after plating and were harvested for RNA and immunofluorescence 72 hours after infection. As shown in
[0515] Secreted HSP70 protein was also detected in the supernatant of cochlea explant culture (
EMBODIMENTS
[0516] Embodiment 1. A composition comprising a single adeno-associated virus (AAV) vector, wherein the single AAV vector comprises a nucleic acid sequence that encodes a secreted target protein; and when introduced into a mammalian cell, a nucleic acid encoding a secretion signal sequence operatively linked to the secreted target protein is generated at the locus of the secreted target protein, and the mammalian cell expresses and secretes the secreted target protein.
[0517] Embodiment 2. The composition of embodiment 1, wherein the single AAV vector further comprises a 5′ untranslated region (UTR), a 3′ UTR, or both.
[0518] Embodiment 3. The composition of embodiment 1 or 2, wherein the secreted target protein is norrin cysteine knot growth factor (NDP).
[0519] Embodiment 4. The composition of embodiment 3, wherein the secreted target protein comprises a sequence that is at least 80% identical to SEQ ID NO: 1.
[0520] Embodiment 5. The composition of embodiment 4, wherein the secreted target protein comprises a sequence that is at least 90% identical to SEQ ID NO: 1.
[0521] Embodiment 6. The composition of embodiment 5, wherein the secreted target protein comprises a sequence that is at least 99% identical to SEQ ID NO: 1.
[0522] Embodiment 7. The composition of any one of embodiments 3-6, wherein the nucleic acid that encodes the secreted target protein comprises a sequence that is at least 80% identical to SEQ ID NO: 2.
[0523] Embodiment 8. The composition of embodiment 7, wherein the nucleic acid that encodes the secreted target protein comprises a sequence that is at least 90% identical to SEQ ID NO: 2.
[0524] Embodiment 9. The composition of embodiment 8, wherein the nucleic acid that encodes the secreted target protein comprises a sequence that is at least 99% identical to SEQ ID NO: 2.
[0525] Embodiment The composition of embodiment 1 or 2, wherein the secreted target protein is heat shock protein, optionally a heat shock protein family A (Hsp70) member 1A (HSPA1A) or heat shock protein family 40 (Hsp40)/DnaJ member.
[0526] Embodiment 11. The composition of embodiment 10, wherein the secreted target protein comprises a sequence that is at least 80% identical to SEQ ID NO: 3.
[0527] Embodiment 12. The composition of embodiment 11, wherein the secreted target protein comprises a sequence that is at least 90% identical to SEQ ID NO: 3.
[0528] Embodiment 13. The composition of embodiment 12, wherein the secreted target protein comprises a sequence that is at least 99% identical to SEQ ID NO: 3.
[0529] Embodiment 14. The composition of any one of embodiments 10-13, wherein the nucleic acid that encodes the secreted target protein comprises a sequence that is at least 80% identical to SEQ ID NO: 4.
[0530] Embodiment 15. The composition of embodiment 14, wherein the nucleic acid that encodes the secreted target protein comprises a sequence that is at least 90% identical to SEQ ID NO: 4.
[0531] Embodiment 16. The composition of embodiment 15, wherein the nucleic acid that encodes the secreted target protein comprises a sequence that is at least 99% identical to SEQ ID NO: 4.
[0532] Embodiment 17. The composition of any one of embodiments 1-16, wherein the AAV vector further comprises one or both of a promoter and a Kozak sequence.
[0533] Embodiment 18. The composition of embodiment 17, wherein the AAV vector comprises a promoter that is an inducible promoter, a constitutive promoter, or a tissue-specific promoter.
[0534] Embodiment 19. The composition of any one of embodiments 1-18, wherein the AAV vector further comprises a poly(dA) sequence.
[0535] Embodiment 20. The composition of any one of embodiments 1-19, wherein the secretion signal sequence comprises SEQ ID NO: 5.
[0536] Embodiment 21. The composition of embodiment 20, wherein the sequence encoding the secretion signal sequence comprises SEQ ID NO: 6.
[0537] Embodiment 22. The composition of any one of embodiments 1-19, wherein the secretion signal sequence comprises SEQ ID NO: 7.
[0538] Embodiment 23. The composition of embodiment 22, wherein the sequence encoding the secretion signal sequence comprises SEQ ID NO: 8.
[0539] Embodiment 24. The composition of embodiment 1, wherein the single AAV vector comprises a sequence that is at least 80% identical to SEQ ID NO: 9.
[0540] Embodiment 25. The composition of embodiment 24, wherein the single AAV vector comprises a sequence that is at least 90% identical to SEQ ID NO: 9.
[0541] Embodiment 26. The composition of embodiment 25, wherein the single AAV vector comprises a sequence that is at least 99% identical to SEQ ID NO: 9.
[0542] Embodiment 27. The composition of embodiment 1, wherein the single AAV vector comprises a sequence that is at least 80% identical to SEQ ID NO: 10.
[0543] Embodiment 28. The composition of embodiment 27, wherein the single AAV vector comprises a sequence that is at least 90% identical to SEQ ID NO: 10.
[0544] Embodiment 29. The composition of embodiment 28, wherein the single AAV vector comprises a sequence that is at least 99% identical to SEQ ID NO: 10.
[0545] Embodiment 30. The composition of any one of embodiments 1-29, further comprising a pharmaceutically acceptable excipient.
[0546] Embodiment 31. A kit comprising a composition of any one of embodiments 1-30.
[0547] Embodiment 32. The kit of embodiment 31, wherein the composition is pre-loaded in a syringe.
[0548] Embodiment 33. A method comprising introducing into an inner ear of a mammal a therapeutically effective amount of the composition of any one of embodiments 1-30.
[0549] Embodiment 34. The method of embodiment 33, wherein the mammal is a human or a primate.
[0550] Embodiment 35. The method of embodiment 33 or 34, wherein the mammal has been previously identified as having a defective secreted target gene.
[0551] Embodiment 36. A method of increasing expression of a full-length secreted target protein in a mammalian cell, the method comprising introducing the composition of any one of embodiments 1-30 into the mammalian cell.
[0552] Embodiment 37. The method of embodiment 36, wherein the mammalian cell is a cochlear inner hair cell, a supporting cell, a ganglion cell, a clear cell, a cuboidal cell, a cartilage cell, a cell of the tegmentum vasculosum, a homogene cell, a Hensen's cell, a Deiters' cell, a pillar cell, or a border cell.
[0553] Embodiment 38. The method of embodiment 36 or 37, wherein the mammalian cell is a human cell or a primate cell.
[0554] Embodiment 39. The method of any one of embodiments 36-38, wherein the mammalian cell has previously been determined to have a defective secreted target gene.
[0555] Embodiment 40. A method of increasing the level of a full-length secreted target protein in an inner ear of a mammal, the method comprising introducing into the inner ear of the mammal a therapeutically effective amount of the composition of any one of embodiments 1-30.
[0556] Embodiment 41. The method of embodiment 40, wherein the mammal has been previously identified as having a defective secreted target gene.
[0557] Embodiment 42. The method of embodiment 40 or 41, wherein the mammal is a human or a primate.
[0558] Embodiment 43. A method of treating syndromic and non-syndromic sensorineural hearing loss in a subject identified as having a defective secreted target gene, the method comprising: administering a therapeutically effective amount of a composition of any one of embodiments 1-30 into the inner ear of the subject.
[0559] Embodiment 44. The method of embodiment 43, wherein the subject is a human or a primate.
[0560] Embodiment 45. The method of embodiment 43 or 44, wherein the subject has Norrie disease pseudoglioma or the subject has been identified or diagnosed as having a Hsp70 polymorphism (such as rs1043618, rs1061581 and rs2227956 in HSP70-1, HSP70-2 and HSP70-hom, respectively) that makes the cochlea susceptible to hearing loss (such as a polymorphism as described by Konings et al., “Variations in HSP70 genes associated with noise-induced hearing loss in two independent populations”, Eur J Hum Genet. 2009 March; 17(3): 329-33, the contents of which is hereby incorporated by reference in its entirety).
[0561] Embodiment 46. The method of any one of embodiments 43-45, further comprising, prior to the administering step, determining that the subject has a defective secreted target gene.
[0562] Embodiment 47. A method of treating or preventing hearing loss in a subject identified as having a defective NDP gene or a defective HSPA1A gene, the method comprising: administering a therapeutically effective amount of a composition of any one of embodiments 1-30 into an inner ear of the subject.
[0563] Embodiment 48. The method of embodiment 47, wherein the subject has been identified or diagnosed as having Norrie disease pseudoglioma or the subject has been identified or diagnosed as having a Hsp70 polymorphism (such as rs1043618, rs1061581 and rs2227956 in HSP70-1, HSP70-2 and HSP70-hom, respectively) that makes the cochlea susceptible to hearing loss (such as a polymorphism as described by Konings et al., “Variations in HSP70 genes associated with noise-induced hearing loss in two independent populations”, Eur J Hum Genet. 2009 March; 17(3): 329-33, the contents of which is hereby incorporated by reference in its entirety).
[0564] Embodiment 49. The method of embodiment 47 or 48, wherein the subject is a human or a primate.
[0565] Embodiment 50. The method of any one of embodiments 47-49, further comprising prior to the administering step, determining that the subject has a defective NDP gene or a defective HSPA1A gene.
[0566] Embodiment 51. A composition comprising at least two different nucleic acid vectors, wherein: each of the at least two different vectors comprises a coding sequence that encodes a different portion of a secreted target protein, each of the encoded portions being at least 30 amino acid residues in length, wherein the amino acid sequence of each of the encoded portions may optionally partially overlap with the amino acid sequence of a different one of the encoded portions; no single vector of the at least two different vectors encodes a full-length secreted target protein; at least one of the coding sequences comprises a nucleotide sequence spanning two neighboring exons of the secreted target protein genomic DNA, and lacks an intronic sequence between the two neighboring exons; and when introduced into a mammalian cell the at least two different vectors undergo concatamerization or homologous recombination with each other, thereby forming a recombined nucleic acid that encodes a secretion signal sequence operatively linked to a full-length secreted target protein.
[0567] Embodiment 52. The composition of embodiment 51, wherein each of the at least two different vectors is a plasmid, a transposon, a cosmid, an artificial chromosome, or a viral vector.
[0568] Embodiment 53. The composition of embodiment 51, wherein each of the at least two different vectors is a human artificial chromosome (HAC), yeast artificial chromosome (YAC), bacterial artificial chromosome (BAC), or a P1-derived artificial chromosome (PAC).
[0569] Embodiment 54. The composition of embodiment 51, wherein each of the at least two different vectors is a viral vector selected from an adeno-associated virus (AAV) vector, an adenovirus vector, a lentivirus vector, or a retrovirus vector.
[0570] Embodiment 55. The composition of embodiment 51, wherein each of the at least two different vectors is an AAV vector.
[0571] Embodiment 56. The composition of any one of embodiments 51-55, wherein the amino acid sequence of one of the encoded portions overlaps with the amino acid sequence of a different one of the encoded portions.
[0572] Embodiment 57. The composition of embodiment 56, wherein the amino acid sequence of each of the encoded portions partially overlaps with the amino acid sequence of a different encoded portion.
[0573] Embodiment 58. The composition of embodiment 57, wherein the overlapping amino acid sequence is between about 30 amino acid residues to about 600 amino acid residues in length.
[0574] Embodiment 59. The composition of any one of embodiments 51-55, wherein the vectors include two different vectors, each of which comprises a different segment of an intron, wherein the intron comprises the nucleotide sequence of an intron that is present in the secreted target protein genomic DNA, and wherein the two different segments overlap in sequence by at least 100 nucleotides.
[0575] Embodiment 60. The composition of embodiment 59, wherein the two different segments overlap in sequence by about 100 nucleotides to about 800 nucleotides.
[0576] Embodiment 61. The composition of any one of embodiments 51-60, wherein the nucleotide sequence of each of the at least two different vectors is between about 500 nucleotides to about 10,000 nucleotides in length.
[0577] Embodiment 62. The composition of embodiment 61, wherein the nucleotide sequence of each of the at least two different vectors is between about 500 nucleotides to about 5,000 nucleotides in length.
[0578] Embodiment 63. The composition of any one of embodiments 51-62, wherein the number of different vectors in the composition is two.
[0579] Embodiment 64. The composition of embodiment 63, wherein a first of the two different vectors comprises a coding sequence that encodes an N-terminal portion of the secreted target protein.
[0580] Embodiment 65. The composition of embodiment 64, wherein the N-terminal portion of the secreted target protein is between about 30 amino acids to about 600 amino acids in length.
[0581] Embodiment 66. The composition of embodiment 65, wherein the N-terminal portion of the secreted target protein is between about 100 amino acids to about 500 amino acids in length.
[0582] Embodiment 67. The composition of any one of embodiments 64-66, wherein the first vector further comprises one or both of a promoter and a Kozak sequence.
[0583] Embodiment 68. The composition of embodiment 67, wherein the first vector comprises a promoter that is an inducible promoter, a constitutive promoter, or a tissue-specific promoter.
[0584] Embodiment 69. The composition of any one of embodiments 64-68, wherein the second of the two different vectors comprises a coding sequence that encodes a C-terminal portion of the secreted target protein.
[0585] Embodiment 70. The composition of embodiment 69, wherein the C-terminal portion of the secreted target protein is between about 30 amino acids to about 600 amino acids in length.
[0586] Embodiment 71. The composition of embodiment 70, wherein the C-terminal portion of the secreted target protein is between about 200 amino acids to about 500 amino acids in length.
[0587] Embodiment 72. The composition of any one of embodiments 69-71, wherein the second vector further comprises a poly(dA) sequence.
[0588] Embodiment 73. The composition of any one of embodiments 63-72, wherein the first vector, the second vector, or both vectors further comprises a 5′ untranslated region (UTR), a 3′ UTR, or both.
[0589] Embodiment 74. The composition of any one of embodiments 51-73, wherein the secreted target protein is norrin cysteine knot growth factor (NDP).
[0590] Embodiment 75. The composition of embodiment 74, wherein the secreted target protein comprises a sequence that is at least 80% identical to SEQ ID NO: 1.
[0591] Embodiment 76. The composition of embodiment 75, wherein the secreted target protein comprises a sequence that is at least 90% identical to SEQ ID NO: 1.
[0592] Embodiment 77. The composition of embodiment 76, wherein the secreted target protein comprises a sequence that is at least 99% identical to SEQ ID NO: 1.
[0593] Embodiment 78. The composition of any one of embodiments 51-73, wherein the secreted target protein is heat shock protein family A (Hsp70) member 1A (HSPA1A), a heat shock protein family 40 (Hsp40) DnaJ homolog subfamily B member 1, or a heat shock protein family 40 (Hsp40)/DnaJ member.
[0594] Embodiment 79. The composition of embodiment 78, wherein the secreted target protein comprises a sequence that is at least 80% identical to SEQ ID NO: 3.
[0595] Embodiment 80. The composition of embodiment 79, wherein the secreted target protein comprises a sequence that is at least 90% identical to SEQ ID NO: 3.
[0596] Embodiment 81. The composition of embodiment 80, wherein the secreted target protein comprises a sequence that is at least 99% identical to SEQ ID NO: 3.
[0597] Embodiment 82. The composition of any one of embodiments 51-81, wherein the secretion signal sequence comprises SEQ ID NO: 5.
[0598] Embodiment 83. The composition of any one of embodiments 51-81, wherein the secretion signal sequence comprises SEQ ID NO: 7.
[0599] Embodiment 84. A composition comprising two different nucleic acid vectors, wherein: a first nucleic acid vector of the two different nucleic acid vectors comprises a promoter, a first coding sequence that encodes an N-terminal portion of an secreted target protein positioned 3′ of the promoter, and a splicing donor signal sequence positioned at the 3′ end of the first coding sequence; and a second nucleic acid vector of the two different nucleic acid vectors comprises a splicing acceptor signal sequence, a second coding sequence that encodes a C-terminal portion of an secreted target protein positioned at the 3′ end of the splicing acceptor signal sequence, and a polyadenylation sequence at the 3′ end of the second coding sequence; wherein each of the encoded portions is at least 30 amino acid residues in length, wherein the amino acid sequences of the encoded portions do not overlap, wherein no single vector of the two different vectors encodes a full-length secreted target protein, and, when the coding sequences are transcribed in a mammalian cell, to produce RNA transcripts, splicing occurs between the splicing donor signal sequence on one transcript and the splicing acceptor signal sequence on the other transcript, thereby forming a recombined RNA molecule that encodes a secretion signal sequence operatively linked to a full-length secreted target protein.
[0600] Embodiment 85. The composition of embodiment 84, wherein the coding sequence of at least one of the vectors comprises a nucleotide sequence spanning two neighboring exons of secreted target genomic DNA, and lacks an intronic sequence between the two neighboring exons.
[0601] Embodiment 86. A composition comprising: a first nucleic acid vector comprising a promoter, a first coding sequence that encodes an N-terminal portion of an secreted target protein positioned 3′ of the promoter, a splicing donor signal sequence positioned at the 3′ end of the first coding sequence, and a first detectable marker gene positioned 3′ of the splicing donor signal sequence; and a second nucleic acid vector, different from the first nucleic acid vector, comprising a second detectable marker gene, a splicing acceptor signal sequence positioned 3′ of the second detectable marker gene, a second coding sequence that encodes a C-terminal portion of an secreted target protein positioned at the 3′ end of the splicing acceptor signal sequence, and a polyadenylation sequence positioned at the 3′ end of the second coding sequence; wherein each of the encoded portions is at least 30 amino acid residues in length, wherein the respective amino acid sequences of the encoded portions do not overlap with each other, wherein no single vector of the two different vectors encodes a full-length secreted target protein, and, when the coding sequences are transcribed in a mammalian cell to produce RNA transcripts, splicing occurs between the splicing donor signal on one transcript and the splicing acceptor signal on the other transcript, thereby forming a recombined RNA molecule that encodes a secretion signal sequence operatively linked to a full-length secreted target protein.
[0602] Embodiment 87. The composition of embodiment 86, wherein the coding sequence of at least one of the vectors comprises a nucleotide sequence spanning two neighboring exons of secreted target genomic DNA, and lacks an intronic sequence between the two neighboring exons.
[0603] Embodiment 88. The composition of embodiment 87, wherein the first or second detectable marker gene encodes alkaline phosphatase.
[0604] Embodiment 89. The composition of embodiment 87 or 88, wherein the first and second detectable marker genes are the same.
[0605] Embodiment 90. A composition comprising: a first nucleic acid vector comprising a promoter, a first coding sequence that encodes an N-terminal portion of an secreted target protein positioned 3′ to the promoter, a splicing donor signal sequence positioned at the 3′ end of the first coding sequence, and a F1 phage recombinogenic region positioned 3′ to the splicing donor signal sequence; and a second nucleic acid vector, different from the first nucleic acid vector, comprising a second F1 phage recombinogenic region, a splicing acceptor signal sequence positioned 3′ of the second F1 phage recombinogenic region, a second coding sequence that encodes a C-terminal portion of an secreted target protein positioned at the 3′ end of the splicing acceptor signal sequence, and a polyadenylation sequence positioned at the 3′ end of the second coding sequence; wherein each of the encoded portions is at least 30 amino acid residues in length, wherein the respective amino acid sequences of the encoded portions do not overlap with each other, wherein no single vector of the two different vectors encodes a full-length secreted target protein, and, when the coding sequences are transcribed in a mammalian cell to produce RNA transcripts, splicing occurs between the splicing donor signal one transcript and the splicing acceptor signal on the other transcript, thereby forming a recombined RNA molecule that encodes a secretion signal sequence operatively linked to a full-length secreted target protein.
[0606] Embodiment 91. The composition of embodiment 90, wherein the coding sequence of at least one of the vectors comprises a nucleotide sequence spanning two neighboring exons of secreted target genomic DNA, and lacks an intronic sequence between the two neighboring exons.
[0607] Embodiment 92. The composition of any one of embodiments 84-91, wherein the first vector, the second vector, or both vectors further comprises a 5′ untranslated region (UTR), a 3′ UTR, or both.
[0608] Embodiment 93. The composition of any one of embodiments 84-92, wherein the secreted target protein is norrin cysteine knot growth factor (NDP).
[0609] Embodiment 94. The composition of embodiment 93, wherein the secreted target protein comprises a sequence that is at least 80% identical to SEQ ID NO: 1.
[0610] Embodiment 95. The composition of embodiment 94, wherein the secreted target protein comprises a sequence that is at least 90% identical to SEQ ID NO: 1.
[0611] Embodiment 96. The composition of embodiment 95, wherein the secreted target protein comprises a sequence that is at least 99% identical to SEQ ID NO: 1.
[0612] Embodiment 97. The composition of any one of embodiments 84-92, wherein the secreted target protein is heat shock protein family A (Hsp70) member 1A (HSPA1A), a heat shock protein family 40 (Hsp40)/DnaJ member.
[0613] Embodiment 98. The composition of embodiment 97, wherein the secreted target protein comprises a sequence that is at least 80% identical to SEQ ID NO: 3.
[0614] Embodiment 99. The composition of embodiment 98, wherein the secreted target protein comprises a sequence that is at least 90% identical to SEQ ID NO: 3.
[0615] Embodiment 100. The composition of embodiment 99, wherein the secreted target protein comprises a sequence that is at least 99% identical to SEQ ID NO: 3.
[0616] Embodiment 101. The composition of any one of embodiments 84-100, wherein the secretion signal sequence comprises SEQ ID NO: 5.
[0617] Embodiment 102. The composition of any one of embodiments 84-100, wherein the secretion signal sequence comprises SEQ ID NO: 7.
[0618] Embodiment 103. The composition of any one of embodiments 51-102, further comprising a pharmaceutically acceptable excipient.
[0619] Embodiment 104. A kit comprising a composition of any one of embodiments 51-103.
[0620] Embodiment 105. A kit of embodiment 104, further comprising a pre-loaded syringe comprising the composition.
[0621] Embodiment 106. A method comprising introducing into an inner ear of a mammal a therapeutically effective amount of the composition of any one of embodiments 51-103.
[0622] Embodiment 107. The method of embodiment 106, wherein the mammal is a human or a primate.
[0623] Embodiment 108. The method of embodiment 106 or 107, wherein the mammal has been previously identified as having a defective secreted target gene.
[0624] Embodiment 109. A method of increasing expression of a full-length secreted target protein in a mammalian cell, the method comprising introducing the composition of any one of embodiments 51-103 into the mammalian cell.
[0625] Embodiment 110. The method of embodiment 109, wherein the mammalian cell is a cochlear inner hair cell, a supporting cell, a ganglion cell, a clear cell, a cuboidal cell, a cartilage cell, a cell of the tegmentum vasculosum, a homogene cell, a Hensen's cell, a Deiters' cell, a pillar cell, or a border cell.
[0626] Embodiment 111. The method of embodiment 109 or 110, wherein the mammalian cell is a human cell or a primate cell.
[0627] Embodiment 112. The method of any one of embodiments 109-111, wherein the mammalian cell has previously been determined to have a defective secreted target gene.
[0628] Embodiment 113. A method of increasing the level of a full-length secreted target protein in an inner ear of a mammal, the method comprising introducing into the inner ear of the mammal a therapeutically effective amount of the composition of any one of embodiments 51-103.
[0629] Embodiment 114. The method of embodiment 113, wherein the mammal has been previously identified as having a defective secreted target gene.
[0630] Embodiment 115. The method of embodiment 113 or 114, wherein the mammal is a human or a primate.
[0631] Embodiment 116. A method of treating syndromic and non-syndromic sensorineural hearing loss in a subject identified as having a defective secreted target gene, the method comprising:
[0632] administering a therapeutically effective amount of a composition of any one of embodiments 51-103 into the inner ear of the subject.
[0633] Embodiment 117. The method of embodiment 116, wherein the subject is a human or a primate.
[0634] Embodiment 118. The method of embodiment 116 or 117, wherein the subject has Norrie disease pseudoglioma or the subject has been identified or diagnosed as having a Hsp70 polymorphism (such as rs1043618, rs1061581 and rs2227956 in HSP70-1, HSP70-2 and HSP70-hom, respectively) that makes the cochlea susceptible to hearing loss (such as a polymorphism as described by Konings et al., “Variations in HSP70 genes associated with noise-induced hearing loss in two independent populations”, Eur J Hum Genet. 2009 March; 17(3): 329-33, the contents of which is hereby incorporated by reference in its entirety).
[0635] Embodiment 119. The method of any one of embodiments 116-118, further comprising, prior to the administering step, determining that the subject has a defective secreted target gene.
[0636] Embodiment 120. A method of treating or preventing hearing loss in a subject identified as having a defective NDP gene or a defective HSPA1A gene, the method comprising: administering a therapeutically effective amount of a composition of any one of embodiments 51-103 into an inner ear of the subject.
[0637] Embodiment 121. The method of embodiment 120, wherein the subject has been identified or diagnosed as having Norrie disease pseudoglioma or the subject has been identified or diagnosed as having a Hsp70 polymorphism (such as rs1043618, rs1061581 and rs2227956 in HSP70-1, HSP70-2 and HSP70-hom, respectively) that makes the cochlea susceptible to hearing loss (such as a polymorphism as described by Konings et al., “Variations in HSP70 genes associated with noise-induced hearing loss in two independent populations”, Eur J Hum Genet. 2009 March; 17(3): 329-33, the contents of which is hereby incorporated by reference in its entirety).
[0638] Embodiment 122. The method of embodiment 120 or 121, wherein the subject is a human or a primate.
[0639] Embodiment 123. The method of any one of embodiments 120-122, further comprising prior to the administering step, determining that the subject has a defective NDP gene or a defective HSPA1A gene.
[0640] Embodiment 124. A method of treating or preventing vision loss in a subject identified as having a defective NDP gene or a defective HSPA1A gene, the method comprising: administering a therapeutically effective amount of a composition of any one of embodiments 1-30 into an inner ear or central nervous system of the subject, or systemically administering a therapeutically effective amount of a composition of any one of embodiments 1-30 to the subject.
[0641] Embodiment 125. The method of embodiment 124, wherein the subject is a human or a primate.
[0642] Embodiment 126. The method of embodiment 124 or 125, further comprising prior to the administering step, determining that the subject has a defective NDP gene or a defective HSPA1A gene.
[0643] Embodiment 127. A method of treating or preventing vision loss in a subject identified as having a defective NDP gene or a defective HSPA1A gene, the method comprising: administering a therapeutically effective amount of a composition of any one of embodiments 51-103 into an inner ear or central nervous system of the subject, or systemically administering a therapeutically effective amount of a composition of any one of embodiments 51-103 to the subject.
[0644] Embodiment 128. The method of embodiment 127, wherein the subject is a human.
[0645] Embodiment 129. The method of embodiment 127 or 128, further comprising prior to the administering step, determining that the subject has a defective NDP gene or a defective HSPA1A gene.
OTHER EMBODIMENTS
[0646] It is to be understood that while the present disclosure has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the present disclosure, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.
[0647] All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control. Section headings and any descriptions of materials, methods, and examples are illustrative only and not intended to be limiting.
TABLE-US-00035 Exemplary Sequences mature NDP protein SEQ ID NO: 1 KTDSSFIMDSDPRRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQP FRSSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECNS mature NDP cDNA SEQ ID NO: 2 AAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACTATGTGG ATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTGCGAGGG GCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAGCAACCC TTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGCTGCGAT GCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGCGAGGAATG CAATTCCTAA mature HSPA1A protein SEQ ID NO: 3 MAKAAAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDPVVQSDMKHWPFQVINDGDKPKVQVSYKGETKAFYPEEISSMVLTKMK EIAEAYLGYPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTFDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVNHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRSTLEPVEKA LRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTI PTKQTQIFTTYSDNQPGVLIQVYEGERAMTKDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANG ILNVTATDKSTGKANKITITNDKGRLSKEEIERMVQEAEKYKAEDEVQRERVSAKNALESYAFN MKSAVEDEGLKGKISEADKKKVLDKCQEVISWLDANTLAEKDEFEHKRKELEQVCNPIISGLYQ GAGGPGPGGFGAQGPKGGSGSGPTIEEVD mature HSPA1A cDNA SEQ ID NO: 4 ATGGCCAAAGCCGCGGCGATCGGCATCGACCTGGGCACCACCTACTCCTGCGTGGGGGTGTTCC AACACGGCAAGGTGGAGATCATCGCCAACGACCAGGGCAACCGCACCACCCCCAGCTACGTGGC CTTCACGGACACCGAGCGGCTCATCGGGGATGCGGCCAAGAACCAGGTGGCGCTGAACCCGCAG AACACCGTGTTTGACGCGAAGCGGCTGATCGGCCGCAAGTTCGGCGACCCGGTGGTGCAGTCGG ACATGAAGCACTGGCCTTTCCAGGTGATCAACGACGGAGACAAGCCCAAGGTGCAGGTGAGCTA CAAGGGGGAGACCAAGGCATTCTACCCCGAGGAGATCTCGTCCATGGTGCTGACCAAGATGAAG GAGATCGCCGAGGCGTACCTGGGCTACCCGGTGACCAACGCGGTGATCACCGTGCCGGCCTACT TCAACGACTCGCAGCGCCAGGCCACCAAGGATGCGGGTGTGATCGCGGGGCTCAACGTGCTGCG GATCATCAACGAGCCCACGGCCGCCGCCATCGCCTACGGCCTGGACAGAACGGGCAAGGGGGAG CGCAACGTGCTCATCTTTGACCTGGGCGGGGGCACCTTCGACGTGTCCATCCTGACGATCGACG ACGGCATCTTCGAGGTGAAGGCCACGGCCGGGGACACCCACCTGGGTGGGGAGGACTTTGACAA CAGGCTGGTGAACCACTTCGTGGAGGAGTTCAAGAGAAAACACAAGAAGGACATCAGCCAGAAC AAGCGAGCCGTGAGGCGGCTGCGCACCGCCTGCGAGAGGGCCAAGAGGACCCTGTCGTCCAGCA CCCAGGCCAGCCTGGAGATCGACTCCCTGTTTGAGGGCATCGACTTCTACACGTCCATCACCAG GGCGAGGTTCGAGGAGCTGTGCTCCGACCTGTTCCGAAGCACCCTGGAGCCCGTGGAGAAGGCT CTGCGCGACGCCAAGCTGGACAAGGCCCAGATTCACGACCTGGTCCTGGTCGGGGGCTCCACCC GCATCCCCAAGGTGCAGAAGCTGCTGCAGGACTTCTTCAACGGGCGCGACCTGAACAAGAGCAT CAACCCCGACGAGGCTGTGGCCTACGGGGCGGCGGTGCAGGCGGCCATCCTGATGGGGGACAAG TCCGAGAACGTGCAGGACCTGCTGCTGCTGGACGTGGCTCCCCTGTCGCTGGGGCTGGAGACGG CCGGAGGCGTGATGACTGCCCTGATCAAGCGCAACTCCACCATCCCCACCAAGCAGACGCAGAT CTTCACCACCTACTCCGACAACCAACCCGGGGTGCTGATCCAGGTGTACGAGGGCGAGAGGGCC ATGACGAAAGACAACAATCTGTTGGGGCGCTTCGAGCTGAGCGGCATCCCTCCGGCCCCCAGGG GCGTGCCCCAGATCGAGGTGACCTTCGACATCGATGCCAACGGCATCCTGAACGTCACGGCCAC GGACAAGAGCACCGGCAAGGCCAACAAGATCACCATCACCAACGACAAGGGCCGCCTGAGCAAG GAGGAGATCGAGCGCATGGTGCAGGAGGCGGAGAAGTACAAAGCGGAGGACGAGGTGCAGCGCG AGAGGGTGTCAGCCAAGAACGCCCTGGAGTCCTACGCCTTCAACATGAAGAGCGCCGTGGAGGA TGAGGGGCTCAAGGGCAAGATCAGCGAGGCGGACAAGAAGAAGGTTCTGGACAAGTGTCAAGAG GTCATCTCGTGGCTGGACGCCAACACCTTGGCCGAGAAGGACGAGTTTGAGCACAAGAGGAAGG AGCTGGAGCAGGTGTGTAACCCCATCATCAGCGGACTGTACCAGGGTGCCGGTGGTCCCGGGCC TGGGGGCTTCGGGGCTCAGGGTCCCAAGGGAGGGTCTGGGTCAGGCCCCACCATTGAGGAGGTG GATTAG NDP secretion signal protein SEQ ID NO: 5 MRKHVLAASFSMLSLLVIMGDTDS NDP secretion signal cDNA SEQ ID NO: 6 AGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACAG ACAGT Human Norrin secretion signal protein SEQ ID NO: 7 MRKHVLAASFSMLSLLVIMGDTDS Human Norrin secretion signal cDNA SEQ ID NO: 8 ATGAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATA CAGACAGT NDP Construct 1 SEQ ID NO: 9 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CTGCGGCCGCACGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCAT TAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTG ACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATA GGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATC AAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCA TTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC GCTATTACCATGGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCC CCACCCCCAATTTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGG GGGGGGCGCGCGCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTG CGGCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCG GCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCC CCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGA GCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTC TTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGG CTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCG GCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGT GTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCC CCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCG GGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTC GGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGC GGCGAGCCGCAGCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCC CAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGA AGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCG TCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGG GGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTC ATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCAT GAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACA GACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACT ATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTG CGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAG CAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGC TGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGCGA GGAATGCAATTCCTAAGGCCCGCTGCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGT TCGACCAGCCAGGGAAAGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTCTCC GGGGACTCTGCATATTCTAGTAATAAAGACTCTACATGCTTGTTGACAGAGAGAGATACTCTGG GAACTTCTTTGCAGTTCCCATCTCCTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTC AGGCATTTTCCCCCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGGAAA AAGTGGGCCCTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAATTT ACTTTGGAAAGTAGAAAAGCCCAGCTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTTTTT TTACCTTGTCATTTTGGTCTAAGGTTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGA TACCAAGCATGTGGATATGTTTAGCTACGTTTACTCACAGCCAGCGAACTGACATTAAAATAAC TAACAAACAGATTCTTTTATGTGATGCTGGAACTCTTGACAGCTATAATTATTATTCAGAAATG ACTTTTTGAAAGTAAAAGCAGCATAAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATG GTAAAATTTTGTAAGGGAGCAGACTTTTAAAGACTTGCACAAATACGGATCCTGCACTGACTCT GGAAAAGGCATATATGTACTAGTGGCATGGAGAATGCACCATACTCATGCATGCAAATTAGACA ACCAAGTATGAATCTATTTGTGGGTGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCTCTA ATATCCACTTGTCCATGTGAAACATGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTT GGTTCAAATGTGTTTTGGTCCTGGAGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATAAAA AGAGTATATTCAAAAGAGCTCCTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCC CGTGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATT GCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGG GGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGCAATTCAGAC TCCCACTAGAAGAAACAGAACTTGAAAAAAGGATAATTGTGGTGAACACCTCAACCCAGTGGTC CAAAAACTGAAAAACTTTGACCCCGAGCACCCTCACACAGATAGACTACTGAAAGAAGGAGAAA GGGGCACTTACTCAGTCTCCCTTATCAGATTGCCTTACGAGGAGTACTAGACTCCCTCAAGCAA ATAGATCTCACTTACCACTTGCAAAGGTATACTACTTTCACTCTATTAGACCTATATATCTGAC CAGGGTCCTATTCCAAGACAACTCTTCTACTCTTCCAGCAGACTCTTATAGAAAGAAAGATTCT GGGGTCCAAGAAAGCAGTACTTTCTTACAAGGAGCCAAAAAAAATAACCTTTCTTTAGCACTTC TAACCTTGGAGTGAACTGGTGATCAAAGAGAGGTTGGCTCCCTGGGGACAAGTGCCACAAATTC AGTCAACTACAAGAAAGTTGAGAACACTGTTCTCCCGAAACCAAGCTTGAATTCAGCTGACGTG CCTCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCT CACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGC GAGCGAGCGCGCAGCTGCCTGCAGG NDP Construct 2 SEQ ID NO: 10 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CTGCGGCCGCACGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCAT TAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTG ACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATA GGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATC AAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCA TTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC GCTATTACCATGGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCC CCACCCCCAATTTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGG GGGGGGCGCGCGCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTG CGGCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCG GCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCC CCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGA GCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTC TTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGG CTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCG GCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGT GTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCC CCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCG GGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTC GGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGC GGCGAGCCGCAGCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCC CAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGA AGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCG TCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGG GGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTC ATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCAT GAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACA GACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACT ATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTG CGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAG CAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGC TGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGCGA GGAATGCAATTCCTAAGAGCTCCTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCC CCGTGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAAT TGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAG GGGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGGTTGACTAA AGATTGAACCTTATTCAAAGTTAGACTCTCTTTGTTAAAGACAAACAAAACTTCCAATAATTCA GCAACTAATAGAAAGACTCAACTTGTGAGCCACTACTTATTAATTGAGAATAGTCACTGAGTCT GGCAAAATATATTAGAAAGTGAGACTGAGTTTAAAAAAGTGAGACCTTTGATTCTGAACAGTGA ACTTTGAGACTGAAAAACTACAGCTTTGAGGCTAAATACTTGATCAAATAAAACTAGTTAGTGA AAAAACTGAGTGAAAGTCCAACAGAAAAAAGAGGGCCCACTTCCACGAGTGAGACAAAATCCAG ATTGATCGTTCTTTAAGTGACTATTCTTGCCAGAATCAGCAAGGTGGATACAAAGGACTCTGAG AAAGAACTCTCTGAACTCTGGGCAAGGCCCCAGTCCAAAGCAATTAGTATCCTTAGGACCAGAA AAATCTGTGGAAGGTCAGAATTTCTTGTCTGAGAAAAACAAAGCAATTCAGACTCCCACTAGAA GAAACAGAACTTGAAAAAAGGATAATTGTGGTGAACACCTCAACCCAGTGGTCCAAAAACTGAA AAACTTTGACCCCGAGCACCCTCACACAGATAGACTACTGAAAGAAGGAGAAAGGGGCACTTAC TCAGTCTCCCTTATCAGATTGCCTTACGAGGAGTAGTAGACTCCCTCAAGCAAATAGATCTCAC TTACCACTTGCAAAGGTATACTACTTTCACTCTATTAGACCTATATATCTGACCAGGGTCCTAT TCCAAGACAACTCTTCTAGTCTTCCAGCAGACTCTTATAGAAAGAAAGATTCTGGGGTCCAAGA AAGCAGTACTTTCTTACAAGGAGCCAAAAAAAATAACCTTTCTTTAGCACTTCTAACCTTGGAG TGAACTGGTGATCAAAGAGAGGTTGGCTCCCTGGGGACAAGTGCCACAAATTCAGTCAACTACA AGAAAGTTGAGAACACTGTTCTCCCGAAACCAAGCTTGAATTCAGCTGACGTGCCTCGGACCGC TAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCG GGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCG CAGCTGCCTGCAGG NDP Construct 3 SEQ ID NO: 11 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CTGCGGCCGCACGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCAT TAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTG ACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATA GGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATC AAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCA TTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC GCTATTACCATGGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCC CCACCCCCAATTTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGG GGGGGGCGCGCGCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTG CGGCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCG GCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCC CCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGA GCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTC TTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGG CTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCG GCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGT GTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCC CCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCG GGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTC GGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGC GGCGAGCCGCAGCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCC CAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGA AGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCG TCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGG GGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTC ATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCAT GAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACA GACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACT ATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTG CGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAG CAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGC TGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGCGA GGAATGCAATTCCGGCTCCGGAGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAG AATCCTGGCCCAATGGAGAGCGACGAGAGCGGCCTGCCCGCCATGGAGATCGAGTGCCGCATCA CCGGCACCCTGAACGGCGTGGAGTTCGAGCTGGTGGGCGGCGGAGAGGGCACCCCCGAGCAGGG CCGCATGACCAACAAGATGAAGAGCACCAAAGGCGCCCTGACCTTCAGCCCCTACCTGCTGAGC CACGTGATGGGCTACGGCTTCTACCACTTCGGCACCTACCCCAGCGGCTACGAGAACCCCTTCC TGCACGCCATCAACAACGGCGGCTACACCAACACCCGCATCGAGAAGTACGAGGACGGCGGCGT GCTGCACGTGAGCTTCAGCTACCGCTACGAGGCCGGCCGCGTGATCGGCGACTTCAAGGTGATG GGCACCGGCTTCCCCGAGGACAGCGTGATCTTCACCGACAAGATCATCCGCAGCAACGCCACCG TGGAGCACCTGCACCCCATGGGCGATAACGATCTGGATGGCAGCTTCACCCGCACCTTCAGCCT GCGCGACGGCGGCTACTACAGCTCCGTGGTGGACAGCCACATGCACTTCAAGAGCGCCATCCAC CCCAGCATCCTGCAGAACGGGGGCCCCATGTTCGCCTTCCGCCGCGTGGAGGAGGATCACAGCA ACACCGAGCTGGGCATCGTGGAGTACCAGCACGCCTTCAAGACCCCGGATGCAGATGCCGGTGA AGAATAAGGCCCGCTGCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGTTCGACCAGC CAGGGAAAGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTCTCCCGGGACTCT GCATATTCTAGTAATAAAGACTCTACATGCTTGTTGACAGAGAGAGATACTCTGGGAACTTCTT TGCAGTTCCCATCTCCTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTCAGGCATTTT CCCCCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGGAAAAAGTGGGCC CTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAATTTACTTTGGAA AGTAGAAAAGCCCAGCTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTTTTTTTACCTTGT CATTTTGGTCTAAGGTTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGATACCAAGCA TGTGGATATGTTTAGCTACGTTTACTCACAGCCAGCGAACTGACATTAAAATAACTAACAAACA GATTCTTTTATGTGATGCTGGAACTCTTGACAGCTATAATTATTATTCAGAAATGACTTTTTGA AAGTAAAAGCAGCATAAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATGGTAAAATTT TGTAAGGGAGCAGACTTTTAAAGACTTGCACAAATACGGATCCTGCACTGACTCTGGAAAAGGC ATATATGTACTAGTGGCATGGAGAATGCACCATACTCATGCATGCAAATTAGACAACCAAGTAT GAATCTATTTGTGGGTGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCTCTAATATCCACT TGTCCATGTGAAACATGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTTGGTTCAAAT GTGTTTTGGTCCTGGAGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATAAAAAGAGTATAT TCAAAAGAGCTCCTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTC CTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCAT TGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGATT GGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGAAGCTTGAATTCAGCTGAC GTGCCTCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTC GCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTG AGCGAGCGAGCGCGCAGCTGCCTGCAGG NDP Construct 4 SEQ ID NO: 12 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CTGCGGCCGCACGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCAT TAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTG ACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATA GGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATC AAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCA TTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC GCTATTACCATGGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCC CCACCCCCAATTTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGG GGGGGGCGCGCGCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTG CGGCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCG GCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCC CCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGA GCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTC TTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGG CTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCG GCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGT GTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCC CCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCG GGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTC GGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGC GGCGAGCCGCAGCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCC CAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGA AGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCG TCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGG GGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTC ATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTGCCACCAT GAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACA GACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACT ATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTG CGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAG CAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGC TGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGCGA GGAATGCAATTCCGGCTCCGGAGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAG AATCCTGGCCCAATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCG AGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCAC CTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACC CTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGC ACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGA CGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATC GAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACT ACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAA GATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCC ATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCA AAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCAC TCTCGGCATGGACGAGCTGTACAAGTAATAAGGCCCGCTGCTGTGTGTGGCTTCTGGATGGGAC AACTGTAGAGGCAGTTCGACCAGCCAGGGAAAGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGG ATGCAACAATTCTCCCGGGACTCTGCATATTCTAGTAATAAAGACTCTACATGCTTGTTGACAG AGAGAGATACTCTGGGAACTTCTTTGCAGTTCCCATCTCCTTTCTCTGGTACAATTTCTTTTGG TTCATTTTCAGATTCAGGCATTTTCCCCCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTC AGCATTAGTGGGAAAAAGTGGGCCCTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCAC GCTGGGGAAGAATTTACTTTGGAAAGTAGAAAAGCCCAGCTTTTCCTGGGACATCTTCTGTTAT TGTTGATGTTTTTTTTTACCTTGTCATTTTGGTCTAAGGTTGCCATTGCTGCTAAAGGTTACCG ATTTCAAAGTCCAGATACCAAGCATGTGGATATGTTTAGCTACGTTTACTCACAGCCAGCGAAC TGACATTAAAATAACTAACAAACAGATTCTTTTATGTGATGCTGGAACTCTTGACAGCTATAAT TATTATTCAGAAATGAGTTTTTGAAAGTAAAAGCAGCATAAAGAATTTGTCACAGGAAGGCTGT CTCAGATAAATTATGGTAAAATTTTGTAAGGGAGCAGACTTTTAAAGACTTGCACAAATACGGA TCCTGCACTGACTCTGGAAAAGGCATATATGTACTAGTGGCATGGAGAATGCACCATACTCATG CATGCAAATTAGACAACCAAGTATGAATCTATTTGTGGGTGTGCTATAGCTTTAGCCGTGTCAC GGGCATCATTCTCTAATATCCACTTGTCCATGTGAAACATGTTGCCAAAATGGTGGCCTGGCTT GTCTTCTGAACGTTTGGTTCAAATGTGTTTTGGTCCTGGAGGCTCAAATTTTGAGTTATTCCCA CGTTTTGAAATAAAAAGAGTATATTCAAAAGAGCTCCTGTGCCTTCTAGTTGCCAGCCATCTGT TGTTTGCCCCTCCCCCGTGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAA TAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGG GGCAGGACAGCAAGGGGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTC TATGGAAGCTTGAATTCAGCTGACGTGCCTCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGC CACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCG GGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCTGCCTGCAGG NDP Construct 5 SEQ ID NO: 13 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CTGCGGCCGCACGCGTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCAT TAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTG ACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATA GGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATC AAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCA TTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC GCTATTACCATGGGTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCC CCACCCCCAATTTTGTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGG GGGGGGCGCGCGCCAGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTG CGGCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCG GCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGGGAGTCGCTGCGTTGCCTTCGCCCCGTGCC CCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGCTCTGACTGACCGCGTTACTCCCACAGGTGA GCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAATTAGCGCTTGGTTTAATGACGGCTCGTTTC TTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTCCGGGAGGGCCCTTTGTGCGGGGGGGAGCGG CTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGGAGCGCCGCGTGCGGCCCGCGCTGCCCGGCG GCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTGTGCGCTCCGCGTGTGCGCGAGGGGAGCGCG GCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTGCGAGGGGAACAAAGGCTGCGTGCGGGGTGT GTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCGGCGGTCGGGCTGTAACCCCCCCCTGCACCC CCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCGGGTGCGGGGCTCCGTGCGGGGCGTGGCGCG GGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGGTGGGGGTGCCGGGCGGGGCGGGGCCGCCTC GGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGGCCCCCGGAGCGCCGGCGGCTGTCGAGGCGC GGCGAGCCGCAGCCATTGCCTTTTATGGTAATCGTGCGAGAGGGCGCAGGGACTTCCTTTGTCC CAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCCGCCGCACCCCCTCTAGCGGGCGCGGGGCGA AGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGGGGAGGGCCTTCGTGCGTCGCCGCGCCGCCG TCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCGCGGGGGGACGGCTGCCTTCGGGGGGGACGG GGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCGGCGGCTCTAGAGCCTCTGCTAACCATGTTC ATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAACGTGCTGGTTATTGTGACCGGTAAGATGCT CCGTGGAAGGGAGCCGAGCGGTGGGCAGAGGCTGAGTCCCCGATAACGAGCGCCTCACATTTCC GTGGCATTCCCATTTGCTAGTGCGCTGCTGCGGCCGCACGCCTGATTGATATATGACTGCAATG GCACTTTTCCATTTGACATTCTCTCTCTCTCTCTCCCTCTCTCTCTCTCCCTCTCTCTCTCCCT CTCTCTCTCTCCCTGTGTCGCTTAAACAACAGTCCTAACTTTTGTGTGTTGCAAATATAAAAGG CAAGCCATGTGACAGAGGGACAGAAGAACAAAAGCATTTGGAAGTAACAGGACCTCTTTCTAGC TCTCAGAAAAGTCTGAGAAGAAAGGAGCCCTGCGTTCCCCTAAGCTGTGCAGCAGATACTGTGA TGATGGATTGCAAGTGCAAAGAGTAAGACAAAACTCCAGCACATAAAGGACAATGACAACCAGA AAGCTTCAGCCCGATCCTGCCCTTTCCTTGAACGGGACTGGATCCTAGGAGGTGAAGCCATTTC CAATTTTTTGTCCTCTGCCTCCCTCTGCTGTTCTTCTAGAGAAGTTTTTCCTTACAACAGCCAC CATGAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGAT ACAGACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACC ACTATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAG GTGCGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTC AAGCAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGC GGCTGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTG CGAGGAATGCAATTCCTAAGGCCCGCTGCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGC AGTTCGACCAGCCAGGGAAAGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTC TCCCGGGACTCTGCATATTCTAGTAATAAAGACTCTAGATGCTTGTTGACAGAGAGAGATACTC TGGGAACTTCTTTGCAGTTCCCATCTCCTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGA TTCAGGCATTTTCCCCCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGG AAAAAGTGGGCCCTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAA TTTACTTTGGAAAGTAGAAAAGCCCAGCTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTT TTTTTACCTTGTCATTTTGGTCTAAGGTTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCC AGATACCAAGCATGTGGATATGTTTAGCTACGTTTACTCACAGCCAGCGAACTGACATTAAAAT AACTAACAAACAGATTCTTTTATGTGATGCTGGAACTCTTGACAGCTATAATTATTATTCAGAA ATGACTTTTTGAAAGTAAAAGCAGCATAAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATT ATGGTAAAATTTTGTAAGGGAGCAGACTTTTAAAGACTTGCACAAATACGGATCCTGCACTGAC TCTGGAAAAGGCATATATGTACTAGTGGCATGGAGAATGCACCATACTCATGCATGCAAATTAG ACAACCAAGTATGAATCTATTTGTGGGTGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCT CTAATATCCACTTGTCCATGTGAAACATGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACG TTTGGTTCAAATGTGTTTTGGTCCTGGAGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATA AAAAGAGTATATTCAAAAGAGCTCCTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTC CCCCGTGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAA ATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCA AGGGGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGCAATTCA GACTCCCACTAGAAGAAACAGAACTTGAAAAAAGGATAATTGTGGTGAACACCTCAACCCAGTG GTCCAAAAACTGAAAAACTTTGACCCCGAGCACCCTCACACAGATAGACTACTGAAAGAAGGAG AAAGGGGCACTTACTCAGTCTCCCTTATCAGATTGCCTTACGAGGAGTACTAGACTCCCTCAAG CAAATAGATCTCACTTACCACTTGCAAAGGTATACTACTTTCACTCTATTAGACCTATATATCT GACCAGGGTCCTATTCCAAGACAACTCTTCTACTCTTCCAGCAGACTCTTATAGAAAGAAAGAT TCTGGGGTCCAAGAAAGCAGTACTTTCTTACAAGGAGCCAAAAAAAATAACCTTTCTTTAGCAC TTCTAACCTTGGAGTGAACTGGTGATCAAAGAGAGGTTGGCTCCCTGGGGACAAGTGCCACAAA TTCAGTCAACTACAAGAAAGTTGAGAACACTGTTCTCCCGAAACCAAGCTTGAATTCAGCTGAC GTGCCTCGGACCGCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTC GCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTG AGCGAGCGAGCGCGCAGCTGCCTGCAGG 5′ UTR SEQ ID NO: 14 AAGATGCTCCGTGGAAGGGAGCCGAGCGGTGGGCAGAGGCTGAGTCCCCGATAACGAGCGCCTC ACATTTCCGTGGCATTCCCATTTGCTAGTGCGCTGCTGCGGCCGCACGCCTGATTGATATATGA CTGCAATGGCACTTTTCCATTTGACATTCTCTCTCTCTCTCTCCCTCTCTCTCTCTCCCTCTCT CTCTCCCTCTCTCTCTCTCCCTGTGTCGCTTAAACAACAGTCCTAACTTTTGTGTGTTGCAAAT ATAAAAGGCAAGCCATGTGACAGAGGGACAGAAGAACAAAAGCATTTGGAAGTAACAGGACCTC TTTCTAGCTCTCAGAAAAGTCTGAGAAGAAAGGAGCCCTGCGTTCCCCTAAGCTGTGCAGCAGA TACTGTGATGATGGATTGCAAGTGCAAAGAGTAAGACAAAACTCCAGCACATAAAGGACAATGA CAACCAGAAAGCTTCAGCCCGATCCTGCCCTTTCCTTGAACGGGACTGGATCCTAGGAGGTGAA GCCATTTCCAATTTTTTGTCCTCTGCCTCCCTCTGCTGTTCTTCTAGAGAAGTTTTTCCTTACA ACA 3′ UTR 1023 SEQ ID NO: 15 GGCCCGCTGCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGTTCGACCAGCCAGGGAA AGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTCTCCCGGGACTCTGCATATT CTAGTAATAAAGACTCTACATGCTTGTTGACAGAGAGAGATACTCTGGGAACTTCTTTGCAGTT CCCATCTCCTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTCAGGCATTTTCCCCCTT GGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGGAAAAAGTGGGCCCTCATAC ACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAATTTACTTTGGAAAGTAGAA AAGCCCAGCTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTTTTTTTACCTTGTCATTTTG GTCTAAGGTTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGATACCAAGCATGTGGAT ATGTTTAGCTACGTTTACTCACAGCCAGCGAACTGACATTAAAATAACTAACAAACAGATTCTT TTATGTGATGCTGGAACTCTTGACAGCTATAATTATTATTCAGAAATGACTTTTTGAAAGTAAA AGCAGCATAAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATGGTAAAATTTTGTAAGG GAGCAGACTTTTAAAGACTTGCACAAATACGGATCCTGCACTGACTCTGGAAAAGGCATATATG TACTAGTGGCATGGAGAATGCACCATACTCATGCATGCAAATTAGACAACCAAGTATGAATCTA TTTGTGGGTGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCTCTAATATCCACTTGTCCAT GTGAAACATGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTTGGTTCAAATGTGTTTT GGTCCTGGAGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATAAAAAGAGTATATTCAAAA 5′ITR cDNA sequence SEQ ID NO: 16 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCGTCGGGCGACCTTTGGTC GCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTC CT CMV enhancer cDNA sequence SEQ ID NO: 17 GACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATA TATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCC CGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGAC GTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCC AAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATG ACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGG CBA promoter cDNA sequence SEQ ID NO: 18 TCGAGGTGAGCCCCACGTTCTGCTTCACTCTCCCCATCTCCCCCCCCTCCCCACCCCCAATTTT GTATTTATTTATTTTTTAATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGCGCGCGCC AGGCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGGCGGCAGCCAAT CAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGCGGCGGCGGCGGCGGCCCTATAAAA AGCGAAGCGCGCGGCGGGCG chimeric intron cDNA sequence SEQ ID NO: 19 GGAGTCGCTGCGTTGCCTTCGCCCCGTGCCCCGCTCCGCGCCGCCTCGCGCCGCCCGCCCCGGC TCTGACTGACCGCGTTACTCCCACAGGTGAGCGGGCGGGACGGCCCTTCTCCTCCGGGCTGTAA TTAGCGCTTGGTTTAATGACGGCTCGTTTCTTTTCTGTGGCTGCGTGAAAGCCTTAAAGGGCTC CGGGAGGGCCCTTTGTGCGGGGGGGAGCGGCTCGGGGGGTGCGTGCGTGTGTGTGTGCGTGGGG AGCGCCGCGTGCGGCCCGCGCTGCCCGGCGGCTGTGAGCGCTGCGGGCGCGGCGCGGGGCTTTG TGCGCTCCGCGTGTGCGCGAGGGGAGCGCGGCCGGGGGCGGTGCCCCGCGGTGCGGGGGGGCTG CGAGGGGAACAAAGGCTGCGTGCGGGGTGTGTGCGTGGGGGGGTGAGCAGGGGGTGTGGGCGCG GCGGTCGGGCTGTAACCCCCCCCTGCACCCCCCTCCCCGAGTTGCTGAGCACGGCCCGGCTTCG GGTGCGGGGCTCCGTGCGGGGCGTGGCGCGGGGCTCGCCGTGCCGGGCGGGGGGTGGCGGCAGG TGGGGGTGCCGGGCGGGGCGGGGCCGCCTCGGGCCGGGGAGGGCTCGGGGGAGGGGCGCGGCGG CCCCCGGAGCGCCGGCGGCTGTCGAGGCGCGGCGAGCCGCAGCCATTGCCTTTTATGGTAATCG TGCGAGAGGGCGCAGGGACTTCCTTTGTCCCAAATCTGTGCGGAGCCGAAATCTGGGAGGCGCC GCCGCACCCCCTCTAGCGGGCGCGGGGCGAAGCGGTGCGGCGCCGGCAGGAAGGAAATGGGCGG GGAGGGCCTTCGTGCGTCGCCGCGCCGCCGTCCCCTTCTCCCTCTCCAGCCTCGGGGCTGTCCG CGGGGGGACGGCTGCCTTCGGGGGGGACGGGGCAGGGCGGGGTTCGGCTTCTGGCGTGTGACCG GCGGCTCTAGAGCCTCTGCTAACCATGTTCATGCCTTCTTCTTTTTCCTACAGCTCCTGGGCAA CGTGCTGGTTATTGTG NDP cDNA sequence SEQ ID NO: 20 CTGAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATA CAGACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCA CTATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGG TGCGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCA AGCAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCG GCTGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGC GAGGAATGCAATTCCTAA NDP cDNA sequence SEQ ID NO: 21 CTGAGAAAACATGTACTAGCTGCATCCTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATA CAGACAGTAAAACGGACAGCTCATTCATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCA CTATGTGGATTCTATCAGTCACCCATTGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGG TGCGAGGGGCACTGCAGCCAGGCGTCACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCA AGCAACCCTTCCGTTCCTCCTGTCACTGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCG GCTGCGATGCTCAGGGGGCATGCGACTCACTGCCACCTACCGATACATCCTCTCCTGTCACTGC GAGGAATGCAATTCC 3′UTR cDNA sequence SEQ ID NO: 22 TGCCCGCTGCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGTTCGACCAGCCAGGGAA AGACTGGCAAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTCTCCCGGGACTCTGCATATT CTAGTAATAAAGACTCTACATGCTTGTTGACAGAGAGAGATACTCTGGGAACTTCTTTGCAGTT CCCATCTCCTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTCAGGCATTTTCCCCCTT GGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGGAAAAAGTGGGCCCTCATAC ACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAATTTACTTTGGAAAGTAGAA AAGCCCAGCTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTTTTTTTACCTTGTCATTTTG GTCTAAGGTTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGATACCAAGCATGTGGAT ATGTTTAGCTACGTTTACTCACAGCCAGCGAACTGACATTAAAATAACTAACAAACAGATTCTT TTATGTGATGCTGGAACTCTTGACAGCTATAATTATTATTCAGAAATGACTTTTTGAAAGTAAA AGCAGCATAAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATGGTAAAATTTTGTAAGG GAGCAGACTTTTAAAGACTTGCACAAATACGGATCCTGCACTGACTCTGGAAAAGGCATATATG TACTAGTGGCATGGAGAATGCACCATACTCATGCATGCAAATTAGACAACCAAGTATGAATCTA TTTGTGGGTGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCTCTAATATCCACTTGTCCAT GTGAAACATGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTTGGTTCAAATGTGTTTT GGTCCTGGAGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATAAAAAGAGTATATTCAAAA bGHpA cDNA sequence SEQ ID NO: 23 CTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGACCCTGGA AGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGG TGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGATTGGGAAGACAATA GCAGGCATGCTGGGGATGCGGTGGGCTCTATGG stuffer cDNA sequence SEQ ID NO: 24 TAATTCAGACTCCCACTAGAAGAAACAGAACTTGAAAAAAGGATAATTGTGGTGAACACCTCAA CCCAGTGGTCCAAAAACTGAAAAACTTTGACCCCGAGCACCCTCACACAGATAGACTACTGAAA GAAGGAGAAAGGGGCACTTACTCAGTCTCCCTTATCAGATTGCCTTACGAGGAGTACTAGACTC CCTCAAGCAAATAGATCTCACTTACCACTTGCAAAGGTATACTACTTTCACTCTATTAGACCTA TATATCTGACCAGGGTCCTATTCCAAGACAACTCTTCTACTCTTCCAGCAGACTCTTATAGAAA GAAAGATTCTGGGGTCCAAGAAAGCAGTACTTTCTTACAAGGAGCCAAAAAAAATAACCTTTCT TTAGCACTTCTAACCTTGGAGTGAACTGGTGATCAAAGAGAGGTTGGCTCCCTGGGGACAAGTG CCACAAATTCAGTCAACTACAAGAAAGTTGAGAACACTGTTCTCCCGAAACC stuffer cDNA sequence SEQ ID NO: 25 GTTGACTAAAGATTGAACCTTATTCAAAGTTAGACTCTCTTTGTTAAAGACAAACAAAACTTCC AATAATTCAGCAACTAATAGAAAGACTCAACTTGTGAGCCACTACTTATTAATTGAGAATAGTC ACTCAGTCTGGCAAAATATATTAGAAAGTGACACTGAGTTTAAAAAAGTGACACCTTTGATTCT GAACAGTGAACTTTGAGACTGAAAAACTACAGCTTTGAGGCTAAATACTTGATCAAATAAAACT ACTTACTCAAAAAACTGAGTGAAAGTCCAACAGAAAAAAGAGGGCCCACTTCCACCAGTGACAC AAAATCCAGATTGATCGTTCTTTAAGTGACTATTCTTGCCAGAATCAGCAAGGTGGATACAAAG GACTCTGAGAAAGAACTCTCTGAACTCTGGGCAAGGCCCCAGTCCAAAGCAATTAGTATCCTTA GGACCAGAAAAATCTGTGGAAGGTCAGAATTTCTTGTCTGAGAAAAACAAAGCAATTCAGACTC CCACTAGAAGAAACAGAACTTGAAAAAAGGATAATTGTGGTGAACACCTCAACCCAGTGGTCCA AAAACTGAAAAACTTTGACCCCGAGCACCCTCACACAGATAGACTACTGAAAGAAGGAGAAAGG GGCACTTACTCAGTCTCCCTTATCAGATTGCCTTACGAGGAGTACTAGACTCCCTCAAGCAAAT AGATCTCACTTACCACTTGCAAAGGTATACTACTTTCACTCTATTAGACCTATATATCTGACCA GGGTCCTATTCCAAGACAACTCTTCTACTCTTCCAGCAGACTCTTATAGAAAGAAAGATTCTGG GGTCCAAGAAAGCAGTACTTTCTTACAAGGAGCCAAAAAAAATAACCTTTCTTTAGCACTTCTA ACCTTGGAGTGAACTGGTGATCAAAGAGAGGTTGGCTCCCTGGGGACAAGTGCCACAAATTCAG TCAACTACAAGAAAGTTGAGAACACTGTTCTCCCGAAACC T2A cDNA sequence SEQ ID NO: 26 GAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCA tGFP cDNA sequence SEQ ID NO: 27 ATGGAGAGCGACGAGAGCGGCCTGCCCGCCATGGAGATCGAGTGCCGCATCACCGGCACCCTGA ACGGCGTGGAGTTCGAGCTGGTGGGCGGCGGAGAGGGCACCCCCGAGCAGGGCCGCATGACCAA CAAGATGAAGAGCACCAAAGGCGCCCTGACCTTCAGCCCCTACCTGCTGAGCCACGTGATGGGC TACGGCTTCTACCACTTCGGCACCTACCCCAGCGGCTACGAGAACCCCTTCCTGCACGCCATCA ACAACGGCGGCTACACCAACACCCGCATCGAGAAGTACGAGGACGGCGGCGTGCTGCACGTGAG CTTCAGCTACCGCTACGAGGCCGGCCGCGTGATCGGCGACTTCAAGGTGATGGGCACCGGCTTC CCCGAGGACAGCGTGATCTTCACCGACAAGATCATCCGCAGCAACGCCACCGTGGAGCACCTGC ACCCCATGGGCGATAACGATCTGGATGGCAGCTTCACCCGCACCTTCAGCCTGCGCGACGGCGG CTACTACAGCTCCGTGGTGGACAGCCACATGCACTTCAAGAGCGCCATCCACCCCAGCATCCTG CAGAACGGGGGCCCCATGTTCGCCTTCCGCCGCGTGGAGGAGGATCACAGCAACACCGAGCTGG GCATCGTGGAGTACCAGCACGCCTTCAAGACCCCGGATGCAGATGCCGGTGAAGAATAA eGFP cDNA sequence SEQ ID NO: 28 ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCG ACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCT GACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACC CTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCA AGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTA CAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGC ATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACA ACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAA CATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGC CCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACG AGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGA CGAGCTGTACAAG eGFP protein SEQ ID NO: 29 MVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTT LTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKG IDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDG PVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYK 3′ITR cDNA sequence SEQ ID NO: 30 AGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGG GCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGC AGCTGCCTGCAGG 5′ UTR 579 SEQ ID NO: 31 AAGATGCTCCGTGGAAGGGAGCCGAGCGGTGGGCAGAGGCTGAGTCCCCGATAACGAGCGCCTC ACATTTCCGTGGCATTCCCATTTGCTAGTGCGCTGCTGCGGCCGCACGCCTGATTGATATATGA CTGCAATGGCACTTTTCCATTTGACATTCTCTCTCTCTCTCTCCCTCTCTCTCTCTCCCTCTCT CTCTCCCTCTCTCTCTCTCCCTGTGTCGCTTAAACAACAGTCCTAACTTTTGTGTGTTGCAAAT ATAAAAGGCAAGCCATGTGACAGAGGGACAGAAGAACAAAAGCATTTGGAAGTAACAGGACCTC TTTCTAGCTCTCAGAAAAGTCTGAGAAGAAAGGAGCCCTGCGTTCCCCTAAGCTGTGCAGCAGA TAGTGTGATGATGGATTGCAAGTGCAAAGAGTAAGACAAAACTCCAGCACATAAAGGACAATGA CAACCAGAAAGCTTCAGCCCGATCCTGCCCTTTCCTTGAACGGGACTGGATCCTAGGAGGTGAA GCCATTTCCAATTTTTTGTCCTCTGCCTCCCTCTGCTGTTCTTCTAGAGAAGTTTTTCCTTACA ACA Human NDP Gene SEQ ID NO: 32 AGAAGAACAAAAGCATTTGGAAGTAACAGGACCTCTTTCTAGCTCTCAGAAAAGTCTGAGAAGA AAGGAGCCCTGCGTTCCCCTAAGCTGTGCAGCAGATACTGTGATGATGGATTGCAAGTGCAAAG AGTAAGACAAAACTCCAGCACATAAAGGACAATGACAACCAGAAAGCTTCAGCCCGATCCTGCC CTTTCCTTGAACGGGACTGGATCCTAGGAGGTGAAGCCATTTCCAATTTTTTGTCCTCTGCCTC CCTCTGCTGTTCTTCTAGAGAAGTTTTTCCTTACAACAATGAGAAAACATGTACTAGCTGCATC CTTTTCTATGCTCTCCCTGCTGGTGATAATGGGAGATACAGACAGTAAAACGGACAGCTCATTC ATAATGGACTCGGACCCTCGACGCTGCATGAGGCACCACTATGTGGATTCTATCAGTCACCCAT TGTACAAGTGTAGCTCAAAGATGGTGCTCCTGGCCAGGTGCGAGGGGCACTGCAGCCAGGCGTC ACGCTCCGAGCCTTTGGTGTCGTTCAGCACTGTCCTCAAGCAACCCTTCCGTTCCTCCTGTCAC TGCTGCCGGCCCCAGACTTCCAAGCTGAAGGCACTGCGGCTGCGATGCTCAGGGGGCATGCGAC TCACTGCCACCTACCGGTACATCCTCTCCTGTCACTGCGAGGAATGCAATTCCTGAGGCCCGCT GCTGTGTGTGGCTTCTGGATGGGACAACTGTAGAGGCAGTTCGACCAGCCAGGGAAAGACTGGC AAGAAAAGAGTTAAGGCAAAAAAGGATGCAACAATTCTCCCGGGACTCTGCATATTCTAGTAAT AAAGACTCTACATGCTTGTTGACAGAGAGAGATACTCTGGGAACTTCTTTGCAGTTCCCATCTC CTTTCTCTGGTACAATTTCTTTTGGTTCATTTTCAGATTCAGGCATTTTCCCCCTTGGCTCTCA ATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGGAAAAAGTGGGCCCTCATACACAAGCGT GTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAGAATTTACTTTGGAAAGTAGAAAAGCCCAG CTTTTCCTGGGACATCTTCTGTTATTGTTGATGTTTTTTTTTACCTTGTCATTTTGGTCTAAGG TTGCCATTGCTGCTAAAGGTTACCGATTTCAAAGTCCAGATACCAAGCATGTGGATATGTTTAG CTACGTTTACTCACAGCGAGCGAACTGAGATTAAAATAACTAACAAACAGATTCTTTTATGTGA TGCTGGAACTCTTGACAGCTATAATTATTATTCAGAAATGAGTTTTTGAAAGTAAAAGCAGCAT AAAGAATTTGTCACAGGAAGGCTGTCTCAGATAAATTATGGTAAAATTTTGTAAGGGAGCAGAC TTTTAAAGACTTGCACAAATACGGATCCTGCACTGAGTCTGGAAAAGGCATATATGTACTAGTG GCATGGAGAATGCACCATACTCATGCATGCAAATTAGACAACCAAGTATGAATCTATTTGTGGG TGTGCTATAGCTTTAGCCGTGTCACGGGCATCATTCTCTAATATCCACTTGTCCATGTGAAACA TGTTGCCAAAATGGTGGCCTGGCTTGTCTTCTGAACGTTTGGTTCAAATGTGTTTTGGTCCTGG AGGCTCAAATTTTGAGTTATTCCCACGTTTTGAAATAAAAAGAGTATATTCAAAA Mouse Mature NDP Protein SEQ ID NO: 33 KTDSSFLMDSQRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQPFR SSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECSS Mouse Mature NDP cDNA SEQ ID NO: 34 AAAACAGACAGTTCATTTCTGATGGAGTCTCAACGCTGCATGAGACACCATTATGTCGATTCTA TCAGTCACCCACTGTACAAATGTAGCTCAAAGATGGTGCTCCTGGCCAGATGTGAGGGGCACTG CAGCCAGGCATCACGCTCTGAGCCCTTGGTGTCCTTCAGCACTGTCCTCAAGCAACCTTTCCGT TCCTCCTGTCACTGCTGCCGACCCCAGACTTCCAAGCTGAAGGCTCTGCGTCTGCGCTGCTCAG GGGGCATGCGACTTACTGCCACTTACCGGTACATCCTCTCCTGTCACTGTGAGGAATGCAGCTC C Rhesus Monkey Mature NDP Protein SEQ ID NO: 35 MRKHVLAASFSMLSLLVIMGDTDSKTDSSFIMDSDPRRCMRHHYVDSISHPLYKCSSKMVLLAR CEGHCSQASRSEPLVSFSTVLKQPFRSSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHC EECNS Rat Mature NDP cDNA SEQ ID NO: 36 KTDSSFLMDSQRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQPFR SSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECSS Chimpanzee Mature NDP Protein SEQ ID NO: 37 KTDSSFVMDSDPRRCMRHHYVDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQP FRSSCHCCRPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECNS Human Mature HSPA1A Protein SEQ ID NO: 38 MAKAAAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDPVVQSDMKHWPFQVINDGDKPKVQVSYKGETKAFYPEEISSMVLTKMK EIAEAYLGYPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTFDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVNHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRSTLEPVEKA LRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQIFTTYSDNQPGVLIQVYEGERA MTKDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAEKYKAEDEVQRERVSAKNALESYAFNMKSAVEDEGLKGKISEADKKKVLDKCQE VISWLDANTLAEKDEFEHKRKELEQVCNPIISGLYQGAGGPGPGGFGAQGPKGGSGSGPTIEEV D Human Mature HSPA1A cDNA SEQ ID NO: 39 ATGGCCAAAGCCGCGGCGATCGGCATCGACCTGGGCACCACCTACTCCTGCGTGGGGGTGTTCC AACACGGCAAGGTGGAGATCATCGCCAACGACCAGGGCAACCGCACCACCCCCAGCTACGTGGC CTTCACGGACACCGAGCGGCTCATCGGGGATGCGGCCAAGAACCAGGTGGCGCTGAACCCGCAG AACACCGTGTTTGACGCGAAGCGGCTGATCGGCCGCAAGTTCGGCGACCCGGTGGTGCAGTCGG ACATGAAGCACTGGCCTTTCCAGGTGATCAACGACGGAGACAAGCCCAAGGTGCAGGTGAGCTA CAAGGGGGAGACCAAGGCATTCTACCCCGAGGAGATCTCGTCCATGGTGCTGACCAAGATGAAG GAGATCGCCGAGGCGTACCTGGGCTACCCGGTGACCAACGCGGTGATCACCGTGCCGGCCTACT TCAACGACTCGCAGCGCCAGGCCACCAAGGATGCGGGTGTGATCGCGGGGCTCAACGTGCTGCG GATCATCAACGAGCCCACGGCCGCCGCCATCGCCTACGGCCTGGACAGAACGGGCAAGGGGGAG CGCAACGTGCTCATCTTTGACCTGGGCGGGGGCACCTTCGACGTGTCCATCCTGACGATCGACG ACGGCATCTTCGAGGTGAAGGCCACGGCCGGGGACACCCACCTGGGTGGGGAGGACTTTGACAA CAGGCTGGTGAACCACTTCGTGGAGGAGTTCAAGAGAAAACACAAGAAGGACATCAGCCAGAAC AAGCGAGCCGTGAGGCGGCTGCGCACCGCCTGCGAGAGGGCCAAGAGGACCCTGTCGTCCAGCA CCCAGGCCAGCCTGGAGATCGACTCCCTGTTTGAGGGCATCGACTTCTACACGTCCATCACCAG GGCGAGGTTCGAGGAGCTGTGCTCCGACCTGTTCCGAAGCACCCTGGAGCCCGTGGAGAAGGCT CTGCGCGACGCCAAGCTGGACAAGGCCCAGATTCACGACCTGGTCCTGGTCGGGGGCTCCACCC GCATCCCCAAGGTGCAGAAGCTGCTGCAGGACTTCTTCAACGGGCGCGACCTGAACAAGAGCAT CAACCCCGACGAGGCTGTGGCCTACGGGGCGGCGGTGCAGGCGGCCATCCTGATGGGGGACAAG TCCGAGAACGTGCAGGACCTGCTGCTGCTGGACGTGGCTCCCCTGTCGCTGGGGCTGGAGACGG CCGGAGGCGTGATGACTGCCCTGATCAAGCGCAACTCCACCATCCCCACCAAGCAGACGCAGAT CTTCACCACCTACTCCGACAACCAACCCGGGGTGCTGATCCAGGTGTACGAGGGCGAGAGGGCC ATGACGAAAGACAACAATCTGTTGGGGCGCTTCGAGCTGAGCGGCATCCCTCCGGCCCCCAGGG GCGTGCCCCAGATCGAGGTGACCTTCGACATCGATGCCAACGGCATCCTGAACGTCACGGCCAC GGACAAGAGCACCGGCAAGGCCAACAAGATCACCATCACCAACGACAAGGGCCGCCTGAGCAAG GAGGAGATCGAGCGCATGGTGCAGGAGGCGGAGAAGTACAAAGCGGAGGACGAGGTGCAGCGCG AGAGGGTGTCAGCCAAGAACGCCCTGGAGTCCTACGCCTTCAACATGAAGAGCGCCGTGGAGGA TGAGGGGCTCAAGGGCAAGATCAGCGAGGCGGACAAGAAGAAGGTTCTGGACAAGTGTCAAGAG GTCATCTCGTGGCTGGACGCCAACACCTTGGCCGAGAAGGACGAGTTTGAGCACAAGAGGAAGG AGCTGGAGCAGGTGTGTAACCCCATCATCAGCGGACTGTACCAGGGTGCCGGTGGTCCCGGGCC TGGGGGCTTCGGGGCTCAGGGTCCCAAGGGAGGGTCTGGGTCAGGCCCCACCATTGAGGAGGTG GATTAG Human HSPA1A Gene Sequence SEQ ID NO: 40 AACGGCTAGCCTGAGGAGCTGCTGCGACAGTCCACTACCTTTTTCGAGAGTGACTCCCGTTGTC CCAAGGCTTCCCAGAGCGAACCTGTGCGGCTGCAGGCACCGGCGCGTCGAGTTTCCGGCGTCCG GAAGGACCGAGCTCTTCTCGCGGATCCAGTGTTCCGTTTCCAGCCCCCAATCTCAGAGCGGAGC CGACAGAGAGCAGGGAACCGGCATGGCCAAAGCCGCGGCGATCGGCATCGACCTGGGCACCACC TACTCCTGCGTGGGGGTGTTCCAACACGGCAAGGTGGAGATCATCGCCAACGACCAGGGCAACC GCACCACCCCCAGCTACGTGGCCTTCACGGACACCGAGCGGCTCATCGGGGATGCGGCCAAGAA CCAGGTGGCGCTGAACCCGCAGAACACCGTGTTTGACGCGAAGCGGCTGATTGGCCGCAAGTTC GGCGACCCGGTGGTGCAGTCGGACATGAAGCACTGGCCTTTCCAGGTGATCAACGACGGAGACA AGCCCAAGGTGCAGGTGAGCTACAAGGGGGAGACCAAGGCATTCTACCCCGAGGAGATCTCGTC CATGGTGCTGACCAAGATGAAGGAGATCGCCGAGGCGTACCTGGGCTACCCGGTGACCAACGCG GTGATCACCGTGCCGGCCTACTTCAACGACTCGCAGCGCCAGGCCACCAAGGATGCGGGTGTGA TCGCGGGGCTCAACGTGCTGCGGATCATCAACGAGCCCACGGCCGCCGCCATCGCCTACGGCCT GGACAGAACGGGCAAGGGGGAGCGCAACGTGCTCATCTTTGACCTGGGCGGGGGCACCTTCGAC GTGTCCATCCTGACGATCGACGACGGCATCTTCGAGGTGAAGGCCACGGCCGGGGACACCCACC TGGGTGGGGAGGACTTTGACAACAGGCTGGTGAACCACTTCGTGGAGGAGTTCAAGAGAAAACA CAAGAAGGACATCAGCCAGAACAAGCGAGCCGTGAGGCGGCTGCGCACCGCCTGCGAGAGGGCC AAGAGGACCCTGTCGTCCAGCACCCAGGCCAGCCTGGAGATCGACTCCCTGTTTGAGGGCATCG ACTTCTACACGTCCATCACCAGGGCGAGGTTCGAGGAGCTGTGCTCCGACCTGTTCCGAAGCAC CCTGGAGCCCGTGGAGAAGGCTCTGCGCGACGCCAAGCTGGACAAGGCCCAGATTCACGACCTG GTCCTGGTCGGGGGCTCCACCCGCATCCCCAAGGTGCAGAAGCTGCTGCAGGACTTCTTCAACG GGCGCGACCTGAACAAGAGCATCAACCCCGACGAGGCTGTGGCCTACGGGGCGGCGGTGCAGGC GGCCATCCTGATGGGGGACAAGTCCGAGAACGTGCAGGACCTGCTGCTGCTGGACGTGGCTCCC CTGTCGCTGGGGCTGGAGACGGCCGGAGGCGTGATGACTGCCCTGATCAAGCGCAACTCCACCA TCCCCACCAAGCAGACGCAGATCTTCACCACCTACTCCGACAACCAACCCGGGGTGCTGATCCA GGTGTACGAGGGCGAGAGGGCCATGACGAAAGACAACAATCTGTTGGGGCGCTTCGAGCTGAGC GGCATCCCTCCGGCCCCCAGGGGCGTGCCCCAGATCGAGGTGACCTTCGACATCGATGCCAACG GCATCCTGAACGTCACGGCCACGGACAAGAGCACCGGCAAGGCCAACAAGATCACCATCACCAA CGACAAGGGCCGCCTGAGCAAGGAGGAGATCGAGCGCATGGTGCAGGAGGCGGAGAAGTACAAA GCGGAGGACGAGGTGCAGCGCGAGAGGGTGTCAGCCAAGAACGCCCTGGAGTCCTACGCCTTCA ACATGAAGAGCGCCGTGGAGGATGAGGGGCTCAAGGGCAAGATCAGCGAGGCGGACAAGAAGAA GGTGCTGGACAAGTGTCAAGAGGTCATCTCGTGGCTGGACGCCAACACCTTGGCCGAGAAGGAC GAGTTTGAGCACAAGAGGAAGGAGCTGGAGCAGGTGTGTAACCCCATCATCAGCGGACTGTACC AGGGTGCCGGTGGTCCCGGGCCTGGGGGCTTCGGGGCTCAGGGTCCCAAGGGAGGGTCTGGGTC AGGCCCCACCATTGAGGAGGTAGATTAGGGGCCTTTCCAAGATTGCTGTTTTTGTTTTGGAGCT TCAAGACTTTGCATTTCCTAGTATTTCTGTTTGTCAGTTCTCAATTTCCTGTGTTTGCAATGTT GAAATTTTTTGGTGAAGTACTGAACTTGCTTTTTTTCCGGTTTCTACATGCAGAGATGAATTTA TACTGCCATCTTACGACTATTTCTTCTTTTTAATACACTTAACTCAGGCCATTTTTTAAGTTGG TTACTTCAAAGTAAATAAACTTTAAAATTCAA Mouse Mature HSPA1A Protein SEQ ID NO: 41 MAKNTAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDAVVQSDMKHWPFQVVNDGDKPKVQVNYKGESRSFFPEEISSMVLTKMK EIAEAYLGHPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTFDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVSHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRGTLEPVEKA LRDAKMDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQTFTTYSDNQPGVLIQVYEGERA MTRDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAERYKAEDEVQRDRVAAKNALESYAFNMKSAVEDEGLKGKLSEADKKKVLDKCQE VISWLDSNTLADKEEFVHKREELERVCSPIISGLYQGAGAPGAGGFGAQAPKGASGSGPTIEEV D Mouse Mature HSPA1A cDNA SEQ ID NO: 42 ATGGCCAAGAACACGGCGATCGGCATCGACCTGGGCACCACCTACTCGTGCGTGGGCGTGTTCC AGCACGGCAAGGTGGAGATCATCGCCAACGACCAGGGCAACCGCACGACCCCCAGCTACGTGGC CTTCACCGACACCGAGCGCCTCATCGGAGACGCCGCCAAGAACCAGGTGGCGCTGAACCCGCAG AACACCGTGTTCGACGCGAAGCGGCTGATCGGCCGCAAGTTCGGCGATGCGGTGGTGCAGTCCG ACATGAAGCACTGGCCCTTCCAGGTGGTGAACGACGGCGACAAGCCCAAGGTGCAGGTGAACTA CAAGGGCGAGAGCCGGTCGTTCTTCCCGGAGGAGATCTCGTCCATGGTGCTGACGAAGATGAAG GAGATCGCTGAGGCGTACCTGGGCCACCCGGTGACCAACGCGGTGATCACGGTGCCCGCCTACT TCAACGACTCTCAGCGGCAGGCCACCAAGGACGCGGGCGTGATCGCCGGTCTAAACGTGCTGCG GATCATCAACGAGCCCACGGCGGCCGCCATCGCCTACGGGCTGGACCGGACCGGCAAGGGCGAG CGCAACGTGCTCATCTTCGACCTGGGGGGCGGCACGTTCGACGTGTCCATCCTGACGATCGACG ACGGCATCTTCGAGGTGAAGGCCACGGCGGGCGACACGCACCTGGGAGGGGAGGACTTCGACAA CCGGCTGGTGAGCCACTTCGTGGAGGAGTTCAAGAGGAAGCACAAGAAGGACATCAGCCAGAAC AAGCGCGCGGTGCGGCGGCTGCGCACTGCGTGTGAGAGGGCCAAGAGGACGCTGTCGTCCAGCA CCCAGGCCAGCCTGGAGATCGACTCTCTGTTCGAGGGCATCGACTTCTACACATCCATCACGCG GGCGCGGTTCGAAGAGCTGTGCTCAGACCTGTTCCGCGGCACGCTGGAGCCCGTGGAGAAGGCC CTGCGCGACGCCAAGATGGACAAGGCGCAGATCCACGACCTGGTGCTGGTGGGCGGCTCGACGC GCATCCCCAAGGTGCAGAAGCTGCTGCAGGACTTCTTCAACGGGCGCGACCTGAACAAGAGCAT CAACCCGGACGAGGCGGTGGCCTACGGGGCGGCGGTGCAGGCGGCCATCCTGATGGGGGACAAG TCGGAGAACGTGCAGGACCTGCTGCTGCTGGACGTGGCGCCGCTGTCGCTGGGCCTGGAGACTG CGGGCGGCGTGATGACGGCGCTCATCAAGCGCAACTCCACCATCCCCACCAAGCAGACGCAGAC CTTCACCACCTACTCGGACAACCAGCCCGGGGTGCTGATCCAGGTGTACGAGGGCGAGAGGGCC ATGACGCGCGACAACAACCTGCTGGGGCGCTTCGAACTGAGCGGCATCCCGCCGGCGCCCAGGG GCGTGCCACAGATCGAGGTGACCTTCGACATCGACGCCAACGGCATCCTGAACGTCACGGCCAC CGACAAGAGCACCGGCAAGGCCAACAAGATCACCATCACCAACGACAAGGGCCGCCTGAGCAAG GAGGAGATCGAGCGCATGGTGCAGGAGGCCGAGCGCTACAAGGCCGAGGACGAGGTGCAGCGCG ACAGGGTGGCCGCCAAGAACGCGCTCGAATCCTATGCCTTCAACATGAAGAGCGCCGTGGAGGA CGAGGGTCTCAAGGGCAAGCTCAGCGAGGCTGACAAGAAGAAGGTGCTGGACAAGTGCCAGGAG GTCATCTCCTGGCTGGACTCCAACACGCTGGCCGACAAGGAGGAGTTCGTGCACAAGCGGGAGG AGCTGGAGCGGGTGTGCAGCCCCATCATCAGTGGGCTGTACCAGGGTGCGGGTGCTCCTGGGGC TGGGGGCTTCGGGGCCCAGGCGCCCAAGGGAGCCTCTGGCTCAGGACCCACCATCGAGGAGGTG GATTAGA Rhesus Monkey Mature HSPA1A Protein SEQ ID NO: 43 MAKAAAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDPVVQSDMKHWPFQVINDGDKPKVQVSYKGETKAFYPEEISSMVLTKMK EIAEAYLGYPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTFDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVNHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRSTLEPVEKA LRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQIFTTYSDNQPGVLIQVYEGERA MTKDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAEKYKAEDEVQRERVSAKNALESYAFNMKSAVEDEGLKGKISEADKKKVLDKCQE VISWLDANTLAEKDEFEHKRKELEQVCNPIISGLYQGAGGPGPGGFGAQGPKGGSGSGPTIEEV D Rat Mature HSPA1A Protein SEQ ID NO: 44 MAKKTAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDPVVQSDMKHWPFQVVNDGDKPKVQVNYKGENRSFYPEEISSMVLTKMK EIAEAYLGHPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTFDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVSHFVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRGTLEPVEKA LRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQTFTTYSDNQPGVLIQVYEGERA MTRDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAERYKAEDEVQRERVAAKNALESYAFNMKSAVEDEGLKGKISEADKKKVLDKCQE VISWLDSNTLAEKEEFVHKREELERVCNPIISGLYQGAGAPGAGGFGAQAPKGGSGSGPTIEEV D Cattle HSPA1A Protein SEQ ID NO: 45 MAKNMAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALNPQ NTVFDAKRLIGRKFGDPVVQSDMKHWPFRVINDGDKPKVQVSYKGETKAFYPEEISSMVLTKMK EIAEAYLGHPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIAYGLDRTGKGE RNVLIFDLGGGTFDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVNHEVEEFKRKHKKDISQN KRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARFEELCSDLFRSTLEPVEKA LRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDK SENVQDLLLLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQIFTTYSDNQPGVLIQVYEGERA MTRDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKGRLSK EEIERMVQEAEKYKAEDEVQRERVSAKNALESYAFNMKSAVEDEGLKGKISEADKKKVLDKCQE VISWLDANTLAEKDEFEHKRKELEQVCNPIISRLYQGAGGPGAGGFGAQGPKGGSGSGPTIEEV D Soluble neuropilin-1 polyadenylation signal SEQ ID NO: 46 AAATAAAATACGAAATG