CRYSTAL STRUCTURES OF HUMAN TORSIN-A AND METHODS OF DETERMINING AND USING THE SAME
20170248601 · 2017-08-31
Inventors
- Thomas U. Schwartz (Brookline, MA, US)
- F. Esra Demircioglu (Boston, MA, US)
- Brian A. Sosa-Alvarado (Cambridge, MA, US)
Cpc classification
C12Y306/04
CHEMISTRY; METALLURGY
G01N23/20
PHYSICS
G16B15/30
PHYSICS
G01N2500/04
PHYSICS
G16B15/00
PHYSICS
G16B35/00
PHYSICS
International classification
G01N23/20
PHYSICS
Abstract
A protein composition including TorsinA or TorsinA mutant, LULL1, and a nanobody obtained by immunization using TorsinA and LULL1 is used to grow complex crystals, and three dimensional structures are determined using x-ray data of the crystals. A creening platform is built based on the determined three dimensional structures for designing a drug lead to cure dystonia.
Claims
1. A protein composition, comprising: a target protein; a modulator of the target protein; and a nanobody specifically binds to at least one of the target protein and the modulator.
2. The protein composition of claim 1, wherein the target protein is TorsinA, a mutant of TorsinA, or a portion thereof.
3. The protein composition of claim 2, wherein the target protein comprises an amino acid sequence as set forth in at least one of SEQ ID NO: 1-3 or a portion thereof.
4. The protein composition of claim 1, wherein the modulator is LULL1 or a portion thereof.
5. The protein composition of claim 4, wherein the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4.
6. The protein composition of claim 1, wherein the nanobody is specific for a complex comprising the target protein and the modulator.
7. The protein composition of claim 6, wherein the nanobody is obtained by immunization using the target protein comprising the amino acid sequence set forth in the SEQ ID NO: 2 and the modulator comprising the amino acid sequence set forth in the SEQ ID NO: 4.
8. The protein composition of claim 7, wherein the target protein comprises the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 or a portion thereof, the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4 or a portion thereof, the nanobody comprises the amino acid sequence set forth in the SEQ ID NO: 5 or a portion thereof, and the protein composition is co-expressed, and optionally purified together.
9. A kit comprising a first vector, wherein a first nucleotide sequence encoding a target protein and a second nucleotide sequence encoding the modulator are cloned into the first vector, and a second vector, wherein a third nucleotide sequence encoding a nanobody is cloned into the second vector, and wherein the vectors comprise promoter sequences operably linked to the nucleotide sequences.
10. The kit of claim 9, wherein the vectors are configured for eukaryotic transformation and/or expression.
11. The kit of claim 9, wherein the first nucleotide sequence comprises the nucleic acid sequence set forth in at least one of SEQ ID NO: 6-8, the second nucleotide sequence comprises the nucleic acid sequence set forth in the SEQ ID NO: 9, and the third nucleotide sequence comprises the nucleic acid sequence set forth in the SEQ ID NO: 10.
12. The kit of claim 11, wherein the first vector is a modified ampicillin resistant pETDuet-1 vector, the second vector is a pET-30b(+) vector, and the bacteria is E. coli strain LOBSTR(DE3) RIL.
13. The kit of claim 11, wherein the target protein comprises the amino acid sequence set forth in at least one of SEQ ID NO: 1-3, the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ 51-332).
14. A method of obtaining protein crystals according to the following steps: preparing the protein composition of claim 1 at about 4-4.5 mg/ml; adding about 2 mM ATP to the prepared protein composition to form a protein stock; preparing a mother liquor comprising about 13% (w/v) polyethylene glycol (PEG) 6000, about 5% (v/v) 2-methyl-2,4-pentanediol, and about 0.1M MES pH6.5; mixing approximately equal parts of the protein stock with the mother liquor to form a mixture; and inducing crystallization of the protein composition in the mixture by hanging drop/vapor diffusion under about 18° C., wherein the crystals are obtained in about 3-5 days.
15. The method of claim 14, wherein the obtained crystals are cryoprotected by flash-frozen in liquid nitrogen after soaking in the mother liquor supplemented with about 20% (v/v) glycerol, x-ray data are collected using one of the obtained crystals, and the structure of the crystallized protein composition is determined based on the collected x-ray data.
16. The protein composition of claim 1, wherein the target protein comprises the amino acid sequence set forth in the SEQ ID NO: 3, the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ 51-332 ΔE303 mutant structure).
17. A method of obtaining protein crystals according to the following steps: preparing the protein composition of claim 1 at about 4-4.5 mg/ml; adding about 2 mM ATP to the prepared protein composition to form a protein stock solution; preparing a mother liquor comprising about 19% (w/v) polyethylene glycol (PEG) 3350, about 0.2M AMSO.sub.4, and about 0.1M Bis-Tris-HCl pH6.5; and mixing approximately equal parts of the protein stock with 1 the mother liquor to form a mixture, and conducting the crystallization using the mixture by hanging drop/vapor diffusion under 18° C., such that the crystals are obtained in about 3-5 days.
18. The protein composition of claim 17, wherein the obtained crystals are cryoprotected by flash-frozen in liquid nitrogen after soaking in the mother liquor supplemented with 20% (v/v) glycerol, x-ray data are collected using one of the obtained crystals, and the structure of the crystallized protein composition is determined based on the collected x-ray data.
19. A method of determining the three dimensional structure of a crystallized protein composition to a resolution of about 1.4 Å or better, wherein the protein composition comprises a target protein having the amino acid sequence set forth in the SEQ ID NO: 2 or SEQ ID NO: 3, a modulator of the target protein having the amino acid sequence set forth in the SEQ ID NO: 4, and a nanobody specifically binds to at least one of the target protein and the modulator and having the amino acid sequence set forth in the SEQ ID NO: 5; and wherein the method comprises: preparing a first nucleotide sequence comprising the nucleic acid sequence set forth in the SEQ ID NO: 7 or the nucleic acid sequence set forth in the SEQ ID NO: 8, a second nucleotide sequence comprising the nucleic acid sequence set forth in the SEQ ID NO: 9, and a third nucleotide sequence comprising the nucleic acid sequence set forth in the SEQ ID NO: 10; cloning the first nucleotide sequence and the second nucleotide sequence to a first vector; cloning the third nucleotide sequence to a second vector; transforming bacteria using the first vector and the second vector; growing the bacteria that expressing the target protein, the modulator and the nanobody; purifying the target protein, the modulator and the nanobody together to obtain a protein composition; crystallizing the protein composition to obtain crystals; collecting x-ray data using one of the obtained crystals; and determining the three dimensional structure from the collected x-ray data.
20. The method of claim 19, wherein the target protein comprises the amino acid sequence set forth in the SEQ ID NO: 2, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA 51-332 with E171Q).
21. The method of claim 19, wherein the target protein comprises the amino acid sequence set forth in the SEQ ID NO: 3, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA 51-332/E171Q/ΔE303 mutant structure).
22. A method for screening compounds that bind to TorsinA, comprising: providing a protein composition of claim 1 comprising TorsinA, and a library of test compounds; treating the protein composition with a test compound; determining whether the compound binds to TorsinA, wherein a compound that binds TorsinA is indicative of a compound that is a candidate TorsinA agonist or antagonist; and optionally determining a three dimensional crystal structure of TorsinA with and/or without the compound bound to a resolution of about 1.4 Å or better.
23. The method of claim 22, wherein the crystals of TorsinA are grown using a protein composition comprising: the TorsinA having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3, a modulator of the TorsinA having the amino acid sequence set forth in the SEQ ID NO: 4, and a nanobody specifically binds to at least one of the TorsinA and the modulator and having the amino acid sequence set forth in the SEQ ID NO: 5.
24. The method of claim 23, wherein the TorsinA comprises TorsinA ΔE303 having the amino acid sequence set forth in the SEQ ID NO: 3, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition having the TorsinA ΔE303, the crystallized protein composition having TorsinA ΔE303 can be determined to a resolution of about 1.4 Å or better (TorsinA 51-332/E171Q/ΔE303 mutant structure).
25. The method of claim 24, wherein the TorsinA comprises TorsinA E171Q having the amino acid sequence set forth in the SEQ ID NO: 2, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition having TorsinA E171Q can be determined to a resolution of about 1.4 Å or better (TorsinA 51-332/E171Q).
26. The method of claim 25, wherein a binding location of the modulator is determined by comparing the three dimensional structure of the crystallized protein composition having TorsinA ΔE303 and the three dimensional structure of the crystallized protein composition having TorsinA E171Q.
27. The method of claim 26, wherein the modulator is virtually screened against the binding location of the three dimensional structure of the TorsinA ΔE303.
28. The method of claim 23, wherein the modulator is co-crystallized with the TorsinA ΔE303 and at least one of the modulator and the nanobody to obtain a three dimensional structure having the TorsinA ΔE303 and the modulator, such that optimization of the modulator is conducted based on the three dimensional structure having the TorsinA ΔE303.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0045] The accompanying drawings illustrate one or more embodiments of the invention and together with the written description, serve to explain the principles of the invention. Wherever possible, the same reference numbers are used throughout the drawings to refer to the same or like elements of an embodiment.
[0046]
[0047]
[0048]
[0049]
[0050]
[0051]
[0052]
[0053]
[0054]
[0055]
[0056]
[0057]
[0058]
[0059]
[0060]
[0061]
[0062]
[0063]
DETAILED DESCRIPTION
[0064] The present invention is more particularly described in the following examples that are intended as illustrative only since numerous modifications and variations therein will be apparent to those skilled in the art. Various embodiments of the invention are now described in detail. Referring to the drawings, like numbers indicate like components throughout the views. As used in the description herein and throughout the claims that follow, the meaning of “a”, “an”, and “the” includes plural reference unless the context clearly dictates otherwise. Also, as used in the description herein and throughout the claims that follow, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise. Moreover, titles or subtitles may be used in the specification for the convenience of a reader, which shall have no influence on the scope of the present invention. Additionally, some terms used in this specification are more specifically defined below.
[0065] Some references, which may include patents, patent applications and various publications, are cited and discussed in the description of this invention. The citation and/or discussion of such references is provided merely to clarify the description of the present invention and is not an admission that any such reference is “prior art” to the invention described herein. All references cited and discussed in this specification are incorporated herein by reference in their entireties and to the same extent as if each reference was individually incorporated by reference.
[0066] The terms used in this specification generally have their ordinary meanings in the art, within the context of the invention, and in the specific context where each term is used. Certain terms that are used to describe the invention are discussed below, or elsewhere in the specification, to provide additional guidance to the practitioner regarding the description of the invention. For convenience, certain terms may be highlighted, for example using italics and/or quotation marks. The use of highlighting has no influence on the scope and meaning of a term; the scope and meaning of a term is the same, in the same context, whether or not it is highlighted. It will be appreciated that same thing can be said in more than one way. Consequently, alternative language and synonyms may be used for any one or more of the terms discussed herein, nor is any special significance to be placed upon whether or not a term is elaborated or discussed herein. Synonyms for certain terms are provided. A recital of one or more synonyms does not exclude the use of other synonyms. The use of examples anywhere in this specification including examples of any terms discussed herein is illustrative only, and in no way limits the scope and meaning of the invention or of any exemplified term. Likewise, the invention is not limited to various embodiments given in this specification.
[0067] It will be understood that when an element is referred to as being “on” another element, it can be directly on the other element or intervening elements may be present therebetween. In contrast, when an element is referred to as being “directly on” another element, there are no intervening elements present. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.
[0068] It will be understood that, although the terms first, second, third etc. may be used herein to describe various elements, components, regions, layers and/or sections, these elements, components, regions, layers and/or sections should not be limited by these terms. These terms are only used to distinguish one element, component, region, layer or section from another element, component, region, layer or section. Thus, a first element, component, region, layer or section discussed below could be termed a second element, component, region, layer or section without departing from the teachings of the present invention.
[0069] Furthermore, relative terms, such as “lower” or “bottom” and “upper” or “top,” may be used herein to describe one element's relationship to another element as illustrated in the Figures. It will be understood that relative terms are intended to encompass different orientations of the device in addition to the orientation depicted in the Figures. For example, if the device in one of the figures is turned over, elements described as being on the “lower” side of other elements would then be oriented on “upper” sides of the other elements. The exemplary term “lower”, can therefore, encompasses both an orientation of “lower” and “upper,” depending of the particular orientation of the figure. Similarly, if the device in one of the figures is turned over, elements described as “below” or “beneath” other elements would then be oriented “above” the other elements. The exemplary terms “below” or “beneath” can, therefore, encompass both an orientation of above and below.
[0070] Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and the present disclosure, and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
[0071] As used herein, “around”, “about” or “approximately” shall generally mean within 20 percent, preferably within 10 percent, and more preferably within 5 percent of a given value or range. Numerical quantities given herein are approximate, meaning that the term “around”, “about” or “approximately” can be inferred if not expressly stated.
[0072] As used herein, “plurality” means two or more.
[0073] As used herein, the terms “comprising”, “including”, “carrying”, “having”, “containing”, “involving”, and the like are to be understood to be open-ended, i.e., to mean including but not limited to.
[0074] The terms “TorsinA” or “Torsin-1A”, as used herein, refer to a protein that in humans that is encoded by the TOR1A gene (also know as DQ2 or DYT1).
[0075] The term “nanobody”, as used herein, refers to a single-domain antibody. The single-domain antibody is an antibody fragment consisting of a single monomeric variable antibody domain. Like a whole antibody, it is able to bind selectively to a specific antigen. With a molecular weight of only 12-15 kDa, single-domain antibodies are much smaller than common antibodies (150-160 kDa) which are composed of two heavy protein chains and two light chains, and even smaller than Fab fragments (˜50 kDa, one light chain and half a heavy chain) and single-chain variable fragments (˜25 kDa, two variable domains, one from a light and one from a heavy chain).
[0076] The term “modulator”, as used herein, refer to a substance influencing the binding of a target protein to its ligand or agonist, or inverse agonist.
[0077] The most common cause of early onset primary dystonia, a neuromuscular disease, is a glutamate deletion (ΔE) at position 302/303 of TorsinA, a AAA+ ATPase that resides in the endoplasmic reticulum including the perinuclear space [1, 2]. While the actual function of TorsinA remains elusive [3-6], the ΔE mutation is known to diminish binding of two TorsinA ATPase activators: lamina-associated protein 1 (LAP1) and its paralog, luminal domain like LAP1 (LULL1) [7-9]. Therefore, ΔE is likely a loss-of-function mutation [10]. A single-chain antibody fragment, a so-called nanobody, which specifically binds the TorsinA LULL1 complex, is generated. The nanobody is called VHH BS2. The resulting trimeric TorsinA(ATP)-LULL1-VHH-BS2 complex is stable in vitro and was crystallized. In addition, and most importantly, VHH-BS2 is able to stabilize the weak TorsinAΔE(ATP).LULL1 interaction, thus a TorsinAΔE(ATP)-LULL1-VHH-BS2 can also be made and was crystallized as well. The ability to stabilize a weak interaction with a reagent like VHH-BS2 is extremely rare. Using a nanobody as a crystallization chaperone, both crystal structures are solved and refined to 1.4 Å resolution. A comparison of these structures at this very high resolution shows, in atomic detail, the subtle differences in activator interactions that separate the healthy wild type from the diseased state DYT1 mutant TorsinA. This structure information may provide a structural platform for drug development, as a small molecule that rescues TorsinAΔE could serve as a cure for primary dystonia.
[0078] In one aspect, the present invention relates to a protein composition. As shown in
[0079] The target protein 110 may be human wild type TorsinA (SEQ ID NO: 1), mutant TorsinA.sub.EQ with a glutamate (E) to Glutamine (Q) mutation at position 171 (SEQ ID NO: 2), mutant TorsinA.sub.EQΔE with a glutamate (E) to Glutamine (Q) mutation at position 171 and a glutamate deletion at position 303 (or 302) (SEQ ID NO: 3), as well as TorsinA mutants having ΔF323-Y328, R288Q, F2051, D194V, ΔA14-P15, E121K, V1291, or D216H mutations, and portions of the above proteins. In certain embodiments, the target protein 110 includes the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 (SEQ ID NO: 1 is human TorsinA 51-332, SEQ ID NO: 2 is TorsinA 51-332 with E171Q; SEQ ID NO: 3 is human TorsinA 51-332 with E171Q and ΔE303) or portions thereof.
[0080] The modulator 130 may be an activator, an agonist, an antagonist, or an inverse agonist, of the target protein 110. When the target protein 110 is TorsinA or a mutant of TorsinA, the modulator 130 may be LAP1, LULL1, a domain or a portion of LAP1 or LULL1. In certain embodiments, the modulator may also be a drug lead that is able to bind to TorsinA or its mutant, and the drug lead may be improved based on the three dimensional complex structure of the TorsinA or its mutant and the drug lead. In certain embodiments, the modulator 130 is LULL1 or portions thereof. In one embodiment, the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4 (SEQ ID NO: 4 is LULL1 233-470) or portions thereof.
[0081] The nanobody 150 specifically binds to at least one of the target protein 110 and the modulator 130. In certain embodiment, the nanobody 150 may be obtained by immunizing a model animal using both the target protein 110 and the modulator 130. In certain embodiments, the nanobody is obtained by immunization using the target protein 110 having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 and the modulator 130 having the amino acid sequence set forth in the SEQ ID NO: 4, or portions thereof. In certain embodiments, the obtained nanobody 150 has the amino acid sequence set forth in the SEQ ID NO:5, or portions thereof.
[0082] In certain embodiments, the target protein 110 includes the amino acid sequence set forth in the SEQ ID NO: 2 or SEQ ID NO: 3 or portions thereof, the modulator 130 includes the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, the nanobody 150 includes the amino acid sequence set forth in the SEQ ID NO: 5 or portions thereof, and the target protein 110, themodulator 130 and the nanobody 150 in the protein composition are co-expressed and purified together.
[0083] Referring to
[0084] In certain embodiments, the target protein 110 includes the amino acid sequence set forth in the SEQ ID NO: 2, the modulator 130 includes the amino acid sequence set forth in the SEQ ID NO: 4, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition 110 can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ51-332 structure).
[0085] In certain embodiments, as shown in
[0086] After the crystals are observed and grow to a sufficient size, the crystals are cryoprotected by flash-frozen in liquid nitrogen after soaking in the mother liquor supplemented with 20% (v/v) glycerol. Single crystal is preferably used in the flash-frozen. X-ray data are collected using one of the obtained crystals, and the structure of the crystallized protein composition is determined based on the collected x-ray data.
[0087] In certain embodiments, the target protein 110 includes the amino acid sequence set forth in the SEQ ID NO: 3 or portions thereof, the modulator 130 includes the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ51-332 ΔE303 mutant structure).
[0088] In certain embodiments, the protein composition 100 is crystallized to obtain crystals by the following steps: preparing the protein composition 100 at about 4-4.5 mg/ml; adding about 2 mM ATP to the prepared protein composition to form a protein stock; preparing a mother liquor comprising about 19% (w/v) polyethylene glycol (PEG) 3350, about 0.2 M AMSO.sub.4, and about 0.1 M Bis-Tris-HCl pH6.5; and mixing 1 μl of the protein stock with 1 μl of the mother liquor to form a second mixture, and inducing crystallization of the protein composition in the mixture by hanging drop/vapor diffusion under about 18° C., such that the crystals are obtained in about 3-5 days.
[0089] In certain embodiments, the obtained crystals are cryoprotected by flash-frozen in liquid nitrogen after soaking in the mother liquor supplemented with 20% (v/v) glycerol, x-ray data are collected using one of the obtained crystals, and the structure of the crystallized protein composition is determined based on the collected x-ray data.
[0090] In another aspect, the present invention related to a method of determining the three dimensional structure of a crystallized protein composition 100 to a resolution of about 1.4 Å or better. In certain embodiments, the protein composition 100 includes a target protein 110 having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 or portions thereof, an modulator 130 of the target protein 110 having the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, and a nanobody 150 specifically binds to at least one of the target protein 110 and the modulator 130 and having the amino acid sequence set forth in the SEQ ID NO: 5 or portions thereof.
[0091] As shown in
[0092] In certain embodiments, the target protein 110 comprises the amino acid sequence set forth in the SEQ ID NO: 2, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition 100 can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ 51-332 structure).
[0093] In certain embodiments, the target protein 110 comprises the amino acid sequence set forth in the SEQ ID NO: 3, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ 51-332 ΔE303 mutant structure).
[0094] In a further aspect, the present invention relates to a method for screening compounds that bind to TorsinA. In certain embodiments, the method includes providing a protein composition as described above comprising TorsinA, and a library of test compounds, treating the protein composition with a test compound, determining whether the compound binds to TorsinA, where a compound that binds TorsinA is indicative of a compound that is a candidate TorsinA agonist or antagonist, and optionally determining a three dimensional crystal structure of TorsinA with and/or without the bound compound to a resolution of about 1.4 Å or better.
[0095] The TorsinA structure may include TorsinA.sub.EQ structure, TorsinA.sub.EQΔE303 structure, as well as their complex structures with modulators such as LULL1 or LAP1, and/or ATP. After analyzing one or more of the three dimensional structures of TorsinA, a targeting binding area of TorsinA or a targeting binding interface between TorsinA and its modulator, is chosen for designing a lead as drug candidate. The lead may be rationally designed, virtually screened, or directly screened by activity. The lead is then crystallized, for example using the method as shown in
[0096] In certain embodiments, the crystals of TorsinA are grown using a protein composition 100 including: TorsinA having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 or portions thereof, a modulator of TorsinA having the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, and a nanobody specifically binds to at least one of TorsinA and the modulator and having the amino acid sequence set forth in the SEQ ID NO: 5 or portions thereof.
[0097] In certain embodiments, TorsinA includes TorsinA.sub.EQ ΔE303 having the amino acid sequence set forth in the SEQ ID NO: 3, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition having the TorsinA.sub.EQΔE303, the crystallized protein composition having TorsinA ΔE303 can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ 51-332 ΔE303 mutant structure).
[0098] In certain embodiments, the TorsinA comprises TorsinA E171Q having the amino acid sequence set forth in the SEQ ID NO: 2, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition having TorsinA E171Q can be determined to a resolution of about 1.4 Å or better (TorsinA 51-332/E171Q).
[0099] In certain embodiments, a binding location of the modulator is determined by comparing the three dimensional structure of the crystallized protein composition having TorsinA ΔE303 and the three dimensional structure of the crystallized protein composition having TorsinA E171Q.
[0100] In certain embodiments, the modulator is virtually screened against the binding location of the three dimensional structure of the TorsinA.sub.EQ ΔE303.
[0101] In certain embodiments, the modulator is co-crystallized with the TorsinA.sub.EQ ΔE303 and at least one of the modulator and the nanobody to obtain a three dimensional structure having the TorsinA.sub.EQ ΔE303 and the modulator, such that modification of the modulator is conducted based on the three dimensional structure having the TorsinA ΔE303.
[0102] Certain embodiments of the present application, among other things, crystallized TorsinA which is a difficult to crystallize. Using this method, variety of TorsinA mutants and their complex structures can be determined. This is not achieved by any others before this invention.
[0103] Further, by comparing the TorsinA.sub.EQ structure and TorsinA.sub.EQ ΔE303 structure, a novel functional mechanism and novel binding site is determined, which can be used as the basis for structural based rational drug design. This information provides a structural platform to develop drug that can rescue TorsinA ΔE303 or other type of mutants so that the TorsinA ΔE303 become functional. The drug is then useful for cure primary dystonia.
[0104] These and other aspects of the present invention are more specifically described below. Without intent to limit the scope of the invention, exemplary methods and their related results according to the embodiments of the present invention are given below. Note that titles or subtitles may be used in the examples for convenience of a reader, which in no way should limit the scope of the invention. Moreover, certain theories are proposed and disclosed herein; however, in no way they, whether they are right or wrong, should limit the scope of the invention so long as the invention is practiced according to the invention without regard for any particular theory or scheme of action.
EXAMPLES
Example 1: Generation and Selection of Nanobodies
[0105] To investigate the molecular basis for primary dystonia as a result of the glutamate 302/303 deletion in TorsinA, a structural approach is taken. TorsinA is a catalytically inactive AAA+ ATPase [11-13], notoriously ill-behaved in vitro, primarily due to its limited solubility and stability. These problems were partially overcome by stabilizing an ATP-trapped E171Q mutant of human TorsinA (residues 51-332; SEQ ID NO: 2) by co-expressing it with the luminal activation domain of human LULL1 (residues 233-470; SEQ ID NO: 4). This resulted in a better behaved heterodimeric complex (
[0106] Specifically, for obtaining the VHH-BS2 nanobody, purified human TorsinA.sub.EQ-LULL1 complex was injected into a male alpaca (Lama pacos) for immunization. Generation and screening of nanobodies was carried out as previously described [14]. Each of the selected nanobodies was subcloned into a pET-30b(+) vector with a C-terminal His.sub.6-tag. Each nanobody was bacterially expressed and Ni.sup.2+-affinity purified essentially as described (see below). Different from the TorsinA-containing preparations, MgCl.sub.2 and ATP were eliminated from all buffer solutions. The Ni.sup.2+-eluate was purified via size exclusion chromatography on a Superdex S75 column (GE Healthcare) in running buffer (10 mM HEPES/NaOH pH 8.0, 150 mM NaCl). Nanobody binding was validated by size exclusion chromatography on a 10/300 Superdex S200 column in 10 mM HEPES/NaOH pH 8.0, 150 mM NaCl, 10 mM MgCl.sub.2 and 0.5 mM ATP. Equimolar amounts of TorsinA.sub.EQ-LULL1 and TorsinA.sub.EQ-LULL1-VHH were loaded and nanobody binding was monitored by a shift in the elution profile and via SDS-PAGE analysis. After validating VHH-BS2 interaction with TorsinA.sub.EQ-LULL1, the C-terminal His.sub.6-tag of VHH-BS2 was removed from the pET-30b(+) vector for co-purification experiments.
Example 2: Constructs, Protein Expression and Purification
[0107] DNA sequences encoding human TorsinA (residues 51-332) and the luminal domain of human LULL1 (residues 233-470) were cloned into a modified ampicillin resistant pETDuet-1 vector (EMD Millipore). TorsinA, N-terminally fused with a human rhinovirus 3C protease cleavable 10xHis-7xArg tag, was inserted into the first multiple cloning site (MCS), whereas the untagged LULL1 was inserted into the second MCS. Mutations on TorsinA and LULL1 were introduced by site-directed mutagenesis. The untagged VHH-BS2 nanobody was cloned into a separate, modified kanamycin resistant pET-30b(+) vector (EMD Biosciences).
[0108] To co-express TorsinA (EQ or EQ/AE), LULL1 and VHH-BS2 for crystallization, the E. coli strain LOBSTR(DE3) RIL (Kerafast) [32] was co-transformed with the two constructs described above. Cells were grown at 37° C. in lysogeny broth (LB) medium supplemented with 100 μg ml.sup.−1 ampicillin, 25 μg ml.sup.−1 kanamaycin and 34 μg ml.sup.−1 hloramphenicol until an optical density (OD.sub.600) of 0.6-0.8 was reached, shifted to 18° C. for 20 min, and induced overnight at 18° C. with 0.2 mM isopropyl β-D-1-thiogalactopyranoside (IPTG). The bacterial cultures were harvested by centrifugation, suspended in lysis buffer (50 mM HEPES/NaOH pH 8.0, 400 mM NaCl, 40 mM imidazole, 10 mM MgCl.sub.2, and 1 mM ATP) and lysed with a cell disruptor (Constant Systems). The lysate was immediately mixed with 0.1 M phenylmethanesulfonyl fluoride (PMSF) (50 μl per 10 ml lysate) and 250 units of TurboNuclease (Eton Bioscience), and cleared by centrifugation. The soluble fraction was gently mixed with Ni-Sepharose 6 Fast Flow (GE Healthcare) resin for 30 min at 4° C. After washing with the lysis buffer, bound protein was eluted in elution buffer (10 mM HEPES/NaOH pH 8.0, 150 mM NaCl, 300 mM imidazole, 10 mM MgCl.sub.2, and 1 mM ATP). The eluted protein complex was immediately purified by size exclusion chromatography on a Superdex S200 column (GE Healthcare) equilibrated in running buffer (10 mM HEPES/NaOH pH 8.0, 150 mM NaCl, 10 mM MgCl.sub.2, and 0.5 mM ATP). Following the tag removal by 10xHis-7xArg-3C protease, the fusion tags and the protease were separated from the complex by cation-exchange chromatography on a HiTrapS column (GE Healthcare) using a linear NaCl gradient. The flow-through from the cation-exchange chromatography, containing the protein complex, was purified again by size exclusion chromatography on a Superdex S200 column as at the previous step.
[0109] For the non-structural analysis of TorsinA and LULL1 variants, the pETDuet-1-based expression plasmid was transformed into LOBSTR(DE3) RIL cells without co-expressing nanobody VHH-BS2. Ni.sup.2+-affinity purification was performed as described above and bound protein was eluted. Aliquots from the Ni.sup.2+-eluate and the total lysate were collected and analyzed by SDS-PAGE gel electrophoresis.
Example 3: Crystallization
[0110] Purified TorsinA.sub.EQ-LULL1-VHH-BS2 and TorsinA.sub.EQΔE-LULL1-VHH-BS2 complexes were concentrated up to 4-4.5 mg/ml and supplemented with 2 mM ATP prior to crystallization. The TorsinA.sub.EQ containing complex crystallized in 13% (w/v) polyethylene glycol (PEG) 6000, 5% (v/v) 2-Methyl-2,4-pentanediol, and 0.1 M MES pH 6.5. The TorsinA.sub.EQAE containing complex crystallized in 19% (w/v) PEG 3350, 0.2 M AmSO4, and 0.1 M Bis-Tris-HCl pH 6.5. Crystals of both complexes grew at 18° C. in hanging drops containing 1 μl of protein and 1 μl of mother liquor. Clusters of diffraction quality, rod-shaped crystals formed within 3-5 days. Single crystals were briefly soaked in mother liquor supplemented with 20% (v/v) glycerol for cryoprotection and flash-frozen in liquid nitrogen.
Example 4: Data Collection and Structure Determination
[0111] X-ray data were collected at NE-CAT beamline 24-ID-C at Argonne National Laboratory. Data reduction was performed with the HKL2000 package [33], and all subsequent data-processing steps were carried out using programs provided through SBGrid [34]. The structure of the TorsinA.sub.EQ-LULL1-VHH-BS2 complex was solved by molecular replacement (MR) using the Phaser-MR tool from the PHENIX suite [35]. A three-part MR solution was easily obtained using a sequential search for models of LULL1, VHH-BS2, and TorsinA. The LULL1 model was generated based on the published human LAP1 structure (PDB 4TVS, chain A), using the Sculptor utility of the PHENIX suite (LULL.sub.1241-470 and LAP.sub.1356-583 share 64% sequence identity). The VHH-BS2 model was based on VHH-BS1 (PDB 4TVS, chain a) after removing the complementarity determining regions (CDRs). The poly-Ala model of TorsinA was generated based on E. coli ClpA (PDB 1R6B) using the MODELLER tool of the HHpred server [36]. The asymmetric unit contains one TorsinA.sub.EQ-LULL1-VHH-BS2 complex. Iterative model building and refinement steps gradually improved the electron density maps and the model statistics. The stereochemical quality of the final model was validated by Molprobity [37]. TorsinA.sub.EQΔE-LULL1-VHH-BS2 crystallized in the same unit cell. Model building was carried starting from a truncated TorsinA.sub.EQ-LULL1-VHH-BS2 structure. All manual model building steps were carried out with Coot [38], and phenix.refine was used for iterative refinement. Two alternate conformations of a loop in LULL1 (residues 428-438) were detected in the Fo-Fc difference electron density maps of both structures, and they were partially built. For comparison, the cysteine residues of TorsinA at the catalytic site (residues 280 and 319 in the TorsinA.sub.EQ structure) were built in the reduced and the oxidized states, respectively. Building them as oxidized, disulfide-bridged residues consistently produced substantial residual Fo-Fc difference density, which disappeared assuming a reduced state. Statistical parameters of data collection and refinement are all given in Table 1 in
Example 5: Bioinformatic Analysis
[0112] Torsin and LAP1/LULL1 sequences were obtained via PSI-BLAST [39] and Backphyre searches [40]. Transmembrane domains were predicted using the HMMTOP tool [41]. LAP1/LULL1 proteins were distinguished based on the calculated isoelectric point (pI) of their extra-luminal portions. The intranuclear domain of LAP1 has a characteristically high pI of ˜8.5-10 due to a clustering of basic residues, while the cytoplasmic domain of LULL1 is distinctively more acidic. Multiple sequence alignments were performed using MUSCLE [42], and visualized by Jalview [43]. To illustrate evolutionary conservation on TorsinA and LULL1 surfaces, conservation scores for each residue were calculated using the ConSurf server with default parameters [44].
[0113] The sequences, which were used to generate the multiple sequence alignments, were also used for preparing the sequence logos of Torsins and LAP1/LULL1 in
Example 6: Structure Analysis
[0114] A stable, heterotrimeric complex of TorsinA.sub.EQ(ATP)-LULL1-VHH-BS2 was readily crystallized in the presence of ATP. A 1.4 Å dataset was collected and the structure was solved by molecular replacement, using the LULL1-homolog LAP1 and a VHH template as search models [14] (Example 4, and Table 1 in
[0115] AAA+ ATPases are organized into a number of structurally defined clades [12, 18], distinguished by shared structural elements. Comparison with other AAA+ ATPase structures shows that TorsinA fits best into a clade that also contains the bacterial proteins HslU, ClpA/B, ClpX, and Lon, all of which are involved in protein degradation or remodeling [13]. These AAA+ family members share a β-hairpin insertion that precedes the sensor-I region (
[0116] The interaction of TorsinA with its ATPase activators LULL1 and LAP1 is of particularly importance, as a prominent mutation causing primary dystonia—the deletion of glutamate 302 or 303—weakens these interaction [7-9]. But why and how? The TorsinA-LULL1 interface extends over an area of 1527 Å.sup.2. The main structural elements involved in this interaction are the nucleotide-binding region as well as the small domain of TorsinA, and helices α0, α2, α4 and α5 of LULL1 (
[0117] To investigate the atomic details of the weakened binding of TorsinAΔE to LAP1/LULL1, and thus the molecular basis of primary dystonia, we made use of the observation that VHH-BS2 also stabilizes the TorsinA.sub.EQAE(ATP)-LULL1 interaction. We were able to crystallize TorsinA.sub.EQAE(ATP)-LULL1-VHH-BS2 and determine its structure at a resolution of 1.4 Å. Not surprisingly, the overall structure is almost identical to the wild type protein (0.34 Å rmsd over 274 Ca atoms for TorsinA, 0.26 Å rmsd over 229 Ca atoms for LULL1), except for critical differences in the TorsinA-LULL1 interface (
[0118] Although TorsinAΔE303 is the most prevalent mutation that causes primary dystonia, it is not the only one [5, 6]. We examined the structural consequence of all known mutations (
[0119] The biological function of TorsinA remains enigmatic [24-28]. Because TorsinA belongs to the AAA+ ATPase superfamily, with specific homology to the bacterial proteins HslU, ClpX, ClpA/B and Lon, it is generally assumed that TorsinA is involved in protein remodeling or protein degradation [5, 6]. However, a substrate of TorsinA has yet to be identified.
[0120] The TorsinA structure enables a more thorough comparison to other AAA+ ATPases. After the discovery that LAP1/LULL1 are Arg-finger containing TorsinA activators, it seemed reasonable to suggest that TorsinA and LAP1/LULL1 likely form heterohexameric rings ((TorsinA-ATP-LAP1/LULL1).sub.3) in order to function [14, 16]. However, the predominant oligomeric form of the TorsinA-ATP-LAP1/LULL1 complex in solution is largely heterodimeric, with the heterohexameric form present as only a small fraction [14, 16, 29-31]. Our structure now raises doubts about the physiological relevance of a heterohexameric ring (
[0121] The foregoing description of the exemplary embodiments of the invention has been presented only for the purposes of illustration and description and is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations are possible in light of the above teaching.
[0122] The embodiments were chosen and described in order to explain the principles of the invention and their practical application so as to enable others skilled in the art to utilize the invention and various embodiments and with various modifications as are suited to the particular use contemplated. Alternative embodiments will become apparent to those skilled in the art to which the present invention pertains without departing from its spirit and scope. Accordingly, the scope of the present invention is defined by the appended claims rather than the foregoing description and the exemplary embodiments described therein.
REFERENCE LIST
[0123] The following references are incorporated herein by reference in their entirety for all purposes. [0124] [1] Ozelius, L. J. et al. The early-onset torsion dystonia gene (DYT1) encodes an ATP-binding protein. Nat. Genet. 17, 40-48 (1997). [0125] [2] Breakefield, X. O. et al. The pathophysiological basis of dystonias. Nat. Rev. Neurosci. 9, 222-234 (2008). [0126] [3] Granata, A. & Warner, T. T. The role of torsinA in dystonia. Eur. J. Neurol. 17 Suppl 1, 81-87 (2010). [0127] [4] McCullough, J. & Sundquist, W. I. Putting a finger in the ring. Nat. Struct. Mol. Biol. 21, 1025-1027 (2014). [0128] [5] Rose, A. E., Brown, R. S. H. & Schlieker, C. Torsins: not your typical AAA+ ATPases. Crit. Rev. Biochem. Mol. Biol. 50, 532-549 (2015). [0129] [6] Laudermilch, E. & Schlieker, C. TorsinATPases: structural insights and functional perspectives. Curr. Opin. Cell Biol. 40, 1-7 (2016). [0130] [7] Naismith, T. V., Dalal, S. & Hanson, P. I. Interaction of torsinA with its major binding partners is impaired by the dystonia-associated DeltaGAG deletion. J. Biol. Chem. 284, 27866-27874 (2009). [0131] [8] Zhu, L., Millen, L., Mendoza, J. L. & Thomas, P. J. A unique redox-sensing sensor II motif in TorsinA plays a critical role in nucleotide and partner binding. J. Biol. Chem. 285, 37271-37280 (2010). [0132] [9] Zhao, C., Brown, R. S. H., Chase, A. R., Eisele, M. R. & Schlieker, C. Regulation of TorsinATPases by LAP1 and LULL1. Proc. Natl. Acad. Sci. U.S.A. 110, E1545-54 (2013). [0133] [10] Goodchild, R. E., Kim, C. E. & Dauer, W. T. Loss of the dystonia-associated protein torsinA selectively disrupts the neuronal nuclear envelope. Neuron 48, 923-932 (2005). [0134] [11] Hanson, P. I. & Whiteheart, S. W. AAA+ proteins: have engine, will work. Nat. Rev. Mol. Cell Biol. 6, 519-529 (2005). [0135] [12] Erzberger, J. P. & Berger, J. M. Evolutionary relationships and structural mechanisms of AAA+ proteins. Annu Rev Biophys Biomol Struct 35, 93-114 (2006). [0136] [13] Olivares, A. O., Baker, T. A. & Sauer, R. T. Mechanistic insights into bacterial AAA+ proteases and protein-remodelling machines. Nat. Rev. Microbiol. 14, 33-44 (2016). [0137] [14] Sosa, B. A. et al. How lamina-associated polypeptide 1 (LAP1) activates Torsin. Elife 3, e03239 (2014). [0138] [15] Wendler, P., Ciniawsky, S., Kock, M. & Kube, S. Structure and function of the AAA+ nucleotide binding pocket. Biochim. Biophys. Acta 1823, 2-14 (2012). [0139] [16] Brown, R. S. H., Zhao, C., Chase, A. R., Wang, J. & Schlieker, C. The mechanism of TorsinATPase activation. Proc. Natl. Acad. Sci. U.S.A. 111, E4822-31 (2014). [0140] [17] Muyldermans, S. Nanobodies: natural single-domain antibodies. Annu. Rev. Biochem. 82, 775-797 (2013). [0141] [18] Iyer, L. M., Leipe, D. D., Koonin, E. V. & Aravind, L. Evolutionary history and higher order classification of AAA+ ATPases. J. Struct. Biol. 146, 11-31 (2004). [0142] [19] Sauer, R. T. & Baker, T. A. AAA+ proteases: ATP-fueled machines of protein destruction. Annu. Rev. Biochem. 80, 587-612 (2011). [0143] [20] Zhu, L., Wrabl, J. O., Hayashi, A. P., Rose, L. S. & Thomas, P. J. The torsin-family AAA+protein OOC-5 contains a critical disulfide adjacent to Sensor-II that couples redox state to nucleotide binding. Mol. Biol. Cell 19, 3599-3612 (2008). [0144] [21] Goodchild, R. E. & Dauer, W. T. The AAA+ protein torsinA interacts with a conserved domain present in LAP1 and a novel ER protein. J. Cell Biol. 168, 855-862 (2005). [0145] [22] Goodchild, R. E. & Dauer, W. T. Mislocalization to the nuclear envelope: an effect of the dystonia-causing torsinA mutation. Proc. Natl. Acad. Sci. U.S.A. 101, 847-852 (2004). [0146] [23] Kim, C. E., Perez, A., Perkins, G., Ellisman, M. H. & Dauer, W. T. A molecular mechanism underlying the neural-specific defect in torsinA mutant mice. Proc. Natl. Acad. Sci. U.S.A. 107, 9861-9866 (2010). [0147] [24] Nery, F. C. et al. TorsinA binds the KASH domain of nesprins and participates in linkage between nuclear envelope and cytoskeleton. J. Cell. Sci. 121, 3476-3486 (2008). [0148] [25] Granata, A., Koo, S. J., Haucke, V., Schiavo, G. & Warner, T. T. CSN complex controls the stability of selected synaptic proteins via a torsinA-dependent process. EMBO J. 30, 181-193 (2011). [0149] [26] Nery, F. C. et al. TorsinA participates in endoplasmic reticulum-associated degradation. Nat Commun 2, 393 (2011). [0150] [27] Jokhi, V. et al. Torsin mediates primary envelopment of large ribonucleoprotein granules at the nuclear envelope. Cell Rep 3, 988-995 (2013). [0151] [28] Liang, C.-C., Tanabe, L. M., Jou, S., Chi, F. & Dauer, W. T. TorsinA hypofunction causes abnormal twisting movements and sensorimotor circuit neurodegeneration. J. Clin. Invest. 124, 3080-3092 (2014). [0152] [29] Jungwirth, M., Dear, M. L., Brown, P., Holbrook, K. & Goodchild, R. Relative tissue expression of homologous torsinB correlates with the neuronal specific importance of DYT1 dystonia-associated torsinA. Hum. Mol. Genet. 19, 888-900 (2010). [0153] [30] Vander Heyden, A. B., Naismith, T. V., Snapp, E. L., Hodzic, D. & Hanson, P. I. LULL1 retargets TorsinA to the nuclear envelope revealing an activity that is impaired by the DYT1 dystonia mutation. Mol. Biol. Cell 20, 2661-2672 (2009). [0154] [31] Goodchild, R. E. et al. Access of torsinA to the inner nuclear membrane is activity dependent and regulated in the endoplasmic reticulum. J. Cell. Sci. 128, 2854-2865 (2015). [0155] [32] Andersen, K. R., Leksa, N. C. & Schwartz, T. U. Optimized E. coli expression strain LOBSTR eliminates common contaminants from His-tag purification. Proteins 81, 1857-1861 (2013). [0156] [33] Otwinowski, Z. & Minor, W. [20] Processing of X-ray diffraction data collected in oscillation mode. Methods in Enzymology 276, 307-326 (Elsevier, 1997). [0157] [34] Morin, A. et al. Collaboration gets the most out of software. Elife 2, e01456 (2013). [0158] [35] Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213-221 (2010). [0159] [36] Söding, J., Biegert, A. & Lupas, A. N. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 33, W244-8 (2005). [0160] [37] Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Biol. Crystallogr. 66, 12-21 (2010). [0161] [38] Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 66, 486-501 (2010). [0162] [39] Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389-3402 (1997). [0163] [40] Kelley, L. A. & Sternberg, M. J. E. Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc 4, 363-371 (2009). [0164] [41] Tusnády, G. E. & Simon, I. The HMMTOP transmembrane topology prediction server.
[0165] Bioinformatics 17, 849-850 (2001). [0166] [42] Edgar, R. C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5, 113 (2004). [0167] [43] Waterhouse, A. M., Procter, J. B., Martin, D. M. A., Clamp, M. & Barton, G. J. Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189-1191 (2009). [0168] [44] Glaser, F. et al. ConSurf: identification of functional regions in proteins by surface-mapping of phylogenetic information. Bioinformatics 19, 163-164 (2003). [0169] [45] Crooks, G. E., Hon, G., Chandonia, J.-M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188-1190 (2004). [0170] [46] Zeymer, C., Barends, T. R. M., Werbeck, N. D., Schlichting, I. & Reinstein, J. Elements in nucleotide sensing and hydrolysis of the AAA+ disaggregation machine ClpB: a structure-based mechanistic dissection of a molecular motor. Acta Crystallogr. D Biol. Crystallogr. 70, 582-595 (2014). [0171] [47] Leung, J. C. et al. Novel mutation in the TOR1A (DYT1) gene in atypical early onset dystonia and polymorphisms in dystonia and early onset parkinsonism. Neurogenetics 3, 133-143 (2001). [0172] [48] Zirn, B. et al. Novel TOR1A mutation p.Arg288Gln in early-onset dystonia (DYT1). J. Neurol. Neurosurg. Psychiatr. 79, 1327-1330 (2008). [0173] [49] Calakos, N. et al. Functional evidence implicating a novel TOR1A mutation in idiopathic, late-onset focal dystonia. J. Med. Genet. 47, 646-650 (2010). [0174] [50] Cheng, F.-B. et al. Combined occurrence of a novel TOR1A and a THAP1 mutation in primary dystonia. Mov. Disord. 29, 1079-1083 (2014). [0175] [51] Vulinovic, F. et al. Unraveling cellular phenotypes of novel TorsinA/TOR1A mutations. Hum. Mutat. 35, 1114-1122 (2014). [0176] [52] Dobri{hacek over (c)}i{umlaut over (c)}, V. et al. Phenotype of non-c.907_909delGAG mutations in TOR1A: DYT1 dystonia revisited. Parkinsonism Relat. Disord. 21, 1256-1259 (2015). [0177] [53] Kock, N. et al. Effects of genetic variations in the dystonia protein torsinA: identification of polymorphism at residue 216 as protein modifier. Hum. Mol. Genet. 15, 1355-1364 (2006). [0178] [54] Kamm, C. et al. Susceptibility to DYT1 dystonia in European patients is modified by the D216H polymorphism. Neurology 70, 2261-2262 (2008). [0179] [55] Kayman-Kurekci, G. et al. Mutation in TOR1AIP1 encoding LAP1B in a form of muscular dystrophy: a novel gene related to nuclear envelopathies. Neuromuscul. Disord. 24, 624-633 (2014). [0180] [56] Dorboz, I. et al. Severe dystonia, cerebellar atrophy, and cardiomyopathy likely caused by a missense mutation in TOR1AIP1. Orphanet J Rare Dis 9, 174 (2014).
TABLE-US-00001 SEQUENCE LISTING SEQ ID NO: 1 TorsinA (51-332) Protein GQKRSLSREALQKDLDDNLFGQHLAKKIILNAVFGFINNPKPKKPLTLSL HGWTGTGKNFVSKIIAENIYEGGLNSDYVHLFVATLHFPHASNITLYKDQ LQLWIRGNVSACARSIFIFDEMDKMHAGLIDAIKPFLDYYDLVDGVSYQK AMFIFLSNAGAERITDVALDFWRSGKQREDIKLKDIEHALSVSVFNNKNS GFWHSSLIDRNLIDYFVPFLPLEYKHLKMCIRVEMQSRGYEIDEDIVSRV AEEMTFFPKEERVFSDKGCKTVFTKLDYYYDD- SEQ ID NO: 6 TorsinA (51-332) Nucleic acid GGGCAGAAGCGGAGCCTTAGCCGGGAGGCACTGCAGAAGGATCTGGACGA CAACCTCTTTGGACAGCATCTTGCAAAGAAAATCATCTTAAATGCCGTGT TTGGTTTCATAAACAACCCAAAGCCCAAGAAACCTCTCACGCTCTCCCTG CACGGGTGGACAGGCACCGGCAAAAATTTCGTCAGCAAGATCATCGCAGA GAATATTTACGAGGGTGGTCTGAACAGTGACTATGTCCACCTGTTTGTGG CCACATTGCACTTTCCACATGCTTCAAACATCACCTTGTACAAGGATCAG TTACAGTTGTGGATTCGAGGCAACGTGAGTGCCTGTGCGAGGTCCATCTT CATATTTGATGAAATGGATAAGATGCATGCAGGCCTCATAGATGCCATCA AGCCTTTCCTCGACTATTATGACCTGGTGGATGGGGTCTCCTACCAGAAA GCCATGTTCATATTTCTCAGCAATGCTGGAGCAGAAAGGATCACAGATGT GGCTTTGGATTTCTGGAGGAGTGGAAAGCAGAGGGAAGACATCAAGCTCA AAGACATTGAACACGCGTTGTCTGTGTCGGTTTTCAATAACAAGAACAGT GGCTTCTGGCACAGCAGCTTAATTGACCGGAACCTCATTGATTATTTTGT TCCCTTCCTCCCCCTGGAATACAAACACCTAAAAATGTGTATCCGAGTGG AAATGCAGTCCCGAGGCTATGAAATTGATGAAGACATTGTAAGCAGAGTG GCTGAGGAGATGACATTTTTCCCCAAAGAGGAGAGAGTTTTCTCAGATAA AGGCTGCAAAACGGTGTTCACCAAGTTAGATTATTACTACGATGATTGA SEQ ID NO: 2 TorsinA E171Q (51-332) Protein GQKRSLSREALQKDLDDNLFGQHLAKKIILNAVFGFINNPKPKKPLTLSL HGWTGTGKNFVSKIIAENIYEGGLNSDYVHLFVATLHFPHASNITLYKDQ LQLWIRGNVSACARSIFIFDQMDKMHAGLIDAIKPFLDYYDLVDGVSYQK AMFIFLSNAGAERITDVALDFWRSGKQREDIKLKDIEHALSVSVFNNKNS GFWHSSLIDRNLIDYFVPFLPLEYKHLKMCIRVEMQSRGYEIDEDIVSRV AEEMTFFPKEERVFSDKGCKTVFTKLDYYYDD- SEQ ID NO: 7 TorsinA E171Q (51-332) Nucleic acid GGGCAGAAGCGGAGCCTTAGCCGGGAGGCACTGCAGAAGGATCTGGACGA CAACCTCTTTGGACAGCATCTTGCAAAGAAAATCATCTTAAATGCCGTGT TTGGTTTCATAAACAACCCAAAGCCCAAGAAACCTCTCACGCTCTCCCTG CACGGGTGGACAGGCACCGGCAAAAATTTCGTCAGCAAGATCATCGCAGA GAATATTTACGAGGGTGGTCTGAACAGTGACTATGTCCACCTGTTTGTGG CCACATTGCACTTTCCACATGCTTCAAACATCACCTTGTACAAGGATCAG TTACAGTTGTGGATTCGAGGCAACGTGAGTGCCTGTGCGAGGTCCATCTT CATATTTGATCAAATGGATAAGATGCATGCAGGCCTCATAGATGCCATCA AGCCTTTCCTCGACTATTATGACCTGGTGGATGGGGTCTCCTACCAGAAA GCCATGTTCATATTTCTCAGCAATGCTGGAGCAGAAAGGATCACAGATGT GGCTTTGGATTTCTGGAGGAGTGGAAAGCAGAGGGAAGACATCAAGCTCA AAGACATTGAACACGCGTTGTCTGTGTCGGTTTTCAATAACAAGAACAGT GGCTTCTGGCACAGCAGCTTAATTGACCGGAACCTCATTGATTATTTTGT TCCCTTCCTCCCCCTGGAATACAAACACCTAAAAATGTGTATCCGAGTGG AAATGCAGTCCCGAGGCTATGAAATTGATGAAGACATTGTAAGCAGAGTG GCTGAGGAGATGACATTTTTCCCCAAAGAGGAGAGAGTTTTCTCAGATAA AGGCTGCAAAACGGTGTTCACCAAGTTAGATTATTACTACGATGATTGA SEQ ID NO: 3 TorsinA E171Q ΔE (51-332) Protein GQKRSLSREALQKDLDDNLFGQHLAKKIILNAVFGFINNPKPKKPLTLSL HGWTGTGKNFVSKIIAENIYEGGLNSDYVHLFVATLHFPHASNITLYKDQ LQLWIRGNVSACARSIFIFDQMDKMHAGLIDAIKPFLDYYDLVDGVSYQK AMFIFLSNAGAERITDVALDFWRSGKQREDIKLKDIEHALSVSVFNNKNS GFWHSSLIDRNLIDYFVPFLPLEYKHLKMCIRVEMQSRGYEIDEDIVSRV AEMTFFPKEERVFSDKGCKTVFTKLDYYYDD- SEQ ID NO: 8 TorsinA E171Q ΔE (51-332) Nucleic acid GGGCAGAAGCGGAGCCTTAGCCGGGAGGCACTGCAGAAGGATCTGGACGA CAACCTCTTTGGACAGCATCTTGCAAAGAAAATCATCTTAAATGCCGTGT TTGGTTTCATAAACAACCCAAAGCCCAAGAAACCTCTCACGCTCTCCCTG CACGGGTGGACAGGCACCGGCAAAAATTTCGTCAGCAAGATCATCGCAGA GAATATTTACGAGGGTGGTCTGAACAGTGACTATGTCCACCTGTTTGTGG CCACATTGCACTTTCCACATGCTTCAAACATCACCTTGTACAAGGATCAG TTACAGTTGTGGATTCGAGGCAACGTGAGTGCCTGTGCGAGGTCCATCTT CATATTTGATCAAATGGATAAGATGCATGCAGGCCTCATAGATGCCATCA AGCCTTTCCTCGACTATTATGACCTGGTGGATGGGGTCTCCTACCAGAAA GCCATGTTCATATTTCTCAGCAATGCTGGAGCAGAAAGGATCACAGATGT GGCTTTGGATTTCTGGAGGAGTGGAAAGCAGAGGGAAGACATCAAGCTCA AAGACATTGAACACGCGTTGTCTGTGTCGGTTTTCAATAACAAGAACAGT GGCTTCTGGCACAGCAGCTTAATTGACCGGAACCTCATTGATTATTTTGT TCCCTTCCTCCCCCTGGAATACAAACACCTAAAAATGTGTATCCGAGTGG AAATGCAGTCCCGAGGCTATGAAATTGATGAAGACATTGTAAGCAGAGTG GCTGAGATGACATTTTTCCCCAAAGAGGAGAGAGTTTTCTCAGATAAAGG CTGCAAAACGGTGTTCACCAAGTTAGATTATTACTACGATGATTGA SEQ ID NO: 4 LULL1 (233-470) Protein SSVNSYYSSPAQQVPKNPALEAFLAQFSQLEDKFPGQSSFLWQRGRKFLQ KHLNASNPTEPATIIFTAAREGRETLKCLSHHVADAYTSSQKVSPIQIDG AGRTWQDSDTVKLLVDLELSYGFENGQKAAVVHHFESFPAGSTLIFYKYC DHENAAFKDVALVLTVLLEEETLEASVGPRETEEKVRDLLWAKFTNSDTP TSFNHMDSDKLSGLWSRISHLVLPVQPVSSIEEQ GCLF- SEQ ID NO: 9 LULL1 (233-470) Nucleic acid AGTTCTGTGAATAGCTACTATTCCTCTCCAGCCCAGCAAGTGCCCAAAAA TCCAGCTTTGGAGGCCTTTTTGGCCCAGTTTAGCCAATTGGAAGATAAAT TTCCAGGCCAGAGTTCCTTCCTGTGGCAGAGAGGACGGAAGTTTCTCCAG AAGCACCTCAATGCTTCCAACCCCACTGAGCCAGCCACCATCATATTTAC AGCAGCTCGGGAGGGAAGAGAGACCCTGAAGTGCCTGAGCCACCATGTTG CAGATGCCTACACCTCTTCCCAGAAAGTCTCTCCCATTCAGATTGATGGG GCTGGAAGGACCTGGCAGGACAGTGACACGGTCAAGCTGTTGGTTGACCT GGAGCTGAGCTATGGGTTTGAGAATGGCCAGAAGGCTGCTGTGGTACACC ACTTCGAATCCTTCCCTGCCGGCTCCACTTTGATCTTCTATAAGTATTGT GATCATGAGAATGCTGCCTTTAAAGATGTGGCCCTGGTCCTGACTGTTCT GCTAGAGGAGGAAACATTAGAAGCAAGTGTAGGCCCAAGGGAAACGGAAG AAAAAGTGAGAGACTTACTCTGGGCCAAGTTTACCAACTCTGACACTCCC ACCTCCTTCAACCACATGGACTCAGACAAATTGAGTGGGCTGTGGAGCCG AATTTCACACCTGGTACTGCCAGTCCAGCCAGTGAGTAGCATAGAAGAAC AGGGGTGCCTTTTCTAA SEQ ID NO: 5 VHH-BS2 (1-123) Protein MQVQLVETGGGLVQAGGSLRLSCAASGNIFSFNVMGWYRQAPGKQRELVA AITSGDTTTYADSVQGRFTISRDNAKNAVYLQMNSLTPEDTAVYFCNARR NPINGPYYTTAYWGQGTQVTVSS- SEQ ID NO: 10 VHH-BS2 (1-123) Nucleic acid ATGCAGGTGCAGCTCGTGGAGACAGGCGGGGGGTTGGTGCAGGCTGGGGG CTCTCTGAGGCTCTCCTGTGCAGCCTCTGGAAACATCTTCAGTTTCAATG TCATGGGCTGGTACCGCCAGGCTCCAGGGAAGCAGCGCGAGTTGGTCGCA GCGATCACGAGTGGTGATACGACAACCTATGCAGACTCCGTGCAGGGCCG ATTCACCATCTCCAGAGACAATGCCAAGAACGCGGTGTATCTGCAAATGA ACAGCCTGACACCTGAGGACACGGCCGTCTATTTCTGTAATGCGCGGCGC AATCCGATTAATGGTCCTTACTACACCACAGCCTACTGGGGCCAGGGGAC CCAGGTCACCGTCTCCTCATGA