CRYSTAL STRUCTURES OF HUMAN TORSIN-A AND METHODS OF DETERMINING AND USING THE SAME

20170248601 · 2017-08-31

    Inventors

    Cpc classification

    International classification

    Abstract

    A protein composition including TorsinA or TorsinA mutant, LULL1, and a nanobody obtained by immunization using TorsinA and LULL1 is used to grow complex crystals, and three dimensional structures are determined using x-ray data of the crystals. A creening platform is built based on the determined three dimensional structures for designing a drug lead to cure dystonia.

    Claims

    1. A protein composition, comprising: a target protein; a modulator of the target protein; and a nanobody specifically binds to at least one of the target protein and the modulator.

    2. The protein composition of claim 1, wherein the target protein is TorsinA, a mutant of TorsinA, or a portion thereof.

    3. The protein composition of claim 2, wherein the target protein comprises an amino acid sequence as set forth in at least one of SEQ ID NO: 1-3 or a portion thereof.

    4. The protein composition of claim 1, wherein the modulator is LULL1 or a portion thereof.

    5. The protein composition of claim 4, wherein the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4.

    6. The protein composition of claim 1, wherein the nanobody is specific for a complex comprising the target protein and the modulator.

    7. The protein composition of claim 6, wherein the nanobody is obtained by immunization using the target protein comprising the amino acid sequence set forth in the SEQ ID NO: 2 and the modulator comprising the amino acid sequence set forth in the SEQ ID NO: 4.

    8. The protein composition of claim 7, wherein the target protein comprises the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 or a portion thereof, the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4 or a portion thereof, the nanobody comprises the amino acid sequence set forth in the SEQ ID NO: 5 or a portion thereof, and the protein composition is co-expressed, and optionally purified together.

    9. A kit comprising a first vector, wherein a first nucleotide sequence encoding a target protein and a second nucleotide sequence encoding the modulator are cloned into the first vector, and a second vector, wherein a third nucleotide sequence encoding a nanobody is cloned into the second vector, and wherein the vectors comprise promoter sequences operably linked to the nucleotide sequences.

    10. The kit of claim 9, wherein the vectors are configured for eukaryotic transformation and/or expression.

    11. The kit of claim 9, wherein the first nucleotide sequence comprises the nucleic acid sequence set forth in at least one of SEQ ID NO: 6-8, the second nucleotide sequence comprises the nucleic acid sequence set forth in the SEQ ID NO: 9, and the third nucleotide sequence comprises the nucleic acid sequence set forth in the SEQ ID NO: 10.

    12. The kit of claim 11, wherein the first vector is a modified ampicillin resistant pETDuet-1 vector, the second vector is a pET-30b(+) vector, and the bacteria is E. coli strain LOBSTR(DE3) RIL.

    13. The kit of claim 11, wherein the target protein comprises the amino acid sequence set forth in at least one of SEQ ID NO: 1-3, the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ 51-332).

    14. A method of obtaining protein crystals according to the following steps: preparing the protein composition of claim 1 at about 4-4.5 mg/ml; adding about 2 mM ATP to the prepared protein composition to form a protein stock; preparing a mother liquor comprising about 13% (w/v) polyethylene glycol (PEG) 6000, about 5% (v/v) 2-methyl-2,4-pentanediol, and about 0.1M MES pH6.5; mixing approximately equal parts of the protein stock with the mother liquor to form a mixture; and inducing crystallization of the protein composition in the mixture by hanging drop/vapor diffusion under about 18° C., wherein the crystals are obtained in about 3-5 days.

    15. The method of claim 14, wherein the obtained crystals are cryoprotected by flash-frozen in liquid nitrogen after soaking in the mother liquor supplemented with about 20% (v/v) glycerol, x-ray data are collected using one of the obtained crystals, and the structure of the crystallized protein composition is determined based on the collected x-ray data.

    16. The protein composition of claim 1, wherein the target protein comprises the amino acid sequence set forth in the SEQ ID NO: 3, the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ 51-332 ΔE303 mutant structure).

    17. A method of obtaining protein crystals according to the following steps: preparing the protein composition of claim 1 at about 4-4.5 mg/ml; adding about 2 mM ATP to the prepared protein composition to form a protein stock solution; preparing a mother liquor comprising about 19% (w/v) polyethylene glycol (PEG) 3350, about 0.2M AMSO.sub.4, and about 0.1M Bis-Tris-HCl pH6.5; and mixing approximately equal parts of the protein stock with 1 the mother liquor to form a mixture, and conducting the crystallization using the mixture by hanging drop/vapor diffusion under 18° C., such that the crystals are obtained in about 3-5 days.

    18. The protein composition of claim 17, wherein the obtained crystals are cryoprotected by flash-frozen in liquid nitrogen after soaking in the mother liquor supplemented with 20% (v/v) glycerol, x-ray data are collected using one of the obtained crystals, and the structure of the crystallized protein composition is determined based on the collected x-ray data.

    19. A method of determining the three dimensional structure of a crystallized protein composition to a resolution of about 1.4 Å or better, wherein the protein composition comprises a target protein having the amino acid sequence set forth in the SEQ ID NO: 2 or SEQ ID NO: 3, a modulator of the target protein having the amino acid sequence set forth in the SEQ ID NO: 4, and a nanobody specifically binds to at least one of the target protein and the modulator and having the amino acid sequence set forth in the SEQ ID NO: 5; and wherein the method comprises: preparing a first nucleotide sequence comprising the nucleic acid sequence set forth in the SEQ ID NO: 7 or the nucleic acid sequence set forth in the SEQ ID NO: 8, a second nucleotide sequence comprising the nucleic acid sequence set forth in the SEQ ID NO: 9, and a third nucleotide sequence comprising the nucleic acid sequence set forth in the SEQ ID NO: 10; cloning the first nucleotide sequence and the second nucleotide sequence to a first vector; cloning the third nucleotide sequence to a second vector; transforming bacteria using the first vector and the second vector; growing the bacteria that expressing the target protein, the modulator and the nanobody; purifying the target protein, the modulator and the nanobody together to obtain a protein composition; crystallizing the protein composition to obtain crystals; collecting x-ray data using one of the obtained crystals; and determining the three dimensional structure from the collected x-ray data.

    20. The method of claim 19, wherein the target protein comprises the amino acid sequence set forth in the SEQ ID NO: 2, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA 51-332 with E171Q).

    21. The method of claim 19, wherein the target protein comprises the amino acid sequence set forth in the SEQ ID NO: 3, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA 51-332/E171Q/ΔE303 mutant structure).

    22. A method for screening compounds that bind to TorsinA, comprising: providing a protein composition of claim 1 comprising TorsinA, and a library of test compounds; treating the protein composition with a test compound; determining whether the compound binds to TorsinA, wherein a compound that binds TorsinA is indicative of a compound that is a candidate TorsinA agonist or antagonist; and optionally determining a three dimensional crystal structure of TorsinA with and/or without the compound bound to a resolution of about 1.4 Å or better.

    23. The method of claim 22, wherein the crystals of TorsinA are grown using a protein composition comprising: the TorsinA having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3, a modulator of the TorsinA having the amino acid sequence set forth in the SEQ ID NO: 4, and a nanobody specifically binds to at least one of the TorsinA and the modulator and having the amino acid sequence set forth in the SEQ ID NO: 5.

    24. The method of claim 23, wherein the TorsinA comprises TorsinA ΔE303 having the amino acid sequence set forth in the SEQ ID NO: 3, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition having the TorsinA ΔE303, the crystallized protein composition having TorsinA ΔE303 can be determined to a resolution of about 1.4 Å or better (TorsinA 51-332/E171Q/ΔE303 mutant structure).

    25. The method of claim 24, wherein the TorsinA comprises TorsinA E171Q having the amino acid sequence set forth in the SEQ ID NO: 2, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition having TorsinA E171Q can be determined to a resolution of about 1.4 Å or better (TorsinA 51-332/E171Q).

    26. The method of claim 25, wherein a binding location of the modulator is determined by comparing the three dimensional structure of the crystallized protein composition having TorsinA ΔE303 and the three dimensional structure of the crystallized protein composition having TorsinA E171Q.

    27. The method of claim 26, wherein the modulator is virtually screened against the binding location of the three dimensional structure of the TorsinA ΔE303.

    28. The method of claim 23, wherein the modulator is co-crystallized with the TorsinA ΔE303 and at least one of the modulator and the nanobody to obtain a three dimensional structure having the TorsinA ΔE303 and the modulator, such that optimization of the modulator is conducted based on the three dimensional structure having the TorsinA ΔE303.

    Description

    BRIEF DESCRIPTION OF THE DRAWINGS

    [0045] The accompanying drawings illustrate one or more embodiments of the invention and together with the written description, serve to explain the principles of the invention. Wherever possible, the same reference numbers are used throughout the drawings to refer to the same or like elements of an embodiment.

    [0046] FIG. 1 shows a protein composition according to one embodiment of the present invention.

    [0047] FIG. 2A shows a schematic diagram of TorsinA according to one embodiment of the present invention. FIG. 2B shows a schematic diagram of LULL1 according to one embodiment of the present invention. The gray areas mark the crystallized segments. Large domain of TorsinA is darker than the small domains of TorsinA. SS, signal sequence; H, hydrophobic region; TM, transmembrane helix.

    [0048] FIG. 3 shows a flowchart of preparing a TorsinA protein composition according to one embodiment of the present invention.

    [0049] FIG. 4 shows a flowchart of obtaining crystals from a TorsinA protein composition according to one embodiment of the present invention.

    [0050] FIG. 5 shows a flowchart of determining three dimensional structures of TorsinA according to one embodiment of the present invention.

    [0051] FIG. 6 is a cartoon representation of the TorsinA-LULL1 complex in two orientations. Large domain of TorsinA is darker than the small domains of TorsinA. A nanobody (VHH-BS2, grey; with complementarity determining regions darker) was used as a crystallization chaperone. Numbers refer to secondary structure elements.

    [0052] FIG. 7 is a close-up of the ATP binding site. Key residues are labeled. 2Fo-Fc electron density contoured at 2τ displayed as grey mesh.

    [0053] FIG. 8 is a close-up of the proximal cysteines 280 and 319 next to the adenine base of the bound ATP. 2Fo-Fc electron density is contoured at 1α. The cysteine pair adopts three alternate conformations, but remains reduced in all of them.

    [0054] FIGS. 9, 10A and 10B are Analysis of the TorsinA-LULL1 interface. FIG. 9 is side-by-side comparison of TorsinA-ATP-LULL1 (left) and TorsinAΔE-ATP-LULL1 (right). Zoomed insets show the atomic details of the interactions between TorsinA/TorsinAΔE and LULL1, with a focus on the ΔE303 area. FIGS. 10A and 10B are mutational analysis of the TorsinA-LULL1 interface. Substitution or deletion of residues involved in TorsinA-LULL1 binding were probed using a Ni-affinity co-purification assay with recombinant, bacterial-expressed protein. Only TorsinA is His-tagged. SDS-PAGE analysis is shown. Lack of binding is observed by the absence of complex (uncomplexed His-tagged TorsinA is insoluble). t, total lysate, e, Ni eluate. Asterisk denotes an unrelated contaminant.

    [0055] FIGS. 11A and 11B shows oligomerization of TorsinA-LULL1. FIG. 11A, Left, Schematic representation of a hypothetical heterohexameric (TorsinA-LULL1).sub.3 ring model, in analogy to canonical AAA+ ATPases. White star represents ATP. Since LULL1 cannot bind a nucleotide, there would be three catalytic (nucleotide-bound) and three non-catalytic interfaces per ring. Open-book representation of the catalytic interface between TorsinA and LULL1, as seen in this study. Black line marks the outline of the interface. Color gradient marks conservation across diverse eukaryotes. FIG. 11B, the same analysis as in a, but for the hypothetical ‘non-catalytic’ interface. The interface model on the right is based on swapping the TorsinA and LULL1 positions in the TorsinA-LULL1 complex.

    [0056] FIGS. 12A and 12B show structure comparisons. FIG. 12A, human TorsinA-ATP (left) displayed as a cartoon, compared to the D2 domain of the double-ringed AAA+ ATPase ClpB-AMPPCP from Thermus thermophilus [46] (PDB code 4LJ9; right) in the same orientation. Important structure motifs are labeled. FIG. 12B, human LULL1 (orange) superposed on human LAP1 (grey, PDB code 4TVS). The one region of major structural difference is labeled.

    [0057] FIG. 13 shows phylogenetic analysis of Torsins. Maximally diverged torsins are aligned. Secondary structure elements of human TorsinA are displayed above the alignment. Important sequence motifs are boxed. LULL1 contacts, red circles, conserved cysteines, yellow circles. Proximal cysteines 280 and 319 connected with a dashed yellow line. Asterisk denotes putative torsin homologs based on sequence analysis. hs, Homo sapiens; oa, Ornithorhynchus anatinus; gg, Gallus gallus; tr, Takifugu rubripes; dr, Danio rerio; nv, Nematostella vectensis; bf, Branchiostoma floridae; stp, Strongylocentrotus purpuratus; ci, Ciona intestinalis; ce, Caenorhabditis elegans; dm, Drosophila melanogaster; ta, Trichoplax adherens.

    [0058] FIG. 14 shows phylogenetic analysis of LAP1/LULL1. Maximally diverged LAP1 and LULL1 sequences are aligned. If not experimentally confirmed, sequences were assigned as LAP1 or LULL1 based on the presence of an N-terminal, extraluminal domain with basic signature, characteristic of LAP1. Secondary structure elements of human LULL1 are displayed above the alignment. The strictly conserved Arg-finger is boxed. TorsinA contacts, red circles, conserved cysteines, yellow circles. Disulfide bridge depicted as a yellow line. hs, Homo sapiens; oa, Ornithorhynchus anatinus; gg, Gallus gallus; tr, Takifugu rubripes; dr, Danio rerio; nv, Nematostella vectensis; bf, Branchiostoma floridae; stp, Strongylocentrotus purpuratus; ci, Ciona intestinalis; ce, Caenorhabditis elegans; dm, Drosophila melanogaster; ta, Trichoplax adherens.

    [0059] FIG. 15 shows nanobody interaction. The heterotrimeric TorsinA(ATP)-LULL1-VHH-BS2 complex is shown in two orientations. Nanobody and interacting secondary structure elements of TorsinA and LULL1 are shown in full color, non-interacting elements in faded colors. Complementarity determining regions (CDRs) in red. Insets show close-ups with important interacting residues labeled.

    [0060] FIG. 16 shows comparison of sequence motifs of AAA+ ATPases. Torsins and LAP1/LULL1 sequences are compared to the HCLR clade, the most similar branch within the AAA+ ATPase family [12,18]. Sequence elements characteristic for each of the 3 groups are displayed as WebLogos [45]. Numbering refers to ClpB-D2 from Thermus thermophilus for the HCLR class, human TorsinA for Torsins, and human LULL1 for LAP1/LULL1. Grey bars indicate the characteristic motif or residue, surrounded by a few adjacent residues to emphasize the distinct conservation. All three groups have elements that can be used to distinguish them among each other.

    [0061] FIG. 17 shows dystonia mutations. All known point mutations and deletions that lead to dystonia are marked as green dots and shown in light green color, respectively, on the TorsinA-ATP-LULL1 structure. A modifier TorsinA mutation, D216H, is marked as a blue dot. The structural equivalent of the LAP1 missense mutation (E482A) would be the LULL1 E368A, marked as a green dot. See Table 2 in FIG. 19 for an explanation of the likely structural consequence.

    [0062] FIG. 18 shows data collection and refinement statistics for TorsinA-LULL1.sub.233-470 and TorsinAΔE-LULL1.sub.233-470.

    [0063] FIG. 19 shows a table of tystonia mutations.

    DETAILED DESCRIPTION

    [0064] The present invention is more particularly described in the following examples that are intended as illustrative only since numerous modifications and variations therein will be apparent to those skilled in the art. Various embodiments of the invention are now described in detail. Referring to the drawings, like numbers indicate like components throughout the views. As used in the description herein and throughout the claims that follow, the meaning of “a”, “an”, and “the” includes plural reference unless the context clearly dictates otherwise. Also, as used in the description herein and throughout the claims that follow, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise. Moreover, titles or subtitles may be used in the specification for the convenience of a reader, which shall have no influence on the scope of the present invention. Additionally, some terms used in this specification are more specifically defined below.

    [0065] Some references, which may include patents, patent applications and various publications, are cited and discussed in the description of this invention. The citation and/or discussion of such references is provided merely to clarify the description of the present invention and is not an admission that any such reference is “prior art” to the invention described herein. All references cited and discussed in this specification are incorporated herein by reference in their entireties and to the same extent as if each reference was individually incorporated by reference.

    [0066] The terms used in this specification generally have their ordinary meanings in the art, within the context of the invention, and in the specific context where each term is used. Certain terms that are used to describe the invention are discussed below, or elsewhere in the specification, to provide additional guidance to the practitioner regarding the description of the invention. For convenience, certain terms may be highlighted, for example using italics and/or quotation marks. The use of highlighting has no influence on the scope and meaning of a term; the scope and meaning of a term is the same, in the same context, whether or not it is highlighted. It will be appreciated that same thing can be said in more than one way. Consequently, alternative language and synonyms may be used for any one or more of the terms discussed herein, nor is any special significance to be placed upon whether or not a term is elaborated or discussed herein. Synonyms for certain terms are provided. A recital of one or more synonyms does not exclude the use of other synonyms. The use of examples anywhere in this specification including examples of any terms discussed herein is illustrative only, and in no way limits the scope and meaning of the invention or of any exemplified term. Likewise, the invention is not limited to various embodiments given in this specification.

    [0067] It will be understood that when an element is referred to as being “on” another element, it can be directly on the other element or intervening elements may be present therebetween. In contrast, when an element is referred to as being “directly on” another element, there are no intervening elements present. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.

    [0068] It will be understood that, although the terms first, second, third etc. may be used herein to describe various elements, components, regions, layers and/or sections, these elements, components, regions, layers and/or sections should not be limited by these terms. These terms are only used to distinguish one element, component, region, layer or section from another element, component, region, layer or section. Thus, a first element, component, region, layer or section discussed below could be termed a second element, component, region, layer or section without departing from the teachings of the present invention.

    [0069] Furthermore, relative terms, such as “lower” or “bottom” and “upper” or “top,” may be used herein to describe one element's relationship to another element as illustrated in the Figures. It will be understood that relative terms are intended to encompass different orientations of the device in addition to the orientation depicted in the Figures. For example, if the device in one of the figures is turned over, elements described as being on the “lower” side of other elements would then be oriented on “upper” sides of the other elements. The exemplary term “lower”, can therefore, encompasses both an orientation of “lower” and “upper,” depending of the particular orientation of the figure. Similarly, if the device in one of the figures is turned over, elements described as “below” or “beneath” other elements would then be oriented “above” the other elements. The exemplary terms “below” or “beneath” can, therefore, encompass both an orientation of above and below.

    [0070] Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and the present disclosure, and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.

    [0071] As used herein, “around”, “about” or “approximately” shall generally mean within 20 percent, preferably within 10 percent, and more preferably within 5 percent of a given value or range. Numerical quantities given herein are approximate, meaning that the term “around”, “about” or “approximately” can be inferred if not expressly stated.

    [0072] As used herein, “plurality” means two or more.

    [0073] As used herein, the terms “comprising”, “including”, “carrying”, “having”, “containing”, “involving”, and the like are to be understood to be open-ended, i.e., to mean including but not limited to.

    [0074] The terms “TorsinA” or “Torsin-1A”, as used herein, refer to a protein that in humans that is encoded by the TOR1A gene (also know as DQ2 or DYT1).

    [0075] The term “nanobody”, as used herein, refers to a single-domain antibody. The single-domain antibody is an antibody fragment consisting of a single monomeric variable antibody domain. Like a whole antibody, it is able to bind selectively to a specific antigen. With a molecular weight of only 12-15 kDa, single-domain antibodies are much smaller than common antibodies (150-160 kDa) which are composed of two heavy protein chains and two light chains, and even smaller than Fab fragments (˜50 kDa, one light chain and half a heavy chain) and single-chain variable fragments (˜25 kDa, two variable domains, one from a light and one from a heavy chain).

    [0076] The term “modulator”, as used herein, refer to a substance influencing the binding of a target protein to its ligand or agonist, or inverse agonist.

    [0077] The most common cause of early onset primary dystonia, a neuromuscular disease, is a glutamate deletion (ΔE) at position 302/303 of TorsinA, a AAA+ ATPase that resides in the endoplasmic reticulum including the perinuclear space [1, 2]. While the actual function of TorsinA remains elusive [3-6], the ΔE mutation is known to diminish binding of two TorsinA ATPase activators: lamina-associated protein 1 (LAP1) and its paralog, luminal domain like LAP1 (LULL1) [7-9]. Therefore, ΔE is likely a loss-of-function mutation [10]. A single-chain antibody fragment, a so-called nanobody, which specifically binds the TorsinA LULL1 complex, is generated. The nanobody is called VHH BS2. The resulting trimeric TorsinA(ATP)-LULL1-VHH-BS2 complex is stable in vitro and was crystallized. In addition, and most importantly, VHH-BS2 is able to stabilize the weak TorsinAΔE(ATP).LULL1 interaction, thus a TorsinAΔE(ATP)-LULL1-VHH-BS2 can also be made and was crystallized as well. The ability to stabilize a weak interaction with a reagent like VHH-BS2 is extremely rare. Using a nanobody as a crystallization chaperone, both crystal structures are solved and refined to 1.4 Å resolution. A comparison of these structures at this very high resolution shows, in atomic detail, the subtle differences in activator interactions that separate the healthy wild type from the diseased state DYT1 mutant TorsinA. This structure information may provide a structural platform for drug development, as a small molecule that rescues TorsinAΔE could serve as a cure for primary dystonia.

    [0078] In one aspect, the present invention relates to a protein composition. As shown in FIG. 1, in certain embodiments, the protein composition 100 includes a target protein 110, a modulator 130 of the target protein, a nanobody 150 specifically binds to at least one of the target protein and the modulator.

    [0079] The target protein 110 may be human wild type TorsinA (SEQ ID NO: 1), mutant TorsinA.sub.EQ with a glutamate (E) to Glutamine (Q) mutation at position 171 (SEQ ID NO: 2), mutant TorsinA.sub.EQΔE with a glutamate (E) to Glutamine (Q) mutation at position 171 and a glutamate deletion at position 303 (or 302) (SEQ ID NO: 3), as well as TorsinA mutants having ΔF323-Y328, R288Q, F2051, D194V, ΔA14-P15, E121K, V1291, or D216H mutations, and portions of the above proteins. In certain embodiments, the target protein 110 includes the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 (SEQ ID NO: 1 is human TorsinA 51-332, SEQ ID NO: 2 is TorsinA 51-332 with E171Q; SEQ ID NO: 3 is human TorsinA 51-332 with E171Q and ΔE303) or portions thereof. FIG. 2A is schematically diagram of TorsinA, where important residues and sequence motis are indicated. For example, SS is signal sequence, and H is hydrophobic region.

    [0080] The modulator 130 may be an activator, an agonist, an antagonist, or an inverse agonist, of the target protein 110. When the target protein 110 is TorsinA or a mutant of TorsinA, the modulator 130 may be LAP1, LULL1, a domain or a portion of LAP1 or LULL1. In certain embodiments, the modulator may also be a drug lead that is able to bind to TorsinA or its mutant, and the drug lead may be improved based on the three dimensional complex structure of the TorsinA or its mutant and the drug lead. In certain embodiments, the modulator 130 is LULL1 or portions thereof. In one embodiment, the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4 (SEQ ID NO: 4 is LULL1 233-470) or portions thereof. FIG. 2A is schematically diagram of LULL1, where important residues and sequence motis are indicated. For example, TM is transmembrane helix.

    [0081] The nanobody 150 specifically binds to at least one of the target protein 110 and the modulator 130. In certain embodiment, the nanobody 150 may be obtained by immunizing a model animal using both the target protein 110 and the modulator 130. In certain embodiments, the nanobody is obtained by immunization using the target protein 110 having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 and the modulator 130 having the amino acid sequence set forth in the SEQ ID NO: 4, or portions thereof. In certain embodiments, the obtained nanobody 150 has the amino acid sequence set forth in the SEQ ID NO:5, or portions thereof.

    [0082] In certain embodiments, the target protein 110 includes the amino acid sequence set forth in the SEQ ID NO: 2 or SEQ ID NO: 3 or portions thereof, the modulator 130 includes the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, the nanobody 150 includes the amino acid sequence set forth in the SEQ ID NO: 5 or portions thereof, and the target protein 110, themodulator 130 and the nanobody 150 in the protein composition are co-expressed and purified together.

    [0083] Referring to FIG. 3, in certain embodiments, the protein composition 100 is obtained through steps 301 to 309. In step 301, a first nucleotide sequence encoding the target protein 110 and a second nucleotide sequence encoding the modulator 130 are cloned into a first vector, and in step 303 a third nucleotide sequence encoding the nanobody 150 is cloned into a second vector. At step 305, the first vector and the second vector are used to transform bacteria. In certain embodiments, the first nucleotide sequence comprises the nucleic acid sequence set forth in at least one of SEQ ID NO: 6-8 (encoding the proteins SEQ ID NO: 1-3, respectively), the second nucleotide sequence comprises the nucleic acid sequence set forth in the SEQ ID NO: 9 (encoding the protein SEQ ID NO: 4), and the third nucleotide sequence comprises the nucleic acid sequence set forth in the SEQ ID NO: 10 (encoding the protein SEQ ID NO: 5). In certain embodiments, the first vector is a modified ampicillin resistant pETDuet-1 vector, the second vector is a pET-30b(+) vector, and the bacteria is E. coli strain LOBSTR(DE3) RIL. At step 307, the bacteria is cultured and the expression of the target protein 110, the modulator 130 and the nanobody 150 is induced. At step 309, the bacteria culture is harvested and the target protein 110, the modulator 130 and the nanobody 150 are purified together.

    [0084] In certain embodiments, the target protein 110 includes the amino acid sequence set forth in the SEQ ID NO: 2, the modulator 130 includes the amino acid sequence set forth in the SEQ ID NO: 4, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition 110 can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ51-332 structure).

    [0085] In certain embodiments, as shown in FIG. 4, the protein composition 100 is crystallized to obtain crystals by the following steps. At step 401, the protein composition 100 is prepared, for example, by concentrating, to a concentration of about 4-4.5 mg/ml. At step 403, about 2 mM ATP is added to the concentrated protein composition 110 to form a protein stock solution. At step 405, a mother liquor is prepared. The mother liquor used in this embodiment includes 13% (w/v) polyethylene glycol (PEG) 6000, 5% (v/v) 2-methyl-2,4-pentanediol, and 0.1M MES pH6.5. At step 407, crystals are grow using hanging drop/vapor diffusion method. In this embodiment, approximately equal amount of protein stock solution and the mother liquor, for example 1 μl of the protein stock solution and 1 μl of the mother liquor, are mixed and disposed on a cover slip, and the cover slip is inverted, sealed, and covered on a reservoir having certain amount of the mother liquor, such as 0.5 ml or 1 ml. The vapor diffusion process between the hanging drop and the mother lique is conducted under a temperature, such as 18° C. or room temperature, and crystals are obtained in about a few days, such as 3-5 days.

    [0086] After the crystals are observed and grow to a sufficient size, the crystals are cryoprotected by flash-frozen in liquid nitrogen after soaking in the mother liquor supplemented with 20% (v/v) glycerol. Single crystal is preferably used in the flash-frozen. X-ray data are collected using one of the obtained crystals, and the structure of the crystallized protein composition is determined based on the collected x-ray data.

    [0087] In certain embodiments, the target protein 110 includes the amino acid sequence set forth in the SEQ ID NO: 3 or portions thereof, the modulator 130 includes the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ51-332 ΔE303 mutant structure).

    [0088] In certain embodiments, the protein composition 100 is crystallized to obtain crystals by the following steps: preparing the protein composition 100 at about 4-4.5 mg/ml; adding about 2 mM ATP to the prepared protein composition to form a protein stock; preparing a mother liquor comprising about 19% (w/v) polyethylene glycol (PEG) 3350, about 0.2 M AMSO.sub.4, and about 0.1 M Bis-Tris-HCl pH6.5; and mixing 1 μl of the protein stock with 1 μl of the mother liquor to form a second mixture, and inducing crystallization of the protein composition in the mixture by hanging drop/vapor diffusion under about 18° C., such that the crystals are obtained in about 3-5 days.

    [0089] In certain embodiments, the obtained crystals are cryoprotected by flash-frozen in liquid nitrogen after soaking in the mother liquor supplemented with 20% (v/v) glycerol, x-ray data are collected using one of the obtained crystals, and the structure of the crystallized protein composition is determined based on the collected x-ray data.

    [0090] In another aspect, the present invention related to a method of determining the three dimensional structure of a crystallized protein composition 100 to a resolution of about 1.4 Å or better. In certain embodiments, the protein composition 100 includes a target protein 110 having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 or portions thereof, an modulator 130 of the target protein 110 having the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, and a nanobody 150 specifically binds to at least one of the target protein 110 and the modulator 130 and having the amino acid sequence set forth in the SEQ ID NO: 5 or portions thereof.

    [0091] As shown in FIG. 5, the method includes the following steps. At steps 501, preparing a first nucleotide sequence having the nucleic acid sequence set forth in at least one of SEQ ID NO: 6-8 (encoding the proteins of SEQ ID NO: 1-3, respectively), a second nucleic acid sequence set forth in the SEQ ID NO: 9 (encoding the protein SEQ ID NO: 4) is prepared, and a third nucleotide sequence having the nucleic acid sequence set forth in the SEQ ID NO: 10 (encoding the protein SEQ ID NO: 5) is prepared. The preparation may be performed by direct synthesis or from PCR. Then the first nucleotide sequence and the second nucleotide sequence is cloent to a first vector, and the third nucleotide sequence is cloned to a second vector. At step 503, bacteria are transformed using the first vector and the second vector. At step 505, the bacteria are grown to express the target protein, the modulator and the nanobody. At step 507, the target protein, the modulator and the nanobody are purified together to obtain a protein composition. At step 509, the protein composition is crystallized to obtain crystals. At step 511, x-ray data is collected using one of the obtained crystals. At step 513, the three dimensional structure from the collected x-ray data is determined.

    [0092] In certain embodiments, the target protein 110 comprises the amino acid sequence set forth in the SEQ ID NO: 2, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition 100 can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ 51-332 structure).

    [0093] In certain embodiments, the target protein 110 comprises the amino acid sequence set forth in the SEQ ID NO: 3, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ 51-332 ΔE303 mutant structure).

    [0094] In a further aspect, the present invention relates to a method for screening compounds that bind to TorsinA. In certain embodiments, the method includes providing a protein composition as described above comprising TorsinA, and a library of test compounds, treating the protein composition with a test compound, determining whether the compound binds to TorsinA, where a compound that binds TorsinA is indicative of a compound that is a candidate TorsinA agonist or antagonist, and optionally determining a three dimensional crystal structure of TorsinA with and/or without the bound compound to a resolution of about 1.4 Å or better.

    [0095] The TorsinA structure may include TorsinA.sub.EQ structure, TorsinA.sub.EQΔE303 structure, as well as their complex structures with modulators such as LULL1 or LAP1, and/or ATP. After analyzing one or more of the three dimensional structures of TorsinA, a targeting binding area of TorsinA or a targeting binding interface between TorsinA and its modulator, is chosen for designing a lead as drug candidate. The lead may be rationally designed, virtually screened, or directly screened by activity. The lead is then crystallized, for example using the method as shown in FIG. 5, with TorsinA. Then the TorsinA/lead complex structure is determined, and the structure information can be used for further optimization of the lead. A drug may be obtained for iterary optimization of the lead.

    [0096] In certain embodiments, the crystals of TorsinA are grown using a protein composition 100 including: TorsinA having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 or portions thereof, a modulator of TorsinA having the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, and a nanobody specifically binds to at least one of TorsinA and the modulator and having the amino acid sequence set forth in the SEQ ID NO: 5 or portions thereof.

    [0097] In certain embodiments, TorsinA includes TorsinA.sub.EQ ΔE303 having the amino acid sequence set forth in the SEQ ID NO: 3, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 Å, b=88.1 Å, and c=105.4 Å such that the three dimensional structure of the crystallized protein composition having the TorsinA.sub.EQΔE303, the crystallized protein composition having TorsinA ΔE303 can be determined to a resolution of about 1.4 Å or better (TorsinA.sub.EQ 51-332 ΔE303 mutant structure).

    [0098] In certain embodiments, the TorsinA comprises TorsinA E171Q having the amino acid sequence set forth in the SEQ ID NO: 2, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 Å, b=90.7 Å, and c=105.1 Å such that the three dimensional structure of the crystallized protein composition having TorsinA E171Q can be determined to a resolution of about 1.4 Å or better (TorsinA 51-332/E171Q).

    [0099] In certain embodiments, a binding location of the modulator is determined by comparing the three dimensional structure of the crystallized protein composition having TorsinA ΔE303 and the three dimensional structure of the crystallized protein composition having TorsinA E171Q.

    [0100] In certain embodiments, the modulator is virtually screened against the binding location of the three dimensional structure of the TorsinA.sub.EQ ΔE303.

    [0101] In certain embodiments, the modulator is co-crystallized with the TorsinA.sub.EQ ΔE303 and at least one of the modulator and the nanobody to obtain a three dimensional structure having the TorsinA.sub.EQ ΔE303 and the modulator, such that modification of the modulator is conducted based on the three dimensional structure having the TorsinA ΔE303.

    [0102] Certain embodiments of the present application, among other things, crystallized TorsinA which is a difficult to crystallize. Using this method, variety of TorsinA mutants and their complex structures can be determined. This is not achieved by any others before this invention.

    [0103] Further, by comparing the TorsinA.sub.EQ structure and TorsinA.sub.EQ ΔE303 structure, a novel functional mechanism and novel binding site is determined, which can be used as the basis for structural based rational drug design. This information provides a structural platform to develop drug that can rescue TorsinA ΔE303 or other type of mutants so that the TorsinA ΔE303 become functional. The drug is then useful for cure primary dystonia.

    [0104] These and other aspects of the present invention are more specifically described below. Without intent to limit the scope of the invention, exemplary methods and their related results according to the embodiments of the present invention are given below. Note that titles or subtitles may be used in the examples for convenience of a reader, which in no way should limit the scope of the invention. Moreover, certain theories are proposed and disclosed herein; however, in no way they, whether they are right or wrong, should limit the scope of the invention so long as the invention is practiced according to the invention without regard for any particular theory or scheme of action.

    EXAMPLES

    Example 1: Generation and Selection of Nanobodies

    [0105] To investigate the molecular basis for primary dystonia as a result of the glutamate 302/303 deletion in TorsinA, a structural approach is taken. TorsinA is a catalytically inactive AAA+ ATPase [11-13], notoriously ill-behaved in vitro, primarily due to its limited solubility and stability. These problems were partially overcome by stabilizing an ATP-trapped E171Q mutant of human TorsinA (residues 51-332; SEQ ID NO: 2) by co-expressing it with the luminal activation domain of human LULL1 (residues 233-470; SEQ ID NO: 4). This resulted in a better behaved heterodimeric complex (FIGS. 2A and 2B), but was still recalcitrant to initial crystallization efforts. To facilitate crystallization, isolated a nanobody (VHH-BS2; SEQ ID NO: 5) was isolated from an alpaca immunized with the TorsinA.sub.EQ-LULL1 complex. A stable, heterotrimeric complex of TorsinA.sub.EQ(ATP)-LULL1-VHH-BS2 was readily crystallized in the presence of ATP.

    [0106] Specifically, for obtaining the VHH-BS2 nanobody, purified human TorsinA.sub.EQ-LULL1 complex was injected into a male alpaca (Lama pacos) for immunization. Generation and screening of nanobodies was carried out as previously described [14]. Each of the selected nanobodies was subcloned into a pET-30b(+) vector with a C-terminal His.sub.6-tag. Each nanobody was bacterially expressed and Ni.sup.2+-affinity purified essentially as described (see below). Different from the TorsinA-containing preparations, MgCl.sub.2 and ATP were eliminated from all buffer solutions. The Ni.sup.2+-eluate was purified via size exclusion chromatography on a Superdex S75 column (GE Healthcare) in running buffer (10 mM HEPES/NaOH pH 8.0, 150 mM NaCl). Nanobody binding was validated by size exclusion chromatography on a 10/300 Superdex S200 column in 10 mM HEPES/NaOH pH 8.0, 150 mM NaCl, 10 mM MgCl.sub.2 and 0.5 mM ATP. Equimolar amounts of TorsinA.sub.EQ-LULL1 and TorsinA.sub.EQ-LULL1-VHH were loaded and nanobody binding was monitored by a shift in the elution profile and via SDS-PAGE analysis. After validating VHH-BS2 interaction with TorsinA.sub.EQ-LULL1, the C-terminal His.sub.6-tag of VHH-BS2 was removed from the pET-30b(+) vector for co-purification experiments.

    Example 2: Constructs, Protein Expression and Purification

    [0107] DNA sequences encoding human TorsinA (residues 51-332) and the luminal domain of human LULL1 (residues 233-470) were cloned into a modified ampicillin resistant pETDuet-1 vector (EMD Millipore). TorsinA, N-terminally fused with a human rhinovirus 3C protease cleavable 10xHis-7xArg tag, was inserted into the first multiple cloning site (MCS), whereas the untagged LULL1 was inserted into the second MCS. Mutations on TorsinA and LULL1 were introduced by site-directed mutagenesis. The untagged VHH-BS2 nanobody was cloned into a separate, modified kanamycin resistant pET-30b(+) vector (EMD Biosciences).

    [0108] To co-express TorsinA (EQ or EQ/AE), LULL1 and VHH-BS2 for crystallization, the E. coli strain LOBSTR(DE3) RIL (Kerafast) [32] was co-transformed with the two constructs described above. Cells were grown at 37° C. in lysogeny broth (LB) medium supplemented with 100 μg ml.sup.−1 ampicillin, 25 μg ml.sup.−1 kanamaycin and 34 μg ml.sup.−1 hloramphenicol until an optical density (OD.sub.600) of 0.6-0.8 was reached, shifted to 18° C. for 20 min, and induced overnight at 18° C. with 0.2 mM isopropyl β-D-1-thiogalactopyranoside (IPTG). The bacterial cultures were harvested by centrifugation, suspended in lysis buffer (50 mM HEPES/NaOH pH 8.0, 400 mM NaCl, 40 mM imidazole, 10 mM MgCl.sub.2, and 1 mM ATP) and lysed with a cell disruptor (Constant Systems). The lysate was immediately mixed with 0.1 M phenylmethanesulfonyl fluoride (PMSF) (50 μl per 10 ml lysate) and 250 units of TurboNuclease (Eton Bioscience), and cleared by centrifugation. The soluble fraction was gently mixed with Ni-Sepharose 6 Fast Flow (GE Healthcare) resin for 30 min at 4° C. After washing with the lysis buffer, bound protein was eluted in elution buffer (10 mM HEPES/NaOH pH 8.0, 150 mM NaCl, 300 mM imidazole, 10 mM MgCl.sub.2, and 1 mM ATP). The eluted protein complex was immediately purified by size exclusion chromatography on a Superdex S200 column (GE Healthcare) equilibrated in running buffer (10 mM HEPES/NaOH pH 8.0, 150 mM NaCl, 10 mM MgCl.sub.2, and 0.5 mM ATP). Following the tag removal by 10xHis-7xArg-3C protease, the fusion tags and the protease were separated from the complex by cation-exchange chromatography on a HiTrapS column (GE Healthcare) using a linear NaCl gradient. The flow-through from the cation-exchange chromatography, containing the protein complex, was purified again by size exclusion chromatography on a Superdex S200 column as at the previous step.

    [0109] For the non-structural analysis of TorsinA and LULL1 variants, the pETDuet-1-based expression plasmid was transformed into LOBSTR(DE3) RIL cells without co-expressing nanobody VHH-BS2. Ni.sup.2+-affinity purification was performed as described above and bound protein was eluted. Aliquots from the Ni.sup.2+-eluate and the total lysate were collected and analyzed by SDS-PAGE gel electrophoresis.

    Example 3: Crystallization

    [0110] Purified TorsinA.sub.EQ-LULL1-VHH-BS2 and TorsinA.sub.EQΔE-LULL1-VHH-BS2 complexes were concentrated up to 4-4.5 mg/ml and supplemented with 2 mM ATP prior to crystallization. The TorsinA.sub.EQ containing complex crystallized in 13% (w/v) polyethylene glycol (PEG) 6000, 5% (v/v) 2-Methyl-2,4-pentanediol, and 0.1 M MES pH 6.5. The TorsinA.sub.EQAE containing complex crystallized in 19% (w/v) PEG 3350, 0.2 M AmSO4, and 0.1 M Bis-Tris-HCl pH 6.5. Crystals of both complexes grew at 18° C. in hanging drops containing 1 μl of protein and 1 μl of mother liquor. Clusters of diffraction quality, rod-shaped crystals formed within 3-5 days. Single crystals were briefly soaked in mother liquor supplemented with 20% (v/v) glycerol for cryoprotection and flash-frozen in liquid nitrogen.

    Example 4: Data Collection and Structure Determination

    [0111] X-ray data were collected at NE-CAT beamline 24-ID-C at Argonne National Laboratory. Data reduction was performed with the HKL2000 package [33], and all subsequent data-processing steps were carried out using programs provided through SBGrid [34]. The structure of the TorsinA.sub.EQ-LULL1-VHH-BS2 complex was solved by molecular replacement (MR) using the Phaser-MR tool from the PHENIX suite [35]. A three-part MR solution was easily obtained using a sequential search for models of LULL1, VHH-BS2, and TorsinA. The LULL1 model was generated based on the published human LAP1 structure (PDB 4TVS, chain A), using the Sculptor utility of the PHENIX suite (LULL.sub.1241-470 and LAP.sub.1356-583 share 64% sequence identity). The VHH-BS2 model was based on VHH-BS1 (PDB 4TVS, chain a) after removing the complementarity determining regions (CDRs). The poly-Ala model of TorsinA was generated based on E. coli ClpA (PDB 1R6B) using the MODELLER tool of the HHpred server [36]. The asymmetric unit contains one TorsinA.sub.EQ-LULL1-VHH-BS2 complex. Iterative model building and refinement steps gradually improved the electron density maps and the model statistics. The stereochemical quality of the final model was validated by Molprobity [37]. TorsinA.sub.EQΔE-LULL1-VHH-BS2 crystallized in the same unit cell. Model building was carried starting from a truncated TorsinA.sub.EQ-LULL1-VHH-BS2 structure. All manual model building steps were carried out with Coot [38], and phenix.refine was used for iterative refinement. Two alternate conformations of a loop in LULL1 (residues 428-438) were detected in the Fo-Fc difference electron density maps of both structures, and they were partially built. For comparison, the cysteine residues of TorsinA at the catalytic site (residues 280 and 319 in the TorsinA.sub.EQ structure) were built in the reduced and the oxidized states, respectively. Building them as oxidized, disulfide-bridged residues consistently produced substantial residual Fo-Fc difference density, which disappeared assuming a reduced state. Statistical parameters of data collection and refinement are all given in Table 1 in FIG. 18. Structure figures were created in PyMOL (Schrödinger LLC).

    Example 5: Bioinformatic Analysis

    [0112] Torsin and LAP1/LULL1 sequences were obtained via PSI-BLAST [39] and Backphyre searches [40]. Transmembrane domains were predicted using the HMMTOP tool [41]. LAP1/LULL1 proteins were distinguished based on the calculated isoelectric point (pI) of their extra-luminal portions. The intranuclear domain of LAP1 has a characteristically high pI of ˜8.5-10 due to a clustering of basic residues, while the cytoplasmic domain of LULL1 is distinctively more acidic. Multiple sequence alignments were performed using MUSCLE [42], and visualized by Jalview [43]. To illustrate evolutionary conservation on TorsinA and LULL1 surfaces, conservation scores for each residue were calculated using the ConSurf server with default parameters [44].

    [0113] The sequences, which were used to generate the multiple sequence alignments, were also used for preparing the sequence logos of Torsins and LAP1/LULL1 in FIG. 16. To obtain the sequence logo of the HCLR clade AAA+ ATPases, Escherichia coli ClpA-D2 (residues 458-758), Escherichia coli ClpB-D2 (residues 568-857), Bacillus subtilis ClpE-D2 (residues 409-699), Saccharomyces cerevisiae Hsp104-D2 (residues 578-868), Escherichia coli HslU (residues 13-443), Bacillus subtilis HslU (residues 15-455), Streptomyces coelicolor ClpX (residues 71-409), Drosophila melanogaster ClpX (residues 199-634), Escherichia coli Lon (residues 320-580), Caenorhabditis elegans Lon (residues 476-771), Thermus thermophilus ClpB-D2 (residues 536-845), Escherichia coli ClpX (residues 64-403), Helicobacter pylori ClpX (residues 77-430), Haemophilus influenza HslU (1-444), Bacillus subtilis Lon (residues 300-590), Bacillus subtilis ClpC-D2 (residues 486-802), Saccharomyces cerevisiae Hsp78-D2 (residues 482-794) and Arabidopsis thaliana Hsp101-D2 (residues 547-849) sequences were used. All sequence logos were generated using WebLogo [45].

    Example 6: Structure Analysis

    [0114] A stable, heterotrimeric complex of TorsinA.sub.EQ(ATP)-LULL1-VHH-BS2 was readily crystallized in the presence of ATP. A 1.4 Å dataset was collected and the structure was solved by molecular replacement, using the LULL1-homolog LAP1 and a VHH template as search models [14] (Example 4, and Table 1 in FIG. 18). TorsinA.sub.EQ adopts a typical AAA+ ATPase fold (FIG. 6 and FIG. 12). The N-terminal nucleotide-binding or large domain (residues 55-271) is composed of a central five-stranded, parallel β-sheet surrounded by 8 α-helices. A small three-helix bundle at its C-terminus (residues 272-332), forms critical contacts with LULL1. The ATP molecule is bound in the manner characteristic of P-loop NTPases [15]. The Walker A and B motifs are positioned to mediate the requisite nucleotide interactions, with sensor 1 and sensor 2 regions sensing the γ-phosphate and thus the nucleotide-binding state (FIG. 7). The luminal LULL1 activation domain (residues 236-470) adopts an AAA+-like conformation, very similar to its paralog LAP1 (rmsd 1.05 Å over 213 Ca positions, FIG. 12). The AAA+-like domain comprises a central β-sheet embedded within six α-helices (FIG. 6). A C-terminal small domain is not found. Characteristically, LULL1 lacks nucleotide binding due to the absence of Walker A and B motifs [14]. LULL1 forms a composite nucleotide-binding site with TorsinA by providing arginine residue 449 (‘arginine finger’) at the base of helix α5 (FIG. 7). The arginine finger activates ATP hydrolysis by TorsinA [14,16]. The small domain of TorsinA, including helix α7 featuring glutamates 302 and 303, is intimately involved in LULL1 binding. Nanobody VHH-BS2 binds both TorsinA and LULL1 at a shallow groove (FIG. 6 and FIG. 15). Nanobodies contain three complementarity determining regions (CDRs), with CDR3 most often making critical contacts with the antigen [17]. Indeed, the long CDR3 of VHH-BS2 (residues 101-109) is the main binding element in the complex.

    [0115] AAA+ ATPases are organized into a number of structurally defined clades [12, 18], distinguished by shared structural elements. Comparison with other AAA+ ATPase structures shows that TorsinA fits best into a clade that also contains the bacterial proteins HslU, ClpA/B, ClpX, and Lon, all of which are involved in protein degradation or remodeling [13]. These AAA+ family members share a β-hairpin insertion that precedes the sensor-I region (FIG. 12). TorsinA also contains this structural element, but it adopts a distinctly different orientation compared to other members of the clade. However, the pre-sensor I region may be affected by crystal packing in our structure. Two other distinct regions are present. The protein degrading or remodeling AAA+ ATPases all form hexameric rings with a central pore [11,13], and ‘pore loops’ in each subunit, conserved elements positioned between strand β2 and helix α2, are critical for threading the protein substrates through the ring [19]. In Torsins, this pore loop is not conserved (FIG. 13 and FIG. 16). TorsinA has two cysteines (Cys280, and Cys 319 in close proximity in the sensor-II motif), positioned near the adenine base of the ATP molecule (FIG. 8). These cysteines do not form a disulfide bridge in our structure. However, the conservation of Cys280 and the Gly-Cys-Lys sensor-II motif at position 318-320 (FIG. 13 and FIG. 16) indicates an important functional role. A redox activity as part of the ATPase cycle therefore seems highly likely, as has been previously speculated [8, 20].

    [0116] The interaction of TorsinA with its ATPase activators LULL1 and LAP1 is of particularly importance, as a prominent mutation causing primary dystonia—the deletion of glutamate 302 or 303—weakens these interaction [7-9]. But why and how? The TorsinA-LULL1 interface extends over an area of 1527 Å.sup.2. The main structural elements involved in this interaction are the nucleotide-binding region as well as the small domain of TorsinA, and helices α0, α2, α4 and α5 of LULL1 (FIGS. 6-9, 13 and 14). The exact position of the small domain of TorsinA relative to the large domain is likely dictated by the sensor II motif, preceding α8, which directly contacts the γ-phosphate of ATP through Lys 320, thus serving as an anchor point. A switch to ADP presumably weakens this connection, such that the small domain would become more loosely attached to the large domain. This could explain the observed ATP-dependency of LAP1/LULL1 binding [7-9, 21]. Within the small domain, helix α7, the following loop, and the terminal helix α8 contain all the critical residues. Glutamate 302 and 303 are positioned at the very end of helix α7, and both are involved in TorsinA contacts. Specifically, Glu 303 forms a prominent charge interaction with Arg 276 of LULL1. TorsinA Lys113-LULL1 Glu385, TorsinA Asp316-LULL1 Arg419, TorsinA Lys317-LULL1 Glu415 are additional charge interactions.

    [0117] To investigate the atomic details of the weakened binding of TorsinAΔE to LAP1/LULL1, and thus the molecular basis of primary dystonia, we made use of the observation that VHH-BS2 also stabilizes the TorsinA.sub.EQAE(ATP)-LULL1 interaction. We were able to crystallize TorsinA.sub.EQAE(ATP)-LULL1-VHH-BS2 and determine its structure at a resolution of 1.4 Å. Not surprisingly, the overall structure is almost identical to the wild type protein (0.34 Å rmsd over 274 Ca atoms for TorsinA, 0.26 Å rmsd over 229 Ca atoms for LULL1), except for critical differences in the TorsinA-LULL1 interface (FIG. 9). The principal difference is that helix α7 is shortened due to the missing Glu 303, with a slight—but significant—restructuring of the loop that follows to establish the connection with helix α8. For future reference, we suggest renaming the ΔE mutation ΔE303, rather than ΔE302/303, since the position of Glu 302 is effectively unchanged. In the dystonia mutant, the TorsinA Glu 303-LULL1 Arg 276 charge interaction is lost, and the hydrogen-bonding network involving TorsinA Glu 302, Phe 306 and Arg312, as well as LULL1 Arg412 and Glu416 is disrupted (FIG. 9). To determine the importance of different TorsinA residues for LULL1 binding, we performed a co-purification assay (FIGS. 10A and 10B). His-tagged, ATP-trapped TorsinAEQ (residues 51-332) and mutants thereof were recombinantly co-expressed with LULL1 (residues 233-470), but without VHH-BS2, in bacteria. Binding was tested in a co-purification assay using Ni-affinity. The Torsin.sub.EQΔE303 mutation abolishes binding in this assay, as expected (FIG. 10A). Since unbound TorsinA.sub.EQ is largely insoluble, absence of binding is not registered as an appearance of TorsinA.sub.EQ alone, but rather as a lack of eluted protein complex altogether Eliminating the salt bridge between TorsinA Glu303 and LULL1 Arg276 does not disrupt the TorsinA-LULL1 interaction (FIG. 10A). However, ΔMet304 and ΔThr305 both phenocopy ΔE303 in abolishing LULL1 binding (FIG. 10B). This is in full agreement with published in vivo data using similar mutants [22]. The intricate network of interactions of the α7-α8 loop of TorsinA is crucial for LULL1 binding. Since the ΔE mutation results in a local change only of the surface of Torsin's small domain rather than protein misfolding, it may be possible to rescue binding by developing a small molecule that resurrect the weakened TorsinAΔE-LAP1/LULL1 interaction.

    [0118] Although TorsinAΔE303 is the most prevalent mutation that causes primary dystonia, it is not the only one [5, 6]. We examined the structural consequence of all known mutations (FIG. 19 Table 2, and FIG. 17). Most mutations appear to cause protein misfolding or weaken or abolish LAP1/LULL1 binding. Conversely, the two dystonia-mutations found in LAP1 likely affect Torsin interaction. Improper Torsin activation is therefore the likely cause of the disease [23].

    [0119] The biological function of TorsinA remains enigmatic [24-28]. Because TorsinA belongs to the AAA+ ATPase superfamily, with specific homology to the bacterial proteins HslU, ClpX, ClpA/B and Lon, it is generally assumed that TorsinA is involved in protein remodeling or protein degradation [5, 6]. However, a substrate of TorsinA has yet to be identified.

    [0120] The TorsinA structure enables a more thorough comparison to other AAA+ ATPases. After the discovery that LAP1/LULL1 are Arg-finger containing TorsinA activators, it seemed reasonable to suggest that TorsinA and LAP1/LULL1 likely form heterohexameric rings ((TorsinA-ATP-LAP1/LULL1).sub.3) in order to function [14, 16]. However, the predominant oligomeric form of the TorsinA-ATP-LAP1/LULL1 complex in solution is largely heterodimeric, with the heterohexameric form present as only a small fraction [14, 16, 29-31]. Our structure now raises doubts about the physiological relevance of a heterohexameric ring (FIGS. 11A and 11B). First, we note that the small domain of TorsinA is essential for LAP1/LULL1 binding. Neither LAP1 nor LULL1 harbor a small domain, arguing against formation of a stable heteromeric ring, or, alternatively, suggesting a ring of substantially different architecture. Second, ring formation is important for AAA+ ATPases that thread their protein substrate through a central pore for refolding or for degradation. This central pore is lined with conserved ‘pore loops’ that are essential for function. Neither TorsinA and its homologs, nor LAP1/LULL1 have ‘pore loop’ equivalents (FIG. 16). TorsinA is therefore unlikely to actually employ a peptide threading mechanism that involves a central pore. Third, the surface conservation of LAP1/LULL1 also argues against a heteromeric ring assembly. Although the catalytic, ATP-containing interface with TorsinA is well-conserved, the presumptive non-catalytic, nucleotide-free interface is not (FIG. 3B). The same analysis for TorsinA shows that its ‘backside’ is conserved. TorsinA may therefore interact in homotypic fashion with TorsinA, with other Torsin homologs, or even with a third player. The physiologically relevant oligomeric state of TorsinA thus remains a matter of speculation. Given the unique properties of TorsinA, keeping an open-mind about TorsinA assembly into its functional state is called for, as it may well differ more than anticipated from well-studied AAA+ ATPase systems.

    [0121] The foregoing description of the exemplary embodiments of the invention has been presented only for the purposes of illustration and description and is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations are possible in light of the above teaching.

    [0122] The embodiments were chosen and described in order to explain the principles of the invention and their practical application so as to enable others skilled in the art to utilize the invention and various embodiments and with various modifications as are suited to the particular use contemplated. Alternative embodiments will become apparent to those skilled in the art to which the present invention pertains without departing from its spirit and scope. Accordingly, the scope of the present invention is defined by the appended claims rather than the foregoing description and the exemplary embodiments described therein.

    REFERENCE LIST

    [0123] The following references are incorporated herein by reference in their entirety for all purposes. [0124] [1] Ozelius, L. J. et al. The early-onset torsion dystonia gene (DYT1) encodes an ATP-binding protein. Nat. Genet. 17, 40-48 (1997). [0125] [2] Breakefield, X. O. et al. The pathophysiological basis of dystonias. Nat. Rev. Neurosci. 9, 222-234 (2008). [0126] [3] Granata, A. & Warner, T. T. The role of torsinA in dystonia. Eur. J. Neurol. 17 Suppl 1, 81-87 (2010). [0127] [4] McCullough, J. & Sundquist, W. I. Putting a finger in the ring. Nat. Struct. Mol. Biol. 21, 1025-1027 (2014). [0128] [5] Rose, A. E., Brown, R. S. H. & Schlieker, C. Torsins: not your typical AAA+ ATPases. Crit. Rev. Biochem. Mol. Biol. 50, 532-549 (2015). [0129] [6] Laudermilch, E. & Schlieker, C. TorsinATPases: structural insights and functional perspectives. Curr. Opin. Cell Biol. 40, 1-7 (2016). [0130] [7] Naismith, T. V., Dalal, S. & Hanson, P. I. Interaction of torsinA with its major binding partners is impaired by the dystonia-associated DeltaGAG deletion. J. Biol. Chem. 284, 27866-27874 (2009). [0131] [8] Zhu, L., Millen, L., Mendoza, J. L. & Thomas, P. J. A unique redox-sensing sensor II motif in TorsinA plays a critical role in nucleotide and partner binding. J. Biol. Chem. 285, 37271-37280 (2010). [0132] [9] Zhao, C., Brown, R. S. H., Chase, A. R., Eisele, M. R. & Schlieker, C. Regulation of TorsinATPases by LAP1 and LULL1. Proc. Natl. Acad. Sci. U.S.A. 110, E1545-54 (2013). [0133] [10] Goodchild, R. E., Kim, C. E. & Dauer, W. T. Loss of the dystonia-associated protein torsinA selectively disrupts the neuronal nuclear envelope. Neuron 48, 923-932 (2005). [0134] [11] Hanson, P. I. & Whiteheart, S. W. AAA+ proteins: have engine, will work. Nat. Rev. Mol. Cell Biol. 6, 519-529 (2005). [0135] [12] Erzberger, J. P. & Berger, J. M. Evolutionary relationships and structural mechanisms of AAA+ proteins. Annu Rev Biophys Biomol Struct 35, 93-114 (2006). [0136] [13] Olivares, A. O., Baker, T. A. & Sauer, R. T. Mechanistic insights into bacterial AAA+ proteases and protein-remodelling machines. Nat. Rev. Microbiol. 14, 33-44 (2016). [0137] [14] Sosa, B. A. et al. How lamina-associated polypeptide 1 (LAP1) activates Torsin. Elife 3, e03239 (2014). [0138] [15] Wendler, P., Ciniawsky, S., Kock, M. & Kube, S. Structure and function of the AAA+ nucleotide binding pocket. Biochim. Biophys. Acta 1823, 2-14 (2012). [0139] [16] Brown, R. S. H., Zhao, C., Chase, A. R., Wang, J. & Schlieker, C. The mechanism of TorsinATPase activation. Proc. Natl. Acad. Sci. U.S.A. 111, E4822-31 (2014). [0140] [17] Muyldermans, S. Nanobodies: natural single-domain antibodies. Annu. Rev. Biochem. 82, 775-797 (2013). [0141] [18] Iyer, L. M., Leipe, D. D., Koonin, E. V. & Aravind, L. Evolutionary history and higher order classification of AAA+ ATPases. J. Struct. Biol. 146, 11-31 (2004). [0142] [19] Sauer, R. T. & Baker, T. A. AAA+ proteases: ATP-fueled machines of protein destruction. Annu. Rev. Biochem. 80, 587-612 (2011). [0143] [20] Zhu, L., Wrabl, J. O., Hayashi, A. P., Rose, L. S. & Thomas, P. J. The torsin-family AAA+protein OOC-5 contains a critical disulfide adjacent to Sensor-II that couples redox state to nucleotide binding. Mol. Biol. Cell 19, 3599-3612 (2008). [0144] [21] Goodchild, R. E. & Dauer, W. T. The AAA+ protein torsinA interacts with a conserved domain present in LAP1 and a novel ER protein. J. Cell Biol. 168, 855-862 (2005). [0145] [22] Goodchild, R. E. & Dauer, W. T. Mislocalization to the nuclear envelope: an effect of the dystonia-causing torsinA mutation. Proc. Natl. Acad. Sci. U.S.A. 101, 847-852 (2004). [0146] [23] Kim, C. E., Perez, A., Perkins, G., Ellisman, M. H. & Dauer, W. T. A molecular mechanism underlying the neural-specific defect in torsinA mutant mice. Proc. Natl. Acad. Sci. U.S.A. 107, 9861-9866 (2010). [0147] [24] Nery, F. C. et al. TorsinA binds the KASH domain of nesprins and participates in linkage between nuclear envelope and cytoskeleton. J. Cell. Sci. 121, 3476-3486 (2008). [0148] [25] Granata, A., Koo, S. J., Haucke, V., Schiavo, G. & Warner, T. T. CSN complex controls the stability of selected synaptic proteins via a torsinA-dependent process. EMBO J. 30, 181-193 (2011). [0149] [26] Nery, F. C. et al. TorsinA participates in endoplasmic reticulum-associated degradation. Nat Commun 2, 393 (2011). [0150] [27] Jokhi, V. et al. Torsin mediates primary envelopment of large ribonucleoprotein granules at the nuclear envelope. Cell Rep 3, 988-995 (2013). [0151] [28] Liang, C.-C., Tanabe, L. M., Jou, S., Chi, F. & Dauer, W. T. TorsinA hypofunction causes abnormal twisting movements and sensorimotor circuit neurodegeneration. J. Clin. Invest. 124, 3080-3092 (2014). [0152] [29] Jungwirth, M., Dear, M. L., Brown, P., Holbrook, K. & Goodchild, R. Relative tissue expression of homologous torsinB correlates with the neuronal specific importance of DYT1 dystonia-associated torsinA. Hum. Mol. Genet. 19, 888-900 (2010). [0153] [30] Vander Heyden, A. B., Naismith, T. V., Snapp, E. L., Hodzic, D. & Hanson, P. I. LULL1 retargets TorsinA to the nuclear envelope revealing an activity that is impaired by the DYT1 dystonia mutation. Mol. Biol. Cell 20, 2661-2672 (2009). [0154] [31] Goodchild, R. E. et al. Access of torsinA to the inner nuclear membrane is activity dependent and regulated in the endoplasmic reticulum. J. Cell. Sci. 128, 2854-2865 (2015). [0155] [32] Andersen, K. R., Leksa, N. C. & Schwartz, T. U. Optimized E. coli expression strain LOBSTR eliminates common contaminants from His-tag purification. Proteins 81, 1857-1861 (2013). [0156] [33] Otwinowski, Z. & Minor, W. [20] Processing of X-ray diffraction data collected in oscillation mode. Methods in Enzymology 276, 307-326 (Elsevier, 1997). [0157] [34] Morin, A. et al. Collaboration gets the most out of software. Elife 2, e01456 (2013). [0158] [35] Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213-221 (2010). [0159] [36] Söding, J., Biegert, A. & Lupas, A. N. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 33, W244-8 (2005). [0160] [37] Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Biol. Crystallogr. 66, 12-21 (2010). [0161] [38] Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 66, 486-501 (2010). [0162] [39] Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389-3402 (1997). [0163] [40] Kelley, L. A. & Sternberg, M. J. E. Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc 4, 363-371 (2009). [0164] [41] Tusnády, G. E. & Simon, I. The HMMTOP transmembrane topology prediction server.

    [0165] Bioinformatics 17, 849-850 (2001). [0166] [42] Edgar, R. C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5, 113 (2004). [0167] [43] Waterhouse, A. M., Procter, J. B., Martin, D. M. A., Clamp, M. & Barton, G. J. Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189-1191 (2009). [0168] [44] Glaser, F. et al. ConSurf: identification of functional regions in proteins by surface-mapping of phylogenetic information. Bioinformatics 19, 163-164 (2003). [0169] [45] Crooks, G. E., Hon, G., Chandonia, J.-M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188-1190 (2004). [0170] [46] Zeymer, C., Barends, T. R. M., Werbeck, N. D., Schlichting, I. & Reinstein, J. Elements in nucleotide sensing and hydrolysis of the AAA+ disaggregation machine ClpB: a structure-based mechanistic dissection of a molecular motor. Acta Crystallogr. D Biol. Crystallogr. 70, 582-595 (2014). [0171] [47] Leung, J. C. et al. Novel mutation in the TOR1A (DYT1) gene in atypical early onset dystonia and polymorphisms in dystonia and early onset parkinsonism. Neurogenetics 3, 133-143 (2001). [0172] [48] Zirn, B. et al. Novel TOR1A mutation p.Arg288Gln in early-onset dystonia (DYT1). J. Neurol. Neurosurg. Psychiatr. 79, 1327-1330 (2008). [0173] [49] Calakos, N. et al. Functional evidence implicating a novel TOR1A mutation in idiopathic, late-onset focal dystonia. J. Med. Genet. 47, 646-650 (2010). [0174] [50] Cheng, F.-B. et al. Combined occurrence of a novel TOR1A and a THAP1 mutation in primary dystonia. Mov. Disord. 29, 1079-1083 (2014). [0175] [51] Vulinovic, F. et al. Unraveling cellular phenotypes of novel TorsinA/TOR1A mutations. Hum. Mutat. 35, 1114-1122 (2014). [0176] [52] Dobri{hacek over (c)}i{umlaut over (c)}, V. et al. Phenotype of non-c.907_909delGAG mutations in TOR1A: DYT1 dystonia revisited. Parkinsonism Relat. Disord. 21, 1256-1259 (2015). [0177] [53] Kock, N. et al. Effects of genetic variations in the dystonia protein torsinA: identification of polymorphism at residue 216 as protein modifier. Hum. Mol. Genet. 15, 1355-1364 (2006). [0178] [54] Kamm, C. et al. Susceptibility to DYT1 dystonia in European patients is modified by the D216H polymorphism. Neurology 70, 2261-2262 (2008). [0179] [55] Kayman-Kurekci, G. et al. Mutation in TOR1AIP1 encoding LAP1B in a form of muscular dystrophy: a novel gene related to nuclear envelopathies. Neuromuscul. Disord. 24, 624-633 (2014). [0180] [56] Dorboz, I. et al. Severe dystonia, cerebellar atrophy, and cardiomyopathy likely caused by a missense mutation in TOR1AIP1. Orphanet J Rare Dis 9, 174 (2014).

    TABLE-US-00001 SEQUENCE LISTING SEQ ID NO: 1 TorsinA (51-332) Protein GQKRSLSREALQKDLDDNLFGQHLAKKIILNAVFGFINNPKPKKPLTLSL HGWTGTGKNFVSKIIAENIYEGGLNSDYVHLFVATLHFPHASNITLYKDQ LQLWIRGNVSACARSIFIFDEMDKMHAGLIDAIKPFLDYYDLVDGVSYQK AMFIFLSNAGAERITDVALDFWRSGKQREDIKLKDIEHALSVSVFNNKNS GFWHSSLIDRNLIDYFVPFLPLEYKHLKMCIRVEMQSRGYEIDEDIVSRV AEEMTFFPKEERVFSDKGCKTVFTKLDYYYDD- SEQ ID NO: 6 TorsinA (51-332) Nucleic acid GGGCAGAAGCGGAGCCTTAGCCGGGAGGCACTGCAGAAGGATCTGGACGA CAACCTCTTTGGACAGCATCTTGCAAAGAAAATCATCTTAAATGCCGTGT TTGGTTTCATAAACAACCCAAAGCCCAAGAAACCTCTCACGCTCTCCCTG CACGGGTGGACAGGCACCGGCAAAAATTTCGTCAGCAAGATCATCGCAGA GAATATTTACGAGGGTGGTCTGAACAGTGACTATGTCCACCTGTTTGTGG CCACATTGCACTTTCCACATGCTTCAAACATCACCTTGTACAAGGATCAG TTACAGTTGTGGATTCGAGGCAACGTGAGTGCCTGTGCGAGGTCCATCTT CATATTTGATGAAATGGATAAGATGCATGCAGGCCTCATAGATGCCATCA AGCCTTTCCTCGACTATTATGACCTGGTGGATGGGGTCTCCTACCAGAAA GCCATGTTCATATTTCTCAGCAATGCTGGAGCAGAAAGGATCACAGATGT GGCTTTGGATTTCTGGAGGAGTGGAAAGCAGAGGGAAGACATCAAGCTCA AAGACATTGAACACGCGTTGTCTGTGTCGGTTTTCAATAACAAGAACAGT GGCTTCTGGCACAGCAGCTTAATTGACCGGAACCTCATTGATTATTTTGT TCCCTTCCTCCCCCTGGAATACAAACACCTAAAAATGTGTATCCGAGTGG AAATGCAGTCCCGAGGCTATGAAATTGATGAAGACATTGTAAGCAGAGTG GCTGAGGAGATGACATTTTTCCCCAAAGAGGAGAGAGTTTTCTCAGATAA AGGCTGCAAAACGGTGTTCACCAAGTTAGATTATTACTACGATGATTGA SEQ ID NO: 2 TorsinA E171Q (51-332) Protein GQKRSLSREALQKDLDDNLFGQHLAKKIILNAVFGFINNPKPKKPLTLSL HGWTGTGKNFVSKIIAENIYEGGLNSDYVHLFVATLHFPHASNITLYKDQ LQLWIRGNVSACARSIFIFDQMDKMHAGLIDAIKPFLDYYDLVDGVSYQK AMFIFLSNAGAERITDVALDFWRSGKQREDIKLKDIEHALSVSVFNNKNS GFWHSSLIDRNLIDYFVPFLPLEYKHLKMCIRVEMQSRGYEIDEDIVSRV AEEMTFFPKEERVFSDKGCKTVFTKLDYYYDD- SEQ ID NO: 7 TorsinA E171Q (51-332) Nucleic acid GGGCAGAAGCGGAGCCTTAGCCGGGAGGCACTGCAGAAGGATCTGGACGA CAACCTCTTTGGACAGCATCTTGCAAAGAAAATCATCTTAAATGCCGTGT TTGGTTTCATAAACAACCCAAAGCCCAAGAAACCTCTCACGCTCTCCCTG CACGGGTGGACAGGCACCGGCAAAAATTTCGTCAGCAAGATCATCGCAGA GAATATTTACGAGGGTGGTCTGAACAGTGACTATGTCCACCTGTTTGTGG CCACATTGCACTTTCCACATGCTTCAAACATCACCTTGTACAAGGATCAG TTACAGTTGTGGATTCGAGGCAACGTGAGTGCCTGTGCGAGGTCCATCTT CATATTTGATCAAATGGATAAGATGCATGCAGGCCTCATAGATGCCATCA AGCCTTTCCTCGACTATTATGACCTGGTGGATGGGGTCTCCTACCAGAAA GCCATGTTCATATTTCTCAGCAATGCTGGAGCAGAAAGGATCACAGATGT GGCTTTGGATTTCTGGAGGAGTGGAAAGCAGAGGGAAGACATCAAGCTCA AAGACATTGAACACGCGTTGTCTGTGTCGGTTTTCAATAACAAGAACAGT GGCTTCTGGCACAGCAGCTTAATTGACCGGAACCTCATTGATTATTTTGT TCCCTTCCTCCCCCTGGAATACAAACACCTAAAAATGTGTATCCGAGTGG AAATGCAGTCCCGAGGCTATGAAATTGATGAAGACATTGTAAGCAGAGTG GCTGAGGAGATGACATTTTTCCCCAAAGAGGAGAGAGTTTTCTCAGATAA AGGCTGCAAAACGGTGTTCACCAAGTTAGATTATTACTACGATGATTGA SEQ ID NO: 3 TorsinA E171Q ΔE (51-332) Protein GQKRSLSREALQKDLDDNLFGQHLAKKIILNAVFGFINNPKPKKPLTLSL HGWTGTGKNFVSKIIAENIYEGGLNSDYVHLFVATLHFPHASNITLYKDQ LQLWIRGNVSACARSIFIFDQMDKMHAGLIDAIKPFLDYYDLVDGVSYQK AMFIFLSNAGAERITDVALDFWRSGKQREDIKLKDIEHALSVSVFNNKNS GFWHSSLIDRNLIDYFVPFLPLEYKHLKMCIRVEMQSRGYEIDEDIVSRV AEMTFFPKEERVFSDKGCKTVFTKLDYYYDD- SEQ ID NO: 8 TorsinA E171Q ΔE (51-332) Nucleic acid GGGCAGAAGCGGAGCCTTAGCCGGGAGGCACTGCAGAAGGATCTGGACGA CAACCTCTTTGGACAGCATCTTGCAAAGAAAATCATCTTAAATGCCGTGT TTGGTTTCATAAACAACCCAAAGCCCAAGAAACCTCTCACGCTCTCCCTG CACGGGTGGACAGGCACCGGCAAAAATTTCGTCAGCAAGATCATCGCAGA GAATATTTACGAGGGTGGTCTGAACAGTGACTATGTCCACCTGTTTGTGG CCACATTGCACTTTCCACATGCTTCAAACATCACCTTGTACAAGGATCAG TTACAGTTGTGGATTCGAGGCAACGTGAGTGCCTGTGCGAGGTCCATCTT CATATTTGATCAAATGGATAAGATGCATGCAGGCCTCATAGATGCCATCA AGCCTTTCCTCGACTATTATGACCTGGTGGATGGGGTCTCCTACCAGAAA GCCATGTTCATATTTCTCAGCAATGCTGGAGCAGAAAGGATCACAGATGT GGCTTTGGATTTCTGGAGGAGTGGAAAGCAGAGGGAAGACATCAAGCTCA AAGACATTGAACACGCGTTGTCTGTGTCGGTTTTCAATAACAAGAACAGT GGCTTCTGGCACAGCAGCTTAATTGACCGGAACCTCATTGATTATTTTGT TCCCTTCCTCCCCCTGGAATACAAACACCTAAAAATGTGTATCCGAGTGG AAATGCAGTCCCGAGGCTATGAAATTGATGAAGACATTGTAAGCAGAGTG GCTGAGATGACATTTTTCCCCAAAGAGGAGAGAGTTTTCTCAGATAAAGG CTGCAAAACGGTGTTCACCAAGTTAGATTATTACTACGATGATTGA SEQ ID NO: 4 LULL1 (233-470) Protein SSVNSYYSSPAQQVPKNPALEAFLAQFSQLEDKFPGQSSFLWQRGRKFLQ KHLNASNPTEPATIIFTAAREGRETLKCLSHHVADAYTSSQKVSPIQIDG AGRTWQDSDTVKLLVDLELSYGFENGQKAAVVHHFESFPAGSTLIFYKYC DHENAAFKDVALVLTVLLEEETLEASVGPRETEEKVRDLLWAKFTNSDTP TSFNHMDSDKLSGLWSRISHLVLPVQPVSSIEEQ GCLF- SEQ ID NO: 9 LULL1 (233-470) Nucleic acid AGTTCTGTGAATAGCTACTATTCCTCTCCAGCCCAGCAAGTGCCCAAAAA TCCAGCTTTGGAGGCCTTTTTGGCCCAGTTTAGCCAATTGGAAGATAAAT TTCCAGGCCAGAGTTCCTTCCTGTGGCAGAGAGGACGGAAGTTTCTCCAG AAGCACCTCAATGCTTCCAACCCCACTGAGCCAGCCACCATCATATTTAC AGCAGCTCGGGAGGGAAGAGAGACCCTGAAGTGCCTGAGCCACCATGTTG CAGATGCCTACACCTCTTCCCAGAAAGTCTCTCCCATTCAGATTGATGGG GCTGGAAGGACCTGGCAGGACAGTGACACGGTCAAGCTGTTGGTTGACCT GGAGCTGAGCTATGGGTTTGAGAATGGCCAGAAGGCTGCTGTGGTACACC ACTTCGAATCCTTCCCTGCCGGCTCCACTTTGATCTTCTATAAGTATTGT GATCATGAGAATGCTGCCTTTAAAGATGTGGCCCTGGTCCTGACTGTTCT GCTAGAGGAGGAAACATTAGAAGCAAGTGTAGGCCCAAGGGAAACGGAAG AAAAAGTGAGAGACTTACTCTGGGCCAAGTTTACCAACTCTGACACTCCC ACCTCCTTCAACCACATGGACTCAGACAAATTGAGTGGGCTGTGGAGCCG AATTTCACACCTGGTACTGCCAGTCCAGCCAGTGAGTAGCATAGAAGAAC AGGGGTGCCTTTTCTAA SEQ ID NO: 5 VHH-BS2 (1-123) Protein MQVQLVETGGGLVQAGGSLRLSCAASGNIFSFNVMGWYRQAPGKQRELVA AITSGDTTTYADSVQGRFTISRDNAKNAVYLQMNSLTPEDTAVYFCNARR NPINGPYYTTAYWGQGTQVTVSS- SEQ ID NO: 10 VHH-BS2 (1-123) Nucleic acid ATGCAGGTGCAGCTCGTGGAGACAGGCGGGGGGTTGGTGCAGGCTGGGGG CTCTCTGAGGCTCTCCTGTGCAGCCTCTGGAAACATCTTCAGTTTCAATG TCATGGGCTGGTACCGCCAGGCTCCAGGGAAGCAGCGCGAGTTGGTCGCA GCGATCACGAGTGGTGATACGACAACCTATGCAGACTCCGTGCAGGGCCG ATTCACCATCTCCAGAGACAATGCCAAGAACGCGGTGTATCTGCAAATGA ACAGCCTGACACCTGAGGACACGGCCGTCTATTTCTGTAATGCGCGGCGC AATCCGATTAATGGTCCTTACTACACCACAGCCTACTGGGGCCAGGGGAC CCAGGTCACCGTCTCCTCATGA