Crystal structures of human Torsin-A and methods of determining and using the same
09823250 ยท 2017-11-21
Assignee
Inventors
- Thomas U. Schwartz (Brookline, MA, US)
- F. Esra Demircioglu (Boston, MA, US)
- Brian A. Sosa-Alvarado (Cambridge, MA, US)
Cpc classification
C12Y306/04
CHEMISTRY; METALLURGY
G01N23/20
PHYSICS
G16B15/30
PHYSICS
G01N2500/04
PHYSICS
G16B15/00
PHYSICS
G16B35/00
PHYSICS
International classification
G01N23/20
PHYSICS
Abstract
A protein composition including TorsinA or TorsinA mutant, LULL1, and a nanobody obtained by immunization using TorsinA and LULL1 is used to grow complex crystals, and three dimensional structures are determined using x-ray data of the crystals. A screening platform is built based on the determined three dimensional structures for designing a drug lead to cure dystonia.
Claims
1. A protein composition, comprising: a target protein; a modulator of the target protein; and a nanobody comprising a polypeptide as set forth in SEQ ID NO: 5 or a portion thereof, wherein said nanobody or said portion thereof specifically binds to at least one of the target protein and the modulator.
2. The protein composition of claim 1, wherein the target protein is TorsinA, a mutant of TorsinA, or a portion thereof.
3. The protein composition of claim 2, wherein the target protein comprises an amino acid sequence as set forth in at least one of SEQ ID NO: 1-3 or a portion thereof.
4. The protein composition of claim 1, wherein the modulator is LULL1 or a portion thereof.
5. The protein composition of claim 4, wherein the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4.
6. The protein composition of claim 1, wherein the nanobody is specific for a complex comprising the target protein and the modulator.
7. The protein composition of claim 6, wherein the nanobody is obtained by immunization using the target protein comprising the amino acid sequence set forth in the SEQ ID NO: 2 and the modulator comprising the amino acid sequence set forth in the SEQ ID NO: 4.
8. The protein composition of claim 7, wherein the target protein comprises the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 or a portion thereof, the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4 or a portion thereof, the nanobody comprises the amino acid sequence set forth in the SEQ ID NO: 5 or a portion thereof, and the protein composition is co-expressed, and optionally purified together.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The accompanying drawings illustrate one or more embodiments of the invention and together with the written description, serve to explain the principles of the invention. Wherever possible, the same reference numbers are used throughout the drawings to refer to the same or like elements of an embodiment.
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
DETAILED DESCRIPTION
(19) The present invention is more particularly described in the following examples that are intended as illustrative only since numerous modifications and variations therein will be apparent to those skilled in the art. Various embodiments of the invention are now described in detail. Referring to the drawings, like numbers indicate like components throughout the views. As used in the description herein and throughout the claims that follow, the meaning of a, an, and the includes plural reference unless the context clearly dictates otherwise. Also, as used in the description herein and throughout the claims that follow, the meaning of in includes in and on unless the context clearly dictates otherwise. Moreover, titles or subtitles may be used in the specification for the convenience of a reader, which shall have no influence on the scope of the present invention. Additionally, some terms used in this specification are more specifically defined below.
(20) Some references, which may include patents, patent applications and various publications, are cited and discussed in the description of this invention. The citation and/or discussion of such references is provided merely to clarify the description of the present invention and is not an admission that any such reference is prior art to the invention described herein. All references cited and discussed in this specification are incorporated herein by reference in their entireties and to the same extent as if each reference was individually incorporated by reference.
(21) The terms used in this specification generally have their ordinary meanings in the art, within the context of the invention, and in the specific context where each term is used. Certain terms that are used to describe the invention are discussed below, or elsewhere in the specification, to provide additional guidance to the practitioner regarding the description of the invention. For convenience, certain terms may be highlighted, for example using italics and/or quotation marks. The use of highlighting has no influence on the scope and meaning of a term; the scope and meaning of a term is the same, in the same context, whether or not it is highlighted. It will be appreciated that same thing can be said in more than one way. Consequently, alternative language and synonyms may be used for any one or more of the terms discussed herein, nor is any special significance to be placed upon whether or not a term is elaborated or discussed herein. Synonyms for certain terms are provided. A recital of one or more synonyms does not exclude the use of other synonyms. The use of examples anywhere in this specification including examples of any terms discussed herein is illustrative only, and in no way limits the scope and meaning of the invention or of any exemplified term. Likewise, the invention is not limited to various embodiments given in this specification.
(22) It will be understood that when an element is referred to as being on another element, it can be directly on the other element or intervening elements may be present therebetween. In contrast, when an element is referred to as being directly on another element, there are no intervening elements present. As used herein, the term and/or includes any and all combinations of one or more of the associated listed items.
(23) It will be understood that, although the terms first, second, third etc. may be used herein to describe various elements, components, regions, layers and/or sections, these elements, components, regions, layers and/or sections should not be limited by these terms. These terms are only used to distinguish one element, component, region, layer or section from another element, component, region, layer or section. Thus, a first element, component, region, layer or section discussed below could be termed a second element, component, region, layer or section without departing from the teachings of the present invention.
(24) Furthermore, relative terms, such as lower or bottom and upper or top, may be used herein to describe one element's relationship to another element as illustrated in the Figures. It will be understood that relative terms are intended to encompass different orientations of the device in addition to the orientation depicted in the Figures. For example, if the device in one of the figures is turned over, elements described as being on the lower side of other elements would then be oriented on upper sides of the other elements. The exemplary term lower, can therefore, encompasses both an orientation of lower and upper, depending of the particular orientation of the figure. Similarly, if the device in one of the figures is turned over, elements described as below or beneath other elements would then be oriented above the other elements. The exemplary terms below or beneath can, therefore, encompass both an orientation of above and below.
(25) Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and the present disclosure, and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
(26) As used herein, around, about or approximately shall generally mean within 20 percent, preferably within 10 percent, and more preferably within 5 percent of a given value or range. Numerical quantities given herein are approximate, meaning that the term around, about or approximately can be inferred if not expressly stated.
(27) As used herein, plurality means two or more.
(28) As used herein, the terms comprising, including, carrying, having, containing, involving, and the like are to be understood to be open-ended, i.e., to mean including but not limited to.
(29) The terms TorsinA or Torsin-1A, as used herein, refer to a protein that in humans that is encoded by the TOR1A gene (also know as DQ2 or DYT1).
(30) The term nanobody, as used herein, refers to a single-domain antibody. The single-domain antibody is an antibody fragment consisting of a single monomeric variable antibody domain. Like a whole antibody, it is able to bind selectively to a specific antigen. With a molecular weight of only 12-15 kDa, single-domain antibodies are much smaller than common antibodies (150-160 kDa) which are composed of two heavy protein chains and two light chains, and even smaller than Fab fragments (50 kDa, one light chain and half a heavy chain) and single-chain variable fragments (25 kDa, two variable domains, one from a light and one from a heavy chain).
(31) The term modulator, as used herein, refer to a substance influencing the binding of a target protein to its ligand or agonist, or inverse agonist.
(32) The most common cause of early onset primary dystonia, a neuromuscular disease, is a glutamate deletion (E) at position 302/303 of TorsinA, a AAA+ ATPase that resides in the endoplasmic reticulum including the perinuclear space [1, 2]. While the actual function of TorsinA remains elusive [3-6], the E mutation is known to diminish binding of two TorsinA ATPase activators: lamina-associated protein 1 (LAP1) and its paralog, luminal domain like LAP1 (LULL1) [7-9]. Therefore, E is likely a loss-of-function mutation [10]. A single-chain antibody fragment, a so-called nanobody, which specifically binds the TorsinA LULL1 complex, is generated. The nanobody is called VHH BS2. The resulting trimeric TorsinA(ATP)-LULL1-VHH-BS2 complex is stable in vitro and was crystallized. In addition, and most importantly, VHH-BS2 is able to stabilize the weak TorsinAE(ATP)LULL1 interaction, thus a TorsinAE(ATP)-LULL1-VHH-BS2 can also be made and was crystallized as well. The ability to stabilize a weak interaction with a reagent like VHH-BS2 is extremely rare. Using a nanobody as a crystallization chaperone, both crystal structures are solved and refined to 1.4 resolution. A comparison of these structures at this very high resolution shows, in atomic detail, the subtle differences in activator interactions that separate the healthy wild type from the diseased state DYT1 mutant TorsinA. This structure information may provide a structural platform for drug development, as a small molecule that rescues TorsinAE could serve as a cure for primary dystonia.
(33) In one aspect, the present invention relates to a protein composition. As shown in
(34) The target protein 110 may be human wild type TorsinA (SEQ ID NO: 1), mutant TorsinA.sub.EQ with a glutamate (E) to Glutamine (Q) mutation at position 171 (SEQ ID NO: 2), mutant TorsinA.sub.EQE with a glutamate (E) to Glutamine (Q) mutation at position 171 and a glutamate deletion at position 303 (or 302) (SEQ ID NO: 3), as well as TorsinA mutants having F323-Y328, R288Q, F2051, D194V, A14-P15, E121K, V129I, or D216H mutations, and portions of the above proteins. In certain embodiments, the target protein 110 includes the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 (SEQ ID NO: 1 is human TorsinA 51-332, SEQ ID NO: 2 is TorsinA 51-332 with E171Q; SEQ ID NO: 3 is human TorsinA 51-332 with E171Q and E303) or portions thereof.
(35) The modulator 130 may be an activator, an agonist, an antagonist, or an inverse agonist, of the target protein 110. When the target protein 110 is TorsinA or a mutant of TorsinA, the modulator 130 may be LAP1, LULL1, a domain or a portion of LAP1 or LULL1. In certain embodiments, the modulator may also be a drug lead that is able to bind to TorsinA or its mutant, and the drug lead may be improved based on the three dimensional complex structure of the TorsinA or its mutant and the drug lead. In certain embodiments, the modulator 130 is LULL1 or portions thereof. In one embodiment, the modulator comprises the amino acid sequence set forth in the SEQ ID NO: 4 (SEQ ID NO: 4 is LULL1 233-470) or portions thereof.
(36) The nanobody 150 specifically binds to at least one of the target protein 110 and the modulator 130. In certain embodiment, the nanobody 150 may be obtained by immunizing a model animal using both the target protein 110 and the modulator 130. In certain embodiments, the nanobody is obtained by immunization using the target protein 110 having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 and the modulator 130 having the amino acid sequence set forth in the SEQ ID NO: 4, or portions thereof. In certain embodiments, the obtained nanobody 150 has the amino acid sequence set forth in the SEQ ID NO:5, or portions thereof.
(37) In certain embodiments, the target protein 110 includes the amino acid sequence set forth in the SEQ ID NO: 2 or SEQ ID NO: 3 or portions thereof, the modulator 130 includes the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, the nanobody 150 includes the amino acid sequence set forth in the SEQ ID NO: 5 or portions thereof, and the target protein 110, themodulator 130 and the nanobody 150 in the protein composition are co-expressed and purified together.
(38) Referring to
(39) In certain embodiments, the target protein 110 includes the amino acid sequence set forth in the SEQ ID NO: 2, the modulator 130 includes the amino acid sequence set forth in the SEQ ID NO: 4, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 , b=90.7 , and c=105.1 such that the three dimensional structure of the crystallized protein composition 110 can be determined to a resolution of about 1.4 or better (TorsinA.sub.EQ51-332 structure).
(40) In certain embodiments, as shown in
(41) After the crystals are observed and grow to a sufficient size, the crystals are cryoprotected by flash-frozen in liquid nitrogen after soaking in the mother liquor supplemented with 20% (v/v) glycerol. Single crystal is preferably used in the flash-frozen. X-ray data are collected using one of the obtained crystals, and the structure of the crystallized protein composition is determined based on the collected x-ray data.
(42) In certain embodiments, the target protein 110 includes the amino acid sequence set forth in the SEQ ID NO: 3 or portions thereof, the modulator 130 includes the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 , b=88.1 , and c=105.4 such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 or better (TorsinA.sub.EQ51-332 E303 mutant structure).
(43) In certain embodiments, the protein composition 100 is crystallized to obtain crystals by the following steps: preparing the protein composition 100 at about 4-4.5 mg/ml; adding about 2 mM ATP to the prepared protein composition to form a protein stock; preparing a mother liquor comprising about 19% (w/v) polyethylene glycol (PEG) 3350, about 0.2 M AMSO.sub.4, and about 0.1 M Bis-Tris-HCl pH6.5; and mixing 1 l of the protein stock with 1 l of the mother liquor to form a second mixture, and inducing crystallization of the protein composition in the mixture by hanging drop/vapor diffusion under about 18 C., such that the crystals are obtained in about 3-5 days.
(44) In certain embodiments, the obtained crystals are cryoprotected by flash-frozen in liquid nitrogen after soaking in the mother liquor supplemented with 20% (v/v) glycerol, x-ray data are collected using one of the obtained crystals, and the structure of the crystallized protein composition is determined based on the collected x-ray data.
(45) In another aspect, the present invention related to a method of determining the three dimensional structure of a crystallized protein composition 100 to a resolution of about 1.4 or better. In certain embodiments, the protein composition 100 includes a target protein 110 having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 or portions thereof, an modulator 130 of the target protein 110 having the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, and a nanobody 150 specifically binds to at least one of the target protein 110 and the modulator 130 and having the amino acid sequence set forth in the SEQ ID NO: 5 or portions thereof.
(46) As shown in
(47) In certain embodiments, the target protein 110 comprises the amino acid sequence set forth in the SEQ ID NO: 2, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 , b=90.7 , and c=105.1 such that the three dimensional structure of the crystallized protein composition 100 can be determined to a resolution of about 1.4 or better (TorsinA.sub.EQ 51-332 structure).
(48) In certain embodiments, the target protein 110 comprises the amino acid sequence set forth in the SEQ ID NO: 3, and the protein composition 100 is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 , b=88.1 , and c=105.4 such that the three dimensional structure of the crystallized protein composition can be determined to a resolution of about 1.4 or better (TorsinA.sub.EQ 51-332 E303 mutant structure).
(49) In a further aspect, the present invention relates to a method for screening compounds that bind to TorsinA. In certain embodiments, the method includes providing a protein composition as described above comprising TorsinA, and a library of test compounds, treating the protein composition with a test compound, determining whether the compound binds to TorsinA, where a compound that binds TorsinA is indicative of a compound that is a candidate TorsinA agonist or antagonist, and optionally determining a three dimensional crystal structure of TorsinA with and/or without the bound compound to a resolution of about 1.4 or better.
(50) The TorsinA structure may include TorsinA.sub.EQ structure, TorsinA.sub.EQE303 structure, as well as their complex structures with modulators such as LULL1 or LAP1, and/or ATP. After analyzing one or more of the three dimensional structures of TorsinA, a targeting binding area of TorsinA or a targeting binding interface between TorsinA and its modulator, is chosen for designing a lead as drug candidate. The lead may be rationally designed, virtually screened, or directly screened by activity. The lead is then crystallized, for example using the method as shown in
(51) In certain embodiments, the crystals of TorsinA are grown using a protein composition 100 including: TorsinA having the amino acid sequence set forth in at least one of SEQ ID NO: 1-3 or portions thereof, a modulator of TorsinA having the amino acid sequence set forth in the SEQ ID NO: 4 or portions thereof, and a nanobody specifically binds to at least one of TorsinA and the modulator and having the amino acid sequence set forth in the SEQ ID NO: 5 or portions thereof.
(52) In certain embodiments, TorsinA includes TorsinA.sub.EQ E303 having the amino acid sequence set forth in the SEQ ID NO: 3, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.5 , b=88.1 , and c=105.4 such that the three dimensional structure of the crystallized protein composition having the TorsinA.sub.EQ E303, the crystallized protein composition having TorsinA E303 can be determined to a resolution of about 1.4 or better (TorsinA.sub.EQ 51-332 E303 mutant structure).
(53) In certain embodiments, the TorsinA comprises TorsinA E171Q having the amino acid sequence set forth in the SEQ ID NO: 2, and the protein composition is crystallized to obtain crystals of space group P2.sub.12.sub.12.sub.1 with approximate a=75.7 , b=90.7 , and c=105.1 such that the three dimensional structure of the crystallized protein composition having TorsinA E171Q can be determined to a resolution of about 1.4 or better (TorsinA 51-332/E171Q).
(54) In certain embodiments, a binding location of the modulator is determined by comparing the three dimensional structure of the crystallized protein composition having TorsinA E303 and the three dimensional structure of the crystallized protein composition having TorsinA E171Q.
(55) In certain embodiments, the modulator is virtually screened against the binding location of the three dimensional structure of the TorsinA.sub.EQ E303.
(56) In certain embodiments, the modulator is co-crystallized with the TorsinA.sub.EQ E303 and at least one of the modulator and the nanobody to obtain a three dimensional structure having the TorsinA.sub.EQ E303 and the modulator, such that modification of the modulator is conducted based on the three dimensional structure having the TorsinA E303.
(57) Certain embodiments of the present application, among other things, crystallized TorsinA which is a difficult to crystalize. Using this method, variety of TorsinA mutants and their complex structures can be determined. This is not achieved by any others before this invention.
(58) Further, by comparing the TorsinA.sub.EQ structure and TorsinA.sub.EQ E303 structure, a novel funcartional mechanism and novel binding site is determined, which can be used as the basis for structural based rational drug design. This information provides a structural platform to develop drug that can rescue TorsinA E303 or other type of mutants so that the TorsinA E303 become funcational. The drug is then useful for cure primary dystonia.
(59) These and other aspects of the present invention are more specifically described below. Without intent to limit the scope of the invention, exemplary methods and their related results according to the embodiments of the present invention are given below. Note that titles or subtitles may be used in the examples for convenience of a reader, which in no way should limit the scope of the invention. Moreover, certain theories are proposed and disclosed herein; however, in no way they, whether they are right or wrong, should limit the scope of the invention so long as the invention is practiced according to the invention without regard for any particular theory or scheme of action.
EXAMPLES
Example 1: Generation and Selection of Nanobodies
(60) To investigate the molecular basis for primary dystonia as a result of the glutamate 302/303 deletion in TorsinA, a structural approach is taken. TorsinA is a catalytically inactive AAA+ ATPase [11-13], notoriously ill-behaved in vitro, primarily due to its limited solubility and stability. These problems were partially overcome by stabilizing an ATP-trapped E171Q mutant of human TorsinA (residues 51-332; SEQ ID NO: 2) by co-expressing it with the luminal activation domain of human LULL1 (residues 233-470; SEQ ID NO: 4). This resulted in a better behaved heterodimeric complex (
(61) Specifically, for obtaining the VHH-BS2 nanobody, purified human TorsinA.sub.EQ-LULL1 complex was injected into a male alpaca (Lama pacos) for immunization. Generation and screening of nanobodies was carried out as previously described [14]. Each of the selected nanobodies was subcloned into a pET-30b(+) vector with a C-terminal His.sub.6-tag. Each nanobody was bacterially expressed and Ni.sup.2+-affinity purified essentially as described (see below). Different from the TorsinA-containing preparations, MgCl.sub.2 and ATP were eliminated from all buffer solutions. The Ni.sup.2+-eluate was purified via size exclusion chromatography on a Superdex S75 column (GE Healthcare) in running buffer (10 mM HEPES/NaOH pH 8.0, 150 mM NaCl). Nanobody binding was validated by size exclusion chromatography on a 10/300 Superdex S200 column in 10 mM HEPES/NaOH pH 8.0, 150 mM NaCl, 10 mM MgCl.sub.2 and 0.5 mM ATP.
(62) Equimolar amounts of TorsinA.sub.EQ-LULL1 and TorsinA.sub.EQ-LULL1-VHH were loaded and nanobody binding was monitored by a shift in the elution profile and via SDS-PAGE analysis. After validating VHH-BS2 interaction with TorsinA.sub.EQ-LULL1, the C-terminal His.sub.6-tag of VHH-BS2 was removed from the pET-30b(+) vector for co-purification experiments.
Example 2: Constructs, Protein Expression and Purification
(63) DNA sequences encoding human TorsinA (residues 51-332) and the luminal domain of human LULL1 (residues 233-470) were cloned into a modified ampicillin resistant pETDuet-1 vector (EMD Millipore). TorsinA, N-terminally fused with a human rhinovirus 3C protease cleavable 10xHis-7xArg tag, was inserted into the first multiple cloning site (MCS), whereas the untagged LULL1 was inserted into the second MCS. Mutations on TorsinA and LULL1 were introduced by site-directed mutagenesis. The untagged VHH-BS2 nanobody was cloned into a separate, modified kanamycin resistant pET-30b(+) vector (EMD Biosciences).
(64) To co-express TorsinA (EQ or EQ/AE), LULL1 and VHH-BS2 for crystallization, the E. coli strain LOBSTR(DE3) RIL (Kerafast) [32] was co-transformed with the two constructs described above. Cells were grown at 37 C. in lysogeny broth (LB) medium supplemented with 100 g ml.sup.1 ampicillin, 25 g ml.sup.1 kanamaycin and 34 g ml.sup.1 hloramphenicol until an optical density (OD.sub.600) of 0.6-0.8 was reached, shifted to 18 C. for 20 min, and induced overnight at 18 C. with 0.2 mM isopropyl -D-1-thiogalactopyranoside (IPTG). The bacterial cultures were harvested by centrifugation, suspended in lysis buffer (50 mM HEPES/NaOH pH 8.0, 400 mM NaCl, 40 mM imidazole, 10 mM MgCl.sub.2, and 1 mM ATP) and lysed with a cell disruptor (Constant Systems). The lysate was immediately mixed with 0.1 M phenylmethanesulfonyl fluoride (PMSF) (50 l per 10 ml lysate) and 250 units of TurboNuclease (Eton Bioscience), and cleared by centrifugation. The soluble fraction was gently mixed with Ni-Sepharose 6 Fast Flow (GE Healthcare) resin for 30 min at 4 C. After washing with the lysis buffer, bound protein was eluted in elution buffer (10 mM HEPES/NaOH pH 8.0, 150 mM NaCl, 300 mM imidazole, 10 mM MgCl.sub.2, and 1 mM ATP). The eluted protein complex was immediately purified by size exclusion chromatography on a Superdex S200 column (GE Healthcare) equilibrated in running buffer (10 mM HEPES/NaOH pH 8.0, 150 mM NaCl, 10 mM MgCl.sub.2, and 0.5 mM ATP). Following the tag removal by 10xHis-7xArg-3C protease, the fusion tags and the protease were separated from the complex by cation-exchange chromatography on a HiTrapS column (GE Healthcare) using a linear NaCl gradient. The flow-through from the cation-exchange chromatography, containing the protein complex, was purified again by size exclusion chromatography on a Superdex S200 column as at the previous step.
(65) For the non-structural analysis of TorsinA and LULL1 variants, the pETDuet-1-based expression plasmid was transformed into LOBSTR(DE3) RIL cells without co-expressing nanobody VHH-BS2. Ni.sup.2+-affinity purification was performed as described above and bound protein was eluted. Aliquots from the Ni.sup.2+-eluate and the total lysate were collected and analyzed by SDS-PAGE gel electrophoresis.
Example 3: Crystallization
(66) Purified TorsinA.sub.EQ-LULL1-VHH-BS2 and TorsinA.sub.EQE-LULL1-VHH-BS2 complexes were concentrated up to 4-4.5 mg/ml and supplemented with 2 mM ATP prior to crystallization. The TorsinA.sub.EQ containing complex crystallized in 13% (w/v) polyethylene glycol (PEG) 6000, 5% (v/v) 2-Methyl-2,4-pentanediol, and 0.1 M MES pH 6.5. The TorsinA.sub.EQE containing complex crystallized in 19% (w/v) PEG 3350, 0.2 M AmSO4, and 0.1 M Bis-Tris-HCl pH 6.5. Crystals of both complexes grew at 18 C. in hanging drops containing 1 l of protein and 1 l of mother liquor. Clusters of diffraction quality, rod-shaped crystals formed within 3-5 days. Single crystals were briefly soaked in mother liquor supplemented with 20% (v/v) glycerol for cryoprotection and flash-frozen in liquid nitrogen.
Example 4: Data Collection and Structure Determination
(67) X-ray data were collected at NE-CAT beamline 24-ID-C at Argonne National Laboratory. Data reduction was performed with the HKL2000 package [33], and all subsequent data-processing steps were carried out using programs provided through SBGrid [34]. The structure of the TorsinA.sub.EQ-LULL1-VHH-BS2 complex was solved by molecular replacement (MR) using the Phaser-MR tool from the PHENIX suite [35]. A three-part MR solution was easily obtained using a sequential search for models of LULL1, VHH-BS2, and TorsinA. The LULL1 model was generated based on the published human LAP1 structure (PDB 4TVS, chain A), using the Sculptor utility of the PHENIX suite (LULL.sub.1241-470 and LAP.sub.1356-583 share 64% sequence identity). The VHH-BS2 model was based on VHH-BS1 (PDB 4TVS, chain a) after removing the complementarity determining regions (CDRs). The poly-Ala model of TorsinA was generated based on E. coli ClpA (PDB 1R6B) using the MODELLER tool of the HHpred server [36]. The asymmetric unit contains one TorsinA.sub.EQ-LULL1-VHH-BS2 complex. Iterative model building and refinement steps gradually improved the electron density maps and the model statistics. The stereochemical quality of the final model was validated by Molprobity [37]. TorsinA.sub.EQE-LULL1-VHH-BS2 crystallized in the same unit cell. Model building was carried starting from a truncated TorsinA.sub.EQ-LULL1-VHH-BS2 structure. All manual model building steps were carried out with Coot [38], and phenix.refine was used for iterative refinement. Two alternate conformations of a loop in LULL1 (residues 428-438) were detected in the Fo-Fc difference electron density maps of both structures, and they were partially built. For comparison, the cysteine residues of TorsinA at the catalytic site (residues 280 and 319 in the TorsinA.sub.EQ structure) were built in the reduced and the oxidized states, respectively. Building them as oxidized, disulfide-bridged residues consistently produced substantial residual Fo-Fc difference density, which disappeared assuming a reduced state. Statistical parameters of data collection and refinement are all given in Table 1 in
Example 5: Bioinformatic Analysis
(68) Torsin and LAP1/LULL1 sequences were obtained via PSI-BLAST [39] and Backphyre searches [40]. Transmembrane domains were predicted using the HMMTOP tool [41]. LAP1/LULL1 proteins were distinguished based on the calculated isoelectric point (pI) of their extra-luminal portions. The intranuclear domain of LAP1 has a characteristically high pI of 8.5-10 due to a clustering of basic residues, while the cytoplasmic domain of LULL1 is distinctively more acidic. Multiple sequence alignments were performed using MUSCLE [42], and visualized by Jalview [43]. To illustrate evolutionary conservation on TorsinA and LULL1 surfaces, conservation scores for each residue were calculated using the ConSurf server with default parameters [44].
Example 6: Structure Analysis
(69) A stable, heterotrimeric complex of TorsinA.sub.EQ(ATP)-LULL1-VHH-BS2 was readily crystallized in the presence of ATP. A 1.4 dataset was collected and the structure was solved by molecular replacement, using the LULL1-homolog LAP1 and a VHH template as search models [14] (Example 4, and Table 1 in
(70) AAA+ ATPases are organized into a number of structurally defined clades [12, 18], distinguished by shared structural elements. Comparison with other AAA+ ATPase structures shows that TorsinA fits best into a clade that also contains the bacterial proteins Hs1U, ClpA/B, ClpX, and Lon, all of which are involved in protein degradation or remodeling [13]. These AAA+ family members share a -hairpin insertion that precedes the sensor-I region (
(71) The interaction of TorsinA with its ATPase activators LULL1 and LAP1 is of particularly importance, as a prominent mutation causing primary dystoniathe deletion of glutamate 302 or 303weakens these interaction [7-9]. But why and how? The TorsinA-LULL1 interface extends over an area of 1527 .sup.2. The main structural elements involved in this interaction are the nucleotide-binding region as well as the small domain of TorsinA, and helices 0, 2, 4 and 5 of LULL1 (
(72) To investigate the atomic details of the weakened binding of TorsinAAE to LAP1/LULL1, and thus the molecular basis of primary dystonia, we made use of the observation that VHH-BS2 also stabilizes the TorsinA.sub.EQAE(ATP)-LULL1 interaction. We were able to crystallize TorsinA.sub.EQAE(ATP)-LULL1-VHH-BS2 and determine its structure at a resolution of 1.4 . Not surprisingly, the overall structure is almost identical to the wild type protein (0.34 rmsd over 274 C atoms for TorsinA, 0.26 rmsd over 229 C atoms for LULL1), except for critical differences in the TorsinA-LULL1 interface (
(73) Although TorsinAE303 is the most prevalent mutation that causes primary dystonia, it is not the only one [5, 6]. We examined the structural consequence of all known mutations (
(74) The biological function of TorsinA remains enigmatic [24-28]. Because TorsinA belongs to the AAA+ ATPase superfamily, with specific homology to the bacterial proteins Hs1U, ClpX, ClpA/B and Lon, it is generally assumed that TorsinA is involved in protein remodeling or protein degradation [5, 6]. However, a substrate of TorsinA has yet to be identified.
(75) The TorsinA structure enables a more thorough comparison to other AAA+ ATPases. After the discovery that LAP1/LULL1 are Arg-finger containing TorsinA activators, it seemed reasonable to suggest that TorsinA and LAP1/LULL1 likely form heterohexameric rings ((TorsinA-ATP-LAP1/LULL1).sub.3) in order to function [14, 16]. However, the predominant oligomeric form of the TorsinA-ATP-LAP1/LULL1 complex in solution is largely heterodimeric, with the heterohexameric form present as only a small fraction [14, 16, 29-31]. Our structure now raises doubts about the physiological relevance of a heterohexameric ring (
(76) The foregoing description of the exemplary embodiments of the invention has been presented only for the purposes of illustration and description and is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations are possible in light of the above teaching.
(77) The embodiments were chosen and described in order to explain the principles of the invention and their practical application so as to enable others skilled in the art to utilize the invention and various embodiments and with various modifications as are suited to the particular use contemplated. Alternative embodiments will become apparent to those skilled in the art to which the present invention pertains without departing from its spirit and scope. Accordingly, the scope of the present invention is defined by the appended claims rather than the foregoing description and the exemplary embodiments described therein.
REFERENCE LIST
(78) The following references are incorporated herein by reference in their entirety for all purposes. [1] Ozelius, L. J. et al. The early-onset torsion dystonia gene (DYT1) encodes an ATP-binding protein. Nat. Genet. 17, 40-48 (1997). [2] Breakefield, X. O. et al. The pathophysiological basis of dystonias. Nat. Rev. Neurosci. 9, 222-234 (2008). [3] Granata, A. & Warner, T. T. The role of torsinA in dystonia. Eur. J. Neurol. 17 Suppl 1, 81-87 (2010). [4] McCullough, J. & Sundquist, W. I. Putting a finger in the ring. Nat. Struct. Mol. Biol. 21, 1025-1027 (2014). [5] Rose, A. E., Brown, R. S. H. & Schlieker, C. Torsins: not your typical AAA+ ATPases. Crit. Rev. Biochem. Mol. Biol. 50, 532-549 (2015). [6] Laudermilch, E. & Schlieker, C. TorsinATPases: structural insights and functional perspectives. Curr. Opin. Cell Biol. 40, 1-7 (2016). [7] Naismith, T. V., Dalal, S. & Hanson, P. I. Interaction of torsinA with its major binding partners is impaired by the dystonia-associated DeltaGAG deletion. J. Biol. Chem. 284, 27866-27874 (2009). [8] Zhu, L., Millen, L., Mendoza, J. L. & Thomas, P. J. A unique redox-sensing sensor II motif in TorsinA plays a critical role in nucleotide and partner binding. J. Biol. Chem. 285, 37271-37280 (2010). [9] Zhao, C., Brown, R. S. H., Chase, A. R., Eisele, M. R. & Schlieker, C. Regulation of TorsinATPases by LAP1 and LULL1. Proc. Natl. Acad. Sci. U.S.A. 110, E1545-54 (2013). [10] Goodchild, R. E., Kim, C. E. & Dauer, W. T. Loss of the dystonia-associated protein torsinA selectively disrupts the neuronal nuclear envelope. Neuron 48, 923-932 (2005). [11] Hanson, P. I. & Whiteheart, S. W. AAA+ proteins: have engine, will work. Nat. Rev. Mol. Cell Biol. 6, 519-529 (2005). [12] Erzberger, J. P. & Berger, J. M. Evolutionary relationships and structural mechanisms of AAA+ proteins. Annu Rev Biophys Biomol Struct 35, 93-114 (2006). [13] Olivares, A. O., Baker, T. A. & Sauer, R. T. Mechanistic insights into bacterial AAA+ proteases and protein-remodelling machines. Nat. Rev. Microbiol. 14, 33-44 (2016). [14] Sosa, B. A. et al. How lamina-associated polypeptide 1 (LAP1) activates Torsin. Elife 3, e03239 (2014). [15] Wendler, P., Ciniawsky, S., Kock, M. & Kube, S. Structure and function of the AAA+ nucleotide binding pocket. Biochim. Biophys. Acta 1823, 2-14 (2012). [16] Brown, R. S. H., Zhao, C., Chase, A. R., Wang, J. & Schlieker, C. The mechanism of TorsinATPase activation. Proc. Natl. Acad. Sci. U.S.A. 111, E4822-31 (2014). [17] Muyldermans, S. Nanobodies: natural single-domain antibodies. Annu. Rev. Biochem. 82, 775-797 (2013). [18] Iyer, L. M., Leipe, D. D., Koonin, E. V. & Aravind, L. Evolutionary history and higher order classification of AAA+ ATPases. J. Struct. Biol. 146, 11-31 (2004). [19] Sauer, R. T. & Baker, T. A. AAA+ proteases: ATP-fueled machines of protein destruction. Annu. Rev. Biochem. 80, 587-612 (2011). [20] Zhu, L., Wrabl, J. O., Hayashi, A. P., Rose, L. S. & Thomas, P. J. The torsin-family AAA+ protein OOC-5 contains a critical disulfide adjacent to Sensor-II that couples redox state to nucleotide binding. Mol. Biol. Cell 19, 3599-3612 (2008). [21] Goodchild, R. E. & Dauer, W. T. The AAA+ protein torsinA interacts with a conserved domain present in LAP1 and a novel ER protein. J. Cell Biol. 168, 855-862 (2005). [22] Goodchild, R. E. & Dauer, W. T. Mislocalization to the nuclear envelope: an effect of the dystonia-causing torsinA mutation. Proc. Natl. Acad. Sci. U.S.A. 101, 847-852 (2004). [23] Kim, C. E., Perez, A., Perkins, G., Ellisman, M. H. & Dauer, W. T. A molecular mechanism underlying the neural-specific defect in torsinA mutant mice. Proc. Natl. Acad. Sci. U.S.A. 107, 9861-9866 (2010). [24] Nery, F. C. et al. TorsinA binds the KASH domain of nesprins and participates in linkage between nuclear envelope and cytoskeleton. J. Cell. Sci. 121, 3476-3486 (2008). [25] Granata, A., Koo, S. J., Haucke, V., Schiavo, G. & Warner, T. T. CSN complex controls the stability of selected synaptic proteins via a torsinA-dependent process. EMBO J. 30, 181-193 (2011). [26] Nery, F. C. et al. TorsinA participates in endoplasmic reticulum-associated degradation. Nat Commun 2, 393 (2011). [27] Jokhi, V. et al. Torsin mediates primary envelopment of large ribonucleoprotein granules at the nuclear envelope. Cell Rep 3, 988-995 (2013). [28] Liang, C.-C., Tanabe, L. M., Jou, S., Chi, F. & Dauer, W. T. TorsinA hypofunction causes abnormal twisting movements and sensorimotor circuit neurodegeneration. J. Clin. Invest. 124, 3080-3092 (2014). [29] Jungwirth, M., Dear, M. L., Brown, P., Holbrook, K. & Goodchild, R. Relative tissue expression of homologous torsinB correlates with the neuronal specific importance of DYT1 dystonia-associated torsinA. Hum. Mol. Genet. 19, 888-900 (2010). [30] Vander Heyden, A. B., Naismith, T. V., Snapp, E. L., Hodzic, D. & Hanson, P. I. LULL1 retargets TorsinA to the nuclear envelope revealing an activity that is impaired by the DYT1 dystonia mutation. Mol. Biol. Cell 20, 2661-2672 (2009). [31] Goodchild, R. E. et al. Access of torsinA to the inner nuclear membrane is activity dependent and regulated in the endoplasmic reticulum. J. Cell. Sci. 128, 2854-2865 (2015). [32] Andersen, K. R., Leksa, N. C. & Schwartz, T. U. Optimized E. coli expression strain LOBSTR eliminates common contaminants from His-tag purification. Proteins 81, 1857-1861 (2013). [33] Otwinowski, Z. & Minor, W. [20] Processing of X-ray diffraction data collected in oscillation mode. Methods in Enzymology 276, 307-326 (Elsevier, 1997). [34] Morin, A. et al. Collaboration gets the most out of software. Elife 2, e01456 (2013). [35] Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213-221 (2010). [36] Sding, J., Biegert, A. & Lupas, A. N. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 33, W244-8 (2005). [37] Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Biol. Crystallogr. 66, 12-21 (2010). [38] Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 66, 486-501 (2010). [39] Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389-3402 (1997). [40] Kelley, L. A. & Sternberg, M. J. E. Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc 4, 363-371 (2009). [41] Tusndy, G. E. & Simon, I. The HMMTOP transmembrane topology prediction server. Bioinformatics 17, 849-850 (2001). [42] Edgar, R. C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5, 113 (2004). [43] Waterhouse, A. M., Procter, J. B., Martin, D. M. A., Clamp, M. & Barton, G. J. Jalview Version 2a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189-1191 (2009). [44] Glaser, F. et al. ConSurf: identification of functional regions in proteins by surface-mapping of phylogenetic information. Bioinformatics 19, 163-164 (2003). [45] Crooks, G. E., Hon, G., Chandonia, J.-M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188-1190 (2004). [46] Zeymer, C., Barends, T. R. M., Werbeck, N. D., Schlichting, I. & Reinstein, J. Elements in nucleotide sensing and hydrolysis of the AAA+ disaggregation machine ClpB: a structure-based mechanistic dissection of a molecular motor. Acta Crystallogr. D Biol. Crystallogr. 70, 582-595 (2014). [47] Leung, J. C. et al. Novel mutation in the TOR1A (DYT1) gene in atypical early onset dystonia and polymorphisms in dystonia and early onset parkinsonism. Neurogenetics 3, 133-143 (2001). [48] Zirn, B. et al. Novel TOR1A mutation p.Arg288Gln in early-onset dystonia (DYT1). J. Neurol. Neurosurg. Psychiatr. 79, 1327-1330 (2008). [49] Calakos, N. et al. Functional evidence implicating a novel TOR1A mutation in idiopathic, late-onset focal dystonia. J. Med. Genet. 47, 646-650 (2010). [50] Cheng, F.-B. et al. Combined occurrence of a novel TOR1A and a THAP1 mutation in primary dystonia. Mov. Disord. 29, 1079-1083 (2014). [51] Vulinovic, F. et al. Unraveling cellular phenotypes of novel TorsinA/TOR1A mutations. Hum. Mutat. 35, 1114-1122 (2014). [52] Dobri{hacek over (c)}i, V. et al. Phenotype of non-c.907_909delGAG mutations in TOR1A: DYT1 dystonia revisited. Parkinsonism Relat. Disord. 21, 1256-1259 (2015). [53] Kock, N. et al. Effects of genetic variations in the dystonia protein torsinA: identification of polymorphism at residue 216 as protein modifier. Hum. Mol. Genet. 15, 1355-1364 (2006). [54] Kamm, C. et al. Susceptibility to DYT1 dystonia in European patients is modified by the D216H polymorphism. Neurology 70, 2261-2262 (2008). [55] Kayman-Kurekci, G. et al. Mutation in TOR1AIP1 encoding LAP1B in a form of muscular dystrophy: a novel gene related to nuclear envelopathies. Neuromuscul. Disord. 24, 624-633 (2014). [56] Dorboz, I. et al. Severe dystonia, cerebellar atrophy, and cardiomyopathy likely caused by a missense mutation in TOR1AIP1. Orphanet J Rare Dis 9, 174 (2014).