PRO-MACROBODIES FOR THE ENHANCEMENT OF STRUCTURE RESEARCH
20230072235 · 2023-03-09
Inventors
Cpc classification
C07K2319/35
CHEMISTRY; METALLURGY
C07K2317/569
CHEMISTRY; METALLURGY
A61K39/00
HUMAN NECESSITIES
C07K2319/24
CHEMISTRY; METALLURGY
G01N33/6803
PHYSICS
C07K19/00
CHEMISTRY; METALLURGY
A61P25/28
HUMAN NECESSITIES
International classification
C07K19/00
CHEMISTRY; METALLURGY
Abstract
The present invention relates to a research tool for structural biology, in particular to enhance the determination of the three-dimensional structure of biological macromolecules. More specifically, the invention serves to improve the overall feasibility of structure determination providing higher resolution and better quality of the three-dimensional structures of proteins by complex-formation with a novel fusion polypeptide.
Claims
1. A fusion polypeptide comprising a first polypeptide which is an antigen-binding domain and a second polypeptide which is a polypeptide-scaffold, wherein the polypeptide-scaffold comprises parallel or antiparallel beta-strands and the antigen binding domain is linked by a peptide linker at its C-terminus to the N-terminus of the polypeptide-scaffold and wherein said peptide linker consists of one or more proline residues.
2. A fusion polypeptide according to claim 1, wherein the polypeptide-scaffold is a maltose binding protein.
3. A fusion polypeptide according to claim 1 or 2 wherein the polypeptide-scaffold is Escherichia coli maltose binding protein, Uniprot entry P0AEX9.
4. A fusion polypeptide according to any preceding claim, wherein the peptide linker consists of one, two, three or four proline residues.
5. A fusion polypeptide according to any preceding claim, wherein the polypeptide-scaffold comprises a polypeptide of the superfamily of periplasmic binding proteins of Interpro entry IPR025997, or a polypeptide of the superfamily of periplasmic binding protein-like I integral of Interpro entry IPR028082, or a periplasmic binding protein domain Pfam domain Peripla_BP_4of PF13407.
6. A fusion polypeptide according to any preceding claim, wherein the antigen binding domain comprises an immunoglobulin-like fold.
7. A fusion polypeptide according to any preceding claim, wherein the antigen-binding domain comprises an immunoglobulin (Ig) domain.
8. A fusion polypeptide according to any preceding claim, wherein the antigen-binding domain is a VHH antigen-binding domain.
9. A fusion polypeptide according to any preceding claim, wherein the antigen-binding domain comprises a camelid VHH or elasmobranch VHH, or shark VHH, or ray VHH or skate VHH or sawfish VHH or VHH domains from heavy or light chain of mammalian antibodies or monobodies.
10. An amino acid sequence encoding the fusion polypeptide of anyone of claims 1-10.
11. An amino acid sequence according to claim 10, comprising Sequence ID NO: 001 or Sequence ID NO: 002.
12. A complex comprising: i) the fusion polypeptide of any one of claims 1-9, and ii) a target protein, wherein said target protein is specifically bound to said fusion polypeptide.
13. The use of the fusion polypeptide according to anyone of claims 1-9, the amino acid sequence according to claims 10 and 11, and the complex according to claim 12 for structural analyses of a target protein.
14. The use of the fusion polypeptide according to any one of claims 1-9 as a medicine.
15. The use of the fusion polypeptide according to any one of claims 1-9 for diagnostic purposes.
Description
DESCRIPTION OF FIGURES
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
[0051]
[0052]
[0053]
[0054]
[0055]
[0056]
DETAILED DESCRIPTION OF THE INVENTION
[0057] Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. The herein described and disclosed embodiments, preferred embodiments and very preferred embodiments should apply to all aspects and other embodiments, preferred embodiments and very preferred embodiments irrespective of whether it is specifically again referred to or its repetition is avoided for the sake of conciseness. The articles “a” and “an”, as used herein, refer to one or to more than one (i.e., to at least one) of the grammatical object of the article. The term “or”, as used herein, should be understood to mean “and/or”, unless the context clearly indicates otherwise.
[0058] In an aspect, the present invention provides a fusion polypeptide comprising a first polypeptide which is an antigen-binding domain and a second polypeptide which is a polypeptide-scaffold, wherein the polypeptide-scaffold comprises parallel or antiparallel beta-strands and the antigen binding domain is linked by a peptide linker at its C-terminus to the N-terminus of the polypeptide-scaffold and wherein said peptide linker consists of one or more proline residues.
[0059] As used herein, the “antigen-binding domain” is limited only by binding to the target antigen. As the antigen binding domain, domains of any structure can be used as long as they bind to the target antigen. One suitable example of the antigen binding domain of the present invention is a single domain antibody (sdAb).
[0060] The term “polypeptide” refers to any sequence of two or more amino acids, regardless of length, post-translation modification, or function. Polypeptides can include natural amino acids and non-natural amino acids. In one embodiment the fusion protein comprises adjacent strands of the beta sheet structure and may be parallel or anti-parallel to each other.
[0061] As used herein, the term “linker” refers to a linkage between two elements, e.g., protein domains. A linker can be a covalent bond or a spacer. The term “bond” refers to a chemical bond, e.g., an amide bond or a disulfide bond, or any kind of bond created from a chemical reaction, e.g., chemical conjugation. The term “spacer” refers to a moiety (e.g., a polyethylene glycol (PEG) polymer) or an amino acid sequence (e.g., a 3-200 amino acid, 3-150 amino acid, or 3-100 amino acid sequence) occurring between two polypeptides or polypeptide domains to provide space and/or flexibility between the two polypeptides or polypeptide domains.
[0062] The “C terminus” or “carboxy terminus” are used herein interchangeably and are defined herein as they are normally used in the art. An amino acid structure contains a carbon atom known as the carbon to which is bonded an amine group, a carboxylic acid group and a side chain. The N-terminus (also known as the amino-terminus, NH.sub.2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide referring to the free amine group (−NH.sub.2) located at the end of a polypeptide.
[0063] In one embodiment the present invention provides the fusion polypeptide wherein the polypeptide-scaffold is a maltose binding protein.
[0064] In another embodiment the present invention provides the fusion polypeptide wherein the polypeptide-scaffold is Escherichia coli maltose binding protein, Uniprot entry P0AEX9.
[0065] In another embodiment the present invention provides the fusion polypeptide wherein the peptide linker consists of one, two, three or four proline residues.
[0066] In another embodiment the present invention provides the fusion polypeptide wherein the peptide linker consists of two proline residues. As indicated, such a rigid linker of two proline residues between the C-terminal end of an antigen binding domain, preferably a VHH antigen binding domain, and the polypeptide-scaffold, preferably a second domain that begins with a beta-sheet architecture such as the truncated N-terminus of the bacterial periplasmic Maltose Binding Proteins (MBP).
[0067] In another embodiment, the present invention provides the fusion polypeptide, wherein the C-terminal end of said antigen binding domain consists of the amino acid sequence VTV. Such C-terminus sequence is typically present in all VHH domains of heavy chain antibodies from camelids or elasmobranchs or synthetic libraries derived thereof, or the VHH domain of classical two-chain antibodies (such as IgGs) or synthetic antigen binding domains with immunoglobulin-architecture like monobodies. In another embodiment, the present invention provides the fusion polypeptide wherein, the C-terminal end of said antigen binding domain consists of the amino acid sequence VTV, wherein said antigen binding domain is a VHH antigen binding domain.
[0068] In another embodiment, the present invention provides the fusion polypeptide wherein, the N-terminal end of said polypeptide-scaffold consists of the amino acid sequence LVI. Preferably, said polypeptide-scaffold, said amino acid sequence LVI is comprised by and typically begins a beta-sheet architecture such as the truncated N-terminus of the bacterial periplasmic Maltose Binding Proteins (MBP). The scaffold fusion partner polypeptide at the C-terminus of the antigen binding domain is the truncated N-terminus of bacterial periplasmic Maltose Binding Proteins (MBP) in a stricter sense and, in a wider sense all periplasmic binding proteins but in particular the protein malE of Escherichia coli (Uniprot entry P0AEX9). In another embodiment, the present invention provides the fusion polypeptide wherein, the N-terminal end of said polypeptide-scaffold consists of the sequence LVI, wherein said polypeptide-scaffold is a Escherichia coli maltose binding protein, Uniprot entry P0AEX9, and wherein preferably said polypeptide-scaffold is a truncated N-terminus of the bacterial periplasmic Maltose Binding Proteins (MBP).
[0069] In another embodiment, the present invention provides the fusion polypeptide wherein the C-terminal end of said antigen binding domain is linked by said peptide linker, which consists of two prolines to the N-terminus of the polypeptide-scaffold, and wherein the C-terminal end of said antigen binding domain linked by said peptide linker to the to the N-terminus of the polypeptide-scaffold comprises the amino acid sequence VTVPPLVI (SEQ ID NO: 10).
[0070] In another embodiment, the present invention provides the fusion polypeptide comprises the amino acid sequence VTVPPLVI (SEQ ID NO: 10).
[0071] In another embodiment, the present invention provides the fusion polypeptide, wherein the C-terminal end of said antigen binding domain, wherein said antigen binding domain is a VHH antigen binding domain, is linked by said peptide linker, which consists of two prolines, i.e. wherein said peptide linker is di-amino acid PP-linker, to the N-terminus of the polypeptide-scaffold, wherein said polypeptide-scaffold is a Escherichia coli maltose binding protein, Uniprot entry P0AEX9, and preferably wherein said polypeptide-scaffold is a truncated N-terminus of the bacterial periplasmic Maltose Binding Proteins (MBP), wherein the C-terminal end of said antigen binding domain linked by said peptide linker to the to the N-terminus of the polypeptide-scaffold comprises the amino acid sequence VTVPPLVI (SEQ ID NO: 10). A rigidification and linear connection between the two proteins is achieved by two precisely positioned proline-residues. The resulting fusion protein, where the VHH antigen binding domain can be replaced with a VHH of any given specificity, serves as a molecular chaperone to enlarge small proteins, in particular membrane proteins. Surprisingly the modification with a di-proline linker results in a major conformational and structural reorganization of the domains relative to each other compared to the structure of the macrobodies lacking the two proline linker. Most remarkably, this new preferred fusion protein linkage is highly rigidified with respect to relative motions between the VHH antigen binding domain and the polypeptide-scaffold, preferably the MBP moiety.
[0072] In another embodiment, the present invention provides a fusion polypeptide, wherein the peptide linker connects the C-terminal portion of the antigen-binding domain with the N-terminal portion of the polypeptide-scaffold.
[0073] In another embodiment, the present invention provides, a fusion polypeptide, wherein the peptide linker connects the conserved and truncated carboxy-terminus of single chain antibody VHH domains of camelids after the conserved sequence Valine-Threonine-Valine (VTV) at the end of the beta-strand G to the truncated amino-terminus of the Escherichia coli maltose binding protein starting from the amino acid position leucine 7, the first amino acid in the beta-strand A of maltose binding protein.
[0074] As used herein, the term “amino acid positions” refers to the position numbers of amino acids in a protein or protein domain.
[0075] In another embodiment, the present invention provides a fusion polypeptide, wherein the C-terminus of the VHH of the antigen binding domain comprises three amino acids Valine-Threonine-Valine and the N-terminus of the maltose binding protein comprises three amino acids Leucine-Valine-Isoleucine.
[0076] In another embodiment, the present invention provides a fusion polypeptide, wherein the carboxy terminus of the peptide linker is fused to the first amino acid of the most amino-terminal beta-strand of the polypeptide-scaffold.
[0077] In another embodiment the present invention provides a fusion polypeptide, wherein the polypeptide-scaffold comprises a polypeptide of the superfamily of periplasmic binding proteins of Interpro entry IPR025997, or a polypeptide of the superfamily of periplasmic binding protein-like I integral of Interpro entry IPR028082, or a periplasmic binding protein domain Pfam domain Peripla_BP_4 of PF13407.
[0078] In another embodiment the present invention provides a fusion polypeptide of Interpro superfamilies IPR025997 and IPR028082 or Pfam domain PF13407 wherein there is at least one amino acid substitution of the original database entries occurring in this scaffold.
[0079] In another embodiment the present invention provides a fusion polypeptide comprising additional fusion polypeptides that are integrated, either internally at any position or fused to the carboxy-terminal end of the polypeptide-scaffold.
[0080] In another embodiment the present invention provides a fusion polypeptide wherein the fused polypeptide belonging to the Interpro superfamilies IPR025997 and IPR028082 or containing a Pfam domain PF13407, are truncated before its native carboxy-terminal end.
[0081] In another embodiment, the present invention provides a fusion polypeptide, wherein said peptide linker and said polypeptide-scaffold comprises, preferably consist of, the amino acid sequence SEQ ID NO:11.
[0082] In another embodiment, the present invention provides a fusion polypeptide, wherein said fusion polypeptide comprises the amino acid sequence SEQ ID NO:11.
TABLE-US-00001 (SEQ ID NO: 11) PPLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGD GPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAY PIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFT WPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADT DYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFV GVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEE LAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEAL KDAQTPG.
[0083] In another embodiment, the present invention provides a fusion polypeptide, wherein said antigen-binding domain, peptide linker and said polypeptide-scaffold comprises the amino acid sequence SEQ ID NO:12:
[0084] In another embodiment, the present invention provides a fusion polypeptide, wherein said fusion polypeptide comprises the amino acid sequence SEQ ID NO:12.
TABLE-US-00002 (SEQ ID NO: 11) VTVPPLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAA TGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKL IAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMN ADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSK PFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSY EEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVD EALKDAQTPG.
[0085] In another embodiment the present invention provides the fusion polypeptide, wherein the antigen binding domain comprises an immunoglobulin-like fold.
[0086] The immunoglobulin (Ig)-like domain is a protein domain that is similar in amino acid sequence and structure to the Ig domains of immunoglobulins (Edelman, 1987; Williams & Barclay, 1988). Structurally, Ig domains possess a distinctive immunoglobulin fold composed of 70-110 amino acids.
[0087] In another embodiment the present invention provides the fusion polypeptide, wherein the antigen-binding domain comprises an immunoglobulin (Ig) domain.
[0088] Ig domains are 70-110 amino acids in length with a distinct overall structure referred to as the Ig fold. This fold consists of two β sheets each made up of short antiparallel β strands. In the majority of Ig domains, the two β sheets are joined by a disulphide linkage.
[0089] In another embodiment the present invention provides the fusion polypeptide, wherein the antigen-binding domain is a VHH antigen-binding domain.
[0090] In another embodiment the present invention provides the fusion polypeptide, wherein the antigen-binding domain comprises a camelid VHH or elasmobranch VHH, or shark VHH, or ray VHH or skate VHH or sawfish VHH or VHH domains from heavy or light chain of mammalian antibodies or monobodies.
[0091] In another embodiment the present invention provides a fusion polypeptide, wherein the VHH domain originates from a single-chain antibody that was isolated from a member of the family of camelidae (i.e. Llama spp., Vicugna spp., or Camelus spp.).
[0092] In another embodiment the present invention provides a fusion polypeptide, wherein the VHH-antigen binding domain is a VHH domain of a heavy chain or light chain of a classical vertebrate Ig-antibody.
[0093] In another embodiment the present invention provides a fusion polypeptide, wherein the VHH-antigen binding domain is replaced with a monobody (synthetic binders built on a fibronectin type III domain) or is an immunoglobulin (Ig).
[0094] In another embodiment the present invention provides a fusion polypeptide, wherein the VHH-antigen binding domain is selected from a synthetic library that was generated by recombinant technology, or PCR or presented in recombinant Saccharomyces cerevisiae or that is presented in recombinant phages.
[0095] In another aspect the invention provides an amino acid sequence encoding the fusion polypeptide of the present invention. In one embodiment the amino acid sequence comprises Sequence ID NO: 001 or Sequence ID NO: 002.
[0096] The amino acid sequence ID NO: 001:
TABLE-US-00003 GPSQVQLVESGGGLVQAGGSLRLSCAASGFPVKYEHMYWYRQAPGKEREWV AAINSAGNETHYADSVKGRFTISRDNAKNTVYLQMNSLKPEDTAVYYCNVK DIGWWAAYDYWGQGTQVTVPPLVIWINGDKGYNGLAEVGKKFEKDTGIKVT VEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQD KLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKE LKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAK AGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKV NYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLE AVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYA VRTAVINAASGRQTVDEALKDAQTPG
[0097] The amino acid sequence ID NO: 002:
TABLE-US-00004 GPSQVQLVESGGGSVQAGGSLRLSCAASGSISSITYLGWFRQAPGKEREGV AALATYYGHTYYADSVKGRFTVSLDNAKNTVYLQMNSLKPEDTALYYCAAA YSGIWTPLGVWATYEYWGQGTQVTVPPLVIWINGDKGYNGLAEVGKKFEKD TGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITP DKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEI PALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGV DNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSN IDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLL TDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQM SAFWYAVRTAVINAASGRQTVDEALKDAQTPG
[0098] In a further aspect the invention provides a chemically modified, covalently labeled fusion polypeptide of the present invention.
[0099] In another aspect the present invention provides a complex comprising: [0100] i) the fusion polypeptide of the present invention, and [0101] ii) a target protein, wherein said target protein is specifically bound to said fusion polypeptide.
[0102] In one embodiment the complex of the present invention, wherein said target protein is bound to the antigen binding domain of said fusion polypeptide.
[0103] In another embodiment the complex of the present invention, wherein the target protein has a molecular weight of less than 100 kDa. In another embodiment the complex of the present invention, wherein the target protein has a molecular weight of less than 90 kDa, or 80 kDa, or 70 kDa or 60 kDa or 50 kDa or 40 kDa or 30 kDa, or 10 kDa.
[0104] In another aspect the invention provides the use of the fusion polypeptide, the amino acid sequence and the complex of the present invention for structural analyses of a target protein.
[0105] In one embodiment said structural analysis comprises single particle cryo-electron microscopy (cryo-EM) or X-ray crystallography. In another embodiment said structural analysis comprises single particle cryo-EM or X-ray crystallography or negative staining TEM or electron diffraction or NMR or any other structure determination technology.
[0106] In another aspect the invention provides the use of the fusion polypeptide according to the present invention as a medicine.
[0107] In another aspect the invention provides the use of the fusion polypeptide according to the present invention for diagnostic purposes.
[0108] In another aspect the present invention provides a fusion polypeptide of the present invention comprising Interpro entries [0109] i) Periplasmic binding protein/LacI sugar binding domain (IPR001761) [0110] ii) Receptor, ligand binding region (IPR001828) [0111] iii) Leucine-binding protein domain (IPR028081) [0112] iv) Arabinose metabolism transcriptional repressor, ligand-binding domain
EXAMPLE I
Evaluation of a Chimeric Fusion Polypeptide (Macrobody) and Design of New Chimeric Fusion Polypeptides (Pro-Macrobody) for Increased Rigidity by Computer Simulations
[0113] Described is the workflow for analysis of motions in chimeric fusion polypeptides using computer simulations (all-atom molecular dynamics simulations) and the simulation of in-silico amino acid exchanges (mutants) in the linker-region. The chimeric fusion polypeptides with amino exchanges in the linker region were subjected to all atom-molecular dynamics simulations (MD simulations) and their motions were quantified. All MD simulations are fully atomistic (ie. Each atom is simulated and not groups of atoms/molecules) and were conducted with Desmond and the OPLS3 force field with explicit solvent at room temperature (300K, or 26.85° C.). For all simulations a prior equilibration of 300 ns was conducted prior to 500 ns of production.
[0114] Simulation Models
[0115] Four different VHH-MBP fusion polypeptide simulation models were built starting from two different sets of coordinates. The first set of coordinates was built from the PDB data base code 6HD8.pdb structure (macrobody, here VK structure), which was simulated with a Val121 and Lys122 linker (from now on VK-linker, numbering refers to the Protein Data Bank (PDB) entry 6HD8). The second set of coordinates was built from a new structure simulated (description below) of a VHH-MBP fusion polypeptide where the antigen is the protein LptD of Neisseria gonnorhoeae (Uniprot accession A0A1D3INQ1) (Pro-macrobody, here PP structure) and a Pro122 Pro123 linker (from now on PP-linker). The VK structure was simulated either in its original state or after changing its linker to a PP-linker. Similarly, the PP structure was simulated either in its original state or after changing its linker to the VK-linker.
[0116] The exact design of the chimeric VHH-MBP fusion polypeptide comprising a di-Proline linker with all important boundaries is depicted in
[0117] Results
[0118] MD simulations initiated from both VK and PP structures and with both VK and PP linkers showed the great impact of the linker residue type on the dynamics of the VHH-MBP fusion polypeptides. Initial results were obtained for the VK structure and suggested that the PP-linker could lead to a significantly more rigid chaperone. Subsequently, a VHH-MBP fusion polypeptide was generated in silico with a PP-linker to test this hypothesis and a new VHH-MBP fusion polypeptide was produced in E. coli where the VHH antigen binding domain is binding the said LptD protein. The X-rays structure for this designed Pro-macrobody was solved (Section III) and shown to be in agreement with the predicted structure from MD simulations (RMSD<2 Å). This set of coordinates (PP structure) was then seeded to run additional MD simulations, including for comparison a simulation model where the VK-linker was re-introduced. Clear agreement was obtained in both cases, showing that the linker is able to modify the dynamics irrespectively of the initial set of coordinates chosen to run the simulations.
[0119] In
[0120] In
[0121]
[0122] In
[0123] Evaluation of Results
[0124] The analysis revealed that the two fused polypeptides with a VK linker have a large conformational freedom and can rotate around the polypeptide backbone at the valine-lysine linker, even at timescales of 500 ns. Both, Val121 and Lysine 6 of MBP (Lys122 in the chimeric fusion polypeptide) were not or at least insufficiently stabilized by hydrogen bond interactions within beta-sheets to prevent rotations. In solution, other than in a protein crystal, the chimeric fusion polypeptide from the structure 6HD8 (
[0125] Proline is restricted in its rotation due to its unique linkage of the side chain to the backbone. It is an amino acid with exceptional conformational rigidity. Due to the linkage of the α-amino group directly to the main chain, the α-carbon becomes a direct substituent of the side-chain. L-Proline has therefore no freedom of rotation like all other natural amino acids. L-Proline would, from the standpoint of its rigidity, be a suitable amino acid to be tested. However, L-Proline in polypeptides is introducing kinks and its nitrogen-atom cannot contribute as a donor in hydrogen-bonds, only as an acceptor. This is leading to destabilization or termination of alpha-helical or beta-sheet secondary structures. The impact of L-Proline residue to the stability of the linkage between a VHH antigen-binding domain and the MBP domain was unclear, especially due to its influence on secondary structure elements that are essential in the said fusion-polypeptide but also due to the introduced kink(s) that would impact the desired elongated shape of the fusion-polypeptide.
[0126] When the linker sequence was changed from . . . VTVVKLI . . . as shown in
[0127] To our surprise, the modification with a di-proline linker resulted in a major conformational and structural change compared to the structure of the fusion polypeptide in the structure 6HD8. Most remarkably, this new protein is very stable with respect to relative motions between the VHH antigen binding domain and the MBP. This is apparent when comparing the trajectories in molecular dynamics simulations of the original chimeric fusion polypeptide with a VK linker and from the new fusion polypeptide with a di-Proline (PP) linker (
EXAMPLE II
Structure Determination of a Chimeric Fusion Polypeptide with a Di-Proline Linker Between the Fused Moieties by X-Ray Crystallography
[0128] Two confirm the predicted structure, the new fusion polypeptide (
EXAMPLE III
The Chimeric Fusion Polypeptide with a Di-Proline Linker Retains the Antigen Binding Properties of the Coupled VHH Antigen Binding Domain
[0129] Binding of the VHH antigen binding domain to the respective antigen was measured with waveguide interferometry, a biophysical method to investigate the direct ligand binding to an immobilized molecular target protein enabling evaluation of affinity, stoichiometry and kinetics of binding. The binding kinetics of the unfused VHH antigen binding domains was measured towards the antigen which was coupled to the sensor chip by a biotin-neutravidin interaction according to manufacturer's instructions (Creoptix, Wädenswil, Switzerland). The binding kinetics of the corresponding VHH antigen binding domain in a chimeric fusion protein with MBP and connected by a di-proline linker were also measured and compared (
EXAMPLE FOR THE APPLICATION OF PRO-MACROBODIES
EXAMPLE IV
A Complex of a Small Membrane Protein and the Chimeric Fusion Polypeptide With a Di-Proline Linker was Purified and Analyzed By Single Particle Cryo-EM
[0130] A complex of a chimeric fusion polypeptide with the di proline linker and the corresponding antigen (a bacterial transporter for lipopolysaccharides (LPS)) was prepared. The bacterial LPS transporter-complex LptDE of Neisseria gonnorhoeae (consisting of the proteins LptD (Uniprot) and LptE (Uniprot) was selected as example (NgLptDE). The protein is a relatively small membrane protein target for cryo-EM with only 110 kDa, asymmetric and built almost exclusively of β-strands. These properties make NgLptDE a very suitable example for examining positive impact of Pro-macrobodies regarding structure determination. A low resolution structure that was acquired beforehand without Pro-Macrobodies served as a control for direct comparison. VHH antigen binding domains binding NgLptDE were raised and already characterized beforehand. Two specific VHH antigen binding domains were chosen and transformed to Pro-macrobodies. These are clones L21 and clone L51.
[0131] The complex of NgLptDE and the Pro-Macrobodies was purified—after mixing the two components—by size-exclusion chromatography (
[0132] The invention enables higher resolution EM maps for target proteins. A difference of 1.2 Angstrom could be observed for the resolution of density maps of the bacterial transporter in the absence or presence of the specific chimeric fusion polypeptide (i.e. uncomplexed compared to complexed transporter with two Pro-Macrobodies). Applying the gold standard fourier shell correlation (GSFSC), the standard measure to obtain the global resolution of a given density map from electron microscopic volume reconstructions (at a FSC cutoff of 0.143) is 4.6 Å for the uncomplexed sample and 3.4 Å for the complex of the transporter with the chimeric fusion polypeptide (
[0133] The fusion protein with the di-proline linker as described in examples I-IV is thus a unique design because one domain (the VHH domain) can be exchanged without sacrificing the rigidity of the fusion protein. It can be assumed that a single polypeptide linker is always flexible, unless the two connected domains show specific interactions at the interface, such that attempts to stabilize would fail. Using a fusion protein with the di-proline linker we generated rigidly connected two non-interacting proteins and the double proline linker identified ideally serves the purpose of (i) a rigid connection, and (ii) keeping the VHH domain freely exchangeable.
[0134] It is important to note that the rigidity of the chaperone becomes increasingly relevant with a decrease of the protein target's size demanding a highly rigid chaperone. The chimeric fusion polypeptide with a di-proline linker described here represents a significant technical improvement of molecular chaperones for structural biology.
[0135] The full amino-acid sequence of the chimeric fusion polypeptide of a VHH antigen binding domain (clone L21) and MBP with the di proline linker (Pro-Macrobody 21) is SEQ ID 001:
TABLE-US-00005 GPSQVQLVESGGGLVQAGGSLRLSCAASGFPVKYEHMYWYRQAPGKEREWV AAINSAGNETHYADSVKGRFTISRDNAKNTVYLQMNSLKPEDTAVYYCNVK DIGWWAAYDYWGQGTQVTVPPLVIWINGDKGYNGLAEVGKKFEKDTGIKVT VEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQD KLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKE LKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAK AGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKV NYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLE AVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYA VRTAVINAASGRQTVDEALKDAQTPG
[0136] The full amino-acid sequence of the chimeric fusion polypeptide of a VHH antigen binding domain (clone L51) and MBP with the di proline linker (Pro-Macrobody 51) is SEQ ID 002:
TABLE-US-00006 GPSQVQLVESGGGSVQAGGSLRLSCAASGSISSITYLGWFRQAPGKEREGV AALATYYGHTYYADSVKGRFTVSLDNAKNTVYLQMNSLKPEDTALYYCAAA YSGIWTPLGVWATYEYWGQGTQVTVPPLVIWINGDKGYNGLAEVGKKFEKD TGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITP DKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEI PALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGV DNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSN IDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLL TDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQM SAFWYAVRTAVINAASGRQTVDEALKDAQTPG
[0137] The VHH domain (antigen-binding) is shown in italics and can be exchanged for any other VHH because of the conserved structure. The linker is shown in bold and underlined. The remaining sequence is the enlarging domain (malE from E. coli). The VHH domain shown here is the antigen-binding domain clone 21 that was crystallized and used for electron microscopy.
[0138] In a final experiment the Pro-Macrobodies were compared with macrobodies (comprising a VK-linker as in the PDB entry 6HD8). For this NgLptDE was complexed with macrobodies 21 and 51 (identical VHH domains as in the respective Pro-Macrobodies), purified by chromatography in the same manner and subjected to analysis by cryo-EM. It became apparent that macrobodies could not improve the resolution significantly in comparison to uncomplexed NgLptDE as shown in