MULTIMERIC PROTEIN COMPLEXES AS ANTIBODY SUBSTITUTES FOR NEUTRALIZATION OF VIRAL PATHOGENS IN PROPHYLACTIC AND THERAPEUTIC APPLICATIONS
20230382977 · 2023-11-30
Inventors
- Daniel L. COX (San Francisco, CA, US)
- Richard L. DAVIS (Penn Valley, PA, US)
- Michael D. TONEY (Davis, CA, US)
Cpc classification
C07K2317/76
CHEMISTRY; METALLURGY
C07K2318/20
CHEMISTRY; METALLURGY
C07K2319/31
CHEMISTRY; METALLURGY
C07K16/1003
CHEMISTRY; METALLURGY
C07K2317/92
CHEMISTRY; METALLURGY
International classification
Abstract
The present patent consists of an engineered multimeric protein complex as antibody substitute composed of human proteins, with an m-fold symmetry, with each m-fold element containing a modified monomeric protein derived from a symmetric human multimeric protein complex fused to a module containing n fused, modified human beta solenoid proteins (mBSP), and that fused to a human derived pathogen binding domain (PBD), as well as a separate antibody substitute composed of P human PBD complexes. The invention may find application in prophylactic and therapeutic treatments for viral infections, especially for COVID19 by neutralizing the SARS-CoV-2 virus.
Claims
1. A multimeric protein complex as antibody substitute complex comprising a plurality (e.g., m≥2) of monomeric proteins with modular protein domains of the form (α-β.sub.n-γ.sub.p).sub.m or (γ.sub.p-β.sub.n-α).sub.m wherein the monomeric proteins α-β.sub.n-γ.sub.p or γ.sub.p-β.sub.n-α comprise fused protein domains wherein: α is a monomeric protein from a symmetric human multimeric protein complex of point group symmetry C.sub.m or D.sub.m, β is a fused domain of n modified beta solenoid proteins (mBSPs) with n≥0, and γ.sub.p is a complex of p pathogen binding domains (PBDs) either fused or bound to each other by intermolecular forces and wherein p≥1.
2. The multimeric protein complex as antibody substitute of claim 1, wherein the multimeric protein complex as antibody substitute is symmetrical.
3. The multimeric protein complex as antibody substitute of claim 2, wherein the multimeric protein complex as antibody substitute has two-fold symmetry, three-fold symmetry, four-fold symmetry, five-fold symmetry, six-fold symmetry, or twelve-fold symmetry.
4-8. (canceled)
9. The multimeric protein complex as antibody substitute of claim 1, wherein the modular protein domain a is a monomeric protein from a wild type symmetric human multimeric protein complex α.sub.m.
10. (canceled)
11. The multimeric protein complex as antibody substitute of claim 1, wherein α is a monomeric protein from: (a) the m=3 human collagen trimerization domain which is at least 90%, 95%, 98%, or 99% identical to SEQ ID NO: 9. or (b) the m=3 human growth factor EDA-Al which is at least 90%, 95%, 98%, or 99% identical to SEQ ID NO: 10; or (c) the m=3 human Langerin which is at least 90%, 95%, 98%, or 99% identical to SEQ ID NO: 11; or (d) the m=4 human tetrameric diubiquitin which is at least 90%, 95%, 98%, or 99% identical to SEQ ID NO: 12.
12-14. (canceled)
15. The multimeric protein complex as antibody substitute of claim 1, where n=0, p=1 and the protein binding domain (PBD) γ is at least 90%, 95%, 98% or 99% identical to the N-terminus domain (residues 19-85 or residues 19-91) of the human ACE2 receptor protein of SEQ ID NO: 6 or SEQ ID NO: 7.
16. The multimeric protein complex as antibody substitute of claims 1, wherein the monomeric protein sequence is at least 90%, 95%, 98% or 99% identical to SEQ ID NO:1, 2 or 3.
17. (canceled)
18. (canceled)
19. The multimeric protein complex as antibody substitute of claim 1, wherein the modified human beta solenoid (mBSP) is: (a) at least 80%, 90%, 95%, 98%, or 99% identical to the dynactin p27 domain (3VTO) of SEQ ID NO: 8. or (b) modified to be at least 80%, 90%, 95%, 98% or 99% identical to the human Retinitis Pigmentosa Protein 2 (RP2) (2BX6).
20. The multimeric protein complex as antibody substitute of claim 1, wherein the monomeric sequence is at least 90%, 95%, 98% or 99% identical to SEQ ID NO: 4 and n=1 and p=1, or to SEQ ID NO: 5, and n=4 and p=1.
21. (canceled)
22. The multimeric protein complex as antibody substitute of claim 1, wherein the sequence is of the form y2 and y is at least 90%, 95%, 98% or 99% identical to SEQ ID NO: 6 or SEQ ID NO: 7.
23. (canceled)
24. The multimeric protein complex as antibody substitute of claim 1, wherein the pathogen binding domain is at least 90%, 95%, 98% or 99% identical to the helix-turn-helix (HTH) complex from the N-terminus of the ACE2 receptor protein of SEQ ID NO: 6 or SEQ ID NO: 7.
25. The multimeric protein complex as antibody substitute of claim 1, wherein at least one (e.g., 1, 2, 3, 4, 5, or more) amino acid of the a or modified human beta solenoid domain is modified to allow attachment to a nanoparticle, a solid support, or other biological molecule.
26. The multimeric protein complex as antibody substitute of claim 1, wherein the multimeric protein complex is attached to a nanoparticle, a solid support, or other biological molecule.
27. The multimeric protein complex as antibody substitute of claim 1, wherein the multimeric protein complex is attached to human serum albumin.
28. The pathogen binding domain of the multimeric protein complex as antibody substitute of claim 1, wherein at least one (e.g., 1, 2, 3, 4, or more) amino acid of one or more of the module domains is modified to allow attachment to a nanoparticle, a solid support, or other biological molecule.
29. A multimeric protein complex as antibody substitute comprising a plurality of pathogen binding domains (e.g., 2, 3, 4, or more).
30. The multimeric protein complex as antibody substitute of claim 29, wherein the pathogen binding domain is modified to be at least 90%, 95%, 98% or 99% identical to the HTH domain (residues 19-85 or 19-91) of the N-terminus of the ACE2 receptor protein of SEQ ID NO: 6 or SEQ ID NO: 7.
31. The multimeric protein complex as antibody substitute of claim 30, wherein at least one (e.g., 1, 2, 3, 4, or more) amino acid of either or all pathogen binding domains are modified to allow attachment to a nanoparticle, a solid support, or other biological molecule.
32. A method for neutralizing a pathogen comprising contacting said pathogen with the multimeric protein complex as antibody substitute of claim 1, wherein one or more pathogen binding domains binds to one or more sites on the pathogen.
33. A method for immobilizing a pathogen comprising contacting said pathogen with the multimeric protein complex as antibody substitute of claim 1, wherein one or more pathogen binding domains binds to one or more sites on the pathogen.
34. (canceled)
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0068]
[0069] In this case the multimeric protein complex is a trimer (m=3) comprised of a monomeric protein from a symmetry human trimer such as the human collagen trimerization domain (PDB ID 3N3F), fused to a modified beta solenoid (mBSP) domain in which n copies of a single mBSP such as the p27 domain from dynactin (PDB ID 3VT0) are fused together, and p copies of a pathogen binding domain (PBD) such as residues 19-91 of the human ACE2 receptor protein. If the sequence begins with the α-monomer, it will have the form (α-β.sub.n-γ.sub.p).sub.m with α-β.sub.n-γ.sub.p the fused (by peptide bond) monomeric protein of the α, β.sub.n, and γ.sub.p domains. If the sequence begins with the PBD, it will have the form (γ.sub.p-β.sub.n-α).sub.m. The multimeric value m is chosen to match the symmetry of the corresponding viral envelope protein (VEP).sub.m where, e.g., VEP can be a monomeric protein of the Spike trimer from the SARS-CoV-2 virus.
[0070]
[0071]
[0072]
[0073]
[0074]
[0075]
[0076]
DETAILED DESCRIPTION OF THE INVENTION
[0077] The inventors have discovered that engineered protein trimers and dimers composed of linked modular sequences from human proteins can neutralize viral pathogens in a prophylactic or therapeutic context to form a multimeric protein complex as antibody substitute. By fusing a purely human multimeric protein complex comprised of m monomeric proteins α.sub.m to mBSPs and PBDs to match the m-fold symmetry and geometry of the viral envelope protein, it is possible to neutralize the multimeric VEP protein with a multimeric protein as an antibody substitute properly attuned to the VEP symmetry. In contrast monoclonal antibodies which at best exhibit dimeric (m=2) protein binding, but without a geometry tuned to the VEP.sub.m geometry they typically exhibit monomeric protein binding. The multimeric protein binding increases the net binding affinity to the VEP.sub.m complex. The inventors have previously demonstrated, for example, that a similar binding domain to that of the PBD from the ACE2 of SEQ ID NOS:1-6 attached to mBSP polymers can successfully bind vascular endothelial growth factor (VEGF) protein. The symmetric human multimeric protein complexes expressible in E. coli can be, for example, the human collagen trimerization domain (PDB code 3N3F) or other multimeric human protein complexes including but not limited to trimeric EDA-1 (PDB code 1RJ7), trimeric Langerin (3KQG), and tetrameric diubiquitin (2XEW), as detailed in SEQ ID NO: 9, 10, 11, and 12.
[0078] The inventors have discovered that in the case of the PBD in the embodiment of the N-terminus sequence (see, e.g., SEQ ID NO: 6 and SEQ ID NO: 7) from the ACE2 protein that the binding is effective and essentially unchanged from the ACE2 dimer itself (
[0079] The inventors have discovered through extensive simulations using the YASARA molecular dynamics program.sup.63 that mutations associated with extensive variants of the SARS-CoV-2 virus do not significantly alter the binding of the PBD from residues 19-91 of the ACE2 to the RBD of the SARS-CoV-2 spike protein (
[0080] The inventors have discovered that the multimeric protein complex as antibody substitute designs of SEQ ID NO: 2 and SEQ ID NO: 3 are expressible in P. pastoris with the trimers secreted from the cells and needing no post-translation modification, confirming the advantage of the invention over monoclonal antibodies. (
[0081] The inventors have discovered that by inserting a human β.sub.n=mBSP.sub.n domain consisting of n fused copies of a single mBSP domain between a monomeric proteins and the γ=PBD element of the multimeric protein complex as antibody substitute that they can adapt the size of the multimeric protein complex as antibody substitute to match the size of the VEPm to bind to more than one VEP element at once (
[0082] The inventors have discovered that in the case of the N-terminus ACE2 HTH example of SEQ ID NO: 6 or SEQ ID NO: 7 for a PBD, that there is a tendency to dimerize when detached from the ACE2 protein (
[0083] By making use of purely or nearly (see the following paragraph) human proteins in each of the modular domains of (α-β.sub.n-γ.sub.p) or (γ.sub.p-β.sub.n-α) for the multimeric protein complex as antibody substitute, the invention avoids immunogenic response of the human host in prophylactic or therapeutic applications.
[0084] Immunogenic tolerance of the host to these multimeric protein complexes as antibody substitutes can be maintained by modifying up to 5 residues of the PBD to either (a) increase the innate binding strength to the VEP complex, or (b) to inhibit the dimerization of a PBD such as the HTH PBD construct from the ACE2 protein. For example, the substitutions of lysines or arginines at positions 62,69 of the HTH PBD in SEQ ID NO: 2 or 3 helps to significantly diminish the hydrophobic dimerization tendency.
[0085] The probability for attachment of N-linked glycans to the multimeric proteins as antibody substitutes can be substantially reduced by modifying NIT sequences to QIT, as for example in SEQ ID NO:2 or 3.
[0086] Exemplary multimeric protein complex as antibody substitute sequences of the form (α-β.sub.n-γ.sub.p).sub.m or (γ.sub.p-β.sub.n-α).sub.m include but are not limited to polypeptides comprising an amino acid sequence at least 90%, 95%, 98%, 99% or 100% identical any one of SEQ ID NOS: 1, 2, 3, 4, 5, 6, or 7 as provided below (numbers at right are amino acid count, beginning with N-terminal).
TABLE-US-00001 ((γ-α).sub.m, m = 3, n = 0, p = 1, α is a monomeric protein of the human collagen trimerization domain 3N3F, γ is residues 19-85 of the human ACE2 receptor with the mutations S19G, T20C, P84A, L85C) This is the monomeric sequence γ-α. SEQ ID NO: 1 GCIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQST (60) LAQMYACGGGNLVTAFSNMDDMLQKAHLVIEGTFIYLRDSTEFFIRVRDGWKKLQLGELI (120) PIPA (124) ((γ-α).sub.m, m = 3, n = 0, p = 1, α is a monomeric protein of the human collagen trimerization domain 3N3F, γ is residues 19-91 of the human ACE2 receptor with the mutations N52Q, M62K, M69K) This is the monomeric sequence γ-α. SEQ ID NO: 2 STIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTQITEENVQNKNNAGDKKSAFLKEQST (60) LAQMYPLQEIQNLGGGNLVTAFSNMDDMLQKAHLVIEGTFIYLRDSTEFFIRVRDGWKKL (120) QLGELIPIPA (130) ((γ-α).sub.m, m = 3, n = 0, p = 1, α is a monomeric protein of the human collagen trimerization domain 3N3F, γ is residues 19-91 of the human ACE2 receptor with the mutations N52Q, M62R, M69R) This is the monomeric sequence γ-α. SEQ ID NO: 3 STIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTQITEENVQNRNNAGDKRSAFLKEQST (60) LAQMYPLQEIQNLGGGNLVTAFSNMDDMLQKAHLVIEGTFIYLRDSTEFFIRVRDGWKKL. (120) QLGELIPIPA (130) ((α-β-γ).sub.3, m = 3, n = 1, m = 1, α is a monomeric protein of the 1RJ7 human trimer, β = mBSP is from the 3VT0, γ is residues 19-83 from the ACE2 with the mutations S19G. This is the monomeric sequence α-β-γ. SEQ ID NO: 4 QPAVVHLQGQGSAIQVKNDLSGGVLNDWSRITMNPKVFKLHPRSGELEVLVDGTYFIYSQ (60) VEVYYINFTDFASYEVVVDEKPFLQCTRSIETGKTNYNTCYTAGVCLLKARQKIAVKMVH (120) ADISINMSKHTTFFGAIRLGEAPAPGAVVCVESEIRGDVTIGPRTVIHPKARIIAEAGPI (180) VIGEGNLIEEQALIINAYPDNIKPMIIGTNNVFEVGCYSQAMKMGDNNVIESKAYVGRNV (240) ILTSGCIIGACCNLNTFEVIPSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEE (300) NVQNMNNAGDKWSAFLKEQSTLAQMYG (327) ((γ-β.sub.4-α).sub.3, m = 3, n = 4, m = 1, α is a monomeric protein of the 1RJ7 human trimer, β = mBSP is the 3VT0, γ from residues 19-83 of the ACE2. This is the monomeric sequence γ-β.sub.4-α. SEQ ID NO: 5 STIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQST (60) LAQMYGGYPDNIKPMIIGTNNVFEVGCYSQAMKMGDNNVIESKAYVGRNVILTSGCIIGA (120) CCNLNTFEVIPENRTVIHPKARIIAEAGPIVIGEGNLIEEQALIINAYPDNIKPMIIGTN (180) NVFEVGCYSQAMKMGDNNVIESKAYVGRNVILTSGCIIGACCNLNTFEVIPENRTVIHPK (240) ARIIAEAGPIVIGEGNLIEEQALIINAYPDNIKPMIIGTNNVFEVGCYSQAMKMGDNNVI (300) ESKAYVGRNVILTSGCIIGACCNLNTFEVIPENRTVIHPKARIIAEAGPIVIGEGNLIEE (360) QALIINAYPDNIKPMIIGTNNVFEVGCYSQAMKMGDNNVIESKAYVGRNVILTSGCIIGA (420) CCNLNTFEVIPENRTVIHPKARIIAEAGPIVIGEGNLIEEQALIINAYPDNQPAVVHLQG (480) QGSAIQVKNDLSGGVLNDWSRITMNPKVFKLHPRSGELEVLVDGTYFIYSQVEVYYINFT (540) DFASYEVVVDEKPFLQCTRSIETGKTNYNTCYTAGVCLLKARQKIAVKMVHADISINMSK (600) HTTFFGAIRLGEAP (614) EXEMPLARY PBD SEQUENCES (One PBD in γ.sub.p, p = 2, from ACE2 N-terminus HTH, residues 19-83 of ACE2) SEQ ID NO: 6 STIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQST (60) LAQMYP (66) (One PBD in γ.sub.p, p = 1, from ACE2 N-terminus HTH, residues 19-91 of ACE2) SEQ ID NO: 7 STIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQST (60) LAQMYPLQEIQNLG (74) EXEMPLARY mBSP SEQUENCES (mBSP from dynactin p27 BSP, 3VT0) SEQ ID NO: 8 GAVVCVESEIRGDVTIGPRTVIHPKARIIAEAGPIVIGEGNLIEEQALIINAYPDNIKPM (60) IIGTNNVFEVGCYSQAMKMGDNNVIESKAYVGRNVILTSGCIIGACCNLNTFEVIPEP (118)
EXEMPLARY SYMMETRIC MULTIMERIC PROTEIN SEQUENCES
[0087]
TABLE-US-00002 (α monomeric protein from om human collagen trimerization domain, m = 3, 3N3F) SEQ ID NO: 9 NLVTAFSNMDDMLQKAHLVIEGTFIYLRDSTEFFIRVRDGWKKLQLGELIPIPA (54) (α monomeric protein from α.sub.m human growth factor EDA-A1, m = 3, 1RJ7) SEQ ID NO: 10 GSHMGPSGAADKAGTRENQPAVVHLQGQGSAIQVKNDLSGGVLNDWSRITMNPKVFKLHP (60) RSGELEVLVDGTYFIYSQVEVYYINFTDFASYEVVVDEKPFLQCTRSIETGKTNYNTCYT. (120) AGVCLLKARQKIAVKMVHADISINMSKHTTFFGAIRLGEAPAS (163) (α monomeric protein from α.sub.m human langerin, m = 3, 3KQG) SEQ ID NO: 11 ASTLNAQIPELKSDLEKASALNTKIRALQGSLENMSKLLKRQNDILQVVSQGWKYFKGNF (60) YYFSLIPKTWYSAEQFCVSRNSHLTSVTSESEQEFLYKTAGGLIYWIGLTKAGMEGDWSW (120) VDDTPFNKVQSARFWIPGEPNNAGNNEHCGNIKAPSLQAWNDAPCDKTFLFICKRPYVPS (180) EP (182) (α monomeric protein from α.sub.m human tetrameric diubiquitin, m = 4, 2XEW) SEQ ID NO: 12 MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYN (60) IQKESTLHLVLRLRGG (76)
EXAMPLES
Example 1
[0088] We have developed antibody replacements for viral neutralization from modular domain designs of proteins into multimeric proteins as antibody substitutes, extending work the inventors have previously demonstrated skill in developing for basic science.sup.64-70 and for applications.sup.53. The invention is to identify human symmetric multimeric protein complexes (α.sub.m) such as the human collagen trimerization domain (SEQ ID NO: 9 and PDB code 3N3F, M=3), with an example shown in
[0089] As example 1, we consider the multimeric protein complex as antibody substitute design of SEQ ID 1 and
where m is the number of binders (identical to the multimeric number m in this example) and ΔG.sub.B is the (negative) binding free energy associated with a single binding event, the dissociation constant of the trimeric antibody substitute of SEQ ID NOS: 1-4 is likely to be in the femtomolar regime when all three RBDs are simultaneously bound by a single trimer. This avidity concept is borne out by considerable theoretical evidence and argument.sup.71-75 and by recent evidence of engineered nanobody binding to the down domain of the spike protein where trimers have likely femtomolar affinity.sup.76.
ΔΔG.sub.Mut−DMS=0.03437*(ΔΔG.sub.Mut−HawkDock−3.8)
from which we can predict, with the theoretical ΔΔG.sub.Mut−HawkDock, the dissociation constants relative to wildtype (WT) as
where T is taken at room temperature and R=8.29 J/mole-K is the ideal gas constant. This is the third column of
[0090] By working with human constructs for the multimeric proteins as antibody substitutes containing a minor number (≤4-5) of point mutations, it is anticipated that our modular protein designs will be resistant to immunogenic response/rejection, and proteolytic degradation inside the body. Moreover, by using common domains from ubiquitous human proteins, the potential for autoimmune response is avoided.
[0091] By choosing protein domains for our multimeric proteins as antibody substitutes which have been expressed from well proven, scalable, industrialized prokaryotic or single celled eukaryotic processes, we save the expense and unpredictability of monoclonal antibody production.
[0092] The molecular weight of each monomeric protein in our multimeric protein complex as antibody substitute designs is relatively small compared to antibodies. The design of SEQ ID 1 and
Example 2
[0093] In example 2, we consider the somewhat larger construct of
[0094] This design can access the RBD domain even when it is flipped out away from the spike complex, and thus is amenable to a wider range of binding conformations than the smaller design of
Example 3
[0095] The much larger multimeric protein complex as antibody substitute design of
Example 4
[0096] The HTH complex itself, in dimer form, can be a potent multimeric protein complex as antibody substitute neutralizing agent. The dimer construct of
[0097] This multimeric protein complex as antibody substitute complex can be mutated by up to 4 amino acids to achieve higher affinity binding with the RBD without inducing immunogenic response, and such mutations can enhance the affinity per HTH to sub-nanomolar KD values.sup.78.
[0098] If necessary, particularly in attachment to the complexes of EXAMPLES 1-3, we can modify the hydrophobic “underside” of the HTH complex by 1-4 residues to prevent self-association.
Example 5
[0099] By fusing a specific human serum albumin binding sequence to the side of the trimers in the multimeric protein complex as antibody substitutes discussed in EXAMPLES 1-3 opposite the binding face to the spike proteins, we can attach to albumin in the blood. For example, the SA2 peptide invented by Genentech.sup.79-82, has specific binding to serum albumin. This provides steric hindrance to viral binding in addition to the explicit blocking of the spike proteins and engenders enhanced lifetimes in vivo.
Example 6
[0100] The multimeric protein complex as antibody substitute inventions herein, while using specific examples of binding to the SARS-CoV-2 virus, are not solely restricted to this application. For example, the general multimeric protein complex as antibody substitute schema (α-β.sub.n-γ.sub.p).sub.m or (γ.sub.p-β.sub.n-α).sub.m can be extended to develop neutralization agents for the trimeric haemagglutinin VEPs on the surface of influenza virions, the tetrameric neuraminidase complexes on influenza virions, or the trimeric gp120 VEPs on the surface of HIV virions.
[0101] Additionally, the general (α-β.sub.n-γ.sub.p).sub.m or (γ.sub.p-β.sub.n-α).sub.m scheme can be extended to binding to multimeric fusion complexes on microorganisms or tumor cells.
REFERENCES
[0102] 1. M.-Y. Chou, C.-Y. Fan, C.-C. Huang and H.-C. Li, U.S. Pat. No. 10,183,986B2 (2019). [0103] 2. P. Bruenker, C. F. Koller, S. Grau-Richards, E. Moessner and P. Umana, U.S. Pat. No. 9,975,958B2 (2018). [0104] 3. V. J. Sanchez-Arevalo Lobo, A. M. Cuesta, L. Sanz, M. Compte, P. Garcia, J. Prieto, F. J. Blanco and L. Alvarez-Vallina, International Journal of Cancer 119 (2), 455-462 (2006). [0105] 4. A. M. Cuesta, D. Sanchez-Martin, A. Blanco-Toribio, M. Villate, K. Enciso-Alvarez, A. Alvarez-Cienfuegos, N. Sainz-Pastor, L. Sanz, F. J. Blanco and L. Alvarez-Vallina, Mabs 4 (2), 226-232 (2012). [0106] 5. A. M. Cuesta, N. Sainz-Pastor, J. Bonet, B. Oliva and L. Alvarez-Vallina, Trends in Biotechnology 28 (7), 355-362 (2010). [0107] 6. C. B. Shoemaker and H. Feng, (2015). [0108] 7. J. A. Wirz, S. P. Boudko, T. F. Lerch, M. S. Chapman and H. P. Baechinger, Matrix Biology 30 (1), 9-15 (2011). [0109] 8. S. G. Hymowitz, D. M. Compaan, M. H. Yan, H. J. A. Wallweber, V. M. Dixit, M. A. Starovasnik and A. M. de Vos, Structure 11 (12), 1513-1520 (2003). [0110] 9. S. G. Hymowitz, D. M. Compaan, M. Yan, H. Ackerly, V. M. Dixit, M. A. Starovasnik and A. M. De Vos, Worldwide Protein Data Bank (2003).
[0111] 10. H. Feinberg, A. S. Powlesland, M. E. Taylor and W. I. Weis, Journal of Biological Chemistry 285 (17), 13285-13293 (2010). [0112] 11. H. Feinberg, A. S. Powlesland, M. E. Taylor and W. I. Weis, Worldwide Protein Data Bank (2010). [0113] 12. A. Bremm, S. M. V. Freund and D. Komander, Worldwide Protein Data Bank (2010). [0114] 13. A. Bremm, S. M. V. Freund and D. Komander, Nature Structural & Molecular Biology 17 (8), 939-U947 (2010). [0115] 14. Z. S. Derewenda, U. Derewenda and A. Kowalska, Worldwide Protein Data Bank (2012). [0116] 15. K. Kuhnel, S. Veltel, I. Schlichting and A. Wittinghofer, Structure 14 (2), 367-378 (2006). [0117] 16. K. Kuhnel, S. Veltel, I. Schlichting and A. Wittinghofer, Worldwide Protein Data Bank (2006). [0118] 17. J. Lan, J. Ge, J. Yu, S. Shan, H. Zhou, S. Fan, Q. Zhang, X. Shi, Q. Wang, L. Zhang and X. Wang, Nature (2020). [0119] 18. X. Wang, J. Lan, J. Ge, J. Yu and S. Shan, Worldwide Protein Data Bank (2020). [0120] 19. L. A. Graham, W. S. Qin, S. C. Lougheed, P. L. Davies and V. K. Walker, Journal of Molecular Evolution 64 (4), 387-398 (2007). [0121] 20. S. P. Graether and B. D. Sykes, European Journal of Biochemistry 271 (16), 3285-3296 (2004). [0122] 21. E. K. Leinala, P. L. Davies and Z. C. Jia, Structure 10 (5), 619-627 (2002). [0123] 22. Y. C. Liou, A. Tocilj, P. L. Davies and Z. C. Jia, Nature 406 (6793), 322-324 (2000). [0124] 23. Y. F. Mok, F. H. Lin, L. A. Graham, Y. Celik, I. Braslaysky and P. L. Davies, Biochemistry 49 (11), 2593-2603 (2010). [0125] 24. B. L. Pentelute, Z. P. Gates, V. Tereshko, J. L. Dashnau, J. M. Vanderkooi, A. A. Kossiakoff and S. B. H. Kent, J. Am. Chem. Soc. 130 (30), 9695-9701 (2008). [0126] 25. K. J. Lauersen, A. Brown, A. Middleton, P. L. Davies and V. K. Walker, Cryobiology 62 (3), 194-201 (2011). [0127] 26. A. J. Middleton, A. M. Brown, P. L. Davies and V. K. Walker, Febs Letters 583 (4), 815-819 (2009). [0128] 27. S. L. Roderick, Worldwide Protein Data Bank (1995). [0129] 28. K. Brown, F. Pompeo, S. Dixon, D. Mengin-Lecreulx, C. Cambillau and Y. Bourne, Worldwide Protein Data Bank (2000). [0130] 29. D. Kostrewa, A. D'Arcy and M. Kamber, Worldwide Protein Data Bank (2001). [0131] 30. L. R. Olsen and S. L. Roderick, Worldwide Protein Data Bank (2001). [0132] 31. B. I. Lee and S. W. Suh, Worldwide Protein Data Bank (2004). [0133] 32. V. E. Pye, A. P. Tingey, R. L. Robson and P. C. E. Moody, Worldwide Protein Data Bank (2004). [0134] 33. C. Kisker, H. Schindelin and D. C. Rees, Worldwide Protein Data Bank (1996). [0135] 34. T. W. Beaman, K. W. Vogel, D. G. Drueckhammer, J. S. Blanchard and S. L. Roderick, Worldwide Protein Data Bank (2002). [0136] 35. L. E. Kehoe, J. Snidwongse, P. Courvalin, J. B. Rafferty and I. A. Murray, Worldwide Protein Data Bank (2003). [0137] 36. L. R. Olsen, B. Huang, M. W. Vetting and S. L. Roderick, Worldwide Protein Data Bank (2004). [0138] 37. H. J. Lee, B. Rakic, M. Gilbert, W. W. Wakarchuk, S. G. Withers and N. C. J. Strynadka, Worldwide Protein Data Bank (2009). [0139] 38. S. M. Anderson, Z. Wawrzak, 0. Onopriyenko, S. N. Peterson, W. F. Anderson, A. Savchenko and D. Center for Structural Genomics of Infectious, Worldwide Protein Data Bank (2011). [0140] 39. X. G. Wang, L. R. Olsen and S. L. Roderick, Worldwide Protein Data Bank (2002). [0141] 40. C. M. Baffling and C. R. H. Raetz, Worldwide Protein Data Bank (2009). [0142] 41. S. Kumar and S. Gourinath, Worldwide Protein Data Bank (2011). [0143] 42. G. Kozlov, K. Gehring and I. Montreal-Kingston Bacterial Structural Genomics, Worldwide Protein Data Bank (2008). [0144] 43. Y. Kim, N. Maltseva, L. Papazisi, W. Anderson and A. Joachimiak, Worldwide Protein Data Bank (2009). [0145] 44. A. Dickmanns, S. Damerow, P. Neumann, E. C. Schulz, A. Lamerz, F. Routier and R. Ficner, Worldwide Protein Data Bank (2010). [0146] 45. M. J. Morrison and B. Imperiali, Worldwide Protein Data Bank (2013). [0147] 46. A. Masoudi, C. R. H. Raetz and C. W. Pemble, Worldwide Protein Data Bank (2013). [0148] 47. J. Benach, Chen, Y., Vorobiev, S. M., Seetharaman, J., Ho, C. K., Janjua, H., Conover, K., Ma, L-C., Xiao, R., et al. , (2014). [0149] 48. J. Benach, Y. Chen, S. M. Vorobiev, J. Seetharaman, C. K. Ho, H. Janjua, K. Conover, L. C. Ma, R. Xiao, T. B. Acton, G. T. Montelione, J. F. Hunt, L. Tong and C. Northeast Structural Genomics, Worldwide Protein Data Bank (2007). [0150] 49. S. S. Hegde, M. W. Vetting, S. L. Roderick, L. A. Mitchenall, A. Maxwell, H. E. Takiff and J. S. Blanchard, Worldwide Protein Data Bank (2005). [0151] 50. M. A. Kennedy, S. Ni, G. W. Buchko and H. Robinson, Worldwide Protein Data Bank (2006). [0152] 51. M. A. Kennedy, S. Ni and G. M. Sheldrick, Worldwide Protein Data Bank (2008). [0153] 52. T. W. Beaman, M. Sugantino and S. L. Roderick, Worldwide Protein Data Bank (1998). [0154] 53. G.-y. L. Daniel Cox, Michael Toney, Xi Chen, Josh Hihath, Gergely Zimanyi, Natha Robert Hayre,Ma ria Peralta U.S. Pat. No. 10,287,332B2 (2019). [0155] 54. T. F. Smith and M. S. Waterman, Journal of Molecular Biology 147 (1), 195-197 (1981). [0156] 55. S. B. Needleman and C. D. Wunsch, Journal of Molecular Biology 48 (3), 443-+(1970). [0157] 56. W. R. Pearson and D. J. Lipman, Proceedings of the National Academy of Sciences of the United States of America 85 (8), 2444-2448 (1988). [0158] 57. S. F. Altschul, W. Gish, W. Miller, E. W. Myers and D. J. Lipman, Journal of Molecular Biology 215 (3), 403-410 (1990). [0159] 58. S. F. Altschul, T. L. Madden, A. A. Schaffer, J. H. Zhang, Z. Zhang, W. Miller and D. J. Lipman, Nucleic Acids Research 25 (17), 3389-3402 (1997). [0160] 59. S. Henikoff and J. G. Henikoff, Proceedings of the National Academy of Sciences of the United States of America 89 (22), 10915-10919 (1992). [0161] 60. S. Karlin and S. F. Altschul, Proceedings of the National Academy of Sciences of the United States of America 90 (12), 5873-5877 (1993). [0162] 61. S. Karlin and S. F. Altschul, Proceedings of the National Academy of Sciences of the United States of America 87 (6), 2264-2268 (1990). [0163] 62. A. Basit, T. Ali and S. U. Rehman, Journal of Biomolecular Structure & Dynamics (2020). [0164] 63. E. Krieger and G. Vriend, Bioinformatics 30, 2981-2982 (2014). [0165] 64. M. D. R. Peralta, A. Karsai, A. Ngo, C. Sierra, K. T. Fong, N. R. Hayre, N. Mirzaee, K. M. Ravikumar, A. J. Kluber, X. Chen, G.-y. Liu, M. D. Toney, R. R. Singh and D. L. Cox, Acs Nano 9 (1), 449-463 (2015). [0166] 65. L. P. Heinz, K. M. Ravikumar and D. L. Cox, Nano Letters 15 (5), 3035-3040 (2015). [0167] 66. Z. Peng, A. S. Parker, M. D. R. Peralta, K. M. Ravikumar, D. L. Cox and M. D. Toney, Biophysical Journal 113 (9), 1945-1955 (2017). [0168] 67. Z. Peng, M. D. R. Peralta and M. D. Toney, Biochemistry 56 (45), 6041-6050 (2017). [0169] 68. A. S. Parker, K. M. Ravikumar and D. L. Cox, Soft Matter 13, 6218-6226 (2017). [0170] 69. R. A. Baarda, T. L. Marianchuk, M. D. Toney and D. L. Cox, Soft Matter 14 (40), 8095-8104 (2018). [0171] 70. Z. Peng, M. D. R. Peralta, D. L. Cox and M. D. Toney, PloS one 15 (2), e0229319-e0229319 (2020). [0172] 71. P. I. Kitov and D. R. Bundle, Journal of the American Chemical Society 125 (52), 16271-16284 (2003). [0173] 72. G. Vauquelin, G. Bricca and I. Van Liefde, Fundamental & Clinical Pharmacology 28 (5), 530-543 (2014). [0174] 73. G. Vauquelin and S. J. Charlton, British Journal of Pharmacology 168 (8), 1771-1785 (2013). [0175] 74. S. J. Charlton and G. Vauquelin, British Journal of Pharmacology 161 (6), 1250-1265 (2010). [0176] 75. G. Vauquelin and S. J. Charlton, British Journal of Pharmacology 161 (3), 488-508 (2010). [0177] 76. M. Schoof, B. Faust, R. A. Saunders, S. Sangwan, V. V. Rezelj, N. Hoppe, M. Boone, C. Billesboelle, M. Zimanyi and I. Deshpande, bioRxiv (2020). [0178] 77. G. Weng, E. Wang, Z. Wang et al., Nucleic Acids Research 47, Supplement W, W322-W330 (2019). [0179] 78. S. Klein, M. Cortese, S. L. Winter, M. Wachsmuth-Melm, C. J. Neufeldt, B. Cerikan, M. L. Stanifer, S. Boulant, R. Bartenschlager and P. Chlanda, BioRxiv (2020). [0180] 79. K. K. Chan, D. Dorosky, P. Sharma, S. A. Abbasi, J. M. Dye, D. M. Kranz, A. S. Herbert and E. Procko, Science (New York, N.Y.) (2020). [0181] 80. M. S. Dennis, H. Jin, D. Dugger, R. Yang, L. McFarland, A. Ogasawara, S. Williams, M. J. Cole, S. Ross and R. Schwall, Cancer Research 67 (1), 254-261 (2007). [0182] 81. M. S. Dennis, M. Zhang, Y. G. Meng, M. Kadkhodayan, D. Kirchhofer, D. Combs and L. A. Damico, Journal of Biological Chemistry 277 (38), 35035-35043 (2002). [0183] 82. M. S. Dennis, Patent No. US2010104588-A1. [0184] 83. M. S. Dennis, Patent No. US2007202045-A1.
[0185] Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, one of skill in the art will appreciate that certain changes and modifications may be practiced within the scope of the appended claims. In addition, each reference provided herein is incorporated by reference in its entirety to the same extent as if each reference was individually incorporated by reference. Where a conflict exists between the instant application and a reference provided herein, the instant application shall dominate.