SARS-2 SPIKE PROTEIN DESIGNS, COMPOSITIONS AND METHODS FOR THEIR USE
20230310582 · 2023-10-05
Inventors
- Rory HENDERSON (Durham, NC, US)
- Kevin Saunders (Durham, NC, US)
- Barton F. Haynes (Durham, NC)
- Priyamvada Acharya (Durham, NC, US)
Cpc classification
A61K39/215
HUMAN NECESSITIES
C12N2770/18034
CHEMISTRY; METALLURGY
A61K2039/57
HUMAN NECESSITIES
C07K2319/30
CHEMISTRY; METALLURGY
A61K2039/55555
HUMAN NECESSITIES
C07K14/165
CHEMISTRY; METALLURGY
International classification
Abstract
The invention provides SARS-2 spike protein designs and uses thereof.
Claims
1. A modified SARS-2 spike protein comprising amino acid changes as described in
2. The modified SARS-2 spike protein of claim 1, wherein the protein comprises a recombinant protein comprising all the consecutive amino acids after the signal peptide of polypeptide sequences in
3. The modified SARS-2 spike protein of claim 1 comprising S383C D985C (RBD to S2 double mutant; rS2d) mutations.
4. A nucleic acid encoding the modified SARS-2 spike protein of claim 1.
5. The nucleic acid of claim 4, wherein the nucleic acid is a modified mRNA.
6. The nucleic acid of claim 5, wherein the mRNA is in a composition comprising lipid nanoparticles.
7. The nucleic acid of claim 4, wherein the nucleic acid is comprised in a vector and is operably linked to a promoter.
8. A composition comprising the modified SARS-2 spike protein of claim 2 or a nucleic acid encoding the modified SARS-2 spike protein, and a carrier.
9. (canceled)
10. A protein nanoparticle or virus-like particle (VLP), comprising the modified SARS-2 spike protein of claim 2.
11. (canceled)
12. A host cell comprising a nucleic acid molecule encoding the modified SARS-2 spike protein of claim 2.
13. An immunogenic composition comprising the modified SARS-2 spike protein of claim 1, a nucleic acid encoding the modified SARS-2 spike protein, a nanoparticle or VLP including the modified SARS-2 spike protein, and a pharmaceutically acceptable carrier.
14. A method for inducing an immune response to a SARS- 2 spike protein in a subject, comprising administering to the subject an effective amount of the modified SARS-2 spike protein of claim 1, a nucleic acid encoding the modified SARS spike protein, or a composition thereof.
15. A modified SARS-2 spike protein, comprising an amino acid sequence of an N165A variant or an N234A variant.
16. The modified SARS-2 spike protein of claim 15, comprising all the consecutive amino acids after the signal peptide of a modified SARS-2 spike protein comprising the amino acid sequence of the N165A variant or the N234A variant.
17-28. (canceled)
Description
BRIEF DESCRIPTION OF DRAWINGS
[0066] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
[0067]
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
[0074]
[0075]
[0076]
[0077]
[0078]
[0079]
[0080] Vaccine-induced neutralizing antibodies against pseudotyped (top panels) or live (bottom panels) SARS-CoV-2 viruses. ID50, inhibitory dilutions at which 50% viruses were neutralized. Each dot indicates one animal, and the bars show geometric means. Pseudovirus assays were performed in 293T/ACE2 cells, and live SARS-CoV-2 microneutralization assays were performed in Vero cells.
[0081]
[0082]
[0083]
[0084]
[0085]
[0086]
[0087]
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
[0098]
[0099]
[0100]
[0101]
[0102]
[0103]
[0104]
[0105]
[0106]
[0107]
[0108]
[0109]
[0110]
[0111]
[0112]
[0113]
[0114]
[0115]
[0116]
[0117]
[0118]
[0119]
[0120]
[0121]
DETAILED DESCRIPTION
[0122] The invention provides proteins and nucleic acids, including modified mRNAs which are stable and can be used as immunogens. Provided also are nucleic acids optionally designed as vectors, for example for recombinant expression and/or stable integration, e.g. but not limited, full-length S protein DNA encoding trimer for stable expression, or VLP incorporation.
[0123] Detailed descriptions of one or more preferred embodiments are provided herein. It is to be understood, however, that the present invention may be embodied in various forms. Therefore, specific details disclosed herein are not to be interpreted as limiting, but rather as a basis for the claims and as a representative basis for teaching one skilled in the art to employ the present invention in any appropriate manner.
[0124] The singular forms “a”, “an” and “the” include plural reference unless the context clearly dictates otherwise. The use of the word “a” or “an” when used in conjunction with the term “comprising” in the claims and/or the specification may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one,” and “one or more than one.”
[0125] Wherever any of the phrases “for example,” “such as,” “including” and the like are used herein, the phrase “and without limitation” is understood to follow unless explicitly stated otherwise. Similarly, “an example,” “exemplary” and the like are understood to be nonlimiting.
[0126] The term “substantially” allows for deviations from the descriptor that do not negatively impact the intended purpose. Descriptive terms are understood to be modified by the term “substantially” even if the word “substantially” is not explicitly recited.
[0127] The terms “comprising” and “including” and “having” and “involving” (and similarly “comprises”, “includes,” “has,” and “involves”) and the like are used interchangeably and have the same meaning. Specifically, each of the terms is defined consistent with the common United States patent law definition of “comprising” and is therefore interpreted to be an open term meaning “at least the following,” and is also interpreted not to exclude additional features, limitations, aspects, etc. Thus, for example, “a process involving steps a, b, and c” means that the process includes at least steps a, b and c. Wherever the terms “a” or “an” are used, “one or more” is understood, unless such interpretation is nonsensical in context.
[0128] The term “about” is used herein to mean approximately, roughly, around, or in the region of. When the term “about” is used in conjunction with a numerical range, it modifies that range by extending the boundaries above and below the numerical values set forth. In general, the term “about” is used herein to modify a numerical value above and below the stated value by a variance of 20 percent up or down (higher or lower).
[0129] SARS-2 Coronavirus S Protein Designs
[0130] The ongoing global pandemic of the new SARS-CoV-2 coronavirus presents an urgent need for the development of effective preventative and treatment therapies. The viral-host cell fusion (S) protein spike is a prime target for such therapies owing to its critical role in the virus lifecycle. The S protein is divided into two regions: the N-terminal S1 domain that caps the C-terminal S2 fusion domain. Binding to host receptor via the Receptor Binding Domain (RBD) in S1 is followed by proteolytic cleavage of the spike by host proteases. Large conformational changes in the S-protein result in S1 shedding and exposure of the fusion machinery in S2. Class I fusion proteins such as the coronavirus (CoV) S protein that undergo large conformational changes during the fusion process must, by necessity, be highly flexible and dynamic. Indeed, cryo-EM structures of the SARS-CoV-2 (SARS-2) spike protein reveal considerable flexibility and dynamics in the S1 domain.sup.1,2, especially around the RBD that exhibits two discrete conformational states—a “down” state that is shielded from receptor binding, and an “up” state that is receptor-accessible. We will use our robust, high-throughput computational and experimental pipeline to define the detailed trajectory of the “down” to “up” transition of the SARS-2 S protein, identify early metastable intermediates in the fusion pathway, and exploit their structures and dynamics for identifying drug and vaccine candidates that target SARS-2.
[0131] A wealth of structural information on CoV spike proteins, including recently determined cryo-EM structures of the SARS-2 spike.sup.1-11, provides a rich source of detailed data from which to begin precise examination of macromolecular transitions underlying triggering of this fusion machine. In certain aspects the invention provides that(a) analysis quantifying CoV S1 domain movements around which structurally conserved domains undergo rigid body motions, (b) in silico, prescreened panel of differentially domain position stabilizing mutations, and (c) integrated computational and experimental approach with unprecedented, dedicated access to >300 accelerated compute devices (GPUs), rapid and priority access to a K3 direct electron detector equipped Titan Krios electron microscope, and high-throughput structural determination pipeline. Together, this puts us in a unique position to provide atomically detailed mechanistic insight into the fusion mechanism of the SARS-2 virus. The scientific premise of this study is that understanding the structural dynamics and early transition kinetics of mobile regions of the SARS-2 spike will allow optimal control of vaccine and drug responses, and facilitate the development of new antiviral drugs and protective vaccines. The goal of this study is to define mechanistically-derived transition states of the pre-fusion SARS-2 spike that can be exploited for vaccine and drug design.
[0132] The invention is based on work to define domain motions in the pre-fusion SARS-2 spike. The idea is that while the RBD undergoes a dramatic “up” and “down” hinge motion, other subtle movements in the pre-fusion SARS-2 spike play an important role in defining antibody and ligand binding specificity. Analysis of CoV S protein structures revealed subtle shifts in S1 that make and break interactions with adjacent domains, resulting in multistate or disordered behavior of the RBD in its “down” position. Here, we identify a set of mutations that lock and stabilize the SARS-2 S protein with the RBD in discrete “down” positions, each with different but specific RBD positions rather than the usual multistate behavior observed in all CoV spike structures determined to date. Deploying rapid assays to assess protein expression, thermostability, and antigenicity, we will generate a set of stabilized SARS-2 spike variants with defined reactivity to patient-sera. We will determine high-resolution cryo-EM structures to define the metastable RBD “down” state orientations in these mutants, and use the combined experimental information from structures, biochemistry and biophysics to iterate the structure-guided computational design cycle.
[0133] The invention is based on work to define the trajectory of the transition between the “down” and “up” states of the SARS-2 S protein. The idea is that the SARS-2 S protein transitions through multiple metastable intermediate states between the known “down” and “up” states. Using an integrated approach, we will interrogate the mechanism by which the SARS-2 S protein transitions from its “down” state to the receptor-accessible “up” state. Our initial examination of the available CoV S protein structures quantifies specific rigid body domain movements within each state. Using a combination of path finding and adaptive sampling molecular dynamics (MD) simulation techniques, we will develop a theoretical model of this initial triggering event. Structural details from the putative path will be used to stabilize predicted intermediate states. Provided are experiments to study the biochemical and biophysical properties of these putative intermediates, and determine their structures using high-resolution cryo-EM. We will assess the reactivity of these structures to patient-sera and known SARS-2 spike ligands to define state antigenicity.
[0134] In certain aspects the invention provides methods to determined structures of multiple “down”, “up”, and intermediate states of the SARS-2 S protein. Given the current global health emergency we will prioritize rapid dissemination of results to the community. Importantly, we will make available coordinates from the experimentally refined transition ensemble determined via MD simulation to enable close examination of the presented transition by researchers in the fields of drug and vaccine design. Overall, these studies will provide atomically detailed structural and mechanistic information that can be exploited for vaccine and therapeutics design.
[0135] On Mar. 11, 2020, the World Health Organization (WHO) characterized the ongoing spread of COVID-19, a highly contagious respiratory disease caused by the new betacoronavirus SARS-CoV-2 (SARS-2), a pandemic. Originating in the Wuhan province of China, now spread to over 100 countries, the virus has infected >150,000 individuals and caused >8000 deaths world-wide. As the virus continues to spread, there is an urgent need to understand as much as possible, as rapidly as possible, about this new virus.
[0136] The transmembrane SARS-2 S protein spike trimer (
[0137] The SARS-2 S protein displays striking structural similarities with the S proteins of the previously identified SARS-CoV, MERS-CoV, and other human and murine CoV viruses. However, most S-targeting antibodies to SARS and MERS do not cross-react with SARS-2. Conformational evasion is among the many host immune evasion tools available to viruses. Dramatic shifts in the conformational ensemble of states for CoVs have in fact been demonstrated.sup.1,2. Therefore, a detailed understanding of structure and dynamics of the SARS-2 S protein in comparison to is orthologs will reveal how genetic drift can give rise to the large phenotypic differences that drive viral evolution and host immune evasion.
[0138] Thus, the urgent need to understand the SARS-2 virus that is responsible for the ongoing pandemic makes this study significant and relevant to public health.
[0139] Provided are studies that use an integrated structural biology approach to harness the latest innovations in high-throughput cryo-electron microscopy and computational methodologies to approach this urgent global healthcare problem. These studies include use of Titan Krios microscope fitted with a K3 camera for rapid determination of high-resolution structures, access to a Philips EM420 microscope, as well as to a Talos Arctica for cryo-screening and data collection at the National Institutes for Environmental Health Sciences (NIEHS), NIH.
[0140] Studies will be able to test immunogenicity in mice and rabbits of any promising SARS-2 spike variants generated in this study.
[0141] In non-limiting embodiments, aspects of the invention are based on the idea that protein dynamics impact its antigenic and immunogenic properties. Coronavirus designs are based on an integrated approach that closely couples structure and molecular dynamics-driven protein engineering with biophysical, biochemical, virological and immunological studies.
[0142] Conformationally distinct structural states of the CoV S-protein spike are well defined. The transmembrane CoV S protein spike trimer is composed of interleaved protomers that include an N-terminal receptor binding S1 domain and a C-terminal S2 domain that contains the fusion elements (
[0143] A detailed structural schematic defining the geometry and internal rearrangements of movable domains. An understanding of macromolecular structural dynamics requires a precise definition of structural states. Examination of the available SARS and MERS S-protein structures revealed: 1) the NTD, RBD, subdomains and internal S2 domains move as rigid bodies and 2) these domains display a remarkable array of relative shifts between the S1 region's domains and the S2 region's β-sheet motif and CD (
[0144] SARS-2 S Protein Production, Purification and Structural Characterization
[0145] The SARS-2 S protein ectodomain.sup.2 was expressed in 293F cells and purified using published methods to yield ˜4 mg/L purified spike (
[0146] Advanced molecular simulation results for the SARS ectodomain spike indicate rapid exchange between metastable states. In order to examine the breadth and time scales of the dynamics of CoV spike protein structure, we initiated an adaptive sampling simulation of the symmetric all ‘down’, closed state of the SARS CoV soluble S-protein (PDB ID 6ACC.sup.6). To overcome the sampling problem in MD, the adaptive scheme periodically monitors multiple simultaneous simulations and launches additional simulations in regions of the coordinate space along transition paths. In this way, difficult to observe slow processes become accessible. In total, we obtained 539 independent 50 ns simulations totaling ˜27 μs of simulation time. Monitoring contacts between each protomer's RBD and their adjacent RBD, NTD, and HR1 C-terminus, we further projected the data using the time-lagged independent component analysis (TICA) approach. TICA components point in the direction of the slowest processes in the simulation data which means that transitions along the so call TICs can correspond to transitions between metastable states (
[0147] In certain aspects the invention provides methods to define symmetric and asymmetric down state domain arrangements in the SARS-2 S protein. Our analysis of the available CoV S-protein structures reveals a wide breadth of conformational states. We therefore ask the following questions: 1) Is it possible to eliminate or markedly reduce S1 flexibility? 2) How does the stabilizing strategy affect distant domain arrangements? 3) The MERS spike domain arrangement is distinct from SARS and SARS- 2; is it possible to insert MERS residue substitutions in SARS-2 to induce this arrangement? 4) Does a change in domain arrangement impact ectodomain antigenicity? In order to answer these questions, we designed differentially stabilized the S-protein domains. To this end, we have screened in silico, using the Schrödinger software suite, a large panel of mutations designed to stabilize specific regions of the S-protein (
[0148] Approach and Methods:
[0149] Small-scale transfections of plasmids encoding the mutated S-protein (
[0150] Testing of cell-culture supernatants for binding to 1) Streptavidin, in a biolayer interferometry (BLI)-based screen similar to that performed in our recent HIV-1 stabilization manuscript.sup.12, to determine expression levels and 2) to the ACE-2 receptor, and other RBD-reactive ligands such as antibodies CR3022 and 47D11 .sup.13-16, that will report on the disposition of the RBD within the spike. Supernatants from untransfected cells will be used as control.
[0151] Constructs showing optimal expression and certainACE-2 binding phenotypes will be purified using the PureSpeed (Mettler Toledo) IMAC based high-throughput purification system.
[0152] Purified proteins will then be characterized using SDS-PAGE, western blotting, rapid fluorescence-based thermostability assays .sup.17-19 size exclusion chromatography (SEC) and NSEM.
[0153] Constructs with confirmed expression of the trimeric spike protein (SDS-PAGE, SEC and NSEM), and improved properties, e.g. but not limited to melting temperature at least 5° C. higher than the unmutated construct, will be selected as candidates for the next round of selection. For these constructs we will determine 1) ACE2 binding and affinity, 2) thermal stability via differential scanning calorimetry, and 3) high-resolution structures via cryo-EM. Collecting large cryo-EM datasets of at least 2 million particles for each mutated construct will allow heterogenous 3D classification. We will compare the structures with that of the unmutated construct, determining changes in residue-residue contacts, epitope shape and accessibility, shifts in the probability of finding the construct in any particular state, and measures of conformational shifts using our vector-based analysis.
[0154] Finally, we will test for differential changes in antigenicity of these constructs using sera from infected patient. This will provide selection criteria for subsequent studies.
[0155] Constructs will also be tested for immunogenicity in any suitable animal model, including without limitation mouse studies, NHP studies, and so forth.
[0156] Small-scale transfections can yield relatively small quantities of protein for some constructs. NSEM and fluorescence-based thermostability measurements require very small amount of protein (less than 10 μg), increasing the chances that most of the constructs will yield sufficient protein for the small-scale screens. For those constructs that do not, we will use a larger transfection volume. The ones that fail to express we will remove from our list. Structural determination by cryo-EM also requires very small amounts of protein thus ensuring for most constructs we will be able to obtain high resolution feedback on the designs. 2. Failed designs: Some of the designs may not show expected phenotypes, a risk inherent in this type approach. The large number of in silico designs we are starting with along with the high throughput assays, can allow us to quickly select the designs that show promise and rapidly iterate the experimental and design cycles. This approach has been successful in structure-guided vaccine design.sup.20,21. If a particular set fail to provide a stabilized construct, we will turn to the high-resolution cryo-EM structures determined in our heterogenous refinement of the unmutated construct to initiate additional design iterations.
[0157] At the successful conclusion of this study, without wishing to be bound by theory, we will provide a detailed, high-resolution mapping of conformational states occupied by the SAR-2 S-protein and shifts in conformational distribution with changes in domain interface interactions. Further, we will demonstrate the degree to which changes in the conformational distribution alter S-protein antigenicity. These results will provide a framework from which to consider how genetic drift in the SARS-2 can affect the spread of the disease and how containment by vaccination can be affected by the selection of conformationally varying mutants.
[0158] In certain aspects the invention provides methods to define, in atomic detail, the transition between the down and up states of the SARS-2 S protein spike. While the HIV-1 Env utilizes a complex network of allosteric machinery to signal receptor binding, the CoV S protein appears to use a kinetic strategy toward receptor recognition and triggering (
[0159] Approach and Methods:
[0160] An in-house developed projection method calculating the pairwise relative angles between the NTD, NTD′, RBD, subdomains 1 and 2, the S2-sheet motif, and the CD of each protomer will used in the adaptive scheme.
[0161] Converged Markov model transition intermediates will be used as “bait” to isolate minor populations of intermediates by heterogenous classification of cryo-EM data
[0162] These MD based particle sets will be unbiased via independent ab initio map reconstruction and subsequent high-resolution refinement for comparison against the MD state.
[0163] The equilibrium distribution of states determined by cryo-EM will be compared to the MD predicted equilibrium. Upon validation, we will analyze the MD transition kinetics, thermodynamics, and path(s).
[0164] All atom simulations will be carried out using HTMD.sup.24 and Amberz18.sup.25 for the adaptive sampling protocol. The Amber ff14SB.sup.26 and Glycam.sup.27 forcefields using a truncated octahedral TIP3P.sup.28 water box and a time-step of 4 fs using hydrogen mass repartitioning.sup.29 in the NVT ensemble will be used throughout for production runs. Simulations will be lengthened and the number of iterations increased if model validation demonstrate a need. Markov models will be built using the PyEMMA.sup.22 software package. Markov model convergence will be monitored based upon linearity in the implied timescale plots and the Chapman-Kolmogorov test and uncertainty will be determined via bootstrapping of the simulated data.
[0165] As blind sampling of the adaptive states can lead to significant simulation time spent in irrelevant states, we will use the FAST′ algorithm to focus sampling in the direction of the known open, ‘up’ state. Due to the size and complexity of the CoV structure, a divide and conquer approach toward simulating the opening process can be necessary. We will split the approach into several distinct modelling steps via a combination of proven approaches.sup.31-33 as needed. The coordinate projection method and the Markov model lag time must be optimized as well. We will test multiple projection methods and compare using the so-called VAMP scoring criteria.sup.34. Finally, if inconsistencies between the simulated and experimental results arise our path forward will involve a sequential shift toward relying upon the experimental data to drive the description of the transition. Even if the model rates and equilibrium values disagree with the cryo-EM data we will still be able to discern possible paths and identify mutations that affect the distribution. Determination of transition kinetics via ACE2 binding, thermal melting temperatures, and cryo-EM state distributions can instead be used to define the transition while still retaining the utility of the MD approach.
[0166] These studies can provide key details important for understanding the transition from the prefusion, closed to the post fusion open structures of the SARS-2 fusion protein. This will include a detailed description of metastable intermediate states, transition states, transition kinetics, and transition free energies. This mechanism will be supported by high-resolution structures. Together, this information will provide atomic details important for both drug and vaccine design as well as in the prediction of conformational evasion mutations in the evolving SARS-2 virus.
REFERENCES
[0167] 1 Walls, A. C. et al. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell, doi: doi.org/10.1016/j.cell.2020.02.058 (2020). [0168] 2 Wrapp, D. et al. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science 367, 1260-1263, doi:10.1126/science.abb2507 (2020). [0169] 3 Kirchdoerfer, R. N. et al. Pre-fusion structure of a human coronavirus spike protein. Nature 531, 118-121, doi:10.1038/nature17200 (2016). [0170] 4 Yuan, Y. et al. Cryo-EM structures of MERS-CoV and SARS-CoV spike glycoproteins reveal the dynamic receptor binding domains. Nature Communications 8, 15092, doi:10.1038/ncomms15092 (2017). [0171] 5 Gui, M. et al. Cryo-electron microscopy structures of the SARS-CoV spike glycoprotein reveal a prerequisite conformational state for receptor binding. Cell Research 27, 119-129, doi:10.1038/cr.2016.152 (2017). [0172] 6 Song, W., Gui, M., Wang, X. & Xiang, Y. Cryo-EM structure of the SARS coronavirus spike glycoprotein in complex with its host cell receptor ACE2. PLOS Pathogens 14, e1007236, doi:10.1371/journal.ppat.1007236 (2018). [0173] 7 Kirchdoerfer, R. N. et al. Stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis. Scientific Reports 8, 15701, doi:10.1038/s41598-018-34171-7 (2018). [0174] 8 Walls, A. C. et al. Unexpected Receptor Functional Mimicry Elucidates Activation of Coronavirus Fusion. Cell 176, 1026-1039.e1015, doi:https://doi.org/10.1016/j.cell.2018.12.028 (2019). [0175] 9 Pallesen, J. et al. Immunogenicity and structures of a rationally designed prefusion MERS-CoV spike antigen. Proceedings of the National Academy of Sciences 114, E7348, doi:10.1073/pnas.1707304114 (2017). [0176] 10 Tortorici, M. A. et al. Structural basis for human coronavirus attachment to sialic acid receptors. Nature Structural & Molecular Biology 26, 481-489, doi:10.1038/s41594-019-0233-y (2019). [0177] 11 Walls, A. C. et al. Cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer. Nature 531, 114-117, doi:10.1038/nature16988 (2016). [0178] 12 Henderson, R. et al. Disruption of the HIV-1 Envelope allosteric network blocks CD4-induced rearrangements. Nature Communications 11, 520, doi:10.1038/s41467-019-14196-w (2020). [0179] 13 Hodgson, J. The pandemic pipeline. Nature Biotechnology (2020). [0180] 14 Wang, C. e. a. A human monoclonal antibody blocking SARS-CoV-2 infection. bioRxiv doi: doi.org/10.1101/2020.03.11.987958 (2020). [0181] 15 ter Meulen, J. et al. Human monoclonal antibody combination against SARS coronavirus: synergy and coverage of escape mutants. PLoS Med 3, e237, doi:10.1371/journal.pmed.0030237 (2006). [0182] 16 Tian, X. et al. Potent binding of 2019 novel coronavirus spike protein by a SARS coronavirus-specific human monoclonal antibody. Emerg Microbes Infect 9, 382-385, doi:10.1080/22221751.2020.1729069 (2020). [0183] 17 Nilsen, J. et al. Human and mouse albumin bind their respective neonatal Fc receptors differently. Sci Rep 8, 14648, doi:10.1038/s41598-018-32817-0 (2018). [0184] 18 Hendus-Altenburger, R. et al. Molecular basis for the binding and selective dephosphorylation of Na(+)/H(+) exchanger 1 by calcineurin. Nat Commun 10, 3489, doi:10.1038/s41467-019-11391-7 (2019). [0185] 19 Nosrati, M. et al. Functionally critical residues in the aminoglycoside resistance-associated methyltransferase RmtC play distinct roles in 30S substrate recognition. J Biol Chem 294, 17642-17653, doi:10.1074/jbc.RA119.011181 (2019). [0186] 20 McLellan, J. S. et al. Structure-based design of a fusion glycoprotein vaccine for respiratory syncytial virus. Science 342, 592-598, doi:10.1126/science.1243283 (2013). [0187] 21 Kwon, Y. D. et al. Crystal structure, conformational fixation and entry-related interactions of mature ligand-free HIV-1 Env. Nat Struct Mol Biol 22, 522-531, doi:10.1038/nsmb.3051 (2015). [0188] 22 Scherer, M. K. et al. PyEMMA 2: A Software Package for Estimation, Validation, and Analysis of Markov Models. Journal of Chemical Theory and Computation 11, 5525-5542, doi:10.1021/acs.jctc.5b00743 (2015). [0189] 23 Chodera, J. D. & Noe, F. Markov state models of biomolecular conformational dynamics. Current Opinion in Structural Biology 25, 135-144, doi:doi.org/10.1016/j.sbi.2014.04.002 (2014). [0190] 24 Doerr, S., Harvey, M. J., Noe, F. & De Fabritiis, G. HTMD: High-Throughput Molecular Dynamics for Molecular Discovery. Journal of Chemical Theory and Computation 12, 1845-1852, doi:10.1021/acs.jctc.6b00049 (2016). [0191] 25 D. A. Case, D. S. C., T. E. Cheatham, I I I, T. A. Darden, R. E. Duke, T. J. Giese, H. Gohlke, A. W. Goetz, D. Greene, N. Homeyer, S. Izadi, A. Kovalenko, T. S. Lee, S. LeGrand, P. Li, C. Lin, J. Liu, T. Luchko, R. Luo, D. Mermelstein, K. M. Merz, G. Monard, H. Nguyen, I. Omelyan, A. Onufriev, F. Pan, R. Qi, D. R. Roe, A. Roitberg, C. Sagui, C. L. Simmerling, W. M. Botello-Smith, J. Swails, R. C. Walker, J. Wang, R. M. Wolf, X. Wu, L. Xiao, D. M. York and P. A. Kollman. (2017). [0192] 26 Maier, J. A. et al. ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. Journal of Chemical Theory and Computation 11, 3696-3713, doi:10.1021/acs.jctc.5b00255 (2015). [0193] 27 Kirschner, K. N. et al. GLYCAM06: a generalizable biomolecular force field. Carbohydrates. Journal of computational chemistry 29, 622-655, doi:10.1002/jcc.20820 (2008). [0194] 28 Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W. & Klein, M. L. Comparison of simple potential functions for simulating liquid water. The Journal of Chemical Physics 79, 926-935, doi:10.1063/1.445869 (1983). [0195] 29 Hopkins, C. W., Le Grand, S., Walker, R. C. & Roitberg, A. E. Long-Time-Step Molecular Dynamics through Hydrogen Mass Repartitioning. Journal of Chemical Theory and Computation 11, 1864-1874, doi:10.1021/ct5010406 (2015). [0196] 30 Zimmerman, M. I. & Bowman, G. R. FAST Conformational Searches by Balancing Exploration/Exploitation Trade-Offs. Journal of Chemical Theory and Computation 11, 5747-5757, doi:10.1021/acs.jctc.5b00737 (2015). [0197] 31 Moradi, M. & Tajkhorshid, E. Computational Recipe for Efficient Description of Large-Scale Conformational Changes in Biomolecular Systems. Journal of Chemical Theory and Computation 10, 2866-2880, doi:10.1021/ct5002285 (2014). [0198] 32 Wang, W., Cao, S., Zhu, L. & Huang, X. Constructing Markov State Models to elucidate the functional conformational changes of complex biomolecules. WIREs Computational Molecular Science 8, e1343, doi:10.1002/wcms.1343 (2018). [0199] 33 Singharoy, A. & Chipot, C. Methodology for the Simulation of Molecular Motors at Different Scales. The Journal of Physical Chemistry B 121, 3502-3514, doi:10.1021/acs.jpcb.6b09350 (2017). [0200] 34 Mardt, A., Pasquali, L., Wu, H. & Noe, F. VAMPnets for deep learning of molecular kinetics. Nature Communications 9, 5, doi:10.1038/s41467-017-02388-1 (2018).
[0201] The SARS-2 S protein includes the receptor binding domain and is a target for neutralizing antibodies. We have designed recombinant DNA constructs that express SARS-2 coronavirus S protein (GenBank Accession number: YP_009724390.1, which is incorporated by reference) as the full-length, transmembrane S protein or a truncated version of the S protein that lacks the C-terminal transmembrane domain and cytoplasmic tail. The truncated S protein is secreted from expressing cells, whereas the full-length version of the plasmid is expressed on the cell surface. Additional SARS-2 S protein sequences from circulating viruses are found in the GISAID EpiFlu™ Database. These sequences can also be modified with any of the modifications described herein.
[0202] The S protein designs have several modifications from the wildtype reference sequence from GenBank. First, the SARS-2 protein sequence encodes furin cleavage sites and a cathepsin L cleavage site. The recombinant protein will be made with and without these protease cleavage sites to see if they affect protein quality, yield, and immunogenicity. Second, the natural signal peptide that directs intracellular trafficking of the S protein will be exchanged for the bovine prolactin signal peptide. The bovine prolactin signal peptide is a strong signal peptide that directs proteins into the secretory pathway. This signal peptide is predicted by the SignalP 5.0 program to be cleaved off of the mature S protein more efficiently than the natural virus signal peptide sequence. Third, the secreted S protein can trimerize in order to resemble the native, membrane-bound S protein on coronavirus virions. However, the truncated, secreted S protein lacks the transmembrane domain and thus may not form a stable trimeric protein. To facilitate trimerization, we added a trimerization domain to the C-terminus of some truncated S proteins. The trimerization domain can be a 29 amino acid sequence called foldon for T4 bacteriophage fibritin protein (Strelkov S V et al. Biochemistry. 1999; Frank S et al. J Mol Biol. 2001). Fourth, we have encoded de novo cysteines to the protein sequence to create new intramolecular and intermolecular disulfide bonds. The bonds prevent conformational changes within the S protein. Non-limiting examples are represented by Cluster modifications 1-11. Fifth, we have encoded two new prolines in between HR1 and the central helix in the S protein to stabilize the polypeptide turns in the S2 protein (Pallesen et al. PNAS. 2017). Sixth, we have added an AviTag to the truncated S proteins to facilitate functionalization by streptavidin binding.
[0203] For development as a vaccine immunogen, we have also created multimeric nanoparticles that display SARS-CoV-2 S protein on their surface. The rationale for creating such immunogens is that presenting multiple copies of the immunogen allows for a more avid interaction between the immunogen and naïve B cell receptors during the immune response. Thus, weak affinity interactions between the B cell receptor and immunogen are enhanced due to the multiple interactions that work in concert. This improved interaction with B cells can underlies the improved uptake of multimeric immunogens by B cells. The internalized immunogen is then presented to T cells in the context of MHC molecules. The T cells in turn provide the required costimulatory signals to the B cells to promote B cell maturation. Additionally, the SARS-CoV-2 S protein has 22 glycosylation sites, which can interact with lectins to facilitate trafficking to secondary lymphoid organs. Multimerization of viral spike glycoproteins improves their interaction with mannose binding lectin, thereby increasing antigen trafficking to sites with abundant immune cells.
[0204] The nanoparticle immunogens are composed of various fragments of SARS-CoV-2 S protein and self-assembling ferritin protein derived from Helicobacter pylori. Each nanoparticle displays 24 copies of the S protein on its surface. The S protein is displayed as a soluble spike trimer that has the transmembrane domain and cytoplasmic tail removed and a foldon trimerization domain added. To focus antibodies to neutralizing targets, the S protein will be truncated down to only the receptor binding domain (RBD), which is a known target for neutralizing antibodies. This construct has the potential to generate neutralizing antibodies, while not eliciting binding antibodies to other sites that mediate antibody-dependent enhancement of virus infectivity (Wang et al. Biochem Biophys Res Commun. 2014 Aug. 22; 451(2):208-14; Jaume et al. J Virol. 2011 October; 85(20):10582-97).
[0205] Nucleic Acid Sequences
[0206] In certain aspects, the invention provides nucleic acids comprising sequences encoding proteins of the invention. In certain embodiments, the nucleic acids are DNAs. In certain embodiments, the nucleic acids are mRNAs. In certain aspects, the invention provides expression vectors comprising the nucleic acids of the invention.
[0207] In certain aspects, the invention provides a pharmaceutical composition comprising mRNAs encoding the inventive antibodies. In certain embodiments, these are optionally formulated in lipid nanoparticles (LNPs) or liposomes. In certain embodiments, the mRNAs are modified. Modifications include without limitations modified ribonucleotides, poly-A tail, and/or 5′ cap.
[0208] In certain aspects the invention provides nucleic acids encoding the inventive protein designs. In non-limiting embodiments, the nucleic acids are mRNA, modified or unmodified, suitable for use any use, e.g but not limited to use as pharmaceutical compositions. In certain embodiments, the nucleic acids are formulated in lipid, such as but not limited to LNPs or liposomes.
[0209] In some embodiments the antibodies are administered as nucleic acids, including but not limited to mRNAs which can be modified and/or unmodified. See US Pub 20180028645A1, US Pub 20170369532, US Pub 20090286852, US Pub 20130111615, US Pub 20130197068, US Pub 20130261172, US Pub 20150038558, US Pub 20160032316, US Pub 20170043037, US Pub 20170327842, US Pub 20180344838A1 at least at paragraphs [0260]-[0281] for non-limiting embodiments of chemical modifications, wherein the content of each is incorporated by reference in its entirety.
[0210] mRNAs delivered in LNP formulations have advantages over non-LNPs formulations. See US Pub 20180028645A1.
[0211] In certain embodiments the nucleic acid encoding a protein is operably linked to a promoter inserted an expression vector. In certain aspects the compositions comprise a suitable carrier. In certain aspects the compositions comprise a suitable adjuvant.
[0212] In certain aspects the invention provides an expression vector comprising any of the nucleic acid sequences of the invention, wherein the nucleic acid is operably linked to a promoter. In certain aspects the invention provides an expression vector comprising a nucleic acid sequence encoding any of the polypeptides of the invention, wherein the nucleic acid is operably linked to a promoter. In certain embodiments, the nucleic acids are codon optimized for expression in a mammalian cell, in vivo or in vitro. In certain aspects the invention provides nucleic acids comprising any one of the nucleic acid sequences of invention. In certain aspects the invention provides nucleic acids consisting essentially of any one of the nucleic acid sequences of invention. In certain aspects the invention provides nucleic acids consisting of any one of the nucleic acid sequences of invention. In certain embodiments the nucleic acid of the invention, is operably linked to a promoter and is inserted in an expression vector. In certain aspects the invention provides an immunogenic composition comprising the expression vector.
[0213] In certain aspects the invention provides a composition comprising at least one of the nucleic acid sequences of the invention. In certain aspects the invention provides a composition comprising any one of the nucleic acid sequences of invention. In certain aspects the invention provides a composition comprising at least one nucleic acid sequence encoding any one of the polypeptides of the invention.
[0214] In one embodiment, the nucleic acid is an RNA molecule. In one embodiment, the RNA molecule is transcribed from a DNA sequence described herein. In some embodiments, the RNA molecule is encoded by one of the inventive sequences. In another embodiment, the nucleotide sequence comprises an RNA sequence transcribed by a DNA sequence encoding the polypeptide sequences described herein, or a variant thereof or a fragment thereof. Accordingly, in one embodiment, the invention provides an RNA molecule encoding one or more of inventive antibodies. The RNA can be plus-stranded. Accordingly, in some embodiments, the RNA molecule can be translated by cells without needing any intervening replication steps such as reverse transcription.
[0215] In some embodiments, an RNA molecule of the invention can have a 5′ cap (e.g. but not limited to a 7-methylguanosine, 7mG(5′)ppp(5′)NlmpNp). This cap can enhance in vivo translation of the RNA. The 5′ nucleotide of an RNA molecule useful with the invention can have a 5′ triphosphate group. In a capped RNA this can be linked to a 7-methylguanosine via a 5′-to-5′ bridge. An RNA molecule may have a 3′ poly-A tail. It can also include a poly-A polymerase recognition sequence (e.g. AAUAAA) near its 3′ end. In some embodiments, a RNA molecule useful with the invention can be single-stranded. In some embodiments, a RNA molecule useful with the invention can comprise synthetic RNA.
[0216] The recombinant nucleic acid sequence can be an optimized nucleic acid sequence. Such optimization can increase or alter the immunogenicity of the protein. Optimization can also improve transcription and/or translation. Optimization can include one or more of the following: low GC content leader sequence to increase transcription; mRNA stability and codon optimization; addition of a kozak sequence (e.g., GCC ACC) for increased translation; addition of an immunoglobulin (Ig) leader sequence encoding a signal peptide; and eliminating to the extent possible cis-acting sequence motifs (i.e., internal TATA boxes).
[0217] Methods for in vitro transfection of mRNA and detection of protein expression are known in the art.
[0218] Methods for expression and immunogenicity determination of nucleic acid encoded proteins are known in the art.
[0219] A non-limiting embodiment of a neutralization assay is described in Zhao, G., Du, L., Ma, C. et al. A safe and convenient pseudovirus-based inhibition assay to detect neutralizing antibodies and screen for viral entry inhibitors against the new human coronavirus MERS-CoV. Virol J 10, 266 (2013). doi.org/10.1186/1743-422X-10-266, which content is incorporated by reference in its entirety. This assay can be adapted for use for SARS CoV-2.
[0220] Non-limiting embodiments of determining antibody responses are described in the following publication: “SARS-CoV-2 specific antibody responses in COVID-19 patients” Okba et al. doi.org/10.1101/2020.03.18.20038059. See also US Patent Publication 20200061185 which is incorporated by reference in its entirety.
[0221] Non-limiting embodiments of various assays, reagents, and technologies for evaluating the immunogens of the invention are described in Muthumani et al. Science Translational Medicine 19 Aug. 2015: Vol. 7, Issue 301, pp. 301ra132, DOI: 10.1126/scitranslmed.aac7462. The assays, reagents, and techniques can be adapted for use for SARS CoV-2.
[0222] Recombinant protein production of coronavirus proteins is known. See e.g. in US Patent Pub 20200061185 which disclosure is incorporated by reference in its entirety.
[0223] In some embodiments the SARS-2 S proteins of the invention are in a trimeric configuration. In some embodiments the SARS-2 S proteins of the invention are expressed as protomers which form trimers. These designs can comprise any suitable trimerization domain.
[0224] Non-limiting examples of exogenous multimerization domains that promote stable trimers of soluble recombinant proteins include: the GCN4 leucine zipper (Harbury et al. 1993 Science 262:1401-1407), the trimerization motif from the lung surfactant protein (Hoppe et al. 1994 FEBS Lett 344:191-195), collagen (McAlinden et al. 2003 J Biol Chem 278:42200-42207), and the phage T4 fibritin Foldon (Miroshnikov et al. 1998 Protein Eng 11:329-414), any of which can be linked to a recombinant coronavirus (e.g. SARS-2) S protein ectodomain described herein (e.g., by linkage to the C-terminus of S2) to promote trimerization of the recombinant coronavirus (e.g. SARS-2) S protein ectodomain.
[0225] In some examples, the C-terminus of the S2 subunit of the SARS-2 S protein ectodomain can be linked to a T4 fibritin Foldon domain. In specific examples, the T4 fibritin Foldon domain can include the amino acid sequence GYIPEAPRDGQAYVRKDGEWVLLSTF (SEQ ID NO: 1), which adopts a .beta.-propeller conformation, and can fold and trimerize in an autonomous way (Tao et al. 1997 Structure 5:789-798). Optionally, the heterologous trimerization is connected to the recombinant coronavirus (e.g. SARS-2) S protein ectodomain via a peptide linker, such as an amino acid linker. Non-limiting examples of peptide linkers that can be used include glycine, serine, and glycine-serine linkers.
[0226] In some embodiments, the SARS-2 spike protein ectodomain trimer can be membrane anchored, for example, for embodiments where the coronavirus (e.g. SARS-2) S protein ectodomain trimer is expressed on an attenuated viral vaccine, or a virus like particle. In such embodiments, the protomers in the trimer can each comprise a C-terminal linkage to a transmembrane domain, such as the transmembrane domain (and optionally the cytosolic tail) of the corresponding coronavirus. For example, the protomers of a disclosed SARS-2 S protein ectodomain trimer can be linked to a SARS-2 S protein transmembrane and cytosolic tail. In some embodiments, one or more peptide linkers (such as a gly-ser linker, for example, a 10 amino acid glycine-serine peptide linker can be used to link the recombinant SARS-2 S protein ectodomain protomer to the transmembrane domain.
[0227] The protomers linked to the transmembrane domain can include any of the modifications provided herein (or combinations thereof) as long as the recombinant coronavirus (e.g. SARS-2) S protein ectodomain trimer formed from the protomers linked to the transmembrane domain retains certain properties (e.g., the coronavirus S protein prefusion conformation).
[0228] The inventive protein or fragments thereof can be produced using recombinant techniques, or chemically or enzymatically synthesized.
[0229] In some embodiments a protein nanoparticle is provided that includes one or more of the disclosed recombinant SARS-2 S proteins, including but not limited to SARS-2 S protein trimers. Non-limiting example of nanoparticles include ferritin nanoparticles, encapsulin nanoparticles, Sulfur Oxygenase Reductase (SOR) nanoparticles, and lumazine synthase nanoparticles, which are comprised of an assembly of monomeric subunits including ferritin proteins, encapsulin proteins, SOR proteins, and lumazine synthase, respectively. Additional protein nanoparticle structures are described by Heinze et al., J Phys Chem B., 120(26):5945-52, 2016; Hsia et al., Nature, 535(7610):136-9, 2016; and King et al., Nature, 510(7503):103-8, 2014; each of which is incorporated by reference herein. To construct such protein nanoparticles a protomer of the SARS-2 S protein ectodomain trimer can be linked to a subunit of the protein nanoparticle (such as a ferritin protein, an encapsulin protein, a SOR protein, or a lumazine synthase protein) and expressed in cells under appropriate conditions. The fusion protein self-assembles into a nanoparticle and can be purified.
[0230] In some embodiments, a protomer of a disclosed recombinant SARS-2 S protein ectodomain trimer can be linked to a ferritin subunit to construct a ferritin nanoparticle. Ferritin nanoparticles and their use for immunization purposes (e.g., for immunization against influenza antigens) have been disclosed in the art (see, e.g., Kanekiyo et al., Nature, 499:102-106, 2013, incorporated by reference herein in its entirety). Ferritin is a globular protein that is found in all animals, bacteria, and plants, and which acts primarily to control the rate and location of polynuclear Fe(III).sub.2O.sub.3 formation through the transportation of hydrated iron ions and protons to and from a mineralized core. The globular form of the ferritin nanoparticle is made up of monomeric subunits, which are polypeptides having a molecule weight of approximately 17-20 kDa. In certain embodiments, the modified coronavirus spike protein or the portion thereof is linked to form a protein multimerizing/nanoparticle subunit by a peptide linker in a sortase reaction, or is directly linked to the protein multimerizing/nanoparticle subunit. In certain embodiments, the protein nanoparticle subunit is a ferritin nanoparticle subunit.
[0231] In non-limiting embodiments the multimeric complexes comprising a ferritin sequence are designed and are assembled via sortase reaction. In non-limiting embodiments the multimeric complexes comprise encapsulin.
[0232] Following production, these monomeric subunit proteins self-assemble into the globular ferritin protein. Thus, the globular form of ferritin comprises 24 monomeric, subunit proteins, and has a capsid-like structure having 432 symmetry. Methods of constructing ferritin nanoparticles are known to the person of ordinary skill in the art and are further described herein (see, e.g., Zhang, Int. J. Mol. Sci., 12:5406-5421, 2011, which is incorporated herein by reference in its entirety).
[0233] In non-specific examples, the ferritin polypeptide is E. coli ferritin, Helicobacter pylori ferritin, human light chain ferritin, bullfrog ferritin or a hybrid thereof, such as E. coli-human hybrid ferritin, E. coli-bullfrog hybrid ferritin, or human-bullfrog hybrid ferritin. Exemplary amino acid sequences of ferritin polypeptides and nucleic acid sequences encoding ferritin polypeptides for use to make a ferritin nanoparticle including a recombinant SARS-2 S protein can be found in GENBANK, for example at accession numbers ZP_03085328, ZP_06990637, EJB64322.1, AAA35832, NP 000137 AAA49532, AAA49525, AAA49524 and AAA49523, which are specifically incorporated by reference herein in their entirety. In some embodiments, a recombinant protein of the invention can be linked to a ferritin subunit to form a nanoparticle.
[0234] Polynucleotides encoding a protomer of any of the disclosed recombinant proteins are also provided. These polynucleotides include DNA, cDNA and RNA sequences which encode the protomer, as well as vectors including the DNA, cDNA and RNA sequences, such as a DNA or RNA vector used for immunization. The genetic code to construct a variety of functionally equivalent nucleic acids, such as nucleic acids which differ in sequence but which encode the same protein sequence, or encode a conjugate or fusion protein including the nucleic acid sequence.
[0235] Another approach to multimerize expression constructs uses staphylococcus Sortase A transpeptidase ligation to conjugate inventive spike ectodomain trimers or spike subunits, for e.g. but not limited to cholesterol or self multimerizing protein. The trimers can be embedded into liposomes via the conjugated cholesterol.
[0236] To conjugate the trimer a C-terminal LPXTG tag (SEQ ID NO: 2) or a N-terminal pentaglycine repeat tag is added to the spike trimer gene, where X signifies any amino acid, such as Ala, Ser, Glu. Cholesterol is also synthesized with these two tags. Sortase A is then used to covalently bond the tagged spike subunit to the cholesterol. The sortase A-tagged spike trimer protein or portion thereof can also be used to conjugate the trimer to other peptides, proteins, or fluorescent labels. In non-limiting embodiments, the sortase A tagged trimers or spike portions are conjugated to ferritin to form nanoparticles.
[0237] In several embodiments, the nucleic acid molecule encodes a precursor of the protomer, that, when expressed in an appropriate cell, is processed into a recombinant SARS-2 S protein protomer that can self-assemble into the corresponding recombinant trimer. For example, the nucleic acid molecule can encode a recombinant SARS-2 S protein ectodomain including a N-terminal signal sequence for entry into the cellular secretory system that is proteolytically cleaved in the during processing of the recombinant protein in the cell. Recombinant proteins with different signal peptide sequences are embodied by the invention.
[0238] In certain embodiments, amino acid sequences of the invention described herein comprise a signal peptide. A skilled artisan can readily determine the signal peptide sequences. Signal peptide sequences can be removed during recombinant production of proteins. In non-limiting embodiments, provided are amino acid sequences of recombinant proteins which do not include amino acids of comprising a signal peptide.
[0239] In some embodiments, the nucleic acid molecule encodes a precursor SARS-2 S polypeptide that, when expressed in an appropriate cell, is processed into a recombinant SARS-2 S protomer including S1 and S2 polypeptides, wherein the recombinant protein includes any of the appropriate modifications described herein, and optionally can be linked to a trimerization domain, such as a T4 Fibritin trimerization domain.
[0240] Exemplary nucleic acids can be prepared by molecular and cloning techniques. A wide variety of cloning methods, host cells, and in vitro amplification methodologies are well known to persons of skill, and can be used to make the nucleic acids and proteins of the invention.
[0241] The polynucleotides encoding a disclosed recombinant protomer can include a recombinant DNA which is incorporated into a vector (such as an expression vector) into an autonomously replicating plasmid or virus or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule (such as a cDNA) independent of other sequences. The nucleotides can be ribonucleotides, deoxyribonucleotides, or modified forms of either nucleotide. The term includes single and double forms of DNA.
[0242] Polynucleotide sequences encoding a disclosed recombinant protomer can be operatively linked to expression control sequences. An expression control sequence operatively linked to a coding sequence is ligated such that expression of the coding sequence is achieved under conditions compatible with the expression control sequences. The expression control sequences include, but are not limited to, appropriate promoters, enhancers, transcription terminators, a start codon (i.e., ATG) in front of a protein-encoding gene, splicing signal for introns, maintenance of the correct reading frame of that gene to permit proper translation of mRNA, and stop codons.
[0243] DNA sequences encoding the disclosed recombinant protomer can be expressed in vitro by DNA transfer into a suitable host cell. The cell can be prokaryotic or eukaryotic. The term also includes any progeny of the subject host cell. All progeny need not be identical to the parental cell since there can be mutations that occur during replication. Methods of stable transfer, meaning that the foreign DNA is continuously maintained in the host, are known in the art.
[0244] Host systems for recombinant production can include microbial, yeast, insect and mammalian organisms. Methods of expressing DNA sequences having eukaryotic or viral sequences in prokaryotes are well known in the art. Non-limiting examples of suitable host cells include bacteria, archea, insect, fungi (for example, yeast), plant, and animal cells (for example, mammalian cells, such as human). Exemplary cells of use include Escherichia coli, Bacillus subtilis, Saccharomyces cerevisiae, Salmonella typhimurium, SF9 cells, C129 cells, 293 cells, Neurospora, and immortalized mammalian myeloid and lymphoid cell lines. Techniques for the propagation of mammalian cells in culture are well-known (see, e.g., Helgason and Miller (Eds.), 2012, Basic Cell Culture Protocols (Methods in Molecular Biology), 4.sup.th Ed., Humana Press). Examples of mammalian host cell lines are VERO and HeLa cells, CHO cells, and WI38, BHK, and COS cell lines, although cell lines can be used, such as cells designed to provide higher expression, desirable glycosylation patterns, or other features. In some embodiments, the host cells include HEK293 cells or derivatives thereof, such as GnTI.sup.−/− cells, or HEK-293F cells.
[0245] In some embodiments, the disclosed recombinant coronavirus (e.g. SARS-2) S protein ectodomain protomer can be expressed in cells under conditions where the recombinant coronavirus (e.g. SARS-2) S protein ectodomain protomer can self-assemble into trimers which are secreted from the cells into the cell media. In such embodiments, each recombinant coronavirus (e.g. SARS-2) S protein ectodomain protomer contains a leader sequence (signal peptide) that causes the protein to enter the secretory system, where the signal peptide is cleaved and the protomers form a trimer, before being secreted in the cell media. The medium can be centrifuged and recombinant coronavirus (e.g. SARS-2) S protein ectodomain trimer can be purified from the supernatant.
[0246] A nucleic acid molecule encoding a protomer can be included in a viral vector, for example, for expression of the immunogen in a host cell, or for immunization of a subject as disclosed herein. In some embodiments, the viral vectors are administered to a subject as part of a prime-boost vaccination. In several embodiments, the viral vectors are included in a vaccine, such as a primer vaccine or a booster vaccine for use in a prime-boost vaccination.
[0247] In several examples, the viral vector can be replication-competent. For example, the viral vector can have a mutation in the viral genome that does not inhibit viral replication in host cells. The viral vector also can be conditionally replication-competent. In other examples, the viral vector is replication-deficient in host cells.
[0248] A number of viral vectors have been constructed, that can be used to express the disclosed antigens, including polyoma, i.e., SV40 (Madzak et al., 1992, J. Gen. Virol., 73:15331536), adenovirus (Berkner, 1992, Cur. Top. Microbiol. Immunol., 158:39-6; Berliner et al., 1988, Bio Techniques, 6:616-629; Gorziglia et al., 1992, J. Virol., 66:4407-4412; Quantin et al., 1992, Proc. Natl. Acad. Sci. USA, 89:2581-2584; Rosenfeld et al., 1992, Cell, 68:143-155; Wilkinson et al., 1992, Nucl. Acids Res., 20:2233-2239; Stratford-Perricaudet et al., 1990, Hum. Gene Ther., 1:241-256), vaccinia virus (Mackett et al., 1992, Biotechnology, 24:495-499), adeno-associated virus (Muzyczka, 1992, Curr. Top. Microbiol. Immunol., 158:91-123; On et al., 1990, Gene, 89:279-282), herpes viruses including HSV and EBV (Margolskee, 1992, Curr. Top. Microbiol. Immunol., 158:67-90; Johnson et al., 1992, J. Virol., 66:29522965; Fink et al., 1992, Hum. Gene Ther. 3:11-19; Breakfield et al., 1987, Mol. Neurobiol., 1:337-371; Fresse et al., 1990, Biochem. Pharmacol., 40:2189-2199), Sindbis viruses (H. Herweijer et al., 1995, Human Gene Therapy 6:1161-1167; U.S. Pat. Nos. 5,091,309 and 5,2217,879), alphaviruses (S. Schlesinger, 1993, Trends Biotechnol. 11:18-22; I. Frolov et al., 1996, Proc. Natl. Acad. Sci. USA 93:11371-11377) and retroviruses of avian (Brandyopadhyay et al., 1984, Mol. Cell Biol., 4:749-754; Petropouplos et al., 1992, J. Virol., 66:3391-3397), murine (Miller, 1992, Curr. Top. Microbiol. Immunol., 158:1-24; Miller et al., 1985, Mol. Cell Biol., 5:431-437; Sorge et al., 1984, Mol. Cell Biol., 4:1730-1737; Mann et al., 1985, J. Virol., 54:401-407), and human origin (Page et al., 1990, J. Virol., 64:5370-5276; Buchschalcher et al., 1992, J. Virol., 66:2731-2739). Baculovirus (Autographa californica multinuclear polyhedrosis virus; AcMNPV) vectors are also known in the art, and can be obtained from commercial sources (such as PharMingen, San Diego, Calif.; Protein Sciences Corp., Meriden, Conn.; Stratagene, La Jolla, Calif).
[0249] In several embodiments, the viral vector can include an adenoviral vector that expresses a protomer of the invention. Adenovirus from various origins, subtypes, or mixture of subtypes can be used as the source of the viral genome for the adenoviral vector. Non-human adenovirus (e.g., simian, chimpanzee, gorilla, avian, canine, ovine, or bovine adenoviruses) can be used to generate the adenoviral vector. For example, a simian adenovirus can be used as the source of the viral genome of the adenoviral vector. A simian adenovirus can be of serotype 1, 3, 7, 11, 16, 18, 19, 20, 27, 33, 38, 39, 48, 49, 50, or any other simian adenoviral serotype. A simian adenovirus can be referred to by using any suitable abbreviation known in the art, such as, for example, SV, SAdV, SAV or sAV. In some examples, a simian adenoviral vector is a simian adenoviral vector of serotype 3, 7, 11, 16, 18, 19, 20, 27, 33, 38, or 39. In one example, a chimpanzee serotype C Ad3 vector is used (see, e.g., Peruzzi et al., Vaccine, 27:1293-1300, 2009). Human adenovirus can be used as the source of the viral genome for the adenoviral vector. Human adenovirus can be of various subgroups or serotypes. For instance, an adenovirus can be of subgroup A (e.g., serotypes 12, 18, and 31), subgroup B (e.g., serotypes 3, 7, 11, 14, 16, 21, 34, 35, and 50), subgroup C (e.g., serotypes 1, 2, 5, and 6), subgroup D (e.g., serotypes 8, 9, 10, 13, 15, 17, 19, 20, 22, 23, 24, 25, 26, 27, 28, 29, 30, 32, 33, 36-39, and 42-48), subgroup E (e.g., serotype 4), subgroup F (e.g., serotypes 40 and 41), an unclassified serogroup (e.g., serotypes 49 and 51), or any other adenoviral serotype. The person of ordinary skill in the art is familiar with replication competent and deficient adenoviral vectors (including singly and multiply replication deficient adenoviral vectors). Examples of replication-deficient adenoviral vectors, including multiply replication-deficient adenoviral vectors, are disclosed in U.S. Pat. Nos. 5,837,511; 5,851,806; 5,994,106; 6,127,175; 6,482,616; and 7,195,896, and International Patent Application Nos. WO 94/28152, WO 95/02697, WO 95/16772, WO 95/34671, WO 96/22378, WO 97/12986, WO 97/21826, and WO 03/02231 1.
[0250] In some embodiments, a virus-like particle (VLP) is provided that comprises a recombinant protomer of the invention. In some embodiments, a virus-like particle (VLP) is provided that includes a recombinant trimer of the invention. Such VLPs can include a recombinant coronavirus (e.g. SARS-2) S protein ectodomain trimer that is membrane anchored by a C-terminal transmembrane domain, for example the recombinant coronavirus (e.g. SARS-2) S protein ectodomain protomers in the trimer each can be linked to a transmembrane domain and cytosolic tail from the corresponding coronavirus. VLPs lack the viral components that are required for virus replication and thus represent a highly attenuated, replication-incompetent form of a virus. However, the VLP can display a polypeptide (e.g., a recombinant coronavirus (e.g. SARS-2) S protein ectodomain trimer) that is analogous to that expressed on infectious virus particles and can eliciting an immune response to the corresponding coronavirus (e.g. SARS-2) when administered to a subject. Virus like particles and methods of their production are known and familiar to the person of ordinary skill in the art, and viral proteins from several viruses are known to form VLPs, including human papillomavirus, HIV (Kang et al., Biol. Chem. 380: 353-64 (1999)), Semliki-Forest virus (Notka et al., Biol. Chem. 380: 341-52 (1999)), human polyomavirus (Goldmann et al., J. Virol. 73: 4465-9 (1999)), rotavirus (Jiang et al., Vaccine 17: 1005-13 (1999)), parvovirus (Casal, Biotechnology and Applied Biochemistry, Vol 29, Part 2, pp 141-150 (1999)), canine parvovirus (Hurtado et al., J. Virol. 70: 5422-9 (1996)), hepatitis E virus (Li et al., J. Virol. 71: 7207-13 (1997)), and Newcastle disease virus. The formation of such VLPs can be detected by any suitable technique. Examples of suitable techniques known in the art for detection of VLPs in a medium include, e.g., electron microscopy techniques, dynamic light scattering (DLS), selective chromatographic separation (e.g., ion exchange, hydrophobic interaction, and/or size exclusion chromatographic separation of the VLPs) and density gradient centrifugation.
[0251] The immunogens of the invention can be combined with any suitable adjuvant.
[0252] A skilled artisan can readily determine the dose and number of immunizations needed to induce immune response. Various assays are known and used in the art to measure to level, breadth and durability of the induced immune response. In non-limiting embodiments the methods comprise two immunizations. The interval between immunizations can be readily determined by a skilled artisan. In non-limiting embodiments, the first and second immunization are about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 weeks apart.
[0253] In certain embodiments the protein dose is in the range of 1-1000 micrograms. In certain embodiments the protein dose is in the range of 10-1000 micrograms. In certain embodiments the protein dose is in the range of 100-1000 micrograms. In certain embodiments the protein dose is in the range of 100-200 micrograms. In certain embodiments the protein dose is in the range of 100-300 micrograms. In certain embodiments the protein dose is in the range of 100-400 micrograms. In certain embodiments the protein dose is in the range of 100-500 micrograms. In certain embodiments the protein dose is in the range of 100-600 micrograms. In certain embodiments the protein dose is in the range of 50-100 micrograms. In certain embodiments the protein dose is in the range of 50-150 micrograms. In certain embodiments the protein dose is in the range of 50-200 micrograms. In certain embodiments the protein dose is in the range of 50-250 micrograms. In certain embodiments the protein dose is in the range of 50-300 micrograms. In certain embodiments the protein dose is in the range of 50-350 micrograms. In certain embodiments the protein dose is in the range of 50-400 micrograms. In certain embodiments the protein dose is in the range of 50-450 micrograms. In certain embodiments the protein dose is in the range of 50-500 micrograms. In certain embodiments the protein dose is in the range of 50-550 micrograms. In certain embodiments the protein dose is in the range of 50-600 micrograms. In certain embodiments the protein dose is in the range of 75-100 micrograms. In certain embodiments the protein dose is in the range of 75-125 micrograms. In certain embodiments the protein dose is in the range of 75-150 micrograms. In certain embodiments the protein dose is in the range of 75-175 micrograms. In certain embodiments the protein dose is in the range of 75-200 micrograms. In certain embodiments the protein dose is in the range of 75-225 micrograms. In certain embodiments the protein dose is in the range of 75-250 micrograms. In certain embodiments the protein dose is 10, 25, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, 600, 625, 650, 700, 750, 800, 850, 900, 950 or 1000 micrograms.
[0254] In certain embodiments adjuvant dose is in the range of 1-200 micrograms. In certain embodiments adjuvant dose is in the range of 1-100 micrograms. In certain embodiments the adjuvant dose is 1-50 micrograms. In certain embodiments the adjuvant dose is 1-25 micrograms. In certain embodiments the adjuvant dose is 1-50 micrograms. In certain embodiments the adjuvant dose is 1-20 micrograms. In certain embodiments the adjuvant dose is 1-50 micrograms. In certain embodiments the adjuvant dose is 1-15 micrograms. In certain embodiments the adjuvant dose is 1-50 micrograms. In certain embodiments the adjuvant dose is 1-10 micrograms. In certain embodiments the adjuvant dose is 1-5 micrograms. In certain embodiments the adjuvant dose is 5-10 micrograms. In certain embodiments the adjuvant dose is 5-15 micrograms. In certain embodiments the adjuvant dose is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, or 45-50 micrograms. Non-limiting examples of evaluating the immunogenicity and effectiveness of the immunogens of the invention are shown in US Patent Pub 20200061185 which disclosure is incorporated by reference in its entirety.
TABLE-US-00001 TABLE 1 Cleaved and uncleaved unstabilized soluble Spike proteins that lack the foldon trimerization domain and lack 2 prolines to stabilize the trimer. FIG. 25A shows non-limiting embodiments of nucleic acids and FIG. 25I shows non-limiting embodiments of amino acid sequences. HV1301945v2 SARS-2 Cleaved soluble Spike_bPrlss_3C_6XHis HV1301946 SARS-2 Cleaved soluble Spike_3C_6XHis HV1301947v2 SARS-2 C-soluble Spike_bPrlss_3C_6XHis HV1301948 SARS-2 C-soluble Spike_3C_6XHis
TABLE-US-00002 TABLE 2 Cleaved and uncleaved unstabilized cell-surface Spike proteins that lack the foldon trimerization domain and lack 2 prolines to stabilize the trimer. FIG. 25B shows non-limiting embodiments of nucleic acids and FIG. 25J shows non-limiting embodiments of amino acid sequences. HV1301949 SARS-2 Cleaved membrane Spike HV1301950v2 SARS-2 Cleaved membrane Spike_bPrlss HV1301951 SARS-2 C-membrane Spike HV1301952v2 SARS-2 C-membrane Spike_bPrlss
TABLE-US-00003 TABLE 3 Cleaved and uncleaved soluble Spike proteins stabilized by the foldon trimerization domain but lacks 2 prolines to stabilize the trimer. FIG. 25C shows non-limiting embodiments of nucleic acids and FIG. 25K shows non-limiting embodiments of amino acid sequences. HV1301953v2 SARS-2 Cleaved soluble Spike_bPrlss_foldon_3C_6XHis HV1301954 SARS-2 Cleaved soluble Spike_foldon_3C_6XHis HV1301955v2 SARS-2 C-soluble Spike_bPrlss_foldon_3C_6XHis HV1301956 SARS-2 C-soluble Spike_foldon_3C_6XHis
TABLE-US-00004 TABLE 4 Cleaved and uncleaved soluble Spike proteins stabilized by the addition of 2 prolines. FIG. 25D shows non-limiting embodiments of nucleic acids and FIG. 25L shows non-limiting embodiments of amino acid sequences. HV1301964 SARS-2 Cleaved soluble Spike_bPrlss_K986P + V987P_3C_6XHis HV1301965 SARS-2 Cleaved soluble Spike_K986P + V987P_3C_6XHis HV1301966 SARS-2 C-soluble Spike_bPrlss_K986P + V987P_3C_6XHis HV1301967 SARS-2 C-soluble Spike_K986P + V987P_3C_6XHis
TABLE-US-00005 TABLE 5 Cleaved and uncleaved stabilized cell-surface Spike proteins that lack the foldon trimerization domain and are stabilized by the addition of 2 prolines. FIG. 25E shows non-limiting embodiments of nucleic acids and FIG. 25M shows non-limiting embodiments of amino acid sequences. HV1301968 SARS-2 Cleaved membrane Spike_K986P + V987P HV1301969 SARS-2 Cleaved membrane Spike_bPrlss_K986P + V987P HV1301970 SARS-2 C-membrane Spike_K986P + V987P HV1301971 SARS-2 C-membrane Spike_bPrlss_K986P + V987P
TABLE-US-00006 TABLE 6 Soluble Spike proteins stabilized by the foldon trimerization domain and the addition of 2 prolines. FIG. 25F shows non-limiting embodiments of nucleic acids and FIG. 25N shows non-limiting embodiments of amino acid sequences. HV130197 SARS-2 Cleaved soluble 2 Spike_bPrlss_foldon_K986P + V987P_3C_6XHis HV130197 SARS-2 Cleaved soluble 3 Spike_foldon_K986P + V987P_3C_6XHis HV130197 SARS-2 C-soluble 4 Spike_bPrlss_foldon_X986P + V987P_3C_6XHis HV130197 SARS-2 C-soluble Spike_foldon_K986P + V987P_3C 6XHis 5
TABLE-US-00007 TABLE 7 Soluble Spike proteins stabilized by the foldon trimerization domain, the addition of 2 prolines, and additional cysteine bonds. Non-limiting embodiments of sequences are shown in FIG. 8 and FIG. 25O. HV1301963_HV1301976 nCoV-1 nCoV-2P_S383C_D985C HV1301977 nCoV-1 nCoV-2P_S383C_A570C_ G669C_T866C_L966C_D985C HV1301978 nCoV-1 nCoV-2P_K41C_A520C HV1301979 nCoV-1 nCoV-2P_F43C_S383C_ G566C_G669C_T866C_D985C HV1301980 nCoV-1 nCoV-2P_K41C_A520C_ A570C_G669C_T866C_L966C
TABLE-US-00008 TABLE 8 Cell-surface Spike proteins stabilized by the addition of 2 prolines and additional cysteine bonds. FIG. 25G and 25P shows non-limiting embodiments of amino acid sequences. HV1301962 SARS CoV-2 membrane S protein_D985C + S383C_K986P + V987P
TABLE-US-00009 TABLE 9A Multimeric nanoparticle immunogens. FIG. 25H shows non-limiting embodiments of nucleic acids and FIG. 25Q shows non-limiting embodiments of amino acid sequences. HV1301985 RBDferritin_v1_3CHis HV1301986 RBDferritin_v2_3CHis HV1301987 SARS-2S-foldonferritin_v1_3CHis HV1301988 SARS-2S-foldonferritin_v2_3CHis HV1301989 SARS-2_RIS_ferritin_v1_3CHis HV1301990 SARS-2_RIS_ferritin_v2_3CHis HV1301991 SARS-2_RISx3_ferritin_v1_3CHis HV1301992 SARS-2_RISx3_ferritin_v2_3CHis
TABLE-US-00010 TABLE 9B Summary of sequences from FIG.s 10A-M A non-limiting embodiment of a sequence is shown in Name Figure. rS2d plus S2 stabilization: A non-limiting S730L + T778V embodiment of a T734I + Q1011L sequence is shown in T734I + Q1011L + Y1007F FIG. 10A-10H T8811 + Q901L + R905Y N907L + Q913I + E1092I N907L + Q913I + E1092F S730L + T778V + N907L + Q913I + E1092I T734I + Q1011L + N907L + Q913I + E1092I rS2d plus SD2 to S2: A non-limiting G669C + T866C embodiment of a T866C + G669C sequence is shown in FIG. 10I rS2d plus S2 stabilization and SD2 to S2: A non-limiting S730L + T778V + G669C + T866C embodiment of a T734I + Q1011L + T866C + G669C sequence is shown in S730L + T778V + N907L + Q913I + FIG. 10J-M E1092I + G669C + T866C T734I + Q1011L + N907L + Q913I + E1092I + T866C + G669C
TABLE-US-00011 TABLE 9C Summary of sequences of cluster mutations from FIG. 8. A non-limiting embodiment of a sequence is shown in Group Figure. Cluster 1 FIG. 8B- Cluster 2 FIG. 8C Cluster 3 FIG. 8D Cluster 4 FIG. 8E Cluster 5 FIG. 8F Cluster 6 FIG. 8G Cluster 7 FIG. 8H Cluster 8 FIG. 8I Cluster 9 FIG. 8J Cluster 10 FIG. 8K Cluster 11 FIG. 8L
EXAMPLES
Example 1A
[0255] Any of the SARS-2 designs, including without limitation as listed in
Example 1B
[0256] SARS-2 designs expressed as nucleic acids or proteins will be expressed, characterized and tested for antigenicity and immunogenicity. Immuonogenicity studies include animal challenge studies. A non-limiting embodiment of an animal study is outlined in Example 2.
Example 2
[0257] Animal study NHP #174: non-human primates (NHPs) are immunized with SARS-2 immunogen designs of the invention. Immune response was evaluated and animals were challenge with SARS-2 stock. The animal study design and immunogen are summarized in
[0258] Data from the animal study are summarized in
[0259] These results show that immunization with the disulfide-stabilized spike ectodomain mRNA-LNP in rhesus macaques elicited IgG antibodies against the receptor binding, N-terminal, and S2 domains of SARS-CoV-2 spike protein. The serum from disulfide-stabilized spike ectodomain mRNA-LNP-immunized macaques blocked ACE2 binding to the receptor domain of SARS-CoV-2 spike protein. Consistent with blocking the ACE2 receptor binding to SARS-CoV-2 spike, the serum neutralized both pseudotyped virus and replication-competent SARS-CoV-2. The vaccine-induced immunity suppressed SARS-CoV-2 replication in the lower respiratory tract and to a lesser extent in the upper respiratory tract. Additionally, inflammatory cytokine production in the lung was decreased in disulfide-stabilized spike ectodomain mRNA-LNP-immunized compared to macaques that received mRNA-LNP encoding an irrelevant protein. Thus, immunization with disulfide-stabilized spike ectodomain mRNA-LNP-immunized generated immunity that protected against SARS-CoV-2 infection.
[0260] Further analyses of the animal study include immunogenicity, levels of antibodies, types of antibodies—neutralizing or not, serum neutralization of pseudo-virus, diversity of epitopes targeted by the induced antibodies, protection after challenge with virus, and any other suitable assay.
Example 3A
[0261] Controlling the SARS-CoV-2 Spike Glycoprotein Conformation
[0262] Abstract
[0263] The coronavirus (CoV) viral host cell fusion spike (S) protein is the primary immunogenic target for virus neutralization and the current focus of many vaccine design efforts. The highly flexible S-protein, with its mobile domains, presents a moving target to the immune system. Here, to better understand S-protein mobility, we implemented a structure-based vector analysis of available β-CoV S-protein structures. We found that despite overall similarity in domain organization, different β-CoV strains display distinct S-protein configurations. Based on this analysis, we developed two soluble ectodomain constructs in which the highly immunogenic and mobile receptor binding domain (RBD) is locked in the all-RBDs ‘down’ position or is induced to display a previously unobserved in SARS-CoV-2 2-RBDs ‘up’ configuration. These results demonstrate that the conformation of the S-protein can be controlled via rational design and provide a framework for the development of engineered coronavirus spike proteins for vaccine applications.
INTRODUCTION
[0264] The ongoing global pandemic of the new SARS-CoV-2 (SARS-2) coronavirus presents an urgent need for the development of effective preventative and treatment therapies. The viral S-protein is a prime target for such therapies owing to its critical role in the virus lifecycle. The S-protein is divided into two regions: an N-terminal S1 domain that caps the C-terminal S2 fusion domain. Binding to host receptor via the Receptor Binding Domain (RBD) in S1 is followed by proteolytic cleavage of the spike by host proteases.sup.1. Large conformational changes in the S-protein result in S1 shedding and exposure of the fusion machinery in S2. Class I fusion proteins, such as the CoV-2 S-protein, undergo large conformational changes during the fusion process and must, by necessity, be highly flexible and dynamic. Indeed, cryo-electron microscopy (cryo-EM) structures of SARS-2 spike reveal considerable flexibility and dynamics in the S1 domain.sup.1,2, especially around the RBD that exhibits two discrete conformational states—a ‘down’ state that is shielded from receptor binding, and an ‘up’ state that is receptor-accessible.
[0265] The wealth of structural information for β-CoV spike proteins, including the recently determined cryo-EM structures of the SARS-2 spike.sup.1-11, has provided a rich source of detailed geometric information from which to begin precise examination of the macromolecular transitions underlying triggering of this fusion machine. The transmembrane CoV S-protein spike trimer is composed of interwoven protomers that include an N-terminal receptor binding S1 domain and a C-terminal S2 domain that contains the fusion elements (
[0266] In this study we have quantified the variability in the S1 and S2 geometric arrangements to reveal important regions of flexibility to consider and to target for structure-based immunogen design. Based on these analyses, we have designed mutations that alter the conformational distribution of the domains in the S-protein. We visualized the effect of our designs using a structural determination pipeline relying first on single particle analysis by negative stain electron microscopy (NSEM) for rapid and low-cost assessment of the spike ectodomains at low resolution, followed by cryo-EM for high-resolution information on the changes introduced by these mutations. Our results reveal a heterogeneous conformational landscape of the SARS-CoV-2 spike that is highly susceptible to modification by the introduction of mutations at sites of contact between the S1 and S2 domains. We also present data on modified SARS-2 ectodomain constructs stabilized in conformations that have not yet been seen in the current available structures, with great interest and direct application in vaccine design.
[0267] Results
[0268] Detailed Structural Schema Defining the Geometry and Internal Rearrangements of Movable Domains of the SARS-2 Spike.
[0269] To characterize the unique arrangement of distinct domains in the CoV spike, we first aimed to develop a precise quantitative definition of their relative positions. Examination of available SARS and MERS S-protein structures revealed: 1) the NTD and RBD subdomains and internal S2 domain move as rigid bodies, and 2) these domains display a remarkable array of relative shifts between the domains in the S1 region and the S2 region's β-sheet motif and connector domain (CD) (
[0270] Identification of Sites for Differential Stabilization of the SARS-2 Ectodomain Spike RBD Orientation.
[0271] Based on the observed variability in the geometric analysis of β-CoV spikes, we asked whether the propensity for the RBD to display the ‘down’ and ‘up’ states can be modified via mutations without altering exposed antigenic surfaces. To this end, we identified protomer to protomer interactive sites amenable to modification and down selected mutations at these sites using the Schrödinger Biologics suite. In an effort to eliminate exposure of the receptor binding site of the RBD, we examined the potential for disulfide linkages between the RBD and its contact with S2 near the C-terminus of HR1 to prevent RBD exposure. We identified a double cysteine mutant, S383C and D985C (RBD to S2 double mutant; rS2d;
[0272] NSEM Analysis of the SARS-2 Spike Ectodomain Proteins.
[0273] To assess the quality of the purified spike proteins and to obtain low resolution visualization of the structures, we performed NSEM analysis. The micrographs showed a reasonably uniform distribution of particles consistent with the size and shape of the SARS-2 spike ectodomain (
[0274] Cryo-EM Analysis of the SARS-2 Spike Ectodomain Proteins.
[0275] To visualize the mutations and their effect on the structure of the spike, we collected cryo-EM datasets for the rS2d and u1S2q constructs (
[0276] In contrast to the rS2d design, the u1S2q design displayed widespread rearrangement of the S1 domains (
TABLE-US-00012 TABLE 10 Cryo-EM Data Collection and Refinement Statistics SARS-2 spike u1S2q construct construct r2S2d 1-RBD 2-RBD Conformation ‘down’ ‘down’ ‘up’ ‘up’ Data Collection FEI Titan FEI Titan Microscope Krios Krios Voltage (kV) 300 300 Electron dose (e.sup.−/Å.sup.2) 65.18 66.82 Detector Gatan K3 Gatan K3 Pixel Size (Å) 1.06 1.058 Defocus Range (μm) 0.63-2.368 0.55-2.94 Magnification 81000 81000 Micrographs 6021 7232 Collected Reconstruction Software cryoSPARC cryoSPARC Particles 367,259 192,430 255,013 133,957 Symmetry C3 C3 C1 C1 Box size (pix) 300 300 300 300 Resolution (Å).sup.$ 2.7 3.2 3.3 3.6 Corrected Refinement (Phenix) .sup.# Protein residues 2916 2913 2875 2862 Resolution (FSC.sub.0.5) 2.9 3.3 3.7 3.8 EMRinger Score 3.11 3.02 1.33 2.69 R.m.s. deviations Bond lengths (Å) 0.009 0.005 0.013 0.011 Bond angles (º) 1.2 0.859 1.276 1.272 Validation Molprobity score 1.58 1.52 0.75 1.84 Clash score 3.93 4.57 0.41 6.6 Favored rotamers (%) 99.41 98.75 99.34 97.46 Ramachandran Favored regions (%) 94.23 95.88 97.5 92.37 Disallowed regions 0 0.04 0.07 0.11 (%) .sup.$Resolutions are reported according to the FSC 0.143 gold-standard criterion
DISCUSSION
[0277] Conformational plasticity is a hallmark of enveloped-virus fusion-protein structure, owing to the necessity of protecting the conserved viral fusion elements from host immune responses while retaining a sufficiently steep free-energy gradient to enable host cell fusion.sup.16. Exposed elements can be well conditioned to be permissive and responsive to mutations through genetic drift and host immune adaptation. Conformational plasticity, however, presents an important difficulty in the context of vaccine and drug design. Indeed, lessons learned in the continued effort to produce a broadly protective HIV-1 vaccine have demonstrated the importance of a detailed understanding and control of fusion protein dynamics.sup.17-28. The new SARS-CoV-2 is no exception in this regard and indeed the conformational plasticity of the SARS-2 S-protein appeared greater than that of the HIV-1 Env. We aimed to develop a quantitative understanding of β-CoV structural states between strains and within each RBD down and up state configuration. The wide breadth of domain arrangements along with the relatively small contact area between the S1 and S2 subunits observed here indicated that, despite a relatively low mutation rate, dramatic changes in S-protein structure can occur from few mutations. Indeed, recent evidence for a mutation in the SD2 to S2 contact region indicates a fitness gain for acquisition of such interfacial residues.sup.29. Based upon our results, this mutant, D614G, can indeed alter the conformational landscape of the SARS-CoV-2 S-protein.
[0278] From the perspective of immunogen development, the constructs developed here present an opportunity to examine the ability of differentially stabilized S-protein particles to induce two different, yet important antibody responses. First, without wishing to be bound by theory, the disulfide linked ‘down’ state locked double mutant (rS2d) can eliminate receptor binding site targeting antibodies which make up the majority of observed responses.sup.30,31. Indeed, a study of MERS responses indicate non-RBD responses (such as NTD and S2 epitopes) will play an important role in vaccine induced protection.sup.32. From a theoretical perspective, the wide control over the RBD ‘up’/‘down’ distribution available to the virus indicates that, by analogy to known difficult to neutralize HIV-1 strains, conformational blocking of antibody responses is not be unusual. Although this can result in a fitness cost to the virus, it does not necessarily make the virion non-infectious. Using the double mutant rS2d as an immunogen provides a platform from which to induce such non-RBD responses that can be needed to protect against such an evasion. The second area of interest comprises cryptic pocket targeting antibodies which have proven effective in the neutralization of SARS. These antibodies target an epitope presented only in the ‘up’ state RBDs and appear to require a two RBD ‘up’ configuration.sup.33. The current stabilized ectodomain construct in wide use in SARS-CoV-2 clinical trials was demonstrated previously, and recapitulated here by NSEM, to display only the ‘down’ and one RBD ‘up’ states. However, the u1S2q, SD1/S2 targeting design developed here display a prominent two RBD ‘up’ state distribution compatible with these cryptic-epitope targeting MAbs. This indicates it can induce such antibodies. While complicating factors, such as vaccine enhancement, can favor the use of truncated, single domain constructs which can display fewer weakly or non-neutralizing epitopes, these, along with the designs presented here will allow for a detailed characterization of not only vaccine immunogenicity but also antigenicity, paving the way for next generation vaccines for the new SARS-CoV-2 and the development of a broadly neutralizing β-CoV vaccine. Thus, while the previous generation of stabilizing mutations ensure well folded trimer, the rational design approach developed here provides a means by which precisely controlling the RBD orientation distribution, thus allowing exploratory efforts to understand the role of conformational dynamics from the perspective of vaccine and drug development.
[0279] Methods
[0280] Vector Based Analysis
[0281] Vector analysis was performed using available cryo-EM structures for SARS-2.sup.13,14, SARS.sup.4,5,7,8, MERS.sup.4,12, and other human.sup.2,10 and murine.sup.11 β-CoV spike proteins. Domains for the vector analysis were selected based upon visual inspection of alignments between SARS, MERS, and SARS-CoV-2 structures. Specifically, C.sub.α centroids for the S1 NTD, RBD, SD1, SD2 (SARS-CoV-2 residues, 27-43 and 54-271, 330-443 and 503-528, 323-329 and 529-590, 294-322 and 591-696, respectively; equivalent SARS/MERS/Murine/HKU1/OC43 residues selected based upon structural alignment with SARS-CoV-2) as well as a β-sheet motif in the NTD (residues 116-129 and 169-172) and a helix motif in the RBD (residues 403-410) were determined. The NTD was split into two regions with the SD1 contacting, SD2 adjacent portion referred to here as the NTD′ (residues 44-53 and 272-293). C.sub.α centroids in the S2 domain were obtained for a β-sheet motif (residues 717-727 and 1047-1071) and the CD domain (711-716 and 1072-1122). Vector magnitudes, angles, and dihedrals between these centroids were determined and used in the subsequent analysis. Vector analysis was performed using the VMD.sup.34 Tcl interface. Principal component analysis performed in R with the vector data centered and scaled.sup.35.
[0282] Rational, Structure-Based Design
[0283] Structures for SARS (PDB ID 5X58.sup.4), MERS (PDB ID 6Q04.sup.36), and SARS-CoV-2 (PDB ID 6VXX.sup.15) were prepared in Maestro.sup.37 using the protein preparation wizard.sup.38 followed by in silico mutagenesis using Schrödinger's cysteine mutation.sup.39 and residue scanning.sup.40 tools. Residue scanning was first performed for individual selected sites allowing mutations to Leu, Ile, Trp, Tyr, and Val followed by scanning of combinations for those which yielded a negative overall score. Scores and visual inspection were used in the selection of the prepared constructs.
[0284] Protein Expression and Purification
[0285] The SARS-CoV-2 ectodomain constructs were produced and purified as described previously. Briefly, a gene encoding residues 1-1208 of the SARS-CoV-2 S (GenBank: MN908947) with proline substitutions at residues 986 and 987, a “GSAS” (SEQ ID NO: 3) substitution at the furin cleavage site (residues 682-685), a C-terminal T4 fibritin trimerization motif, an HRV3C protease cleavage site, a TwinStrepTag and an 8XHisTag (SEQ ID NO: 4) was synthesized and cloned into the mammalian expression vector pαH. All mutants were introduced in this background. expression plasmids encoding the ectodomain sequence were used to transiently transfect FreeStyle293F cells using Turbo293 (SpeedBiosystems). Protein was purified on the sixth day post transfection from the filtered supernatant using StrepTactin resin (IBA).
[0286] Cryo-EM Sample Preparation and Data Collection
[0287] Purified SARS-CoV-2 spike preparations were diluted to a concentration of ˜1 mg/mL in 2 mM Tris pH 8.0, 200 mM NaCl and 0.02% NaN3. 2.5 uL of protein was deposited on a CF-1.2/1.3 grid that had been glow discharged for 30 seconds in a PELCO easiGlow™ Glow Discharge Cleaning System. After a 30 s incubation in >95% humidity, excess protein was blotted away for 2.5 seconds before being plunge frozen into liquid ethane using a Leica EM GP2 plunge freezer (Leica Microsystems). Frozen grids were imaged in a Titan Krios (Thermo Fisher) equipped with a K3 detector (Gatan). Data were acquired using the Leginon system.sup.41. The dose was fractionated over 50 raw frames and collected at 50 ms framerate. This dataset was energy-filtered with a slit width of 30 eV. Individual frames were aligned and dose-weighted. CTF estimation, particle picking, 2D classifications, ab initio model generation, heterogeneous refinements, and homogeneous 3D refinements were carried out in cryoSPARC.sup.42.
[0288] Cryo-EM Structure Fitting
[0289] Structures of the all ‘down’ state (PDB ID 6VXX) and single RBD ‘up’ state (PDB ID 6VYB) from the previously published SARS-CoV-2 ectodomain were used to fit the cryo-EM maps in Chimera.sup.43. The 2 RBD ‘up’ state was generated in PyMol using the single RBD ‘up’ state structure. Mutations were made in PyMol.sup.44. Coordinates were then fit manually in Coot.sup.45 following iterative refinement using Phenix.sup.46 real space refinement and subsequent manual coordinate fitting in Coot. Structure and map analysis was performed using PyMol and Chimera.
REFERENCES
[0290] 1 Hoffmann, M. et al. SARS-CoV-2 Cell Entry Depends on ACE2 and TMPRSS2 and Is Blocked by a Clinically Proven Protease Inhibitor. Cell 181, 271-280 e278, doi:10.1016/j.cell.2020.02.052 (2020). [0291] 2 Kirchdoerfer, R. N. et al. Pre-fusion structure of a human coronavirus spike protein. Nature 531, 118-121, doi:10.1038/nature17200 (2016). [0292] 3 Wrapp, D. et al. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science 367, 1260-1263, doi:10.1126/science.abb2507 (2020). [0293] 4 Yuan, Y. et al. Cryo-EM structures of MERS-CoV and SARS-CoV spike glycoproteins reveal the dynamic receptor binding domains. Nature Communications 8, 15092, doi:10.1038/ncomms15092 (2017). [0294] 5 Gui, M. et al. Cryo-electron microscopy structures of the SARS-CoV spike glycoprotein reveal a prerequisite conformational state for receptor binding. Cell Research 27, 119-129, doi:10.1038/cr.2016.152 (2017). [0295] 6 Song, W., Gui, M., Wang, X. & Xiang, Y. Cryo-EM structure of the SARS coronavirus spike glycoprotein in complex with its host cell receptor ACE2. PLOS Pathogens 14, e1007236, doi:10.1371/journal.ppat.1007236 (2018). [0296] 7 Kirchdoerfer, R. N. et al. Stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis. Scientific Reports 8, 15701, doi:10.1038/s41598-018-34171-7 (2018). [0297] 8 Walls, A. C. et al. Unexpected Receptor Functional Mimicry Elucidates Activation of Coronavirus Fusion. Cell 176, 1026-1039.e1015, doi:https://doi.org/10.1016/j.cell.2018.12.028 (2019). [0298] 9 Pallesen, J. et al. Immunogenicity and structures of a rationally designed prefusion MERS-CoV spike antigen. Proceedings of the National Academy of Sciences 114, E7348, doi:10.1073/pnas.1707304114 (2017). [0299] 10 Tortorici, M. A. et al. Structural basis for human coronavirus attachment to sialic acid receptors. Nature Structural & Molecular Biology 26, 481-489, doi:10.1038/s41594-019-0233-y (2019). [0300] 11 Walls, A. C. et al. Cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer. Nature 531, 114-117, doi:10.1038/nature16988 (2016). [0301] 12 Pallesen, J. et al. Immunogenicity and structures of a rationally designed prefusion MERS-CoV spike antigen. Proc Natl Acad Sci USA 114, E7348-E7357, doi:10.1073/pnas.1707304114 (2017). [0302] 13 Walls, A. C. et al. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell, doi:https://doi.org/10.1016/j.cell.2020.02.058 (2020). [0303] 14 Wrapp, D. et al. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science 367, 1260, doi:10.1126/science.abb2507 (2020). [0304] 15 Walls, A. C. et al. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell 181, 281-292 e286, doi:10.1016/j.cell.2020.02.058 (2020). [0305] 16 Rey, F. A. & Lok, S.-M. Common Features of Enveloped Viruses and Implications for Immunogen Design for Next-Generation Vaccines. Cell 172, 1319-1334, doi:10.1016/j.cell.2018.02.054 (2018). [0306] 17 de Taeye, Steven W. et al. Immunogenicity of Stabilized HIV-1 Envelope Trimers with Reduced Exposure of Non-neutralizing Epitopes. Cell 163, 1702-1715, doi:https://doi.org/10.1016/j.cell.2015.11.056 (2015). [0307] 18 He, L. et al. HIV-1 vaccine design through minimizing envelope metastability. Science advances 4, eaau6769-eaau6769, doi:10.1126/sciadv.aau6769 (2018). [0308] 19 Zhang, P. et al. Interdomain Stabilization Impairs CD4 Binding and Improves Immunogenicity of the HIV-1 Envelope Trimer. Cell Host & Microbe 23, 832-844.e836, doi:https://doi.org/10.1016/j.chom.2018.05.002 (2018). [0309] 20 Chuang, G.-Y. et al. Structure-Based Design of a Soluble Prefusion-Closed HIV-1 Env Trimer with Reduced CD4 Affinity and Improved Immunogenicity. Journal of Virology 91, doi:10.1128/JVI.02268-16 (2017). [0310] 21 Torrents de la Pella, A. et al. Improving the Immunogenicity of Native-like HIV-1 Envelope Trimers by Hyperstabilization. Cell reports 20, 1805-1817, doi:10.1016/j.celrep.2017.07.077 (2017). [0311] 22 Medina-Ramirez, M. et al. Design and crystal structure of a native-like HIV-1 envelope trimer that engages multiple broadly neutralizing antibody precursors in vivo. The Journal of Experimental Medicine 214, 2573, doi:10.1084/jem.20161160 (2017). [0312] 23 Steichen, J. M. et al. HIV Vaccine Design to Target Germline Precursors of Glycan-Dependent Broadly Neutralizing Antibodies. Immunity 45, 483-496, doi:10.1016/j.immuni.2016.08.016 (2016). [0313] 24 Kulp, D. W. et al. Structure-based design of native-like HIV-1 envelope trimers to silence non-neutralizing epitopes and eliminate CD4 binding. Nature Communications 8, 1655, doi:10.1038/s41467-017-01549-6 (2017). [0314] 25 Yang, L. et al. Structure-Guided Redesign Improves NFL HIV Env Trimer Integrity and Identifies an Inter-Protomer Disulfide Permitting Post-Expression Cleavage. Frontiers in Immunology 9, 1631 (2018). [0315] 26 Sharma, S. K. et al. Cleavage-independent HIV-1 Env trimers engineered as soluble native spike mimetics for vaccine design. Cell reports 11, 539-550, doi:10.1016/j.celrep.2015.03.047 (2015). [0316] 27 Guenaga, J. et al. Structure-Guided Redesign Increases the Propensity of HIV Env To Generate Highly Stable Soluble Trimers. Journal of Virology 90, 2806, doi:10.1128/JVI.02652-15 (2016). [0317] 28 Sliepen, K. et al. Structure and immunogenicity of a stabilized HIV-1 envelope trimer based on a group-M consensus sequence. Nature communications 10, 2355-2355, doi:10.1038/s41467-019-10262-5 (2019). [0318] 29 Korber, B. et al. Spike mutation pipeline reveals the emergence of a more transmissible form of SARS-CoV-2. bioRxiv, 2020.2004.2029.069054, doi:10.1101/2020.04.29.069054 (2020). [0319] 30 Zost, S. J. et al. Rapid isolation and profiling of a diverse panel of human monoclonal antibodies targeting the SARS-CoV-2 spike protein. bioRxiv, 2020.2005.2012.091462, doi:10.1101/2020.05.12.091462 (2020). [0320] 31 Brouwer, P. J. M. et al. Potent neutralizing antibodies from COVID-19 patients define multiple targets of vulnerability. bioRxiv, 2020.2005.2012.088716, doi:10.1101/2020.05.12.088716 (2020). [0321] 32 Wang, L. et al. Importance of Neutralizing Monoclonal Antibodies Targeting Multiple Antigenic Sites on the Middle East Respiratory Syndrome Coronavirus Spike Glycoprotein To Avoid Neutralization Escape. Journal of virology 92, e02002-02017, doi:10.1128/JVI.02002-17 (2018). [0322] 33 Yuan, M. et al. A highly conserved cryptic epitope in the receptor binding domains of SARS-CoV-2 and SARS-CoV. Science 368, 630, doi:10.1126/science.abb7269 (2020). [0323] 34 Humphrey, W., Dalke, A. & Schulten, K. VMD: Visual molecular dynamics. Journal of Molecular Graphics 14, 33-38, doi:https://doi.org/10.1016/0263-7855(96)00018-5 (1996). [0324] 35 Team, R. C. R: A Language and Environment for Statistical Computing. (2017). [0325] 36 Park, Y.-J. et al. Structures of MERS-CoV spike glycoprotein in complex with sialoside attachment receptors. Nature Structural & Molecular Biology 26, 1151-1157, doi:10.1038/s41594-019-0334-7 (2019). [0326] 37 Schrödinger Release 2020-1: Maestro (Schrödinger, LLC, New York, NY, 2020). [0327] 38 Madhavi Sastry, G., Adzhigirey, M., Day, T., Annabhimoju, R. & Sherman, W. Protein and ligand preparation: parameters, protocols, and influence on virtual screening enrichments. Journal of Computer-Aided Molecular Design 27, 221-234, doi:10.1007/s10822-013-9644-8 (2013). [0328] 39 Salam, N. K., Adzhigirey, M., Sherman, W. & Pearlman, D. A. Structure-based approach to the prediction of disulfide bonds in proteins. Protein Engineering, Design and Selection 27, 365-374, doi:10.1093/protein/gzu017 (2014). [0329] 40 Beard, H., Cholleti, A., Pearlman, D., Sherman, W. & Loving, K. A. Applying Physics-Based Scoring to Calculate Free Energies of Binding for Single Amino Acid Mutations in Protein-Protein Complexes. PLOS ONE 8, e82849, doi:10.1371/journal.pone.0082849 (2013). [0330] 41 Suloway, C. et al. Automated molecular microscopy: the new Leginon system. J Struct Biol 151, 41-60, doi:10.1016/j.jsb.2005.03.010 (2005). [0331] 42 Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat Methods 14, 290-296, doi:10.1038/nmeth.4169 (2017). [0332] 43 Pettersen, E. F. et al. UCSF Chimera—A visualization system for exploratory research and analysis. Journal of Computational Chemistry 25, 1605-1612, doi:10.1002/jcc.20084 (2004). [0333] 44 Schrodinger, L. The PyMOL Molecular Graphics System. (2015). [0334] 45 —Features and development of Coot. —Acta crystallographica. Section D, Biological crystallography -66, -486-501, doi:-(2010). [0335] 46 Afonine, P. V. et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallographica Section D 74, 531-544, doi:10.1107/S2059798318006551 (2018).
Example 3B
[0336] Without being bound by theory, the results in Example 2 indicate that the rs2d can require further modifications.
[0337] Rs2d designs comprising hexapro mutations are evaluated and discussed in Edwards et al. Nature Structural & Molecular Biology volume 28, pages128-131(2021).
[0338]
[0339] Any of the SARS-2 designs will be expressed as nucleic acids or proteins will be expressed, characterized and tested for antigenicity and immunogenicity. Immuonogenicity studies include animal challenge studies.
Example 4
[0340] Glycans on the SARS-CoV-2 Spike Control the Receptor Binding Domain
[0341] Conformation
[0342] Abstract
[0343] The glycan shield of the beta-coronavirus (β-CoV) Spike (S) glycoprotein provides protection from host immune responses, acting as a steric block to potentially neutralizing antibody responses. The conformationally dynamic S-protein is the primary immunogenic target of vaccine design owing to its role in host-cell fusion, displaying multiple receptor binding domain (RBD) ‘up’ and ‘down’ state configurations. Here, we investigated the potential for RBD adjacent, N-terminal domain (NTD) glycans to influence the conformational equilibrium of these RBD states. Using a combination of antigenic screens and high-resolution cryo-EM structure determination, we show that an N-glycan deletion at position 234 results in a dramatically reduced population of the ‘up’ state RBD position. Conversely, glycan deletion at position N165 results in a discernable increase in ‘up’ state RBDs. This indicates the glycan shield acts not only as a passive hinderance to antibody meditated immunity but also as a conformational control element. Together, our results demonstrate this highly dynamic conformational machine is responsive to glycan modification with implications in viral escape and vaccine design.
[0344] Introduction
[0345] The ongoing SARS-CoV-2 (SARS-2) pandemic presents an urgent need for the development of a protective vaccine. The primary immunogenic target for the vaccines in development is the viral transmembrane S-protein trimer. Each protomer of the trimer is split into an N-terminal receptor binding S1 subunit and a C-terminal fusion element containing S2 subunit, demarcated by the presence of a host protease cleavage site. The S1 subunit is further split into an N-terminal domain (NTD), two subdomains (SD1 and SD2) as well as the receptor binding domain (RBD) that together cap the conserved elements of the S2 subunit. The fusion event is marked by the shedding of the S1 subunit and large conformational transitions in the S2 subunit. The necessity to maintain a large free energy gradient between the prefusion, immune protective state of the molecule and the post-fusion state results in a highly dynamic macromolecular structure. The S1 subunit is dynamic, presenting the RBD in two distinct states: a receptor binding site occluded ‘down’ state in which the RBDs rest against their adjacent protomer's NTD, and a receptor binding site exposed ‘up’ state. It is this RBD ‘up’ state to which the majority of neutralizing responses are observed in convalescent SARS-2 infected individuals′. As conformational evasion is a well-known virus escape mechanism, it is critical to understand the mechanism by which the dynamics are controlled.
[0346] Structural studies of the β-CoV S-protein have focused primarily on a soluble, ectodomain construct with and without stabilizing proline mutations (2P). This includes structures for SARS-2.sup.1,2, SARS.sup.3-7, MERS.sup.3,8, and other human.sup.9,10 and murine.sup.11 β-CoV ectodomains. Structures for the SARS and MERS ectodomains revealed the presence of one and two RBD ‘up’ states with a three RBD ‘up’ state observed in the MERS ectodomain demonstrating the breadth of RBD configurations available to the spike. Interestingly, these states were not observed in the human β-CoVs HKU1 and OC43 nor in a Murine β-CoV, indicating mutations in the spike protein can confer dramatic differences in the propensity of the RBD to sample its available conformational space.
[0347] Our quantitative examination of the available β-CoV S-protein structures recently revealed the S1 and S2 subunit domains of different β-CoV viruses occupy a diverse array of configurations.sup.12. Based upon this analysis we predicted the S-protein conformation was sensitive to mutations at the interfaces between domains and subunits. Indeed, mutations at these sites had major impacts on the configuration of the protein, especially on the RBD ‘up’/‘down’ distribution.sup.12. While these and other studies.sup.13,14,15 have demonstrated the role of protein-protein contacts in determining the conformation of the S-protein, the influence on RBD configuration of glycosylation at or near interfacial domain regions is poorly understood.
[0348] Like other class I viral fusion proteins, the β-CoV S-proteins are heavily glycosylated, obscuring the spike surface and limiting the targetable area for immune responses. A recent site-specific analysis of the glycosylation patterns of the SARS-2 S-protein revealed variation in the glycan type, indicating marked differences in processing enzyme accessibility at each site.sup.16. Together, the wide variation in spike conformation coupled with the presence of glycans adjacent to the RBD indicates among the many factors affecting the RBD position, glycosylation patterns can provide a means by which to control its conformational equilibrium.
[0349] In this study we have investigated the potential for two SARS-2 NTD glycans in close proximity to the RBD to influence the conformational distribution of the RBD ‘up’ and ‘down’ states. Analysis of the available SARS-2 ‘up’ state structures indicated N165 and N234 glycans can interact with the ‘up’ state RBD acting as both direct stabilizers of the ‘up’ state and as steric blocks to transitions to the ‘down’ state. We combined binding studies by surface plasmon resonance, with structural studies using negative stain electron microscopy (NSEM) and single-particle cryo-electron microscopy (cryo-EM) to define shifts in the ‘up’/‘down’ state equilibrium in glycan-deleted mutants of the SARS-2 spike ectodomain. Together, our results demonstrate that RBD proximal glycans can influence the propensity of the S-protein adopt multiple configurations indicating a means for viral escape and therefore the need to consider non-RBD neutralizing responses in vaccine design.
[0350] Results
[0351] Structure Analysis Identifies Glycans with the Potential to Modify the S-Protein Conformation
[0352] In order to establish whether glycans can indeed alter the RBD orientation, we first examined the SARS-2 glycan density at positions 165 and 234 in the cryo-EM maps from three previously published SARS-2 structures. In the ‘down’ state, the N234 glycan resides in a cleft formed by the NTD and RBD (
[0353] RBD Conformation and Antigenicity of the N-Glycan Deleted S-Proteins Reveals Differential Stabilization of RBD ‘Up’ and ‘Down’ States
[0354] In order to examine the extent to which the N234 and N165 glycans influence the conformational distribution of the S-protein, we produced di-proline (2P) stabilized.sup.8 S-protein ectodomain.sup.2 N234A and N165A mutants.
[0355] The parent nCoV sequence (“nCoV-1 nCoV-2P”) is shown in
[0356] The N165A mutant sequence is shown in
[0357] The N234A mutant sequence is shown in
[0358] The protein yields after StrepTactin purification were 2.0 mg and 0.8 mg per 1 L culture supernatant, respectively for the N234A and the N165A mutant. (
[0359] We next examined the ‘up’/‘down’ state distribution of both mutants via negative stain electron microscopy (NSEM). Heterogenous classification of the N234A mutant particles revealed a dramatic shift from a ˜1:1 ‘up’ v. ‘down’ state distribution in the unmutated 2P.sup.2,12,17 to a ratio of ˜1:4 in the down state (
[0360] High-Resolution Cryo-EM Structures of the N-Glycan Deleted Constructs Indicates Modest Perturbation to S-Protein Configuration
[0361] We next turned to cryo-EM for high resolution structure determination to visualize the impact of the glycan deletions on the local and global configuration of the S-protein domains. We collected and processed 7,269 and 8,068 images for the N165A and N234A mutant, respectively, to yield particle stacks cleaned up by 2D classification, that were then subjected to multiple rounds of ab initio classification and heterogenous refinement in cryoSPARC.sup.18 using 20 Å low pass filtered ‘up’ state and ‘down’ state maps generated from available SARS-2 structures. Initial maps for high resolution refinement were generated from sorted particles via ab initio reconstruction (
[0362] We next examined the high-resolution details of the cryo-EM maps. Refinement of the N234A mutant ‘down’ state using C3 symmetry resulted in a 3.0 Å map with coordinates fit to this map yielding a structure aligning to the unmutated 2P structure (PDB ID 6VXX) with a ˜0.6 Å RMSD. Alignment of the S2 subunit revealed the structures to be nearly identical in these regions (RMSD ˜0.4 Å). Examination of the NTD to RBD interface using this alignment revealed a shift of the NTD toward the RBD (
[0363] Refinement of the N165A ‘up’ and ‘down’ states resulted in maps with resolutions of 3.6 Å using C1 symmetry and 3.3 Å using C3 symmetry, respectively. Similar to the N234A mutant, the N165A mutant structures showed an overall similar arrangement of the various domains. Alignment of the ‘down’ state structure of the N165A mutant with that of the unmutated spike yielded an RMSD of 0.81 Å with an S2 subunit alignment RMSD of 0.36 Å. Unlike the N234A mutant, the N165A mutant NTD is shifted away from the adjacent RBD (
[0364] Discussion
[0365] Viral fusion proteins are often heavily glycosylated with the SARS-2 S-protein being no exception. Though decorated with fewer glycans than the HIV-1 Envelope protein, with 22 glycans per protomer.sup.16, the SARS-2 spike is well shielded from immune surveillance. The SARS-2 spike protein has proven remarkably sensitive to domain-domain interfacial mutations.sup.12-15,19 which led us to ask whether glycans near the NTD-RBD interface can also impact the configuration of the spike. Here we have investigated the role of two NTD glycans at positions 234 and 165 in modulating S protein conformational dynamics by tracking the shift of RBD disposition in glycan-deleted mutants using binding to ACE-2 receptor, NSEM and cryo-EM analysis. While the specific magnitudes of differences vary between the different analysis methods, all the results track in the same direction to show that deletion of glycan 234 shifts the RBD dynamics more toward the “down” state, whereas deletion of glycan 165, retains or slightly enhances the distribution toward more “up” states. The 2-RBD “up” state observed in the NSEM analysis was not found in the cryo-EM data, indicating that the RBD up/down configuration in this construct can be sensitive to its environment. The shift in the position of the NTD toward the RBD in the ‘down’ state N234A mutant indicates the N234 glycan plays a direct role in destabilizing the ‘down’ state RBD position such that removal allows tighter packing of the RBD to the NTD. Additionally, the observed shift in the position of the ‘up’ state RBD indicates a role for the N234 glycan in modulating RBD stability. This is consistent with a recently released theoretical study investigating ‘up’ state RBD sensitivity to the presence of N165/N234 glycans via molecular simulation.sup.20. This investigation found that the absence of these glycans resulted in a comparatively unstable ‘up’ state RBD. The results here confirm the prediction from these simulations that loss of the N234 glycan results in an increased prevalence of the ‘down’ state. Deletion of the glycan at position 165 here indicates an opposite effect on the conformation of the spike relative to the N234A mutant, with the NTD shifting away from the adjacent RBD. Though this appears to relieve strain caused by the restriction imposed by the N234 glycan, the resultant lack of packing between the RBD and NTD can be sufficient to favor transitions to the ‘up’ state. Further, this shift indicates the N165 glycan interacts directly with the RBD. Though direct interactions are not observed in the cryo-EM densities here or in previously published SARS-2 structures, the presence of ‘down’ state conformational heterogeneity evinced by the poor resolution of the RBD and NTD elements of the spike is consistent with the possibility of such an interaction. A more detailed examination of this heterogeneity and the influence of these glycans on the various states of the spike will require large datasets with improved orientational sampling to better resolve these apical regions. Nevertheless, the results here demonstrate that the conformational ensemble of the SARS-2 spike and β-CoV spikes are sensitive to glycosylation patterns, especially near the NTD-RBD interface.
[0366] Our results from this study lend insights into two key questions—what role do the glycans at positions 165 and 234 play in modulating RBD dynamics and the biology of the native SARS-2 spike and how do these findings impact vaccine design? Toward the first question, we recognize that the results we describe are in the context of a stabilized, ectodomain construct and differences between these and what occurs on the spike in its native context can be determined. Indeed, a recent report for a detergent solubilized, full-length SARS-2 spike indicated greater stability in the ‘down’ state RBD.sup.21. Yet our experimental results revealing the role of the N165 and N234 glycans in modulating the conformational landscape of the S protein, taken together with the findings from the computational analysis performed in the context of the full-length spike.sup.20, and our analysis of the RBD-proximal NTD glycans of diverse β-CoVs (
[0367] Studies have shown that the NTD and RBD are quite mobile. We therefore asked whether the observed shifts in the NTD of the N165A and N234A mutants in the ‘down’ state are related to changes in the propensity of the domain to occupy positions or due to access to new states. We first classified the ‘down’ state 2P, N165A, and N234A particles using C1 symmetry yielding 4, 4, and 3 states, respectively. In order to quantify differences in the positions of the S1 domains, we generated a set of vectors between protomer RBD's and SD1's centroids and their adjacent NTD's centroids (
[0368] Vector magnitudes and relevant angles and dihedrals were determined for each of the three RBD-NTD pairings. Examination of the distance between adjacent RBDs and NTDs revealed markedly shifted positions between the three constructs (
[0369] The results indicate the 2P and N165A constructs display similar angles with geometric means of 52.9 And 53.0 Å, respectively (
[0370] A previous molecular dynamics-based study of the one ‘up’ state RBD indicated the N165 glycan “props up” the RBD. We therefore classified the 2P and N165A construct ‘up’ states in order to determine the extent to which the RBD positions. Each classified into four states with some overlap in the relative position of the RBDs. However, the 2P construct displayed an RBD more distant from the primary trimer axis as compared to those of the N165A construct while the N165A construct displayed a state much closer to the primary axis (
[0371] Methods
[0372] Vector Based Analysis
[0373] Vector analysis was performed as previously described. Specifically, Cα centroids for the S1 NTD, RBD, S131, SD2 (SARS-CoV-2 residues, 27-43 and 54-271, 330-443 and 503-528, 323-329 and 529-590, 294-322 and 591-696, respectively) as well as a β-sheet motif in the NTD (residues 116-129 and 169-172) and a helix motif in the RBD (residues 403-410) were determined. The NTD was split into two regions with the SD1 contacting, SD2 adjacent portion referred to here as the NTD′ (residues 44-53 and 272-293). Cα centroids in the S2 subunit were obtained for a β-sheet motif (residues 717-727 and 1047-1071) and the CD domain (711-716 and 1072-1122). Vector magnitudes, angles, and dihedrals between these centroids were determined and used in the subsequent analysis. Vector analysis was performed using the VMD.sup.22 Tcl interface.
[0374] Protein Expression and Purification
[0375] The SARS-CoV-2 ectodomain constructs were produced and purified as described previously.sup.2. Briefly, a gene encoding residues 1-1208 of the SARS-CoV-2 S (GenBank: MN908947) with proline substitutions at residues 986 and 987, a “GSAS” substitution at the furin cleavage site (residues 682-685), a C-terminal T4 fibritin trimerization motif, an HRV3C protease cleavage site, a TwinStrepTag and an 8XHisTag (SEQ ID NO: 4) was synthesized and cloned into the mammalian expression vector pall. All mutants were introduced in this background. Expression plasmids encoding the ectodomain sequence were used to transiently transfect FreeStyle293F cells using Turbo293 (SpeedBiosystems). Protein was purified on the sixth day post-transfection from the filtered supernatant using StrepTactin resin (IBA).
[0376] The ACE-2 gene was cloned as a fusion protein with a mouse Fc region attached to its C-terminal end. A 6× His-tag (SEQ ID NO: 5) was added to the C-terminal end of the Fc domain. ACE-2 with mouse FC tag was purified by Ni-NTA chromatography.
[0377] Thermal Shift Assay
[0378] The thermal shift assay was performed using Tycho NT. 6 (NanoTemper Technologies). Spike variants were diluted (0.15 mg/ml) in nCoV buffer (2 mM Tris, pH 8.0, 200 mM NaCl, 0.02% sodium azide) and run in duplicates in capillary tubes. Intrinsic fluorescence was recorded at 330 nm and 350 nm while heating the sample from 35-95° C. at a rate of 3° C./min. The ratio of fluorescence (350/330 nm) and the Ti were calculated by Tycho NT. 6.
[0379] Cryo-EM Sample Preparation, Data Collection and Processing
[0380] Purified SARS-CoV-2 spike preparations were diluted to a concentration of ˜1 mg/mL in 2 mM Tris pH 8.0, 200 mM NaCl and 0.02% NaN3. 2.5 μL of protein was deposited on a CF-1.2/1.3 grid that had been glow discharged for 30 seconds in a PELCO easiGlow™ Glow Discharge Cleaning System. After a 30 s incubation in >95% humidity, excess protein was blotted away for 2.5 seconds before being plunge frozen into liquid ethane using a Leica EM GP2 plunge freezer (Leica Microsystems). Frozen grids were imaged in a Titan Krios (Thermo Fisher) equipped with a K3 detector (Gatan). Data were acquired using the Leginon system.sup.23. The dose was fractionated over 50 raw frames and collected at 50 ms framerate. This dataset was energy-filtered with a slit width of 30 eV. Individual frames were aligned and dose-weighted′. CTF estimation, particle picking, 2D classifications, ab initio model generation, heterogeneous refinements, homogeneous 3D refinements and local resolution calculations were carried out in cryoSPARC.sup.25.
[0381] Cryo-EM Structure Fitting and Analysis
[0382] Structures of the all ‘down’ state (PDB ID 6VXX) and single RBD ‘up’ state (PDB ID 6VYB) from the previously published SARS-CoV-2 ectodomain were used to fit the cryo-EM maps in Chimera.sup.26. Mutations were made in PyMol.sup.27. Coordinates were fit to the maps first using ISOLDE.sup.28 followed by iterative refinement using Phenix.sup.29 real space refinement and subsequent manual coordinate fitting in Coot as needed. Structure and map analysis were performed using PyMol, Chimera.sup.26 and ChimeraX.sup.30.
[0383] Surface Plasmon Resonance
[0384] The binding of ACE-2 to the SARS-2 spike constructs was assessed by surface plasmon resonance on Biacore T-200 (GE-Healthcare) at 25° C. with HBS-EP+ (10 mM HEPES, pH 7.4, 150 mM NaCl, 3 mM EDTA, and 0.05% surfactant P-20) as the running buffer. ACE-2 tagged at its C-terminal end to a mouse Fc region was captured on an anti-Fc surface. Binding was assessed by flowing over different concentrations of the spike constructs over the ACE-2 surface. The surface was regenerated between injections by flowing over 3M MgCl2 solution for 10s with flow rate of 100 μl/min. Blank sensorgrams were obtained by injection of the same volume of HBS-EP+ buffer in place of IgGs and Fab solutions. Sensorgrams were corrected with corresponding blank curves. Sensorgram data were analyzed using the BiaEvaluation software (GE Healthcare).
REFERENCES
[0385] 1 Barnes, C. O. et al. Structures of human antibodies bound to SARS-CoV-2 spike reveal common epitopes and recurrent features of antibodies. Cell, doi:https://doi.org/10.1016/j.cell.2020.06.025 (2020). [0386] 2 Wrapp, D. et al. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science 367, 1260-1263, doi:10.1126/science.abb2507 (2020). [0387] 3 Yuan, Y. et al. Cryo-EM structures of MERS-CoV and SARS-CoV spike glycoproteins reveal the dynamic receptor binding domains. Nature Communications 8, 15092, doi:10.1038/ncomms15092 (2017). [0388] 4 Gui, M. et al. Cryo-electron microscopy structures of the SARS-CoV spike glycoprotein reveal a prerequisite conformational state for receptor binding. Cell Research 27, 119-129, doi:10.1038/cr.2016.152 (2017). [0389] 5 Song, W., Gui, M., Wang, X. & Xiang, Y. Cryo-EM structure of the SARS coronavirus spike glycoprotein in complex with its host cell receptor ACE2. PLOS Pathogens 14, e1007236, doi:10.1371/journal.ppat.1007236 (2018). [0390] 6 Kirchdoerfer, R. N. et al. Stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis. Scientific Reports 8, 15701, doi:10.1038/s41598-018-34171-7 (2018). [0391] 7 Walls, A. C. et al. Unexpected Receptor Functional Mimicry Elucidates Activation of Coronavirus Fusion. Cell 176, 1026-1039.e1015, doi:https://doi.org/10.1016/j.cell.2018.12.028 (2019). [0392] 8 Pallesen, J. et al. Immunogenicity and structures of a rationally designed prefusion MERS-CoV spike antigen. Proceedings of the National Academy of Sciences 114, E7348, doi:10.1073/pnas.1707304114 (2017). [0393] 9 Kirchdoerfer, R. N. et al. Pre-fusion structure of a human coronavirus spike protein. Nature 531, 118-121, doi:10.1038/nature17200 (2016). [0394] 10 Tortorici, M. A. et al. Structural basis for human coronavirus attachment to sialic acid receptors. Nature Structural & Molecular Biology 26, 481-489, doi:10.1038/s41594-019-0233-y (2019). [0395] 11 Walls, A. C. et al. Cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer. Nature 531, 114-117, doi:10.1038/nature16988 (2016). [0396] 12 Henderson, R. et al. Controlling the SARS-CoV-2 Spike Glycoprotein Conformation. bioRxiv, 2020.2005.2018.102087, doi:10.1101/2020.05.18.102087 (2020). [0397] 13 Hsieh, C.-L. et al. Structure-based Design of Prefusion-stabilized SARS-CoV-2 Spikes. bioRxiv, 2020.2005.2030.125484, doi:10.1101/2020.05.30.125484 (2020). [0398] 14 McCallum, M., Walls, A. C., Corti, D. & Veesler, D. Closing coronavirus spike glycoproteins by structure-guided design. bioRxiv, 2020.2006.2003.129817, doi:10.1101/2020.06.03.129817 (2020). [0399] 15 Xiong, X. et al. A thermostable, closed, SARS-CoV-2 spike protein trimer. bioRxiv, 2020.2006.2015.152835, doi:10.1101/2020.06.15.152835 (2020). [0400] 16 Watanabe, Y., Allen, J. D., Wrapp, D., McLellan, J. S. & Crispin, M. Site-specific glycan analysis of the SARS-CoV-2 spike. Science, eabb9983, doi:10.1126/science.abb9983 (2020). [0401] 17 Walls, A. C. et al. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell, doi:https://doi.org/10.1016/j.cell.2020.02.058 (2020). [0402] 18 Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nature Methods 14, 290, doi:10.1038/nmeth.4169 https://www.nature.com/articles/nmeth.4169 #supplementary-information (2017). [0403] 19 Zhang, L. et al. The D614G mutation in the SARS-CoV-2 spike protein reduces S1 shedding and increases infectivity. bioRxiv, 2020.2006.2012.148726, doi:10.1101/2020.06.12.148726 (2020). [0404] 20 Casalino, L. et al. Shielding and Beyond: The Roles of Glycans in SARS-CoV-2 Spike Protein. bioRxiv, 2020.2006.2011.146522, doi:10.1101/2020.06.11.146522 (2020). [0405] 21 Cai, Y. et al. Distinct conformational states of SARS-CoV-2 spike protein. bioRxiv, 2020.2005.2016.099317, doi:10.1101/2020.05.16.099317 (2020). [0406] 22 Humphrey, W., Dalke, A. & Schulten, K. VMD: Visual molecular dynamics. Journal of Molecular Graphics 14, 33-38, doi:https://doi.org/10.1016/0263-7855(96)00018-5 (1996). [0407] 23 Suloway, C. et al. Automated molecular microscopy: the new Leginon system. J Struct Biol 151, 41-60, doi:10.1016/j.jsb.2005.03.010 (2005). [0408] 24 Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat Methods 14, 331-332, doi:10.1038/nmeth.4193 (2017). [0409] 25 Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat Methods 14, 290-296, doi:10.1038/nmeth.4169 (2017). [0410] 26 Pettersen, E. F. et al. UCSF Chimera—A visualization system for exploratory research and analysis. Journal of Computational Chemistry 25, 1605-1612, doi:10.1002/jcc.20084 (2004). [0411] 27 Schrodinger, L. The PyMOL Molecular Graphics System. (2015). [0412] 28 Croll, T. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallographica Section D 74, 519-530, doi:10.1107/S2059798318002425 (2018). [0413] 29 Afonine, P. V. et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallographica Section D 74, 531-544, doi:10.1107/S2059798318006551 (2018). [0414] 30 Goddard, T. D. et al. UCSF ChimeraX: Meeting modern challenges in visualization and analysis. Protein Sci 27, 14-25, doi:10.1002/pro.3235 (2018).