IMMUNOGENIC COMPOSITION
20220054625 · 2022-02-24
Inventors
Cpc classification
A61K39/39
HUMAN NECESSITIES
International classification
A61K39/39
HUMAN NECESSITIES
Abstract
The present invention relates to an immunogenic composition comprising two or more polypeptides. The invention also provides nucleic acid molecules and vectors encoding the polypeptides, and methods of using the compositions, nucleic acid molecules and vectors for the prevention or treatment of influenza.
Claims
1. An immunogenic composition comprising at least five different virus-like particles (VLPs), wherein each VLP independently comprises one or more homotrimers comprised of polypeptides of SEQ ID NOs: 13-17, optionally together with one or more pharmaceutically-acceptable carriers, adjuvants, excipients or diluents, as a combined preparation in a form suitable for simultaneous, separate or sequential use for treating or preventing influenza A infection.
2. An immunogenic composition comprising two or more polypeptides, wherein each polypeptide independently comprises a first region of contiguous amino acids, wherein: (a) the amino acid sequence of the first region has at least 80% sequence identity to an influenza A haemagglutinin head domain; and (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is a negatively charged amino acid position 146 is T, N, I or A position 147 is a positively charged amino acid, I or is absent position 148 is G position 149 is V position 151 is A position 154 is S or P position 155 is H position 156 is a positively charged amino acid or A or G or N or E position 157 is a positively charged amino acid or A or G position 158 is a positively charged amino acid or A or S or N or C or E position 159 is K or A or S or N or C position 163 is a positively charged amino acid, wherein the amino acid sequences of the two or more polypeptides are different, and wherein the composition is capable of inducing antibodies in a subject against an influenza A virus, optionally together with one or more pharmaceutically-acceptable carriers, adjuvants, excipients or diluents.
3. A composition comprising a polypeptide, wherein the amino acid sequence of the polypeptide comprises a first region, wherein: (a) the amino acid sequence of the first region has at least 80% sequence identity to an influenza A subtype H1, H2, H3, H4, H5, H6, H7, H8, H9, H10, H11, H12, H13, H14, H15, H16, H17 or H18 haemagglutinin head domain, preferably an influenza A subtype H1, H5, H6, H9 or H11 haemagglutinin head domain; and (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is a negatively charged amino acid position 146 is T, N, I or A position 147 is a positively charged amino acid, I or is absent position 148 is G position 149 is V position 151 is A position 154 is S or P position 155 is H position 156 is a positively charged amino acid or A or G or N or E position 157 is a positively charged amino acid or A or G position 158 is a positively charged amino acid or A or S or N or C or E position 159 is K or A or S or N or C position 163 is a positively charged amino acid.
4. A composition as claimed in claim 2 or claim 3, wherein the amino acid sequence of the influenza A haemagglutinin head is selected from the group consisting of SEQ ID NOs: 1-4.
5. A composition as claimed in claim 3, the amino acid sequence of the first region has at least 85%, 90% or 95% sequence identity to the influenza A haemagglutinin head domain.
6. A composition as claimed in claim 3, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is E or D position 146 is T or N position 147 is R or K or I or is absent position 148 is G position 149 is V position 151 is A position 154 is S or P position 155 is H position 156 is A or G or K or N or E position 157 is A or G position 158 is A or K or E position 159 is A, K, C, N or S position 163 is K or R.
7. A composition as claimed in claim 3, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is a negatively charged amino acid position 146 is T, N, I or A position 147 is a positively charged amino acid, position 148 is G position 149 is V position 151 is A position 154 is S or P position 155 is H position 156 is A position 157 is G position 158 is K or A or S or N or C position 159 is K or A or S or N or C position 163 is a positively charged amino acid.
8. A composition as claimed in claim 7, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is E position 146 is N position 147 is R position 148 is G position 149 is V position 151 is A position 154 is P position 155 is H position 156 is A position 157 is G position 158 is A position 159 is K position 163 is K, preferably, all of the above amino acid substitutions.
9. A composition as claimed in claim 3, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is a negatively charged amino acid position 146 is N position 147 is I position 148 is G position 149 is V position 151 is A position 154 is S position 155 is H position 156 is K or A position 157 is G position 158 is A or K position 159 is K or S position 163 is a positively charged amino acid.
10. A composition as claimed in claim 9, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is E position 146 is N position 147 is I position 148 is G position 149 is V position 151 is A position 154 is S position 155 is H position 156 is A position 157 is G position 158 is K position 159 is S position 163 is K, preferably, all of the above amino acid substitutions.
11. A composition as claimed in claim 3, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is a negatively charged amino acid position 146 is T position 147 is a positively charged amino acid position 148 is G position 149 is V position 151 is A position 154 is S position 155 is H position 156 is a positively charged amino acid or A or G position 157 is a positively charged amino acid or A or G position 158 is K position 159 is S or C position 163 is a positively charged amino acid.
12. A composition as claimed in claim 11, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is E position 146 is T position 147 is R position 148 is G position 149 is V position 151 is A position 154 is S position 155 is H position 156 is K position 157 is G position 158 is K position 159 is S position 163 is K, preferably, all of the above amino acid substitutions.
13. A composition as claimed in claim 3, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is a negatively charged amino acid position 146 is T position 147 is a positively charged amino acid position 148 is G position 149 is V position 151 is A position 154 is S position 155 is H position 156 is N position 157 is G position 158 is a positively charged amino acid position 159 is S position 163 is a positively charged amino acid.
14. A composition as claimed in claim 13, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is E position 146 is T position 147 is K position 148 is G position 149 is V position 151 is A position 154 is S position 155 is H position 156 is N position 157 is G position 158 is K position 159 is S position 163 is R, preferably, all of the above amino acid substitutions.
15. A composition as claimed in claim 3, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is a negatively charged amino acid position 146 is T or N position 147 is absent position 148 is G position 149 is V position 151 is A position 154 is S or P position 155 is H position 156 is N position 157 is G position 158 is K position 159 is S position 163 is a positively charged amino acid.
16. A composition as claimed in claim 15, wherein: (b) the first region has one or more amino acid substitutions at positions which correspond to the following positions in SEQ ID NO: 9: position 83 is E position 85 is E position 146 is T position 147 is absent position 148 is G position 149 is V position 151 is A position 154 is S position 155 is H position 156 is N position 157 is G position 158 is K position 159 is S position 163 is R, preferably, all of the above amino acid substitutions.
17-19. (canceled)
20. A composition as claimed in claim 3, wherein the composition comprises 2, 3, 4 or 5 of said polypeptides, all of which are different.
21. A composition as claimed in claim 3, wherein the composition comprises one or more hetero-trimers of three different polypeptides or homotrimers of the same polypeptides.
22-25. (canceled)
26. A virus-like particle comprising two or more polypeptides as defined in claim 3.
27-32. (canceled)
33. A method of: (i) preventing or treating influenza infection in a subject; or (ii) inducing a T-cell or B-cell response to an influenza antigen in a subject, the method comprising administering an effective amount of a composition as claimed in claim 3 to a subject in need thereof.
34-43. (canceled)
44. A composition as claimed in claim 20, wherein the first regions of the two or more polypeptides independently comprise amino acid sequences selected from the group consisting of SEQ ID NOs: 13-17.
45. A composition as claimed in claim 20, wherein the polypeptides are all 280-300 amino acids in length.
Description
BRIEF DESCRIPTION OF THE FIGURES
[0484]
[0485]
[0486]
[0487]
[0488]
[0489] (
[0490] (
[0491] (
[0492]
[0493] (
[0494] (
[0495] (
[0496]
[0497] (
[0498] (
[0499] (
[0500]
[0501]
[0502]
EXAMPLES
[0503] The present invention is further illustrated by the following Examples, in which parts and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be understood that these Examples, while indicating preferred embodiments of the invention, are given by way of illustration only. From the above discussion and these Examples, one skilled in the art can ascertain the essential characteristics of this invention, and without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various usages and conditions. Thus, various modifications of the invention in addition to those shown and described herein will be apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims.
Example 1: Antigenic Thrift Model
[0504] The existence of protective epitopes of low variability is consistent with the population dynamics of influenza A under the “antigenic thrift model”. This model is based on a multi-locus representation of the virus with each locus corresponding to an epitope region and presents an alternative to the more widely accepted “antigenic drift” model in having the potential to contain protective epitopes of limited variability as well as those of high variability.
[0505] Carter et al. (2013) provides evidence for the antigenic thrift model. Ferrets were infected with one of several historical influenza viruses. Serum antibodies were measured at day 14 and 81 by haemagglutin inhibition (HAI) assays (
[0506] The observed periodic cross-reactivity in Carter et al. (2013) is predicted, in part, by the current structural bioinformatics analysis. For example, infection of ferrets with the 1957 strain is predicted to induce cross-reactive antibodies to the A/R/8/1934, A/Den/1/1957, A/NC/20/1999 and A/Bris/59/2007 strains. Cross-reactivity is observed between the A/R/8/1934, A/Den/1/1957 and A/NC/20/1999 strains on infection, but not the A/Bris/59/2007 strain.
Example 2: Cyclical Cross-Reactivity of Infant Plasma Against Chronologically-Dispersed H1 Influenza Strains
[0507] Standardised enzyme-linked immunosorbant assays (ELISAs) were performed using plasma from children aged 12 to 17 months, collected in 2012. The HA1 domains from influenza strains A/California/4/2009, A/USSR/90/1977, A/Brevig Mission/1/1918, A/Solomon Islands/3/2006, A/New Caledonia/20/1999, A/Puerto Rico/8/34 and A/WSN/33 were bought from Sino Biological. Standards used sera from adults based on their date of birth. Two negative controls were run on each plate consisting of a caesin only control and a non-reactive human plasma or sera control.
[0508] The results are shown in
[0509] The fact that this plasma reacted with a panel of historical H1N1 strains in a cyclical manner lead us to infer that epitopes of limited variability are present in the head domain of H1 HA and that they cycle through a limited number of conformations as host population immunity changes.
Example 3: Identification of Epitopes of Limited Variability
[0510] To identify epitopes of limited variability, antibody binding sites were mapped to the A/Puerto Rico/8/1934 crystal structure and the variability within those site determined by referring to an alignment of 2,756 H1 sequences (
[0511] In silico analysis was used to determine how the accessibility and binding site area contributed to the variability of hypothetical antibody binding sites. An antibody binding site of 800 A.sup.2 was used to determining the variability for three accessibility parameters: amino acids with >30%, >10% or >1% accessibility. A dataset of positions with >10% accessibility was used to determine the variability for three binding site sizes: 600 A.sup.2, 800 A.sup.2 or 1000 A.sup.2. Both approaches identified the same regions of limited variability within the head of H1 HA (
[0512] Analysis of the sites of limited variability predicted to exist from the in silico analysis was performed by mapping the predicted epitopes to the A/Puerto Rico/8/1934, A/Brevig Mission/1/1918 and A/California/04/2009 crystal structures using Swiss-pdb viewer. By mapping the predicted sites to the crystal structures, sites that were likely to be epitopes could be identified. One site close to the receptor binding site (RBS), in a region which is known to be under strong immune selection but thought to be highly variable, centred on a 800 A.sup.2 region surrounding this positions 156/158 (
Example 4: Cycling of Epitopes
[0513] Yearly consensus sequences were generated by dividing the 12,480 curated H1 HA sequences into separate fasta files based on the year that the sequence was collected. The R package ‘seqinr’ was then used to generate consensus sequences.
[0514] Analysis of the predicted binding site surrounding positions 156/158 indicated that there were a number of positions in which charged residues could be found, in addition to positions with non-charged residues, which were either conserved or changed between similar residue types. It is generally accepted that antibodies preferentially bind to charged residues and so the possible epitope permutations were defined based on the cycling of charged amino acids in positions 147, 156, 157, 158 and 159 (Kringelum et al. 2013).
[0515] At position 147, the amino acid alternated between a positively charged amino acid, lysine or arginine, a neutral amino acid, isoleucine, and no amino acid. Consequently the site was divided up based on this pattern into three groups.
[0516] Phylogenetic analysis of position 147 also reveals that strains were identified in which no amino acid was present at position 147 five times during the evolution of H1 influenza in humans between 1918-1957 and 1977-2015 (
[0517] The 147 positive group was then further divided based on the presence of a positively charged amino acid at position 158 or 159, 158 and either a positive charged amino acid at position 156 or 157.
[0518] The space filling capacity of non-charged amino acid was then considered, which allowed an addition group to be produced from 147-positive/158 or 157 positive group based on whether alanine or asparagine was present at position 156 (
Example 5: Loss of Neutralisation Upon Site-Directed Mutagenesis
[0519] Sera from children aged 6 to 11 years, taken in late 2006/early 2007, cross-reacted extensively with HA1 domains from historical influenza strains (
[0520] Using a microneutralisation assay (
[0521] This data emphasises the importance of amino acid position 147. It should be noted that in the A/Solomon Islands/3/2006, A/PR/8/1934 and A/WSN/1933 strains no amino acid is contained at position 147. Instead monomers of these viruses consist of 565 instead of 566 amino acids.
Example 6: Synthesis of Polypeptides
[0522] Invitrogen® GeneArt Strings were used to synthesise the chimeric HA molecules consisting of the epitope of limited variability substituted into the HA1 domain of H5, H6, or H11. Three conformations of the site were initially used.
[0523] The chimeric HA1 domain sequences were then cloned into DNA expression constructs and lentiviral glycoprotein expression vectors. The DNA expression were grown up in E. coli and purified using a Qiagen Giga Prep Kit. Lentiviruses were produced displaying the chimeric HAs via the protocol outlined in Carnell et al., (2015) before being purified by sucrose cushion centrifugation. The conformations substituted into the H6, H5 and H11 HA1 domains are provided below (amino acid position is denoted in brackets):
TABLE-US-00001 Blue: N(146), K(147), G(148), V(149), A(151), P(154), H(155), A(156), G(157), A(158), K(159), K(163) AAC(146) AAG(147) GGC(148) GTG(149) GCC(151) CCC(154) CAC(155) GCC(156) GGC(157) GCC(158) AAG(159) AAG(163) Hazel: N(146), I(147), G(148), V(149), A(151), S(154), H(155), A(156), G(157), K(158), S(159), K(163) AAC(146) ATC(147) GGC(148) GTG(149) GCC(151) AGC(154) CAC(155) GCC(156) GGC(157) AAG(158) AGC(159) AAG(163) Green: T(146), R(147), G(148), V(149), A(151), S(154), H(155), K(156), G(157), K(158), S(159), R(163) ACC(146) AGG(147) GGC(148) GTG(149) GCC(151) AGC(154) CAC(155) AAG(156) GGC(157) AAG(158) AGC(159) AGG(163) Orange: T(146), K(147), G(148), V(149), A(151), S(154), H(155), N(156), G(157), K(158), S(159), R(163) ACC(146) AAG(147) GGC(148) GTG(149) GCC(151) AGC(154) CAC(155) AAC(156) GGC(157) AAG(158) AGC(159) AGG(163) Red: T(146), Absent(147),), G(148), V(149), A(151), S(154), H(155), N(156), G(157), K(158), S(159), R(163) ACC(146) Absent(147) GGC(148) GTG(149) GCC(151) AGC(154) CAC(155) AAC(156) GGC(157) AAG(158) AGC(159) AGG(163)
[0524] These sequences correspond to those cloned into the vaccine constructs and so any cross-reactivity can be directly attributed to them.
Example 7: Mouse Challenges
[0525] Mouse influenza challenges are performed with influenza strains:
(i) A/California/4/2009 at a concentration of 1*10.sup.5 Pfu and (ii) A/PR/8/1934 at a concentration of 1*10.sup.3Pfu. Weight changes were monitored on a daily basis. The optimisation of challenge experiments enables the protection induced by the vaccine in mice to be quantified in the vaccination studies.
[0526] The basic vaccination protocol is shown in
[0527] Mice were sequentially vaccinated with the sequences outlined below:
TABLE-US-00002 Position Name 146 147 142 149 151 154 155 156 157 152 159 163 Blue N K G V A P H A G A K K Hazel N I G V A S H A G K S K Green T R G V A S H K G K S R Orange T K G V A S H N G K S R Red T Absent G V A S H N G K S R
[0528] Five groups of six mice (named blue, red, hazel, orange and green) were vaccinated via intramuscular injection of 100 μg of DNA each with a different conformation of the epitope substituted into H6 HA at 10 weeks. At 13 weeks of age, the same groups were vaccinated via intramuscular injection of 100 μg of DNA with the same conformation substituted into H5 HA. At 18 weeks of age, the same groups were vaccinated via intramuscular with the same conformation substituted into H11 HA displayed on a lentivirus and mixed with Alum adjuvant (Alhydrogel, Invivogen). Two control groups (purple and grey) were vaccinated in the same manner as the aforementioned mice with the HAs without the epitope conformations substituted into them. Finally, two further control groups (black and white) were mock vaccinated at 18 weeks with PBS and Alum (Alhydrogel, Invivogen). At 11 weeks, 14 weeks, 20 weeks and 21 weeks all groups were bled. At 22 weeks, the blue, orange, hazel and purple groups were challenged with mouse adapted A/California/4/2009 virus and weighed daily. At 22 weeks, the red, green, grey and white groups were challenged with mouse adapted A/PR/8/1934 virus and weighed daily. The results are shown in
Example 8: Vaccination Against the H3 Influenza Subtype
[0529] The method of identifying sites of limited variability and subsequent epitopes of limited variability is applied to the H3 subtype of influenza A. As H3 subtype influenza A virus evolves in a similar way to the H1 subtype influenza A virus, this approach to identifying epitopes is equally applicable to H3 subtype influenza A. Consequently epitopes can be identified by mapping the variability of H3 strains to the head of H3 influenza, identifying regions of limited variability, mapping said regions to H3 structures to identify potential epitopes and then analysing consensus sequence data to identify epitopes behaving in a cyclical manner predicted by antigenic thrift model
[0530] Epitope conformations of this type are placed in the HA head domains of H4, H7H10, H14 and H15 and expressed using VLPs or viral vectors. The vaccine combination is administered as a prime-boost-boost.
Example 9: Cross-Reactivity Produced by the Vaccines
[0531] ELISA assays were performed against the HA1 domain of A/PR/8/1934, A/Bel/1942, A/Albany/14/1951 and A/Memphis/3/1987. Relative ELISA units (REU) were calculated based on a known positive sample reaching an OD of 1.0 in each assay.
TABLE-US-00003 HA1 domains (REU) Vaccine groups A/PR/8/ A/Bel/ A/Albany/ A/Memphis/ (pooled sera samples) 1934 1942 14/1951 3/1987 Blue — — — — Hazel — — — 329 Green 205 — — 569 Orange 231 224 317 735 Red 770 — — 307 H5 + H6 + H11 control — — — — Unvaccinated control — — — —
REFERENCES
[0532] Belongia, E. A. et al., 2009. Effectiveness of Inactivated Influenza Vaccines Varied Substantially with Antigenic Match from the 2004-2005 Season to the 2006-2007 Season Linked references are available on JSTOR for this article: Effectiveness of Inactivated Influenza Vaccines Varied. The Journal of Infectious Disease, 199(2), pp. 159-167. [0533] Carnell et al., (2015) Pseudotype-based neutralization assays for influenza: a systematic analysis. Front Immunol. 2015 Apr. 29; 6:161. doi: 10.3389/fimmu.2015.00161. eCollection 2015. [0534] Carter et al., (2013) Sequential seasonal H1N1 influenza virus infections protect ferrets against novel 2009 H1N1 influenza virus. J Virol. 2013 February; 87(3):1400-10. [0535] Caton et al., 1982. The antigenic structure of the influenza virus A/PR/8/34 hemagglutinin (H1 subtype). Cell, 31(2 Pt 1), pp. 417-427. [0536] Gupta S. 2016 Immune Driven Pathogen Evolution, Encyclopaedia of Immunology (Ed. Kaye, P.) Elsevier. [0537] Krammer, F. et al., 2013. Broadly Protective Stalk-Specific Antibodies., 87(12), pp. 6542-6550. [0538] Li, Y. et al., 2013. Immune history shapes specificity of pandemic H1N1 influenza antibody responses. 210(8), pp. 1493-1500. [0539] Lozano, R. et al., 2012. Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet, 380, pp. 2095-2128. [0540] Manicassamy, B. et al., 2010. Protection of mice against lethal challenge with 2009 H1N1 influenza A virus by 1918-like and classical swine H1N1 based vaccines. PLoS Pathogens, 6(1). [0541] Matsuzaki, Y. et al., 2014. Epitope Mapping of the Hemagglutinin Molecule of A/(H1N1) pdm09 Influenza Virus by Using Monoclonal Antibody Escape Mutants. Journal of Virology, 88(21), pp. 12364-12373. [0542] Mertz, D., Hyong, T. & Johnstone, J., 2013. Populations at risk for severe or complicated influenza illness: systematic review and meta-analysis. British Medical Journal, 5061(August), pp. 1-15. [0543] Miura et al. 2008 Vaccine 26:193. [0544] Presanis, A. M. et al., 2011. Changes in severity of 2009 pandemic A/H1N1 influenza in England: a Bayesian evidence synthesis. British Medical Journal, (343), pp. 1-14. [0545] Recker, M. et al., 2007. The generation of influenza outbreaks by a network of host immune responses against a limited set of antigenic types. PNAS 104:7711 [0546] Taubenberger, J. K. & Morens, D. M., 2006. 1918 Influenza: the Mother of All Pandemics. Lancet, 12(1), pp. 15-22. [0547] Treanor, J. J. et al., 2012. Effectiveness of Seasonal Influenza Vaccines in the United States During a Season With Circulation of All Three Vaccine Strains., pp. 1-9. [0548] WHO 2016. Recommended composition of influenza virus vaccines for use in the 2016-2017 northern hemisphere influenza season. [0549] Wikramaratna, P. S. et al., 2013. The antigenic evolution of influenza: drift or thrift? Philosophical transactions of the Royal Society of London. Series B, Biological sciences, 368(1614), p. 20120200.
TABLE-US-00004 SEQUENCES H1 head domain—amino acid SEQ ID NO: 1 CKLRGVAPLHLGKCNIAGWILGNPECESLSTASSWSYIVETSSSDNGTCYPGDFIDYEE LREQLSSVSSFERFEIFPKTSSWPNHDSNKGVTAACPHAGAKSFYKNLIWLVKKGNSY PKLSKSYINDKGKEVLVLWGIHHPSTSADQQSLYQNADAYVFVGTSRYSKKFKPEIAIR PKVRDQEGRMNYYWTLVEPGDKITFEATGNLVVPRYAFAMERNAGSGIIISDTPVHDC H6 head domain—amino acid SEQ ID NO: 2 CKILNKAPLDLRGCTIEGWILGNPQCDLLLGDQSWSYIVERPTAQNGICYPGTLNEVEE LKALIGSGERVERFEMFPKSTWAGVDTNSGVTSACPYNSGSSFYRNLLWIIKTKSAAY PVIKGTYNNTGNQPILYFWGVHHPPDTNEQNTLYGSGDRYVRMGTESMNFAKSPEIA ARPAVNGQRGRIDYYWSVLKPGETLNVESNGNLIAPWYAYKFVSTNNKGAVFKSNLPI ENC H5 head domain—amino acid SEQ ID NO: 3