CANCER NEOANTIGENS AND THEIR UTILITIES IN CANCER VACCINES AND TCR-BASED CANCER IMMUNOTHERAPY
20220307087 · 2022-09-29
Inventors
- Rongfu WANG (Houston, TX, US)
- Yicheng WANG (Houston, TX, US)
- Lili ZHANG (Shanghai, CN)
- Chen Qian (Houston, TX, US)
Cpc classification
G01N33/57484
PHYSICS
C12N2501/999
CHEMISTRY; METALLURGY
C12N5/0638
CHEMISTRY; METALLURGY
A61P35/00
HUMAN NECESSITIES
International classification
Abstract
Disclosed are compositions and methods for identifying neoantigens and using neoantigens in the use of treating cancer, as well as autoimmune diseases, where antigens causing tissue destruction.
Claims
1. A method of identifying neoantigens from a cancer in a human subject comprising: a. performing whole exome sequencing on a nucleic acid sample from a cancer cell; b. mapping the sequence to a reference genome sequence; c. filtering sequence variations to remove common variations in tumors and normal cells; d. creating one or more single mutation peptide constructs comprising at least one uncommon amino acid variation and one or more flanking amino acids; e. synthesizing one or more minigenes encoding one or more single mutation peptide constructs of step d; f. transfecting one or more minigenes into one or more cells or cell lines expressing MHC class I or MHC class II molecules; g. co-culturing one or more T cells or T cells lines with the transfected cells of step f); h. measuring T cell activity of the co-cultured T cells or T cell lines and identifying mutation peptides encoded by minigenes in transfected cells that induced T cell activity; and i. assaying the identified mutation peptide from each minigene in transfected cells that induced T cell activity, for the ability to induce T cell activity alone.
2-12. (canceled)
13. A method of stimulating an immunological response against a cancer or treating, inhibiting, and/or preventing a cancer, the method comprising administering to a subject a composition comprising a pharmaceutically acceptable carrier and a therapeutically effective amount of one or more neoantigen peptides identified by the method of claim 1.
14. The method of claim 13, wherein the neoantigen is an immunodominant neoantigen.
15. A composition for treating a cancer, wherein the composition comprises a pharmaceutically acceptable carrier and a therapeutically effective amount of one or more neoantigen peptides
16. (canceled)
17. A composition for treating a cancer comprising a therapeutically effective amount of one or more chimeric antigen receptor (CAR) T cells T cell receptor (TCR) T-cells, and/or tumor infiltrating lymphocytes (TILs); wherein the CAR T cell, TCR T cell, and/or TIL has been engineered to express a receptor for a cancer neoantigen.
18. The composition of any one of claim 15 or 17, wherein the neoantigen is selected from the group of neoantigens consisting of the amino acids as set forth in SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 57, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 80, SEQ ID NO: 82, SEQ ID NO: 84, SEQ ID NO: 86, SEQ ID NO: 88, SEQ ID NO: 89, SEQ ID NO: 90, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 93, SEQ ID NO: 94, SEQ ID NO: 95, SEQ ID NO: 96, SEQ ID NO: 97, SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID NO: 110, SEQ ID NO: 111, SEQ ID NO: 112, SEQ ID NO: 113, SEQ ID NO: 114, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 117, SEQ ID NO: 118, SEQ ID NO: 119, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, and/or SEQ ID NO: 163.
19. A method of treating a cancer in a subject comprising administering to the subject the composition of any one of claim 15, 17 or 18.
20-44. (canceled)
45. An isolated nucleic acid encoding a neoantigen of claim 18.
46. The method of claim 13 wherein the cancer is selected from the group consisting of B cell lymphoma, T cell lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and neck, lung cancers, small cell lung cancer, non-small cell lung cancer, neuroblastoma, glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin cancers, melanoma, basal cell carcinoma, squamous cell carcinoma, liver cancer, squamous cell carcinomas of the mouth, throat, larynx, and lung, cervical cancer, cervical carcinoma, breast cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon and rectal cancers, prostatic cancer, AIDS-related lymphomas, or AIDS-related sarcomas.
47. The method of claim 13, wherein the neoantigen is selected from the group of neoantigens consisting of the amino acids as set forth in SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 57, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 80, SEQ ID NO: 82, SEQ ID NO: 84, SEQ ID NO: 86, SEQ ID NO: 88, SEQ ID NO: 89, SEQ ID NO: 90, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 93, SEQ ID NO: 94, SEQ ID NO: 95, SEQ ID NO: 96, SEQ ID NO: 97, SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID NO: 110, SEQ ID NO: 111, SEQ ID NO: 112, SEQ ID NO: 113, SEQ ID NO: 114, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 117, SEQ ID NO: 118, SEQ ID NO: 119, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, and/or SEQ ID NO: 163.
48. A composition comprising a therapeutically effective amount of one or more TCR-T cells or CAR-T cells, wherein the cells have been engineered to express one or more nucleic acid sequences encoding a shRNA targeting a gene encoding a negative signaling molecule or an epigenetic protein, whereby knock down or knockout of the negative signaling molecule or epigenetic protein enhances T cell persistence and antitumor activity of the TCR-T or CAR-T cells in vivo.
49. The composition of claim 48, wherein the TCR-T and CAR-T cells are further engineered to express a T cell receptor or chimeric antigen receptor specific for cancer neoantigens.
50. The composition of claim 49, wherein the cancer neoantigen is a cancer testis neoantigen.
51. The composition of claim 48, wherein the negative signaling molecule is selected from a master regulator of T cell development, an immune checkpoint protein and an immune suppressor protein.
52. The composition of claim 48, wherein the epigenetic protein is a lysine demethylase.
53. A composition comprising a therapeutically effective amount of one or more TCR-T cells or CAR-T cells and a LSD1 inhibitor that enhances the anti-tumor activity of the T cells or CAR-T cells in vivo.
54. The composition of claim 52, wherein the LSD1 inhibitor reprograms stem cell-like T cells to prolong survival of the T cells.
Description
III. BRIEF DESCRIPTION OF THE DRAWINGS
[0016] The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate several embodiments and together with the description illustrate the disclosed compositions and methods.
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
IV. DETAILED DESCRIPTION
[0037] Before the present compounds, compositions, articles, devices, and/or methods are disclosed and described, it is to be understood that they are not limited to specific synthetic methods or specific recombinant biotechnology methods unless otherwise specified, or to particular reagents unless otherwise specified, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting.
A. Definitions
[0038] As used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a pharmaceutical carrier” includes mixtures of two or more such carriers, and the like.
[0039] Ranges can be expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, another embodiment includes from the one particular value and/or to the other particular value. Similarly, when values are expressed as approximations, by use of the antecedent “about,” it will be understood that the particular value forms another embodiment. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint. It is also understood that there are a number of values disclosed herein, and that each value is also herein disclosed as “about” that particular value in addition to the value itself. For example, if the value “10” is disclosed, then “about 10” is also disclosed. It is also understood that when a value is disclosed that “less than or equal to” the value, “greater than or equal to the value” and possible ranges between values are also disclosed, as appropriately understood by the skilled artisan. For example, if the value “10” is disclosed the “less than or equal to 10” as well as “greater than or equal to 10” is also disclosed. It is also understood that the throughout the application, data is provided in a number of different formats, and that this data, represents endpoints and starting points, and ranges for any combination of the data points. For example, if a particular data point “10” and a particular data point 15 are disclosed, it is understood that greater than, greater than or equal to, less than, less than or equal to, and equal to 10 and 15 are considered disclosed as well as between 10 and 15. It is also understood that each unit between two particular units are also disclosed. For example, if 10 and 15 are disclosed, then 11, 12, 13, and 14 are also disclosed.
[0040] In this specification and in the claims which follow, reference will be made to a number of terms which shall be defined to have the following meanings:
[0041] “Optional” or “optionally” means that the subsequently described event or circumstance may or may not occur, and that the description includes instances where said event or circumstance occurs and instances where it does not.
[0042] The term “subject” refers to any individual who is the target of administration or treatment. The subject can be a vertebrate, for example, a mammal. In one aspect, the subject can be human, non-human primate, bovine, equine, porcine, canine, or feline. The subject can also be a guinea pig, rat, hamster, rabbit, mouse, or mole. Thus, the subject can be a human or veterinary patient. The term “patient” refers to a subject under the treatment of a clinician, e.g., physician.
[0043] Throughout this application, various publications are referenced. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art to which this pertains. The references disclosed are also individually and specifically incorporated by reference herein for the material contained in them that is discussed in the sentence in which the reference is relied upon.
B. Methods of Identifying Neoantigens
[0044] Although in vitro stimulation of T cells from normal healthy donors with neoantigen peptides could increase the number of neoantigens capable of activating T cells, prior to the present disclosure it remained unknown about the number and recognition pattern of neoantigens by T cells in cancer patients. Furthermore, it was largely unknown whether a single T cell receptor (TCR) can recognize two or multiple neoantigens in cancer patients. Despite recent identification of many HLA class I-restricted neoantigens using exome sequencing and computer-assisted prediction programs, increasing evidence demonstrates that neoantigen-specific CD4+ T cells are more dominant than neoantigen-specific CD8+ T cells, and play a critical role in tumor regression. However, the current prediction programs in use in the art for MHC II-restricted neoantigens are not accurate and much poorer than the prediction programs of MHC I-restricted neoantigens due to the open pockets of MHC II molecules, thus limiting the ability to identify neoantigens and develop neoantigen-based immunotherapy.
[0045] Tumor regression induced by checkpoint blockade therapy and T cell-based immunotherapy relies on T-cell recognition of tumor antigens, particularly mutation-derived neoantigens, expressed by cancer cells. However, despite the larger numbers of somatic mutations identified in cancer tissues that could generate neoantigens, only a few immunogenic peptide epitopes (per cancer patient) for T-cell recognition have been reported. The underlying mechanisms of these observations remain poorly understood. Herein identification of four MHC class II-restricted neoantigens and two MHC class I-restricted neoantigens is reported. Further analysis of neoantigens for recognition by CD4+ T-cell clones revealed the dominance of one neoantigen, which was recognized by 87% of the T-cell clones, while several low-frequency T-cell clones recognized 3 additional neoantigens. Overall, 98% of the CD4+ T cells recognized neoantigens, compared with only 34% of the CD8+ T cell clones. Additionally, 2 T-cell clones did not recognize mutation-derived neoantigen, but instead recognized neoepitopes derived from gene products translated from the 3′ untranslated region of the ADIPOR2 gene or from an alternative open reading frame of LAGE1b. Finally, it was demonstrated herein that a single TCR can recognize two neoantigens, but not other antigens in normal cells. Also demonstrated was the utility and power of neoantigen-specific T cells in inhibiting tumor growth. The findings have identified immune regulatory mechanisms that restrain T-cell response to a limited number of neoantigens with one as dominant, and should have important implications in personalized immunotherapy that targets dominant neoantigens and TCRs with multiple neoantigen specificity.
[0046] In one aspect, disclosed herein are methods of identifying neoantigens from a cancer in a human subject comprising a) exome sequencing on a nucleic acid sample from a cancer cell (in some instances the method can further comprise isolating DNA from cancer cells, capturing DNA fragments, and constructing an exome library); b) mapping the sequencing result to a reference genome sequence data (for example human genome GRCh38/hg38); c) filtering sequence variations to remove common variations in tumors and normal cells; d) creating one or more single mutation peptide constructs comprising at least one uncommon amino acid variation and one or more flanking amino acids; e) synthesizing one or more minigenes encoding one or more single mutation peptide constructs of step d; f) transfecting one or more minigenes into one or more cells or cell lines expressing MHC class I or MHC class II molecules; g) co-culturing one or more T cells or T cells lines with the transfected cells of step f); h) measuring T cell activity of the co-cultured T cells or T cell lines; i) and assaying single mutation peptides from each minigene in transfected cell lines that induced T cell activity for the ability to induce T cell activity alone.
[0047] In one aspect, also disclosed herein are methods of identifying any neoantigendisclosed herein, further comprising mapping the mutation to a specific peptide epitope by serial truncation and/or further comprising assaying the HLA restriction of the peptide epitope by HLA antibody blocking.
[0048] As noted herein, the single mutation constructs created in the disclosed methods can have one or more amino acids on either or both sides of the mutations. Thus, for example, disclosed herein are methods of identifying any neoantigen disclosed herein, wherein the single mutation constructs comprise at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 amino acids on either side of the mutation.
[0049] It is understood and herein contemplated that the minigenes created in the disclosed methods can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or more single mutation constructs. Thus, in one aspect, disclosed herein are methods of identifying any neoantigen disclosed herein, wherein the minigene comprises at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 single mutation constructs.
[0050] In one aspect, it is understood and herein contemplated that the disclosed also disclosed herein are methods of identifying any neoantigen disclosed herein, wherein the one or more cell or cell lines (such as, for example HEK293 cells, HEK293T cells, Cos-7, MA104 cells, CHO cells, Fibroblasts, B cells, VERO cells, Madin-Darby Canine Kidney (MDCK) cells, HEp-2 cells, HeLa cells, MRC-5 cells, WI-38 cells, EB66, or PER C6 cells) expressing MHC class I or class II molecules, are engineered to express an MHC molecule.
[0051] It is understood and herein contemplated that there are numerous methods known in the art for measuring T cell activity that could be used in the disclosed methods of identifying neoantigens, including, but not limited to chromium release assays, ELISA, ELISpot, intracellular cytokine staining, flow cytometry, MHC class I tetramer staining, MHC class II tetramer staining, radioimmunoassay, or other immunoassay. The immunoassays can measure the release of any number of effector cytokines (such as, for example, release of cytokines including, but not limited to IFN-γ, TGF-β, lymphotoxin-α, IL-2, IL-4, IL-10, IL-17, or IL-25) or expression of cell surface markers associated with T cell activation (CD69, CD62L, CD44, CD45 RA, CD45RO, and/or CCR7).
[0052] 1. Immunoassays and Fluorochromes 52. The steps of various useful immunodetection methods have been described in the scientific literature, such as, e.g., Maggio et al., Enzyme-Immunoassay, (1987) and Nakamura, et al., Enzyme Immunoassays: Heterogeneous and Homogeneous Systems, Handbook of Experimental Immunology, Vol. 1: Immunochemistry, 27.1-27.20 (1986), each of which is incorporated herein by reference in its entirety and specifically for its teaching regarding immunodetection methods. Immunoassays, in their most simple and direct sense, are binding assays involving binding between antibodies and antigen. Many types and formats of immunoassays are known and all are suitable for detecting the disclosed biomarkers. Examples of immunoassays are enzyme linked immunosorbent assays (ELISAs), radioimmunoassays (RIA), radioimmune precipitation assays (RIPA), immunobead capture assays, Western blotting, dot blotting, gel-shift assays, Flow cytometry, protein arrays, multiplexed bead arrays, magnetic capture, in vivo imaging, fluorescence resonance energy transfer (FRET), and fluorescence recovery/localization after photobleaching (FRAP/FLAP).
[0053] In general, immunoassays involve contacting a sample suspected of containing a molecule of interest (such as the disclosed biomarkers) with an antibody to the molecule of interest or contacting an antibody to a molecule of interest (such as antibodies to the disclosed biomarkers) with a molecule that can be bound by the antibody, as the case may be, under conditions effective to allow the formation of immunocomplexes. Contacting a sample with the antibody to the molecule of interest or with the molecule that can be bound by an antibody to the molecule of interest under conditions effective and for a period of time sufficient to allow the formation of immune complexes (primary immune complexes) is generally a matter of simply bringing into contact the molecule or antibody and the sample and incubating the mixture for a period of time long enough for the antibodies to form immune complexes with, i.e., to bind to, any molecules (e.g., antigens) present to which the antibodies can bind. In many forms of immunoassay, the sample-antibody composition, such as a tissue section, ELISA plate, dot blot or Western blot, can then be washed to remove any non-specifically bound antibody species, allowing only those antibodies specifically bound within the primary immune complexes to be detected.
[0054] Immunoassays can include methods for detecting or quantifying the amount of a molecule of interest (such as the disclosed biomarkers or their antibodies) in a sample, which methods generally involve the detection or quantitation of any immune complexes formed during the binding process. In general, the detection of immunocomplex formation is well known in the art and can be achieved through the application of numerous approaches. These methods are generally based upon the detection of a label or marker, such as any radioactive, fluorescent, biological or enzymatic tags or any other known label.
[0055] As used herein, a label can include a fluorescent dye, a member of a binding pair, such as biotin/streptavidin, a metal (e.g., gold), or an epitope tag that can specifically interact with a molecule that can be detected, such as by producing a colored substrate or fluorescence. Substances suitable for detectably labeling proteins include fluorescent dyes (also known herein as fluorochromes and fluorophores) and enzymes that react with colorometric substrates (e.g., horseradish peroxidase). The use of fluorescent dyes is generally preferred in the practice of the invention as they can be detected at very low amounts. Furthermore, in the case where multiple antigens are reacted with a single array, each antigen can be labeled with a distinct fluorescent compound for simultaneous detection. Labeled spots on the array are detected using a fluorimeter, the presence of a signal indicating an antigen bound to a specific antibody.
[0056] Fluorophores are compounds or molecules that luminesce. Typically fluorophores absorb electromagnetic energy at one wavelength and emit electromagnetic energy at a second wavelength. Representative fluorophores include, but are not limited to, 1,5 IAEDANS; 1,8-ANS; 4-Methylumbelliferone; 5-carboxy-2,7-dichlorofluorescein; 5-Carboxyfluorescein (5-FAM); 5-Carboxynapthofluorescein; 5-Carboxytetramethylrhodamine (5-TAMRA); 5-Hydroxy Tryptamine (5-HAT); 5-ROX (carboxy-X-rhodamine); 6-Carboxyrhodamine 6G; 6-CR 6G; 6-JOE; 7-Amino-4-methylcoumarin; 7-Aminoactinomycin D (7-AAD); 7-Hydroxy-4-I methylcoumarin; 9-Amino-6-chloro-2-methoxyacridine (ACMA); ABQ; Acid Fuchsin; Acridine Orange; Acridine Red; Acridine Yellow; Acriflavin; Acriflavin Feulgen SITSA; Aequorin (Photoprotein); AFPs—AutoFluorescent Protein—(Quantum Biotechnologies) see sgGFP, sgBFP; Alexa Fluor 350™; Alexa Fluor 430™; Alexa Fluor 488™; Alexa Fluor 532™; Alexa Fluor 546™; Alexa Fluor 568™; Alexa Fluor 594™; Alexa Fluor 633™; Alexa Fluor 647™; Alexa Fluor 660™; Alexa Fluor 680™; Alizarin Complexon; Alizarin Red; Allophycocyanin (APC); AMC, AMCA-S; Aminomethylcoumarin (AMCA); AMCA-X; Aminoactinomycin D; Aminocoumarin; Anilin Blue; Anthrocyl stearate; APC-Cy7; APTRA-BTC; APTS; Astrazon Brilliant Red 4G; Astrazon Orange R; Astrazon Red 6B; Astrazon Yellow 7 GLL; Atabrine; ATTO-TAG™ CBQCA; ATTO-TAG™ FQ; Auramine; Aurophosphine G; Aurophosphine; BAO 9 (Bisaminophenyloxadiazole); BCECF (high pH); BCECF (low pH); Berberine Sulphate; Beta Lactamase; BFP blue shifted GFP (Y66H); Blue Fluorescent Protein; BFP/GFP FRET; Bimane; Bisbenzemide; Bisbenzimide (Hoechst); bis-BTC; Blancophor FFG; Blancophor SV; BOBO™-1; BOBO™-3; Bodipy 492/515; Bodipy 493/503; Bodipy 500/510; Bodipy; 505/515; Bodipy 530/550; Bodipy 542/563; Bodipy 558/568; Bodipy 564/570; Bodipy 576/589; Bodipy 581/591; Bodipy 630/650-X; Bodipy 650/665-X; Bodipy 665/676; Bodipy Fl; Bodipy FL ATP; Bodipy Fl-Ceramide; Bodipy R6G SE; Bodipy TMR; Bodipy TMR-X conjugate; Bodipy TMR-X, SE; Bodipy TR; Bodipy TR ATP; Bodipy TR-X SE; BO-PRO™-1; BO-PRO™-3; Brilliant Sulphoflavin FF; BTC; BTC-5N; Calcein; Calcein Blue; Calcium Crimson—; Calcium Green; Calcium Green-1 Ca.sup.2+ Dye; Calcium Green-2 Ca.sup.2+; Calcium Green-5N Ca.sup.2+; Calcium Green-C18 Ca.sup.2+; Calcium Orange; Calcofluor White; Carboxy-X-rhodamine (5-ROX); Cascade Blue™; Cascade Yellow; Catecholamine; CCF2 (GeneBlazer); CFDA; CFP (Cyan Fluorescent Protein); CFP/YFP FRET; Chlorophyll; Chromomycin A; Chromomycin A; CL-NERF; CMFDA; Coelenterazine; Coelenterazine cp; Coelenterazine f; Coelenterazine fcp; Coelenterazine h; Coelenterazine hcp; Coelenterazine ip; Coelenterazine n; Coelenterazine O; Coumarin Phalloidin; C-phycocyanine; CPM I Methylcoumarin; CTC; CTC Formazan; Cy2™; Cy3.1 8; Cy3.5™; Cy3™; Cy5.1 8; Cy5.5™; Cy5™; Cy7™; Cyan GFP; cyclic AMP Fluorosensor (FiCRhR); Dabcyl; Dansyl; Dansyl Amine; Dansyl Cadaverine; Dansyl Chloride; Dansyl DHPE; Dansyl fluoride; DAPI; Dapoxyl; Dapoxyl 2; Dapoxyl 3′DCFDA; DCFH (Dichlorodihydrofluorescein Diacetate); DDAO; DHR (Dihydorhodamine 123); Di-4-ANEPPS; Di-8-ANEPPS (non-ratio); DiA (4-Di 16-ASP); Dichlorodihydrofluorescein Diacetate (DCFH); DiD-Lipophilic Tracer; DiD (Di1C18(5)); DIDS; Dihydorhodamine 123 (DHR); Dil (Di1C18(3)); I Dinitrophenol; DiO (DiOC18(3)); DiR; DiR (Di1C18(7)); DM-NERF (high pH); DNP; Dopamine; DsRed; DTAF; DY-630-NHS; DY-635-NHS; EBFP; ECFP; EGFP; ELF 97; Eosin; Erythrosin; Erythrosin ITC; Ethidium Bromide; Ethidium homodimer-1 (EthD-1); Euchrysin; EukoLight; Europium (111) chloride; EYFP; Fast Blue; FDA; Feulgen (Pararosaniline); FIF (Formaldehyd Induced Fluorescence); FITC; Flazo Orange; Fluo-3; Fluo-4; Fluorescein (FITC); Fluorescein Diacetate; Fluoro-Emerald; Fluoro-Gold (Hydroxystilbamidine); Fluor-Ruby; FluorX; FM 1-43™; FM 4-46; Fura Red™ (high pH); Fura Red™/Fluo-3; Fura-2; Fura-2/BCECF; Genacryl Brilliant Red B; Genacryl Brilliant Yellow 10GF; Genacryl Pink 3G; Genacryl Yellow SGF; GeneBlazer; (CCF2); GFP (S65T); GFP red shifted (rsGFP); GFP wild type′ non-UV excitation (wtGFP); GFP wild type, UV excitation (wtGFP); GFPuv; Gloxalic Acid; Granular blue; Haematoporphyrin; Hoechst 33258; Hoechst 33342; Hoechst 34580; HPTS; Hydroxycoumarin; Hydroxystilbamidine (FluoroGold); Hydroxytryptamine; Indo-1, high calcium; Indo-1 low calcium; Indodicarbocyanine (DiD); Indotricarbocyanine (DiR); Intrawhite Cf; JC-1; JO JO-1; JO-PRO-1; LaserPro; Laurodan; LDS 751 (DNA); LDS 751 (RNA); Leucophor PAF; Leucophor SF; Leucophor WS; Lissamine Rhodamine; Lissamine Rhodamine B; Calcein/Ethidium homodimer; LOLO-1; LO-PRO-1; Lucifer Yellow; Lyso Tracker Blue; Lyso Tracker Blue-White; Lyso Tracker Green; Lyso Tracker Red; Lyso Tracker Yellow; LysoSensor Blue; LysoSensor Green; LysoSensor Yellow/Blue; Mag Green; Magdala Red (Phloxin B); Mag-Fura Red; Mag-Fura-2; Mag-Fura-5; Mag-lndo-1; Magnesium Green; Magnesium Orange; Malachite Green; Marina Blue; I Maxilon Brilliant Flavin 10 GFF; Maxilon Brilliant Flavin 8 GFF; Merocyanin; Methoxycoumarin; Mitotracker Green FM; Mitotracker Orange; Mitotracker Red; Mitramycin; Monobromobimane; Monobromobimane (mBBr-GSH); Monochlorobimane; MPS (Methyl Green Pyronine Stilbene); NBD; NBD Amine; Nile Red; Nitrobenzoxedidole; Noradrenaline; Nuclear Fast Red; i Nuclear Yellow; Nylosan Brilliant lavin EBG; Oregon Green™; Oregon Green™ 488; Oregon Green™ 500; Oregon Green™ 514; Pacific Blue; Pararosaniline (Feulgen); PBFI; PE-Cy5; PE-Cy7; PerCP; PerCP-Cy5.5; PE-TexasRed (Red 613); Phloxin B (Magdala Red); Phorwite AR; Phorwite BKL; Phorwite Rev; Phorwite RPA; Phosphine 3R; PhotoResist; Phycoerythrin B [PE]; Phycoerythrin R [PE]; PKH26 (Sigma); PKH67; PMIA; Pontochrome Blue Black; POPO-1; POPO-3; PO-PRO-1; PO-I PRO-3; Primuline; Procion Yellow; Propidium lodid (P1); PyMPO; Pyrene; Pyronine; Pyronine B; Pyrozal Brilliant Flavin 7GF; QSY 7; Quinacrine Mustard; Resorufin; RH 414; Rhod-2; Rhodamine; Rhodamine 110; Rhodamine 123; Rhodamine 5 GLD; Rhodamine 6G; Rhodamine B; Rhodamine B 200; Rhodamine B extra; Rhodamine BB; Rhodamine BG; Rhodamine Green; Rhodamine Phallicidine; Rhodamine: Phalloidine; Rhodamine Red; Rhodamine WT; Rose Bengal; R-phycocyanine; R-phycoerythrin (PE); rsGFP; S65A; S65C; S65L; S65T; Sapphire GFP; SBFI; Serotonin; Sevron Brilliant Red 2B; Sevron Brilliant Red 4G; Sevron I Brilliant Red B; Sevron Orange; Sevron Yellow L; sgBFP™ (super glow BFP); sgGFP™ (super glow GFP); SITS (Primuline; Stilbene Isothiosulphonic Acid); SNAFL calcein; SNAFL-1; SNAFL-2; SNARF calcein; SNARF1; Sodium Green; SpectrumAqua; SpectrumGreen; SpectrumOrange; Spectrum Red; SPQ (6-methoxy-N-(3 sulfopropyl) quinolinium); Stilbene; Sulphorhodamine B and C; Sulphorhodamine Extra; SYTO 11; SYTO 12; SYTO 13; SYTO 14; SYTO 15; SYTO 16; SYTO 17; SYTO 18; SYTO 20; SYTO 21; SYTO 22; SYTO 23; SYTO 24; SYTO 25; SYTO 40; SYTO 41; SYTO 42; SYTO 43; SYTO 44; SYTO 45; SYTO 59; SYTO 60; SYTO 61; SYTO 62; SYTO 63; SYTO 64; SYTO 80; SYTO 81; SYTO 82; SYTO 83; SYTO 84; SYTO 85; SYTOX Blue; SYTOX Green; SYTOX Orange; Tetracycline; Tetramethylrhodamine (TRITC); Texas Red™; Texas Red-X™ conjugate; Thiadicarbocyanine (DiSC3); Thiazine Red R; Thiazole Orange; Thioflavin 5; Thioflavin S; Thioflavin TON; Thiolyte; Thiozole Orange; Tinopol CBS (Calcofluor White); TIER; TO-PRO-1; TO-PRO-3; TO-PRO-5; TOTO-1; TOTO-3; TriColor (PE-Cy5); TRITC TetramethylRodaminelsoThioCyanate; True Blue; Tru Red; Ultralite; Uranine B; Uvitex SFC; wt GFP; WW 781; X-Rhodamine; XRITC; Xylene Orange; Y66F; Y66H; Y66W; Yellow GFP; YFP; YO-PRO-1; YO-PRO 3; YOYO-1; YOYO-3; Sybr Green; Thiazole orange (interchelating dyes); semiconductor nanoparticles such as quantum dots; or caged fluorophore (which can be activated with light or other electromagnetic energy source), or a combination thereof.
[0057] A modifier unit such as a radionuclide can be incorporated into or attached directly to any of the compounds described herein by halogenation. Examples of radionuclides useful in this embodiment include, but are not limited to, tritium, iodine-125, iodine-131, iodine-123, iodine-124, astatine-210, carbon-11, carbon-14, nitrogen-13, fluorine-18. In another aspect, the radionuclide can be attached to a linking group or bound by a chelating group, which is then attached to the compound directly or by means of a linker. Examples of radionuclides useful in the apset include, but are not limited to, Tc-99m, Re-186, Ga-68, Re-188, Y-90, Sm-153, Bi-212, Cu-67, Cu-64, and Cu-62. Radiolabeling techniques such as these are routinely used in the radiopharmaceutical industry.
[0058] The radiolabeled compounds are useful as imaging agents to diagnose neurological disease (e.g., a neurodegenerative disease) or a mental condition or to follow the progression or treatment of such a disease or condition in a mammal (e.g., a human). The radiolabeled compounds described herein can be conveniently used in conjunction with imaging techniques such as positron emission tomography (PET) or single photon emission computerized tomography (SPECT).
[0059] Labeling can be either direct or indirect. In direct labeling, the detecting antibody (the antibody for the molecule of interest) or detecting molecule (the molecule that can be bound by an antibody to the molecule of interest) include a label. Detection of the label indicates the presence of the detecting antibody or detecting molecule, which in turn indicates the presence of the molecule of interest or of an antibody to the molecule of interest, respectively. In indirect labeling, an additional molecule or moiety is brought into contact with, or generated at the site of, the immunocomplex. For example, a signal-generating molecule or moiety such as an enzyme can be attached to or associated with the detecting antibody or detecting molecule. The signal-generating molecule can then generate a detectable signal at the site of the immunocomplex. For example, an enzyme, when supplied with suitable substrate, can produce a visible or detectable product at the site of the immunocomplex. ELISAs use this type of indirect labeling.
[0060] As another example of indirect labeling, an additional molecule (which can be referred to as a binding agent) that can bind to either the molecule of interest or to the antibody (primary antibody) to the molecule of interest, such as a second antibody to the primary antibody, can be contacted with the immunocomplex. The additional molecule can have a label or signal-generating molecule or moiety. The additional molecule can be an antibody, which can thus be termed a secondary antibody. Binding of a secondary antibody to the primary antibody can form a so-called sandwich with the first (or primary) antibody and the molecule of interest. The immune complexes can be contacted with the labeled, secondary antibody under conditions effective and for a period of time sufficient to allow the formation of secondary immune complexes. The secondary immune complexes can then be generally washed to remove any non-specifically bound labeled secondary antibodies, and the remaining label in the secondary immune complexes can then be detected. The additional molecule can also be or include one of a pair of molecules or moieties that can bind to each other, such as the biotin/avidin pair. In this mode, the detecting antibody or detecting molecule should include the other member of the pair.
[0061] Other modes of indirect labeling include the detection of primary immune complexes by a two-step approach. For example, a molecule (which can be referred to as a first binding agent), such as an antibody, that has binding affinity for the molecule of interest or corresponding antibody can be used to form secondary immune complexes, as described above. After washing, the secondary immune complexes can be contacted with another molecule (which can be referred to as a second binding agent) that has binding affinity for the first binding agent, again under conditions effective and for a period of time sufficient to allow the formation of immune complexes (thus forming tertiary immune complexes). The second binding agent can be linked to a detectable label or signal-genarating molecule or moiety, allowing detection of the tertiary immune complexes thus formed. This system can provide for signal amplification.
[0062] Immunoassays that involve the detection of as substance, such as a protein or an antibody to a specific protein, include label-free assays, protein separation methods (i.e., electrophoresis), solid support capture assays, or in vivo detection. Label-free assays are generally diagnostic means of determining the presence or absence of a specific protein, or an antibody to a specific protein, in a sample. Protein separation methods are additionally useful for evaluating physical properties of the protein, such as size or net charge. Capture assays are generally more useful for quantitatively evaluating the concentration of a specific protein, or antibody to a specific protein, in a sample. Finally, in vivo detection is useful for evaluating the spatial expression patterns of the substance, i.e., where the substance can be found in a subject, tissue or cell.
[0063] Provided that the concentrations are sufficient, the molecular complexes ([Ab-Ag]n) generated by antibody-antigen interaction are visible to the naked eye, but smaller amounts may also be detected and measured due to their ability to scatter a beam of light. The formation of complexes indicates that both reactants are present, and in immunoprecipitation assays a constant concentration of a reagent antibody is used to measure specific antigen ([Ab-Ag]n), and reagent antigens are used to detect specific antibody ([Ab-Ag]n). If the reagent species is previously coated onto cells (as in hemagglutination assay) or very small particles (as in latex agglutination assay), “clumping” of the coated particles is visible at much lower concentrations. A variety of assays based on these elementary principles are in common use, including Ouchterlony immunodiffusion assay, rocket immunoelectrophoresis, and immunoturbidometric and nephelometric assays. The main limitations of such assays are restricted sensitivity (lower detection limits) in comparison to assays employing labels and, in some cases, the fact that very high concentrations of analyte can actually inhibit complex formation, necessitating safeguards that make the procedures more complex. Some of these Group 1 assays date right back to the discovery of antibodies and none of them have an actual “label” (e.g. Ag-enz). Other kinds of immunoassays that are label free depend on immunosensors, and a variety of instruments that can directly detect antibody-antigen interactions are now commercially available. Most depend on generating an evanescent wave on a sensor surface with immobilized ligand, which allows continuous monitoring of binding to the ligand. Immunosensors allow the easy investigation of kinetic interactions and, with the advent of lower-cost specialized instruments, may in the future find wide application in immunoanalysis.
[0064] The use of immunoassays to detect a specific protein can involve the separation of the proteins by electrophoresis. Electrophoresis is the migration of charged molecules in solution in response to an electric field. Their rate of migration depends on the strength of the field; on the net charge, size and shape of the molecules and also on the ionic strength, viscosity and temperature of the medium in which the molecules are moving. As an analytical tool, electrophoresis is simple, rapid and highly sensitive. It is used analytically to study the properties of a single charged species, and as a separation technique.
[0065] Generally the sample is run in a support matrix such as paper, cellulose acetate, starch gel, agarose or polyacrylamide gel. The matrix inhibits convective mixing caused by heating and provides a record of the electrophoretic run: at the end of the run, the matrix can be stained and used for scanning, autoradiography or storage. In addition, the most commonly used support matrices—agarose and polyacrylamide—provide a means of separating molecules by size, in that they are porous gels. A porous gel may act as a sieve by retarding, or in some cases completely obstructing, the movement of large macromolecules while allowing smaller molecules to migrate freely. Because dilute agarose gels are generally more rigid and easy to handle than polyacrylamide of the same concentration, agarose is used to separate larger macromolecules such as nucleic acids, large proteins and protein complexes. Polyacrylamide, which is easy to handle and to make at higher concentrations, is used to separate most proteins and small oligonucleotides that require a small gel pore size for retardation.
[0066] Proteins are amphoteric compounds; their net charge therefore is determined by the pH of the medium in which they are suspended. In a solution with a pH above its isoelectric point, a protein has a net negative charge and migrates towards the anode in an electrical field. Below its isoelectric point, the protein is positively charged and migrates towards the cathode. The net charge carried by a protein is in addition independent of its size—i.e., the charge carried per unit mass (or length, given proteins and nucleic acids are linear macromolecules) of molecule differs from protein to protein. At a given pH therefore, and under non-denaturing conditions, the electrophoretic separation of proteins is determined by both size and charge of the molecules.
[0067] Sodium dodecyl sulphate (SDS) is an anionic detergent which denatures proteins by “wrapping around” the polypeptide backbone—and SDS binds to proteins fairly specifically in a mass ratio of 1.4:1. In so doing, SDS confers a negative charge to the polypeptide in proportion to its length. Further, it is usually necessary to reduce disulfide bridges in proteins (denature) before they adopt the random-coil configuration necessary for separation by size; this is done with 2-mercaptoethanol or dithiothreitol (DTT). In denaturing SDS-PAGE separations therefore, migration is determined not by intrinsic electrical charge of the polypeptide, but by molecular weight.
[0068] Determination of molecular weight is done by SDS-PAGE of proteins of known molecular weight along with the protein to be characterized. A linear relationship exists between the logarithm of the molecular weight of an SDS-denatured polypeptide, or native nucleic acid, and its Rf. The Rf is calculated as the ratio of the distance migrated by the molecule to that migrated by a marker dye-front. A simple way of determining relative molecular weight by electrophoresis (Mr) is to plot a standard curve of distance migrated vs. log 10 MW for known samples, and read off the log Mr of the sample after measuring distance migrated on the same gel.
[0069] In two-dimensional electrophoresis, proteins are fractionated first on the basis of one physical property, and, in a second step, on the basis of another. For example, isoelectric focusing can be used for the first dimension, conveniently carried out in a tube gel, and SDS electrophoresis in a slab gel can be used for the second dimension. One example of a procedure is that of O'Farrell, P. H., High Resolution Two-dimensional Electrophoresis of Proteins, J. Biol. Chem. 250:4007-4021 (1975), herein incorporated by reference in its entirety for its teaching regarding two-dimensional electrophoresis methods. Other examples include but are not limited to, those found in Anderson, L and Anderson, NG, High resolution two-dimensional electrophoresis of human plasma proteins, Proc. Natl. Acad. Sci. 74:5421-5425 (1977), Ornstein, L., Disc electrophoresis, L. Ann. N.Y. Acad. Sci. 121:321349 (1964), each of which is herein incorporated by reference in its entirety for teachings regarding electrophoresis methods. Laemmli, U.K., Cleavage of structural proteins during the assembly of the head of bacteriophage T4, Nature 227:680 (1970), which is herein incorporated by reference in its entirety for teachings regarding electrophoresis methods, discloses a discontinuous system for resolving proteins denatured with SDS. The leading ion in the Laemmli buffer system is chloride, and the trailing ion is glycine. Accordingly, the resolving gel and the stacking gel are made up in Tris-HCl buffers (of different concentration and pH), while the tank buffer is Tris-glycine. All buffers contain 0.1% SDS.
[0070] One example of an immunoassay that uses electrophoresis that is contemplated in the current methods is Western blot analysis. Western blotting or immunoblotting allows the determination of the molecular mass of a protein and the measurement of relative amounts of the protein present in different samples. Detection methods include chemiluminescence and chromogenic detection. Standard methods for Western blot analysis can be found in, for example, D. M. Bollag et al., Protein Methods (2d edition 1996) and E. Harlow & D. Lane, Antibodies, a Laboratory Manual (1988), U.S. Pat. No. 4,452,901, each of which is herein incorporated by reference in their entirety for teachings regarding Western blot methods. Generally, proteins are separated by gel electrophoresis, usually SDS-PAGE. The proteins are transferred to a sheet of special blotting paper, e.g., nitrocellulose, though other types of paper, or membranes, can be used. The proteins retain the same pattern of separation they had on the gel. The blot is incubated with a generic protein (such as milk proteins) to bind to any remaining sticky places on the nitrocellulose. An antibody is then added to the solution which is able to bind to its specific protein.
[0071] The attachment of specific antibodies to specific immobilized antigens can be readily visualized by indirect enzyme immunoassay techniques, usually using a chromogenic substrate (e.g. alkaline phosphatase or horseradish peroxidase) or chemiluminescent substrates. Other possibilities for probing include the use of fluorescent or radioisotope labels (e.g., fluorescein, .sup.125I). Probes for the detection of antibody binding can be conjugated anti-immunoglobulins, conjugated staphylococcal Protein A (binds IgG), or probes to biotinylated primary antibodies (e.g., conjugated avidin/streptavidin).
[0072] The power of the technique lies in the simultaneous detection of a specific protein by means of its antigenicity, and its molecular mass. Proteins are first separated by mass in the SDS-PAGE, then specifically detected in the immunoassay step. Thus, protein standards (ladders) can be run simultaneously in order to approximate molecular mass of the protein of interest in a heterogeneous sample.
[0073] The gel shift assay or electrophoretic mobility shift assay (EMSA) can be used to detect the interactions between DNA binding proteins and their cognate DNA recognition sequences, in both a qualitative and quantitative manner. Exemplary techniques are described in Ornstein L., Disc electrophoresis—I: Background and theory, Ann. NY Acad. Sci. 121:321-349 (1964), and Matsudiara, PT and DR Burgess, SDS microslab linear gradient polyacrylamide gel electrophoresis, Anal. Biochem. 87:386-396 (1987), each of which is herein incorporated by reference in its entirety for teachings regarding gel-shift assays.
[0074] In a general gel-shift assay, purified proteins or crude cell extracts can be incubated with a labeled (e.g., .sup.32P-radiolabeled) DNA or RNA probe, followed by separation of the complexes from the free probe through a nondenaturing polyacrylamide gel. The complexes migrate more slowly through the gel than unbound probe. Depending on the activity of the binding protein, a labeled probe can be either double-stranded or single-stranded. For the detection of DNA binding proteins such as transcription factors, either purified or partially purified proteins, or nuclear cell extracts can be used. For detection of RNA binding proteins, either purified or partially purified proteins, or nuclear or cytoplasmic cell extracts can be used. The specificity of the DNA or RNA binding protein for the putative binding site is established by competition experiments using DNA or RNA fragments or oligonucleotides containing a binding site for the protein of interest, or other unrelated sequence. The differences in the nature and intensity of the complex formed in the presence of specific and nonspecific competitor allows identification of specific interactions.
[0075] Gel shift methods can include using, for example, colloidal forms of COOMASSIE (Imperial Chemicals Industries, Ltd) blue stain to detect proteins in gels such as polyacrylamide electrophoresis gels. Such methods are described, for example, in Neuhoff et al., Electrophoresis 6:427-448 (1985), and Neuhoff et al., Electrophoresis 9:255-262 (1988), each of which is herein incorporated by reference in its entirety for teachings regarding gel shift methods. In addition to the conventional protein assay methods referenced above, a combination cleaning and protein staining composition is described in U.S. Pat. No. 5,424,000, herein incorporated by reference in its entirety for its teaching regarding gel shift methods. The solutions can include phosphoric, sulfuric, and nitric acids, and Acid Violet dye.
[0076] Radioimmune Precipitation Assay (RIPA) is a sensitive assay using radiolabeled antigens to detect specific antibodies in serum. The antigens are allowed to react with the serum and then precipitated using a special reagent such as, for example, protein A sepharose beads. The bound radiolabeled immunoprecipitate is then commonly analyzed by gel electrophoresis. Radioimmunoprecipitation assay (RIPA) is often used as a confirmatory test for diagnosing the presence of HIV antibodies. RIPA is also referred to in the art as Farr Assay, Precipitin Assay, Radioimmune Precipitin Assay; Radioimmunoprecipitation Analysis; Radioimmunoprecipitation Analysis, and Radioimmunoprecipitation Analysis.
[0077] While the above immunoassays that utilize electrophoresis to separate and detect the specific proteins of interest allow for evaluation of protein size, they are not very sensitive for evaluating protein concentration. However, also contemplated are immunoassays wherein the protein or antibody specific for the protein is bound to a solid support (e.g., tube, well, bead, or cell) to capture the antibody or protein of interest, respectively, from a sample, combined with a method of detecting the protein or antibody specific for the protein on the support. Examples of such immunoassays include Radioimmunoassay (RIA), Enzyme-Linked Immunosorbent Assay (ELISA), Flow cytometry, protein array, multiplexed bead assay, and magnetic capture.
[0078] Radioimmunoassay (RIA) is a classic quantitative assay for detection of antigen-antibody reactions using a radioactively labeled substance (radioligand), either directly or indirectly, to measure the binding of the unlabeled substance to a specific antibody or other receptor system. Radioimmunoassay is used, for example, to test hormone levels in the blood without the need to use a bioassay. Non-immunogenic substances (e.g., haptens) can also be measured if coupled to larger carrier proteins (e.g., bovine gamma-globulin or human serum albumin) capable of inducing antibody formation. RIA involves mixing a radioactive antigen (because of the ease with which iodine atoms can be introduced into tyrosine residues in a protein, the radioactive isotopes .sup.125I or .sup.131I are often used) with antibody to that antigen. The antibody is generally linked to a solid support, such as a tube or beads. Unlabeled or “cold” antigen is then adding in known quantities and measuring the amount of labeled antigen displaced. Initially, the radioactive antigen is bound to the antibodies. When cold antigen is added, the two compete for antibody binding sites—and at higher concentrations of cold antigen, more binds to the antibody, displacing the radioactive variant. The bound antigens are separated from the unbound ones in solution and the radioactivity of each used to plot a binding curve. The technique is both extremely sensitive, and specific.
[0079] Enzyme-Linked Immunosorbent Assay (ELISA), or more generically termed EIA (Enzyme ImmunoAssay), is an immunoassay that can detect an antibody specific for a protein. In such an assay, a detectable label bound to either an antibody-binding or antigen-binding reagent is an enzyme. When exposed to its substrate, this enzyme reacts in such a manner as to produce a chemical moiety which can be detected, for example, by spectrophotometric, fluorometric or visual means. Enzymes which can be used to detectably label reagents useful for detection include, but are not limited to, horseradish peroxidase, alkaline phosphatase, glucose oxidase, β-galactosidase, ribonuclease, urease, catalase, malate dehydrogenase, staphylococcal nuclease, asparaginase, yeast alcohol dehydrogenase, alpha.-glycerophosphate dehydrogenase, triose phosphate isomerase, glucose-6-phosphate dehydrogenase, glucoamylase and acetylcholinesterase.
[0080] Variations of ELISA techniques are known to those of skill in the art. In one variation, antibodies that can bind to proteins can be immobilized onto a selected surface exhibiting protein affinity, such as a well in a polystyrene microtiter plate. Then, a test composition suspected of containing a marker antigen can be added to the wells. After binding and washing to remove non-specifically bound immunocomplexes, the bound antigen can be detected. Detection can be achieved by the addition of a second antibody specific for the target protein, which is linked to a detectable label. This type of ELISA is a simple “sandwich ELISA.” Detection also can be achieved by the addition of a second antibody, followed by the addition of a third antibody that has binding affinity for the second antibody, with the third antibody being linked to a detectable label.
[0081] Another variation is a competition ELISA. In competition ELISA's, test samples compete for binding with known amounts of labeled antigens or antibodies. The amount of reactive species in the sample can be determined by mixing the sample with the known labeled species before or during incubation with coated wells. The presence of reactive species in the sample acts to reduce the amount of labeled species available for binding to the well and thus reduces the ultimate signal.
[0082] Regardless of the format employed, ELISAs have certain features in common, such as coating, incubating or binding, washing to remove non-specifically bound species, and detecting the bound immunecomplexes. Antigen or antibodies can be linked to a solid support, such as in the form of plate, beads, dipstick, membrane or column matrix, and the sample to be analyzed applied to the immobilized antigen or antibody. In coating a plate with either antigen or antibody, one will generally incubate the wells of the plate with a solution of the antigen or antibody, either overnight or for a specified period of hours. The wells of the plate can then be washed to remove incompletely adsorbed material. Any remaining available surfaces of the wells can then be “coated” with a nonspecific protein that is antigenically neutral with regard to the test antisera. These include bovine serum albumin (BSA), casein and solutions of milk powder. The coating allows for blocking of nonspecific adsorption sites on the immobilizing surface and thus reduces the background caused by nonspecific binding of antisera onto the surface.
[0083] In ELISAs, a secondary or tertiary detection means rather than a direct procedure can also be used. Thus, after binding of a protein or antibody to the well, coating with a non-reactive material to reduce background, and washing to remove unbound material, the immobilizing surface is contacted with the control clinical or biological sample to be tested under conditions effective to allow immunecomplex (antigen/antibody) formation. Detection of the immunecomplex then requires a labeled secondary binding agent or a secondary binding agent in conjunction with a labeled third binding agent.
[0084] Enzyme-Linked Immunospot Assay (ELISpot) is an immunoassay that can detect an antibody specific for a protein or antigen. In such an assay, a detectable label bound to either an antibody-binding or antigen-binding reagent is an enzyme. When exposed to its substrate, this enzyme reacts in such a manner as to produce a chemical moiety which can be detected, for example, by spectrophotometric, fluorometric or visual means. Enzymes which can be used to detectably label reagents useful for detection include, but are not limited to, horseradish peroxidase, alkaline phosphatase, glucose oxidase, β-galactosidase, ribonuclease, urease, catalase, malate dehydrogenase, staphylococcal nuclease, asparaginase, yeast alcohol dehydrogenase, alpha.-glycerophosphate dehydrogenase, triose phosphate isomerase, glucose-6-phosphate dehydrogenase, glucoamylase and acetylcholinesterase. In this assay a nitrocellulose microtiter plate is coated with antigen. The test sample is exposed to the antigen and then reacted similarly to an ELISA assay. Detection differs from a traditional ELISA in that detection is determined by the enumeration of spots on the nitrocellulose plate. The presence of a spot indicates that the sample reacted to the antigen. The spots can be counted and the number of cells in the sample specific for the antigen determined.
[0085] “Under conditions effective to allow immunecomplex (antigen/antibody) formation” means that the conditions include diluting the antigens and antibodies with solutions such as BSA, bovine gamma globulin (BGG) and phosphate buffered saline (PBS)/Tween so as to reduce non-specific binding and to promote a reasonable signal to noise ratio.
[0086] The suitable conditions also mean that the incubation is at a temperature and for a period of time sufficient to allow effective binding. Incubation steps can typically be from about 1 minute to twelve hours, at temperatures of about 20° to 30° C., or can be incubated overnight at about 0° C. to about 10° C.
[0087] Following all incubation steps in an ELISA, the contacted surface can be washed so as to remove non-complexed material. A washing procedure can include washing with a solution such as PBS/Tween or borate buffer. Following the formation of specific immunecomplexes between the test sample and the originally bound material, and subsequent washing, the occurrence of even minute amounts of immunecomplexes can be determined.
[0088] To provide a detecting means, the second or third antibody can have an associated label to allow detection, as described above. This can be an enzyme that can generate color development upon incubating with an appropriate chromogenic substrate. Thus, for example, one can contact and incubate the first or second immunecomplex with a labeled antibody for a period of time and under conditions that favor the development of further immunecomplex formation (e.g., incubation for 2 hours at room temperature in a PBS-containing solution such as PBS-Tween).
[0089] After incubation with the labeled antibody, and subsequent to washing to remove unbound material, the amount of label can be quantified, e.g., by incubation with a chromogenic substrate such as urea and bromocresol purple or 2,2′-azido-di-(3-ethyl-benzthiazoline-6-sulfonic acid [ABTS] and H.sub.2O.sub.2, in the case of peroxidase as the enzyme label. Quantitation can then be achieved by measuring the degree of color generation, e.g., using a visible spectra spectrophotometer.
[0090] Protein arrays are solid-phase ligand binding assay systems using immobilized proteins on surfaces which include glass, membranes, microtiter wells, mass spectrometer plates, and beads or other particles. The assays are highly parallel (multiplexed) and often miniaturized (microarrays, protein chips). Their advantages include being rapid and automatable, capable of high sensitivity, economical on reagents, and giving an abundance of data for a single experiment. Bioinformatics support is important; the data handling demands sophisticated software and data comparison analysis. However, the software can be adapted from that used for DNA arrays, as can much of the hardware and detection systems.
[0091] One of the chief formats is the capture array, in which ligand-binding reagents, which are usually antibodies but can also be alternative protein scaffolds, peptides or nucleic acid aptamers, are used to detect target molecules in mixtures such as plasma or tissue extracts. In diagnostics, capture arrays can be used to carry out multiple immunoassays in parallel, both testing for several analytes in individual sera for example and testing many serum samples simultaneously. In proteomics, capture arrays are used to quantitate and compare the levels of proteins in different samples in health and disease, i.e. protein expression profiling. Proteins other than specific ligand binders are used in the array format for in vitro functional interaction screens such as protein-protein, protein-DNA, protein-drug, receptor-ligand, enzyme-substrate, etc. The capture reagents themselves are selected and screened against many proteins, which can also be done in a multiplex array format against multiple protein targets.
[0092] For construction of arrays, sources of proteins include cell-based expression systems for recombinant proteins, purification from natural sources, production in vitro by cell-free translation systems, and synthetic methods for peptides. Many of these methods can be automated for high throughput production. For capture arrays and protein function analysis, it is important that proteins should be correctly folded and functional; this is not always the case, e.g. where recombinant proteins are extracted from bacteria under denaturing conditions. Nevertheless, arrays of denatured proteins are useful in screening antibodies for cross-reactivity, identifying autoantibodies and selecting ligand binding proteins.
[0093] Protein arrays have been designed as a miniaturization of familiar immunoassay methods such as ELISA and dot blotting, often utilizing fluorescent readout, and facilitated by robotics and high throughput detection systems to enable multiple assays to be carried out in parallel. Commonly used physical supports include glass slides, silicon, microwells, nitrocellulose or PVDF membranes, and magnetic and other microbeads. While microdrops of protein delivered onto planar surfaces are the most familiar format, alternative architectures include CD centrifugation devices based on developments in microfluidics (Gyros, Monmouth Junction, N.J.) and specialised chip designs, such as engineered microchannels in a plate (e.g., The Living Chip™, Biotrove, Woburn, Mass.) and tiny 3D posts on a silicon surface (Zyomyx, Hayward Calif.). Particles in suspension can also be used as the basis of arrays, providing they are coded for identification; systems include colour coding for microbeads (Luminex, Austin, Tex.; Bio-Rad Laboratories) and semiconductor nanocrystals (e.g., QDots™, Quantum Dot, Hayward, Calif.), and barcoding for beads (UltraPlex™, SmartBead Technologies Ltd, Babraham, Cambridge, UK) and multimetal microrods (e.g., Nanobarcodes™ particles, Nanoplex Technologies, Mountain View, Calif.). Beads can also be assembled into planar arrays on semiconductor chips (LEAPS technology, BioArray Solutions, Warren, N.J.).
[0094] Immobilization of proteins involves both the coupling reagent and the nature of the surface being coupled to. A good protein array support surface is chemically stable before and after the coupling procedures, allows good spot morphology, displays minimal nonspecific binding, does not contribute a background in detection systems, and is compatible with different detection systems. The immobilization method used are reproducible, applicable to proteins of different properties (size, hydrophilic, hydrophobic), amenable to high throughput and automation, and compatible with retention of fully functional protein activity. Orientation of the surface-bound protein is recognized as an important factor in presenting it to ligand or substrate in an active state; for capture arrays the most efficient binding results are obtained with orientated capture reagents, which generally require site-specific labeling of the protein.
[0095] Both covalent and noncovalent methods of protein immobilization are used and have various pros and cons. Passive adsorption to surfaces is methodologically simple, but allows little quantitative or orientational control; it may or may not alter the functional properties of the protein, and reproducibility and efficiency are variable. Covalent coupling methods provide a stable linkage, can be applied to a range of proteins and have good reproducibility; however, orientation may be variable, chemical derivatization may alter the function of the protein and requires a stable interactive surface. Biological capture methods utilizing a tag on the protein provide a stable linkage and bind the protein specifically and in reproducible orientation, but the biological reagent must first be immobilized adequately and the array may require special handling and have variable stability.
[0096] Several immobilization chemistries and tags have been described for fabrication of protein arrays. Substrates for covalent attachment include glass slides coated with amino- or aldehyde-containing silane reagents. In the Versalinx™ system (Prolinx, Bothell, Wash.) reversible covalent coupling is achieved by interaction between the protein derivatised with phenyldiboronic acid, and salicylhydroxamic acid immobilized on the support surface. This also has low background binding and low intrinsic fluorescence and allows the immobilized proteins to retain function. Noncovalent binding of unmodified protein occurs within porous structures such as HydroGel™ (PerkinElmer, Wellesley, Mass.), based on a 3-dimensional polyacrylamide gel; this substrate is reported to give a particularly low background on glass microarrays, with a high capacity and retention of protein function. Widely used biological coupling methods are through biotin/streptavidin or hexahistidine/Ni interactions, having modified the protein appropriately. Biotin may be conjugated to a poly-lysine backbone immobilized on a surface such as titanium dioxide (Zyomyx) or tantalum pentoxide (Zeptosens, Witterswil, Switzerland).
[0097] Array fabrication methods include robotic contact printing, ink-jetting, piezoelectric spotting and photolithography. A number of commercial arrayers are available [e.g. Packard Biosciences] as well as manual equipment [V & P Scientific]. Bacterial colonies can be robotically gridded onto PVDF membranes for induction of protein expression in situ.
[0098] At the limit of spot size and density are nanoarrays, with spots on the nanometer spatial scale, enabling thousands of reactions to be performed on a single chip less than 1 mm square. BioForce Laboratories have developed nanoarrays with 1521 protein spots in 85 sq microns, equivalent to 25 million spots per sq cm, at the limit for optical detection; their readout methods are fluorescence and atomic force microscopy (AFM).
[0099] Fluorescence labeling and detection methods are widely used. The same instrumentation as used for reading DNA microarrays is applicable to protein arrays. For differential display, capture (e.g., antibody) arrays can be probed with fluorescently labeled proteins from two different cell states, in which cell lysates are directly conjugated with different fluorophores (e.g. Cy-3, Cy-5) and mixed, such that the color acts as a readout for changes in target abundance. Fluorescent readout sensitivity can be amplified 10-100 fold by tyramide signal amplification (TSA) (PerkinElmer Lifesciences). Planar waveguide technology (Zeptosens) enables ultrasensitive fluorescence detection, with the additional advantage of no intervening washing procedures. High sensitivity can also be achieved with suspension beads and particles, using phycoerythrin as label (Luminex) or the properties of semiconductor nanocrystals (Quantum Dot). A number of novel alternative readouts have been developed, especially in the commercial biotech arena. These include adaptations of surface plasmon resonance (HTS Biosystems, Intrinsic Bioprobes, Tempe, Ariz.), rolling circle DNA amplification (Molecular Staging, New Haven Conn.), mass spectrometry (Intrinsic Bioprobes; Ciphergen, Fremont, Calif.), resonance light scattering (Genicon Sciences, San Diego, Calif.) and atomic force microscopy [BioForce Laboratories].
[0100] Capture arrays form the basis of diagnostic chips and arrays for expression profiling. They employ high affinity capture reagents, such as conventional antibodies, single domains, engineered scaffolds, peptides or nucleic acid aptamers, to bind and detect specific target ligands in high throughput manner.
[0101] Antibody arrays have the required properties of specificity and acceptable background, and some are available commercially (BD Biosciences, San Jose, Calif.; Clontech, Mountain View, Calif.; BioRad; Sigma, St. Louis, Mo.). Antibodies for capture arrays are made either by conventional immunization (polyclonal sera and hybridomas), or as recombinant fragments, usually expressed in E. coli, after selection from phage or ribosome display libraries (Cambridge Antibody Technology, Cambridge, UK; Biolnvent, Lund, Sweden; Affitech, Walnut Creek, Calif.; Biosite, San Diego, Calif.). In addition to the conventional antibodies, Fab and scFv fragments, single V-domains from camelids or engineered human equivalents (Domantis, Waltham, Mass.) may also be useful in arrays.
[0102] The term “scaffold” refers to ligand-binding domains of proteins, which are engineered into multiple variants capable of binding diverse target molecules with antibody-like properties of specificity and affinity. The variants can be produced in a genetic library format and selected against individual targets by phage, bacterial or ribosome display. Such ligand-binding scaffolds or frameworks include ‘Affibodies’ based on Staph. aureus protein A (Affibody, Bromma, Sweden), ‘Trinectins’ based on fibronectins (Phylos, Lexington, Mass.) and ‘Anticalins’ based on the lipocalin structure (Pieris Proteolab, Freising-Weihenstephan, Germany). These can be used on capture arrays in a similar fashion to antibodies and may have advantages of robustness and ease of production.
[0103] Nonprotein capture molecules, notably the single-stranded nucleic acid aptamers which bind protein ligands with high specificity and affinity, are also used in arrays (SomaLogic, Boulder, Colo.). Aptamers are selected from libraries of oligonucleotides by the Selex™ procedure and their interaction with protein can be enhanced by covalent attachment, through incorporation of brominated deoxyuridine and UV-activated crosslinking (photoaptamers). Photocrosslinking to ligand reduces the crossreactivity of aptamers due to the specific steric requirements. Aptamers have the advantages of ease of production by automated oligonucleotide synthesis and the stability and robustness of DNA; on photoaptamer arrays, universal fluorescent protein stains can be used to detect binding.
[0104] Protein analytes binding to antibody arrays may be detected directly or via a secondary antibody in a sandwich assay. Direct labelling is used for comparison of different samples with different colors. Where pairs of antibodies directed at the same protein ligand are available, sandwich immunoassays provide high specificity and sensitivity and are therefore the method of choice for low abundance proteins such as cytokines; they also give the possibility of detection of protein modifications. Label-free detection methods, including mass spectrometry, surface plasmon resonance and atomic force microscopy, avoid alteration of ligand. What is required from any method is optimal sensitivity and specificity, with low background to give high signal to noise. Since analyte concentrations cover a wide range, sensitivity has to be tailored appropriately; serial dilution of the sample or use of antibodies of different affinities are solutions to this problem. Proteins of interest are frequently those in low concentration in body fluids and extracts, requiring detection in the pg range or lower, such as cytokines or the low expression products in cells.
[0105] An alternative to an array of capture molecules is one made through ‘molecular imprinting’ technology, in which peptides (e.g., from the C-terminal regions of proteins) are used as templates to generate structurally complementary, sequence-specific cavities in a polymerizable matrix; the cavities can then specifically capture (denatured) proteins that have the appropriate primary amino acid sequence (ProteinPrint™, Aspira Biosystems, Burlingame, Calif.).
[0106] Another methodology which can be used diagnostically and in expression profiling is the ProteinChip® array (Ciphergen, Fremont, Calif.), in which solid phase chromatographic surfaces bind proteins with similar characteristics of charge or hydrophobicity from mixtures such as plasma or tumour extracts, and SELDI-TOF mass spectrometry is used to detection the retained proteins.
[0107] Large-scale functional chips have been constructed by immobilizing large numbers of purified proteins and used to assay a wide range of biochemical functions, such as protein interactions with other proteins, drug-target interactions, enzyme-substrates, etc. Generally they require an expression library, cloned into E. coli, yeast or similar from which the expressed proteins are then purified, e.g. via a His tag, and immobilized. Cell free protein transcription/translation is a viable alternative for synthesis of proteins which do not express well in bacterial or other in vivo systems.
[0108] For detecting protein-protein interactions, protein arrays can be in vitro alternatives to the cell-based yeast two-hybrid system and may be useful where the latter is deficient, such as interactions involving secreted proteins or proteins with disulphide bridges. High-throughput analysis of biochemical activities on arrays has been described for yeast protein kinases and for various functions (protein-protein and protein-lipid interactions) of the yeast proteome, where a large proportion of all yeast open-reading frames was expressed and immobilised on a microarray. Large-scale ‘proteome chips’ promise to be very useful in identification of functional interactions, drug screening, etc. (Proteometrix, Branford, Conn.).
[0109] As a two-dimensional display of individual elements, a protein array can be used to screen phage or ribosome display libraries, in order to select specific binding partners, including antibodies, synthetic scaffolds, peptides and aptamers. In this way, ‘library against library’ screening can be carried out. Screening of drug candidates in combinatorial chemical libraries against an array of protein targets identified from genome projects is another application of the approach.
[0110] A multiplexed bead assay, such as, for example, the BD™ Cytometric Bead Array, is a series of spectrally discrete particles that can be used to capture and quantitate soluble analytes. The analyte is then measured by detection of a fluorescence-based emission and flow cytometric analysis. Multiplexed bead assay generates data that is comparable to ELISA based assays, but in a “multiplexed” or simultaneous fashion. Concentration of unknowns is calculated for the cytometric bead array as with any sandwich format assay, i.e. through the use of known standards and plotting unknowns against a standard curve. Further, multiplexed bead assay allows quantification of soluble analytes in samples never previously considered due to sample volume limitations. In addition to the quantitative data, powerful visual images can be generated revealing unique profiles or signatures that provide the user with additional information at a glance.
[0111] Accordingly, in one aspect, disclosed herein are methods of identifying neoantigens any neoantigen disclosed herein, wherein the T cell activity (such as, for example, release of cytokines including, but not limited to IFN-γ, TGF-β, lymphotoxin-α, IL-2, IL-4, IL-10, IL-17, or IL-25) is measured by ELISA, ELISpot, Intracellular cytokine staining, or Chromium Release.
[0112] It is understood and herein contemplated that the disclosed methods of identifying neoantigens can be used to identify neoantigens in any cancer, including, but not limited to B cell lymphoma, T cell lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and neck, lung cancers, small cell lung cancer, non-small cell lung cancer, neuroblastoma, glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin cancers, melanoma, basal cell carcinoma, squamous cell carcinoma, liver cancer, squamous cell carcinomas of the mouth, throat, larynx, and lung, cervical cancer, cervical carcinoma, breast cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon and rectal cancers, prostatic cancer, AIDS-related lymphomas, or AIDS-related sarcomas. Therefore, in one aspect, also disclosed herein are methods of identifying any neoantigen disclosed herein, wherein the cancer is selected from the group consisting of B cell lymphoma, T cell lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and neck, lung cancers, small cell lung cancer, non-small cell lung cancer, neuroblastoma, glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin cancers, melanoma, basal cell carcinoma, squamous cell carcinoma, liver cancer, squamous cell carcinomas of the mouth, throat, larynx, and lung, cervical cancer, cervical carcinoma, breast cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon and rectal cancers, prostatic cancer, AIDS-related lymphomas, or AIDS-related sarcomas.
C. Compositions
[0113] Disclosed are the components to be used to prepare the disclosed compositions as well as the compositions themselves to be used within the methods disclosed herein. These and other materials are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these materials are disclosed that while specific reference of each various individual and collective combinations and permutation of these compounds may not be explicitly disclosed, each is specifically contemplated and described herein. For example, if a particular neoantigen is disclosed and discussed and a number of modifications that can be made to a number of molecules including the neoantigen are discussed, specifically contemplated is each and every combination and permutation of neoantigen and the modifications that are possible unless specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are disclosed as well as a class of molecules D, E, and F and an example of a combination molecule, A-D is disclosed, then even if each is not individually recited each is individually and collectively contemplated meaning combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are considered disclosed. Likewise, any subset or combination of these is also disclosed. Thus, for example, the sub-group of A-E, B-F, and C-E would be considered disclosed. This concept applies to all aspects of this application including, but not limited to, steps in methods of making and using the disclosed compositions. Thus, if there are a variety of additional steps that can be performed it is understood that each of these additional steps can be performed with any specific embodiment or combination of embodiments of the disclosed methods.
[0114] The methods disclosed herein identify neoantigens that can be used in compositions for the treatment of cancer (either as a therapeutic treatment or prophylactic treatment) as well as for the creating of CAR T cells, TCR T cells, and/or TILs that can be used to treat cancer. Thus, in one aspect, also disclosed herein are neoantigens identified by the methods of identifying any neoantigen disclosed herein. For example, disclosed herein are polypeptides comprising the amino acid sequence AEPKRKSSLFWHAFNRLTPFRK (SEQ ID NO: 2) or a fragment thereof comprising at least 9 amino acids, wherein any fragment of the sequence at least comprises LFWHAFNRL (SEQ ID NO: 7), for example, SSLFWHAFNRLTP (SEQ ID NO: 4), RKSSLFWHAFNRL (SEQ ID NO: 3), RKSSLFWHAFNRLTPFR (SEQ ID NO: 6), LFWHAFNRLTPFR (SEQ ID NO: 5), SLFWHAFNRL (SEQ ID NO: 88), SSLFWHAFNRL (SEQ ID NO: 89), SSLFWHAFNRLT (SEQ ID NO: 90), LFWHAFNRLT (SEQ ID NO: 91), LFWHAFNRLTP (SEQ ID NO: 92), SLFWHAFNRLT (SEQ ID NO: 93), LFWHAFNRLTPF (SEQ ID NO: 94); a polypeptide comprising the amino acid sequence FQLLLEKPFQIFCAELWVRDINDHA (SEQ ID NO: 9) or a fragment thereof comprising at least 13 amino acids, wherein any fragment of the sequence at least comprises LEKPFQIFCAELW (SEQ ID NO: 12), for example, LEKPFQIFCAELWV (SEQ ID NO: 95) and LEKPFQIFCAELWVR (SEQ ID NO: 96); a polypeptide comprising the amino acid sequence ENSPLGTEFPLNYALDLDVGSNNVQ (SEQ ID NO: 14) or a fragment thereof comprising at least 13 amino acids, wherein any fragment of the sequence at least comprises LGTEFPLNYALDL (SEQ ID NO: 17), for example LGTEFPLNYALDLD (SEQ ID NO: 97), LGTEFPLNYALDLDV (SEQ ID NO: 98), LGTEFPLNYALDLDVG (SEQ ID NO: 99), LGTEFPLNYALDLDVGS (SEQ ID NO: 100), LGTEFPLNYALDLDVGSN (SEQ ID NO: 101), LGTEFPLNYALDLDVGSNN (SEQ ID NO: 102), LGTEFPLNYALDLDVGSNNV (SEQ ID NO: 103), LGTEFPLNYALDLDVGSNNVQ (SEQ ID NO: 104), ENSPLGTEFPLNYALDL (SEQ ID NO: 105), NSPLGTEFPLNYALDL (SEQ ID NO: 106), SPLGTEFPLNYALDL (SEQ ID NO: 107), PLGTEFPLNYALDL (SEQ ID NO: 108), PLGTEFPLNYALDLD (SEQ ID NO: 109), PLGTEFPLNYALDLDV (SEQ ID NO: 110), SPLGTEFPLNYALDLD (SEQ ID NO: 111), SPLGTEFPLNYALDLDV (SEQ ID NO: 112), NSPLGTEFPLNYALDLDV (SEQ ID NO: 113), and ENSPLGTEFPLNYALDLDV (SEQ ID NO: 114); a polypeptide comprising the amino acid sequence MTDDKDVLRNVWFGRIPTCFT (SEQ ID NO: 19) or a fragment thereof comprising at least 11 amino acids, wherein any fragment of the sequence at least comprises KDVLRNVWFGR (SEQ ID NO: 23), for example, DDKDVLRNVWFGR (SEQ ID NO: 22), KDVLRNVWFGRIP (SEQ ID NO: 21), DDKDVLRNVWFGRIP (SEQ ID NO: 24), DKDVLRNVWFGR (SEQ ID NO: 115), KDVLRNVWFGRI (SEQ ID NO: 116), DKDVLRNVWFGRI (SEQ ID NO: 117), DKDVLRNVWFGRIP (SEQ ID NO: 118), DDKDVLRNVWFGRI (SEQ ID NO: 119), DDKDVLRNVWFGRIPT (SEQ ID NO: 120), DDKDVLRNVWFGRIPTC (SEQ ID NO: 121), and TDDKDVLRNVWFGRIPT (SEQ ID NO: 122); a polypeptide comprising the amino acid sequence RLKASLDRPFTNSESAFYSIVGLSS (SEQ ID NO: 26) or a fragment thereof comprising at least 10 amino acids, wherein any fragment of the sequence at least comprises PFTNSESAFY (SEQ ID NO: 30), for example, DRPFTNSESAFYS (SEQ ID NO: 28), PFTNSESAFYS (SEQ ID NO: 123), PFTNSESAFYSI (SEQ ID NO: 124), PFTNSESAFYSIV (SEQ ID NO: 29), RPFTNSESAFY (SEQ ID NO: 125), RPFTNSESAFYS (SEQ ID NO: 126), RPFTNSESAFYSI (SEQ ID NO: 127), RPFTNSESAFYSIV (SEQ ID NO: 128), DRPFTNSESAFY (SEQ ID NO: 129), DRPFTNSESAFYSI (SEQ ID NO: 130), and DRPFTNSESAFYSIV (SEQ ID NO: 131); a polypeptide comprising the amino acid sequence GSGEKVAGRVIVKVCEVTRVKAVRI (SEQ ID NO: 32) or a fragment thereof comprising at least 9 amino acids, wherein any fragment of the sequence at least comprises RVIVKVCEV (SEQ ID NO: 36), for example, KVAGRVIVKVCEV (SEQ ID NO: 33), AGRVIVKVCEVTR (SEQ ID NO: 34), KVAGRVIVKVCEVTRVK (SEQ ID NO: 32), RVIVKVCEVTRVK (SEQ ID NO: 35), and KVAGRVIVKVCEVTRVK (SEQ ID NO: 37); a polypeptide comprising the amino acid sequence YGMYFCMNISSQEDGACVLLRALEP (SEQ ID NO: 39) or a fragment thereof comprising at least 10 amino acids, wherein any fragment of the sequence at least comprises ISSQEDGACV (SEQ ID NO: 43), for example, MNISSQEDGACVL (SEQ ID NO: 41), ISSQEDGACVLLR (SEQ ID NO: 42), MNISSQEDGACVLLR (SEQ ID NO: 44), ISSQEDGACVL (SEQ ID NO: 132), ISSQEDGACVLL (SEQ ID NO: 133), NISSQEDGACV (SEQ ID NO: 134), NISSQEDGACVL (SEQ ID NO: 135), NISSQEDGACVLL, (SEQ ID NO: 136), NISSQEDGACVLLR (SEQ ID NO: 137), and MNISSQEDGACVLL (SEQ ID NO: 138); a polypeptide comprising the amino acid sequence MKLTSLMCNPVKSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 46) or a fragment thereof comprising at least 26 amino acids, wherein any fragment of the sequence at least comprises VSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 60), for example, MCNPVKSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 49), KSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 50), GCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 51), GHVPISMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 52), SMCVSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 53), VSTCSSLPTASCALDLTVLAENSHQVGA (SEQ ID NO: 54), MCNPVKSPFFGCVCGHVPISMCVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 57), VSTCSSLPTASCALDLTVLAENSHQVG (SEQ ID NO: 139), CVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 140), MCVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 141), MCVSTCSSLPTASCALDLTVLAENSHQVG (SEQ ID NO: 142), SMCVSTCSSLPTASCALDLTVLAENSHQV (SEQ ID NO: 143), SMCVSTCSSLPTASCALDLTVLAENSHQVG (SEQ ID NO: 144); a polypeptide comprising the amino acid sequence MLMAQEALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRR MEGAPAGPGGRTAACLSCTSRCLSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 67) or a fragment thereof comprising at least 61 amino acids, wherein any fragment of the sequence at least comprises VPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSRR (SEQ ID NO: 73), for example, QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPWKR (SEQ ID NO: 74), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR R (SEQ ID NO: 145), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RR (SEQ ID NO: 146), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRR (SEQ ID NO: 147), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRR (SEQ ID NO: 148), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RP (SEQ ID NO: 149), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRP (SEQ ID NO: 150), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRP (SEQ ID NO: 151), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRP (SEQ ID NO: 152), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RPW (SEQ ID NO: 153), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRPW (SEQ ID NO: 154), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRPW (SEQ ID NO: 155), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPW (SEQ ID NO: 156), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RPWK (SEQ ID NO: 157), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRPWK (SEQ ID NO: 158), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRPWK (SEQ ID NO: 159), QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPWK (SEQ ID NO: 160), RVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLSR RPWKR (SEQ ID NO: 161), RRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCLS RRPWKR (SEQ ID NO: 162), ERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRCL SRRPWKR (SEQ ID NO: 163), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKRSWSAGSCPGMPHRSP (SEQ ID NO: 61), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKRSWSAGSCPGM (SEQ ID NO: 62), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKRSWSAG (SEQ ID NO: 63), EALAFLMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPA GPGGRTAACLSCTSRCLSRRPWKR (SEQ ID NO: 64), LMAQGAMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGG RTAACLSCTSRCLSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 68), AMLAAQERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACL SCTSRCLSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 69), and QERRVPRAAEVPGAQGQQGPRGREEAPRGVRMAVPLLRRMEGAPAGPGGRTAACLSCTSRC LSRRPWKRSWSAGSCPGMPHRSPDQGRF (SEQ ID NO: 70); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 76 (KDAARPAYWVPDYEILHCHNCRKEF); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 78 (SMPLWDFQGSTMRTSQYVRLTPDER); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 80 (YPGIKFEELFPDCIFPSESERDKIK); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 82 (LLKTRRILKCSYLYGFFLEPKSTKK); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 84 (PFYLGHTIKSGDFEYVGMEGGIVLS); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 86 (LLLAGYLAQQYLLLPTPKVIGIDLG); a polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 166 (TSMLILSNLVFLEGNEVGKTYWNRI); and/or variant of any of the preceding polypeptides or polypeptide fragments comprising a conservative amino acid substitution. Also disclosed are CAR T cells engineered to express a receptor (such as, for example, a T cell receptor) that can recognize one or more of the neoantigens disclosed herein. In one aspect, the TCR T cell, CAR T cell, or TIL specific for one or more of the neoantigens disclosed herein can be further engineered to knockout or knockdown Zinc finger and BTB domain-containing protein 7B (ThPOK), Lysine-specific histone demethylase 1 (LSD1), programmed cell death protein (PD1), and/or protein phosphatase (PP2A) to enhance their function such as cytotoxic activity and persistence or survival in vivo after adoptive transfer to a cancer patient. In one aspect, the TCR T cell, CAR T cell, or TIL specific for one or more of the neoantigens disclosed herein is treated with a small molecular inhibitors such 2-PAPC to enhance T cell cytotoxic activity or the ability to proliferate and survive in vivo.
[0115] It is understood and herein contemplated that the disclosed that once the sequence of the neoantigens is identified, the skilled artisan would have full knowledge of the nucleic acids that would encode said amino acid neoantigens and it would be well within the skill set of the skilled artisan to make said nucleic acid constructs. Thus, in one aspect, also disclosed herein are nucleic acids encoding a polypeptide for any neoantigen disclosed herein.
[0116] 1. Homology
[0117] It is understood that one way to define any known variants and derivatives or those that might arise, of the disclosed genes and proteins herein is through defining the variants and derivatives in terms of homology and/or identify to specific known sequences. For example SEQ ID NO: 1 sets forth a particular sequence of an MHC Class II epitope peptide. Specifically disclosed are variants of these and other genes and proteins herein disclosed which have at least, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 percent homology to the stated sequence. Those of skill in the art readily understand how to determine the homology of two proteins or nucleic acids, such as genes. For example, the homology can be calculated after aligning the two sequences so that the homology is at its highest level.
[0118] Another way of calculating homology can be performed by published algorithms. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by inspection.
[0119] The same types of homology can be obtained for nucleic acids by for example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 183:281-306, 1989 which are herein incorporated by reference for at least material related to nucleic acid alignment.
[0120] Nucleic Acids
[0121] There are a variety of molecules disclosed herein that are nucleic acid based, including for example the nucleic acids that encode, for example SEQ ID NO: 1, or any of the nucleic acids disclosed herein or fragments thereof, as well as various functional nucleic acids. The disclosed nucleic acids are made up of for example, nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and other molecules are discussed herein. It is understood that for example, when a vector is expressed in a cell, that the expressed mRNA will typically be made up of A, C, G, and U. Likewise, it is understood that if, for example, an antisense molecule is introduced into a cell or cell environment through for example exogenous delivery, it is advantageous that the antisense molecule be made up of nucleotide analogs that reduce the degradation of the antisense molecule in the cellular environment.
[0122] a) Nucleotides and Related Molecules
[0123] A nucleotide is a molecule that contains a base moiety, a sugar moiety and a phosphate moiety. Nucleotides can be linked together through their phosphate moieties and sugar moieties creating an internucleoside linkage. The base moiety of a nucleotide can be adenin-9-yl (A), cytosin-1-yl (C), guanin-9-yl (G), uracil-1-yl (U), and thymin-1-yl (T). The sugar moiety of a nucleotide is a ribose or a deoxyribose. The phosphate moiety of a nucleotide is pentavalent phosphate. An non-limiting example of a nucleotide would be 3′-AMP (3′-adenosine monophosphate) or 5′-GMP (5′-guanosine monophosphate). There are many varieties of these types of molecules available in the art and available herein.
[0124] A nucleotide analog is a nucleotide which contains some type of modification to either the base, sugar, or phosphate moieties. Modifications to nucleotides are well known in the art and would include for example, 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, and 2-aminoadenine as well as modifications at the sugar or phosphate moieties. There are many varieties of these types of molecules available in the art and available herein.
[0125] Nucleotide substitutes are molecules having similar functional properties to nucleotides, but which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). Nucleotide substitutes are molecules that will recognize nucleic acids in a Watson-Crick or Hoogsteen manner, but which are linked together through a moiety other than a phosphate moiety. Nucleotide substitutes are able to conform to a double helix type structure when interacting with the appropriate target nucleic acid. There are many varieties of these types of molecules available in the art and available herein.
[0126] It is also possible to link other types of molecules (conjugates) to nucleotides or nucleotide analogs to enhance for example, cellular uptake. Conjugates can be chemically linked to the nucleotide or nucleotide analogs. Such conjugates include but are not limited to lipid moieties such as a cholesterol moiety. (Letsinger et al., Proc. Natl. Acad. Sci. USA, 1989, 86, 6553-6556). There are many varieties of these types of molecules available in the art and available herein.
[0127] A Watson-Crick interaction is at least one interaction with the Watson-Crick face of a nucleotide, nucleotide analog, or nucleotide substitute. The Watson-Crick face of a nucleotide, nucleotide analog, or nucleotide substitute includes the C2, N1, and C6 positions of a purine based nucleotide, nucleotide analog, or nucleotide substitute and the C2, N3, C4 positions of a pyrimidine based nucleotide, nucleotide analog, or nucleotide substitute.
[0128] A Hoogsteen interaction is the interaction that takes place on the Hoogsteen face of a nucleotide or nucleotide analog, which is exposed in the major groove of duplex DNA. The Hoogsteen face includes the N7 position and reactive groups (NH2 or 0) at the C6 position of purine nucleotides.
[0129] b) Primers and Probes
[0130] Disclosed are compositions including primers and probes, which are capable of interacting with the disclosed nucleic acids, such as the neoantigens disclosed herein. In certain embodiments the primers are used to support DNA amplification reactions. Typically the primers will be capable of being extended in a sequence specific manner. Extension of a primer in a sequence specific manner includes any methods wherein the sequence and/or composition of the nucleic acid molecule to which the primer is hybridized or otherwise associated directs or influences the composition or sequence of the product produced by the extension of the primer. Extension of the primer in a sequence specific manner therefore includes, but is not limited to, PCR, DNA sequencing, DNA extension, DNA polymerization, RNA transcription, or reverse transcription. Techniques and conditions that amplify the primer in a sequence specific manner are preferred. In certain embodiments the primers are used for the DNA amplification reactions, such as PCR or direct sequencing. It is understood that in certain embodiments the primers can also be extended using non-enzymatic techniques, where for example, the nucleotides or oligonucleotides used to extend the primer are modified such that they will chemically react to extend the primer in a sequence specific manner. Typically the disclosed primers hybridize with the disclosed nucleic acids or region of the nucleic acids or they hybridize with the complement of the nucleic acids or complement of a region of the nucleic acids.
[0131] The size of the primers or probes for interaction with the nucleic acids in certain embodiments can be any size that supports the desired enzymatic manipulation of the primer, such as DNA amplification or the simple hybridization of the probe or primer. A typical primer or probe would be at least 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.
[0132] In other embodiments a primer or probe can be less than or equal to 6, 7, 8, 9, 10, 11, 12 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.
[0133] In certain embodiments this product is at least 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.
[0134] In other embodiments the product is less than or equal to 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.
[0135] Peptides
[0136] a) Protein Variants
[0137] As discussed herein there are numerous variants of the neoantigens that are known and herein contemplated. Protein variants and derivatives are well understood to those of skill in the art and in can involve amino acid sequence modifications. For example, amino acid sequence modifications typically fall into one or more of three classes: substitutional, insertional or deletional variants. Insertions include amino and/or carboxyl terminal fusions as well as intrasequence insertions of single or multiple amino acid residues. Insertions ordinarily will be smaller insertions than those of amino or carboxyl terminal fusions, for example, on the order of one to four residues. Immunogenic fusion protein derivatives, such as those described in the examples, are made by fusing a polypeptide sufficiently large to confer immunogenicity to the target sequence by cross-linking in vitro or by recombinant cell culture transformed with DNA encoding the fusion. Deletions are characterized by the removal of one or more amino acid residues from the protein sequence. Typically, no more than about from 2 to 6 residues are deleted at any one site within the protein molecule. These variants ordinarily are prepared by site specific mutagenesis of nucleotides in the DNA encoding the protein, thereby producing DNA encoding the variant, and thereafter expressing the DNA in recombinant cell culture. Techniques for making substitution mutations at predetermined sites in DNA having a known sequence are well known, for example M13 primer mutagenesis and PCR mutagenesis. Amino acid substitutions are typically of single residues, but can occur at a number of different locations at once; insertions usually will be on the order of about from 1 to 10 amino acid residues; and deletions will range about from 1 to 30 residues. Deletions or insertions preferably are made in adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues. Substitutions, deletions, insertions or any combination thereof may be combined to arrive at a final construct. The mutations must not place the sequence out of reading frame and preferably will not create complementary regions that could produce secondary mRNA structure. Substitutional variants are those in which at least one residue has been removed and a different residue inserted in its place. Such substitutions generally are made in accordance with the following Tables 1 and 2 and are referred to as conservative substitutions.
TABLE-US-00001 TABLE 1 Amino Acid Abbreviations Amino Acid Abbreviations Alanine Ala A allosoleucine AIle Arginine Arg R asparagine Asn N aspartic acid Asp D Cysteine Cys C glutamic acid Glu E Glutamine Gln Q Glycine Gly G Histidine His H Isolelucine Ile I Leucine Leu L Lysine Lys K phenylalanine Phe F proline Pro P pyroglutamic acid pGlu Serine Ser S Threonine Thr T Tyrosine Tyr Y Tryptophan Trp W Valine Val V
TABLE-US-00002 TABLE 2 Amino Acid Substitutions Original Residue Exemplary Conservative Substitutions, others are known in the art. Ala Ser Arg Lys; Gln Asn Gln; His Asp Glu Cys Ser Gln Asn, Lys Glu Asp Gly Pro His Asn; Gln Ile Leu; Val Leu Ile; Val Lys Arg; Gln Met Leu; Ile Phe Met; Leu; Tyr Ser Thr Thr Ser Trp Tyr Tyr Trp; Phe Val Ile; Leu
[0138] Substantial changes in function or immunological identity are made by selecting substitutions that are less conservative than those in Table 2, i.e., selecting residues that differ more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in the area of the substitution, for example as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. The substitutions which in general are expected to produce the greatest changes in the protein properties will be those in which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in this case, (e) by increasing the number of sites for sulfation and/or glycosylation.
[0139] For example, the replacement of one amino acid residue with another that is biologically and/or chemically similar is known to those skilled in the art as a conservative substitution. For example, a conservative substitution would be replacing one hydrophobic residue for another, or one polar residue for another. The substitutions include combinations such as, for example, Gly, Ala; Val, Ile, Leu; Asp, Glu; Asn, Gln; Ser, Thr; Lys, Arg; and Phe, Tyr. Such conservatively substituted variations of each explicitly disclosed sequence are included within the mosaic polypeptides provided herein.
[0140] Substitutional or deletional mutagenesis can be employed to insert sites for N-glycosylation (Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine or other labile residues also may be desirable. Deletions or substitutions of potential proteolysis sites, e.g. Arg, is accomplished for example by deleting one of the basic residues or substituting one by glutaminyl or histidyl residues.
[0141] Certain post-translational derivatizations are the result of the action of recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are frequently post-translationally deamidated to the corresponding glutamyl and asparyl residues. Alternatively, these residues are deamidated under mildly acidic conditions. Other post-translational modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the o-amino groups of lysine, arginine, and histidine side chains (T. E. Creighton, Proteins: Structure and Molecular Properties, W. H. Freeman & Co., San Francisco pp 79-86 [1983]), acetylation of the N-terminal amine and, in some instances, amidation of the C-terminal carboxyl.
[0142] It is understood that one way to define the variants and derivatives of the disclosed proteins herein is through defining the variants and derivatives in terms of homology/identity to specific known sequences. Specifically disclosed are variants of these and other proteins herein disclosed which have at least, 70% or 75% or 80% or 85% or 90% or 95% homology to the stated sequence. Those of skill in the art readily understand how to determine the homology of two proteins. For example, the homology can be calculated after aligning the two sequences so that the homology is at its highest level.
[0143] Another way of calculating homology can be performed by published algorithms. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by inspection.
[0144] The same types of homology can be obtained for nucleic acids by for example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 183:281-306, 1989.
[0145] It is understood that the description of conservative mutations and homology can be combined together in any combination, such as embodiments that have at least 70% homology to a particular sequence wherein the variants are conservative mutations.
[0146] As this specification discusses various proteins and protein sequences it is understood that the nucleic acids that can encode those protein sequences are also disclosed. This would include all degenerate sequences related to a specific protein sequence, i.e. all nucleic acids having a sequence that encodes one particular protein sequence as well as all nucleic acids, including degenerate nucleic acids, encoding the disclosed variants and derivatives of the protein sequences. Thus, while each particular nucleic acid sequence may not be written out herein, it is understood that each and every sequence is in fact disclosed and described herein through the disclosed protein sequence.
[0147] It is understood that there are numerous amino acid and peptide analogs which can be incorporated into the disclosed compositions. For example, there are numerous D amino acids or amino acids which have a different functional substituent then the amino acids shown in Table 1 and Table 2. The opposite stereo isomers of naturally occurring peptides are disclosed, as well as the stereo isomers of peptide analogs. These amino acids can readily be incorporated into polypeptide chains by charging tRNA molecules with the amino acid of choice and engineering genetic constructs that utilize, for example, amber codons, to insert the analog amino acid into a peptide chain in a site specific way.
[0148] Molecules can be produced that resemble peptides, but which are not connected via a natural peptide linkage. For example, linkages for amino acids or amino acid analogs can include CH.sub.2NH—, —CH.sub.2S—, —CH.sub.2—CH.sub.2—, —CH═CH—(cis and trans), —COCH.sub.2—, —CH(OH)CH.sub.2—, and —CHH.sub.2SO—(These and others can be found in Spatola, A. F. in Chemistry and Biochemistry of Amino Acids, Peptides, and Proteins, B. Weinstein, eds., Marcel Dekker, New York, p. 267 (1983); Spatola, A. F., Vega Data (March 1983), Vol. 1, Issue 3, Peptide Backbone Modifications (general review); Morley, Trends Pharm Sci (1980) pp. 463-468; Hudson, D. et al., Int J Pept Prot Res 14:177-185 (1979) (—CH.sub.2NH—, CH.sub.2CH.sub.2—); Spatola et al. Life Sci 38:1243-1249 (1986) (—CH H.sub.2—S); Hann J. Chem. Soc Perkin Trans. I 307-314 (1982) (—CH—CH—, cis and trans); Almquist et al. J. Med. Chem. 23:1392-1398 (1980) (—COCH.sub.2—); Jennings-White et al. Tetrahedron Lett 23:2533 (1982) (—COCH.sub.2—); Szelke et al. European Appln, EP 45665 CA (1982): 97:39405 (1982) (—CH(OH)CH.sub.2—); Holladay et al. Tetrahedron. Lett 24:4401-4404 (1983) (—C(OH)CH.sub.2—); and Hruby Life Sci 31:189-199 (1982) (—CH.sub.2—S—); each of which is incorporated herein by reference. A particularly preferred non-peptide linkage is —CH.sub.2NH—. It is understood that peptide analogs can have more than one atom between the bond atoms, such as b-alanine, g-aminobutyric acid, and the like.
[0149] Amino acid analogs and analogs and peptide analogs often have enhanced or desirable properties, such as, more economical production, greater chemical stability, enhanced pharmacological properties (half-life, absorption, potency, efficacy, etc.), altered specificity (e.g., a broad-spectrum of biological activities), reduced antigenicity, and others.
[0150] D-amino acids can be used to generate more stable peptides, because D amino acids are not recognized by peptidases and such. Systematic substitution of one or more amino acids of a consensus sequence with a D-amino acid of the same type (e.g., D-lysine in place of L-lysine) can be used to generate more stable peptides. Cysteine residues can be used to cyclize or attach two or more peptides together. This can be beneficial to constrain peptides into particular conformations.
[0151] 4. Pharmaceutical carriers/Delivery of pharmaceutical products
[0152] In one aspect, it is understood and herein contemplated that the disclosed neoantigens can be administered as compositions to a subject with a cancer or likely to develop a cancer. Accordingly, disclosed herein are compositions comprising a therapeutically effective amount of one or more of the neoantigens disclosed herein (including, but not limited to peptides, polypeptides, and proteins of the neoantigens). Additionally, disclosed herein are compositions comprising a therapeutically effective amount of one or more CAR T cells, TCR T-cells and/or TILs; wherein the CAR T cell, TCR T cell, and/or TIL has been engineered to express a receptor for one or more of the neoantigens disclosed herein. In one aspect, the TCR T cell, CAR T cell, or TIL specific for one or more of the neoantigens disclosed herein can be further engineered to knockout or knockdown Zinc finger and BTB domain-containing protein 7B (ThPOK), Lysine-specific histone demethylase 1 (LSD1), programmed cell death protein (PD1), and/or Protein phosphatase 2 (PP2A) to enhance their function such as cytotoxic activity and persistence or survival in vivo after adoptive transfer to a cancer patient. In one aspect, the TCR T cell, CAR T cell, or TIL specific for one or more of the neoantigens disclosed herein is treated with a small molecular inhibitors such 2-PAPC to enhance T cell cytotoxic activity or the ability to proliferate and survive in vivo.
[0153] As described above, the compositions can also be administered in vivo in a pharmaceutically acceptable carrier. By “pharmaceutically acceptable” is meant a material that is not biologically or otherwise undesirable, i.e., the material may be administered to a subject, along with the nucleic acid or vector, without causing any undesirable biological effects or interacting in a deleterious manner with any of the other components of the pharmaceutical composition in which it is contained. The carrier would naturally be selected to minimize any degradation of the active ingredient and to minimize any adverse side effects in the subject, as would be well known to one of skill in the art.
[0154] The compositions may be administered orally, parenterally (e.g., intravenously), by intramuscular injection, by intraperitoneal injection, transdermally, extracorporeally, topically or the like, including topical intranasal administration or administration by inhalant. As used herein, “topical intranasal administration” means delivery of the compositions into the nose and nasal passages through one or both of the nares and can comprise delivery by a spraying mechanism or droplet mechanism, or through aerosolization of the nucleic acid or vector. Administration of the compositions by inhalant can be through the nose or mouth via delivery by a spraying or droplet mechanism. Delivery can also be directly to any area of the respiratory system (e.g., lungs) via intubation. The exact amount of the compositions required will vary from subject to subject, depending on the species, age, weight and general condition of the subject, the severity of the allergic disorder being treated, the particular nucleic acid or vector used, its mode of administration and the like. Thus, it is not possible to specify an exact amount for every composition. However, an appropriate amount can be determined by one of ordinary skill in the art using only routine experimentation given the teachings herein.
[0155] Parenteral administration of the composition, if used, is generally characterized by injection. Injectables can be prepared in conventional forms, either as liquid solutions or suspensions, solid forms suitable for solution of suspension in liquid prior to injection, or as emulsions. A more recently revised approach for parenteral administration involves use of a slow release or sustained release system such that a constant dosage is maintained. See, e.g., U.S. Pat. No. 3,610,795, which is incorporated by reference herein.
[0156] The materials may be in solution, suspension (for example, incorporated into microparticles, liposomes, or cells). These may be targeted to a particular cell type via antibodies, receptors, or receptor ligands. The following references are examples of the use of this technology to target specific proteins to tumor tissue (Senter, et al., Bioconjugate Chem., 2:447-451, (1991); Bagshawe, K. D., Br. J. Cancer, 60:275-281, (1989); Bagshawe, et al., Br. J. Cancer, 58:700-703, (1988); Senter, et al., Bioconjugate Chem., 4:3-9, (1993); Battelli, et al., Cancer Immunol. Immunother., 35:421-425, (1992); Pietersz and McKenzie, Immunolog. Reviews, 129:57-80, (1992); and Roffler, et al., Biochem. Pharmacol, 42:2062-2065, (1991)). Vehicles such as “stealth” and other antibody conjugated liposomes (including lipid mediated drug targeting to colonic carcinoma), receptor mediated targeting of DNA through cell specific ligands, lymphocyte directed tumor targeting, and highly specific therapeutic retroviral targeting of murine glioma cells in vivo. The following references are examples of the use of this technology to target specific proteins to tumor tissue (Hughes et al., Cancer Research, 49:6214-6220, (1989); and Litzinger and Huang, Biochimica et Biophysica Acta, 1104:179-187, (1992)). In general, receptors are involved in pathways of endocytosis, either constitutive or ligand induced. These receptors cluster in clathrin-coated pits, enter the cell via clathrin-coated vesicles, pass through an acidified endosome in which the receptors are sorted, and then either recycle to the cell surface, become stored intracellularly, or are degraded in lysosomes. The internalization pathways serve a variety of functions, such as nutrient uptake, removal of activated proteins, clearance of macromolecules, opportunistic entry of viruses and toxins, dissociation and degradation of ligand, and receptor-level regulation. Many receptors follow more than one intracellular pathway, depending on the cell type, receptor concentration, type of ligand, ligand valency, and ligand concentration. Molecular and cellular mechanisms of receptor-mediated endocytosis has been reviewed (Brown and Greene, DNA and Cell Biology 10:6, 399-409 (1991)).
[0157] a) Pharmaceutically Acceptable Carriers
[0158] The compositions, including antibodies, can be used therapeutically in combination with a pharmaceutically acceptable carrier.
[0159] Suitable carriers and their formulations are described in Remington: The Science and Practice of Pharmacy (19th ed.) ed. A. R. Gennaro, Mack Publishing Company, Easton, Pa. 1995. Typically, an appropriate amount of a pharmaceutically-acceptable salt is used in the formulation to render the formulation isotonic. Examples of the pharmaceutically-acceptable carrier include, but are not limited to, saline, Ringer's solution and dextrose solution. The pH of the solution is preferably from about 5 to about 8, and more preferably from about 7 to about 7.5. Further carriers include sustained release preparations such as semipermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in the form of shaped articles, e.g., films, liposomes or microparticles. It will be apparent to those persons skilled in the art that certain carriers may be more preferable depending upon, for instance, the route of administration and concentration of composition being administered.
[0160] Pharmaceutical carriers are known to those skilled in the art. These most typically would be standard carriers for administration of drugs to humans, including solutions such as sterile water, saline, and buffered solutions at physiological pH. The compositions can be administered intramuscularly or subcutaneously. Other compounds will be administered according to standard procedures used by those skilled in the art.
[0161] Pharmaceutical compositions may include carriers, thickeners, diluents, buffers, preservatives, surface active agents and the like in addition to the molecule of choice. Pharmaceutical compositions may also include one or more active ingredients such as antimicrobial agents, antiinflammatory agents, anesthetics, and the like.
[0162] The pharmaceutical composition may be administered in a number of ways depending on whether local or systemic treatment is desired, and on the area to be treated. Administration may be topically (including ophthalmically, vaginally, rectally, intranasally), orally, by inhalation, or parenterally, for example by intravenous drip, subcutaneous, intraperitoneal or intramuscular injection. The disclosed antibodies can be administered intravenously, intraperitoneally, intramuscularly, subcutaneously, intracavity, or transdermally.
[0163] Preparations for parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media. Parenteral vehicles include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte replenishers (such as those based on Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like.
[0164] Formulations for topical administration may include ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable.
[0165] Compositions for oral administration include powders or granules, suspensions or solutions in water or non-aqueous media, capsules, sachets, or tablets. Thickeners, flavorings, diluents, emulsifiers, dispersing aids or binders may be desirable.
[0166] Some of the compositions may potentially be administered as a pharmaceutically acceptable acid- or base-addition salt, formed by reaction with inorganic acids such as hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, and phosphoric acid, and organic acids such as formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, and fumaric acid, or by reaction with an inorganic base such as sodium hydroxide, ammonium hydroxide, potassium hydroxide, and organic bases such as mono-, di-, trialkyl and aryl amines and substituted ethanolamines.
[0167] b) Therapeutic Uses
[0168] Effective dosages and schedules for administering the compositions may be determined empirically, and making such determinations is within the skill in the art. The dosage ranges for the administration of the compositions are those large enough to produce the desired effect in which the symptoms of the disorder are effected. The dosage should not be so large as to cause adverse side effects, such as unwanted cross-reactions, anaphylactic reactions, and the like. Generally, the dosage will vary with the age, condition, sex and extent of the disease in the patient, route of administration, or whether other drugs are included in the regimen, and can be determined by one of skill in the art. The dosage can be adjusted by the individual physician in the event of any counterindications. Dosage can vary, and can be administered in one or more dose administrations daily, for one or several days. Guidance can be found in the literature for appropriate dosages for given classes of pharmaceutical products. For example, guidance in selecting appropriate doses for antibodies can be found in the literature on therapeutic uses of antibodies, e.g., Handbook of Monoclonal Antibodies, Ferrone et al., eds., Noges Publications, Park Ridge, N.J., (1985) ch. 22 and pp. 303-357; Smith et al., Antibodies in Human Diagnosis and Therapy, Haber et al., eds., Raven Press, New York (1977) pp. 365-389. A typical daily dosage of the antibody used alone might range from about 1 μg/kg to up to 100 mg/kg of body weight or more per day, depending on the factors mentioned above.
D. Methods of Using the Compositions
[0169] 1. Method of Treating Cancer
[0170] The disclosed compositions can be used to treat any disease where uncontrolled cellular proliferation occurs such as cancers. Accordingly, in one aspect, disclosed herein are methods of stimulating an immunological response against a cancer or treating, inhibiting, and/or preventing a cancer comprising administering to a subject a composition comprising a therapeutically effective amount of any of the neoantigens disclosed herein (for example SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 57, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 80, SEQ ID NO: 82, SEQ ID NO: 84, SEQ ID NO: 86, SEQ ID NO: 88, SEQ ID NO: 89, SEQ ID NO: 90, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 93, SEQ ID NO: 94, SEQ ID NO: 95, SEQ ID NO: 96, SEQ ID NO: 97, SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID NO: 110, SEQ ID NO: 111, SEQ ID NO: 112, SEQ ID NO: 113, SEQ ID NO: 114, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 117, SEQ ID NO: 118, SEQ ID NO: 119, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, and/or SEQ ID NO: 163) and/or any neoantigens identified by the method of identifying a neoantigen disclosed herein. In one aspect, it is understood and herein contemplated that the disclosed neoantigens include peptide, polypeptides, and/or proteins of any disclosed neoantigen.
[0171] The term “therapeutically effective” refers to the amount of the composition used is of sufficient quantity to ameliorate one or more causes or symptoms of a disease or disorder. Such amelioration only requires a reduction or alteration, not necessarily elimination.
[0172] The term “treatment” refers to the medical management of a patient with the intent to cure, ameliorate, stabilize, or prevent a disease, pathological condition, or disorder. This term includes active treatment, that is, treatment directed specifically toward the improvement of a disease, pathological condition, or disorder, and also includes causal treatment, that is, treatment directed toward removal of the cause of the associated disease, pathological condition, or disorder. In addition, this term includes palliative treatment, that is, treatment designed for the relief of symptoms rather than the curing of the disease, pathological condition, or disorder; preventative treatment, that is, treatment directed to minimizing or partially or completely inhibiting the development of the associated disease, pathological condition, or disorder; and supportive treatment, that is, treatment employed to supplement another specific therapy directed toward the improvement of the associated disease, pathological condition, or disorder.
[0173] A non-limiting list of different types of cancers that can be treated by the disclosed methods is as follows: lymphomas (Hodgkins and non-Hodgkins), leukemias, carcinomas, carcinomas of solid tissues, squamous cell carcinomas, adenocarcinomas, sarcomas, gliomas, high grade gliomas, blastomas, neuroblastomas, plasmacytomas, histiocytomas, melanomas, adenomas, hypoxic tumors, myelomas, AIDS-related lymphomas or sarcomas, metastatic cancers, or cancers in general.
[0174] A representative but non-limiting list of cancers that the disclosed compositions can be used to treat is the following: lymphoma, B cell lymphoma, T cell lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and neck, lung cancers such as small cell lung cancer and non-small cell lung cancer, neuroblastoma/glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin cancer, liver cancer, melanoma, squamous cell carcinomas of the mouth, throat, larynx, and lung, colon cancer, cervical cancer, cervical carcinoma, breast cancer, and epithelial cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon cancer, and/or rectal cancers.
[0175] 2. Methods of Treating Autoimmune Diseases
[0176] In one aspect, it is understood and herein contemplated that the disclosed neoantigens can be used in the treatment of autoimmune diseases. As used herein, “autoimmune disease” refers to a set of diseases, disorders, or conditions resulting from an adaptive immune response (T cell and/or B cell response) against the host organism. In such conditions, either by way of mutation or other underlying cause, the host T cells and/or B cells and/or antibodies are no longer able to distinguish host cells from non-self-antigens and attack host cells baring an antigen for which they are specific. Examples of autoimmune diseases that can cause an inflammatory skin disorder include, but are not limited to Achalasia, Acute disseminated encephalomyelitis, Acute motor axonal neuropathy, Addison's disease, Adiposis dolorosa, Adult Still's disease, Agammaglobulinemia, Alopecia areata, Alzheimer's disease, Amyloidosis, Ankylosing spondylitis, Anti-GBM/Anti-TBM nephritis, Antiphospholipid syndrome, Aplastic anemia, Autoimmune angioedema, Autoimmune dysautonomia, Autoimmune encephalomyelitis, Autoimmune enteropathy, Autoimmune hemolytic anemia, Autoimmune hepatitis, Autoimmune inner ear disease (AIED), Autoimmune myocarditis, Autoimmune oophoritis, Autoimmune orchitis, Autoimmune pancreatitis, Autoimmune polyendocrine syndrome, Autoimmune retinopathy, Autoimmune urticaria, Axonal & neuronal neuropathy (AMAN), Baló disease, Behcet's disease, Benign mucosal emphigoid, Bickerstaffs encephalitis, Bullous pemphigoid, Castleman disease (CD), Celiac disease, Chagas disease, Chronic fatigue syndrome, Chronic inflammatory demyelinating polyneuropathy (CIDP), Chronic recurrent multifocal osteomyelitis (CRMO), Churg-Strauss Syndrome (CSS), Eosinophilic Granulomatosis (EGPA), Cicatricial pemphigoid, Cogan's syndrome, Cold agglutinin disease, Congenital heart block, Coxsackie myocarditis, CREST syndrome, Crohn's disease, Dermatitis herpetiformis, Dermatomyositis, Devic's disease (neuromyelitis optica), Diabetes mellitus type 1, Discoid lupus, Dressler's syndrome, Endometriosis, Enthesitis, Eosinophilic esophagitis (EoE), Eosinophilic fasciitis, Erythema nodosum, Essential mixed cryoglobulinemia, Evans syndrome, Felty syndrome, Fibromyalgia, Fibrosing alveolitis, Giant cell arteritis (temporal arteritis), Giant cell myocarditis, Glomerulonephritis, Goodpasture's syndrome, Granulomatosis with Polyangiitis, Graves' disease, Guillain-Barre syndrome, Hashimoto's encephalopathy, Hashimoto's thyroiditis, Hemolytic anemia, Henoch-Schonlein purpura (HSP), Herpes gestationis or pemphigoid gestationis (PG), Hidradenitis Suppurativa (HS) (Acne Inversa), Hypogammalglobulinemia, IgA Nephropathy, IgG4-related sclerosing disease, Immune thrombocytopenic purpura (ITP), Inclusion body myositis (IBM), Interstitial cystitis (IC), Inflamatory Bowel Disease (IBD), Juvenile arthritis, Juvenile diabetes (Type 1 diabetes), Juvenile myositis (JM), Kawasaki disease, Lambert-Eaton syndrome, Leukocytoclastic vasculitis, Lichen planus, Lichen sclerosus, Ligneous conjunctivitis, Linear IgA disease (LAD), Lupus nephritis, Lupus vasculitis, Lyme disease chronic, Meniere's disease, Microscopic polyangiitis (MPA), Mixed connective tissue disease (MCTD), Mooren's ulcer, Mucha-Habermann disease, Multifocal Motor Neuropathy (MMN) or MMNCB, Multiple sclerosis, Myasthenia gravis, Myositis, Narcolepsy, Neonatal Lupus, Neuromyelitis optica, Neutropenia, Ocular cicatricial pemphigoid, Optic neuritis, Ord's thyroiditis, Palindromic rheumatism (PR), PANDAS, Paraneoplastic cerebellar degeneration (PCD), Paroxysmal nocturnal hemoglobinuria (PNH), Parry Romberg syndrome, Pars planitis (peripheral uveitis), Parsonnage-Turner syndrome, Pemphigus, Peripheral neuropathy, Perivenous encephalomyelitis, Pernicious anemia (PA), POEMS syndrome, Polyarteritis nodosa, Polyglandular syndromes type I, II, III, Polymyalgia rheumatica, Polymyositis, Postmyocardial infarction syndrome, Postpericardiotomy syndrome, Primary biliary cirrhosis, Primary sclerosing cholangitis, Progesterone dermatitis, Psoriasis, Psoriatic arthritis, Pure red cell aplasia (PRCA), Pyoderma gangrenosum, Raynaud's phenomenon, Reactive Arthritis, Reflex sympathetic dystrophy, Relapsing polychondritis, Restless legs syndrome (RLS), Retroperitoneal fibrosis, Rheumatic fever, Rheumatoid arthritis, Rheumatoid vasculitis, Sarcoidosis, Schmidt syndrome, Schnitzler syndrome, Scleritis, Scleroderma, Sjögren's syndrome, Sperm & testicular autoimmunity, Stiff person syndrome (SPS), Subacute bacterial endocarditis (SBE), Susac's syndrome, Sydenham chorea, Sympathetic ophthalmia (SO), Systemic Lupus Erythematosus, Systemic scleroderma, Takayasu's arteritis, Temporal arteritis/Giant cell arteritis, Thrombocytopenic purpura (TTP), Tolosa-Hunt syndrome (THS), Transverse myelitis, Type 1 diabetes, Ulcerative colitis (UC), Undifferentiated connective tissue disease (UCTD), Urticaria, Urticarial vasculitis, Uveitis, Vasculitis, Vitiligo, Vogt-Koyanagi-Harada Disease, and Wegener's granulomatosis (or Granulomatosis with Polyangiitis (GPA)). Accordingly, in one aspect, disclosed herein are methods of stimulating an immunological response against an autoimmune disease or treating, inhibiting, and/or preventing an autoimmune disease comprising administering to a subject a composition comprising a therapeutically effective amount of any of the neoantigens disclosed herein (for example SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 46, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 57, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 76, SEQ ID NO: 78, SEQ ID NO: 80, SEQ ID NO: 82, SEQ ID NO: 84, SEQ ID NO: 86, SEQ ID NO: 88, SEQ ID NO: 89, SEQ ID NO: 90, SEQ ID NO: 91, SEQ ID NO: 92, SEQ ID NO: 93, SEQ ID NO: 94, SEQ ID NO: 95, SEQ ID NO: 96, SEQ ID NO: 97, SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID NO: 110, SEQ ID NO: 111, SEQ ID NO: 112, SEQ ID NO: 113, SEQ ID NO: 114, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 117, SEQ ID NO: 118, SEQ ID NO: 119, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, and/or SEQ ID NO: 163) and/or any neoantigens identified by the method of identifying a neoantigen disclosed herein. In one aspect, it is understood and herein contemplated that the disclosed neoantigens include peptide, polypeptides, and/or proteins of any disclosed neoantigen.
E. EXAMPLES
[0177] The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how the compounds, compositions, articles, devices and/or methods claimed herein are made and evaluated, and are intended to be purely exemplary and are not intended to limit the disclosure. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors and deviations have been accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in ° C. or is at ambient temperature, and pressure is at or near atmospheric.
Example 1: Dominant Neoantigen-Specific T Cell Response and Multiple-Antigen Specificity of Single T Cell Receptors
[0178] Tumor regression induced by checkpoint blockade therapy and T-cell-based immunotherapy depends on T-cell recognition of tumor antigens, particularly mutation-derived neoantigens. Increasing evidence indicates that tumor mutation loads are positively correlated with clinical response to checkpoint blockade therapy. However, despite the relatively larger numbers of somatic mutations in cancer tissues, the number of neoantigens identified from human cancer patients using tumor-reactive T cells has been unexpectedly low. The molecular mechanisms underlying the low number of neoantigens for T cell recognition remain poorly understood.
[0179] The mutated proteins must be naturally processed to short peptides [generally 9 amino acids for MHC class I molecules and variable length (13-15 amino acids) for MHC-II molecules] with specific MHC-I or -II binding motifs, and then presented on the cell surface for T cell stimulation. Furthermore, the binding affinity of neoantigens to MHC-I or -II molecules is a critical determinant of its immunogenicity and dependent on MHC-I (HLA-A, -B and -C) or MHC-II (HLA-DR, -DQ and DP) molecules. More importantly, there is accumulating evidence supporting that the mutated amino acid of neoantigenic peptides either increases the binding affinity of peptide-MHC complexes or triggers T cell receptor (TCR) to elicit T cell response through MHC/peptide-TCR contact. Therefore, antigen processing, HLA genotypes and TCRs can influence the number and quality of neoantigens that elicit T cell response against cancer cells. Recent studies show that tumor cells may develop multiple strategies to invade immune destruction by regulating antigen processing and specific loss of HLA alleles, thus rendering them susceptible or resistant to immunotherapy. Despite recent identification of many HLA class I-restricted neoantigens and their importance in cancer immunotherapy, growing evidence indicates that neoantigen-specific CD4.sup.+ T cells and their cognate neoantigens play a critical role in tumor regression. However, the current prediction programs for neoantigens are not accurate, particularly for MHC II-restricted neoantigens owing to the open pockets of MHC II molecules, thus limiting our ability to identify the full-spectrum of immunogenic neoantigens.
[0180] Because of the importance of mutation-derived neoantigens in tumor regression induced by checkpoint blockade therapy, a systemic approach may be required for identifying mutation-derived neoantigens for better understanding of T-cell recognition repertoire and potential mechanisms that control the number and quality of neoantigens for T cell recognition. Furthermore, whether a single T-cell receptor (TCR) can recognize two or more neoantigens in cancer patients remains largely unknown. In this study, is described a systemic approach to identify all possible mutation-derived neoantigens recognized by CD4.sup.+ and CD8.sup.+ T cells, and discover the immunodominance of neoantigen recognition by T cell lines and clones derived from cancer patients. The immunodominant neoantigen recognition by T cells is further supported by T cell clone analysis, intracellular staining and TCR profiling of primary T cells. Interestingly, one single TCR has the ability to recognize two neoantigen epitopes in an MHC II restricted fashion, while another TCR recognizes a MHC-I restricted neoantigen epitope when it is expressed in CD8.sup.+ T cells, but recognizes a MHC II-restricted neoantigen epitope when it is expressed in CD4.sup.+ T cells.
[0181] a) Results
(1) Systemic Identification of Somatic Mutation-Derived Neoantigens in 135 Mel
[0182] To systemically identify the maximal number of neoantigens recognized by T cells, primary tumor-infiltrating lymphocyte (TIL) lines were generated from fresh cancer tissues that had been surgically removed from two melanoma patients (#135 and #136). The resultant two TIL cell lines consisted of 54.5% CD4.sup.+, 41.9% CD8.sup.+ T cells (#135) and 82.2% CD4.sup.+, 17.2% CD8.sup.+ T cells (#136) and elicited tumor-specific responses against 135 mel and 136 mel tumor cells, respectively (
[0183] Whole-exome sequencing was performed on libraries generated from genomic DNAs of 135 mel and 135TIL cells. Based on the next-generation sequencing results, around 40,000 single nucleotide variations (SNVs) were distinguished in 135 mel and 135TIL respectively after alignment with the Human Genome Version 19 (hg19). Then a multiple-step filtering pipeline was applied to determine the lower bound of SNV frequency, remove common variations in tumor and normal T cells and limit the SNV type. Finally, 232 somatic missense mutations unique to 135 mel tumor cells (
[0184] To identify true neoantigens among the TMGs recognized by tumor-reactive T cells, HEK293 cells engineered to express MHC-I (HLA-A1, HLA-B8 or HLA-C7) and/or MHC-II molecules (HLA-DR1, HLA-DR3, HLA-DR4, HLA-DR11 and/or HLA-DP4) were used as artificial antigen-presenting cells (APCs). Based on HLA typing of patient #135, 293T-A1 and 293T-B8 (which naturally express HLA-C7) were generated for MHC-I antigen presentation, and 293IMDR3/DP4 and 293IMDR4/DP4 (expressing various HLA-DR molecules and HLA-DP4) for MHC-II antigen presentation. Neoantigens were screened for T cell recognition by transfection of 24 TMG constructs into 2931IMDR3/DP4 and 293IMDR4/DP4 cells, respectively, followed by adding CD4.sup.+135TILs for co-culture overnight. Significant IFN-γ release was found from T cells that were co-cultured with 293IMDR3/DP4 or 293IMDR4/DP4 cells transfected with TMG6, TMG17 or TMG18 (
[0185] To identify MHC-I restricted neoantigens, 293T-A1 or 293T-B8 cells were co-transfected with 24 TMGs, followed by co-culture with CD8.sup.+135TIL cells, and found that CD8.sup.+ 135TIL cells recognized TMG1 presented by 293T-B8 cells, as well as TMG14 presented by 293T-A1 cells (
[0186] Because the 293-engineered APCs do not express all HLA types, immortalized autologous APCs (135EBV-B) were used to serve as APCs for identification of all possible neoantigens presented different MHC-I and II molecules. Due to low transfection efficiency of EBV-B cells, in vitro transcribed (IVT) RNA of 24 TMGs were generated and then electroporated them into 135EBV-B cells, followed by adding 135TIL cells for antigen recognition based on cytokine release assay. All neoantigen-containing TMGs identified above positively responded for T cell recognition, but one new construct (TMG7) presented by EBV-B cells was identified that was strongly recognized by 135TIL, indicating more neoantigens presented by HLA molecules that the artificial APCs did not cover (
(2) Identification of Two Non-Mutated Antigens Derived from Alternative Open Reading Frames and 3′ UTR Region in 135Mel Tumor Cells
[0187] It is shown herein that tumor-reactive T cells are capable of recognizing T cell epitopes derived alternative open reading frames (ORF) and long-noncoding genes. Thus, immunogenic antigens can be generated from various sources or mechanisms [such as alternative ORFs, noncoding regions (3′ and 5′ UTRs, introns, noncoding RNA)]. To explore the possibility that some tumor-reactive T cells can recognize the cryptic T cell epitope, rather than just in the coding region based on exome sequence, the established CD4.sup.+ T-cell clones were tested for their ability recognize four newly identified MHC-II neoantigens, and CD8+ T-cell clones for their ability to recognize three MHC-1 neoantigens in 135 mel. Among all T cell clones, two T cell clones were identified that recognized 135 mel tumor cells, but did not respond to any neoantigen identified. A representative data are shown in
(3) Systemic Identification of Neoantigens from the Second Patient Tumor by Tumor-Reactive T Cells
[0188] Using several different approaches (293-engineered APCs and B cells, as well as different libraries) and tumor-reactive T cells (TIL lines and clones), 6 MHC-II antigens (4 mutation-derived neoantigens and 2 derived from alternative ORF and 3′ UTR region) were identified and 3 MHC-I neoantigens in 135 mel tumor cells. It appears that the number of neoantigens recognized by T cells is rather limited, even though there are 232 somatic mutations (Table 3). To extend the findings, exome sequencing of genomic DNAs isolated from the second patient tumor (136 mel) and 136TIL cells (as a control) was performed and identified 348 somatic mutations that were unique to 136 tumor cells (
TABLE-US-00003 TABLE 3 Amino acid sequences of somatic mutations (patient #135) for screening in each TMG TMG No. Gene name and locus Mutant AA sequence TMG1 AGRN location:chr1:987171 the VDTLAFDGRTFVGYLNAVTESEKAL 33 -th exon(s) of (SEQ ID NO: 167) ENST00000379370 MORN1 location:chr1:2267954 KKAGGRSRGGLHSRGTPPTAQEPPG the 12 -th exon(s) of (SEQ ID NO: 168) ENST00000378531 CLCN6 location:chr1:11894611 GIYDIHVGLRGVLLLEWETEVEMDK the 16 -th exon(s) of (SEQ ID NO: 169) ENST00000312413 ATP13A2 location:chr1:17320224 PLVPEPRRLPVGLLLRALATCHALS the 16 -th exon(s) of (SEQ ID NO: 170) ENST00000452699 KIF17 location:chr1:20998671 the EKMQRKLRAAEVKIKDLQSEFQLEK 12 -th exon(s) of (SEQ ID NO: 171) EN5T00000247986 CELA3B location:chr1:22310198 WNRSCVACGNDIVLIKLSRSAQLGD the 5 -th exon(s) of (SEQ ID NO: 172) ENST00000337107 EPHA10 location:chr1:38197088 QSFNPSIEVQTLEEAASGSRDQSPA the 7 -th exon(s) of (SEQ ID NO: 173) ENST00000373048 TIE1 location:chr1:43773506 the CAPGHFGADCRLRCQCQNGGTCDRF 7 -th exon(s) of EN5T00000372476 (SEQ ID NO: 174) AGBL4 location:chrl:48999906 KHPLLRGPASNYLNSKGDKKSSVNH the 14 -th exon(s) of (SEQ ID NO: 175) ENST00000371839 TXNIP location:chr1:145438902 GSGEKVAGRVIVKVCEVTRVKAVRI the 1 -th exon(s) of (SEQ ID NO: 176) ENST00000582401 TMG2 POGZ location:chr1:151380993 PRTVPVSSNDTPLSALQEAAPLTSS the 14 -th exon(s) of (SEQ ID NO: 177) ENST00000271715 RORC location:chr1:151789697 KICGDKSSGIHYRVITCEGCKGFFR the 2 -th exon(s) of (SEQ ID NO: 178) ENST00000356728 FLG2 location:chr1:152326075 the VHERHETTYGQTEEATGHGHSGHGQ 3 -th exon(s) of (SEQ ID NO: 179) ENST00000388718 IVL location:chr1:152883669 the EQQVGQPKNLEQKEKQLELPEQQEG 2 -th exon(s) of ENST00000368764 (SEQ ID NO: 180) FCRL1 location:chr1:157767607 TPGQLQPIYENELREQSVAVHGRQQ the 8 -th exon(s) of (SEQ ID NO: 181) ENST00000358292 LY9 location:chr1:160783603 the SESNGGSILTVSQTPCDPDLPYICT 3 -th exon(s) of (SEQ ID NO: 182) ENST00000263285 TBX19 location:chr1:168278072 WTSLSSTPHASIMSVPHTNGPINPG the 7 -th exon(s) of (SEQ ID NO: 183) ENST00000367821 RGS16 location:chr1:182572416 LGIFLHKSELGCNTGSTGKFEWGSK the 2 -th exon(s) of (SEQ ID NO: 184) ENST00000367558 FAM129A LEEAYTLVQHQVPEGLSALKEECRA location:chr1:184792424 the 8 -th (SEQ ID NO: 185) exon(s) of ENST00000367511 BRINP3 location:chr1:190129856 QLFLKAQKIVHKFFSLSKRCHKQPL the 7 -th exon(s) of (SEQ ID NO: 186) ENST00000367462 TMG3 CRB1 location:chr1:197403848 QPVLQGFECIANVVFNGQSGQILFR the 7 -th exon(s) of (SEQ ID NO: 187) ENST00000367399 NENF location:chr1:212619328 NLDFKPEDQPHFYIKDEF the 4 -th exon(s) of (SEQ ID NO: 188) ENST00000366988 USH2A location:chr1:216074197 LPPRLSSATPTSIQVVWSTPARNNA the 39 -th exon(s) of (SEQ ID NO: 189) ENST00000307340 CCDC185 HAVEGQKKVQDTSLSSLINYQARKV location:chr1:223568085 the 1 -th (SEQ ID NO: 190) exon(s) of ENST00000366875 ABCB10 location:chrl :229654072 LMDGRTVLVIAHCLSTIKNANMVAV the 13 -th exon(s) of (SEQ ID NO: 191) ENST00000344517 RYR2 location:chr1:237777461 ALGNHRVAHALCNHVDEPQLLYAIE the 37 -th exon(s) of (SEQ ID NO: 192) ENST00000366574 OR2L3 location:chr1:248224771 YAPFVYTYLRPRFLRSPTEDKVLAV the 1 -th exon(s) of (SEQ ID NO: 193) ENST00000359959 CUBN location:chr10:16893368 NNTFASPDSDSNRMYDKNLNCVWII the 60 -th exon(s) of (SEQ ID NO: 194) ENST00000377833 SH2D4B location:chr10:82363415 QRARDEYRHHSLCAIQKGTVAGLSS the 5 -th exon(s) of (SEQ ID NO: 195) ENST00000339284 ATRNL1 ANICHLHTGKCFRTTKGIKGDQCQL location:chr10:117154225 the 20- (SEQ ID NO: 196) th exon(s) of ENST00000355044 TMG4 MUC6 location:chr11:1031883 the VQLRRGPDGSISWIIVELGASVVTV 3 -th exon(s) of (SEQ ID NO: 197) ENST00000421673 OR51A4 location:chr11:4967678 LCLMVDFILIAVFYTLILKTVLGIA the 1 -th exon(s) of (SEQ ID NO: 198) ENST00000380373 HBD location:chr11:5255429 the VGGEALGRLLVVFPWTQRFFESFGD 2 -th exon(s) of ENST00000380299 (SEQ ID NO: 199) OR52E4 location:chr11:5906376 IHILLANLYVVVLPALNPVIYGVRT the 1 -th exon(s) of (SEQ ID NO: 200) ENST00000316987 NCR3LG1 IGVGLVLLIVLISWKKICNKSSSAY location:chr11:17390500 the 4 -th (SEQ ID NO: 201) exon(s) of ENST00000338965 E2F8 location:chr11:19247118 the PVTSSELTAVNFSSFHVTPLKLMVS 202 12 -th exon(s) of ENST00000250024 OR5D16 location:chr11:55607024 HGTILFLYCVPNFKNSRHTVKVASV 203 the 1 -th exon(s) of ENST00000378396 TRIM49 location:chr11:89537525 VTIDCGHSFCRPSFYLNWQDIPFLV 204 the 3 -th exon(s) of ENST00000329758 TMPRSS5 LRCSECGARPLAFRIVGGQSVAPGR 205 location:chr11:113565338 the 8 -th exon(s) of ENST00000299882 CEP164 location:chr11:117258122 EEILRLHQQKEQFLSSLRERLQKAI 206 the 15 -th exon(s) of ENST00000278935 TMG5 PLEKHG6 PLLLHAVLKRSPKARAQEALNAMIE 207 location:chr12:6426830 the 9 -th exon(s) of ENST00000396988 C1S location:chr12:7169847 the VYAEPTMYGEILFPNYPQAYPSEVE 208 3 -th exon(s) of ENST00000328916 GDF3 location:chr12:7843214 the KLLYFNLSAIKEGEQLTLAQLGLDL 209 2 -th exon(s) of ENST00000329913 KRT83 location:chr12:52709846 AAVAQSEQQGEATLSDARCKLAELE 210 the 7 -th exon(s) of ENST00000293670 LRP1 location:chr12:57589739 the SRQWECDGENDCLDQSDEAPKNPHC 211 54 -th exon(s) of ENST00000243077 KIF5A location:chr12:57965854 RLQEVSGHQRKRIAEVLNGLMKDLS 212 the 14 -th exon(s) of ENST00000286452 NR2C1 location:chr12:95442978 SRAFDTLAKALNAGESTACQSSVAG 213 the 9 -th exon(s) of ENST00000333003 CUX2 location:chr12:111757993 EVAPRGRSVPPSLPERPSLATASQN 214 the 17 -th exon(s) of ENST00000261726 PTPN11 LKYDVGGGERFDALTDLVEHYKKNP 215 location:chr12:112892407 the 5 -th exon(s) of ENST00000392597 TBX3 location:chr12:115117732 FPSDHATWQGNYGFGTQTILNSMHK 216 the 3 -th exon(s) of ENST00000257566 TMG6 NCOR2 location:chr12:124810031 PPPPGLPAGSGPFAGPHHAWDEEPK 217 the 48 -th exon(s) of ENST00000429285 SPATA13 AEPKRKSSLFWHAFNRLTPFRK 218 location:chr13:24876879 the 13 -th exon(s) of ENST00000382108 CCDC168 PSQPKLPISSGAEKSRLANSNEGIS 219 location:chr13:103388489 the 4 -th exon(s) of ENST00000322527 COL4A1 TPGPTGPAGQKGKPGSDGIPGSAGE 220 location:chr13:110826298 the 40 -th exon(s) of ENST00000375820 ARHGAP5 RHQREIVEKAKEKFQEMLFEHSELF 221 location:chr14:32561340 the 2 -th exon(s) of ENST00000345122 MIPOL1 location:chr14:37838743 AALSKCKRLEQEFHHVKEQNQTSAN 222 the 11 -th exon(s) of ENST00000327441 SPTB location:chr14:65263382 the EEAEYRRELALRHELIRQEKLEQLA 223 10 -th exon(s) of ENST00000389722 NPAP1 location:chr15:24924213 GQAAWDPTGHSMTAAPQGASNIPVF 224 the 1 -th exon(s) of ENST00000329468 RYR3 location:chr15:34065794 YHILCSLYSLGTRKNIYVERQRPAL 225 the 64 -th exon(s) of ENST00000389232 RASGRP1 LLFDHLEPEELSKHLTYLEFKSFRR 226 location:chr15:38808442 the 6 -th exon(s) of ENST00000310803 TMG7 RASGRP1 MGTLGKEREAPRKPSHGCR 227 location:chr15:38856810 the 1 -th exon(s) of ENST00000310803 TRIM69 location:chr15:45047438 KKLPLLKGHPQCQEHGENLKLFSKP 228 the 2 -th exon(s) of ENST00000329464 CYP11A1 FGERQGMLEEVVSPEAQRFIDAIYQ 229 location:chr15:74636279 the 4 -th exon(s) of ENST00000268053 CHSY1 location:chr15:101775370 RRMVPHIGKCLRKMYTTHEDVEVGR 230 the 2 -th exon(s) of ENST00000254190 MPG location:chr16:135400 the YGMYFCMNISSQEDGACVLLRALEP 231 4 -th exon(s) of ENST00000397817 CLEC16A VARSAAVETASLFPSLVPARQPTIS 232 location:chr16:11272324 the 23 -th exon(s) of EN5T00000409790 TXNDC11 VIIPAKPPVSFFFLRSPVLDLFQGQ 233 location:chr16:11830039 the 2 -th exon(s) of ENST00000283033 ABCC1 location:chr16:16139715 PQILKLLIKFVNNTKAPDWQGYFYT 234 the 9 -th exon(s) of ENST00000399410 Cl6orf58 location:chr16:31504800 PAPSLSLGVPLHLLVSSVFELQQLV 235 the 9 -th exon(s) of EN5T00000327237 ADCY7 location:chr16:50324484 YVECLLRRWLRAFALLTWACLVALG 236 the 2 -th exon(s) of ENST00000254235 TMG8 ADCY7 location:chr16:50324485 VECLLRRWLRALTLLTWACLVALGY 237 the 2 -th exon(s) of ENST00000254235 DYNC1LI2 KLQGAEHGKKGRDLEYLYLSVHDED 238 location:chr16:66783144 the 3 -th exon(s) of ENST00000258198 FBXL8 location:chr16:67196929 RAPGLRGLRLECCGEKPLFDAGRDV 239 the 3 -th exon(s) of ENST00000258200 SLC12A4 IRPKVSSLLGKLISYTNLTQGAKEH 240 location:chr16:67995561 the 3 -th exon(s) of ENST00000316341 PMFBP1 location:chr16:72174425 DHSKVRIYTSPCIIQEHQETQKRLS 241 the 6 -th exon(s) of ENST00000237353 MYH4 location:chr17:10356983 DDLELTLAKVEKKKHATENKVKNLT 242 the 23 -th exon(s) of ENST00000255381 DNAH9 location:chr17:11833330 SPEGHIIPQGILQNSIKITNEPPTG 243 the 9 -th exon(s) of ENST00000608377 FLCN location:chr17:17117084 EDTQKLLSILGAFEEDNVKLLKFWM 244 the 14 -th exon(s) of ENST00000285071 FLII location:chr17:18160249 the LKLNRTGLCYLPKELAALQKLEHLS 245 2 -th exon(s) of ENST00000545457 SHMT1 location:chr17:18232625 TPALTSRGLLEKNFQKVAHFIHRGI 246 the 10 -th exon(s) of ENST00000352886 TMG9 FBXW10 location:chr17:18682399 RIYTALDPFRVNAEFVLLTVKEEKE 247 the 13 -th exon(s) of ENST00000301938 SLFN14 location:chr17:33884597 LREKGFRAQRGRLRVKKLHPQQVLN 248 the 1 -th exon(s) of ENST00000415846 KRT24 location:chr17:38859509 GLFSGGEKQTMQSLNDRLANYLDKV 249 the 1 -th exon(s) of ENST00000264651 KRT31 location:chr17:39551303 LQAQHNLRDSLEKTLTESEARYSSQ 250 the 6 -th exon(s) of ENST00000251645 CDC27 location:chr17:45234359 PDTVPLGTGTSIFSKQVQNKPKTGR 251 the 7 -th exon(s) of ENST00000066544 CDC27 location:chr17:45234397 SSVSYIDSAVISSDTVPLGTGTSIL 252 the 7 -th exon(s) of ENST00000066544 BZRAP1 location:chr17:56395792 GSGPKDLDLPPGFPGRCTPKSSEPA 253 the 13 -th exon(s) of ENST00000268893 ABCA8 location:chr17:66871534 VAIMVSGRLRCICSIQHLKSKFGKD 254 the 36 -th exon(s) of ENST00000430352 Cl7orf62 location:chr17:80402458 TLFRAGHDQVVVQLHDVRDVSVEEE 255 the 5 -th exon(s) of ENST00000434650 FOXK2 location:chr17:80543800 AQSAPGSPLSSQSVLITVQRQLPQA 256 the 7 -th exon(s) of ENST00000335255 TMG10 L3MBTL4 location:chr18:6244527 NGFQIGMRLEGIHPRHPSVFCVLSV 257 the 6 -th exon(s) of ENST00000400105 DSC1 location:chr18:28720131 MCTTTVTVKIIDNDEGPECHPPVKV 258 the 10 -th exon(s) of ENST00000257197 CCDC178 GTLFHLTKHKTDKMEDKIAEVRRKF 259 location:chr18:30825250 the 14 -th exon(s) of ENST00000403303 TCEB3B location:chr18:44560939 VSHSKGHKSSRQKKRPLCAQGDWHS 260 the 1 -th exon(s) of ENST00000332567 CPLX4 location:chr18:56964118 EMDENQIQMAGDNVDLPEDLRKMVD 261 the 3 -th exon(s) of ENST00000299721 MUC16 location:chr19:9046760 VDTRSGVPTTTILPSIPGVVTSQVT 262 the 5 -th exon(s) of ENST00000397910 ZNF560 location:chr19:9577898 LRTHAGEKPYECIKCGKAFTERSYL 263 the 10 -th exon(s) of ENST00000301480 ATG4D location:chr19:10662790 CQPTVDVSQADFLLESFHCTSPRKM 264 the 8 -th exon(s) of ENST00000309469 ZNF91 location:chr19:23544724 KRIHTGEKPYKCEECGKAFSNSSTL 265 the 4 -th exon(s) of ENST00000300619 KIRREL2 LVPPEAPQVLGGLSVSLVAGVPANL 266 location:chr19:36349639 the 4 -th exon(s) of ENST00000360202 TMG11 ZNF607 location:chr19:38189938 TAPHTFESVEKPFKCEECGKAFSVH 267 the 5 -th exon(s) of ENST00000355202 CYP2A13 ATFDWLFKGYGVVFSNGERAKQLRR 268 location:chr19:41595958 the 3 -th exon(s) of ENST00000330436 PRKD2 location:chr19:47177874 RYITHESDDARWEQFAAEHPLPGSG 269 the 19 -th exon(s) of ENST00000595515 PRKD2 location:chr19:47177874 RWEQFAAEHPLPESGLPTDRDLGGA 270 the 17 -th exon(s) of ENST00000600194 LRRC4B location:chr19:51021783 KCRTGTSMTSVNLLTPNGTLMTHGS 271 the 3 -th exon(s) of ENST00000389201 SIGLEC10 STQAPESQESQEKLHYATLNFPGVR 272 location:chr19:51914457 the 10 -th exon(s) of ENST00000353836 KIR2DL1 FPLGPATHGGTYKCFGSFHDSPYEW 273 location:chr19:55286836 the 4 -th exon(s) of ENST00000336077 NLRP2 location:chr19:55494015 CGDWEKKKPVPVILGSLLNRVMLPK 274 the 5 -th exon(s) of ENST00000537859 NLRP4 location:chr19:56379171 NNKKLTYLNVSCKQLDTGVPLLCEA 275 the 6 -th exon(s) of ENST00000301295 OTOF location:chr2:26684954 the FFTGEKSSDIFVKGWLKGQQEDKQD 276 25 -th exon(s) of ENST00000338581 TMG12 SOS1 location:chr2:39213258 the VFSSSPLHLQPPSLGKKSDHGNAFF 277 23 -th exon(s) of ENST00000402219 EGR4 location:chr2:73519196 the APGDLGEGAEGLSGLLTPPSGEGGS 278 2 -th exon(s) of ENST00000545030 DNAH6 location:chr2:84949894 MYFVIASLSEIDLMYQYSLKYFKQL 279 the 60 -th exon(s) of ENST00000389394 TBC1D8 location:chr2:101654034 KSPLMHPDALVTVFQQSGSQSPDSR 280 the 8 -th exon(s) of ENST00000376840 RNF149 location:chr2:101898473 GDVQEMPAPESPLGRDPAANLSLAL 281 the 6 -th exon(s) of ENST00000295317 ZC3H6 location:chr2:113089764 GGLKSSDKTEPSLGEAILPQKPSPN 282 the 12 -th exon(s) of ENST00000409871 PTPN4 location:chr2:120734607 RFVCEAILKVYEDGFVKPLTTSTNK 283 the 27 -th exon(s) of ENST00000263708 PTPN4 location:chr2:120734627 LKVYEEGFVKPLKTSTNK 284 the 27 -th exon(s) of ENST00000263708 BIN1 location:chr2:127821176 the FYVNTFQSIAGLQENFHKEMSKLNQ 285 9 -th exon(s) of ENST00000351659 NEB location:chr2:152380916 the QAAKQASEVEYRVKHRKEGSHGLSM 286 153 -th exon(s) of ENST00000427231 TMG13 NEB location:chr2:152380916 the TVADRPDIKKATLAAKQASEVEYRA 287 125 -th exon(s) of ENST00000409198 NEB location:chr2:152527587 the QHPDTVKFTSVPNSMGMVLAQHNTK 288 38 -th exon(s) of ENST00000427231 PMS1 location:chr2:190656594 TVRLLSSSQIITLVVSVVKELIENS 289 the 2 -th exon(s) of ENST00000441310 ABCA12 location:chr2:215917230 GTYTFNGSQVLAQILGLEKLLKQNS 290 the 5 -th exon(s) of ENST00000272895 ZNF142 location:chr2:219521044 LLIPPPLSNRGIMGPVQSPCPSRDP 291 the 4 -th exon(s) of ENST00000449707 SLC23A3 PLPEDPGDEEGGFSEPEEMADLLPG 292 location:chr2:220026722 the 11 -th exon(s) of ENST00000295738 TRPM8 location:chr2:234888922 LGLFYFIAGIVFWLHSSNKSSLYSG 293 the 18 -th exon(s) of ENST00000324695 COL6A3 location:chr2:238274484 VSVVANTPSGPVKAFDFDEYQPEML 294 the 11 -th exon(s) of ENST00000353578 UBE2F location:chr2:238925240 PDEGYYQGGKFQLETEVPDAYNMVP 295 the 5 -th exon(s) of ENST00000272930 EBF4 location:chr20:2690234 the LTHEIMCSRCCDQKSCGNRNETPSD 296 7 -th exon(s) of ENST00000380648 TMG14 PLCB1 location:chr20:8862399 ISEDSNHGSAPLFLSSDPGKVNHKT 297 the 32 -th exon(s) of ENST00000338037 PAK7 location:chr20:9561130 the LSKPSEYSDLKWKYQRASSSSPLDY 298 4 -th exon(s) of ENST00000353224 FLRT3 location:chr20:14306307 ISKEEFVIHTIFSPNGMNLYKNNHS 299 the 3 -th exon(s) of ENST00000341420 RPN2 location:chr20:35812712 RLKASLDRPFTNSESAFYSIVGLSS 300 the 2 -th exon(s) of ENST00000373622 ZNFX1 location:chr20:47887948 DNFQQWRTPHQKLTEQPQQAKKLGY 301 the 3 -th exon(s) of ENST00000396105 RNF114 location:chr20:48561937 HGCRKNFFLSKIWSHVATCSKYQNY 302 the 3 -th exon(s) of ENST00000244061 ATP9A location:chr20:50290746 LVMVALQHFAGRSYLQIIRFLLLFS 303 the 11 -th exon(s) of ENST00000338821 CTCFL location:chr20:56093816 PFKCSMCKYASVKVKPFLDLKLHGI 304 the 4 -th exon(s) of ENST00000608158 CTCFL location:chr20:56093816 PFKCSMCKYASVKERHMTAHIRTHT 305 the 4 -th exon(s) of ENST00000432255 CLIC6 location:chr21:36079611 KAGYDGESIGNCSFSQRLFMILWLK 306 the 2 -th exon(s) of ENST00000349499 TMG15 CECR1 location:chr22:17662383 WKKRWDKFIADVDTK 307 the 10 -th exon(s) of ENST00000399837 MYO18B QVAQMRIEYLEQFTVDRAIVSRQEA 308 location:chr22:26348292 the 38 -th exon(s) of EN5T00000335473 MYO18B LAAVRQTLQTDLKTSIRRIADLQAA 309 location:chr22:26388383 the 40 -th exon(s) of ENST00000335473 SEZ6L location:chr22:26709798 GVVLSPNWPEPYLEGEDCIWKIHVG 310 the 9 -th exon(s) of ENST00000360929 ELFN2 location:chr22:37770923 VTKNYDRLQCESSREFAGYPLLVPR 311 the 3 -th exon(s) of ENST00000402918 GTPBP1 location:chr22:39122341 IKLNDTLLLGPDSLGNFLSIAVKSI 312 the 8 -th exon(s) of ENST00000216044 EFCAB6 location:chr22:44064813 GPPTVSPVLVPKNQLLSEHLQKDEQ 313 the 16 -th exon(s) of ENST00000262726 PNPLA3 location:chr22:44323024 VSDGENVLVSDFWSKDEVVDALVCS 314 the 2 -th exon(s) of ENST00000216180 GRAMD4 DRLNEIKGHLEIGLLEKHFLQEELR 315 location:chr22:47033834 the 2 -th exon(s) of ENST00000361034 NUP210 location:chr3:13383354 TLVALDEALDNYIITFLIRGVAIGQ 316 the 23 -th exon(s) of ENST00000254508 TMG16 SEMA3G location:chr3:52469912 IVASQLDNLFPPKPKPEEPPARGGL 317 the 16 -th exon(s) of ENST00000231721 FOXP1 location:chr3:71026116 EIYNWFTRMFAYLRRNAATWKGAIR 318 the 17 -th exon(s) of ENST00000615603 ABI3BP location:chr3:100513845 KPYPEVSQSEPASLETRGIPFIPMI 319 the 22 -th exon(s) of ENST00000284322 RBPJ location:chr4:26426018 the TKVALFNRLRSQRVSTRYLHVEGGN 320 7 -th exon(s) of ENST00000355476 MMRN1 location:chr4:90856108 SSLSEDLESTRQKIQKVNESVVSIA 321 the 6 -th exon(s) of ENST00000264790 BANK1 location:chr4:102965052 RRQSDDDKFCGLSKKQDRARIESPA 322 the 11 -th exon(s) of ENST00000444316 NPY1R location:chr4:164247357 HWVFGEAMCKLNLFVQCVSITVSIF 323 the 2 -th exon(s) of ENST00000296533 TLL1 location:chr4:166915559 the IGFGLEQNNTVKRKVPLQFSGQNEK 324 4 -th exon(s) of ENST00000061240 TLL1 location:chr4:166924631 the GKNCDKFGIVVHKLGHVIGFWHEHT 325 6 -th exon(s) of ENST00000061240 FRG1 location:chr4:190878626 IRCNEAGDIEAKNKTAGEEEMIKIR 326 the 6 -th exon(s) of ENST00000226798 TMG17 DNAH5 location:chr5:13901576 DMISKLYTKQKYNPPLARNQPPIAG 327 the 14 -th exon(s) of ENST00000265104 IL7R location:chr5:35873708 the DHYFKGFWSEWSSSYYFRTPEINNS 328 5 -th exon(s) of ENST00000303115 C7 location:chr5:40979942 the HCQGRNYTLTGRNSCTLPASAEKAC 329 17 -th exon(s) of ENST00000313164 FBN2 location:chr5:127640712 NVCSHGLCVDLQRSYQCICHNGFKA 330 the 45 -th exon(s) of ENST00000262464 PCDHA9 location:chr5:140230541 IIFFLERYYRLLSGAVQIVLFIFLE 331 the 1 -th exon(s) of ENST00000378122 PCDHB7 location:chr5:140552768 FQLLLEKPFQIFCAELWVRDINDHA 332 the 1 -th exon(s) of ENST00000231137 PCDHB16 ENSPLGTEFPLNYALDLDVGSNNVQ 333 location:chr5:140562600 the 1 -th exon(s) of ENST00000609684 GRIA1 location:chr5:153085554 EGRDQTTSDQSNKFGIFNSLWFSLG 334 the 11 -th exon(s) of ENST00000518783 PWWP2A RKDNGLLVRQEACISWFGSPTTSFL 335 location:chr5:159519583 the 2 -th exon(s) of ENST00000307063 GABRP location:chr5:170235679 RNVLYFILETYVLSTFLVVLSWVSF 336 the 8 -th exon(s) of ENST00000265294 TMG18 F13A1 location:chr6:6318782 the LPTVELQGVVPRDVNLQEFLNVTSV 337 2 -th exon(s) of ENST00000264870 ZSCAN31 location:chr6:28294281 NEHRRSHTGEKPHQCKECGKAFSAS 338 the 3 -th exon(s) of ENST00000446474 GLTSCR1L SQIILKGSGQQASSNVSGGLLVHRQ 339 location:chr6:42796807 the 5 -th exon(s) of ENST00000394168 ADGRF5 location:chr6:46834827 CIFRYKNSYSIAAKDVIVHPLPLKL 340 the 13 -th exon(s) of ENST00000265417 PKHD1 location:chr6:51893144 TLSRNISNIAGGKTLVIGVARLMNY 341 the 30 -th exon(s) of ENST00000340994 GFRAL location:chr6:55264062 KHANKITLTGFHFPFNGEVIYAAMC 342 the 7 -th exon(s) of ENST00000340465 EYS location:chr6:66044988 the CSCLSEEDSQEYWYLCFLRWAGNMY 343 11 -th exon(s) of ENST00000503581 TBX18 location:chr6:85447046 VPATHPHLLSGSFCSSPAFHLGPNT 344 the 8 -th exon(s) of ENST00000369663 MAP3K7 location:chr6:91296538 SSSSSAGEMIEALSQVLNFEEIDYK 345 the 1 -th exon(s) of ENST00000369325 ATG5 location:chr6:106764056 MTDDKDVLRNVWFGRIPTCFT 346 the 2 -th exon(s) of ENST00000343245 TMG19 REV3L location:chr6:111678229 AADEKALFHEIASIIKRYDPDILLG 347 the 19 -th exon(s) of ENST00000358835 NHSL1 location:chr6:138753933 APANRENGSQAMSYNCRNNLAFPAH 348 the 6 -th exon(s) of ENST00000343505 DGKB location:chr7:14733729 the LEEWIQGGMTTISLLVLLGLENNVK 349 8 -th exon(s) of ENST00000399322 SP4 location:chr7:21468406 the PENNNKKPKTSGFQDSQPSPLALLA 350 2 -th exon(s) of ENST00000222584 GHRHR location:chr7:31015410 IVLSVGVNFGLFFNIIRILVRKLEP 351 the 10 -th exon(s) of ENST00000326139 SEPT14 location:chr7:55910709 LFEYHDSRVHVCFYFISPTGHSLKS 352 the 5 -th exon(s) of ENST00000388975 PCLO location:chr7:82785328 the KEQGKPEGIIKPLLQQQPPKPIPKQ 353 2 -th exon(s) of ENST00000333891 LRRD1 location:chr7:91793994 IKYVKYLYLDKNKIKTFQGADSGDL 354 the 1 -th exon(s) of ENST00000430130 TSPAN12 SVELLKPSLSRILEHTSMANSFNTH 355 location:chr7:120428705 the 8 -th exon(s) of EN5T00000222747 ZNF783 location:chr7:148963799 RNRNFWILRLPPDSKGEAPKVPVTF 356 the 2 -th exon(s) of ENST00000434415 TMG20 TMEM176A VAVLAGAAAFIYKKRGGTYWALLRT 357 location:chr7:150500521 the 4 -th exon(s) of ENST00000004103 MSRA location:chr8:10102709 the TQVGFAGGYTSNSTYKEVCSEKTGH 358 3 -th exon(s) of ENST00000317173 TEX15 location:chr8:30701132 QVNECEAIMEHCFDCFDFSLSVPFT 359 the 1 -th exon(s) of ENST00000256246 FGFR1 location:chr8:38287286 TRITGEEVEVQDFVPADSGLYACVT 360 the 3 -th exon(s) of ENST00000397091 PCMTD1 location:chr8:52732961 PQNLLREKIMKLSLPESLKAYLTYF 361 the 4 -th exon(s) of ENST00000544451 TOX location:chr8:59728090 the MHPSLPRNIAPKLNNQMPVTVSIAN 362 7 -th exon(s) of EN5T00000361421 COPS5 location:chr8:67958099 AQLGRGSFMLGLQTHDRKSEDKLAK 363 the 7 -th exon(s) of ENST00000357849 TG location:chr8:133925498 the FYQVLTSEASQDRLGCVKCPEGSYS 364 20 -th exon(s) of ENST00000220616 GSDMD location:chr8:144642058 APGQAKIAGGAAESDSSSTSMNVYS 365 the 3 -th exon(s) of ENST00000262580 VPS13A location:chr9:79986036 GAVARPTGGIIDIASSTFQGIKRAT 366 the 67 -th exon(s) of ENST00000357409 TMG21 DIRAS2 location:chr9:93375725 MLVGNKCDESPSCEVQSSEAEALAR 367 the 2 -th exon(s) of ENST00000375765 CDK5RAP2 EERIQALEEDLRKKEREIATEKKNS 368 location:chr9:123290182 the 10 -th exon(s) of ENST00000360190 FBXW2 location:chr9:123550377 RHLSNNLETLLKWDFLKLLPLELSF 369 the 3 -th exon(s) of ENST00000608872 ST6GALNAC6 LWDLRRVRGEAAFAQPLGQGPSSGQ 370 location:chr9:130648830 the 7 -th exon(s) of ENST00000373142 TLR7 location:chrX:12904948 the SVNKISPSGDSSKVGFCSNARTSVE 371 3 -th exon(s) of ENST00000380659 MAGEB1 location:chrX:30269305 IWKFMNVLGAYDEEEHLIYGEPRKF 372 the 4 -th exon(s) of ENST00000378981 FAM47C location:chrX:37027628 PPEAGVSHLCPELPKTRVPPLRPET 373 the 1 -th exon(s) of ENST00000358047 GPR34 location:chrX:41555747 FTICFVPYHAFRLIYISSQLNVSSC 374 the 3 -th exon(s) of ENST00000378142 ZNF81 location:chrX:47775337 CGKAFTQKSTLRTHQRIHTGERSYI 375 the 5 -th exon(s) of ENST00000338637 HDAC6 location:chrX:48673271 LEFQPQLVLVAARFDALQGDPKGEM 376 the 13 -th exon(s) of ENST00000334136 TMG22 XAGE3 location:chrX:52896139 MIWRGRSTCRPRPRRSV 377 the 2 -th exon(s) of ENST00000346279 PHF8 location:chrX:54040917 the SWVENLWPEECVVERPNVQKYCLMS 378 7 -th exon(s) of ENST00000322659 TEX11 location:chrX:69828949 SLESRANEAQWFQKTAWNLAVQCDK 379 the 23 -th exon(s) of ENST00000395889 ACRC location:chrX:70823968 SDDSEAPDDSSDNSEASDDSSDDSE 380 the 8 -th exon(s) of ENST00000373696 PCDH19 location:chrX:99551483 PLHLKSSLPTKPFVSYTIALAPPAR 381 the 5 -th exon(s) of ENST00000420881 NRK location:chrX:105167164 the VGKISPPVYLTNKWVGYNALSEIFR 382 18 -th exon(s) of ENST00000243300 COL4A6 TGPQGFTGSTGLLGLKGERGFPGLL 383 location:chrX:107464534 the 4 -th exon(s) of ENST00000334504 OR13H1 location:chrX:130678638 KLTCSDTSLNEFIILITSIFTLLLP 384 the 1 -th exon(s) of ENST00000338616 MAGEC1 EDSLSPLHFPQFSPQGEDFQSSLQS 385 location:chrX:140995378 the 4 -th exon(s) of ENST00000285879 MAMLD1 KSMLPHFAMVGNCQEPRKLQESGTV 386 location:chrX:149613849 the 1 -th exon(s) of ENST00000426613 TMG23 PDZD4 location:chrX:153073863 GTQTDITFEHIMVLGKLRPPTPPMV 387 the 2 -th exon(s) of ENST00000164640 FLNA location:chrX:153593209 VGKSADFVVEAIEDDVGTLGFSVEG 388 the 12 -th exon(s) of ENST00000369850 F8 location:chrX:154157374 the IKWNEANRPGKVLFLRVATESSAKT 389 14 -th exon(s) of ENST00000360256 ATP13A2 location:chr1:17320225 PLVPEPRRLPVGSLLRALATCHALS 390 the 16 -th exon(s) of ENST00000452699 UBE2F location:chr2:238925240 KDEGYYQGGKFQLETEVPDAYNMVP 391 the 3 -th exon(s) of ENST00000414443 ZNFX1 location:chr20:47887949 DNFQQWRTPHQKSTEQPQQAKKLGY 392 the 3 -th exon(s) of ENST00000396105 TMG24 CTCFL location:chr20:56093816 PFKCSMCKYASVKASKLKRHVRSHT 393 the 5 -th exon(s) of ENST00000371196 FOXP1 location:chr3:71026116 EIYNWFTRMFAYLRRNAATWKNAVR 394 the 11 -th exon(s) of ENST00000614176 F13A1 location:chr6:6318783 the LPTVELQGVVPRSVNLQEFLNVTSV 395 2 -th exon(s) of ENST00000264870 TG location:chr8:133925499 the FYQVLTSEASQDELGCVKCPEGSYS 396 20 -th exon(s) of ENST00000220616 HDAC6 location:chrX:48673272 LEFQPQLVLVAAEFDALQGDPKGEM 397 the 13 -th exon(s) of ENST00000334136 MAMLD1 KSMLPHFAMVGNCQEPRKLQESGKK 398 location:chrX:149613849 the 1 -th exon(s) of ENST00000262858
TABLE-US-00004 TABLE 4 Amino acid sequences of somatic mutations (patient #136) for screening in each TMG TMG No. Gene name and locus Mutant AA sequence TMG1 PRDM16:NM_022114:exon14: APVSQHPGVLTNYLGTSASSPTSES c.C3139T:p.H1047Y, 399 C1QC:NM_172369:exon3:c.G446A: LIRFNAVLTNPQEDYDTSTGKFTCK p.G149E, 400 PIGV:NM_017837:exon3:c.G1111A: QRSKNNKTLEKPNLGFLSPQVFVYV p.D371N, 401 HPDL:NM_032756:exon1:c.G377C: GAATYAVVSSPAAILSLTLLERAGY p.G126A, 402 MROH7:NM 001039464:exon12: FLGPQQIKDLLLGALEGLKGSSEAP c.C2099G:p.A700G, 403 LRRIQ3:NM_001105659:exon7: NAVLREKKQHFFSAYPQPIYTTHPK c.C1135T:p.P379S, 404 FAM19A3:NM_182759:exon4: WCQMEPCLPGEEYKVLPDLSGWSCS c.G326A:p.C109Y, 405 FCRL5:NM_001195388:exon12: GPFATGVAGGLLTIAGLAAGALLLY c.G2573C:p.S858T, 406 FCRL1:NM_001159398:exon2: MLPRLLLLICALLCEPAELFLIAS c.C35T:p.P12L, 407 OR6K2:NM_001005279:exon1: DHYLAICSPLHYSSIMTPKLCTQLT c.C394T:p.P132S, 408 TMG2 VSIG8:NM_001013661:exon3: SINLMNLQVSDTVTYECRVKKTTMA c.C365T:p.A122V, 409 RASAL2:NM_170692:exon14: QNGSRSRQQSSSFRESPVPKVRAIQ c.C3197T:p.S1066F, 410 HHAT:NM_001170588:exon10: TSMLILSNLVFLEGNEVGKTYWNRI c.G1148A:p.G383E, 411 KCNK1:NM_002245:exon2:c.G727A: YVPGEGYNQKFRKLYKIGITCYLLL p.E243K, 412 OR2G3:NM_001001914:exon1: TECILLADMALDWYIAVCKPLHYVV c.C364T:p.R122W, 413 OSBPL5:NM_145638:exon12: TDSRTFYIAEQVFHHPPVSAFHVSN c.C1226T:p.S409F, 414 MRGPRX1:NM_147199:exon1: TISKILYPVMMFFYFAGLSFLSAVS c.C314T:p.S105F, 415 AGBL2:NM_024783:exon10:c.C955T: FRVQNTRKDATYCFTIVNLLKPKSL p.R319C, 416 OR4A16:NM_001005274:exon1: DAIYSTAMSPKLIIDLLCDKIAISL c.G240T:p.M80I, 417 OR4C15:NM_001001920:exon1: RLYMIPVGAFIFFLGNMQNQSFVTE c.C152T:p.S51F, 418 TMG3 OR4C15:NM_001001920:exon1: FLGFLSFLDACFASVITPKMIVDSL c.T379G:p.S127A, 419 OR5F1:NM_003697:exon1:c.G886A: MLNPLIYSLRSKKVKKALANVISRK p.E296K, 420 OR8K3:NM_001005202:exon1: MIYVISVMGNLGIIVLTKLDSRLQT c.G135A:p.M45I, 421 SLC22A10:NM_001039752:exon5: FFVFFLLSRWLVKSARWLIITNKLD c.G841A:p.E281K, 422 PPME1:NM_016147:exon6:c.G476C: PPPIMLIGHSMGAAIAVHTASSNLV p.G159A, 423 MMP10:NM_002425:exon8:c.G1135A: VQAGYPRGIHTLSFPPTIRKIDAAV p.G379S, 424 BUD13:NM_032725:exon3:c.G271T: VVAEFVDERPEELKQMEAFRSSAKW p.V91L, 425 ADAMTS15:NM_139055:exon8: TVEVLSVGKMTPLRVRYSFYLPKEP c.C2375T:p.P792L, 426 SLC6A12:NM_001122848:exon11: QGVPISEVAESGLGLAFIAFPKAVT c.C1079T:p.P360L, 427 CD163L1:NM_001297650:exon7: AGVICSDKADLDIRLVGAHSPCYGR c.C1456A:p.L486I, 428 TMG4 OR6C75:NM_001005497:exon1: VTEFYLLAAMSYNRCMAICKPLHYT c.G355A:p.D119N, 429 SLC5A8:NM_145913:exon10: LSWISQGMSVVYEALCIGMAALASL c.G1178A:p.G393E, 430 BTBD11:NM_001018072:exon2: MFSQSELRTIEQFLLATRVGSIAEL c.C1247T:p.S416F, 431 RASAL1:NM_001193520:exon17: KKRYVWLSGETLFFSKSPEWQMCHS c.C1808T:p.S603F, 432 OR4K5:NM_001005483:exon1: MSRRTCTVLVMIFWAVSLVHTLSQL c.C443T:p.S148F, 433 STON2:NM_033104:exon6:c.C2686T: PFVHPTTLPLLFFLAMLTMFAW p.L896F, 434 CHD2:NM_001271:exon17:c.G2096A: QSLHKVLEPFLLQRVKKDVEKSLPA p.R699Q, 435 CHD2:NM_001271:exon20:c.C2519T: TIKHYPFQRLDGFIKGEIRKQALDH p.S840F, 436 TSC2:NM_000548:exon31:c.C3680T: PLSPFSSDINNMLLQELSNALMAAE p.P1227L, 437 CACNG3:NM_006539:exon3:c.G436A: VILSAGIFFVSARLSNIIGIIVYIS p.G146R, 438 TMG5 LPCAT2:NM_017839:exon1:c.C127T: RQASFFPPPVPNSFVQQTQIGSARR p.P43S, 439 LPCAT2:NM_017839:exon1:c.C128T: RQASFFPPPVPNLFVQQTQIGSARR p.P43L, 440 OR1G1:NM_003555:exon1:c.A125G: SFLFMYLVTVAGSLLIILVIITDTQ p.N42S, 441 DNAH9:NM_001372:exon45: TLRKGYQIQDFKIDLASLCLKAGVK c.G8586A:p.M2862I, 442 WSB1:NM_134265:exon5:c.C463T: DILMEFGHLFPPSTPIFAGGANDRW p.P155S, 443 GPATCH8:NM_001304943:exon5: AQVHHIPQPHLTLISLSHLTHSIIP c.C4010T:p.P1337L, 444 ANKRD29:NM_173505:exon5:c.C415T: VVETLLKHGANIYDQLYDGATALFL p.H139Y, 445 LRG1:NM_052972:exon2:c.A808T: GLRQLDMLDLSNYSLASVPEGLWAS p.N270Y, 446 ZNF559-ZNF177:NM_001172650: PYECSDCGKAFISQSSLKKHMRSHT exon12:c.T404C:p.F135S, 447 ANKLE1:NM_001278445:exon8: CFQHVVAVEAYTLEACIVEALGIQT c.G1637T:p.R546L, 448 TMG6 ZNF585B:NM_152279:exon5:c.C764T: ECTDCGKAFTQKFTLKIHQKIHTGE p.S255F, 449 RYR1:NM_001042723:exon33: CQEPLTMMALHISEENRCMDILELS c.C4921T:p.P1641S, 450 RYR1:NM_001042723:exon33: CQEPLTMMALHILEENRCMDILELS c.C4922T:p.P1641L, 451 FCGBP:NM_003890:exon28: CMGGGDRDILCKDLASYVAACQAAG c.C12971A:p.A4324D, 452 C19orf47:NM_001256440:exon5: VPCSPSPLAGEICRGTSAASRMITN c.C397T:p.R133C, 453 FOXA3:NM_004497:exon2:c.C575T: RSPDKPGKGSYWVLHPSSGNMFENG p.A192V, 454 TULP2:NM_003323:exon8:c.C908T: TRDKHGVDKGLFLLYYLYLETSDSL p.P303L, 455 FPR2:NM_001005738:exon2:c.C866T: YKIIDILVNPTSFLAFFNSCLNPML p.S289F, 456 USP29:NM_020903:exon4:c.A875G: SQQLQQGFPNLGSTCYMNAVLQSLF p.N292S, 457 ZNF418:NM_001317029:exon5: GERPYECSECGKLFHRSSSLLRHQR c.C1964T:p.S655L, 458 TMG7 APOB:NM_000384:exon9:c.G1081A: NKLVTELRGLSDKAVTSLLPQLIEV p.E361K, 459 MAP4K3:NM_003618:exon9:c.A651T: IELAELQPPMFDFHPMRALFLMTKS p.L217F, 460 ANTXR1:NM_053034:exon3:c.A271G: RMSFIVFSTRGTALMKLTEDREQIR p.T91A, 461 STEAP3:NM_182915:exon5:c.G1121A: EEVWRMEIYLSLEVLALGTLSLLAV p.G374E, 462 GALNT13:NM_052917:exon5:c.C370T: DELPNTSVVIVFYNEAWSTLLRTVY p.H124Y, 463 UBR3:NM_172070:exon8:c.T1238C: FLLNMLPDQEYKAAFTKTFVQHYAF p.V413A, 464 TTN:NM_001267550:exon326: FEVTGLVEDHRYKFRVIARNAAGVF c.G75232A:p.E25078K, 465 DNAH7:NM_018897:exon49:c.C9271T: FYITTKLRNPHYFPETSVKVTLLNF p.L3091F, 466 HECW2:NM_020760:exon4:c.C464T: YHGISGALRATTLCITVKNPAVMMG p.P155L, 467 NPBWR2:NM_005286:exon1:c.C423A: FSSIYFLAVMSVERYLVVLATVRSR p.D141E, 468 TMG8 HSPA13:NM_006948:exon2:c.C80T: LLLAGYLAQQYLLLPTPKVIGIDLG p.P27L, 469 SLC5A3:NM_006933:exon2:c.T953G: KLLPMFIIVVPGRISRILFTDDIAC p.M318R, 470 DSCAM:NM_001271534:exon15: DSWDSAQRTKDVFPQLNSATIIDIH c.C2816T:p.S939F, 471 APOL3:NM_145642:exon4:c.C322T: RAIRQARARARLSVTTWRISAGSGG p.P108S, 472 SLC4A7:NM_001258379:exon16: CVILFFTTFFLSLFLKQFKTKRYFP c.C2162T:p.S721L, 473 GADL1:NM_207359:exon14:c.A1381G: LREMEEGPEFWAELNLVAPAIKERM p.K461E, 474 SCN5A:NM_001099405:exon23: TKVKVNFDNVGAEYLALLQVYEEQP c.G4223A:p.G1408E, 475 COL7A1:NM_000094:exon5:c.G638A: FSILRTLLPLVSQRVCTTAGGVPVT p.R213Q, 476 MON1A:NM_032355:exon4:c.C1388T: SSSSFREGEAWTLVCLPKFNAAGFF p.P463L, 477 SLC38A3:NM_006841:exon8:c.T553C: QTFLNLEEKTSDRYMNGNYLVILVS p.W185R, 478 TMG9 POLQ:NM_199420:exon29:c.C7615T: CPIRGGFFILQLYDELLYEVAEEDV p.H2539Y, 479 SAMD7:NM_182610:exon6:c.G484A: LHFHRSTLRNLQRNPMLAATAPHFE p.G162R, 480 SPATA16:NM_031955:exon2:c.C425T: IDEMGVRYEFVEFFMSTGSQPTCQA p.S142F, 481 ATP13A5:NM_198505:exon26: AFLYVKQQPWYCKVYQYSECFLANQ c.G3007A:p.E1003K, 482 MUC4:NM_138297:exon13:c.C1751T: GSPEEMLFHFGMIWQINGTGLLGKR p.T584I, 483 MUC4:NM_018406:exon2:c.G8482A: SLPVTDASSVFTSHATSLPVTIPSS p.G2828S, 484 GABRA4:NM_001204266:exon7: IQTYIPCIMTVIPSQVSFWINKESV c.T761C:p.L254P, 485 PDGFC:NM_016205:exon6:c.C968T: LRPKTGVRGLHKLLTDVALEHHEEC p.S323L, 486 SLC6A18:NM_182632:exon9: LDVGVLPRWVPKKALTGLVCLVCFL c.G1324A:p.E442K, 487 NIPBL:NM_133433:exon23:c.C4753T: VNKPEWPAAELLFSLLGRLLVHQFS p.L1585F, 488 TMG10 C6:NM_000065:exon7:c.G769A: TAEDDLKTDFYKNLTSLGHNENQQG p.D257N, 489 FAM170A:NM_182761:exon3:c.G805A: CHVFHLTMAQLTSNMESESTQDEQE p.G269S, 490 PCDHB8:NM_019120:exon1:c.C1709T: NSPFVLYPLQNGFAPCTELVPRAAE p.S570F, 491 NDST1:NM_001543:exon7:c.C1444T: RRGFIHNGIMVLSRQTCGLFTHTIF p.P482S, 492 SLIT3:NM_001271946:exon1:c.G44C: PGWAGVGAAVRAPLALALALASVLS p.R15P, 493 BMP5:NM_021073:exon1:c.G160A: RLRNHERREIQRKILSILGLPHRPR p.E54K, 494 SOGA3:NM_001012279:exon6: VRNIRCLTPTRSLYPAPGPWPKSFS c.T2392C:p.F798L, 495 SLC22A2:NM_003058:exon7: KHTMILMYNWFTNSVLYQGLIMHMG c.G1073A:p.S358N, 496 MAD1L1:NM_003550:exon19: DCLIFKATSPSGFKMQLLETEFSHT c.C2018T:p.S673F, 497 AMZ1:NM_001284355:exon2:c.C259T: EDFQTFHASLQHWKPRLARKHIYLQ p.R87W, 498 TMG11 COL1A2:NM_000089:exon44: HKGERGYPGNIGLVGAAGAPGPHGP c.C2867T:p.P956L, 499 PLXNA4:NM_020911:exon2:c.G721A: IPSDTFTIIPDFNIYYVYGFSSGNF p.D241N, 500 PTPRN2:NM_001308268:exon6: QYLLRAPSRMPRSLLAPAAPQKWPS c.C901T:p.P301S, 501 CLVS1:NM_173519:exon2:c.C83T: WNGDLAKMTHLQVGLSPETIEKARL p.A28V, 502 DNAJC5B:NM_033105:exon4:c.G250A: TDISKRSIYDKYRSLGLYVAEQFGD p.G84R, 503 ANGPT1:NM_001146:exon2:c.G336A: LENYIVENMKSEIAQIQQNAVQNHT p.M112I, 504 CYHR1:NM_001129888:exon2:c.G19A: MAPKPGTEWSTALSHLVLG p.A7T, 505 CNTNAP3:NM_033655:exon13: APSGHPRSAVSFVYAAGAGQLRSAV c.C1973T:p.A658V, 506 NTRK2:NM_001018065:exon14: SSSEGGPDAVIIRMTKIPVIENPQY c.G1558A:p.G520R, 507 OR13C3:NM_001001961:exon1: ILAVLKLACADIFLNIITMVISNMA c.C668T:p.S223F, 508 TMG12 OR13C8:NM_001004483:exon1: PKLQTVFFVLILLMYLMILLGNGVL c.G98T:p.W33L, 509 OR13C8:NM_001004483:exon1: VLILWMYLMILLENGVLISVIIFDS c.G122A:p.G41E, 510 SLC2A6:NM_001145099:exon6: IQDNVRRQSSRVLWAEARAPHVCRP c.C788T:p.S263L, 511 TLR8:NM_138636:exon2:c.A1422C: DPHSNFYHFTRPFIKPQCAAYGKAL p.L474F, 512 SMS:NM_004595:exon5:c.G476A: YQNIKILHSKQFENILILSGDVNLA p.G159E, 513 CASK:NM_001126054:exon18: NQTVEQLQKMLRKMRGSITFKIVPS c.G1672A:p.E558K, 514 CCNB3:NM_033031:exon5:c.G2092A: SGSLFQEALVLQKKTDAEEDSLKNL p.E698K, 515 FAM120C:NM_001300788:exon3: AAFHWSLLGPEHSLASLKVRAHQLV c.C1012T:p.P338S, 516 SRPX2:NM_014467:exon5:c.C401T: CHALPFITSGTYICTNGVLLDSRCD p.T134I, 517 XKRX:NM_212559:exon2:c.C593T: YQLYVSLISAEVLLGRVVLMVFSLV p.P198L, 518 TMG13 C1GALT1C1:NM_001011551:exon2: PFYLGHTIKSGDFEYVGMEGGIVLS c.C532T:p.L178F, 519 USP26:NM_031907:exon1:c.C208T: NVVLKSYRGNQNYLHLTLQNNNGLF p.H70Y, 520 MAGEC1:NM_005462:exon4:c.C3196T: LEYREVPNSSPPCYEFLWGPRAHSE p.R1066C, 521 JAKM1P2:NM_014790:exon21: IKDLEEKSNRKHE c.G2429A:p.G810E, 522 IRF6:NM_006147:exon9:c.C1397T: PTPSMQLPPALPLQ p.P466L,IRF6:NM_001206696: 523 exon7:c.C1112T:p.P371L, AFF2:NM_001169123:exon1:c.G4A: MNLFDFFRDWDLEQ p.D2N, 524 TMG14 IFNLR1:NM_173064:exon7:c.C1039T: GVDSGRPRAPLVSSEGSSAWDSSDR p.P347S, 525 HMGB4:NM_145205:exon2:c.G220A: ALAKLDKARYQEKMMNYVGKRKKRR p.E74K, 526 RIMS3:NM_014747:exon3:c.G101A: SISGEICGSQQAEGGAGTTTAKKRR p.G34E, 527 KANK4:NM_181712:exon3:c.C59T: DQSSQGDEEKDPLKSHPYSVETPYG p.P20L, 528 SLC44A5:NM_152697:exon4:c.G70A: SEEEDFGDPRTYNPDFKGPVANRSC p.D24N, 529 FCGR1A:NM_000566:exon6:c.G899A: PVWFHVLFYLAVEIMFLVNTVLWVT p.G300E, 530 SV2A:NM_014849:exon2:c.C305T: DDEIYEGEYQGILRAESGGKGERMA p.P102L, 531 TCHHL1:NM_001008536:exon3: FGEQEGNLATQSLPPKEATQRPCED c.C788T:p.S263L, 532 HRNR:NM_001009931:exon3: SSSSYGQHGSGSHQSLGHGQHGSGS c.G5792A:p.R1931H, 533 FLG:NM_002016:exon3:c.G10285A: QARDSSRHSASQKGQDTIRGHPGSS p.E3429K, 534 TMG15 UBAP2L:NM_001287816:exon7: GSGRRGGRFSAQEMGTFNPADYAEP c.G617A:p.G206E, 535 CD1E:NM_030893:exon3:c.C520T: GIRAQNICKVLNCYLDIKEILQSLL p.R174C, 536 OR10J5:NM_001004469:exon1: IIVTIICIDHHLYTPMYFFLSMLAS c.C166T:p.H56Y, 537 KCNT2:NM_001287819:exon9: IFTCICGIQHLEQIGKKLNLFDSLY c.G665A:p.R222Q, 538 CFHR2:NM_001312672:exon3: GDIVEFVCKSGYQPTKSHSFRAMCQ c.T366G:p.H122Q, 539 NFASC:NM_015090:exon6:c.C487T: PLTLQCNPPPGLSSPVIFWMSSSME p.P163S, 540 TUBB8:NM_177987:exon4:c.A1241G: GMDEMEFTEAESSMNDLVSEYQQYQ p.N414S, 541 FAM208B:NM_017782:exon12: GAEVLTAQFVQKIKLDRKNQEAPIS c.C1241T:p.T414I, 542 ARID5B:NM_001244638:exon7: EIQEGKDKLLEKKALPHSHMPSFLA c.G1721A:p.R574K, 543 SFTPA2:NM_001098668:exon3: GRDGVKGDPGPPSPMGPPGETPCPP c.G172A:p.G58S, 544 TMG16 SFTPA1:NM_001164644:exon3: GRDGLKGDPGPPSPMGPPGEMPCPP c.G172A:p.G58S, 545 DNMBP:NM_015221:exon5:c.C2401T: IEELLQTERDYIWDLEMCIERIMVP p.R801W, 546 TECTB:NM_058222:exon7:c.G682A: LQWQLINKGCPTNETVLVHENGRDH p.D228N, 547 MUC5B:NM_002458:exon31:c.G1229 TPGHTTATSRTTTTATPSKTRTSTL 2A:p.A4098T, 548 TRPM5:NM_014555:exon12:c.C1754T: EAKYERLALDLFFECYSNSEARAFA p.S585F, 549 OR51G1:NM_001005237:exon1: QGLEGLHGWISISFCFIYLTVILGN c.C94T:p.P32S, 550 TPP1:NM_000391:exon11:c.C1424T: SNRVPIPWVSGTLASTPVFGGILSL p.S475L, 551 OR10A4:NM_207186:exon1:c.T452G: HISCAQLAAASWCSGFSVATVQTTW p.F151C, 552 AMPD3:NM_001172431:exon10: VDDESKHSDHMFFDKSPNPDVWTSE c.C1136T:p.S379F, 553 LUZP2:NM_001009909:exon11: QDEGRPCSMKHKKSPPSNATAETEP c.G889A:p.E297K, 554 TMG17 BBOX1:NM_003986:exon6:c.G589T: DKIDANNVAYTTWKLSFHTDYPALH p.G197W, 555 KCNA4:NM_002233:exon2:c.C1148T: FFIVETVCIVWFFFEFVVRCFACPS p.S383F, 556 PAMR1:NM_001282675:exon8: CLAGYTGQRCENFLEERNCSDPGGP c.C700T:p.L234F, 557 NARS2:NM_024678:exon2:c.C164T: RIKIQGWIRSVRFQKEVLFLHVNDG p.S55F, 558 CRACR2A:NM_001144958:exon16: VDNSQVALQLWDMAGQERYRCITQQ c.C1802T:p.T601M, 559 CCND2:NM_001759:exon5:c.G803A: NSLQQYRQDQRDESKSEDELDQAST p.G268E, 560 TAS2R46:NM_176887:exon1:c.G198C: SRVGLLWVLVLNCYATELNPAFNSI p.W66C, 561 SLC38A4:NM_018018:exon17: GSMALIIIDWIYNPPNSKHH c.G1618A:p.D540N, 562 KMT2D:NM_003482:exon10:c.C2584T: HLSPELEKPPLSSRPEKPPEEPGQC p.P862S, 563 LUM:NM_002345:exon2:c.G743A: LSHNELADSGIPENSFNVSSLVELD p.G248E, 564 TMG18 CFAP54:NM_001306084:exon60: VSEAVLAINLLIEKKNTRMHKVNQV c.G8177A:p.G2726E, 565 HECTD4:NM_001109662:exon63: TPAKPIRVSDIYFSKEQINSQTPGN c.C10705T:p.L3569F, 566 PARP4:NM_006437:exon13: FSDSLSTSIKYSQPGETDGTRLLLI c.C1470A:p.H490Q, 567 FREM2:NM_207361:exon1:c.C3661T: DTPILNAADADVSLDDLTFTITQFP p.P1221S, 568 PROSER1:NM_025138:exon11: PVMIKTEPTSPTSSAFKGPSHSGNP c.C1846T:p.P616S, 569 SLITRK6:NM_032229:exon2: PATTTNTADTILQSLTDAVPLSVLI c.G1796A:p.R599Q, 570 METTL21C:NM_001010977:exon2: YASYTQEHYRFAEKEIVIQESIESY c.G224A:p.G75E, 571 CCDC168:NM_001146197:exon4: PLSPKRKDQKGRITIRDLKRELSTK c.T18244A:p.L6082I, 572 TRIM9:NM_015163:exon8:c.G1871A: DDKAWAMYVDNNQSWFMHNNSHTNR p.R624Q, 573 SMOC1:NM_001034852:exon8: EAQQNPREGIVISECAPGGLYKPVQ c.C745T:p.P249S, 574 TMG19 ZFYVE1:NM_021260:exon12: KDAARPAYWVPDYEILHCHNCRKEF c.C2146T:p.H716Y, 575 SLC12A1:NM_001184832:exon11: VAICVGACVVRDTTGNMNDTIISGM c.G1321A:p.A441T, 576 UNC13C:NM_001080534:exon1: STHNLSTEEDEAGKEFSLSPTFSYR c.A247G:p.S83G, 577 CGNLI:NM_001252335:exon3: PVLDGARSRRSSLSSTTPTSANSLY c.C893T:p.S298L, 578 ITGA11:NM_001004439:exon14: LNQDSYNDVVVGTPLEDNHAGAIYI c.G1690A:p.A564T, 579 GOLGA6C:NM_001164404:exon17: AQNPADEPTPGASAPQELGAAGEQD c.C1915T:p.P639S, 580 NTRK3:NM_001007156:exon10: LEEPELRLEHCIKFVVRGNPPPTLH c.G964A:p.E322K, 581 RGS11:NM_003834:exon16: LAEAGIPLEMKRHVFPFTWRPRHSS c.G1226A:p.R409H, 582 SMIM122:NM_001253793:exon4: PRRESPRKERPKEVDNLALEP c.G224A:p.G75E, 583 DCUN1D3:NM_173475:exon3: LSNYSEDEAWPSFFDTFVEWEMERR c.C805T:p.L269F, 584 TMG20 MVP:NM_005115:exon3:c.C244T: QGLVLFDVTGQVWLRHADLEIRLAQ p.R82W, 585 GPT2:NM_001142466:exon10: AKLTEDLFNQVPEIHCNPLQGAMYA c.G986A:p.G329E, 586 AMFR:NM_001144:exon11:c.C1507T: DNILEGRIQVPFSTQRSDSIRPALN p.P503S, 587 NFATC3:NM_004555:exon9:c.C2797T: GSATTASPAASHSLASSPLSGPPSP p.P933S, 588 ZNF469:NM_001127464:exon2: QLPASPSCRDPPCPQQLLACSPAWA c.G6397T:p.G2133C, 589 CDH15:NM_004933:exon5:c.G553A: RAEATDADDPETNNAALRFSILQQG p.D185N, 590 RPH3AL:NM_001190413:exon7: GETGTGSADPPGRPRPGLTRRAPVK c.G757A:p.G253R, 591 ZNF594:NM_032530:exon2:c.G451A: SSNLIIHQRIHTRNKPYVCNECGKD p.G151R, 592 NEURL4:NM_001005408:exon1: GGPGPGPGGGGGLSGSGSGPGSNGG c.C74T:p.P25L, 593 NEURL4:NM_001005408:exon1: GGPGPGPGGGGGSSGSGSGPGSNGG c.C73T:p.P25S, 594 TMG21 MYH2:NM_017534:exon12:c.G1039A: TDSAIDILGFTNKEKVSIYKLTGAV p.E347K, 595 MYOCD:NM_153604:exon10:c.G1661A: VEELRMQLQKQKKNNCSEKKPLPFL p.R554K, 596 PLD6:NM_178836:exon2:c.G548A: SLNWTTQAIQNNKENVLITEDDEYV p.R183K, 597 MRC2:NM_006039:exon6:c.C1114T: KKKPNATAEPTPSDRWANVKVECEP p.P372S, 598 ACE:NM_000789:exon16:c.G2383A: LLWAWEGWRDKARRAILQFYPKYVE p.G795R, 599 CD300LF:NM_001289083:exon3: VQVTIDPAPVTQAETSSSPTLTGHH c.A398C:p.E133A, 600 RNF213:NM_001256071:exon29: HPLLEDGCIEDDLAPHKKVGFVGIS c.C8618T:p.P2873L, 601 DSG4:NM_001134453:exon4:c.G322A: PYGVFTINPRTGKINITSVVDREIT p.E108K, 602 DSG4:NM_001134453:exon8:c.G901A: RLQAIDLDEEGTNNWLAQYLILSGN p.D301N, 603 RNF165:NM_152470:exon2:c.C286T: PLPTLQFQDVTGSSFLPQALHQQYL p.P96S, 604 TMG22 PLIN4:NM_001080400:exon3: ASVVDVAKGVVQRGLDTTRSALTGT c.G325A:p.G109R, 605 RFX2:NM_000635:exon8:c.C811T: TRGNSKYHYYGICLKPDSPLNRLQE p.R271C, 606 DENND1C:NM_001290331:exon20: LSMGAKSAGSLRSSQSLDCCHRGDL c.C1603T:p.P535S, 607 MUC16:NM_024690:exon5:c.T33443A: EASSAVLTVSPEEPGMVTSLVTSSR p.V11148E, 608 C19orf43:NM_024038:exon3: YKAHQCGDDDKTWPLVK c.C514T:p.R172W, 609 KMT2B:NM_014727:exon3:c.C1919T: PPAPSPPPAPATFSRRPLLLRAPQF p.S640F, 610 ZNF585A:NM_199126:exon6:c.C625T: TQKSTLKMHQKIYTGERSYICIECG p.H209Y, 611 SIPA1L3:NM_015073:exon3: GECERNVSFSRAFVGSPSSGEGHLA c.C1343T:p.5448F, 612 HNRNPL:NM_001533:exon8:c.C1004T: GYHSHYHDEGYGLPPPHYEGRRMGP p.P335L, 613 ZNF574:NM_022752:exon2:c.A74G: EHRYVCSECNQLCGSLEEVLMHQNS p.Y25C, 614 TMG23 LILRA4:NM_012276:exon6:c.G1172A: EFPMSPVTSAHAETYRCYGSRSSNP p.G391E, 615 NLRP11:NM_145007:exon8:c.C2336T: CDALLHPNCTLILLVLVFCCLTENC p.5779L, 616 NLRP5:NM_153447:exon7:c.G866A: FRPRTVVLHGKSEIGKSALARRIVL p.G289E, 617 ZNF256:NM_005773:exon3:c.G1383T: DLIVHERVHTGESPYECSECGKSFT p.R461S, 618 GCKR:NM_001486:exon12:c.C1055T: IIAIMDGVECIHIFGADFRDVRGFL p.T352I, 619 BIRC6:NM_016252:exon68: ATTSLRQANQEKILGEYSKKAAMKP c.A13508T:p.K4503I, 620 DYSF:NM_001130979:exon28: DAGHLSFVEEVFKNQTRLPGGQWIY c.G2956A:p.E986K, 621 DNAH6:NM_001370:exon51:c.C8413T: ADISEIRVFTKPSDLVMTVMEAISI p.P2805S, 622 MARCO:NM_006770:exon5:c.G533A: GAPGPPGPPAEKEAKGAMGRDGATG p.G178E, 623 BAZ2B:NM_001289975:exon28: LTSNVASSKSESSVPQNEKATSAQP c.C4933T:p.P1645S, 624 TMG24 SCN2A:NM_001040142:exon17: ESDFENLNTEEFNSESDMEESKEKL c.G3368A:p.S1123N, 625 SCN1A:NM_001165963:exon24: KLGSKKPQKPIPQPGNKFQGMVFDF c.G4574A:p.R1525Q, 626 TTN:NM_001267550:exon28: ELKSRKKDESYEKLLRKTKDELLHW c.G6085A:p.E2029K, 627 CERKL:NM_001030311:exon8: FSAMFGFGGRTLVLAEKYRWMSPNQ c.C1073T:p.A358V, 628 DNAH7:NM_018897:exon5:c.A277G: QSHAEYMERFGKEGKLPHQVDDSYV p.K93E, 629 SPAG16:NM_024532:exon7:c.C734T: KHHTLLKEKMLTFLERDKVVGQISG p.S245F, 630 PRKAG3:NM_017431:exon4:c.C329T: PLAQADPAGVGTLPTGWDCLPSDCT p.P110L, 631 DES:NM_001927:exon6:c.G1232A: DVEIATYRKLLEEEESRINLPIQTY p.G411E, 632 COL4A3:NM_000091:exon28: IPGSLGKCGDPGFPGPDGEPGIPGI c.C2059T:p.L687F, 633 SPHKAP:NM_030623:exon7:c.C2486T: SHRVPDSSTATTFSKEIYLKGIAGE p.S829F, 634 TMG25 FAM209B:NM_001013646:exon2: SLYKDCVFNTLNKLEVELLKFVSEV c.G352A:p.E118K, 635 KCNJ15:NM_001276437:exon4: QLSGKLLQTHVTQEGERILLNQATV c.A667C:p.K223Q, 636 UMODL1:NM_001199527:exon11: VNPSQGSPSQGSFRQESTSQASPSQ c.C1771T:p.L591F, 637 SEZ6L:NM_001184773:exon4: YFRTFQDDGLGTVQLHYQAFMLSCN c.T1144G:p.F382V, 638 MKL1:NM_001282662:exon15: GEAIPEDSLWVPTGSTAITFC c.C2368A:p.P790T, 639 DLEC1:NM_007337:exon3:c.C658T: SPEDYYTDTVPFYSAPKGISLPGCS p.H220Y, 640 SEMA3F:NM_004186:exon8:c.G646A: DPKLDTASALINKELYAGVYIDFMG p.E216K, 641 STAB1:NM_015136:exon47:c.G4819A: ASFFSLRLLEYKKLKGDGPFTIFVP p.E1607K, 642 LRTM1:NM020678:exon3:c.G1016A: QTNDPGKVEEKEQFDSSPA p.R339Q, 643 LRTM1:NM_020678:exon3:c.G1006A: PLAQTNDPGKVEKKERFDSSPA p.E336K, 644 TMG26 LRTM1:NM_020678:exon2:c.G158A: LAEIPSHLPPQTQTLHLQDNQIHHL p.R53Q, 645 ERC2:NM_015576:exon2:c.G89A: SPRLPRSPRLGHQRTSSGGGGGTGK p.R30Q, 646 COL8A1:NM_020351:exon4:c.G440A: PGLPGHGIPGIKEKPGPQGYPGVGK p.G147E, 647 USF3:NM_001009899:exon7: NSQGSIEATMERSLEKPSCSLGIKT c.C3625T:p.P1209S, 648 TIMMDC1:NM_016589:exon2:c.A338T: KQQYIEQSQAEIFHNRFDAVQSAHR p.Y113F, 649 CCDC14:NM_001308317:exon12: DWSISSFSTFTSHDEQDFRNGLAAL c.G2168A:p.R723H, 650 ACAD9:NM_014049:exon11:c.C1117T: YLTAGMLDQPGFSDCSIEAAMVKVF p.P373S, 651 FNDC3B:NM_001135095:exon5: YISQVIEDSTGVCRVVVTPQSPECY c.C289T:p.R97C, 652 BCL6:NM_001706:exon5:c.A496C: IMAYRGREVVENHLPLRSAPGCESR p.N166H, 653 UBE2K:NM_001312646:exon3: LEIKIPETYPFNSPKYKQNPEMFKQ c.C208T:p.P70S, 654 TMG27 BEND4:NM_001159547:exon3: GHPSSSTLPEEEKEEDEEGYCPRCQ c.G943A:p.E315K, 655 GABRB1:NM_000812:exon9:c.G1129A: GNILLSTLEIRNKTSGSEVLTSVSD p.E377K, 656 NOA1:NM_032313:exon6:c.C1807T: MGGKERMAGFPPFVAEDIMLKEGLG p.L603F, 657 ADGRL3:NM_015236:exon7:c.C1369T: RTTTLSPGRSTTSSVSGRRNRSTST p.P457S, 658 ANKRD17:NM_198889:exon27: KKVSVPSTVISRGIGRGGCNINAIR c.T4466G:p.V1489G, 659 FRAS1:NM_025074:exon63:c.C9566T: HVEEVTKEGVKKFPSPGYPLVCVTP p.S3189F, 660 ADH1A:NM_000667:exon7:c.C890T: EACGTSVIVGVPLDSQNLSMNPMLL p.P297L, 661 BANK1:NM_001127507:exon8: ADGAEANEMEGEEKQNGSGMETKHS c.G977A:p.G326E, 662 ENPEP:NM_001977:exon16:c.C2237T: NDAGDHVTKLLRFSVLGFACKMGDR p.S746F, 663 KIAA1109:NM_015312:exon71: SGLGSPLGRSRHTSSQSDLTSSSSS c.G12395C:p.S4132T, 664 TMG28 INTU:NM_015693:exon12:c.A2110G: CRRTLFGDYSLKARKPSPSCSSGGS p.T704A, 665 LARP1B:NM_018078:exon10: MSRGLSTSLPDLGSEPWIEVKKRHQ c.A1103G:p.D368G, 666 NPY5R:NM_006174:exon4:c.T998A: SQLSSSSKFIPGDPTCFEIKPEENS p.V333D, 667 GLRA3:NM_001042543:exon6: LPQFLLKEEKDLQYCTKHYNTGKFT c.G686A:p.R229Q, 668 TENM3:NM_001080477:exon27: IHDVKDYITDVNNWLVTFGFHLHNA c.G7223A:p.S2408N, 669 STOX2:NM_020225:exon3:c.G1091A: HHSGRSKKSRTHQKSHGKSRSHSKT p.R364Q, 670 PLEKHG4B:NM_052909:exon3: QSGVVTLPGTRDGHGRAVVQVRTRS c.C631G:p.R211G, 671 DNAH5:NM_001369:exon73: FGALGWNIPYEFSQADFNATVQFIQ c.A12638G:p.N4213S, 672 DNAH5:NM_001369:exon14:c.G1837A: DMISKLYTKQKYNPPLARNQPPIAG p.D613N, 673 DNAH5:NM_001369:exon4:c.G364T: KPKVFVTEGNDVSLTGVCVFFIRTD p.A122S, 674 TMG29 PRLR:NM_000949:exon10:c.A1297T: PSQHNPRSSYHNFTDVCELAVGPAG p.I433F, 675 RICTOR:NM_001285439:exon1: AAIGRGRSLKNLQVRGRNDSGEENV c.G41A:p.R14Q, 676 MROH2B:NM_173489:exon30: FIPNEEILMFLEKMLDGLESLNPTC c.G3031A:p.E1011K, 677 CMYA5:NM_153610:exon2:c.C9670T: SEGDSVNSEASFSSRNSDTDDGTGI p.P3224S, 678 VCAN:NM_001164098:exon7: PFPSQHRTEIELLPYSGDKILVEGI c.T1966C:p.F656L, 679 VCAN:NM_001164098:exon7: LSTPQPPASTKFYPDINVYIIEVRE c.C3955T:p.H1319Y, 680 VCAN:NM_001164097:exon7: SPHVETTFSTEPIGLVLSTVMDRVV c.C2687T:p.T896I, 681 ADGRV1:NM_032119:exon20: FTGLEGAFGTVNSKYHPSRNNTIAN c.C4024T:p.P1342S, 682 NR2F1:NM_005654:exon2:c.G476A: LKVGMRREAVQRERMPPTQPNPGQY p.G159E, 683 LIX1:NM_153234:exon4:c.C412T: ASTSGTLDDADDSSTSVGAYHYMLE Pp.138S, 684 TMG30 EPB41L4A:NM_022140:exon6: TAAQLGAYAIQSKLGDYDPYKHTAG c.G436A:p.E146K, 685 KDM3B:NM_016604:exon14: DIRSEEPLKTDSWASNSNSELKAIR c.C3590G:p.S1197W, 686 GRXCR2:NM_001080516:exon1: PEKKLNQKSDGKLRKVRFKISSSYS c.C47T:p.P16L, 687 LARS:NM_020117:exon29:c.A3033C: NLEKMGPRILDLHLEFDEKAVLMEN p.Q1011H, 688 GRIA1:NM_000827:exon11:c.G1496T: DVAVAPLTITLVLEEVIDFSKPFMS p.R499L, 689 FAXDC2:NM_032385:exon9:c.C985T: GFTPLSESIPDSSKRME p.P329S, 690 THVID4:NM_001146726:exon2: SSWSHNSNSMCWEKDQCPYSGCKEA c.G164A:p.G55E, 691 NIPAL4:NM_001099287:exon1: SRPPAPELGSPGTVRPRVGSCAPGP c.G148A:p.A50T, 692 GABRG2:NM_198904:exon6:c.C701T: WKRSSVEVGDTRFWRLYQFSFVGLR p.S234F, 693 STC2:NM_003714:exon4:c.G871A: LGAQGPSGSSEWKDEQSEYSDIRR p.E291K, 694 TMG31 FGFR4:NM_213647:exon13: LRARRPPGPDLSSDGPRSSEGPLSF c.C1720T.p.P574S, 695 LMAN2:NM_006816:exon2:c.T248G: SMPLWDFQGSTMRTSQYVRLTPDER p.L83R, 696 MAPK9:NM_002752:exon8:c.G834C: YPGIKFEELFPDCIFPSESERDKIK p.W278C, 697 DEK:NM_001134709:exon2:c.G123C: REESEEEEDEDDDEEEEEEKGKGQK p.E41D, 698 LRRC16A:NM_017640:exon33: LIKSRSKSERPPKILMTEEPSSPKG c.C3254A:p.T1085K, 699 RNF8:NM_183078:exon3:c.C664T: LEPSKTTGAPIYSGFPKVTEVHHEQ p.P222S, 700 RNF8:NM_183078:exon3:c.C665T: LEPSKTTGAPIYLGFPKVTEVHHEQ p.P222L, 701 KCNK16:NM_001135106:exon5: SDPSGLPRPQKISISA c.C871T:p.P291S, 702 KLHDC3:NM_057161:exon2:c.C37T: MLRWTVHLEGGPCRVNHAAVAVGHR p.R13C, 703 HSP90AB1:NM_001271969:exon10: LCKLMKEILDKKAEKVTISNRLVSS c.T1724C:p.V575A, 704 TMG32 OOEP:NM_001080507:exon1:c.C50T: AGAAESQRGKQTLAHSLEQLRRLPL p.P17L, 705 IMPG1:NM_001282368:exon2: CQQETFCLFDIGNNFSNSQEHLDLL c.A192T:p.K64N, 706 MANEA:NM_024641:exon5:c.G1253A: HEGTQIEKAVPKKTSNTVYLDYRPH p.R418K, 707 AK9:NM_001145128:exon23: TVVLPEFPEDSYSDVPEMEPFKEKI c.C2467T:p.P823S, 708 BCLAF1:NM_014739:exon5:c.G1111A: EASKEKGSEKGRTEGEWEDQEALDY p.A371T, 709 ADCYAP1R1:NM_001118:exon7: FGDSNSLDLSDMRVVSRNCTEDGWS c.G334A:p.G112R, 710 PDE1C:NM_001191058:exon16: EKARLAAEEQQKKMEAKSQAEEGAS c.G1825A:p.E609K, 711 ANK1B1:NM_019004:exon15: LLKTRRILKCSYLYGFFLEPKSTKK c.C1994T:p.P665L, 712 BRAF:NM_004333:exon15:c.T1799A: LTVKIGDFGLATEKSRWSGSHQFEQ p.V600E, 713 ZNF398:NM_170686:exon6:c.C1277T: TYHLRVHNSTERLFPCPDCPKRFAD p.P426L, 714 TMG33 ABCB8:NM_001282293:exon15: KGGLYAELIRRQTLDAPRTAAPPPK c.G1816A:p.A606T, 715 CSMD1:NM_033225:exon66: REVNETVTKTPVSSDVFFVNSLWKG c.C10042T:p.P3348S, 716 FGF20:NM_019851:exon3:c.C601T: HFLPRPVDPERVSELYKDLLMYT p.P201S, 717 C8orf34:NM_052958:exon9: TLNICSRCARLQEDNLEERTEESLP c.G1250A:p.G417E, 718 ZFHX4:NM_024721:exon10:c.A7093G: MDATDQVVYKHCAVSGQTDAAKNAA p.T2365A, 719 ZC2HC1A:NM_016010:exon8:c.T741G: GANVKPRNSTPPRLARNPAPGVLTN p.S247R, 720 ATP6V0D2:NM_152565:exon2: ANHTNPLTVSKIYTEMRKRLCGEFE c.G211T:p.D71Y, 721 MMP16:NM_005941:exon10:c.C1564T: FNNQILKVEPGYSRSILKDFMGCDG p.P522S, 722 PKHD1L1:NM_177531:exon16: VQVITLENWETTDAINEVQKIKVTS c.A1564G:p.N522D, 723 GML:NM_002066:exon3:c.C181T: PYHIRRCMTISICINSRELLVYKNC p.R61C, 724 TMG34 CEP78:NM_032171:exon11:c.T1307C: VTVTVESPSSSEAEEVDDSSESVHE p.V436A, 725 NAA35:NM_024635:exon17:c.C1409T: ATLQDEAEKVDAVLHTMLLKQEPQR p.A470V, 726 GRIN3A:NM_133445:exon1:c.C329T: LGSTLHGRGPPGFRKPGEGARAEAL p.S110F, 727 FKTN:NM_006731:exon10:c.G1339A: DWKRSPPNVQPNRIWPISEWDEVIQ p.G447R, 728 OR1J4:NM_001004452:exon1: IFFTDLDNFLLTTMAYDRYVAICHP c.T349A:p.S117T, 729 IER5L:NM_203434:exon1:c.G949A: YPGQEEEEDDEENAGGLGAEPPGGA p.D317N, 730 VCX3B:NM_001001888:exon3: SQESQVEEPLSQKSEMEEPLSQESE c.G679A:p.E227K, 731 ARHGAP6:NM_006125:exon4:c.C995T: NKRQNKELSSSNLSLSSTSETPNES p.S332L, 732 FRMPD4:NM_014728:exon2:c.C68T: KLSSHRTKSSGWLPPSGTWGLSQVP p.P23L, 733 BMX:NM_001721:exon15:c.G1567A: SQLLEMCYDVCESMAFLESHQFIHR p.G523S, 734 TMG35 PHEX:NM_000444:exon4:c.C386T: ELLEKSISRRRDIEAIQKAKILYSS p.T129I, 735 DMD:NM_004009:exon10:c.G1000A: EDKSFGSSLMESKVNLDRYQTALEE p.E334K, 736 FAM47C:NM_001013736:exon1: CPEPPETRVSPLCQLPPEAGVSHLC c.C1099T:p.R367C, 737 SHROOM4:NM_020717:exon4: PKECSRMGGRRSRGTRGRSIQNRRK c.G1714A:p.G572R, 738 IQSEC2:NM_001111125:exon7: LDDALRKFQSHIQVQGEAQKVERLI c.G2534A:p.R845Q, 739 PCDH11X:NM_032968:exon1:c.C277T: TGEIFTTGARIDCEKLCAGIPRDEH p.R93C, 740 PCDH19:NM_001105243:exon4: TSVTSMGSQMPDYDQNEGFHCREEC c.C2704T:p.H902Y, 741 TENM1:NM_014253:exon10:c.C1774T: CRHGWKGPECDVSEEQCIDPTCFGH p.P592S, 742 ZNF75D:NM_001185063:exon4: MKIAQKTMGRENSGDTHSVQKWHRA c.C673T:p.P225S, 743 MAGEC1:NM_005462:exon4:c.G367A: PLEISQSPPEGENVQSPLQNPASSF p.D123N, 744 IDS:NM_006123:exon6:c.G820A: VAYNPWMD1RQRKDVQALNISVPYG p.E274K, 745 ABCD1:NM_000033:exon1:c.C22G: MPVLSRPGPWRGNTLKRTAV p.R8G, 746
[0189] To identify MHC-I neoantigens, MHC-I expressing Cos7 cells were transfected (Cos7-A1, Cos7-B8 and Cos7-C7) cells with 35 TMGs, respectively, followed by adding CD8.sup.+ T cells isolated from 136 TILs. ELISA analysis revealed that one TMG (i.e. TMG2) was positive for activating T cell response in Cos7-A1, but not in Cos7-B8 or Cos7-C7 cells (
[0190] As shown in 135 mel, ADIPOR2 and LAGE1b were identified as tumor antigens recognized by T cell clones that were in relatively low abundance in the bulk TIL population. Thus, T cell clones generated from 136TILs were tested for their ability to recognize the newly identified as well as unknown neoantigens. 293IMDR1/DP4 and 293IMDR11/DP4 cells were transfected with 35 TMGs, respectively, followed by co-culturing with T cells clones for overnight. It was found that CD4.sup.+136-C13 and 136-C22 T cell clones were capable of recognizing TMG13 and TMG8 presented by 293IMDR1/DP4 cells (
(4) Discovery of Immunodominance for Neoantigen Recognition Patterns by Tumor-Reactive T Cells
[0191] Using systemic and multiple approaches, 8 cancer neoantigens were identified in 135 mel and 7 neoantigens in 136 mel. To understand the potential mechanisms that restrain the number of neoantigens that could induce T cell response, it was asked whether these neoantigens were equally recognized by T cells. To test this possibility, 78 CD4.sup.+ tumor-reactive T-cell clones established from 135TILs were tested for their ability to recognize neoantigens, and unexpectedly found that 69 of 78 T cell clones (88%) were capable of recognizing the PCDHB16 neoantigen in 293IMDR4/DP4 cells, while 3 (3.8%) recognized SPATA13, 3 (3.8%) recognized PCDHB7, and 1 (1.3%) recognized ATG5 (
[0192] To exclude the possibility that T cell growth selection during T cell cloning may contribute to the observed immunodominance, primary neoantigen-specific T cell population (without long-time culture and expansion) were freshly generated and performed intracellular cytokine staining after specific neoantigen peptide stimulation, and found that tumor-reactive CD4.sup.+ T cell populations accounted for 32.7% in the total T cell population when stimulated with 135 mel tumor cells. Importantly, PCDHB16 neoantigen-specific CD4.sup.+ T cell population accounted for 30.2% in the total T cell population (
[0193] Interestingly a similar recognition pattern was observed in the second patient-derived T cells (136TIL) for their neoantigen recognition. Intracellular staining of the freshly generated 136TIL population after stimulation with fibroblasts loaded with neoantigen peptide pools or individual one showed that tumor-reactive of CD4.sup.+ or CD8.sup.+ T cells accounted for 24.8% and 14.6% in the total T cell population, respectively (
[0194] Although T cell clones and intracellular staining analysis demonstrate immunodominant neoantigen recognition patterns by T cells, it is not clear whether immunodominant neoantigens are recognized by T cells with the same or different TCR α and β chains. To address this issue, PCDHB16 neoantigen-specific CD4.sup.+ T cells were isolated by FACS sorting of PCDHB16-neoantigen T cells from freshly generated bulk T cells after stimulated PCDHB16 neoepitope peptide-pulsed 135EBV-B cells. Using the similar approach, MPG-specific CD8+ T cells were also purified from freshly generated bulk T cells after stimulated MPG neoepitope peptide-pulsed 135EBV-B cells. Genomic DNAs were isolated from the two purified neoantigen-specific T cell populations, as well as the parental bulk T cells, and used as templates for TCR β-chain CDR3 repertoire amplification. Since there is only one copy of genomic DNA in a single T cell, specific TCR β-chain sequence abundance can represent the frequency of neoantigen-specific T cells in the total T cell population. TCRβ CDR3 were then amplified with a set of primers targeting mature V-D-J rearranged T cells. After the amplicons (from PCDHB16-, MPG-specific T cells and bulk 135TIL) were subjected to deep-sequenced via next-generation sequencing (reads>100,000 per sample), followed by bioinformatic analysis and TCR repertoire alignment. Dysfunctional TCR sequences and redundant sequences in all three groups were removed. PCDHB16 and MPG neoantigen-specific TCR sequences (CDR3β VDJ) with top frequencies (>0.001%) were identified in the sequencing result and used to determine the relative frequencies of neoantigen-specific TCRs in the total T cell population (
(5) Differential Recognition of Tumor Cells In Vitro and In Vivo by Neoantigen-Specific T Cells
[0195] To further understand the cause and impact of immunodominant neoantigens on immunotherapy, it was hypothesized that immunodominant neoantigens may be highly expressed and presented by all tumor cells, while cryptic (subdominant) neoantigens are expressed and presented by some, but not all tumor cells. To test this possibility, single tumor-cell clones of 135 mel were generated by limited dilution (0.3 cells/well) in 96-well plates. Each clone was expanded from one single 135 mel cell and then tested for their recognition by different neoantigen-specific T-cell clones. As expected, the PCDHB16-specific T-cell clone 1H.sub.1 recognized every single tumor cell clone that was generated, but other subdominant neoantigen-specific T-cell clones (such as 2C10 specific for PCDHB7 and JF6 specific for SPATA13) recognized only a few of single tumor cell clones (
[0196] To further demonstrate in vivo recognition of single tumor cell clones by different neoantigen-specific T cells, 135 mel cells were injected in NSG mice at day 0 and followed by treatment with adoptive transfer of different neoantigen-specific T cell clones (1H.sub.1 specific for PCDHB16 and 2C10 specific for PCDHB7) on day 15. Exogenous interleukin-2 was given on day 15 for three consecutive days to maintain T cell proliferation and survival in vivo (
(6) Dual Recognition of Two Neoantigens by a Single TCR
[0197] Because of large TCR sequence diversity for the same neoantigen, it was noticed that a few tumor-reactive T-cell clones for the dominant neoantigen PCGHB16 had cross-reactivity against other neoantigens (ADIPOR2) (
[0198] The CD4.sup.+ T-cell clone 135-C76 was analyzed for dual recognition of the PCDHB16 neoantigen presented by HLA-DP4 molecules and RPN2 neoantigens presented by HLA-A1 molecules (
(7) Knockdown of ThPOK in CD4+ 135TIL and Other CD4+ T Cells Increases their Cytolytic Activity In Vitro and In Vivo
[0199] CD8.sup.+ and CD4.sup.+ T cells are the major components of T cell-based antitumor immunity. CD8.sup.+ T cells, also known as cytotoxic T lymphocytes (CTL), are capable of specific antigen recognition presented by MHC-I molecules and have high cytotoxicity to kill target cells. CD4.sup.+ T cells also show specific antigen recognition presented by MHC-II molecules but much lower killing ability than CD8.sup.+ T cells. Although the data of T-cell clones of 135TIL show rather high cytolytic activity when targeting PCDHB16 neoantigen, in most cases CD4.sup.+ antigen-specific T cells have rare killing ability when they recognize target cells. Thus it will be a new challenge to reprogram CD4.sup.+ T cells and improve their cytolytic activity. The transcription factor T-helper-inducing POZ/Krueppel-like factor (ThPOK) and is a master regulator of CD4.sup.+ T cell development and negatively regulates the cytolytic ability of CD4.sup.+ T cells. After immigration from the thymus, naïve CD4.sup.+ T cells maintain the expression of ThPOK, indicating that ThPOK can still play important roles for the maintenance of CD4.sup.+ T cells phenotype and function.
[0200] To address this problem, single T cell clones were generated from 135 mel cancer cell-reactive CD4.sup.+ TIL via limited dilution. Cytotoxicity analysis of those T-cell clones were performed with expanded T-cell clones, showing different cytotoxic ability in each T cell clone (
[0201] To abolish the expression of ThPOK in antigen-specific CD4.sup.+ human T cells, lentiviral shRNA plasmid from the human shRNA library was applied to degrade ThPOK mRNA generation. Western blotting analysis confirmed ThPOK was effectively knockdown 72 h after lentiviral transduction in CD4.sup.+135TILs (
[0202] Based on the results in vitro, the 135 mel mouse model was applied to test knockdown of ThPOK in vivo. 135 mel tumor cells were inoculated into the NSG mice. CD4.sup.+135TIL modified by ThPOK shRNA lentiviruses were adoptively transferred on the 5.sup.th day after the tumor inoculation, together with unmodified CD4+135TIL and non-specific CD4.sup.+ T cells. From day 5 to day25, 135 mel growth in each group was observed. Tumor growth treated by ThPOK-knockdown CD4.sup.+135TIL was inhibited significantly, leaving the smallest tumor size, compared to unmodified CD4.sup.+135TIL and non-specific CD4+ T cells (
(8) ThPOK Inhibits CD4+ Tumor Specific T Cell Cytolytic Activity by Interacting with LSD1
[0203] It was hypothesized that the improvement killing ability of CD4.sup.+ T cells due to ThPOK knockdown is procedure of T-cell reprogramming via epigenetic regulation. To distinguish how ThPOK reprogrammed CD4.sup.+ T cells to cytolytic CD4.sup.+ T cells, a screening of A2ESO TCR-transduced CD4.sup.+ T cells treated by various epigenetic inhibitors was performed. After measuring the change of cytotoxicity of CD4.sup.+ TCR-T cells against tumor, it was found that 2-PCPA, the inhibitor of Lysine-specific histone demethylase 1A (LSD1) was the only one inhibitor which dramatically enhanced the cytolytic activity of CD4.sup.+ TCR-T cells, that was similar as depletion of ThPOK (FIG. 18A). To further investigate the correlation between ThPOK and LSD1, immunoprecipitation screening was performed by overexpression tagged proteins in HEK293T host cells with various epigenetic regulators and ThPOK. Suprisingly, LSD1 showed the strongest binding with ThPOK among all epigenetic regulators (
(9) Inhibition of LSD1 by 2-PCPA Positively Regulates of the Generation of Stem Cell-Like Memory T Cells
[0204] To identify key regulators of the generation and maintenance of stem cell-like memory T cells (Tscm), an in vitro small molecules screening platform was established that works directly on ex vivo primary human T cells. Briefly, T cells were activated with plate-bound OKT3 and treated them with small molecules including different metabolites and inhibitors. Half volume of culture medium was changed, and metabolites or inhibitors were added every 2 or 3 days. The memory phenotypes of both CD4.sup.+ and CD8.sup.+ T cell populations were detected at day 12 through staining with key memory markers CD45RO, CCR7 and CD62L. Tscm subset in this platform was identified as CD45RO.sup.−, CCR7.sup.+ and CD62L.sup.+ population by flow cytometry. In activated T cells that don't contain naïve T cells, these three markers were able to identify three memory T cells subsets, Tscm, Tcm and Tem (Lugli et al., 2013). Small molecules that were extremely toxic and dramatically suppressed T cells growth were excluded from this platform. With this method of T cells culture, T cells differentiated significantly and very limited percentage of Tscm cells in both CD4.sup.+ and CD8.sup.+ T cells can be obtained. 2-PCPA, an inhibitor of LSD1, was identified as the only one positive regulator of Tscm generation that increased the population of Tscm in both CD4.sup.+ and CD8.sup.+ T cells (
(10) Inhibition of LSD1 by 2-PCPA in TCR-T and CAR-T Cells Increases T-Cell Antitumor Activity In Vitro and In Vivo
[0205] To test how the inhibition of LSD1 by 2-PCPA treatment affects TCR-T cells, 2-PCPA (8 uM) was directly administered at the beginning of TCR-T cell generation. Activated human T cells were transduced on day 2 and day 3 with A2ESO TCR viruses that were packaged by PG13-A2ESO TCR clone. Results showed that 2-PCPA treatment increased both Tscm and Tcm percentages in A2ESO TCR-T cells, indicating a less-differentiated phenotype (
[0206] The results of 2-PCPA-treated A2ESO TCR-T cells in vitro showed less-differentiated memory phenotype and increased function and proliferation, an attempt was made to test to in vivo effect of 2-PCPA-treated A2ESO TCR-T cells. A breast cancer model was established using MDA-MB-231 cells expressing HLA-A2 and NY-ESO-1 antigen in 8 weeks old NOD-scid IL2Rγnull (NSG) mice. The in vivo anti-tumor function of A2ESO TCR-T cells was significantly enhanced by 2-PCPA treatment (
[0207] To determine whether 2-PCPA treatment was also effective on CAR-T cells as well as TCR-T cells, 2-PCPA treatment was applied when generating CAR-T cells. Activated T cells were transduced with CD19-specific chimeric antigen receptor (CAR) possessing CD28 co-stimulatory domain virus packaged by PG13 clone. Memory phenotype analysis showed that 2-PCPA treatment similarly increased the frequencies of less-differentiated memory T cells subsets (
[0208] b) DISCUSSION
[0209] The findings presented here identify a distinct pattern of T-cell reactivity against mutation-derived neoantigen; that is, a dominant neoantigen was recognized by the majority of tumor-reactive CD4.sup.+ or CD8.sup.+ T cells, while several other neoantigens were infrequently recognized by T cells. Interestingly, in the case of patient #135 it was shown that large majority of the tumor-reactive CD4.sup.+ or CD8.sup.+ TIL population/T-cell clones recognized neoantigens derived from somatic missense mutations respectively which was demonstrated via FACS and TCR sequencing analysis. Importantly, dominant neoantigen-specific T cells show better recognition and killing ability of single tumor cell clones, compared with subdominant/cryptic neoantigen-specific T-cell clones, implying that dominant neoantigen-specific T cells can play a critical role in tumor regression in checkpoint immunotherapy. Although two tumor-reactive CD4.sup.+ T-cell clones did not recognize mutation-derived neoantigens, they did respond to neoepitopes derived from the gene product of the 3′UTR region of ADIPOR2 mRNA or translated from an alternative ORF of the LAGE1b gene. Thus, gene products derived from aberrant translation of an mRNA can serve as neoantigens in cancer cells. By contrast, it was found that all tumor-reactive CD8.sup.+ T-cell clones recognized neoantigens derived from somatic missense mutations. Further investigation of neoantigens in another melanoma patient (patient #136) revealed the similar pattern of neoantigen recognition as seen with melanoma patient #135, indicating that although neoantigens capable of eliciting a specific T-cell response can be different among cancer patients, the regulation and control of neoantigen-specific response and recognition pattern appear consistent among patients within the same type of cancer.
[0210] Immunodominant and cryptic antigens have been reported in autoimmunity and infectious diseases, the study indicates that similar mechanisms can operate in response to cancer neoantigens. In this study, tumor clones growing from single cells were generated and checked the specific neoantigen presentation on tumor clones by recognition of T-cell clones. The data confirms that the presentation level of neoantigens varies in one tumor tissue and there are normally a few neoantigens presented by most of tumor cells, inducing the expansion of their specific T cells to the largest population in primary tumor-reactive T cells. These dominant neoantigens are ideal targets for T-cell based immunotherapy and contribute more in tumor suppression and elimination rather than those subdominant neoantigens, which has been demonstrated by the in vitro and in vivo study.
[0211] A single TCR has been reported to recognize multiple antigens in infectious and autoimmune diseases, but it has not demonstrated that a single TCR can recognize dual cancer neoantigens. By taking advantage of tumor-reactive single T-cell clones, provided herein is clear evidence that a single TCR can recognize two neoantigens presented by the same MHC II molecules, without amino acid sequence similarity. Surprisingly, a single TCR from 135-C76 T cells recognized the PCDHB16 neoantigen presented by HLA-DR4 molecules when transduced into CD4.sup.+ T cells, while the same TCR responded only to the RPN2 neoantigens presented by HLA-A1 molecules when expressed on CD8.sup.+ T cells. Hence, the use of TCRs with dual neoantigen specificity can enhance the potency of T cells used in TCR-based cancer immunotherapy. [0212] c) Methods [0213] (1) Human Samples
[0214] Fresh tumor tissues were obtained from melanoma patients (#135 and #136) under an approved Institutional Review Board (IRB) protocol of Houston Methodist. Both patients were HLA typed (#135: HLA-A1, B8, C7, DR3, DR4, DP4; #136: HLA-A1, B8, C7, DR1, DR11, DP4).
[0215] (2) Primary and Cell Lines
[0216] All cells are grown at 37° C. with 5% CO.sub.2 unless otherwise stated. Patients' melanoma cell lines (135 mel and 136 mel) were generated and expanded in RPMI 1640 medium (Fisher scientific) with 10% (v/v) fetal bovine serum (Valley Biomedical Inc.) supplemented with 1% (v/v) penicillin-streptomycin (Thermo Fisher Scientific).
[0217] All types of human T-cell containing cells, including peripheral blood mononuclear cell (PBMCs), tumor-infiltrated lymphocytes (135TIL and 136TIL), T-cell clones etc. were established and maintained in T-cell culture medium (TCM) containing RPMI 1640 medium with 10% (v/v) human AB serum (Corning) supplemented with 1% (v/v) HEPES (Thermo Fisher Scientific), 1% (v/v) GlutaMAX (Thermo Fisher Scientific) and 0.1% (v/v) 2-mercaptoethanol (Thermo Fisher Scientific)) supplemented with recombinant human interleukin-2 (IL-2, 300 IU/ml) (Peprotech).
[0218] Epstein-Barr virus-transformed B (EBV-B) cells derived from patient #135 were cultured in RPMI 1640 medium with 10% (v/v) fetal bovine serum supplemented with 1% (v/v) penicillin-streptomycin.
[0219] Fibroblasts derived from patient #136 were cultured in Dulbecco's Modified Eagle Medium (DMEM) (Fisher Scientific) with 10% (v/v) fetal bovine serum (FBS) containing 1% penicillin-streptomycin.
[0220] HEK293-based antigen presenting cells (APCs), including 293IMDR3/DP4, 293IMDR4/DP4, 293IMDR1/DP4, 293IMDR11/DP4 were generated, and maintained in Dulbecco's DMEM with 10% (v/v) FBS containing 1% (v/v) penicillin-streptomycin. HEK293T cells, 239T-HLA-A1 and 293T-HLA-B8 cells were maintained in DMEM with 10% (v/v) fetal bovine serum containing 1% (v/v) penicillin-streptomycin.
[0221] Cos-7 cells were cultured in DMEM with 10% (v/v) fetal bovine serum containing 1% (v/v) penicillin-streptomycin.
[0222] Phoenix-AMPHO cells were cultured in DMEM with 10% (v/v) fetal bovine serum containing 1% (v/v) penicillin-streptomycin.
[0223] (3) T-Cell Cloning and T-Cell Expansion
[0224] T-cell cloning and expansion were performed. Briefly, T-cell clones from 135TILs were generated by a limited dilution (0.3 cell/well) in 96-well plates. Allogeneic PBMCs from normal donors were irradiated (60 Gy) and then seeded as feeder cells (7×10.sup.4/well) in TCM containing 35 ng/ml OKT3 (anti-hCD3 antibody) (R&D Systems) and 35 ng/ml anti-hCD28 antibody (R&D Systems). On day 1, recombinant IL-2 was added to the final concentration of 300 IU/ml. On day 5, half of the culture medium was replaced with fresh TCM containing recombinant IL-2, and then the medium was changed every other day or when the medium color turned yellow. On day 14, T-cell clones were harvested, counted, tested for tumor reactivity. Tumor reactive T-cell clones were further expanded.
[0225] For T-cell expansion, 2.5×10.sup.5 T cells were transferred into 25 ml flasks with 20 ml of TCM. 2×10.sup.7 irradiated PBMCs (60 Gy) and 5×10.sup.6 irradiated EBV-B cells (210 Gy) were added as feeder cells. OKT3 and anti-hCD28 antibody were added to the final concentration of 35 ng/ml. On day 1, recombinant interleukin-2 was added to the final concentration of 300 IU/ml. Culture medium was changed, as described above, starting on day 5. On day 14, expanded T cells were harvested, counted, and restored in liquid nitrogen until use.
[0226] (4) T-Cell Reactivity Assay 96-well tissue culture plates were coated with phosphate-buffered saline (PBS, GenDEPOT) containing 50 μg/ml poly-L-lysine (Millipore Sigma) at 37° C. 10 min later, the plates were rinsed twice with PBS. Tumor cells or 293-APCs were seeded at 5×10.sup.4/well in the 96-well plate and cultured at 37° C., 5% CO.sub.2 for at least 4 h. For plasmid transfection, 250 ng of plasmid DNA were transfected into 293-APCs per well with 0.5 μl of Lipofectamine 2000 (Thermo Fisher Scientific) according to the manufacturer's instructions and incubated at 37° C., in 5% CO.sub.2 overnight. For peptide presentation, synthesized peptides (GeneScript) were added to each well at different final concentrations from 10.sup.2 to 10.sup.−4 μM and incubated at 37° C., 5% CO.sub.2. The cells transfected with plasmids or pulsed with peptides were washed with TCM twice, and then cocultured with T cells (1×105/well). After overnight culture, 50 μl of supernatant was collected for measuring cytokine release with an ELISA assay.
[0227] 96-well microplate (Greiner Bio-one) was coated with 50 μl of PBS containing anti-human IFN-γ (Thermo Fisher Scientific, primary antibody, 1 μg/ml) per well and incubated at 4° C. overnight. After three washes with PBS, each well was blocked by 200 μl of 1% (w/v) bovine serum albumin (BSA) in PBS and incubated at room temperature for at least 1 h. After three washes with PBS, 50 μl of culture supernatant or human IFN-γ standard (Peprotech) was added to each well at different concentrations in PBS and incubated at room temperature for 1 h. After three washes, 50 μl of biotin-labeled anti-human IFN-γ antibody (Thermo Fisher Scientific, secondary antibody, 1:1000 dilution) in 2% (v/v) FBS-containing PBS buffer was added to each well, and then incubated at room temperature for 1 h. The 96-well plate was washed six times with PBS containing 1% (v/v) Tween-20. 50 μl of 1:5000 diluted poly-HRP Streptavidin (Thermo Fisher Scientific) in PBS containing 1% (w/v) BSA was then added per well, and incubated in the dark at room temperature for 30 min. After six washes with PBS containing 1% (v/v) Tween-20, 100 μl of 3,3′,5,5′-tetramethylbenzidine substrate solution (TMB, Millipore Sigma) was added per well and incubated at room temperature for 10 min. The color reaction was terminated by adding 50 μl of 2.5 M H.sub.2504 The plate was read at OD450 nm. The concentration of IFN-γ was calculated by standard reading.
[0228] (5) Whole-Exome Sequencing of Tumor and Control Cells
[0229] Whole-exome sequencing was performed by a standard protocol. Briefly, genomic DNA from 135 mel and 135TILs were extracted with the Quick-DNA™ Miniprep Kit (Zymo Research). DNA library construction, exome capture, and sequencing were performed by the standard procedure of Illumina HiSeq 3000. The average reads were around 150× of the coverage on each base. Over 8 billion bases of sequence data were obtained from each sequenced sample. Output from Illumina was processed to yield BAM files, which contained aligned reads to the NCBI Human Reference Genome Build hg19 with well-calibrated quality scores. Somatic mutations were detected referred to previous reports and determined after filtering with the following parameters: filtering out variants in TIL sequencing; read depth >10; 1000 human genome frequency <0.01; missense mutation. Finally, somatic mutations were identified for the tandem minigene construction.
(6) Construction of Tandem Minigenes (TMGs) and Screening with T Cells
[0230] The tandem minigene construct was designed to encode 10 mutations-containing peptides with 12 amino acids flanked on both sides. The codons for double-strained nucleotide synthesis were optimized to avoid EcoRI and XhoI sites for cloning. The synthesized tandem minigenes (Integrated DNA Technologies) were then cloned into pTSX expression vector with in-frame fusion with the Ii80 open reading frame.
[0231] The screening of antigen-specific T cells against TMGs were performed as described in T-cell reactivity assay above. Briefly, TMGs were transfected with Lipofectamine 2000 in the artificial APCs according to the patient's HLA typing respectively. The transfected cells were cocultured with tumor-reactive T cells overnight and the IFN-γ release in the supernatant in each well was determined by ELISA on the second day.
(7) In Vitro Transcription of TMGs
[0232] In vitro transcription (IVT) of TMGs was performed with a standard protocol of HiScribe™ T7 ARCA mRNA Kit (New England Biolabs). Briefly, each TMG construct was linearized by cutting of XhoI (New England Biolabs) and gel-purified with DNA Recovery Kit (Zymo Research). The linearized DNA (1 μg) were used as template for RNA synthesis respectively and mixed with T7 RNA polymerase and reaction buffer in a 20 μl mixture as the kit instruction. The reaction was incubated at 37° C. for at least 30 min. Then the DNA templates were removed by adding 2 μl of DNase I and incubating at 37° C. for 15 min. Next the poly (A) tailing reaction (50 μl) was completed by mixing the IVT reaction with Poly(A) polymerase and reaction buffer as the kit instruction and incubating at 37° C. for 30 min.
[0233] Each 5′-capped and 3′-tailed WT-RNAs were purified by adding 1/2 volume of LiCl to the reaction as the kit instruction. After the incubation at −20° C. for 30 min, the RNAs were spun down by at 4° C. for 15 min at top speed and then the pellets were washed with 500 μl of cold 70% ethanol and air-dried. The pellets were finally resuspended and dissolved in suitable RNA storage solution.
(8) Electroporation of EBV-B Cells and Screening with T Cells
[0234] The electroporation was performed as with a modified protocol of Amaxa® Cell Line Nucleofector® Kit V (Lonza). Briefly 1×10.sup.5 EBV-B cells were spun down and resuspended in 50 μl of electroporation solution (Nucleofector® Solution:Supplement=9:2) and transferred into a kit-attached 100 μl cuvette immediately. 500 ng of IVT-RNA of TMGs were added to the cuvette respectively. The electroporation was completed by placing the cuvette in the Nucleofector™ 2b device and running a pre-set program (T-020).
[0235] The electroporated EBV-B cells were cultured in grow medium at 37° C. overnight and then cocultured with tumor-reactive T cells as described above. The IFN-γ release in the supernatant in each well was determined by ELISA on the second day.
(9) Ii80-cDNA Library Construction and Screening with T Cells
[0236] For cDNA library construction and screening, total RNAs of 135 mel tumor cells were extracted with TRIzol reagent (Thermo Fisher Scientific) according to the manufacturer's instructions. mRNA was further purified from total RNAs, and converted into cDNA using a cDNA construction kit (Thermo Fisher Scientific) with an oligo-dT primer. After ligation with adaptors, cDNAs were cloned into a pTSX vector containing an Ii80 fragment upstream of the cloning sites, and then transformed into Stbl3™ Escherichia coli bacteria (Thermo Fisher Scientific). The cDNA bacterial library was divided into pools with each consisted of approximately 100 cDNA clones. Each plasmid pool was isolated from bacteria and transfected into 293-based APCs per well with 0.5 μl of Lipofectamine 2000 and incubated at 37° C., 5% CO.sub.2 overnight. After gentle washes, transfected cells were cocultured with T-cell clone (1×10.sup.5/well) overnight. The positive pools were identified based on cytokine release from T cells by ELISA. Bacterial colonies transformed with the positive pool plasmid DNA were picked and prepared for DNA isolation. Several hundreds of individual colonies were repeated for screening with T cells. The positive clones were sequenced to identify target antigens recognized by T cells.
(10) Intracellular Staining of IFN-γ
[0237] T cells were stimulated by various in vitro methods, such as tumor cells, peptides or DNA encoding neoantigens, etc. and induced to release cytokines. Stimulated T cells were resuspended in 100 μl of Cytofix/Cytoperm solution (BD Biosciences) per well for microwell plates (or 250 μl for tubes) and incubated for 20 min at 4° C. for cell fixation. Fixed cells were permeabilized by washing twice in 1× Perm/Wash buffer (BD Biosciences). Then the fixed/permeabilized cells were thoroughly resuspended in 50 μl of 1× Perm/Wash buffer containing 1 μl of phycoerythrin (PE)-conjugated IFN-γ antibody (Thermo Fisher Scientific) and incubate at 4° C. for 30 min in the dark. After that the cells were washed twice with 1× Perm/Wash buffer and ready for flow cytometric analysis.
(11) Live-Cell IFN-γ Staining
[0238] Cytokine Secretion Assay-Detection Kit (MACS Miltenyi Biotec) was used in cell-surface IFN-γ staining based on a commercial protocol. Briefly 1×10.sup.6 T cells were washed with growth medium and spun down at 300×g for 10 min and then resuspended in 100 μl of grow medium containing 5% human serum. Washed cells were transferred in one well of a 96-well plate and added with antigenic peptides to the concentration of 1-10m/ml and incubated for 3-6 h at 37° C. in 5% CO.sub.2. After in vitro stimulation of T cells, they were transferred to 15 ml closable tube and washed with 1 ml of cold buffer (cold PBS containing 2% FBS (v/v) and 5 mM EDTA) and spun down. The cell pellets were resuspended in 90 μl of cold growth medium and added with 10 μl of IFN-γ Catch Reagent and incubated for 5 min on ice. After that, 1 ml of warm growth medium (37° C.) was added in the tube and the cells were incubated in closed tube for 45 min at 37° C. under slow continuous rotation. The tube was transferred on ice after incubation and the cells were washed with cold buffer and spun down. The cell pellets were resuspended in 90 μl of cold buffer and then added with 10 μl of IFN-γ Detection Antibody and incubated for 10 min on ice. Finally cells were washed with cold buffer and ready for flow cytometric analysis.
(12) Tumor Cell Cloning
[0239] 135 mel tumor cells were suspended in 1× Keratinocyte-SFM and supplements (Thermo Fisher Scientific) with additional 10% (v/v) fetal bovine serum and 1% (v/v) penicillin-streptomycin at the density of 1.5 cell/ml after trypsinization, then 200 μl of mixture was added per well to 96-well plates and incubated at 37° C., 5% CO.sub.2. On day 5, wells containing a single colony were marked after observation under the microscope. On day 10, the medium was changed with fresh culture medium. The plates were incubated until day 20. The tumor clones in marked wells were trypsinized, and then expanded in larger flasks separately.
(13) Cytotoxicity Assay
[0240] CytoTox 96® Non-Radioactive Cytotoxicity Assay Kit (Promega) was applied to measure T cell —tumor cytotoxicity. Briefly, T cells (effectors) and tumor cells (targets) were mixed in TCM at the E/T ratio from 2:1 to 20:1, then cocultured at 100 μl/well in a 96-well plate at 37° C., 5% CO.sub.2 for at least 4 h. After the incubation, lactate dehydrogenase (LDH) concentration in the supernatant of each well was measured with the Kit following the manufacturer's instructions and the cytotoxicity percentage was calculated.
(14) In Vivo Adoptive T Cell-Based Antitumor Assay
[0241] NSG mice (female) were 6-8 weeks of age, which were obtained from Jackson Laboratories. All NSG mice-related experiments were performed in animal housing facilities under specific pathogen-free conditions at Houston Methodist. All animal studies were performed as instructed by the NIH guidelines and approved by the Animal Care and Use Committee of Houston Methodist.
[0242] Subcutaneous injection of 135 mel tumor lines in NSG mice was performed to construct an animal model for adoptive T cell-based therapy (4 mice in each group). Briefly, 3×10.sup.6 135 mel cells were injected on day 0 per mouse. On day 15, different neoantigen-specific T cell clones were intravenously injected at 10.sup.7 T cells per mouse, followed by the intraperitoneal injection of 5×10.sup.4 unit of interleukin-2 per mouse on day 15, 16 and 17. The serum level of IFN-γ in each tumor-bearing mouse was collected on day 19 and measured via ELISA. The growth of 135 mel in each mouse was tracked until day 40.
(15) TCR β CDR3 Repertoire Analysis
[0243] The TCR β CDR3 analysis were referred to a previous description (Robins et al., 2009). Neoantigen-specific T cells were isolated after the flow cytometric sorting of peptide-stimulated TIL bulk by live-cell IFN-γ staining. Genomic DNAs were isolated from sorted T cells with Quick-DNA™ Miniprep Kit (Zymo Research) according to a commercial protocol. The isolated genomic DNAs were used as templates for PCR amplification of rearranged TCR β CDR3 in mature T cells. A 50 μl PCR mixture was set up with 16 ng/μl genomic DNA templates, 1 μM of forward BV primer sets and 1 μM of reverse BJ primer sets (Table 5), 10× reaction buffer and 0.2 μl of AccuPrime Taq DNA Polymerase HF (Thermo Fisher Scientific). The PCR was performed for 35 cycles (95° C. for 30 s, 59° C. for 30 s and 68° C. for 1 min). The PCR amplicons were gel-purified by DNA Recovery Kit (Zymo Research) and sent for amplicon next-generation sequencing (Genewiz). The sequencing result was analyzed via a laboratory pipeline for data filtering and TCR repertoire alignment.
TABLE-US-00005 TABLE 5 Primers for TCR β CDR3 amplification PRIMER PRIMER SETS NAME SEQUENCE (5′.fwdarw.3′) BV-F TRBV2 TCAAATTTCACTCTGAAGATCCGGTCCACAA (SEQ ID NO: 747) TRBV3-1 GCTCACTTAAATCTTCACATCAATTCCCTGG 748 TRBV4-1 CTTAAACCTTCACCTACACGCCCTGC 749 TRBV4-2, 4-3 CTTATTCCTTCACCTACACACCCTGC 750 TRBV5-1 GCTCTGAGATGAATGTGAGCACCTTG 751 TRBV5-3 GCTCTGAGATGAATGTGAGTGCCTTG 752 TRBV5-4, 5-5, 5-6, GCTCTGAGCTGAATGTGAACGCCTTG 753 5-7, 5-8 TRBV6-1 TCGCTCAGGCTGGAGTCGGCTG 754 TRBV6-2, 6-3 GCTGGGGTTGGAGTCGGCTG 755 TRBV6-4 CCCTCACGTTGGCGTCTGCTG 756 TRBV6-5 GCTCAGGCTGCTGTCGGCTG 757 TRBV6-6 CGCTCAGGCTGGAGTTGGCTG 758 TRBV6-7 CCCCTCAAGCTGGAGTCAGCTG 759 TRBV6-8 CACTCAGGCTGGTGTCGGCTG 760 TRBV6-9 CGCTCAGGCTGGAGTCAGCTG 761 TRBV7-1 CCACTCTGAAGTTCCAGCGCACAC 762 TRBV7-2 CACTCTGACGATCCAGCGCACAC 763 TRBV7-3 CTCTACTCTGAAGATCCAGCGCACAG 764 TRBV7-4 CCACTCTGAAGATCCAGCGCACAG 765 TRBV7-6 CACTCTGACGATCCAGCGCACAG 766 TRBV7-7 CCACTCTGACGATTCAGCGCACAG 767 TRBV7-8 CCACTCTGAAGATCCAGCGCACAC 768 TRBV7-9 CACCTTGGAGATCCAGCGCACAG 769 TRBV9 GCACTCTGAACTAAACCTGAGCTCTCTG 770 TRBV10-1 CCCCTCACTCTGGAGTCTGCTG 771 TRBV10-2 CCCCCTCACTCTGGAGTCAGCTA 772 TRBV10-3 CCTCCTCACTCTGGAGTCCGCTA 773 TRBV11-1, 11-3 CCACTCTCAAGATCCAGCCTGCAG 774 TRBV11-2 CTCCACTCTCAAGATCCAGCCTGCAA 775 TRBV12-3, 12-4, CCACTCTGAAGATCCAGCCCTCAG 776 12-5 TRBV13 CATTCTGAACTGAACATGAGCTCCTTGG 777 TRBV14 CTACTCTGAAGGTGCAGCCTGCAG 778 TRBV15 GATAACTTCCAATCCAGGAGGCCGAACA 779 TRBV16 CTGTAGCCTTGAGATCCAGGCTACGA 780 TRBV17 CTTCCACGCTGAAGATCCATCCCG 781 TRBV18 GCATCCTGAGGATCCAGCAGGTAG 782 TRBV19 CCTCTCACTGTGACATCGGCCC 783 TRBV20-1 CTTGTCCACTCTGACAGTGACCAGTG 784 TRBV23-1 CAGCCTGGCAATCCTGTCCTCAG 785 TRBV24-1 CTCCCTGTCCCTAGAGTCTGCCAT 786 TRBV25-1 CCCTGACCCTGGAGTCTGCCA 787 TRBV27 CCCTGATCCTGGAGTCGCCCA 788 TRBV28 CTCCCTGATTCTGGAGTCCGCCA 789 TRBV29-1 CTAACATTCTCAACTCTGACTGTGAGCAACA 790 TRBV30 CGGCAGTTCATCCTGAGTTCTAAGAAGC 791 BJ-R TRBJ1-1 TCACGAAGGTCCCCAGTATCCAACTTTACCTACAACT GTGAGTCTGGTGCCTTGTCCAAA 792 TRBJ1-2 TCACGAAGGTCCCCAGTATCCAACTACCTACAACGGT TAACCTGGTCCCCGAACCGAA 793 TRBJ1-3 TCACGAAGGTCCCCAGTATCCAACTACCTACAACAGT GAGCCAACTTCCCTCTCCAAA 794 TRBJ1-4 TCACGAAGGTCCCCAGTATCCAACTCCAAGACAGAG AGCTGGGTTCCACTGCCAAA 795 TRBJ1-6 TCACGAAGGTCCCCAGTATCCAACTCTGTCACAGTGA GCCTGGTCCCGTTCCCAAA 796 TRBJ2-1 TCACGAAGGTCCCCAGTATCCAACTCGGTGAGCCGTG TCCCTGGCCCGAA 797 TRBJ2-2 TCACGAAGGTCCCCAGTATCCAACTCCAGTACGGTCA GCCTAGAGCCTTCTCCAAA 798 TRBJ2-3 TCACGAAGGTCCCCAGTATCCAACTACTGTCAGCCGG GTGCCTGGGCCAAA 799 TRBJ2-4 TCACGAAGGTCCCCAGTATCCAACTAGAGCCGGGTCC CGGCGCCGAA 800 TRBJ2-5 TCACGAAGGTCCCCAGTATCCAACTGGAGCCGCGTGC CTGGCCCGAA 801 TRBJ2-6 TCACGAAGGTCCCCAGTATCCAACTGTCAGCCTGCTG CCGGCCCCGAA 802 TRBJ2-7 TCACGAAGGTCCCCAGTATCCAACTGTGAGCCTGGTG CCCGGCCCGAA 803
TABLE-US-00006 TABLE 6 Amino acid sequences of V-J joint in TCR β CDR3 of dominant neoantigen-specific T cells in 135TILs TRBV Paired V-D-J sequence TRBJ REPERTOIRE V segment Joint (-NDN-) J segment repertoire Amino acid sequences (CDR3 β) of potentially functional PCDHB16-reactive TCRs TRBV6-2 TSVYFCAS SYDTGSGDYT DTQYFGPGTR TRBJ2-3 SAVYLCAS SLGAW GYTFGSGTR TRBJ1-2 TRBV7-2 SAVYLCAS IGVIGG NEQFFGPGTR TRBJ2-1 SAVYLCAS SLGP NEQFFGPGTR TRBJ2-1 TRBV7-6 SAMYRCAS SLHRAD GANVLTFGPGTR TRBJ2-6 TRBV7-9 SAMYLCAS SQRGTE NEKLFFWQWNP TRBJ1-4 TRBV10-2 TSVYFCAS SGTLGYN SPLHFGNGTR TRBJ1-6 TRBV11-2 SAVYLCAS SPTSGP DTQYFGPGTR TRBJ2-3 TRBV12-3 SAVYFCAS SFPLA GELFFGEGSR TRBJ2-2 TRBV19 TAFYLCAS TTTSGTY NEQFFGPGTR TRBJ2-1 TAFYLCAS SSWVY GYTFGSGTR TRBJ1-2 TRBV25-1 TSQYLCAS PKARTGD YEQYFGPGTR TRBJ2-7 TSLYFCAS TPVGQGIW SPLHFGNGTR TRBJ1-6 TRBV27 TSLYFCAS SSISTGYWT EAFFGQGTR TRBJ1-1 TSLYFCAS SFRTGGGGSY GYTFGSGTR TRBJ1-2 Amino acid sequences (CDR3 β) of potentially functional MPG-reactive TCRs TRBV5-4 SALYLCAS STRLAGYT DTQYFGPGTR TRBJ2-3 TRBV5-8 SALYLCAS SPREDGT DTQYFGPGTR TRBJ2-3 TRBV6-1 TSVYFCAS SDRGSG NTIYFGEGSW TRBJ1-3 TRBV6-3 TSVYFCAS SYDTGSGDYT DTQYFGPGTR TRBJ2-3 TRBV6-4 TSVYFCAS RNRDRDT EAFFGQGTR TRBJ1-1 TSVYFCAS RNRDWDT EAFFGQGTR TRBJ1-1 TRBV7-9 SAMYLCAS SPRQGDQM NTQYFGPGTR TRBJ2-3 TRBV19 TAFYLCAS RDRQGDT EAFFGQGTR TRBJ1-1 TRBV25-1 TSQYLCAS SENPNSG NTIYFGEGSW TRBJ1-3 TRBV28 TSMYLCAS RSAPL EAFFGQGTR TRBJ1-1 V segment, Joint (-NDN-) and J segment are sections of a single contiguous sequence. Each sequence is depicted here to emphasize the different segments.
(16) TCRα/β Chain Variable Regions and Full-Length TCRα/β Chain Cloning
[0244] TCRα/β chain variable regions (CDR3) were amplified from a single T cell by three rounds of nested PCR. The first one-step RT-PCR was performed with the OneStep RT-PCR kit (Qiagen) in a 25 μl reaction and the following primers (Table 7): mixα1 (38 primers mixed, 10 μM each) and pCal (TCRα), mixβ1 (36 primers mixed, 10 μM each) and pCβ1 (TCRβ). 2 μl of PBS containing at least one T cell was added as the DNA template. The one-step RT-PCR was performed by reverse transcription (50° C., 30 min) followed by PCR for 25 cycles (94° C. for 30 s, 62° C. for 1 min, and 72° C. for 1 min), according to the manufacturer's instructions. The second and third PCR reactions were performed with KOD Hot Start DNA polymerase (Millipore Sigma) following the manufacturer's instructions. One μl of the first-round PCR product was added as a template for the second-round PCR in a 25 μl reaction. The second-round PCR was performed with the following primers (Table 7): mixα2 (36 primers mixed, 10 μM each) and pCα2 (TCRα), mixβ2 (36 primers mixed, 10 μM each) and pCβ2 (TCRβ) for 25 cycles (94° C. for 30 s, 64° C. for 1 min, and 70° C. for 1 min). One μl of the second-round PCR product was added as a template for the third-round PCR in a 25 μl reaction. The third-round PCR was performed with the following primers (Table 7): pF3 and pCα3 (TCRα), pF3 and pCβ3 (TCRβ), for 35 cycles (94° C. for 30 s, 64° C. for 1 min, and 70° C. for 1 min). The final PCR amplicons were gel-purified (Zymo Research) and sequenced (Genewiz). The genotypes of TCRs were determined by search with the online tools V-QUEST (IMGT) and Igblast (NCBI).
TABLE-US-00007 TABLE 7 Primers for TCRα/β chain variable regions (CDR3) amplification Primer Name Sequence (5′.fwdarw.3′) 1st round of mixα1 TRAV1 CTG CAC GTA CCA GAC ATC TGG GTT nested PCR 804 TRAV2 GGC TCA AAG CCT TCT CAG CAG G 805 TRAV3 GGA TAA CCT GGT TAA AGG CAG CTA 806 TRAV4 GGA TAC AAG ACA AAA GTT ACA AAC GA 807 TRAV5 GCT GAC GTA TAT TTT TTC AAA TAT GGA 808 TRAV6 GGA AGA GGC CCT GTT TTC TTG CT 809 TRAV7 GCT GGA TAT GAG AAG CAG AAA GGA 810 TRAV8 AGG ACT CCA GCT TCT CCT GAA GTA 811 TRAV9 GTA TGT CCA ATA TCC TGG AGA AGG T 812 TRAV10 CAG TCA GAA CAC AAA GTC GAA CGG 813 TRAV12.1 CCT AAG TTG CTG ATG TCC GTA TAC 814 TRAV12.2 GGG AAA AGC CCT GAG TTG ATA ATG T 815 TRAV12.3 GCT GAT GTA CAC ATA CTC CAG TGG 816 TRAV13.1 CCC TTG GTA TAA GCA AGA ACT TGG 817 TRAV13.2 CCT CAA TTC ATT ATA GAC ATT CGT TC 818 TRAV14 GCA AAA TGC AAC AGA AGG TCG CTA 819 TRAV16 TAG AGA GAG CAT CAA AGG CTT CAC 820 TRAV17 CGT TCA AAT GAA AGA GAG AAA CAC AG 821 TRAV18 CCT GAA AAG TTC AGA AAA CCA GGA G 822 TRAV19 GGT CGG TAT TCT TGG AAC TTC CAG 823 TRAV20 GCT GGG GAA GAA AAG GAG AAA GAA A 824 TRAV21 GTC AGA GAG AGC AAA CAA GTG GAA 825 TRAV22 GGA CAA AAC AGA ATG GAA GAT TAA GC 826 TRAV23 CCA GAT GTG AGT GAA AAG AAA GAA G 827 TRAV24 GAC TTT AAA TGG GGA TGA AAA GAA GA 828 TRAV25 GGA GAA GTG AAG AAG CAG AAA AGA C 829 TRAV26.1 CCA ATG AAA TGG CCT CTC TGA TCA 830 TRAV26.2 GCA ATG TGA ACA ACA GAA TGG CCT 831 TRAV27 GGT GGA GAA GTG AAG AAG CTC AAG 832 TRAV29 GGA TAA AAA TGA AGA TGG AAG ATT CAC 833 TRAV30 CCT GAT GAT ATT ACT GAA GGG TGG A 834 TRAV34 GGT GGG GAA GAG AAA AGT CAT GAA 835 TRAV35 GGT GAA TTG ACC TCA AAT GGA AGA C 836 TRAV36 GCT AAC TTC AAG TGG AAT TGA AAA GA 837 TRAV38 GAA GCT TAT AAG CAA CAG AAT GCA AC 838 TRAV39 GGA GCA GTG AAG CAG GAG GGA C 839 TRAV40 GAG AGA CAA TGG AAA ACA GCA AAA AC 840 TRAV41 GCT GAG CTC AGG GAA GAA GAA GC 841 mixβ1 TRBV2 CTG AAA TAT TCG ATG ATC AAT TCT CAG 842 TRBV3-1 TCA TTA TAA ATG AAA CAG TTC CAA ATC G 843 TRBV4 AGT GTG CCA AGT CGC TTC TCA C 844 TRBV5-4,8 CAG AGG AAA CTT CCC TCC TAG ATT 845 TRBV5-1 GAG ACA CAG AGA AAC AAA GGA AAC TTC 846 TRBV6-1 GGT ACC ACT GAC AAA GGA GAA GTC C 847 TRBV6-2,3 GAG GGT ACA ACT GCC AAA GGA 848 TRBV6-4 GGC AAA GGA GAA GTC CCT GAT GGT T 849 TRBV6-5,6 AAG GAG AAG TCC CCA ATG GCT ACA A 850 TRBV6-8 CTG ACA AAG AAG TCC CCA ATG GCT AC 851 TRBV6-9 CAC TGA CAA AGG AGA AGT CCC CGA T 852 TRBV7-2 AGA CAA ATC AGG GCT GCC CAG TGA 853 TRBV7-3 GAC TCA GGG CTG CCC AAC GAT 854 TRBV7-8 CCA GAA TGA AGC TCA ACT AGA CAA 855 TRBV7-4,6 GGT TCT CTG CAG AGA GGC CTG AG 856 TRBV7-7 GGC TGC CCA GTG ATC GGT TCT C 857 TRBV7-9 GAC TTA CTT CCA GAA TGA AGC TCA ACT 858 TRBV9 GAG CAA AAG GAA ACA TTC TTG AAC GAT T 859 TRBV10-1,3 GGC TGA TCC ATT ACT CAT ATG GTG TT 860 TRBV10-2 GAT AAA GGA GAA GTC CCC GAT GGC T 861 TRBV11 GAT TCA CAG TTG CCT AAG GAT CGA T 862 TRBV12-3,4 GAT TCA GGG ATG CCC GAG GAT CG 863 TRBV12-5 GAT TCG GGG ATG CCG AAG GAT CG 864 TRBV13 GCA GAG CGA TAA AGG AAG CAT CCC T 865 TRBV14 TCC GGT ATG CCC AAC AAT CGA TTC T 866 TRBV15 GAT TTT AAC AAT GAA GCA GAC ACC CCT 867 TRBV16 GAT GAA ACA GGT ATG CCC AAG GAA AG 868 TRBV18 TAT CAT AGA TGA GTC AGG AAT GCC AAA G 869 TRBV19 GAC TTT CAG AAA GGA GAT ATA GCT GAA 870 TRBV20-1 CAA GGC CAC ATA CGA GCA AGG CGT C 871 TRBV24-1 CAA AGA TAT AAA CAA AGG AGA GAT CTC T 872 TRBV25-1 AGA GAA GGG AGA TCT TTC CTC TGA GT 873 TRBV27-1 GAC TGA TAA GGG AGA TGT TCC TGA AG 874 TRBV28 GGC TGA TCT ATT TCT CAT ATG ATG TTA A 875 TRBV29 GCC ACA TAT GAG AGT GGA TTT GTC ATT 876 TRBV30 GGT GCC CCA GAA TCT CTC AGC CT 877 pCα1 CGG TGA ATA GGC AGA CAG ACT TGT 878 pCβ1 ACC AGT GTG GCC TTT TGG GTG TG 879 2nd round of mixα2 2TRAV1 CCA GGG TTT TCC CAG TCA CGA CAG nested PCR GTC GTT TTT CTT CAT TCC TTA GTC 880 2TRAV2 CCA GGG TTT TCC CAG TCA CGA CAC GAT ACA ACA TGA CCT ATG AAC GG 881 2TRAV3.1 CCA GGG TTT TCC CAG TCA CGA CCT TTG AAG CTG AAT TTA ACA AGA GCC 882 2TRAV4.1 CCA GGG TTT TCC CAG TCA CGA CCT CCC TGT TTA TCC CTG CCG AC 883 2TRAV5.1 CCA GGG TTT TCC CAG TCA CGA CAA ACA AGA CCA AAG ACT CAC TGT TC 884 2TRAV6 CCA GGG TTT TCC CAG TCA CGA CAA GAC TGA AGG TCA CCT TTG ATA CC 885 2TRAV7 CCA GGG TTT TCC CAG TCACGA CAC TAA ATG CTA CAT TAC TGA AGA ATG G 886 2TRAV8 CCA GGG TTT TCC CAG TCACGA CGC ATC AAC GGT TTT GAG GCT GAA TTT AA 887 2TRAV9 CCA GGG TTT TCC CAG TCA CGA CGA AAC CAC TTC TTT CCA CTT GGA GAA 888 2TRAV10 CCA GGG TTT TCC CAG TCA CGA CTA CAG CAA CTC TGG ATG CAG ACA C 889 2TRAV12 CCA GGG TTT TCC CAG TCA CGA CGA AGA TGG AAG GTT TAC AGC ACA 890 2TRAV13.1 CCA GGG TTT TCC CAG TCA CGA CGA CAT TCG TTC AAA TGT GGG CGA A 891 2TRAV13.2 CCA GGG TTT TCC CAG TCA CGA CGG CAA GGC CAA AGA GTC ACC GT 892 2TRAV14 CCA GGG TTT TCC CAG TCA CGA CTC CAG AAG GCA AGA AAA TCC GCC A 893 2TRAV16 CCA GGG TTT TCC CAG TCA CGA CGC TGA CCT TAA CAA AGG CGA GAC A 894 2TRAV17 CCA GGG TTT TCC CAG TCA CGA CTT AAG AGT CAC GCT TGA CAC TTC CA 895 2TRAV18 CCA GGG TTT TCC CAG TCA CGA CGC AGA GGT TTT CAG GCC AGT CCT 896 2TRAV19 CCA GGG TTT TCC CAG TCA CGA CTC CAC CAG TTC CTT CAA CTT CAC C 897 2TRAV20 CCA GGG TTT TCC CAG TCA CGA CGC CAC ATT AAC AAA GAA GGA AAG CT 898 2TRAV21 CCA GGG TTT TCC CAG TCA CGA CGC CTC GCT GGA TAA ATC ATC AGG A 899 2TRAV22 CCA GGG TTT TCC CAG TCA CGA CAC GAC TGT CGC TAC GGA ACG CTA 900 2TRAV23 CCA GGG TTT TCC CAG TCA CGA CCA CAA TCT CCT TCA ATA AAA GTG CCA 901 2TRAV24 CCA GGG TTT TCC CAG TCA CGA CAC GAA TAA GTG CCA CTC TTA ATA CCA 902 2TRAV25 CCA GGG TTT TCC CAG TCA CGA CGT TTG GAG AAG CAA AAA AGA ACA GCT 903 2TRAV26.1 CCA GGG TTT TCC CAG TCA CGA CCA GAA GAC AGA AAG TCC AGC ACC T 904 2TRAV26.2 CCA GGG TTT TCC CAG TCA CGA CAT CGC TGA AGA CAG AAA GTC CAG T 905 2TRAV27 CCA GGG TTT TCC CAG TCA CGA CAC TAA CCT TTC AGT TTG GTG ATG CAA 906 2TRAV29 CCA GGG TTT TCC CAG TCA CGA CCT TAA ACA AAA GTG CCA AGC ACC TC 907 2TRAV30 CCA GGG TTT TCC CAG TCA CGA CAA TAT CTG CTT CAT TTA ATG AAA AAA AGC 908 2TRAV34 CCA GGG TTT TCC CAG TCA CGA CCC AAG TTG GAT GAG AAA AAG CAG CA 909 2TRAV35 CCA GGG TTT TCC CAG TCA CGA CCT CAG TTT GGT ATA ACC AGA AAG GA 910 2TRAV36 CCA GGG TTT TCC CAG TCA CGA CGG AAG ACT AAG TAG CAT ATT AGA TAA G 911 2TRAV38 CCA GGG TTT TCC CAG TCA CGA CCT GTG AAC TTC CAG AAA GCA GCC A 912 2TRAV39 CCA GGG TTT TCC CAG TCA CGA CCC TCA CTT GAT ACC AAA GCC CGT 913 2TRAV40 CCA GGG TTT TCC CAG TCA CGA CAG GCG GAA ATA TTA AAG ACA AAA ACT C 914 2TRAV41 CCA GGG TTT TCC CAG TCA CGA CGA TTA ATT GCC ACA ATA AAC ATA CAG G 915 mixβ2 2TRBV2 CCA GGG TTT TCC CAG TCA CGA CGC CTG ATG GAT CAA ATT TCA CTC TG 916 2TRBV3-1 CCA GGG TTT TCC CAG TCA CGA CTC TCA CCT AAA TCT CCA GAC AAA GCT 917 2TRBV4 CCA GGG TTT TCC CAG TCA CGA CCC TGA ATG CCC CAA CAG CTC TC 918 2TRBV5-4,8 CCA GGG TTT TCC CAG TCA CGA CCT CTG AGC TGA ATG TGA ACG CCT 919 2TRBV5-1 CCA GGG TTT TCC CAG TCA CGA CCG ATT CTC AGG GCG CCA GTT CTC T 920 2TRBV6-1 CCA GGG TTT TCC CAG TCA CGA CTG GCT ACA ATG TCT CCA GAT TAA ACA A 921 2TRBV6-2,3 CCA GGG TTT TCC CAG TCA CGA CCC CTG ATG GCT ACA ATG TCT CCA GA 922 2TRBV6-4 CCA GGG TTT TCC CAG TCA CGA CGT GTC TCC AGA GCA AAC ACA GAT GAT T 923 2TRBV6-5,6 CCA GGG TTT TCC CAG TCA CGA CGT CTC CAG ATC AAC CAC AGA GGA T 924 2TRBV6-8 CCA GGG TTT TCC CAG TCA CGA CGT CTC TAG ATT AAA CAC AGA GGA TTT C 925 2TRBV6-9 CCA GGG TTT TCC CAG TCA CGA CGG CTA CAA TGT ATC CAG ATC AAA CA 926 2TRBV7-2 CCA GGG TTT TCC CAG TCA CGA CTC GCT TCT CTG CAG AGA GGA CTG G 927 2TRBV7-3 CCA GGG TTT TCC CAG TCA CGA CCG GTT CTT TGC AGT CAG GCC TGA 928 2TRBV7-8 CCA GGG TTT TCC CAG TCA CGA CCC AGT GAT CGC TTC TTT GCA GAA A 929 2TRBV7-4,6 CCA GGG TTT TCC CAG TCA CGA CTC TCC ACT CTG AAG ATC CAG CGC A 930 2TRBV7-7 CCA GGG TTT TCC CAG TCA CGA CGC AGA GAG GCC TGA GGG ATC CAT 931 2TRBV7-9 CCA GGG TTT TCC CAG TCA CGA CCT GCA GAG AGG CCT AAG GGA TCT 932 2TRBV9 CCA GGG TTT TCC CAG TCA CGA CCT CCG CAC AAC AGT TCC CTG ACT T 933 2TRBV10-1.3 CCA GGG TTT TCC CAG TCA CGA CCA GAT GGC TAT AGT GTC TCT AGA TCA AA 934 2TRBV10-2 CCA GGG TTT TCC CAG TCA CGA CGT TGT CTC CAG ATC CAA GAC AGA GAA 935 2TRBV11 CCA GGG TTT TCC CAG TCA CGA CGC AGA GAG GCT CAA AGG AGT AGA CT 936 2TRBV12-3.4 CCA GGG TTT TCC CAG TCA CGA CGC TAA GAT GCC TAA TGC ATC ATT CTC 937 2TRBV12-5 CCA GGG TTT TCC CAG TCA CGA CCT CAG CAG AGA TGC CTG ATG CAA CT 938 2TRBV13 CCA GGG TTT TCC CAG TCA CGA CTC TCA GCT CAA CAG TTC AGT GAC TA 939 2TRBV14 CCA GGG TTT TCC CAG TCA CGA CGC TGA AAG GAC TGG AGG GAC GTA T 940 2TRBV15 CCA GGG TTT TCC CAG TCA CGA CGA TAA CTT CCA ATC CAG GAG GCC G 941 2TRBV16 CCA GGG TTT TCC CAG TCA CGA CGC TAA GTG CCT CCC AAA TTC ACC C 942 2TRBV18 CCA GGG TTT TCC CAG TCA CGA CGG AAC GAT TTT CTG CTG AAT TTC CCA 943 2TRBV19 CCA GGG TTT TCC CAG TCA CGA CGG TAC AGC GTC TCT CGG GAG AAG A 944 2TRBV20-1 CCA GGG TTT TCC CAG TCA CGA CGG ACA AGT TTC TCA TCA ACC ATG CAA 945 2TRBV24-1 CCA GGG TTT TCC CAG TCA CGA CTG GAT ACA GTG TCT CTC GAC AGG C 946 2TRBV25-1 CCA GGG TTT TCC CAG TCA CGA CCA ACA GTC TCC AGA ATA AGG ACG GA 947 2TRBV27-1 CCA GGG TTT TCC CAG TCA CGA CTA CAA AGT CTC TCG AAA AGA GAA GAG GA 948 2TRBV28 CCA GGG TTT TCC CAG TCA CGA CGG GGT ACA GTG TCT CTA GAG AGA 949 2TRBV29 CCA GGG TTT TCC CAG TCA CGA CGT TTC CCA TCA GCC GCC CAA ACC TA 950 2TRBV30 CCA GGG TTT TCC CAG TCA CGA CCA GAC CCC AGG ACC GGC AGT TCA T 951 pCα2 CAG ACA GAC TTG TCA CTG GAT TTA G 952 pCβ2 CTT TTG GGT GTG GGA GAT CTC TG 953 3rd round of pF3 CCT ACA CGA CGC TCT TCC GAT CTN nested PCR NGC AGA GAT AAG CCC AGG GTT TTC CCA GTC ACG AC (SEQ ID NO: 954) pCα3 CTG CTG AAC CGC TCT TCC GAT CTN NGT TCA GTC ACT GGA TTT AGA GTC TCT CAG (SEQ ID NO: 955) pCβ3 CTG CTG AAC CGC TCT TCC GAT CTN NGT TCA GAG ATC TCT GCT TCT GAT GGC TC (SEQ ID NO: 956)
[0245] Once TCRα/β chain variable regions were determined, primers specific for leader sequence and constant region were used to amplify full-length TCRα/β chains. The PCR to amplify the intact TCRα- or β-chain and the following overlap PCR were performed with KOD Hot Start DNA polymerase (Millipore Sigma) according to the manufacturer's instructions. One μl of cDNA was added as a template in a 25 μl reaction together with the primers: pLα-P2A (based on TCR genotype), pRCα for amplifying full-length a chain or pLβ (based on TCR genotype), and pRCβ-P2A for amplifying full-length β chain. The PCR was performed for 20 cycles (94° C. for 30s, 64° C. for 1 min, 70° C. for 1 min). 2 μl of TCRα and 2 μl of TCRβ chain PCR products were mixed in 25 μl of overlap PCR reaction. The primers pLβ and pRCα were also added. The overlap PCR was performed for 30 cycles (94° C. for 30 s, 60° C. for 1 min, and 70° C. for 90s). The overlap PCR products (˜1.5 kb) were gel-purified with DNA Recovery Kit (Zymo Research) and cloned into pMSGV vector with NEBuilder® HiFi DNA Assembly Master Mix (New England Biolabs).
(17) Retroviral Transduction of TCR in Naïve T Cells
[0246] The pMSGV DNA encoding full-length TCR was transfected into Phoenix-Ampho cells (ATCC) to produce retroviruses. Briefly, 6-well tissue culture plate was coated with Poly-L-lysine (50 μg/ml in PBS) for 10 min at 37° C. and rinsed with PBS once. Phoenix-Ampho cells were seeded onto the coated plate at 1.5×10.sup.6 per well and cultured in DMEM with 10% (v/v) fetal bovine serum containing 1% penicillin-streptomycin at 37° C., 5% CO.sub.2. Four h later, 4 μg of pMSGV plasmids was transfected into each well with 10 μl of Lipofectamine 2000. The culture medium was changed 6 h after the transfection and the retroviral supernatant was collected at 48 or 60 h after the transfection.
[0247] For preparation of activated naïve T cells, 24-well tissue culture plates were coated with 1 ml of PBS containing 0.5 μg/ml OKT3 per well and incubated at 4° C. overnight. After the plate was rinsed with PBS once, PBMCs from normal donors or bead-purified T cells were added in (1.0×10.sup.6/well) and cultured in TCM containing 300 IU/ml IL-2 at 37° C., 5% CO.sub.2 for 48 h. The OKT3-stimulated T cells were ready for transduction after washed twice with TCM. 230. For retroviral transduction of activated T cells, 24-well non-tissue culture plates (Corning) were coated with 0.5 ml of human recombinant RetroNectin per well (Takara, 10m/ml in PBS) 1 day prior to the transduction and incubated at 4° C. overnight. After RetroNectin was removed, the plate was blocked with 2% (w/v) BSA in PBS at room temperature for 30 min. The plate was then washed twice with PBS containing 2.5% (v/v) HEPES. Retroviral supernatant of Phoenix-Ampho cells was added to the RetroNectin-coated plates (2.5 ml per well). The plate was centrifuged at 2000×g, 32° C. for 2 h. After the viral supernatant was discarded, OKT3-stimulated T cells were added to the plate (5×10.sup.5/well) and cultured in TCM containing 150 IU/ml IL-2 at 37° C., 5% CO.sub.2 overnight. A second round of transduction was performed with fresh retroviral supernatant and RetroNectin-coated plates, as described above. The transduced T cells were ready to assay with tumor cells or 293APCs 72 h after transduction.
(18) Statistics
[0248] Comparisons between groups were calculated by Student's t-test using GraphPad Prism (GraphPad Software). P<0.05 was considered as statistically significant.
F. REFERENCES
[0249] Balachandran, V. P., Luksza, M., Zhao, J. N., Makarov, V., Moral, J. A., Remark, R., Herbst, B., Askan, G., Bhanot, U., Senbabaoglu, Y., et al. (2017). Identification of unique neoantigen qualities in long-term survivors of pancreatic cancer. Nature 551, 512-516. [0250] Birnbaum, M. E., Mendoza, J. L., Sethi, D. K., Dong, S., Glanville, J., Dobbins, J., Ozkan, E., Davis, M. M., Wucherpfennig, K. W., and Garcia, K. C. (2014). Deconstructing the peptide-MHC specificity of T cell recognition. Cell 157, 1073-1087. [0251] Bolotin, D. A., Poslaysky, S., Davydov, A. N., Frenkel, F. E., Fanchi, L., Zolotareva, O. I., Hemmers, S., Putintseva, E. V., Obraztsova, A. S., Shugay, M., et al. (2017). Antigen receptor repertoire profiling from RNA-seq data. Nature biotechnology 35, 908-911. [0252] Braumuller, H. et al. T-helper-1-cell cytokines drive cancer into senescence. Nature 494, 361-365, doi:10.1038/nature11824 (2013). [0253] Carpenter, A. C., Wohlfert, E., Chopp, L. B., Vacchio, M. S., Nie, J., Zhao, Y., Shetty, J., Xiao, Q., Deng, C., Tran, B., et al. (2017). Control of Regulatory T Cell Differentiation by the Transcription Factors Thpok and LRF. Journal of immunology 199, 1716-1728. [0254] Carreno, B. M., Magrini, V., Becker-Hapak, M., Kaabinejadian, S., Hundal, J., Petti, A. A., Ly, A., Lie, W. R., Hildebrand, W. H., Mardis, E. R., et al. (2015). Cancer immunotherapy. A dendritic cell vaccine increases the breadth and diversity of melanoma neoantigen-specific T cells. Science 348, 803-808. [0255] Chen, L., and Han, X. (2015). Anti-PD-1/PD-L1 therapy of human cancer: past, present, and future. The Journal of clinical investigation 125, 3384-3391. [0256] Chen, Y. T., Scanlan, M. J., Sahin, U., Tureci, O., Gure, A. O., Tsang, S., Williamson, B., Stockert, E., Pfreundschuh, M., and Old, L. J. (1997). A testicular antigen aberrantly expressed in human cancers detected by autologous antibody screening. ProcNatlAcadSciUSA 94, 1914-1918. [0257] Chowell, D., Morris, L. G. T., Grigg, C. M., Weber, J. K., Samstein, R. M., Makarov, V., Kuo, F., Kendall, S. M., Requena, D., Riaz, N., et al. (2018). Patient HLA class I genotype influences cancer response to checkpoint blockade immunotherapy. Science 359, 582-587. [0258] Cibulskis, K., Lawrence, M. S., Carter, S. L., Sivachenko, A., Jaffe, D., Sougnez, C., Gabriel, S., Meyerson, M., Lander, E. S., and Getz, G. (2013). Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nature biotechnology 31, 213-219. [0259] Cole, D. K. et al. Hotspot autoimmune T cell receptor binding underlies pathogen and insulin peptide cross-reactivity. J. Clin. Invest. 126, 2191-2204, doi:10.1172/JCI85679 (2016). [0260] Cole, D. K., Bulek, A. M., Dolton, G., Schauenberg, A. J., Szomolay, B., Rittase, W., Trimby, A., Jothikumar, P., Fuller, A., Skowera, A., et al. (2016). Hotspot autoimmune T cell receptor binding underlies pathogen and insulin peptide cross-reactivity. The Journal of clinical investigation 126, 3626. [0261] De la Herran-Arita, A. K. et al. CD4+ T cell autoimmunity to hypocretin/orexin and cross-reactivity to a 2009 H.sub.1N1 influenza A epitope in narcolepsy. Sci Transl Med 5, 216ra176, doi:10.1126/scitranslmed.3007762 (2013). [0262] Gubin, M. M., Zhang, X., Schuster, H., Caron, E., Ward, J. P., Noguchi, T., Ivanova, Y., Hundal, J., Arthur, C. D., Krebber, W. J., et al. (2014). Checkpoint blockade cancer immunotherapy targets tumour-specific mutant antigens. Nature 515, 577-581. [0263] Han, A., Glanville, J., Hansmann, L., and Davis, M. M. (2014). Linking T-cell receptor sequence to functional phenotype at the single-cell level. Nature biotechnology 32, 684-692. [0264] He, X., Park, K., and Kappes, D. J. (2010). The role of ThPOK in control of CD4/CD8 lineage commitment. Annual review of immunology 28, 295-320. [0265] Hellmann, M. D., Callahan, M. K., Awad, M. M., Calvo, E., Ascierto, P. A., Atmaca, A., Rizvi, N. A., Hirsch, F. R., Selvaggi, G., Szustakowski, J. D., et al. (2018). Tumor Mutational Burden and Efficacy of Nivolumab Monotherapy and in Combination with Ipilimumab in Small-Cell Lung Cancer. Cancer cell 33, 853-861 e854. [0266] Hung, K. et al. The Central Role of CD4(+) T Cells in the Antitumor Immune Response. J. Exp. Med. 188, 2357-2368 (1998). [0267] Kang, T. W. et al. Senescence surveillance of pre-malignant hepatocytes limits liver cancer development. Nature 479, 547-551, doi:10.1038/nature10599 (2011). [0268] Kastenmuller, W., Gasteiger, G., Gronau, J. H., Baier, R., Ljapoci, R., Busch, D. H., and Drexler, I. (2007). Cross-competition of CD8+ T cells shapes the immunodominance hierarchy during boost vaccination. The Journal of experimental medicine 204, 2187-2198. [0269] Kreiter, S., Vormehr, M., van de Roemer, N., Diken, M., Lower, M., Diekmann, J., Boegel, S., Schrors, B., Vascotto, F., Castle, J. C., et al. (2015). Mutant MHC class II epitopes drive therapeutic immune responses to cancer. Nature 520, 692-696. [0270] La Gruta, N. L., Gras, S., Daley, S. R., Thomas, P. G., and Rossjohn, J. (2018). Understanding the drivers of MHC restriction of T cell receptors. Nature reviews Immunology 18, 467-478. [0271] Laumont, C. M., Vincent, K., Hesnard, L., Audemard, E., Bonneil, E., Laverdure, J. P., Gendron, P., Courcelles, M., Hardy, M. P., Cote, C., et al. (2018). Noncoding regions are the main source of targetable tumor-specific antigens. Science translational medicine 10. [0272] Le, D. T., Durham, J. N., Smith, K. N., Wang, H., Bartlett, B. R., Aulakh, L. K., Lu, S., Kemberling, H., Wilt, C., Luber, B. S., et al. (2017). Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science 357, 409-413. [0273] Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754-1760. [0274] Linnemann, C., van Buuren, M. M., Bies, L., Verdegaal, E. M., Schotte, R., Calis, J. J., Behjati, S., Velds, A., Hilkmann, H., Atmioui, D. E., et al. (2015). High-throughput epitope discovery reveals frequent recognition of neo-antigens by CD4+ T cells in human melanoma. Nat Med 21, 81-85. [0275] Lu, Y. C., Yao, X., Crystal, J. S., Li, Y. F., El-Gamil, M., Gross, C., Davis, L., Dudley, M. E., Yang, J. C., Samuels, Y., et al. (2014). Efficient identification of mutated cancer antigens recognized by T cells associated with durable tumor regressions. Clinical cancer research: an official journal of the American Association for Cancer Research 20, 3401-3410. [0276] Luckey, M. A. et al. The transcription factor ThPOK suppresses Runx3 and imposes CD4(+) lineage fate by inducing the SOCS suppressors of cytokine signaling. Nature immunology 15, 638-645, doi:10.1038/ni.2917 (2014). [0277] Luksza, M., Riaz, N., Makarov, V., Balachandran, V. P., Hellmann, M. D., Solovyov, A., Rizvi, N. A., Merghoub, T., Levine, A. J., Chan, T. A., et al. (2017). A neoantigen fitness model predicts tumour response to checkpoint blockade immunotherapy. Nature 551, 517-520. [0278] Malandro, N., Budhu, S., Kuhn, N. F., Liu, C., Murphy, J. T., Cortez, C., Zhong, H., Yang, X., Rizzuto, G., Altan-Bonnet, G., et al. (2016). Clonal Abundance of Tumor-Specific CD4(+) T Cells Potentiates Efficacy and Alters Susceptibility to Exhaustion. Immunity 44, 179-193. [0279] Manguso, R. T., Pope, H. W., Zimmer, M. D., Brown, F. D., Yates, K. B., Miller, B. C., Collins, N. B., Bi, K., LaFleur, M. W., Juneja, V. R., et al. (2017). In vivo CRISPR screening identifies Ptpn2 as a cancer immunotherapy target. Nature 547, 413-418. [0280] Marty Pyke, R., Thompson, W. K., Salem, R. M., Font-Burgada, J., Zanetti, M., and Carter, H. (2018). Evolutionary Pressure against MHC Class II Binding Cancer Mutations. Cell 175, 1991. [0281] Marty, R., Kaabinejadian, S., Rossell, D., Slifker, M. J., van de Haar, J., Engin, H. B., de Prisco, N., Ideker, T., Hildebrand, W. H., Font-Burgada, J., et al. (2017). MHC-I Genotype Restricts the Oncogenic Mutational Landscape. Cell 171, 1272-1283 e1215. [0282] McGranahan, N., Rosenthal, R., Hiley, C. T., Rowan, A. J., Watkins, T. B. K., Wilson, G. A., Birkbak, N. J., Veeriah, S., Van Loo, P., Herrero, J., et al. (2017). Allele-Specific HLA Loss and Immune Escape in Lung Cancer Evolution. Cell 171, 1259-1271 e1211. [0283] McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., Garimella, K., Altshuler, D., Gabriel, S., Daly, M., et al. (2010). The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome research 20, 1297-1303. [0284] Mucida, D. et al. Transcriptional reprogramming of mature CD4(+) helper T cells generates distinct MHC class II-restricted cytotoxic T lymphocytes. Nature immunology 14, 281-289, doi:10.1038/ni.2523 (2013). [0285] Nelson, R. W., Beisang, D., Tubo, N.J., Dileepan, T., Wiesner, D. L., Nielsen, K., Wuthrich, M., Klein, B. S., Kotov, D. I., Spanier, J. A., et al. (2015). T cell receptor cross-reactivity between similar foreign and self peptides influences naïve cell population size and autoimmunity. Immunity 42, 95-107. [0286] Pan, D., Kobayashi, A., Jiang, P., Ferrari de Andrade, L., Tay, R. E., Luoma, A. M., Tsoucas, D., Qiu, X., Lim, K., Rao, P., et al. (2018). A major chromatin regulator determines resistance of tumor cells to T cell-mediated killing. Science 359, 770-775. [0287] Patel, S. J., Sanjana, N. E., Kishton, R. J., Eidizadeh, A., Vodnala, S. K., Cam, M., Gartner, J. J., Jia, L., Steinberg, S. M., Yamamoto, T. N., et al. (2017). Identification of essential genes for cancer immunotherapy. Nature 548, 537-542. [0288] Porter, D. L., Levine, B. L., Kalos, M., Bagg, A., and June, C. H. (2011). Chimeric antigen receptor-modified T cells in chronic lymphoid leukemia. The New England journal of medicine 365, 725-733. [0289] Rizvi, N. A., Hellmann, M. D., Snyder, A., Kvistborg, P., Makarov, V., Havel, J. J., Lee, W., Yuan, J., Wong, P., Ho, T. S., et al. (2015a). Cancer immunology. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science 348, 124-128. [0290] Rizvi, N. A., Hellmann, M. D., Snyder, A., Kvistborg, P., Makarov, V., Havel, J. J., Lee, W., Yuan, J., Wong, P., Ho, T. S., et al. (2015b). Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science 348, 124-128. [0291] Robbins, P. F., Lu, Y. C., El-Gamil, M., Li, Y. F., Gross, C., Gartner, J., Lin, J. C., Teer, J. K., Cliften, P., Tycksen, E., et al. (2013). Mining exomic sequencing data to identify mutated antigens recognized by adoptively transferred tumor-reactive T cells. Nat Med 19, 747-752. [0292] Robins, H. S., Campregher, P. V., Srivastava, S. K., Wacher, A., Turtle, C. J., Kahsai, O., Riddell, S. R., Warren, E. H., and Carlson, C. S. (2009). Comprehensive assessment of T-cell receptor beta-chain diversity in alphabeta T cells. Blood 114, 4099-4107. [0293] Rosenberg, S. A., and Restifo, N. P. (2015). Adoptive cell transfer as personalized immunotherapy for human cancer. Science 348, 62-68. [0294] Rosenberg, S. A., Tran, E., and Robbins, P. F. (2017). T-Cell Transfer Therapy Targeting Mutant KRAS. The New England journal of medicine 376, ell. [0295] Schumacher, T. N., and Schreiber, R. D. (2015). Neoantigens in cancer immunotherapy. Science 348, 69-74. [0296] Schumacher, T. N., Scheper, W., and Kvistborg, P. (2018). Cancer Neoantigens. Annu Rev Immunol. Sercarz, E. E., Lehmann, P. V., Ametani, A., Benichou, G., Miller, A., and Moudgil, K. (1993). Dominance and crypticity of T cell antigenic determinants. Annual review of immunology 11, 729-766.
[0297] Sharma, P., and Allison, J. P. (2015). Immune checkpoint targeting in cancer therapy: toward combination strategies with curative potential. Cell 161, 205-214. [0298] Shastri, N., Schwab, S., and Serwold, T. (2002). Producing nature's gene-chips: the generation of peptides for display by MHC class I molecules. Annu Rev Immunol 20, 463-493. [0299] Shedlock, D. J. & Shen, H. Requirement for CD4 T cell help in generating functional CD8 T cell memory. Science 300, 337-339 (2003). [0300] Starck, S. R. et al. Translation from the 5′ untranslated region shapes the integrated stress response. Science 351, aad3867, doi:10.1126/science.aad3867 (2016). [0301] Stronen, E., Toebes, M., Kelderman, S., van Buuren, M. M., Yang, W., van Rooij, N., Donia, M., Boschen, M. L., Lund-Johansen, F., Olweus, J., et al. (2016). Targeting of cancer neoantigens with donor-derived T cell receptor repertoires. Science 352, 1337-1341. [0302] Sun, J. C. & Bevan, M. J. Defective CD8 T cell memory following acute infection without CD4 T cell help. Science 300, 339-342 (2003). [0303] Topalian, S. L., Drake, C. G., and Pardoll, D. M. (2015). Immune checkpoint blockade: a common denominator approach to cancer therapy. Cancer cell 27, 450-461. [0304] Tran, E., Ahmadzadeh, M., Lu, Y. C., Gros, A., Turcotte, S., Robbins, P. F., Gartner, J. J., Zheng, Z., Li, Y. F., Ray, S., et al. (2015). Immunogenicity of somatic mutations in human gastrointestinal cancers. Science 350, 1387-1390. [0305] Tran, E., Turcotte, S., Gros, A., Robbins, P. F., Lu, Y. C., Dudley, M. E., Wunderlich, J. R., Somerville, R. P., Hogan, K., Hinrichs, C. S., et al. (2014). Cancer immunotherapy based on mutation-specific CD4+ T cells in a patient with epithelial cancer. Science 344, 641-645. [0306] Vacchio, M. S. et al. A ThPOK-LRF transcriptional node maintains the integrity and effector potential of post-thymic CD4+ T cells. Nature immunology 15, 947-956, doi:10.1038/ni.2960 (2014). [0307] Wang, H. Y., Lee, D. A., Peng, G., Guo, Z., Li, Y., Kiniwa, Y., Shevach, E. M., and Wang, R. F. (2004). Tumor-specific human CD4+ regulatory T cells and their ligands: implications for immunotherapy. Immunity 20, 107-118. [0308] Wang, H. Y., Peng, G., Guo, Z., Shevach, E. M., and Wang, R.-F. (2005). Recognition of a new ARTC1 peptide ligand uniquely expressed in tumor cells by antigen-specific CD4+ gegulatory T cells. J Immunol 174, 2661-2670. [0309] Wang, H. Y., Zhou, J., Zhu, K., Riker, A. I., Marincola, F. M., and Wang, R. F. (2002). Identification of a mutated fibronectin as a tumor antigen recognized by CD4+ T cells: its role in extracellular matrix formation and tumor metastasis. The Journal of experimental medicine 195, 1397-1406. [0310] Wang, R.-F. The role of MHC class II-restricted tumor antigens and CD4+ T cells in antitumor immunity. Trends in Immunology 22, 269-276 (2001). [0311] Wang, R. F., and Wang, H. Y. (2017). Immune targets and neoantigens for cancer immunotherapy and precision medicine. Cell research 27, 11-37. [0312] Wang, R.-F., Johnston, S. L., Zeng, G., Schwartzentruber, D. J., and Rosenberg, S. A. (1998). A breast and melanoma-shared tumor antigen: T cell responses to antigenic peptides translated from different open reading frames. J Immunol 161, 3596-3606. [0313] Wang, R. F., Parkhurst, M. R., Kawakami, Y., Robbins, P. F., and Rosenberg, S. A. (1996). Utilization of an alternative open reading frame of a normal gene in generating a novel human cancer antigen. J Exp Med 183, 1131-1140. [0314] Wang, R. F., Wang, X., and Rosenberg, S. A. (1999b). Identification of a novel major histocompatibility complex class II-restricted tumor antigen resulting from a chromosomal rearrangement recognized by CD4(+) T cells. J Exp Med 189, 1659-1668. [0315] Wang, R. F., Wang, X., Atwood, A. C., Topalian, S. L., and Rosenberg, S. A. (1999). Cloning genes encoding MHC class II-restricted antigens: mutated CDCl.sub.27 as a tumor antigen. Science 284, 1351-1354. [0316] Zacharakis, N., Chinnasamy, H., Black, M., Xu, H., Lu, Y. C., Zheng, Z., Pasetto, A., Langhan, M., Shelton, T., Prickett, T., et al. (2018). Immune recognition of somatic mutations leading to complete durable regression in metastatic breast cancer. Nature medicine 24, 724-730.