SYNTHETIC PATHWAY ACTIVATORS

20250270288 ยท 2025-08-28

    Inventors

    Cpc classification

    International classification

    Abstract

    Provided herein are novel synthetic pathway activators comprising multimerization regions, transmembrane domains, and intracellular signaling domains.

    Claims

    1. A nucleic acid encoding a synthetic pathway activator (SPA) peptide comprising a chimeric polypeptide comprising: a. optionally, an extracellular domain; b. a lipid anchor or a transmembrane domain; c. an intracellular signaling domain; and d. a multimerization region.

    2. The nucleic acid of claim 1, wherein the intracellular signaling domain comprises an interleukin receptor intracellular signaling domain or functional fragments thereof.

    3. The nucleic acid of claim 2, wherein the intracellular signaling domain comprises one ore more of a type I cytokine receptor superfamily box1 peptide motif comprising IWPNVDP (SEQ ID NO: 106), a type I cytokine receptor superfamily box2 peptide motif comprising VSVVEIEANDKKP (SEQ ID NO: 107), and/or a tyrosine phosphorylation motif comprising YXXQ or YXPQ.

    4. The nucleic acid of claim 2, wherein the intracellular signaling domain comprises a sequence at least about 80%, 85%, 90%, 95%, or 99% identity to a sequence as set forth in SEQ ID NO: 60, wherein the intracellular signaling domain comprises a type I cytokine receptor superfamily box2 peptide motif comprising VSVVEIEANDKKP (SEQ ID NO: 107), and a tyrosine phosphorylation motif comprising YXPQ.

    5. The nucleic acid of claim 2, wherein the intracellular signaling domain comprises the sequence as set forth in SEQ ID NO: 60.

    6. The nucleic acid of claim 1, wherein the extracellular domain comprises a CD34 epitope, the multimerization region comprises a multimerization region comprising an unpaired cysteine residue, the lipid anchor or transmembrane domain comprises a gp130 transmembrane domain, and the intracellular signaling domain comprises a gp130 intracellular signaling domain, optionally wherein the extracellular domain comprises one or more of a CD34 epitope, a CD34 ectodomain, a BCR ectodomain, a thrombopoietin receptor (TpoR) ectodomain, or an erythropoietin receptor (EpoR) ectodomain.

    7. The nucleic acid of claim 1, wherein the SPA peptide comprises a sequence as set forth in SEQ ID NO: 81.

    8. The nucleic acid of claim 1, wherein the SPA peptide comprises a sequence selected from the sequences set forth in SEQ ID NOs: 1-58 or 63-104.

    9. A synthetic pathway activator (SPA) peptide comprising a chimeric polypeptide comprising: a. optionally, an extracellular domain; b. a lipid anchor or a transmembrane domain; c. an intracellular signaling domain; and d. a multimerization region.

    10. The SPA peptide of claim 9, wherein the extracellular domain comprises a CD34 epitope, the multimerization region comprises a multimerization region comprising an unpaired cysteine residue, the lipid anchor or transmembrane domain comprises a gp130 transmembrane domain, and the intracellular signaling domain comprises a gp130 intracellular signaling domain.

    11. The SPA peptide of claim 9, comprising a sequence selected from the sequences set forth in SEQ ID NOs: 1-58 or 63-104.

    12. The SPA peptide of claim 9 comprising a sequence as set forth in SEQ ID NO: 81.

    13. A multimer comprising the SPA peptide of claim 9.

    14. A multimer comprising the SPA peptide of claim 11.

    15. A multimer comprising the SPA peptide of claim 12.

    16. A vector comprising the nucleic acid of claim 1.

    17. A vector comprising the nucleic acid of claim 7.

    18. A vector comprising the nucleic acid of claim 8.

    19. A system comprising: a. optionally, a nucleic acid encoding a first chimeric polypeptide comprising a priming receptor; b. a nucleic acid encoding a second chimeric polypeptide, optionally comprising a chimeric antigen receptor (CAR); and c. the nucleic acid of claim 1.

    20. A cell or population of cells comprising the nucleic acid of claim 1.

    21. A cell or population of cells comprising the nucleic acid of claim 7.

    22. A cell or population of cells comprising the nucleic acid of claim 8.

    23. A pharmaceutical composition comprising the cell or population of cells of 20, and a pharmaceutically acceptable excipient.

    24. A pharmaceutical composition comprising the cell or population of cells of 21, and a pharmaceutically acceptable excipient.

    25. A pharmaceutical composition comprising the cell or population of cells of 22, and a pharmaceutically acceptable excipient.

    26. A method of engineering a cell, comprising contacting a nucleic acid with the cell, wherein the nucleic acid encodes a synthetic pathway activator (SPA) peptide comprising a chimeric polypeptide comprising: a. optionally, an extracellular domain; b. a lipid anchor or a transmembrane domain; c. an intracellular signaling domain; and d. a multimerization region.

    27. A method of treating a disease in a subject or inhibiting a target cell in a subject, comprising administering the cell or population of cells of claim 20 to the subject.

    28. A method of modulating a target cell comprising contacting the target cell with a cell or population of cells, wherein the contacting modulates the target cell, and wherein the cell or population of cells comprises nucleic acid encoding a synthetic pathway activator (SPA) peptide comprising a chimeric polypeptide comprising: a. optionally, an extracellular domain; b. a lipid anchor or a transmembrane domain; c. an intracellular signaling domain; and d. a multimerization region.

    29. A method of modulating the activity of a cell or an immune cell comprising: a. obtaining a cell or an immune cell comprising the nucleic acid of claim 1; and b. contacting the cell or immune cell with a target cell, wherein the synthetic pathway activator modulates the activity of the cell or immune cell.

    30. A method of making a nucleotide sequence, comprising obtaining the nucleic acid of claim 1; and synthesizing the nucleotide sequence, optionally wherein the synthesizing occurs via replication in a cell.

    Description

    BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

    [0078] These and other features, aspects, and advantages of the present disclosure will become better understood with regard to the following description, and accompanying drawings, where:

    [0079] FIG. 1 provides diagram of the various ICT transgene cassettes expressing Logic Gate 1-5 ICs, shRNA, and SPAs.

    [0080] FIG. 2A shows exemplary synthetic pathway activators and that synthetic pathway activators increase potency and T stem memory phenotype. FIG. 2B shows additional exemplary synthetic pathway activators

    [0081] FIG. 3 shows that all ICT cells constitutively expressed the PrimeR construct.

    [0082] FIG. 4 shows that the ICT cells induced CAR expression when co-cultured with primeR-antigen expressing cell lines.

    [0083] FIG. 5 shows that ICT cells expressing the SPA exhibit approximately two logs higher pSTAT3 expression when compared to the PrimeR-cells lacking a SPA (EGFRt).

    [0084] FIG. 6A shows cytotoxicity against parental K562 cells expressing neither CAR or primeR antigen, FIG. 6B shows cytotoxicity against K562 cells expressing only CAR antigen, FIG. 6C cytotoxicity against K562 cells expressing only primeR antigen. FIG. 6D shows cytotoxicity against K562 cells expressing both primeR antigen and CAR antigen.

    [0085] FIG. 7 shows IFN- production from ICTs expressing Logic Gates 1-5 only in supernatants taken from co-cultures where the target cells expressed both primeR antigen and CAR antigen.

    [0086] FIG. 8A shows that ICTs expressing LG 1-5 ICs demonstrated in vitro cytotoxicity against the endogenous +CAR/+primeR antigen cell line. FIG. 8B shows IFN, TNF, GM-CSF, and IL-2 secretion by ICT cells after co-culture with endogenous +CAR/+primeR antigen cells.

    [0087] FIG. 9 shows that co-culture with HUVEC-primeR antigen cells induced expression of the CAR protein on ICT cells and specific killing of CAR antigen+ cells.

    [0088] FIG. 10A shows the tumor volume post tumor implant in mice treated with ICTs expressing Logic Gates 1-5, RNP or PBS generated from donor 1. FIG. 10B shows the total T cells and expansion of the ICTs on day 12 post inoculation followed by contraction by day 21. FIG. 10C shows total T cells expressing the priming receptor on days 12 and 21. FIG. 10D show the tumor volume post tumor implant in mice treated with ICTs expressing Logic Gates 1-5, RNP or PBS generated from donor 2. FIG. 10E shows the total T cells and expansion of the ICTs on day 12 post inoculation followed by contraction by day 21. FIG. 10F shows total T cells expressing the priming receptor on days 12 and 21.

    [0089] FIG. 11A shows tumor growth inhibition (TGI) in the single positive CAR antigen-only flank. FIG. 11B shows tumor growth inhibition (TGI) in the dual positive primeR antigen/CAR antigen flank.

    [0090] FIG. 12A shows level of pSTAT3 signaling induced by expression of the indicated SPA in a T cell after serum starvation. FIG. 12B shows level of pSTAT1 signaling induced by expression of the indicated SPA in a T cell after serum starvation

    [0091] FIG. 13 provides a pSTAT1 vs pSTAT3 heatmap for the indicated SPAs.

    [0092] FIG. 14 shows the levels of granzyme B (top panel) and IL-10 (bottom panel) induced by the indicated SPA.

    [0093] FIG. 15 shows the tumor cell clearance (top panel) and T cell expansion (bottom panel) of tumor cells expressing the logic gate antigens in a repetitive stimulation assay.

    [0094] FIG. 16 shows the percent of ICT cells expressing that indicated SPA that expressed the indicated cell markers after a repetitive stimulation assay.

    [0095] FIG. 17 shows the tumor volume in vivo after treatment with ICTs expressing the indicated SPA.

    [0096] FIG. 18A shows CD11c RNA and cell surface expression and SPA expression in both CD4+ and CD8+ cells at Day 7 in both tumor and spleen cells. Cells with increased CD11c RNA and cell surface expression correlated with cells expressing the SPA. FIG. 18B shows CD11c (ITGAX) expression in spleen or tumor cells from mice treated with cells expressing a SPA ICT (two donors) as compared to cells that do not express a SPA ICT (one donor) collected on Day 0 and Day 7 post treatment.

    DETAILED DESCRIPTION

    Definitions

    [0097] Terms used in the claims and specification are defined as set forth below unless otherwise specified.

    [0098] As used herein, the term gene refers to the basic unit of heredity, consisting of a segment of DNA arranged along a chromosome, which codes for a specific protein or segment of protein. A gene typically includes a promoter, a 5 untranslated region, one or more coding sequences (exons), optionally introns, and a 3 untranslated region. The gene may further comprise a terminator, enhancers and/or silencers.

    [0099] As used herein, the term locus refers to a specific, fixed physical location on a chromosome where a gene or genetic marker is located.

    [0100] The term safe harbor locus refers to a locus at which genes or genetic elements can be incorporated without disruption to expression or regulation of adjacent genes. These safe harbor loci are also referred to as safe harbor sites (SHS). As used herein, a safe harbor locus refers to an integration site or knock-in site at which a sequence encoding a transgene, as defined herein, can be inserted. In some embodiments the insertion occurs with replacement of a sequence that is located at the integration site. In some embodiments, the insertion occurs without replacement of a sequence at the integration site. Examples of integration sites contemplated are provided in Table D.

    [0101] As used herein, the term insert refers to a nucleotide sequence that is integrated (inserted) at a target locus or safe harbor site. The insert can be used to refer to the genes or genetic elements that are incorporated at the target locus or safe harbor site using, for example, homology-directed repair (HDR) CRISPR/Cas9 genome-editing or other methods for inserting nucleotide sequences into a genomic region known to those of ordinary skill in the art.

    [0102] The term inserting refers to a manipulation of a nucleotide sequence to introduce a non-native sequence. This is done, for example, via the use of restriction enzymes and ligases whereby the DNA sequence of interest, usually encoding the gene of interest, can be incorporated into another nucleic acid molecule by digesting both molecules with appropriate restriction enzymes in order to create compatible overlaps and then using a ligase to join the molecules together. One skilled in the art is very familiar with such manipulations and examples may be found in Sambrook et al. (Sambrook, Fritsch, & Maniatis, Molecular Cloning: A Laboratory Manual, 2.sup.nd ed., Cold Spring Harbor Laboratory, 1989), which is hereby incorporated by reference in its entirety including any drawings, figures and tables.

    [0103] The CRISPR/Cas system refers to a widespread class of bacterial systems for defense against foreign nucleic acid. CRISPR/Cas systems are found in a wide range of cubacterial and archacal organisms. CRISPR/Cas systems include type I, II, and III sub-types. Wild-type type II CRISPR/Cas systems utilize an RNA-mediated nuclease, Cas9 in complex with guide and activating RNA to recognize and cleave foreign nucleic acid. Guide RNAs having the activity of both a guide RNA and an activating RNA are also known in the art. In some cases, such dual activity guide RNAs are referred to as a small guide RNA (sgRNA).

    [0104] Cas9 homologs are found in a wide variety of cubacteria, including, but not limited to bacteria of the following taxonomic groups: Actinobacteria, Aquificae, Bacteroidetes-Chlorobi, Chlamydiae-Verrucomicrobia, Chlroflexi, Cyanobacteria, Firmicutes, Proteobacteria, Spirochaetes, and Thermotogae. An exemplary Cas9 protein is the Streptococcus pyogenes Cas9 protein. Additional Cas9 proteins and homologs thereof are described in, e.g., Chylinksi, et al., RNA Biol. 2013 May 1; 10 (5): 726-737; Nat. Rev. Microbiol. 2011 June; 9 (6): 467-477; Hou, et al., Proc Natl Acad Sci USA. 2013 Sep. 24; 110 (39): 15644-9; Sampson et al., Nature. 2013 May 9; 497 (7448): 254-7; and Jinek, et al., Science. 2012 Aug. 17; 337 (6096): 816-21. The Cas9 nuclease domain can be optimized for efficient activity or enhanced stability in the host cell.

    [0105] As used herein, the term Cas9 refers to an RNA-mediated nuclease (e.g., of bacterial or archeal origin, or derived therefrom). Exemplary RNA-mediated nucleases include the foregoing Cas9 proteins and homologs thereof, and include but are not limited to, CPF1 (See, e.g., Zetsche et al., Cell, Volume 163, Issue 3, p759-771, 22 Oct. 2015). Similarly, as used herein, the term Cas9 ribonucleoprotein complex and the like refers to a complex between the Cas9 protein, and a crRNA (e.g., guide RNA or small guide RNA), the Cas9 protein and a trans-activating crRNA (tracrRNA), the Cas9 protein and a small guide RNA, or a combination thereof (e.g., a complex containing the Cas9 protein, a tracrRNA, and a crRNA guide RNA).

    [0106] As used herein, the phrase immune cell is inclusive of all cell types that can give rise to immune cells, including hematopoietic cells such hematopoietic stem cells, pluripotent stem cells, and induced pluripotent stem cells (iPSCs). In some embodiments, the immune cell is a B cell, macrophage, a natural killer (NK) cell, an induced pluripotent stem cell (iPSC), a human pluripotent stem cell (HSPC), a T cell or a T cell progenitor cell, or dendritic cell. In some embodiments, the cell is an innate immune cell.

    [0107] As used herein, the term primary in the context of a primary cell or primary stem cell refers to a cell that has not been transformed or immortalized. Such primary cells can be cultured, sub-cultured, or passaged a limited number of times (e.g., cultured 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 times). In some cases, the primary cells are adapted to in vitro culture conditions. In some cases, the primary cells are isolated from an organism, system, organ, or tissue, optionally sorted, and utilized, e.g., directly without culturing or sub-culturing. In some cases, the primary cells are stimulated, activated, or differentiated. For example, primary T cells can be activated by contact with (e.g., culturing in the presence of) CD3, CD28 agonists, IL-2, IFN-, or a combination thereof.

    [0108] As used herein, the terms T lymphocyte and T cell are used interchangeably and refer to cells that have completed maturation in the thymus, and identify certain foreign antigens in the body. The terms also refer to the major leukocyte types that have various roles in the immune system, including activation and deactivation of other immune cells. The T cell can be any T cell such as a cultured T cell, e.g., a primary T cell, or a T cell derived from a cultured T cell line, e.g., a Jurkat, SupTI, etc., or a T cell obtained from a mammal. T cells include, but are not limited to, nave T cells, stimulated T cells, primary T cells (e.g., uncultured), cultured T cells, immortalized T cells, helper T cells, cytotoxic T cells, memory T cells, regulatory T cells, natural killer T cells, combinations thereof, or sub-populations thereof. The T cell can be a CD3+ cell. T cells can be CD4.sup.+, CD8.sup.+, or CD4.sup.+ and CD8.sup.+. The T cell can be any type of T cell, CD4+/CD8+ double positive T cells, CD4+ helper T cells (e.g. Th1 and Th2 cells), CD8+ T cells (e.g. cytotoxic T cells), peripheral Including but not limited to blood mononuclear cells (PBMC), peripheral blood leukocytes (PBL), tumor infiltrating lymphocytes (TIL), memory T cells, nave T cells, T cells, regulatory T cells, T cells, etc. It can be any T cell at any stage of development. Additional types of helper T cells include Th3 (Treg) cells, Th17 cells, Th9 cells, or Tfh cells. Additional types of memory T cells include cells such as central memory T cells (Tem cells), effector memory T cells (Tem cells and TEMRA cells). A T cell can also refer to a genetically modified T cell, such as a T cell that has been modified to express a T cell receptor (TCR) or a chimeric antigen receptor (CAR). T cells can also be differentiated from stem cells or progenitor cells.

    [0109] CD4+ T cells refers to a subset of T cells that express CD4 on their surface and are associated with a cellular immune response. CD4+ T cells are characterized by a post-stimulation secretion profile that can include secretion of cytokines such as IFN-, TNF-, IL-2, IL-4 and IL-10. CD4 is a 55 kD glycoprotein originally defined as a differentiation antigen on T lymphocytes, but was also found on other cells including monocytes/macrophages. The CD4 antigen is a member of the immunoglobulin superfamily and has been implicated as an associative recognition element in MHC (major histocompatibility complex) class II restricted immune responses. On T lymphocytes, the CD4 antigen defines a helper/inducer subset.

    [0110] CD8+ T cells refers to a subset of T cells that express CD8 on their surface, are MHC class I restricted, and function as cytotoxic T cells. The CD8 molecule is a differentiation antigen present on thymocytes, as well as on cytotoxic and suppressor T lymphocytes. The CD8 antigen is a member of the immunoglobulin superfamily and is an associative recognition element in major histocompatibility complex class I restriction interactions.

    [0111] As used herein, the phrase hematopoietic stem cell refers to a type of stem cell that can give rise to a blood cell. Hematopoietic stem cells can give rise to cells of the myeloid or lymphoid lineages, or a combination thereof. Hematopoietic stem cells are predominantly found in the bone marrow, although they can be isolated from peripheral blood, or a fraction thereof. Various cell surface markers can be used to identify, sort, or purify hematopoietic stem cells. In some cases, hematopoietic stem cells are identified as c-kit.sup.+ and lin.sup.. In some cases, human hematopoietic stem cells are identified as CD34.sup.+, CD59.sup.+, Thy 1/CD90.sup.+, CD38.sup.lo/, C-kit/CD117.sup.+, lin.sup.. In some cases, human hematopoietic stem cells are identified as CD34.sup., CD59.sup.+, Thy1/CD90.sup.+, CD38.sup.lo/, C-kit/CD117.sup.+, lin.sup.. In some cases, human hematopoietic stem cells are identified as CD133.sup.+, CD59.sup.+, Thy1/CD90.sup.+, CD38.sup.lo/, C-kit/CD117.sup.+, lin.sup.. In some cases, mouse hematopoietic stem cells are identified as CD34.sup.lo/, SCA-1.sup.+, Thy1.sup.+/lo, CD38.sup.+, C-kit.sup.+, lin.sup.. In some cases, the hematopoietic stem cells are CD150.sup.+CD48.sup.CD244.sup..

    [0112] As used herein, the phrase hematopoietic cell refers to a cell derived from a hematopoietic stem cell. The hematopoietic cell may be obtained or provided by isolation from an organism, system, organ, or tissue (e.g., blood, or a fraction thereof). Alternatively, an hematopoietic stem cell can be isolated and the hematopoietic cell obtained or provided by differentiating the stem cell. Hematopoietic cells include cells with limited potential to differentiate into further cell types. Such hematopoietic cells include, but are not limited to, multipotent progenitor cells, lineage-restricted progenitor cells, common myeloid progenitor cells, granulocyte-macrophage progenitor cells, or megakaryocyte-erythroid progenitor cells. Hematopoietic cells include cells of the lymphoid and myeloid lineages, such as lymphocytes, erythrocytes, granulocytes, monocytes, and thrombocytes.

    [0113] As used herein, the term construct refers to a complex of molecules, including macromolecules or polynucleotides.

    [0114] As used herein, the term integration refers to the process of stably inserting one or more nucleotides of a construct into the cell genome, i.e., covalently linking to a nucleic acid sequence in the chromosomal DNA of the cell. It may also refer to nucleotide deletions at a site of integration. Where there is a deletion at the insertion site, integration may further include substitution of the endogenous sequence or nucleotide deleted with one or more inserted nucleotides.

    [0115] As used herein, the term exogenous refers to a molecule or activity that has been introduced into a host cell and is not native to that cell. The molecule can be introduced, for example, by introduction of the encoding nucleic acid into host genetic material, such as by integration into a host chromosome, or as non-chromosomal genetic material, such as a plasmid. Thus, the term, when used in connection with expression of an encoding nucleic acid, refers to the introduction of the encoding nucleic acid into a cell in an expressible form. The term endogenous refers to a molecule or activity that is present in a host cell under natural, unedited conditions. Similarly, the term, when used in connection with expression of the encoding nucleic acid, refers to expression of the encoding nucleic acid that is contained within the cell and not introduced exogenously.

    [0116] The term heterologous refers to a nucleic acid or polypeptide sequence or domain which is not native to a flanking sequence, e.g., wherein the heterologous sequence is not found in nature coupled to the nucleic acid or polypeptide sequences occurring at one or both ends.

    [0117] The term homologous refers to a nucleic acid or polypeptide sequence or domain which is native to a flanking sequence, e.g., wherein the homologous sequence is found in nature coupled to the nucleic acid or polypeptide sequences occurring at one or both ends.

    [0118] As used herein, a polynucleotide donor construct refers to a nucleotide sequence (e.g. DNA sequence) that is genetically inserted into a polynucleotide and is exogenous to that polynucleotide. The polynucleotide donor construct is transcribed into RNA and optionally translated into a polypeptide. The polynucleotide donor construct can include prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and synthetic DNA sequences. For example, the polynucleotide donor construct can be a miRNA, shRNA, natural polypeptide (i.e., a naturally occurring polypeptide) or fragment thereof or a variant polypeptide (e.g. a natural polypeptide having less than 100% sequence identity with the natural polypeptide) or fragments thereof.

    [0119] As used herein, the term complementary or complementarity refers to specific base pairing between nucleotides or nucleic acids. Complementary nucleotides are, generally, A and T (or A and U), and G and C. The guide RNAs described herein can comprise sequences, for example, DNA targeting sequence that are perfectly complementary or substantially complementary (e.g., having 1-4 mismatches) to a genomic sequence in a cell.

    [0120] As used herein, the term transgene refers to a polynucleotide that has been transferred naturally, or by any of a number of genetic engineering techniques from one organism to another. It is optionally translated into a polypeptide. As used, transgene can refer to a polynucleotide that encodes a polypeptide.

    [0121] The terms protein, polypeptide, and peptide are used herein interchangeably.

    [0122] As used herein, the term operably linked or operatively linked refers to the binding of a nucleic acid sequence to a single nucleic acid fragment such that one function is affected by the other. For example, if a promoter is capable of affecting the expression of a coding sequence or functional RNA (i.e., the coding sequence or functional RNA is under transcriptional control by the promoter), the promoter is operably linked thereto. Coding sequences can be operably linked to control sequences in both sense and antisense orientation.

    [0123] As used herein, the term developmental cell states refers to, for example, states when the cell is inactive, actively expressing, differentiating, senescent, etc. developmental cell state may also refer to a cell in a precursor state (e.g., a T cell precursor).

    [0124] As used, the term encoding refers to a sequence of nucleic acids which codes for a protein or polypeptide of interest. The nucleic acid sequence may be either a molecule of DNA or RNA. In preferred embodiments, the molecule is a DNA molecule. In other preferred embodiments, the molecule is a RNA molecule. When present as a RNA molecule, it will comprise sequences which direct the ribosomes of the host cell to start translation (e.g., a start codon, ATG) and direct the ribosomes to end translation (e.g., a stop codon). Between the start codon and stop codon is an open reading frame (ORF). Such terms are known to one of ordinary skill in the art.

    [0125] As used herein, the term subject refers to a mammalian subject. Exemplary subjects include humans, monkeys, dogs, cats, mice, rats, cows, horses, camels, goats, rabbits, pigs and sheep. In certain embodiments, the subject is a human. In some embodiments the subject has a disease or condition that can be treated with an engineered cell provided herein or population thereof. In some aspects, the disease or condition is a cancer.

    [0126] As used herein, the term promoter refers to a nucleotide sequence (e.g. DNA sequence) capable of controlling the expression of a coding sequence or functional RNA. The promoter sequence consists of proximal and more distal upstream elements, the latter elements often referred to as enhancers. A promoter can be derived from natural genes in its entirety, can be composed of different elements from different promoters found in nature, and/or may comprise synthetic DNA segments. A promoter, as contemplated herein, can be endogenous to the cell of interest or exogenous to the cell of interest. It is appreciated by those skilled in the art that different promoters can induce gene expression in different tissue or cell types, or at different developmental stages, or in response to different environmental conditions. As is known in the art, a promoter can be selected according to the strength of the promoter and/or the conditions under which the promoter is active, e.g., constitutive promoter, strong promoter, weak promoter, inducible/repressible promoter, tissue specific Or developmentally regulated promoters, cell cycle-dependent promoters, and the like.

    [0127] A promoter can be an inducible promoter (e.g., a heat shock promoter, tetracycline-regulated promoter, steroid-regulated promoter, metal-regulated promoter, estrogen receptor-regulated promoter, HNF1 promoter, etc.). The promoter can be a constitutive promoter (e.g., CMV promoter, UBC promoter). In some embodiments, the promoter can be a spatially restricted and/or temporally restricted promoter (e.g., a tissue specific promoter, a cell type specific promoter, etc.). See for example US Publication 20180127786, the disclosure of which is herein incorporated by reference in its entirety.

    [0128] Gene editing, as contemplated herein, may involve a gene (or nucleotide sequence) knock-in or knock-out. As used herein, the term knock-in refers to an addition of a DNA sequence, or fragment thereof into a genome. Such DNA sequences to be knocked-in may include an entire gene or genes, may include regulatory sequences associated with a gene or any portion or fragment of the foregoing. For example, a polynucleotide donor construct encoding a protein may be inserted into the genome of a cell carrying a mutant gene. In some embodiments, a knock-in strategy involves substitution of an existing sequence with the provided sequence, e.g., substitution of a mutant allele with a wild-type copy. On the other hand, the term knock-out refers to the elimination of a gene or the expression of a gene. For example, a gene can be knocked out by either a deletion or an addition of a nucleotide sequence that leads to a disruption of the reading frame. As another example, a gene may be knocked out by replacing a part of the gene with an irrelevant (e.g., non-coding) sequence.

    [0129] As used herein, the term non-homologous end joining or NHEJ refers to a cellular process in which cut or nicked ends of a DNA strand are directly ligated without the need for a homologous template nucleic acid. NHEJ can lead to the addition, the deletion, substitution, or a combination thereof, of one or more nucleotides at the repair site.

    [0130] As used herein, the term homology directed repair or HDR refers to a cellular process in which cut or nicked ends of a DNA strand are repaired by polymerization from a homologous template nucleic acid. Thus, the original sequence is replaced with the sequence of the template. The homologous template nucleic acid can be provided by homologous sequences elsewhere in the genome (sister chromatids, homologous chromosomes, or repeated regions on the same or different chromosomes). Alternatively, an exogenous template nucleic acid can be introduced to obtain a specific HDR-induced change of the sequence at the target site. In this way, specific mutations can be introduced at the cut site.

    [0131] As used herein, a single-stranded DNA template or a double-stranded DNA template refers to a DNA oligonucleotide that can be used by a cell as a template for HDR. Generally, the single-stranded DNA template or a double-stranded DNA template has at least one region of homology to a target site. In some cases, the single-stranded DNA template or double-stranded DNA template has two homologous regions flanking a region that contains a heterologous sequence to be inserted at a target cut site.

    [0132] The terms vector and plasmid are used interchangeably and as used herein refer to polynucleotide vehicles useful to introduce genetic material into a cell. Vectors can be linear or circular. Vectors can integrate into a target genome of a host cell or replicate independently in a host cell. Vectors can comprise, for example, an origin of replication, a multicloning site, and/or a selectable marker. An expression vector typically comprises an expression cassette. Vectors and plasmids include, but are not limited to, integrating vectors, prokaryotic plasmids, eukaryotic plasmids, plant synthetic chromosomes, episomes, cosmids, and artificial chromosomes.

    [0133] As used herein, the phrase introducing in the context of introducing a nucleic acid or a complex comprising a nucleic acid, for example, an RNP-DNA template complex, refers to the translocation of the nucleic acid sequence or the RNP-DNA template complex from outside a cell to inside the cell. In some cases, introducing refers to translocation of the nucleic acid or the complex from outside the cell to inside the nucleus of the cell. Various methods of such translocation are contemplated, including but not limited to, electroporation, contact with nanowires or nanotubes, receptor mediated internalization, translocation via cell penetrating peptides, liposome mediated translocation, and the like.

    [0134] As used herein the term expression cassette is a polynucleotide construct, generated recombinantly or chemically synthesized, comprising regulatory sequences operably linked to a selected polynucleotide to facilitate expression of the selected polynucleotide in a host cell. For example, the regulatory sequences can facilitate transcription of the selected polynucleotide in a host cell, or transcription and translation of the selected polynucleotide in a host cell. An expression cassette can, for example, be integrated in the genome of a host cell or be present in an expression vector.

    [0135] As used herein, the phrase subject in need thereof refers to a subject that exhibits and/or is diagnosed with one or more symptoms or signs of a disease or disorder as described herein.

    [0136] A chemotherapeutic agent refers to a chemical compound useful in the treatment of cancer. Chemotherapeutic agents include anti-hormonal agents or endocrine therapeutics which act to regulate, reduce, block, or inhibit the effects of hormones that can promote the growth of cancer.

    [0137] The term composition refers to a mixture that contains, e.g., an engineered cell or protein contemplated herein. In some embodiments, the composition may contain additional components, such as adjuvants, stabilizers, excipients, and the like. The term composition or pharmaceutical composition refers to a preparation which is in such form as to permit the biological activity of an active ingredient contained therein to be effective in treating a subject, and which contains no additional components which are unacceptably toxic to the subject in the amounts provided in the pharmaceutical composition.

    [0138] The term in situ refers to processes that occur in a living cell growing separate from a living organism, e.g., growing in tissue culture.

    [0139] The term in vivo refers to processes that occur in a living organism.

    [0140] As used herein, the term ex vivo generally includes experiments or measurements made in or on living tissue, preferably in an artificial environment outside the organism, preferably with minimal differences from natural conditions.

    [0141] The term mammal as used herein includes both humans and non-humans and include but is not limited to humans, non-human primates, canines, felines, murines, bovines, equines, and porcines.

    [0142] The term percent identity, in the context of two or more nucleic acid or polypeptide sequences, refer to two or more sequences or subsequences that have a specified percentage of nucleotides or amino acid residues that are the same, when compared and aligned for maximum correspondence, as measured using one of the sequence comparison algorithms described below (e.g., BLASTP and BLASTN or other algorithms available to persons of skill) or by visual inspection. Depending on the application, the percent identity can exist over a region of the sequence being compared, e.g., over a functional domain, or, alternatively, exist over the full length of the two sequences to be compared.

    [0143] For sequence comparison, typically one sequence acts as a reference sequence to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters.

    [0144] Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc Nat'l. Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by visual inspection (see generally Ausubel et al., infra).

    [0145] One example of an algorithm that is suitable for determining percent sequence identity and sequence similarity is the BLAST algorithm, which is described in Altschul et al., J. Mol. Biol. 215:403-410 (1990). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (ncbi.nlm.nih.gov/).

    [0146] The term sufficient amount means an amount sufficient to produce a desired effect, e.g., an amount sufficient to modulate protein aggregation in a cell.

    [0147] The term therapeutically effective amount is an amount that is effective to ameliorate a symptom of a disease.

    [0148] The term ameliorating refers to any therapeutically beneficial result in the treatment of a disease state, e.g., a cancer disease state, lessening in the severity or progression, remission, or cure thereof.

    [0149] As used herein, the term effective amount refers to the amount of a compound (e.g., a compositions described herein, cells described herein) sufficient to effect beneficial or desired results. An effective amount can be administered in one or more administrations, applications or dosages and is not intended to be limited to a particular formulation or administration route.

    [0150] As used herein, the term treating includes any effect, e.g., lessening, reducing, modulating, ameliorating or eliminating, that results in the improvement of the condition, disease, disorder, and the like, or ameliorating a symptom thereof.

    [0151] The terms modulate and modulation refer to reducing or inhibiting or, alternatively, activating or increasing, a recited variable.

    [0152] The terms increase and activate refer to an increase of 10%, 20%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 100%, 2-fold, 3-fold, 4-fold, 5-fold, 10-fold, 20-fold, 50-fold, 100-fold, or greater in a recited variable.

    [0153] The terms reduce and inhibit refer to a decrease of 10%, 20%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 2-fold, 3-fold, 4-fold, 5-fold, 10-fold, 20-fold, 50-fold, 100-fold, or greater in a recited variable.

    [0154] With regard to the binding of an antibody to a target molecule, the terms bind, specific binding, specifically binds to, specific for, selectively binds, and selective for a particular antigen (e.g., a polypeptide target) or an epitope on a particular antigen mean binding that is measurably different from a non-specific or non-selective interaction (e.g., with a non-target molecule). For example, an antibody that selectively binds or specifically binds an antigen is an antigen-binding moiety that binds the antigen with high affinity and does not significantly bind other unrelated antigens. Specific binding can be measured, for example, by measuring binding to a target molecule and comparing it to binding to a non-target molecule. Specific binding can also be determined by competition with a control molecule that mimics the epitope recognized on the target molecule. In that case, specific binding is indicated if the binding of the antibody to the target molecule is competitively inhibited by the control molecule.

    [0155] Affinity refers to the strength of the sum total of non-covalent interactions between a single binding site of a molecule (e.g., an antibody) and its binding partner (e.g., an antigen or epitope). Unless indicated otherwise, as used herein, affinity refers to intrinsic binding affinity, which reflects a 1:1 interaction between members of a binding pair (e.g., antibody and antigen or epitope). The affinity of a molecule X for its partner Y can be represented by the dissociation equilibrium constant (K.sub.D). The kinetic components that contribute to the dissociation equilibrium constant are described in more detail below. Affinity can be measured by common methods known in the art, including, but not limited to, surface plasmon resonance (SPR) technology (e.g., BIACORE) or biolayer interferometry (e.g., FORTEBIO).

    [0156] The term hypervariable region or HVR, as used herein, refers to each of the regions of an antibody variable domain which are hypervariable in sequence and/or form structurally defined loops (hypervariable loops). Generally, native four-chain antibodies comprise six HVRs; three in the VH (H1, H2, H3), and three in the VL (L1, L2, L3). HVRs generally comprise amino acid residues from the hypervariable loops and/or from the complementarity determining regions (CDRs), the latter being of highest sequence variability and/or involved in antigen recognition. With the exception of CDR1 in VH, CDRs generally comprise the amino acid residues that form the hypervariable loops. Hypervariable regions (HVRs) are also referred to as complementarity determining regions (CDRs), and these terms are used herein interchangeably in reference to portions of the variable region that form the antigen-binding regions. This particular region has been described by Kabat et al., U.S. Dept. of Health and Human Services, Sequences of Proteins of Immunological Interest (1983) and by Chothia et al., J Mol Biol 196:901-917 (1987), where the definitions include overlapping or subsets of amino acid residues when compared against each other. Nevertheless, application of either definition to refer to a CDR of an antibody or variants thereof is intended to be within the scope of the term as defined and used herein. The exact residue numbers which encompass a particular CDR will vary depending on the sequence and size of the CDR. Those skilled in the art can routinely determine which residues comprise a particular CDR given the variable region amino acid sequence of the antibody.

    [0157] The amino acid sequence boundaries of a CDR can be determined by one of skill in the art using any of a number of known numbering schemes, including those described by Kabat et al., supra (Kabat numbering scheme); Al-Lazikani et al., 1997, J. Mol. Biol., 273:927-948 (Chothia numbering scheme); MacCallum et al., 1996, J. Mol. Biol. 262:732-745 (Contact numbering scheme); Lefranc et al., Dev. Comp. Immunol., 2003, 27:55-77 (IMGT numbering scheme); and Honegge and Plckthun, J. Mol. Biol., 2001, 309:657-70 (AHo numbering scheme); each of which is incorporated by reference in its entirety.

    [0158] Table A provides the positions of CDR-L1, CDR-L2, CDR-L3, CDR-H1, CDR-H2, and CDR-H3 as identified by the Kabat and Chothia schemes. For CDR-H1, residue numbering is provided using both the Kabat and Chothia numbering schemes.

    [0159] CDRs may be assigned, for example, using antibody numbering software, such as Abnum, available at bioinf.org.uk/abs/abnum/, and described in Abhinandan and Martin, Immunology, 2008, 45:3832-3839, incorporated by reference in its entirety.

    [0160] The EU numbering scheme is generally used when referring to a residue in an antibody heavy chain constant region (e.g., as reported in Kabat et al., supra). Unless stated otherwise, the EU numbering scheme is used to refer to residues in antibody heavy chain constant regions described herein.

    [0161] As used herein, the term single chain refers to a molecule comprising amino acid monomers linearly linked by peptide bonds. In a particular such embodiment, the C-terminus of the Fab light chain is connected to the N-terminus of the Fab heavy chain in the single-chain Fab molecule. As described in more detail herein, an scFv has a variable domain of light chain (VL) connected from its C-terminus to the N-terminal end of a variable domain of heavy chain (VH) by a polypeptide chain. Alternately the scFv comprises of polypeptide chain where in the C-terminal end of the VH is connected to the N-terminal end of VL by a polypeptide chain.

    [0162] The Fab fragment (also referred to as fragment antigen-binding) contains the constant domain (CL) of the light chain and the first constant domain (CH1) of the heavy chain along with the variable domains VL and VH on the light and heavy chains respectively. The variable domains comprise the complementarity determining loops (CDR, also referred to as hypervariable region) that are involved in antigen-binding. Fab fragments differ from Fab fragments by the addition of a few residues at the carboxy terminus of the heavy chain CH1 domain including one or more cysteines from the antibody hinge region.

    [0163] F(ab).sub.2 fragments contain two Fab fragments joined, near the hinge region, by disulfide bonds. F(ab).sub.2 fragments may be generated, for example, by recombinant or synthetic methods or by pepsin digestion of an intact antibody. The F(ab) fragments can be dissociated, for example, by treatment with -mercaptoethanol.

    [0164] Fv fragments comprise a non-covalently-linked dimer of one heavy chain variable domain and one light chain variable domain.

    [0165] The Single-chain Fv or scFv includes the VH and VL domains of an antibody, wherein these domains are present in a single polypeptide chain. In one embodiment, the Fv polypeptide further comprises a polypeptide linker between the VH and VL domains which enables the scFv to form the desired structure for antigen-binding. For a review of scFv see Pluckthun in The Pharmacology of Monoclonal Antibodies, vol. 113, Rosenburg and Moore eds., Springer-Verlag, New York, pp. 269-315 (1994). HER2 antibody scFv fragments are described in WO93/16185; U.S. Pat. Nos. 5,571,894; and 5,587,458.

    [0166] The term single domain antibody or sdAb refers to a molecule in which one variable domain of an antibody specifically binds to an antigen without the presence of the other variable domain. Single domain antibodies, and fragments thereof, are described in Arabi Ghahroudi et al., FEBS Letters, 1998, 414:521-526 and Muyldermans et al., Trends in Biochem. Sci., 2001, 26:230-245, each of which is incorporated by reference in its entirety. Single domain antibodies are also known as sdAbs or nanobodies. Sdabs are fairly stable and easy to express as fusion partner with the Fc chain of an antibody (Harmsen M M, De Haard HJ (207). Properties, production, and applications of camelid single-domain antibody frag ents. Appl. Microbiol Biotechnol. 77 (1): 13-22).

    [0167] It must be noted that, as used in the specification and the appended claims, the singular forms a, an and the include plural referents unless the context clearly dictates otherwise.

    Synthetic Pathway Activators

    [0168] In various aspects, systems disclosed herein employ one or more synthetic pathway activator (SPA) peptides. CAR-expressing immune cells can be limited by the necessity for in vivo expansion following infusion. To achieve robust expansion, T cells use three signals: antigen-stimulation, co-stimulation, and cytokine-induced stimulation. Activation of CARs is sufficient to induce the first two signals, but cannot recapitulate cytokine signaling. Furthermore, the tumor microenvironment is often immunosuppressive and devoid of pro-inflammatory cytokines. SPA peptides can thus be used to stimulate robust in vivo expansion and enhance desirable properties (i.e., increased survival, persistence, and potency) of T cells, e.g., expressing priming receptors and/or CARs as described herein.

    [0169] In one aspect, provided herein are synthetic pathway activator (SPA) peptides comprising a chimeric polypeptide comprising: a lipid anchor or a transmembrane domain; an intracellular signaling domain; and a multimerization region. In another aspect, provided herein are synthetic pathway activator (SPA) peptides comprising a chimeric polypeptide comprising: an extracellular domain, a lipid anchor or a transmembrane domain; an intracellular signaling domain; and a multimerization region. In some embodiments, a SPA peptide further comprises a CD8-alpha hinge domain. In some embodiments, the SPA peptide is constitutively expressed in a cell. In some embodiments, expression of the SPA peptide in a cell is induced, e.g., by T cell activation. For example, an inducible SPA can be expressed after engagement of a CAR T cell with the CAR cognate ligand on a target cell. In some embodiments, expression of the SPA peptide in a cell is induced by the priming receptor and/or CAR signaling.

    SPA Structure

    [0170] In various embodiments, a SPA peptide mimics activation of interleukin signaling. Interleukin receptors are cytokine receptors that signal through Signal Transducer and Activator of Transcription (STAT) transcription factors (e.g., STAT1, STAT3, and STAT5). Interleukin receptors typically function by dimerization in response to ligand binding. Once dimerized, receptors can bind janus-associated kinases (JAKs) to induce JAK cross-phosphorylation and downstream JAK/STAT signaling. Accordingly, induced receptor agonism or ligand-independent dimerization of receptors can be utilized t Synthetic Pathway Activators induce constitutive receptor activity and thus, constitutive cytokine signaling.

    [0171] In various embodiments, SPA peptides comprise interleukin receptors or functional fragments thereof. In some embodiments, SPAs comprise or are derived from interleukin receptor intracellular signaling domains or functional fragments thereof. In some embodiments, SPA peptides comprise or are derived from interleukin-6 signal transducer (IL6ST) polypeptides or functional fragments thereof. Interleukin-6 signal transducer (IL6ST) is also known as glycoprotein 130 (gp130). A SPA can comprise multimerized SPA peptides, e.g., two SPA peptides that are dimerized, e.g., homodimerized. Multimerization encompasses dimerization, trimerization, tetramerization, or higher order combinations of SPA peptides that interact with each other.

    Intracellular Signaling Domain

    [0172] In some embodiments, the intracellular signaling domain induces phosphorylation of STAT1, STAT3, and/or STAT5. In some embodiments, a functional SPA induces phosphorylation of STAT1, STAT3, and/or STAT5. In some embodiments, the intracellular signaling domain comprises a type I cytokine receptor superfamily box 1 (IWPNVDP (SEQ ID NO: 106)) or box2 (VSVVEIEANDKKP (SEQ ID NO: 107)) peptide motif. In some embodiments, the intracellular signaling domain comprises a tyrosine phosphorylation motif comprising YXXQ or YXPQ. In some embodiments, the intracellular signaling domain comprises one or more minimum STAT binding motif from a STAT1, STAT3 and/or STAT5 protein. In some embodiments, the intracellular signaling domain comprises one or more minimum STAT binding motif from a STAT1 and STAT3 protein. In some embodiments, the intracellular signaling domain comprises one or more minimum STAT binding motif from a STAT1 and STAT5 protein. In some embodiments, the intracellular signaling domain comprises one or more minimum STAT binding motif from a STAT3 and STAT5 protein.

    [0173] In some embodiments, the intracellular signaling domain comprises a polypeptide sequence from an interleukin receptor.

    [0174] In some embodiments, the interleukin receptor comprises a gp130 intracellular signaling domain. In some embodiments, the intracellular signaling domain comprises a polypeptide sequence comprising amino acids 642 to 918 of gp130 (SEQ ID NO: 59).

    [0175] In some embodiments, the interleukin receptor comprises a truncated gp130 intracellular signaling domain. In some embodiments, the SPA peptides comprise truncated gp130 intracellular domains. In some embodiments, the truncated gp130 intracellular signaling domain comprises the truncated gp130 intracellular domain of a sequence selected from the group set forth in SEQ ID NOs: 10-16 and 71-77. In some embodiments, the truncated gp130 intracellular signaling domain comprises a sequence with at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% identity to the gp130 intracellular signaling domain as provide in the sequence set forth in SEQ ID NO: 60.

    [0176] In some embodiments, the truncated gp130 intracellular domain comprises a GP130771-811 fragment. In some embodiments, the truncated gp130 intracellular domain comprises a GP130707-755 fragment. In some embodiments, the truncated gp130 intracellular domain comprises a GP130818-901 fragment. In some embodiments, the truncated gp130 intracellular domain comprises one or more truncation selected from the group consisting of GP130707-755, GP130771-811, and GP130818-901. In some embodiments, the truncated gp130 intracellular signaling domain comprises a deletion of amino acids 771 to 811 of gp130 (SEQ ID NO: 59). In some embodiments, the truncated gp130 intracellular signaling domain comprises a deletion of amino acids 707 to 755 of gp130 (SEQ ID NO: 59). In some embodiments, the truncated gp130 intracellular signaling domain comprises a deletion of amino acids 818 to 901 of gp130 (SEQ ID NO: 59). In some embodiments, the truncated gp130 intracellular signaling domain comprises one or more (e.g., one, two, or three) deletions of amino acids selected from the group consisting of amino acids 707 to 755, 771 to 811, and 818-901 of gp130 (SEQ ID NO: 59). In other embodiments, the gp130 intracellular domain comprises a Y759F mutation (resulting in a SOCS-proof mutant). In some embodiments, the gp130 intracellular signaling domain further comprises a Y759F mutation of gp130 (SEQ ID NO: 59). In other embodiments, the gp130 intracellular domain comprises a a Y759F mutation and a 4771-811 truncation (gp130Y759F771-811) of gp130 (SEQ ID NO: 59). In other embodiments, the gp130 intracellular domain comprises a a Y759F mutation and a 4707-755 truncation (gp130Y759F707-755) of gp130 (SEQ ID NO: 59). In other embodiments, the gp130 intracellular domain comprises a a Y759F mutation and a 4818-901 truncation (gp130Y759F818-901) of gp130 (SEQ ID NO: 59).

    [0177] In some embodiments, the gp130 intracellular signaling domain comprises an amino acid sequence at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence set forth in SEQ ID NO: 60. In some embodiments, the gp130 intracellular signaling domain comprises the amino acid sequence set forth in SEQ ID NO: 60.

    [0178] In some embodiments, a SPA peptide can comprise a ligand agonist (e.g., a cytokine, e.g., an interleukin) that allows constitutive activation of the SPA. In some embodiments, the cytokine receptor and a soluble agonist are expressed simultaneously. In some embodiments, the cytokine receptor and a membrane-bound agonist are expressed simultaneously.

    Lipid Anchors and Transmembrane Domains

    [0179] In various embodiments, SPA peptides are anchored to the cellular membrane via a lipid anchor. In other embodiments, SPA peptides comprise a transmembrane domain. In various embodiments, a SPA peptide comprises a lipid anchor or a transmembrane domain comprising a gp130 transmembrane domain, a CD8-alpha transmembrane domain, a prenylation motif, or a myristoylation domain derived from src, fyn, or lck.

    [0180] In some embodiments, a SPA peptide comprises a src-derived myristoylation domain, fyn-derived myristoylation domain, or lck-derived myristoylation domain. In other embodiments, a SPA peptide comprises a prenylation motif. In some embodiments, a SPA peptide comprises an extracellular domain, a transmembrane domain, and an intracellular signaling domain. In some embodiments, a SPA peptide comprises comprise a transmembrane domain of an interleukin receptor.

    [0181] In some embodiments, the transmembrane domain comprises an amino acid sequence at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence set forth in SEQ ID NO: 61. In some embodiments, the transmembrane domain comprises the amino acid sequence set forth in SEQ ID NO: 61.

    Multimerization Regions

    [0182] In various embodiments, one or more structural alterations can be made to confer constitutive activity to a SPA or functional fragment thereof. In some embodiments, structures or mutations can be added to induce SPA multimerization (e.g, dimerization, trimerization, tetramerization, or higher order multimers). In some embodiments, one or more amino acids can be mutated to a cysteine to allow formation of one or more disulfide bond(s), e.g., between two receptor monomers. In some embodiments, one or more amino acids can be inserted into a wild-type receptor polypeptide to promote dimerization, e.g., through formation of one or more disulfide bond(s). In some embodiments, the multimerization domain comprises one or more unpaired cysteine residues. The one or more amino acids mutated to an unpaired cysteine can be in the extracellular domain, the transmembrane domain, a hinge domain, the intracellular signaling domain, or any linker used to link such domains. In some embodiments, the multimerization domain comprises one or more VASP domains. In some embodiments, the multimerization domain comprises one or more VASP tetramerization domains. In some embodiments, the multimerization domain comprises one or more leucine zippers. In some embodiments, the multimerization domain is intracellular or extracellular.

    [0183] In some embodiments, an exogenous polypeptide is operatively linked to a cytokine receptor or functional fragment thereof to cause their multimerization or dimerization. In some embodiments, a leucine zipper polypeptide is operatively linked to a cytokine receptor or functional fragment thereof. In some embodiments, the leucine zipper polypeptide is a c-Jun leucine zipper. In some embodiments, an exogenous scaffold is operatively linked to a cytokine receptor or functional fragment thereof. In some embodiments, the exogenous scaffold is a CD34 ectodomain (e.g., SEQ ID NO: 242), erythropoietin receptor (EpoR) ectodomain, or thrombopoietin receptor (TpoR) ectodomain.

    [0184] In some embodiments, multimerization of the chimeric polypeptide via the multimerization region results in constitutive activity of the intracellular signaling domain. In some embodiments, the SPA peptide comprises one or more multimerization region. In some embodiments, the SPA peptide comprises two or more multimerization regions. In some embodiments, the multimerization region comprises at least one of one or more unpaired cysteine residues, a leucine zipper, a BCR domain, and a VASP domain. An exemplary BCR domain is provided in SEQ ID NO: 239. An exemplary VASP domain is provided in SEQ ID NO: 240. In some embodiments, the multimerization region comprises at least one or more unpaired cysteine residues. In some embodiments, the multimerization region comprises at least one or more unpaired cysteine residue and a leucine zipper. In some embodiments, the SPA peptide comprises one or more VASP domain polypeptides to promote multimerization (e.g., SEQ ID NO: 240). In some embodiments, the VASP domain is the tetramerization domain of VASP. In some embodiments, the BCR domain comprises a coiled coil tetramerization region. In such embodiments, the BCR ectodomain can result in multimerization through non-covalent interactions.

    [0185] In some embodiments, the multimerization region is intracellular when expressed by a cell. In some embodiments, the multimerization region is extracellular when expressed by a cell. For example, the SPA peptide can comprise, in an N terminus to C terminus direction, i) one or more multimerization domainsan extracellular domaina transmembrane domainan intracellular signaling domain, ii) an extracellular domainone or more multimerization domainsa transmembrane domainan intracellular signaling domain, iii) one or more multimerization domainsa transmembrane domainan intracellular signaling domain, iv) one or more multimerization domainsa lipid anchoran intracellular signaling domain, v) an extracellular domaina transmembrane domainone or more multimerization domainsan intracellular signaling domain, vi) a lipid anchorone or more multimerization domainsan intracellular signaling domain, vii) an extracellular domaina transmembrane domainan intracellular signaling domain-one or more multimerization domains, or viii) a lipid anchoran intracellular signaling domainone or more multimerization domains, or any combination thereof.

    Extracellular Domain

    [0186] A SPA peptide disclosed herein can also comprise an extracellular domain. In some embodiments, the extracellular domain conveys constitutive activity to the intracellular signaling domain. In some embodiments, the extracellular domain comprises a CD34 ectodomain (e.g., a CD34 extracellular domain, SEQ ID NO: 238). In some embodiments, the extracellular domain comprises a CD34 epitope (e.g. a QBEND10 epitope, SEQ ID NO: 238). In some embodiments, the extracellular domain comprises a type I cytokine receptor extracellular domain (e.g., a thrombopoietin receptor (TpoR) ectodomain or erythropoietin receptor (EpoR) ectodomain). In some embodiments, the type I cytokine receptor extracellular domain (e.g., a thrombopoietin receptor (TpoR) ectodomain or erythropoietin receptor (EpoR) ectodomain) further comprises a type I cytokine receptor transmembrane domain (e.g., a thrombopoietin receptor (TpoR) transmembrane domain or erythropoietin receptor (EpoR) transmembrane domain). In some embodiments, the extracellular domain comprises one or more of a CD34 epitope (e.g. a QBEND10 epitope, SEQ ID NO: 238), a CD34 ectodomain (SEQ ID NO: 242), a BCR ectodomain (SEQ ID NO: 239), a thrombopoietin receptor (TpoR) domain (SEQ ID NO: 243), or an erythropoietin receptor (EpoR) ectodomain (SEQ ID NO: 241). In various embodiments, the thrombopoietin receptor (TpoR) ectodomain or erythropoietin receptor (EpoR) ectodomain further comprise one or more unpaired cysteines. In some embodiments, the BCR ectodomain comprises a coiled coil tetramerization region. In such embodiments, the BCR ectodomain can result in multimerization through non-covalent interactions.

    Exemplary SPAs

    [0187] In some embodiments, the SPA peptide comprises a leucine zipper-gp130 (referred to interchangeably herein as L-gp130 or gp130) or an L-gp130 intracellular signaling domain. L-gp130 comprises a homodimer, with each monomer comprising (a) an extracellular domain comprising an inserted cysteine residue that forms a disulfide linkage with another monomer and a c-Jun leucine zipper; and (b) an IL6ST (GP130) transmembrane domain and intracellular signaling domain. The cysteine residue and the leucine zipper on each polypeptide can induce the formation of stable homodimers that mimic constitutive IL-6R activation. Additional details on the construction of L-gp130 are described in Stuhlmann-Laeisz et al. Mol Biol Cell. 2006 July; 17 (7): 2986-95 and in WO2020200325, which are hereby incorporated by reference in their entirety. Diagrams of L-gp130 and other exemplary SPAs described herein are provided in FIG. 2B.

    [0188] In some embodiments, the SPA peptide comprises, from N terminus to C terminus, an extracellular domain comprising a CD34 epitope or CD34 extracellular domain, a multimerization region comprising one or more unpaired cysteine residues, a GP130 (IL6ST) transmembrane domain, and a GP130 (IL6ST) intracellular signaling domain. In some embodiments, the SPA peptide further comprises a leader sequence at the N terminus. In some embodiments, the leader sequence is a CD8a signal sequence, a GP130 (IL6ST) signal sequence, a CD34 signal sequence, or an Erythropoietin receptor (EpoR) signal sequence. In some embodiments, the leader sequence comprises

    TABLE-US-00001 (SEQIDNO:108) MALPVTALLLPLALLLHAARP, (SEQIDNO:109) MLVRRGARAGPRMPRGWTALCLLSLLPSGFM, (SEQIDNO:110) MDHLGASLWPQVGSLCLLLAGAAW, or (SEQIDNO:111) MLTLQTWLVQALFIFLTTESTG.

    [0189] In some embodiments, the SPA peptide comprises a sequence selected from the group set forth in SEQ ID NOs: 1-58 or 63-104. SPAs with a leader sequence at the N terminus are provided in SEQ ID NOs: 1-58. SPAs without a leader sequence at the N terminus are provided in SEQ ID NOs: 63-104. In some embodiments, the SPA peptide comprises a sequence as set forth in SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, or 104. In some embodiments, the SPA peptide comprises a sequence selected from the group set forth in SEQ ID NOs: 1-58 or 63-104. In some embodiments, the SPA peptide comprises a sequence with about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% identity to a sequence as set forth in SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, or 104. In some embodiments, the SPA peptide comprises the sequence set forth in SEQ ID NO: 20. In some embodiments, the SPA peptide comprises the sequence set forth in SEQ ID NO: 81. In some embodiments, the SPA peptide comprises a sequence with about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% identity to a sequence as set forth in SEQ ID NOs: 20 or 81.

    Logic Gate Systems

    [0190] As used herein, a logic gate, circuit, circuit receptor, system or system receptor refers to a two part protein expression system comprising a priming receptor and a chimeric antigen receptor. The system can be encoded on at least one nucleic acid inserted into a cell, where the priming receptor is expressed in the cell. The intracellular domain of the priming receptor is cleaved from the transmembrane domain upon binding of the priming receptor to its target antigen. The intracellular domain is then capable of translocating into a cell nucleus where it induces expression of the chimeric antigen receptor.

    [0191] In one aspect, provided herein are systems comprising a priming receptor that binds to a target antigen and a chimeric antigen receptor that binds to a target antigen, wherein the transcription factor of the intracellular domain of the priming receptor is capable of inducing expression of the CAR and/or SPA. Such systems are alternatively termed logic gates or circuits. In some aspects, the system is encoded by nucleic acid transgenes inserted into an immune cell. The system can be encoded on a single nucleic acid insert or fragment that comprises both transgenes, or can be encoded on two nucleic acids that encode the system transgenes individually. The priming receptor and CAR of the system can be placed in any order on the single nucleic acid. For example, the priming receptor can be at the 5 end and the CAR can be at the 3 end, or the CAR can be at the 5 end and the priming receptor can be at the 3 end.

    [0192] A constitutive promoter can be operably linked to the nucleotide sequence encoding the priming receptor and/or SPA. An inducible promoter can also be operably linked to the nucleotide sequence encoding the CAR. In some embodiments, when the system is encoded on a single nucleic acid insert or fragment that comprises both transgenes, the nucleic acid can comprise, in a 5 to 3 direction, the constitutive promoter; the nucleotide sequence encoding priming receptor; the inducible promoter; and the nucleotide sequence encoding chimeric antigen receptor. Alternatively, the nucleic acid can comprise, in a 5 to 3 direction, the inducible promoter; the nucleotide sequence encoding chimeric antigen receptor; the constitutive promoter; the nucleotide sequence encoding priming receptor. In some embodiments, the inducible promoter comprises one or more HNF1 enhancer elements (e.g., one or more, two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more HNF1 enhancer elements). In some embodiments, the constitutive promoter comprises an EF1 promoter.

    Priming Receptors

    [0193] Provided herein are priming receptors comprising an extracellular antigen-binding domain that specifically binds target antigens and one or more intracellular domains from or derived from a transcriptional regulator and/or a DNA-binding domain.

    [0194] In certain aspects of the present disclosure, the priming receptor is a synthetic receptor based on the Notch protein. Binding of a natural Notch receptor to a cognate ligand, such as those from the Delta family of proteins, causes intramembrane proteolysis that cleaves an intracellular fragment of the Notch protein. This intracellular fragment is a transcriptional regulator that only functions when cleaved from Notch. Cleavage may occur by sequential proteolysis by ADAM metalloprotease and the gamma-secretase complex. This intracellular fragment enters the nucleus of a cell and activates cell-cell signaling genes. In contrast to a natural Notch protein, a synthetic notch priming receptor replaces the natural Notch intracellular fragment with one that causes a gene encoding a protein of choice, such as a CAR, to be transcribed upon release of the intracellular fragment from the priming receptor.

    [0195] Notch receptors have a modular domain organization. The ectodomains of Notch receptors consist of a series of N-terminal epidermal growth factor (EGF)-like repeats that are responsible for ligand binding. In synthetic Notch receptors or priming receptors, the Notch ligand-binding domain is replaced with a ligand binding domain that binds a selected target ligand or antigen. The EGF repeats are followed by three LIN-12/Notch repeat (LNR) modules, which are unique to Notch receptors, and are widely reported to participate in preventing premature receptor activation. The heterodimerization (HD) domain of Notchl is divided by furin cleavage, so that its N-terminal part terminates the extracellular subunit, and its C-terminal half constitutes the beginning of the transmembrane subunit. Following the extracellular region, the receptor has a transmembrane segment and an intracellular domain (ICD), which includes a transcriptional regulator.

    [0196] Multiple forms of priming receptors can be used in the methods, cells, and nucleic acids as described herein. One type of priming receptor contemplated for use in the methods and cells herein comprise a heterologous extracellular ligand binding domain, a linking polypeptide having substantial sequence identity with a Notch receptor including the NRR, a TMD, and an ICD. Fn Notch receptors comprise a heterologous extracellular ligand binding domain, a linking polypeptide having substantial sequence identity with a Robo receptor (such as a mammalian Robo1, Robo2, Robo3, or Robo4), followed by 1, 2, or 3 fibronectin repeats (Fn), a TMD, and an ICD. Mini Notch receptors comprise a heterologous extracellular ligand binding domain, a linking polypeptide having substantial sequence identity with a Notch receptor (lacking the NRR), a TMD, and an ICD. Minimal Linker Notch receptors comprise a heterologous extracellular ligand binding domain, a linking polypeptide lacking substantial sequence identity with a Notch receptor (e.g., a synthetic (GGS) n polypeptide sequence), a TMD, and an ICD. Hinge Notch receptors comprise a heterologous extracellular ligand binding domain, a hinge sequence comprising an oligomerization domain (i.e., a domain that promotes dimerization, trimerization, or higher order multimerization with a synthetic receptor and/or an existing host receptor), a TMD, and an ICD. All of these receptor classes are synthetic, recombinant, and do not occur in nature. In some embodiments, the non-naturally occurring receptors disclosed herein bind a target cell-surface displayed ligand, which triggers proteolytic cleavage of the receptors and release of a transcriptional regulator that modulates a custom transcriptional program in the cell. In some embodiments, the priming receptor does not include a LIN-12-Notch repeat (LNR) and/or a heterodimerization domain (HD) of a Notch receptor.

    Priming Receptor Extracellular Domain

    [0197] In some embodiments, the extracellular domain includes the ligand-binding portion of a receptor. In some embodiments, the extracellular domain includes an antigen-binding moiety that binds to one or more target antigens. In some embodiments, the antigen-binding moiety includes one or more antigen-binding determinants of an antibody or a functional antigen-binding fragment thereof. In some embodiments, the antigen-binding moiety is selected from the group consisting of an antibody, a nanobody, a diabody, a triabody, or a minibody, a F(ab)2 fragment, a Fab fragment, a single chain variable fragment (scFv), and a single domain antibody (sdAb), or a functional fragment thereof. In some embodiments, the antigen-binding moiety comprises an scFv. The antigen-binding moiety can include naturally-occurring amino acid sequences or can be engineered, designed, or modified so as to provide desired and/or improved properties, e.g., increased binding affinity.

    [0198] In various embodiments, a priming receptor comprises means for binding a target protein, optionally binding a human target protein. In some embodiments, the means binds a target protein. In some embodiments, the means binds a human target protein. In some embodiments, the means is an antibody or antigen-binding fragment or equivalent thereof (e.g., a full length antibody or a F(ab) 2 fragment, a Fab fragment, a single chain variable fragment (scFv), and a single domain antibody (sdAb), or a functional fragment thereof).

    Transmembrane Domain

    [0199] In some embodiments, the priming receptor comprises a hinge domain. In some embodiments, the hinge domain is a CD8 hinge.

    [0200] As described above, the priming receptor comprises a transmembrane domain (TMD) comprising one or more ligand-inducible proteolytic cleavage sites.

    [0201] In some embodiments, the TMD comprises a Notch1 transmembrane domain.

    [0202] Generally, the TMD suitable for the chimeric receptors disclosed herein can be any transmembrane domain of a Type 1 transmembrane receptor including at least one gamma-secretase cleavage site. Detailed description of the structure and function of the gamma-secretase complex as well as its substrate proteins, including amyloid precursor protein (APP) and Notch, can, for example, be found in a recent review by Zhang et al, Frontiers Cell Neurosci (2014). Non limiting suitable TMDs from Type 1 transmembrane receptors include those from CLSTN1, CLSTN2, APLP1, APLP2, LRP8, APP, BTC, TGBR3, SPN, CD44, CSF1R, CXCL16, CX3CL1, DCC, DLL1, DSG2, DAG1, CDH1, EPCAM, EPHA4, EPHB2, EFNB1, EFNB2, ErbB4, GHR, HLA-A, and IFNAR2, wherein the TMD includes at least one gamma secretase cleavage site. Additional TMDs suitable for the compositions and methods described herein include, but are not limited to, transmembrane domains from Type 1 transmembrane receptors ILIR1, ILIR2, IL6R, INSR, ERN1, ERN2, JAG2, KCNE1, KCNE2, KCNE3, KCNE4, KL, CHL1, PTPRF, SCN1B, SCN3B, NPR3, NGFR, PLXDC2, PAM, AGER, ROBO1, SORCS3, SORCS1, SORL1, SDC1, SDC2, SPN, TYR, TYRP1, DCT, YASN, FLT1, CDH5, PKHD1, NECTIN1, PCDHGC3, NRG1, LRP1B, CDH2, NRG2, PTPRK, SCN2B, Nradd, and PTPRM. In some embodiments, the TMD of the chimeric polypeptides or Notch receptors of the disclosure is a TMD derived from the TMD of a member of the calsyntenin family, such as, alcadein alpha and alcadein gamma. In some embodiments, the TMD of the chimeric polypeptides or Notch receptors of the disclosure is a TMD known for Notch receptors. In some embodiments, the TMD of the chimeric polypeptides or Notch receptors of the disclosure is a TMD derived from a different Notch receptor. For example, in a Mini Notch based on human Notch1, the Notch1 TMD can be substituted with a Notch2 TMD, Notch3 TMD, Notch4 TMD, or a Notch TMD from a non-human animal such as Danio rerio, Drosophila melanogaster, Xenopus lacvis, or Gallus gallus.

    [0203] In some embodiments, the priming receptor comprises a Notch cleavage site, such as S2 or S3. Additional proteolytic cleavage sites suitable for the compositions and methods disclosed herein include, but are not limited to, ADAM10, a metalloproteinase cleavage site for a MMP selected from collagenase-1, -2, and -3 (MMP-1, -8, and -13), gelatinase A and B (MMP-2 and -9), stromelysin 1, 2, and 3 (MMP-3, -10, and -11), matrilysin (MMP-7), and membrane metalloproteinases (MT1-MMP and MT2-MMP). Another example of a suitable protease cleavage site is a plasminogen activator cleavage site, e.g., a urokinase plasminogen activator (uPA) or a tissue plasminogen activator (tPA) cleavage site. Another example of a suitable protease cleavage site is a prolactin cleavage site. Specific examples of cleavage sequences of uPA and tPA include sequences comprising Yal-Gly-Arg. Another example of a protease cleavage site that can be included in a proteolytically cleavable linker is a tobacco etch vims (TEV) protease cleavage site, e.g., Glu-Asn-Leu-Tyr-Thr-Gln-Ser (SEQ ID NO: 112), where the protease cleaves between the glutamine and the serine. Another example of a protease cleavage site that can be included in a proteolytically cleavable linker is an enterokinase cleavage site, e.g., Asp-Asp-Asp-Asp-Lys (SEQ ID NO: 113), where cleavage occurs after the lysine residue. Another example of a protease cleavage site that can be included in a proteolytically cleavable linker is a thrombin cleavage site, e.g., Leu-Val-Pro-Arg (SEQ ID NO: 114). Additional suitable linkers comprising protease cleavage sites include sequences cleavable by the following proteases: a PreScission protease (a fusion protein comprising human rhinovirus 3C protease and glutathione-S-transferase), a thrombin, cathepsin B, Epstein-Barr vims proteas, MMP-3 (stromelysin), MMP-7 (matrilysin), MMP-9; thermolysin-like MMP, matrix metalloproteinase 2 (MMP-2), cathepsin L; cathepsin D, matrix metalloproteinase 1 (MMP-1), urokinase-type plasminogen activator, membrane type 1 matrixmetalloprotemase (MT-MMP), stromelysin 3 (or MMP-11), thermo lysin, fibroblast collagenase and stromelysin-1, matrix metalloproteinase 13 (collagenase-3), tissue-type plasminogen activator (tPA), human prostate-specific antigen, kallikrein (hK3), neutrophil elastase, and calpain (calcium activated neutral protease). Proteases that are not native to the host cell in which the receptor is expressed (for example, TEV) can be used as a further regulatory mechanism, in which activation of the receptor is reduced until the protease is expressed or otherwise provided. Additionally, a protease may be tumor-associated or disease-associated (expressed to a significantly higher degree than in normal tissue), and serve as an independent regulatory mechanism. For example, some matrix metalloproteases are highly expressed in certain cancer types.

    [0204] In some embodiments, the amino acid substitution(s) within the TMD includes one or more substitutions within a GV motif of the TMD. In some embodiments, at least one of such substitution(s) comprises a substitution to alanine. Additional sequences and substitutions are described in WO2021061872, hereby incorporated by reference in its entirety.

    Intracellular Domain

    [0205] In some embodiments, the priming receptor comprises one or more intracellular domains from or derived from a transcriptional regulator and/or a DNA-binding domain. In some embodiments, the intracellular domain comprises means for modulating transcription of one or more genes. In some embodiments, the means for modulating transcription of one or more genes comprises a transcriptional regulator, e.g., a transcriptional regulator provided herein or an equivalent thereof. In some embodiments, the priming receptor comprises one or more intracellular domains from or derived from a transcriptional regulator and/or a DNA-binding domain. In some embodiments, the intracellular domain comprises an HNF1/p65 domain or a Gal4/VP64 domain.

    [0206] Transcriptional regulators either activate or repress transcription from cognate promoters. Transcriptional activators typically bind nearby to transcriptional promoters and recruit RNA polymerase to directly initiate transcription. Transcriptional repressors bind to transcriptional promoters and sterically hinder transcriptional initiation by RNA polymerase. Other transcriptional regulators serve as either an activator or a repressor depending on where it binds and cellular conditions. Accordingly, as used herein, a transcriptional activation domain refers to the domain of a transcription factor that interacts with transcriptional control elements and/or transcriptional regulatory proteins (i.e., transcription factors, RNA polymerases, etc.) to increase and/or activate transcription of one or more genes. Non-limiting examples of transcriptional activation domains include: a herpes simplex virus VP16 activation domain, VP64 (which is a tetrameric derivative of VP16), HIV TAT, a NFkB p65 activation domain, p53 activation domains 1 and 2, a CREB (CAMP response element binding protein) activation domain, an E2A activation domain, NFAT (nuclear factor of activated T-cells) activation domain, yeast Gal4, yeast GCN4, yeast HAP1, MLL, RTG3, GLN3, OAF1, PIP2, PDR1, PDR3, PHO4, LEU3 glucocorticoid receptor transcription activation domain, B-cell POU homeodomain protein Oct2, plant Ap2, or any others known to one or ordinary skill in the art. In some embodiments, the transcriptional regulator is selected from Gal4-VP16, Gal4-VP64, tetR-VP64, ZFHD1-YP64, Gal4-KRAB, and HAP1-VP16. In some embodiments, the transcriptional regulator is Gal4-VP64. A transcriptional activation domain can comprise a wild-type or naturally occurring sequence, or it can be a modified, mutant, or derivative version of the original transcriptional activation domain that has the desired ability to increase and/or activate transcription of one or more genes. In some embodiments, the transcriptional regulator can further include a nuclear localization signal.

    [0207] In some embodiments, the priming receptor comprises one or more intracellular DNA-binding domains (or DB domains). Such DNA-binding domains refer to sequence-specific DNA binding domains that bind a particular DNA sequence element. Accordingly, as used herein, a sequence-specific DNA-binding domain refers to a protein domain portion that has the ability to selectively bind DNA having a specific, predetermined sequence. A sequence-specific DNA binding domain can comprise a wild-type or naturally occurring sequence, or it can be a modified, mutant, or derivative version of the original domain that has the desired ability to bind to a desired sequence. In some embodiments, the sequence-specific DNA binding domain is engineered to bind a desired sequence. Non-limiting examples of proteins having sequence-specific DNA binding domains that can be used in synthetic proteins described herein include HNF1a, Gal4, GCN4, reverse tetracycline receptor, THY1, SYN1, NSE/RU5, AGRP, CALB2, CAMK2A, CCK, CHAT, DLX6A, EMX1, zinc finger proteins or domains thereof, CRISPR/Cas proteins, such as Cas9, Cas3, Cas4, Cas5, Cas5e (or CasD), Cash, Cas6e, Cas6f, Cas7, Cas8a1, Cas8a2, Cas8b, Cas8c, Cas10, Cas10d, CasF, CasG, CasH, Csy1, Csy2, Csy3, Cse1 (or CasA), Csc2 (or CasB), Cse3 (or CasE), Cse4 (or CasC), Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csz1, Csx15, Csf1, Csf2, Csf3, Csf4, and Cu196, and TALES.

    [0208] In those embodiments where a CRISPR/Cas-like protein is used, the CRISPR/Cas-like protein can be a wild type CRISPR/Cas protein, a modified CRISPR/Cas protein, or a fragment of a wild type or modified CRISPR/Cas protein. The CRISPR/Cas-like protein can be modified to increase nucleic acid binding affinity and/or specificity, alter an enzymatic activity, and/or change another property of the protein. For example, nuclease (i.e., DNase, RNase) domains of the CRISPR/Cas-like protein can be modified, deleted, or inactivated. Alternatively, the CRISPR/Cas-like protein can be truncated to remove domains that are not essential for the functions of the systems described herein. For example, a CRISPR enzyme that is used as a DNA binding protein or domain thereof can be mutated with respect to a corresponding wild-type enzyme such that the mutated CRISPR or domain thereof lacks the ability to cleave a nucleic acid sequence containing a DNA binding domain target site. For example, a D10A mutation can be combined with one or more of H840A, N854A, or N863A mutations to produce a Cas9 enzyme substantially lacking all DNA cleavage activity.

    Juxtamembrane Domain

    [0209] The ECD and the TMD, or the TMD and the ICD, can be linked to each other with a linking polypeptide, such as a juxtamembrane domain. SynNotch or synthetic notch receptors comprise a heterologous extracellular ligand-binding domain, a linking polypeptide having substantial sequence identity with a Notch receptor JMD (including the NRR), a TMD, and an ICD. Fn Notch receptors comprise a heterologous extracellular ligand binding domain, a linking polypeptide having substantial sequence identity with a Robo receptor (such as a mammalian Robo1, Robo2, Robo3, or Robo4), followed by 1, 2, or 3 fibronectin repeats (Fn), a TMD, and an ICD. Mini Notch receptors comprise a heterologous extracellular ligand binding domain, a linking polypeptide having substantial sequence identity with a Notch receptor JMD but lacking the NRR (the LIN-12-Notch repeat (LNR) modules, and the heterodimerization domain), a TMD, and an ICD. Minimal Linker Notch receptors comprise a heterologous extracellular ligand-binding domain, a linking polypeptide lacking substantial sequence identity with a Notch receptor (for example, without limitation, having a synthetic (GGS) n polypeptide sequence), a TMD, and an ICD. Hinge Notch receptors comprise a heterologous extracellular ligand-binding domain, a hinge sequence comprising an oligomerization domain (i.e., a domain that promotes dimerization, trimerization, or higher order multimerization with a synthetic receptor and/or an existing host receptor), a TMD, and an ICD.

    [0210] In some embodiments, the priming receptor comprises a juxtamembrane domain (JMD) peptide in between the extracellular domain and the transmembrane domain. In some embodiments, the priming receptor comprises a juxtamembrane domain (JMD) peptide in between the transmembrane domain and the intracellular domain. In some embodiments, the JMD peptide comprises an LWF motif. The use of LWF motifs in receptor constructs is described in U.S. Pat. No. 10,858,443, hereby incorporated by reference in its entirety. In some embodiments, the JMD peptide has substantial sequence identity to the JMD of Notch1, Notch2, Notch3, and/or Notch4. In some embodiments, the JMD peptide has substantial sequence identity to the Notch1, Notch2, Notch3, and/or Notch4 JMD, but does not include a LIN-12-Notch repeat (LNR) and/or a heterodimerization domain (HD) of a Notch receptor. In some embodiments, the JMD peptide does not have substantial sequence identity to the Notch1, Notch2, Notch3, and/or Notch4 JMD. In some embodiments, the JMD peptide includes an oligimerization domain which promotes formation of dimers, trimers, or higher order assemblages of the receptor. Such JMD peptides are described in WO2021061872, hereby incorporated by reference in its entirety.

    [0211] In the Mini Notch receptor, the linking polypeptide is derived from a Notch JMD sequence after deletion of the NRR and HD domain. The Notch JMD sequence may be the sequence from Notch1, Notch2, Notch3, or Notch4, and can be derived from a non-human homolog, such as those from Drosophila, Gallus, Danio, and the like. Four to 50 amino acid residues of the remaining Notch sequence can be used as a polypeptide linker. In some embodiments, the length and amino acid composition of the linker polypeptide sequence are varied to alter the orientation and/or proximity of the ECD and the TMD relative to one another to achieve a desired activity of the chimeric polypeptide, such as the signal transduction level when ligand induced or in the absence of ligand.

    [0212] In the Minimal Linker Notch receptor, the linking polypeptide does not have substantial sequence identity to a Notch JMD sequence, including the Notch JMD sequence from Notch1, Notch2, Notch3, or Notch4, or a non-human homolog thereof. Four to 50 amino acid residues can be used as a polypeptide linker. In some embodiments, the length and amino acid composition of the linker polypeptide sequence are varied to alter the orientation and/or proximity of the ECD and the TMD relative to one another to achieve a desired activity of the chimeric polypeptide of the disclosure. The Minimal Linker sequence can be designed to include or omit a protease cleavage site, and can include or omit a glycosylation site or sites for other types of post-translational modification. In some embodiments, the Minimal Linker does not comprise a protease cleavage site or a glysosylation site.

    [0213] In some embodiments, the priming receptor further comprises a hinge. Hinge linkers that can be used in the priming receptor can include an oligomerization domain (e.g., a hinge domain) containing one or more polypeptide motifs that promote oligomer formation of the chimeric polypeptides via intermolecular disulfide bonding. In these instances, within the chimeric receptors disclosed herein, the hinge domain generally includes a flexible polypeptide connector region disposed between the ECD and the TMD. Thus, the hinge domain provides flexibility between the ECD and TMD and also provides sites for intermolecular disulfide bonding between two or more chimeric polypeptide monomers to form an oligomeric complex. In some embodiments, the hinge domain includes motifs that promote dimer formation of the chimeric polypeptides disclosed herein. In some embodiments, the hinge domain includes motifs that promote trimer formation of the chimeric polypeptides disclosed herein (e.g., a hinge domain derived from OX40). Hinge polypeptide sequences suitable for the compositions and methods of the disclosure can be naturally-occurring hinge polypeptide sequences (e.g., those from naturally-occurring immunoglobulins) or can be engineered, designed, or modified so as to provide desired and/or improved properties, e.g., modulating transcription. Suitable hinge polypeptide sequences include, but are not limited to, those derived from IgA, IgD, and IgG subclasses, such as IgG1 hinge domain, IgG2 hinge domain, IgG3 hinge domain, and IgG4 hinge domain, or a functional variant thereof. In some embodiments, the hinge polypeptide sequence contains one or more CXXC motifs. In some embodiments, the hinge polypeptide sequence contains one or more CPPC motifs (SEQ ID NO: 115).

    [0214] Hinge polypeptide sequences can also be derived from a CD8a hinge domain, a CD28 hinge domain, a CD152 hinge domain, a PD-1 hinge domain, a CTLA4 hinge domain, an OX40 hinge domain, and functional variants thereof. In some embodiments, the hinge domain includes a hinge polypeptide sequence derived from a CD8 a hinge domain or a functional variant thereof. In some embodiments, the hinge domain includes a hinge polypeptide sequence derived from a CD28 hinge domain or a functional variant thereof. In some embodiments, the hinge domain includes a hinge polypeptide sequence derived from an OX40 hinge domain or a functional variant thereof. In some embodiments, the hinge domain includes a hinge polypeptide sequence derived from an IgG4 hinge domain or a functional variant thereof.

    [0215] The Fn Notch linking polypeptide is derived from the Robo1 JMD, which contains a fibronectin repeat (Fn) domain, with a short polypeptide sequence between the Fn repeats and the TMD. The Fn Notch linking polypeptide does not contain a Notch negative regulatory region (NRR), or the Notch HD domain. The Fn linking polypeptide can contain 1, 2, 3, 4, or 5 Fn repeats. In some embodiments, the chimeric receptor comprises a Fn linking polypeptide having about 1 to about 5 Fn repeats, about 1 to about 3 Fn repeats, or about 2 to about 3 Fn repeats. The short polypeptide sequence between the Fn repeats and the TMD can be from about 2 to about 30 amino acid residues. In some embodiments, the short polypeptide sequence can be between about 5 and about 20 amino acids, of any sequence. In some embodiments, the short polypeptide sequence can be between about 5 and about 20 naturally-occurring amino acids, of any sequence. In some embodiments, the short polypeptide sequence can be between about 5 and about 20 amino acids, of any sequence but having no more than one proline. In some embodiments, the short polypeptide sequence can be between about 5 and about 20 amino acids, and about 50% or more of the amino acids are glycine. In some embodiments, the short polypeptide sequence can be between about 5 and about 20 amino acids, where the amino acids are selected from glycine, serine, threonine, and alanine. In some embodiments, the length and amino acid composition of the Fn linking polypeptide sequence can be varied to alter the orientation and/or proximity of the ECD and the TMD relative to one another to achieve a desired activity of the chimeric polypeptide of the disclosure.

    Stop-Transfer Sequence

    [0216] In some embodiments, the priming receptor further comprises a stop-transfer sequence (STS) in between the transmembrane domain and the intracellular domains. The STS comprises a charged, lipophobic sequence. Without being bound by any theory, the STS serves as a membrane anchor, and is believed to prevent passage of the intracellular domain into the plasma membrane. The use of STS domains in priming receptors is described in WO2021061872, hereby incorporated by reference in its entirety. Non-limiting exemplary STS sequences include APLP1, APLP2, APP, TGBR3, CSF1R, CXCL16, CX3CL1, DAG1, DCC, DNER, DSG2, CDH1, GHR, HLA-A, IFNAR2, IGF1R, ILIR1, ERN2, KCNE1, KCNE2, CHL1, LRP1, LRP2, LRP18, PTPRF, SCN1B, SCN3B, NPR3, NGFR, PLXDC2, PAM, AGER, ROBO1, SORCS3, SORCS1, SORL1, SDC1, SDC2, SPN, TYR, TYRP1, DCT, VASN, FLT1, CDH5, PKTFD1, NECTIN1, KL, IL6R, EFNB1, CD44, CLSTN1, LRP8, PCDHGC3, NRG1, LRP1B, JAG2, EFNB2, DLL1, CLSTN2, EPCAM, ErbB4, KCNE3, CDH2, NRG2, PTPRK, BTC, EPHA4, ILIR2, KCNE4, SCN2B, Nradd, PTPRM, Notch1, Notch2, Notch3, and Notch4 STS sequences. In some embodiments, the STS is heterologous to the transmembrane domain. In some embodiments, the STS is homologous to the transmembrane domain. STS sequences are described in WO2021061872, hereby incorporated by reference in its entirety.

    Chimeric Antigen Receptors

    [0217] In another aspect, provided herein are chimeric antigen receptors comprising an extracellular antigen-binding domain that specifically binds to a target antigen or ligand.

    [0218] In some embodiments, the chimeric antigen receptor includes an extracellular portion comprising an antigen binding domain. The antigen recognition domain of a receptor such as a CAR can be linked to one or more intracellular signaling components, such as signaling components that mimic activation through an antigen receptor complex, such as a TCR complex, in the case of a CAR, and/or signal via another cell surface receptor. Thus, in some embodiments, the extracellular binding component (e.g., ligand-binding or antigen-binding domain) is linked to one or more transmembrane and intracellular signaling domains. In some embodiments, the transmembrane domain is fused to the extracellular domain. In one embodiment, a transmembrane domain that naturally is associated with one of the domains in the receptor, e.g., CAR, is used. In some instances, the transmembrane domain is selected or modified by amino acid substitution to avoid binding of such domains to the transmembrane domains of the same or different surface membrane proteins to minimize interactions with other members of the receptor complex.

    [0219] In some aspects, the chimeric antigen receptor includes an extracellular portion comprising an antigen binding domain described herein and an intracellular signaling domain. In some embodiments, an antibody or fragment includes an scFv, a VH, or a single-domain VH antibody and the intracellular domain contains an ITAM. In some aspects, the intracellular signaling domain includes a signaling domain of a zeta chain of a CD3-zeta (CD3) chain. In some embodiments, the chimeric antigen receptor includes a transmembrane domain linking the extracellular domain and the intracellular signaling domain.

    [0220] In some aspects, the transmembrane domain contains a transmembrane portion of CD8a or CD28. The extracellular domain and transmembrane can be linked directly or indirectly. In some embodiments, the extracellular domain and transmembrane are linked by a spacer, such as any described herein. In some embodiments, the chimeric antigen receptor contains an intracellular domain of a T cell costimulatory molecule, such as between the transmembrane domain and intracellular signaling domain. In some aspects, the T cell costimulatory molecule is CD28 or 41BB.

    Chimeric Antigen Receptor Extracellular Domain

    [0221] In some embodiments, the extracellular domain includes the ligand-binding portion of a receptor. In some embodiments, the extracellular domain includes an antigen-binding moiety that binds to one or more target antigens. In some embodiments, the antigen-binding moiety includes one or more antigen-binding determinants of an antibody or a functional antigen-binding fragment thereof. In some embodiments, the antigen-binding moiety is selected from the group consisting of an antibody, a nanobody, a diabody, a triabody, or a minibody, a F(ab).sub.2 fragment, a Fab fragment, a single chain variable fragment (scFv), and a single domain antibody (sdAb), or a functional fragment thereof. In some embodiments, the antigen-binding moiety comprises an scFv. The antigen-binding moiety can include naturally-occurring amino acid sequences or can be engineered, designed, or modified so as to provide desired and/or improved properties, e.g., increased binding affinity.

    [0222] In various embodiments, a CAR comprises means for binding a target protein. In some embodiments, the means binds a target protein. In some embodiments, the means binds a human target protein. In some embodiments, the means is an antibody or antigen-binding fragment or equivalent thereof (e.g., a full length antibody or a F(ab) 2 fragment, a Fab fragment, a single chain variable fragment (scFv), and a single domain antibody (sdAb), or a functional fragment thereof) means for binding a target protein.

    CAR Transmembrane Domain

    [0223] The transmembrane domain in some embodiments is derived either from a natural or from a synthetic source. Where the source is natural, the domain in some aspects is derived from any membrane-bound or transmembrane protein. Transmembrane regions include those derived from (i.e. comprise at least the transmembrane region(s) of) the alpha, beta or zeta chain of the T-cell receptor, CD28, CD3 epsilon, CD45, CD4, CD5, CDS, CD9, CD 16, CD22, CD33, CD37, CD64, CD80, CD86, CD 134, CD137, and/or CD 154. Alternatively the transmembrane domain in some embodiments is synthetic. In some aspects, the synthetic transmembrane domain comprises predominantly hydrophobic residues such as leucine and valine. In some aspects, a triplet of phenylalanine, tryptophan and valine will be found at each end of a synthetic transmembrane domain. In some embodiments, the linkage is by linkers, spacers, and/or transmembrane domain(s).

    [0224] In some embodiments, the transmembrane domain of the receptor, e.g., the CAR, is a transmembrane domain of human CD28 or variant thereof, e.g., a 27-amino acid transmembrane domain of a human CD28 (Accession No.: P10747.1).

    [0225] In some embodiments, the CAR comprises a CD8a or CD28 TMD.

    CAR Hinge

    [0226] In some embodiments, the CAR further includes a spacer, which may be or include at least a portion of an immunoglobulin constant region or variant or modified version thereof, such as a hinge region, e.g., a CD8a hinge, an IgG4 hinge region, and/or a CH1/CL and/or Fc region. In some embodiments, the constant region or portion is of a human IgG, such as IgG4 or IgG1. In some aspects, the portion of the constant region serves as a spacer region between the antigen-recognition component, e.g., scFv, and transmembrane domain. The spacer can be of a length that provides for increased responsiveness of the cell following antigen binding, as compared to in the absence of the spacer. In some examples, the spacer is at or about 12 amino acids in length or is no more than 12 amino acids in length. Exemplary spacers include those having at least about 10 to 229 amino acids, about 10 to 200 amino acids, about 10 to 175 amino acids, about 10 to 150 amino acids, about 10 to 125 amino acids, about 10 to 100 amino acids, about 10 to 75 amino acids, about 10 to 50 amino acids, about 10 to 40 amino acids, about 10 to 30 amino acids, about 10 to 20 amino acids, or about 10 to 15 amino acids, and including any integer between the endpoints of any of the listed ranges. In some embodiments, a spacer region has about 12 amino acids or less, about 119 amino acids or less, or about 229 amino acids or less. Exemplary spacers include CD8a hinge, IgG4 hinge alone, IgG4 hinge linked to CH2 and CH3 domains, or IgG4 hinge linked to the CH3 domain. Exemplary spacers include, but are not limited to, those described in Hudecek et al. (2013) Clin. Cancer Res., 19:3153 or international patent application publication number WO2014031687. In some embodiments, the CAR hinge comprises a CD8a CD8a, truncated CD8a, or CD28 hinge domain.

    [0227] Among the intracellular signaling domains are those that mimic or approximate a signal through a natural antigen receptor, a signal through such a receptor in combination with a costimulatory receptor, and/or a signal through a costimulatory receptor alone. In some embodiments, a short oligo- or polypeptide linker, for example, a linker of between 2 and 10 amino acids in length, such as one containing glycines and serines, e.g., glycine-serine doublet, is present and forms a linkage between the transmembrane domain and the cytoplasmic signaling domain of the receptor.

    CAR Intracellular Domain

    [0228] In some embodiments, upon ligation of the CAR, the cytoplasmic domain or intracellular signaling domain of the receptor activates at least one of the normal effector functions or responses of the immune cell, e.g., T cell engineered to express the receptor. In some embodiments, the CAR comprises means for activating at least one of the normal effector functions or responses of the immune cell, e.g., T cell engineered to express the receptor. For example, in some contexts, the receptor induces a function of a T cell such as cytolytic activity or T-helper activity, such as secretion of cytokines or other factors. In some embodiments, a truncated portion of an intracellular signaling domain of an antigen receptor component or costimulatory molecule is used in place of an intact immunostimulatory chain, for example, if it transduces the effector function signal. In some embodiments, the intracellular signaling domain or domains include the cytoplasmic sequences of the T cell receptor (TCR), and in some aspects also those of co-receptors that in the natural context act in concert with such receptor to initiate signal transduction following antigen receptor engagement, and/or any derivative or variant of such molecules, and/or any synthetic sequence that has the same functional capability. In some embodiments, the means for at least one of the normal effector functions or responses of the immune cell comprises an CAR intracellular activation domain, e.g., an intracellular activation domain provided herein or an equivalent thereof. In some embodiments, the means for at least one of the normal effector functions or responses of the immune cell comprises an CAR intracellular activation domain and a CAR co-stimulatory domain, e.g., a co-stimulatory domain provided herein or an equivalent thereof.

    [0229] In some aspects, the receptor includes a primary cytoplasmic signaling sequence that regulates primary activation of the TCR complex. Primary cytoplasmic signaling sequences that act in a stimulatory manner may contain signaling motifs which are known as immunoreceptor tyrosine-based activation motifs or ITAMs. Examples of ITAM containing primary cytoplasmic signaling sequences include those derived from TCR or CD3 zeta, FcR gamma, FcR beta, CD3 gamma, CD3 delta, CD3 epsilon, CDS, CD22, CD79a, CD79b, and CD66d. In some embodiments, cytoplasmic signaling molecule(s) in the CAR contain(s) a cytoplasmic signaling domain, portion thereof, or sequence derived from CD3 zeta. In some embodiments, the intracellular activation domain comprises a CD33 domain.

    [0230] In some embodiments, the intracellular signaling domain comprises a human CD3 zeta stimulatory signaling domain or functional variant thereof, such as a 112 AA cytoplasmic domain of isoform 3 of human CD3.zeta. (Accession No.: P20963.2) or a CD3 zeta signaling domain as described in U.S. Pat. No. 7,446,190 or U.S. Pat. No. 8,911,993.

    [0231] The receptor, e.g., the CAR, can include at least one intracellular signaling component or components. In some embodiments, the receptor includes an intracellular component of a TCR complex, such as a TCR CD3 chain that mediates T-cell activation and cytotoxicity, e.g., CD3 zeta chain. Thus, in some aspects, the extracellular domain is linked to one or more cell signaling modules. In some embodiments, cell signaling modules include CD3 transmembrane domain, CD3 intracellular signaling domains, and/or other CD transmembrane domains. In some embodiments, the receptor, e.g., CAR, further includes a portion of one or more additional molecules such as Fc receptor-gamma, CD8, CD4, CD25, or CD16. For example, in some aspects, the CAR includes a chimeric molecule between CD3-zeta or Fc receptor-gamma and CD8, CD4, CD25 or CD16.

    [0232] In some embodiments, the intracellular domain comprises an intracellular costimulatory signaling domain of 41BB or functional variant or portion thereof, such as a 42-amino acid cytoplasmic domain of a human 4-1BB (Accession No. Q07011.1) or functional variant or portion thereof.

    [0233] In some embodiments, the receptor encompasses one or more, e.g., two or more, costimulatory domains and an activation domain, e.g., primary activation domain, in the cytoplasmic portion. Exemplary receptors include intracellular components of CD3-zeta, CD28, and 4-1BB. In some embodiments, the chimeric antigen receptor contains an intracellular domain of a T cell costimulatory molecule. In some aspects, the T cell costimulatory molecule is 4-1BB.

    [0234] In some embodiments, the receptor includes a signaling domain and/or transmembrane portion of a costimulatory receptor, such as CD28, 4-1BB, OX40, DAP10, and ICOS. In some aspects, the same receptor includes both the activating and costimulatory components.

    [0235] In certain embodiments, the intracellular signaling domain comprises a CD8a transmembrane and signaling domain linked to a CD3 (e.g., CD3-zeta) intracellular domain. In some embodiments, the intracellular signaling domain comprises a 4-1BB (CD137, TNFRSF9) co-stimulatory domains, linked to a CD3 zeta intracellular domain. In some embodiments, the CAR comprises a 4-1BB co-stimulatory domain.

    [0236] In some embodiments, the CAR or other antigen receptor further includes a marker, such as a cell surface marker, which may be used to confirm transduction or engineering of the cell to express the receptor, such as a truncated version of a cell surface receptor, such as truncated EGFR (tEGFR). In some aspects, the marker includes all or part (e.g., truncated form) of CD34, a nerve growth factor receptor (NGFR), or epidermal growth factor receptor (e.g., tEGFR). In some embodiments, the nucleic acid encoding the marker is operably linked to a polynucleotide encoding for a linker sequence, such as a cleavable linker sequence or a ribosomal skip sequence, e.g., T2A. See WO2014031687. In some embodiments, introduction of a construct encoding the CAR and EGFRt separated by a T2A ribosome switch can express two proteins from the same construct, such that the EGFRt can be used as a marker to detect cells expressing such construct. In some embodiments, a marker, and optionally a linker sequence, can be any as disclosed in published patent application No. WO2014031687. For example, the marker can be a truncated EGFR (tEGFR) that is, optionally, linked to a linker sequence, such as a T2A ribosomal skip sequence.

    [0237] In some embodiments, the marker is a molecule, e.g., cell surface protein, not naturally found on T cells or not naturally found on the surface of T cells, or a portion thereof.

    [0238] In some embodiments, the molecule is a non-self molecule, e.g., non-self protein, i.e., one that is not recognized as self by the immune system of the host into which the cells will be adoptively transferred.

    [0239] In some embodiments, the marker serves no therapeutic function and/or produces no effect other than to be used as a marker for genetic engineering, e.g., for selecting cells successfully engineered. In other embodiments, the marker may be a therapeutic molecule or molecule otherwise exerting some desired effect, such as a ligand for a cell to be encountered in vivo, such as a costimulatory or immune checkpoint molecule to enhance and/or dampen responses of the cells upon adoptive transfer and encounter with ligand.

    [0240] The CAR may comprise one or modified synthetic amino acids in place of one or more naturally-occurring amino acids. Exemplary modified amino acids include, but are not limited to, aminocyclohexane carboxylic acid, norleucine, -amino n-decanoic acid, homoserine, S-acetylaminomethylcysteine, trans-3- and trans-4-hydroxyproline, 4-aminophenylalanine, 4-nitrophenylalanine, 4-chlorophenylalanine, 4-carboxyphenylalanine, (3-phenylserine (3-hydroxyphenylalanine, phenylglycine, -naphthylalanine, cyclohexylalanine, cyclohexylglycine, indoline-2-carboxylic acid, 1,2,3,4-tetrahydroisoquinoline-3-carboxylic acid, aminomalonic acid, aminomalonic acid monoamide, N-benzyl-N-methyl-lysine, N,N-dibenzyl-lysine, 6-hydroxylysine, ornithine, -aminocyclopentane carboxylic acid, -aminocyclohexane carboxylic acid, -aminocycloheptane carboxylic acid, -(2-amino-2-norbomane)-carboxylic acid, ,-diaminobutyric acid, ,-diaminopropionic acid, homophenylalanine, and -tertbutylglycine.

    [0241] For example, in some embodiments, the CAR includes an antibody or fragment thereof, including single chain antibodies (sdAbs, e.g. containing only the VH region), VH domains, and scFvs, described herein, a spacer such as a CD8a hinge, a CD8a transmembrane domain, a 4-1BB intracellular signaling domain, and a CD3 zeta signaling domain. In some embodiments, the CAR includes an antibody or fragment, including sdAbs and scFvs described herein, a spacer such as a CD8a hinge, a CD8a transmembrane domain, a 4-1BB intracellular signaling domain, and a CD3 zeta signaling domain.

    [0242] Transgenes expressing the priming receptor and CAR system may be introduced into cells, such as a T cell, using, for example, a site-specific technique. With site specific integration of the transgenes (e.g. priming receptor and CAR), the transgenes may be targeted to a safe harbor locus or TRAC. Examples of site-specific techniques for integration into the safe harbor loci include, without limitation, homology-dependent engineering using nucleases and homology independent targeted insertion using Cas9.

    [0243] The engineered cells have applications to immune-oncology. The priming receptor and CAR, for example, can be selected to target different specific tumor antigens. Examples of cancers that can be effectively targeted using such cells are blood cancers or solid cancers. In some embodiments, immune cell therapy can be used to treat solid tumors.

    Nucleic Acids and Vectors

    [0244] In another aspect, provided herein are one or more nucleic acids, wherein the one or more nucleic acids encode a synthetic pathway activator described herein. In another aspect, provided herein are one or more nucleic acids, wherein the one or more nucleic acids encode a sequence selected from the group consisting of SEQ ID NOS: 1-58 or 63-104.

    [0245] In some embodiments, the one or more nucleic acid(s) further comprises a 5 homology directed repair arm and/or a 3 homology directed repair arm complementary to an insertion site in a host cell chromosome. In some embodiments, the one or more nucleic acid(s) comprises the 5 homology directed repair arm and the 3 homology directed repair arm. In some embodiments, the one or more nucleic acid(s) is incorporated into an expression cassette or an expression vector. In some embodiments, the expression cassette or the expression vector further comprises a constitutive promoter upstream of the one or more nucleic acid(s).

    [0246] In some embodiments, the priming receptor, CAR, and synthetic pathway activator are incorporated into a single expression cassette or a single expression vector. In some embodiments, the priming receptor, CAR, and synthetic pathway activator are incorporated into two or more expression cassettes or expression vectors. In some embodiments, the expression vector(s) is a non-viral vector.

    [0247] In some embodiments, the present disclosure contemplates nucleic acid DNA template inserts that comprise one or more transgenes encoding the synthetic pathway activators as described herein. In some embodiments, the DNA template insert encodes a synthetic pathway activator. In some embodiments, the nucleic acid DNA template further comprises a priming receptors and/or CAR. In some embodiments, the DNA template insert encodes a priming receptor transgene. In some embodiments, the DNA template insert encodes a chimeric antigen receptor transgene. In some embodiments, the DNA template insert comprises a synthetic pathway activator, a priming receptor transgene, and a chimeric antigen receptor transgene.

    [0248] In some embodiments, the one or more nucleic acid(s) are encoded on a single DNA template insert. In some embodiments, the one or more nucleic acid(s) are encoded on multiple DNA template inserts. For example, the one or more nucleic acid(s) can be encoded on two, three, or four DNA template inserts.

    [0249] The DNA template insert can also comprise a self-cleaving peptide. Examples of self-cleaving peptides include, but are not limited to, self-cleaving viral 2A peptides, for example, a porcine teschovirus-1 (P2A) peptide, a Thosea asigna virus (T2A) peptide, an equine rhinitis A virus (E2A) peptide, or a foot-and-mouth disease virus (F2A) peptide. Self-cleaving 2A peptides allow expression of multiple gene products from a single construct. (See, for example, Chang et al. Cleavage efficient 2A peptides for high level monoclonal antibody expression in CHO cells, MAbs 7 (2): 403-412 (2015)).

    [0250] The DNA template insert can also comprise a WPRE element. WPRE elements are generally described in Higashimoto, T., et al. Gene Ther 14, 1298-1304 (2007); and Zufferey, R., et al. J Virol. 1999 April; 73 (4): 2886-92., both of which are hereby incorporated by reference.

    [0251] The DNA template insert can also comprise an SV40 polyA tail.

    Cells

    [0252] Also provided herein are cells or immune cells comprising at least one DNA template non-virally inserted into a target region of the genome of the cell, wherein DNA template encodes one or more of the synthetic pathway activators as described herein. In some embodiments, the DNA template further encodes a priming receptor and CAR system as described herein.

    [0253] A cell comprising a DNA template insert at a target locus or safe harbor site as described in the present disclosure can be referred to as an engineered cell. In some embodiments, the cell or immune cell is any cell that can give rise to a pluripotent immune cell. In some embodiments, the immune cell is a primary immune cell. In some embodiments, the immune cell can be an induced pluripotent stem cell (iPSC) or a human pluripotent stem cell (HSPC). In some embodiments, the immune cell comprises primary hematopoietic cells or primary hematopoietic stem cells. In some embodiments, that engineered cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an adaptive immune cell, an innate immune cell, a natural killer (NK) cell, a T cell, a CD8+ cell, a CD4+ cell, or a T cell progenitor cell. In some embodiments, the immune cells are T cells. In some embodiments, the T cells are regulatory T cells, effector T cells, or nave T cells. In some embodiments, the T cells are CD8.sup.+ T cells. In some embodiments, the T cells are CD4.sup.+ T cells. In some embodiments, the T cells are CD4.sup.+CD8.sup.+ T cells.

    [0254] In some embodiments, the engineered cell is a stem cell, a human cell, a primary cell, an hematopoietic cell, an hematopoietic stem cell, an adaptive immune cell, an innate immune cell, a T cell or a T cell progenitor. Non-limiting examples of immune cells that are contemplated in the present disclosure include T cell, B cell, natural killer (NK) cell, NKT/INKT cell, macrophage, myeloid cell, and dendritic cells. Non-limiting examples of stem cells that are contemplated in the present disclosure include pluripotent stem cells (PSCs), embryonic stem cells (ESCs), induced pluripotent stem cells (iPSCs), embryo-derived embryonic stem cells obtained by nuclear transfer (ntES; nuclear transfer ES), male germline stem cells (GS cells), embryonic germ cells (EG cells), hematopoietic stem/progenitor stem cells (HSPCs), somatic stem cells (adult stem cells), hemangioblasts, neural stem cells, mesenchymal stem cells and stem cells of other cells (including osteocyte, chondrocyte, myocyte, cardiac myocyte, neuron, tendon cell, adipocyte, pancreocyte, hepatocyte, nephrocyte and follicle cells and so on). In some embodiments, the engineered cells is a T cell, NK cells, iPSC, and HSPC. In some embodiments, the engineered cells used in the present disclosure are human cell lines grown in vitro (e.g. deliberately immortalized cell lines, cancer cell lines, etc.).

    [0255] Also provided herein are populations of cells comprising a plurality of the cells or immune cell. In some embodiments, the genome of at least 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 99% or greater of the cells comprises the priming receptor and CAR system as described herein.

    Method of Treating Immune-Related Condition of Disease

    [0256] In another aspect, the disclosure provides methods of treating an immune-related condition (e.g., cancer) in an individual comprising administering to the individual an effective amount of a composition comprising a synthetic pathway activator described herein, such as a cell comprising a synthetic pathway activator described herein. In some embodiments, the composition further comprises a priming receptor that specifically binds to a target antigen and a chimeric antigen receptor that specifically binds to a target antigen. In another aspect, the disclosure provides methods of enhancing an immune response in an individual comprising administering to the individual an effective amount of a composition comprising a synthetic pathway activator described herein, such as such as a cell comprising a synthetic pathway activator described herein. In some embodiments, the composition further comprises a priming receptor that specifically binds to a target antigen and a chimeric antigen receptor that specifically binds to a target antigen.

    [0257] In some embodiments, the methods provided herein are useful for the treatment of an immune-related condition in an individual. In one embodiment, the individual is a human.

    [0258] In some embodiments, the methods provided herein (such as methods of enhancing an immune response) are useful for the treatment of cancer and as such an individual receiving the synthetic pathway activator described herein has cancer. In some embodiments, the cancer is a solid cancer. In some embodiments, the cancer is immunoevasive. In some embodiments, the cancer is immunoresponsive. In some embodiments, the cancer is immunoresponsive. In particular embodiments, the cancer is kidney cancer, renal cell carcinoma, clear cell renal cell carcinoma (ccRcc), colorectal cancer, or lung cancer. In some embodiments, the cancer is mesothelioma.

    [0259] In another aspect, the disclosure provides methods of inhibiting (e.g., killing, disabling, or preventing growth or expansion) a target cell that expresses both of the CAR antigen and the priming receptor antigen. In another aspect, the invention provides methods of killing a target cell that expresses both of the CAR antigen and the priming receptor antigen. In some embodiments, the target cell is a cancer cell.

    [0260] In some embodiments, the treatment results in a decrease in the cancer volume or size. In some embodiments, the treatment is effective at reducing a cancer volume as compared to the cancer volume prior to administration of the antibody. In some embodiments, the treatment results in a decrease in the cancer growth rate. In some embodiments, the treatment is effective at reducing a cancer growth rate as compared to the cancer growth rate prior to administration of the antibody. In some embodiments, the treatment is effective at eliminating the cancer.

    [0261] In some embodiments, the CAR antigen and/or the priming receptor antigen are expressed at a higher level in the cancer as compared to a non-cancer cell. Levels of CAR antigen and/or the priming receptor antigen can be assessed by any technique known in the field, including, but not limited to, protein assays or nucleic assays such as FACS, Western blot, ELISA, immunoprecipitation, immunohistochemistry, immunofluorescence, radioimmunoassay, dot blotting, immunodetection methods, HPLC, surface plasmon resonance, optical spectroscopy, mass spectrometry, HPLC, qPCR, RT-qPCR, multiplex qPCR or RT-qPCR, RNA-seq, microarray analysis, SAGE, MassARRAY technique, and FISH, and combinations thereof.

    Method of Immune Modulation

    [0262] Methods of administration of a cell comprising a synthetic pathway activator described herein can result in modulation of an immune response. Modulation can be an increase or decrease in an immune response. In some embodiments, modulation is an increase in an immune response.

    [0263] In one aspect, administration of a cell comprising a synthetic pathway activator described herein can result in induction of pro-inflammatory molecules, such as cytokines or chemokines. Generally, induced pro-inflammatory molecules are present at levels greater than that achieved with isotype control. Such pro-inflammatory molecules in turn result in activation of anti-tumor immunity, including, but not limited to, T cell activation, T cell proliferation, T cell differentiation, MI-like macrophage activation, and NK cell activation. Thus, the administration of a cell comprising a synthetic pathway activator described herein can induce multiple anti-tumor immune mechanisms that lead to tumor destruction. In some embodiments, the immune activity of the cell is cytolytic activity.

    [0264] In another aspect, provided herein are methods of increasing an immune response in an individual comprising administering to the individual an effective amount of a cell comprising a synthetic pathway activator described herein. In some embodiments, the method of increasing an immune response in a subject comprises administering to the subject a cell comprising a synthetic pathway activator described herein.

    [0265] In some embodiments, the cell is present in a pharmaceutical composition further comprising a pharmaceutically acceptable excipient.

    [0266] In any and all aspects of increasing an immune response as described herein, any increase or decrease or alteration of an aspect of characteristic(s) or function(s) is as compared to a cell not comprising a composition comprising a synthetic pathway activator described herein.

    [0267] Increasing an immune response can be both enhancing an immune response or inducing an immune response. For instance, increasing an immune response encompasses both the start or initiation of an immune response, or ramping up or amplifying an on-going or existing immune response. In some embodiments, the treatment induces an immune response. In some embodiments, the induced immune response is an adaptive immune response. In some embodiments, the induced immune response is an innate immune response. In some embodiments, the treatment enhances an immune response. In some embodiments, the enhanced immune response is an adaptive immune response. In some embodiments, the enhanced immune response is an innate immune response. In some embodiments, the treatment increases an immune response. In some embodiments, the increased immune response is an adaptive immune response. In some embodiments, the increased immune response is an innate immune response. In some embodiments, the immune response is started or initiated by administration of a cell comprising a synthetic pathway activator described herein. In some embodiments, the immune response is enhanced by administration of cell comprising a synthetic pathway activator described herein. In some embodiments, the immune response is enhanced by administration of cell comprising a synthetic pathway activator and a priming receptor and CAR system described herein.

    [0268] In another aspect, the present application provides methods of genetically editing a cell with a synthetic pathway activator described herein, which results in the modulation of the immune function of the cell. The modulation can be increasing an immune response. In some embodiments, the modulation is an increase in immune function. In some embodiments, the modulation of function leads to the expression of cytokine or interleukin. In some embodiments, the modulation of function leads to the activation of an immune cell.

    [0269] In some embodiments, the cell is a natural killer (NK) cell, a T cell, a CD8+ T cell, a CD4+ T cell, a primary T cell, or a T cell progenitor.

    [0270] In some embodiments, the modulation of function of the cells comprising a synthetic pathway activator as described herein leads to an increase in the cells' abilities to stimulate both native and activated T-cells, for example, by increasing cytokine or chemokine secretion by the cells expressing the synthetic pathway activator described herein. In some embodiments, the modulation of function enhances or increases the cells' ability to produce cytokines, chemokines, CARs, or costimulatory or activating receptors. In some embodiments, the modulation increases the T-cell stimulatory function of the cells expressing a synthetic pathway activator described herein, including, for example, the cells' abilities to trigger T-cell receptor (TCR) signaling, T-cell proliferation, or T-cell cytokine production.

    [0271] In some embodiments, the increased immune response is secretion of cytokines and chemokines. In some embodiments, a synthetic pathway activator described herein induces increased expression of at least one cytokine or chemokine in a cell as compared to an isotype control cell. In some embodiments, the at least one cytokine or chemokine is selected from the group consisting of: IL-2 and IFN. In some embodiments, the cytokine or chemokine is IL-2. In some embodiments, the cytokine or chemokine is IFN. In some embodiments, the cytokine or chemokine secretion is increased a between bout 1-100-fold 1, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 1-10, 10-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80-90, or 90-100 fold as compared to an untreated cell or a cell treated with an isotype control antibody. In some embodiments, the chemokine is IL-2 and the secretion is increased between about 1-100-fold, 1-fold, 5-fold, 10-fold, 20-fold, 30-fold, 40-fold, 50-fold, 60-fold, 70-fold, 80-fold, 90-fold, 100-fold, 1-10-fold, 10-20-fold, 20-30-fold, 30-40-fold, 40-50-fold, 50-60-fold, 60-70-fold, 70-80-fold, 80-90-fold, or 90-100-fold as compared to an untreated cell or a cell treated with an isotype control antibody. In some embodiments, the cytokine is IFN and the secretion is increased between about 1-100-fold, 1-fold, 5-fold, 10-fold, 20-fold, 30-fold, 40-fold, 50-fold, 60-fold, 70-fold, 80-fold, 90-fold, 100-fold, 1-10-fold, 10-20-fold, 20-30-fold, 30-40-fold, 40-50-fold, 50-60-fold, 60-70-fold, 70-80-fold, 80-90-fold, or 90-100-fold as compared to an untreated cell or a cell treated with an isotype control antibody.

    [0272] In some embodiments, the enhanced immune response is anti-tumor immune cell recruitment and activation.

    [0273] In some embodiments, the cell expressing a synthetic pathway activator described herein induces a memory immune response as compared to an isotype control cell. In general, a memory immune response is a protective immune response upon a subsequent exposure to pathogens or antigens that the immune system encountered previously. Exemplary memory immune responses include the immune response after infection or vaccination with an antigen. In general, memory immune responses are mediated by lymphocytes such as T cells or B cells. In some embodiments, the memory immune response is a protective immune response to cancer, including cancer cell growth, proliferation, or metastasis. In some embodiments, the memory immune response inhibits, prevents, or reduces cancer cell growth, proliferation, or metastasis.

    Methods of Editing Cells

    [0274] The terms gene editing or genome editing, as used herein, refer to a type of genetic manipulation in which DNA is inserted, replaced, or removed from the genome using artificially manipulated nucleases or molecular scissors. It is a useful tool for elucidating the function and effect of sequence-specific genes or proteins or altering cell behavior (e.g. for therapeutic purposes).

    [0275] Currently available genome editing tools include zinc finger nucleases (ZFN) and transcription activator-like effector nucleases (TALENs) to incorporate genes at safe harbor loci (.e.g. the adeno-associated virus integration site 1 (AAVS1) safe harbor locus). The DICE (dual integrase cassette exchange) system utilizing phiC31 integrase and Bxb1 integrase is a tool for target integration. Additionally, clustered regularly interspaced short palindromic repeat/Cas9 (CRISPR/Cas9) techniques can be used for targeted gene insertion.

    [0276] Site specific gene editing approaches can include homology dependent mechanisms or homology independent mechanisms.

    [0277] All methods known in the art for targeted insertion of gene sequences are contemplated in the methods described herein to insert constructs at gene targets or safe harbor loci.

    [0278] Provided herein are methods of inserting nucleotide sequences greater than about 5 kilobases in length into the genome of a cell, in the absence of a viral vector. In some embodiments, the nucleotide sequence greater than about 5 kilobase in length can be inserted into the genome of a primary immune cell, in the absence of a viral vector

    [0279] Integration of large nucleic acids, for example nucleic acids greater than 5 kilobase in size, into cells, can be limited by low efficiency of integration, off-target effects and/or loss of cell viability. Described herein are methods and compositions for achieving integration of a nucleotide sequence, for example, a nucleotide sequence greater than about 5 kilobases in size, into the genome of a cell. In some methods the efficiency of integration is increased, off-target effects are reduced and/or loss of cell viability is reduced.

    [0280] The plasmid can be introduced into an immune cell with a nuclease, such as a CRISPR-associated system (Cas). The nuclease can be introduced in a ribonucleoprotein format with a guide RNA (gRNA) that targets a specific site on the genome of the immune cell. The nuclease cuts the genomic DNA at this specific site. The specific site may be a portion of the genome that encodes an endogenous immune cell receptor. Thus, cutting the genome at this site will cause the immune cell to no longer express an endogenous immune cell receptor.

    [0281] The plasmid may include 5 and 3 homology-directed repair arms complementary to sequences at a specific site on the genome of the immune cell. The complementary sequences are on either side of the site cut by the nuclease, which allows the plasmid to be incorporated at a specified insertion site on the immune cell's genome. Once the plasmid is incorporated, the cell will express the SPA peptide. In examples where the SPA peptide is also co-expressed with a system comprising a priming receptor and CAR, the priming receptor is also expressed by the cell. However, as explained, the design of the transgene cassette ensures that non-virally delivered circuit system receptors do not express CAR until the priming receptor binds to its cognate ligand and releases the cleavable transcription factor.

    [0282] Initially, a T cell is activated. The T cell may be obtained from a patient. Thus, the present disclosure provides methods in which immune cells, such as T cells, are harvested from a patient. Then, the plasmid that encodes the CAR and priming receptor are introduced into a T cell. Advantageously, the plasmids of the present disclosure can be introduced using electroporation. When introducing the plasmid via electroporation, the nuclease may also be introduced. By using electroporation, methods of the present disclosure avoid the use of viral vectors for introducing transgenes, which is a known bottleneck in immune cell engineering. The T cells are then expanded and co-cultured to create a sufficient quantity of engineered immune cells to be used as a therapeutic treatment.

    [0283] Methods for editing the genome of a cell can include a) providing a Cas9 ribonucleoprotein complex (RNP)-DNA template complex comprising: (i) the RNP, wherein the RNP comprises a Cas9 nuclease domain and a guide RNA, wherein the guide RNA specifically hybridizes to a target region of the genome of the cell, and wherein the Cas9 nuclease domain cleaves the target region to create an insertion site in the genome of the cell; and (ii) a double-stranded or single-stranded DNA template, wherein the size of the DNA template is greater than about 200 nucleotides, wherein the 5 and 3 ends of the DNA template comprise nucleotide sequences that are homologous to genomic sequences flanking the insertion site, and wherein the molar ratio of RNP to DNA template in the complex is from about 3:1 to about 100:1; and b) introducing the RNP-DNA template complex into the cell.

    [0284] In some embodiments, the methods described herein provide an efficiency of delivery of the RNP-DNA template complex of at least about 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, 99.5%, 99%, or higher. In some cases, the efficiency is determined with respect to cells that are viable after introducing the RNP-DNA template into the cell. In some cases, the efficiency is determined with respect to the total number of cells (viable or non-viable) in which the RNP-DNA template is introduced into the cell.

    [0285] As another example, the efficiency of delivery can be determined by quantifying the number of genome edited cells in a population of cells (as compared to total cells or total viable cells obtained after the introducing step). Various methods for quantifying genome editing can be utilized. These methods include, but are not limited to, the use of a mismatch-specific nuclease, such as T7 endonuclease I; sequencing of one or more target loci (e.g., by sanger sequencing of cloned target locus amplification fragments); and high-throughput deep sequencing.

    [0286] In some embodiments, loss of cell viability is reduced as compared to loss of cell viability after introduction of naked DNA into a cell or introduction of DNA into a cell using a viral vector. The reduction can be a reduction of at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100% or any percentage in between these percentages. In some embodiments, off-target effects of integration are reduced as compared to off-target integration after introduction of naked DNA into a cell or introduction of DNA into a cell using a viral vector. The reduction can be a reduction of at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100% or any percentage in between these percentages.

    [0287] In some cases, the methods described herein provide for high cell viability of cells to which the RNP-DNA template has been introduced. In some cases, the viability of the cells to which the RNP-DNA template has been introduced is at least about 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, 99.5%, 99%, or higher. In some cases, the viability of the cells to which the RNP-DNA template has been introduced is from about 20% to about 99%, from about 30% to about 90%, from about 35% to about 85% or 90% or higher, from about 40% to about 85% or 90% or higher, from about 50% to about 85% or 90% or higher, from about 50% to about 85% or 90% or higher, from about 60% to about 85% or 90% or higher, or from about 70% to about 85% or 90% or higher.

    [0288] In the methods provided herein, the molar ratio of RNP to nucleic acid (e.g., DNA template) can be from about 3:1 to about 100:1. For example, the molar ratio can be from about 3:1 to 10:1, from about 3:1 to about 15:1, 3:1 to about 20:1; 3:1 to about 25:1; from about 3:1 to 50:1, from about 3:1 to 75:1, from about 3:1 to 100:1; from about 5:1 to 10:1, from about 5:1 to about 15:1, 5:1 to about 20:1; 5:1 to about 25:1; from about 5:1 to 50:1, from about 5:1 to 75:1, from about 5:1 to 100:1; from about 8:1 to about 12:1; from about 8:1 to about 15:1, from about 8:1 to about 20:1, from about 8:1 to about 25:1, from about 8:1 to 50:1, from about 8:1 to 75:1, from about 8:1 to 100:1; from about 10:1 to about 15:1, 10:1 to about 20:1, 10:1 to about 25:1; from about 10:1 to 50:1, from about 10:1 to 75:1, or from about 10:1 to 100:1.

    [0289] In some embodiments, the DNA template is at a concentration of about 2.5 pM to about 25 pM. For example, the concentration of DNA template can be about 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, 10, 10.5, 11, 11.5, 12, 12.5, 13, 13.5, 14, 14.5, 15, 15.5, 16, 16.5, 17, 17.5, 18, 18.5, 19, 19.5, 20, 20.5, 21, 21.5, 22, 22.5, 23, 23.5, 24, 24.5, 25 pM or any concentration in between these concentrations.

    [0290] In some embodiments, the size or length of the nucleic acid (e.g., DNA template) is greater than about 4.5 kb, 5.0 kb, 5.1 kb, 5.2 kb, 5.3 kb, 5.4 kb, 5.5 kb, 5.6 kb, 5.7 kb, 5.8 kb, 5.9 kb, 6.0 kb, 6.1 kb, 6.2 kb, 6.3 kb, 6.4 kb, 6.5 kb, 6.6 kb, 6.7 kb, 6.8 kb, 6.9 kb, 7.0 kb, 7.1 kb, 7.2 kb, 7.3 kb, 7.4 kb, 7.5 kb, 7.6 kb, 7.7 kb, 7.8 kb, 7.9 kb, 8.0 kb, 8.1 kb, 8.2 kb, 8.3 kb, 8.4 kb, 8.5 kb, 8.6 kb, 8.7 kb, 8.8 kb, 8.9 kb, 9.0 kb, 9.1 kb, 9.2 kb, 9.3 kb, 9.4 kb, 9.5 kb, 9.6 kb, 9.7 kb, 9.8 kb, 9.9 kb, 10 kb, 11 kb, 12 kb, 13 kb, 14 kb, 15 kb, or 16 kb or any size of nucleic acid (e.g., DNA template) in between these sizes. For example, the size of the DNA template can be about 4.5 kb to about 15 kb, about 4.5 kb to about 14 kb, about 4.5 kb to about 10 kb, about 5 kb to about 15 kb, about 5 kb to about 14 kb, about 5 kb to about 10 kb, about 5 kb to about 9 kb, about 5 kb to about 8 kb, about 5 kb to about 7 kb, about 5 kb to about 6 kb, about kb 6 to about 15 kb, about kb 6 to about 14 kb, about kb 6 to about 10 kb, about 6 kb to about 9 kb, about 6 kb to about 8 kb, about 6 kb to about 7 kb, about 7 kb to about 15 kb, about 7 kb to about 14 kb, about 7 kb to about 10 kb, about 7 kb to about 9 kb, about 7 kb to about 8 kb, about 8 kb to about 15 kb, about 8 kb to about 14 kb, about 8 kb to about 10 kb, about 8 kb to about 9 kb, about 9 kb to about 15 kb, about 9 kb to about 14 kb, about 9 kb to about 13 kb, about 9 kb to about 12 kb, about 9 kb to about 11 kb, about 9 kb to about 10 kb, about 10 kb to about 15 kb, about 10 kb to about 14 kb, about 10 kb to about 13 kb, about 10 kb to about 12 kb, or about 10 kb to about 11 kb.

    [0291] In some embodiments, the amount of DNA template is about 1 g to about 10 g. For example, the amount of DNA template can be about 1 g to about 2 g, about 1 g to about 3 g, about 1 g to about 4 g, about 1 g to about 5 g, about 1 g to about 6 g, about 1 g to about 7 g, about 1 g to about 8 g, about 1 g to about 9 g, about 1 g to about 10 g. In some embodiments the amount of DNA template is about 2 g to about 3 g, about 2 g to about 4 g, about 2 g to about 5 g, about 2 g to about 6 g, about 2 g to about 7 g, about 2 g to about 8 g, about 2 g to about 9 g, or 2 g to about 10 g. In some embodiments the amount of DNA template is about 3 g to about 4 g, about 3 g to about 5 g, about 3 g to about 6 g, about 3 g to about 7 g, about 3 g to about 8 g, about 3 g to about 9 g, or about 3 g to about 10 g. In some embodiments, the amount of DNA template is about 4 g to about 5 g, about 4 g to about 6 g, about 4 g to about 7 g, about 4 g to about 8 g, about 4 g to about 9 g, or about 4 g to about 10 g. In some embodiments, the amount of DNA template is about 5 g to about 6 g, about 5 g to about 7 g, about 5 g to about 8 g, about 5 g to about 9 g, or about 5 g to about 10 g. In some embodiments, the amount of DNA template is about 6 g to about 7 g, about 6 g to about 8 g, about 6 g to about 9 g, or about 6 g to about 10 g. In some embodiments, the amount of DNA template is about 7 g to about 8 g, about 7 g to about 9 g, or about 7 g to about 10 g. In some embodiments, the amount of DNA template is about 8 g to about 9 g, or about 8 g to about 10 g. In some embodiments, the amount of DNA template is about 9 g to about 10 g.

    [0292] In some cases, the size of the DNA template is large enough and in sufficient quantity to be lethal as naked DNA. In some embodiments, the DNA template encodes a heterologous protein or a fragment thereof. In some embodiments, the DNA template encodes at least one gene. In some embodiments, the DNA template encodes at least two genes. In some embodiments, the DNA template encodes one, two, three, four, five, six, seven, eight, nine, ten, or more genes.

    [0293] In some embodiments, the DNA template includes regulatory sequences, for example, a promoter sequence and/or an enhancer sequence to regulate expression of the heterologous protein or fragment thereof after insertion into the genome of a cell.

    [0294] In some cases, the DNA template is a linear DNA template. In some cases, the DNA template is a single-stranded DNA template. In some cases, the single-stranded DNA template is a pure single-stranded DNA template. As used herein, by pure single-stranded DNA is meant single-stranded DNA that substantially lacks the other or opposite strand of DNA. By substantially lacks is meant that the pure single-stranded DNA lacks at least 100-fold more of one strand than another strand of DNA.

    [0295] In some cases, the RNP-DNA template complex is formed by incubating the RNP with the DNA template for less than about one minute to about thirty minutes, at a temperature of about 20 C. to about 25 C. For example, the RNP can be incubated with the DNA template for about 5 seconds, 10 seconds, 15 seconds, 20 seconds, 25 seconds, 30 seconds, 35 seconds, 40 seconds, 45 seconds, 50 seconds, 55 seconds, 1 minute, 2 minutes, 3 minutes, 4 minutes, 5 minutes, 6 minutes, 7 minutes, 8 minutes, 9 minutes, 10 minutes, 11 minutes, 12 minutes, 13 minutes, 14 minutes, 15 minutes, 16 minutes, 17 minutes, 18 minutes, 19 minutes, 20 minutes, 21 minutes, 22 minutes, 23 minutes, 24 minutes, 25 minutes, 26 minutes, 27 minutes, 28 minutes, 29 minutes or 30 minutes or any amount of time in between these times, at a temperature of about 20 C., 21 C., 22 C., 23 C., 24 C., or 25 C. In another example, the RNP can be incubated with the DNA template for less than about one minute to about one minute, for less than about one minute to about 5 minutes, for less than about 1 minute to about 10 minutes, for about 5 minutes to 10 minutes, for about 5 minutes to 15 minutes, for about 10 to about 15 minutes, for about 10 minutes to about 20 minutes, or for about 10 minutes to about 30 minutes, at a temperature of about 20 C. to about 25 C. In some embodiments, the RNP-DNA template complex and the cell are mixed prior to introducing the RNP-DNA template complex into the cell.

    [0296] In some embodiments introducing the RNP-DNA template complex comprises electroporation. Methods, compositions, and devices for electroporating cells to introduce a RNP-DNA template complex can include those described in the examples herein. Additional or alternative methods, compositions, and devices for electroporating cells to introduce a RNP-DNA template complex can include those described in WO/2006/001614 or Kim, J. A. et al. Biosens. Bioelectron. 23, 1353-1360 (2008). Additional or alternative methods, compositions, and devices for electroporating cells to introduce a RNP-DNA template complex can include those described in U.S. Patent Appl. Pub. Nos. 2006/0094095; 2005/0064596; or 2006/0087522. Additional or alternative methods, compositions, and devices for electroporating cells to introduce a RNP-DNA template complex can include those described in Li, L. H. et al. Cancer Res. Treat. 1, 341-350 (2002); U.S. Pat. Nos. 6,773,669; 7,186,559; 7,771,984; 7,991,559; 6,485,961; 7,029,916; and U.S. Patent Appl. Pub. Nos: 2014/0017213; and 2012/0088842, all of which are hereby incorporated by reference. Additional or alternative methods, compositions, and devices for electroporating cells to introduce a RNP-DNA template complex can include those described in Geng, T. et al., J. Control Release 144, 91-100 (2010); and Wang, J., et al. Lab. Chip 10, 2057-2061 (2010), all of which are hereby incorporated by reference.

    [0297] In some embodiments, the Cas9 protein can be in an active endonuclease form, such that when bound to target nucleic acid as part of a complex with a guide RNA or part of a complex with a DNA template, a double strand break is introduced into the target nucleic acid. The double strand break can be repaired by NHEJ to introduce random mutations, or HDR to introduce specific mutations. Various Cas9 nucleases can be utilized in the methods described herein. For example, a Cas9 nuclease that requires an NGG protospacer adjacent motif (PAM) immediately 3 of the region targeted by the guide RNA can be utilized. Such Cas9 nucleases can be targeted to any region of a genome that contains an NGG sequence. As another example, Cas9 proteins with orthogonal PAM motif requirements can be utilized to target sequences that do not have an adjacent NGG PAM sequence. Exemplary Cas9 proteins with orthogonal PAM sequence specificities include, but are not limited to, CFP1, those described in Nature Methods 10, 1116-1121 (2013), and those described in Zetsche et al., Cell, Volume 163, Issue 3, p759-771, 22 Oct. 2015, both of which are hereby incorporated by reference.

    [0298] In some cases, the Cas9 protein is a nickase, such that when bound to target nucleic acid as part of a complex with a guide RNA, a single strand break or nick is introduced into the target nucleic acid. A pair of Cas9 nickases, each bound to a structurally different guide RNA, can be targeted to two proximal sites of a target genomic region and thus introduce a pair of proximal single stranded breaks into the target genomic region. Nickase pairs can provide enhanced specificity because off-target effects are likely to result in single nicks, which are generally repaired without lesion by base-excision repair mechanisms. Exemplary Cas9 nickases include Cas9 nucleases having a D10A or H840A mutation.

    [0299] In some embodiments, the RNP comprises a Cas9 nuclease. In some embodiments, the RNP comprises a Cas9 nickase. In some embodiments, the RNP-DNA template complex comprises at least two structurally different RNP complexes. In some embodiments, the at least two structurally different RNP complexes contain structurally different Cas9 nuclease domains In some embodiments, the at least two structurally different RNP complexes contain structurally different guide RNAs. In some embodiments, wherein the at least two structurally different RNP complexes contain structurally different guide RNAs, each of the structurally different RNP complexes comprises a Cas9 nickase, and the structurally different guide RNAs hybridize to opposite strands of the target region.

    [0300] In some cases, a plurality of RNP-DNA templates comprising structurally different ribonucleoprotein complexes is introduced into the cell. For example a Cas9 protein can be complexed with a plurality (e.g., 2, 3, 4, 5, or more, e.g., 2-10, 5-100, 20-100) of structurally different guide RNAs to target insertion of a DNA template at a plurality of structurally different target genomic regions.

    [0301] In the methods and compositions provided herein, cells include, but are not limited to, eukaryotic cells, prokaryotic cells, animal cells, plant cells, fungal cells and the like. Optionally, the cell is a mammalian cell, for example, a human cell. The cell can be in vitro, ex vivo or in vivo. The cell can also be a primary cell, a germ cell, a stem cell or a precursor cell. The precursor cell can be, for example, a pluripotent stem cell, or a hematopoietic stem cell. In some embodiments, the cell is a primary hematopoietic cell or a primary hematopoietic stem cell. In some embodiments, the primary hematopoietic cell is an immune cell. In some embodiments, the immune cell is a T cell. In some embodiments, the T cell is a regulatory T cell, an effector T cell, or a nave T cell. In some embodiments, the T cell is a CD4.sup.+ T cell. In some embodiments, the T cell is a CD8.sup.+ T cell. In some embodiments, the T cell is a CD4.sup.+CD8.sup.+ T cell. In some embodiments, the T cell is a CD4.sup.CD8.sup.T cell. Populations of any of the cells modified by any of the methods described herein are also provided. In some embodiments, the methods further comprise expanding the population of modified cells.

    [0302] In some cases, the cells are removed from a subject, modified using any of the methods described herein and administered to the patient. In other cases, any of the constructs described herein is delivered to the patient in vivo. See, for example, U.S. Pat. No. 9,737,604 and Zhang et al. Lipid nanoparticle-mediated efficient delivery of CRISPR/Cas9 for tumor therapy, NPG Asia Materials Volume 9, page e441 (2017), both of which are hereby incorporated by reference.

    [0303] In some embodiments, the RNP-DNA template complex is introduced into about 110.sup.5 to about 210.sup.6 cells. For example, the RNP-DNA template complex can be introduced into about 110.sup.5 to about 510.sup.5 cells, about 110.sup.5 to about 110.sup.6, 110.sup.5 to about 1.510.sup.6, 110.sup.5 to about 210.sup.6, about 110.sup.6 to about 1.510.sup.6 cells or about 110.sup.6 to about 210.sup.6.

    [0304] In some cases, the methods and compositions described herein can be used for generation, modification, use, or control of recombinant T cells, such as chimeric antigen receptor T cells (CAR T cells). Such CAR T cells can be used to treat or prevent cancer, an infectious disease, or autoimmune disease in a subject. For example, in some embodiments, one or more gene products are inserted or knocked-in to a T cell to express a heterologous protein (e.g., a chimeric antigen receptor (CAR) or a priming receptor).

    Insertion Sites

    [0305] Methods for editing the genome of a T cell, specifically, include a method of editing the genome of a human T cell comprise inserting a nucleic acid sequence or construct into a target region in exon 1 of the TCR- subunit (TRAC) gene in the human T cell. In some embodiments, the target region is in exon 1 of the constant domain of TRAC gene. In other embodiments, the target region is in exon 1, exon 2 or exon 3, prior to the start of the sequence encoding the TCR- transmembrane domain. In some embodiments, the target region is the GS94 genomic safe harbor.

    [0306] Methods for editing the genome of a T cell also include a method of editing the genome of a human T cell comprise inserting a nucleic acid sequence or construct into a target region in exon 1 of a TCR- subunit (TRBC) gene in the human T cell. In some embodiments, the target region is in exon 1 of the TRBC1 or TRBC2 gene.

    [0307] Methods for editing the genome of a T cell, specifically, include a method of editing the genome of a human T cell comprise inserting a nucleic acid sequence or construct into a target region of a genomic safe harbor (GSH).

    [0308] Methods for editing the genome of a T cell also include a method of editing the genome of a human T cell comprise inserting a nucleic acid sequence or construct into a GS94 target region (locus chr11: 128340000-128350000).

    [0309] In some embodiments, the target region is the GS94 locus.

    [0310] Gene editing therapies include, for example, vector integration and site specific integration. Site-specific integration is a promising alternative to random integration of viral vectors, as it mitigates the risks of insertional mutagenesis or insertional oncogenesis (Kolb et al. Trends Biotechnol. 2005 23:399-406; Porteus et al. Nat Biotechnol. 2005 23:967-973; Paques et al. Curr Gen Ther. 2007 7:49-66). However, site specific integration continues to face challenges such as poor knock-in efficiency, risk of insertional oncogenesis, unstable and/or anomalous expression of adjacent genes or the transgene, low accessibility (e.g. within 20 KB of adjacent genes), etc., These challenges can be addressed, in part, through the identification and use of safe harbor loci or safe harbor sites (SHS), which are sites in which genes or genetic elements can be incorporated without disruption to expression or regulation of adjacent genes.

    [0311] The most widely used of the putative human safe harbor sites is the AAVS1 site on chromosome 19q, which was initially identified as a site for recurrent adenoassociated virus insertion. Other potential SHS have been identified on the basis of homology, with sites first identified in other species (e.g., the human homolog of the permissive murine Rosa26 locus) or among the growing number of human genes that appear non-essential under some circumstances. One putative SHS of this type is the CCR5 chemokine receptor gene, which, when disrupted, confers resistance to human immunodeficiency virus infection. Additional potential genomic SHS have been identified in human and other cell types on the basis of viral integration site mapping or gene-trap analyses, as was the original murine Rosa26 locus. The three top SHS, AAVS1, CCR5, and Rosa26, are in close proximity to many protein coding genes and regulatory elements. (See Sadelain, M., et al. (2012). Safe harbours for the integration of new DNA in the human genome. Nature reviews Cancer, 12 (1), 51-58, the relevant disclosures of which are herein incorporated by reference in their entirety).

    [0312] The AAVS1 (also known as the PPP1R12C locus) on human chromosome 19 is a known SHS for hosting transgenes (e.g. DNA transgenes) with expected function. It is at position 19q13.42. It has an open chromatin structure and is transcription-competent. The canonical SHS locus for AAVS1 is chr19: 55,625,241-55,629,351. See Pellenz et al. New Human Chromosomal Sites with Safe Harbor Potential for Targeted Transgene Insertion. Human gene therapy vol. 30, 7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference. An exemplary AAVS1 target gRNA and target sequence are provided below:

    TABLE-US-00002 AAVS1-gRNAsequence: (SEQIDNO:116) ggggccactagggacaggatGTTTTAGAGCTAGAAATAGCAAGTTAAAA TAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTT TTTTT AAVS1targetsequence: (SEQIDNO:117) ggggccactagggacaggat.

    [0313] CCR5, which is located on chromosome 3 at position 3p21.31, encodes the major co-receptor for HIV-1. Disruption at this site in the CCR5 gene has been beneficial in HIV/AIDS therapy and prompted the development of zinc-finger nucleases that target its third exon. The canonical SHS locus for CCR5 is chr3: 46,414,443-46,414,942. See Pellenz et al. New Human Chromosomal Sites with Safe Harbor Potential for Targeted Transgene Insertion. Human gene therapy vol. 30, 7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference.

    [0314] The mouse Rosa26 locus is particularly useful for genetic modification as it can be targeted with high efficiency and is expressed in most cell types tested. Irion et al. 2007 (Identification and targeting of the ROSA26 locus in human embryonic stem cells. Nature biotechnology 25.12 (2007): 1477-1482, the relevant disclosure of which are herein incorporated by reference) identified the human homolog, human ROSA26, in chromosome 3 (position 3p25.3). The canonical SHS locus for human Rosa26 (hRosa26) is chr3: 9,415,082-9,414,043. See Pellenz et al. New Human Chromosomal Sites with Safe Harbor Potential for Targeted Transgene Insertion. Human gene therapy vol. 30,7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference.

    [0315] Additional examples of safe harbor sites are provided in Pellenz et al. New Human Chromosomal Sites with Safe Harbor Potential for Targeted Transgene Insertion. Human gene therapy vol. 30, 7 (2019): 814-828, the relevant disclosures of which are herein incorporated by reference. Examples of additional integration sites are provided in Table D.

    [0316] In some embodiments, the safe harbor sites allow for high transgene expression (sufficient to allow for transgene functionality or treatment of a disease of interest) and stable expression of the transgene over several days, weeks or months. In some embodiments, knockout of the gene at the safe harbor locus confers benefit to the function of the cell, or the gene at the safe harbor locus has no known function within the cell. In some embodiments the safe harbor locus results in stable transgene expression in vitro with or without CD3/CD28 stimulation, negligible off-target cleavage as detected by iGuide-Seq or CRISPR-Seq, less off-target cleavage relative to other loci as detected by iGuide-Seq or CRISPR-Seq, negligible transgene-independent cytotoxicity, negligible transgene-independent cytokine expression, negligible transgene-independent chimeric antigen receptor expression, negligible deregulation or silencing of nearby genes, and positioned outside of a cancer-related gene.

    [0317] As used, a nearby gene can refer to a gene that is within about 100 KB, about 125 kB, about 150 kB, about 175 kB, about 200 kB, about 225 kB, about 250 kB, about 275 kB, about 300 KB, about 325 kB, about 350 KB, about 375 KB, about 400 kB, about 425 kB, about 450 kB, about 475 kB, about 500 KB, about 525 kB, about 550 kB away from the safe harbor locus (integration site).

    [0318] In some embodiments, the present disclosure contemplates inserts that comprise one or more transgenes. The transgene can encode a therapeutic protein, an antibody, a peptide, or any other gene of interest. The transgene integration can result in, for example, enhanced therapeutic properties. These enhanced therapeutic properties, as used herein, refer to an enhanced therapeutic property of a cell when compared to a typical immune cell of the same normal cell type. For example, a T cell having enhanced therapeutic properties has an enhanced, improved, and/or increased treatment outcome when compared to a typical, unmodified and/or naturally occurring T cell. The therapeutic properties of immune cells can include, but are not limited to, cell transplantation, transport, homing, viability, self-renewal, persistence, immune response control and regulation, survival, and cytotoxicity. The therapeutic properties of immune cells are also manifested by: antigen-targeted receptor expression; HLA presentation or lack thereof; tolerance to the intratumoral microenvironment; induction of bystander immune cells and immune regulation; improved target specificity with reduction; resistance to treatments such as chemotherapy.

    [0319] As used herein, the term insert size refers to the length of the nucleotide sequence being integrated (inserted) at the target locus or safe harbor site. In some embodiments, the insert size comprises at least about 4.5 kilobasepairs (kb) to about 10 kilobasepairs (kb). In some embodiments, the insert size comprises about 5000 nucleotides or more basepairs. In some embodiments, the insert size comprises up to 4.5, 4.8, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 kbp (kilo basepairs) or the sizes in between. In some embodiments, the insert size is greater than 4.5, 4.8, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 kbp or the sizes in between. In some embodiments, the insert size is within the range of 4.5-15 kbp or is any number in that range. In some embodiments, the insert size is within the range of 4.8-8.3 kbp or is any number in that range. In some embodiments, the insert size is within the range of 5-8.3 kbp or is any number in that range. In some embodiments, the insert size is within the range of 5-15 kbp or is any number in that range. In some embodiments, the insert size is within the range of 4.5-20 kbp or is any number in that range. In some embodiments, the insert size is 5-10 kbp. In some embodiments, the insert size is 4.5-10, 5-10, 6-10, 7-10, 8-10, 9-10 kbp. In some embodiments, the insert size is 4.5-11, 6-11, 7-11, 8-11, 9-11, or 10-11 kbp. In some embodiments, the insert size is 4.5-12, 6-12, 7-12, 8-12, 9-12, 10-12, or 11-12 kbp. In some embodiments, the insert size is 4.5-13, 6-13, 7-13, 8-13, 9-13, 10-13, 11-13, or 12-13 kbp. In some embodiments, the insert size is 4.5-14, 6-14, 7-14, 8-14, 9-14, 10-14, 11-14, 12-14 or 13-14 kbp. In some embodiments, the insert size is 4.5-15, 6-15, 7-15, 8-15, 9-15, 10-15, 11-15, 12-15, 13-15, or 14-15 kbp. In some embodiments, the insert size is 4.5-16, 6-16, 7-16, 8-16, 9-16, 10-16, 11-16, 12-16, 13-16, 14-16 or 15-16 kbp. In some embodiments, the insert size is 4.5-17, 6-17, 7-17, 8-17, 9-17, 10-17, 11-17, 12-17, 13-17, or 14-17, 15-17 or 16-17 kbp. In some embodiments, the insert size is 4.5-18, 6-18, 7-18, 8-18, 9-18, 10-18, 11-18, 12-18, 13-18, 14-18, 15-18, 16-18 or 17-18 kbp. In some embodiments, the insert size is 4.5-19, 6-19, 7-19, 8-19, 9-19, 10-19, 11-19, 12-19, 13-19, 14-19, 15-19, 16-19, 17-19, or 18-19 kbp. In some embodiments, the insert size is 4.5-20, 6-20, 7-20, 8-20, 9-20, 10-20, 11-20, 12-20, 13-20, 14-20, 15-20, 16-20, 17-20, 18-20, or 19-20 kbp.

    [0320] The inserts of the present disclosure refer to nucleic acid molecules or polynucleotide inserted at a target locus or safe harbor site. In some embodiments, the nucleotide sequence is a DNA molecule, e.g., genomic DNA, or comprises deoxy-ribonucleotides. In some embodiments, the insert comprises a smaller fragment of DNA, such as a plastid DNA, mitochondrial DNA, or DNA isolated in the form of a plasmid, a fosmid, a cosmid, a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), and/or any other sub-genome segment of DNA. In some embodiments, the insert is an RNA molecule or comprises ribonucleotides. The nucleotides in the insert are contemplated as naturally occurring nucleotides, non-naturally occurring, and modified nucleotides. Nucleotides may be modified chemically or biochemically, or may contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those of skill in the art. Such modifications include, for example, labels, methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications. The polynucleotides can be in any topological conformation, including single-stranded, double-stranded, partially duplexed, triplexed, hairpinned, circular conformations, and other three-dimension conformations contemplated in the art.

    [0321] The inserts can have coding and/or non-coding regions. The insert can comprises a non-coding sequence (e.g., control elements, e.g., a promoter sequence). In some embodiments, the insert encodes transcription factors. In some embodiments, the insert encodes an antigen binding receptors such as single receptors, T-cell receptors (TCRs), priming receptors, CARs, mAbs, etc. In some embodiments, the the insert is a human sequence. In some embodiments, the insert is chimeric. In some embodiments, the insert is a multi-gene/multi-module therapeutic cassette. A multi-gene/multi-module therapeutic cassette refers to an insert or cassette having one or more than one receptor (e.g., synthetic receptors such as a CAR or a priming receptor), other exogenous protein coding sequences, non-coding RNAs, transcriptional regulatory elements, and/or insulator sequences, etc.

    [0322] In some embodiments, the nucleic acid sequence is inserted into the genome of the cell such as an immune cell or T cell via non-viral delivery. In non-viral delivery methods, the nucleic acid can be naked DNA, or in a non-viral plasmid or vector. Non-viral delivery techniques can be site-specific integration techniques, as described herein or known to those of ordinary skill in the art. Examples of site-specific techniques for integration into the safe harbor loci include, without limitation, homology-dependent engineering using nucleases and homology independent targeted insertion using Cas9 or other CRISPR endonucleases.

    [0323] In some embodiments, the insert is integrated at a safe harbor site by introducing into the engineered cell, (a) a targeted nuclease that cleaves a target region in the safe harbor site to create the insertion site; and (b) the nucleic acid sequence (insert), wherein the insert is incorporated at the insertion site by, e.g., HDR. Examples of non-viral delivery techniques that can be used in the methods of the present disclosure are provided in U.S. Pat. Nos. 11,033,584B2 and 11,814,624B2, the relevant disclosures of which are herein incorporated by reference in their entirety.

    [0324] Examples of integration sites contemplated are provided in Table D.

    TABLE-US-00003 TABLED sgRNAsequences Median(% Modified), summarized SEQ from2 sgRNA ID sgRNAstart sgRNA Integration donors,2 ID sgRNASequence NO: coorGRCH38 TargetLoci Site primersets sgRNA_ GCACCTGAATACC 118 chr16:88811818 APRT APRT 79.28 1 ACGCCTG sgRNA_ CGCCTGCGATGTA 119 chr16:88811551 APRT APRT 78.60 2 GTCGATG sgRNA_ CAGGACGGGCGA 120 chr16:88811640 APRT APRT 85.25 3 GATGTCCC sgRNA_ CTGAATCTTTGGA 121 chr15:44715425 B2M B2M 78.51 4 GTACCTG sgRNA_ GGCCACGGAGCGA 122 chr15:44711550 B2M B2M 94.75 5 GACATCT sgRNA_ AAGTCAACTTCAA 123 chr15:44715515 B2M B2M 70.97 6 TGTCGGA sgRNA_ GCTTGGAGGCCTG 124 chr19:36141111 CAPNS1 CAPNS1 89.34 7 ATCAGCG sgRNA_ CTTATCTCTTCGC 125 chr19:36142301 CAPNS1 CAPNS1 91.09 8 AGCGAGG sgRNA_ CACACATTACTCC 126 chr19:36142676 CAPNS1 CAPNS1 71.98 9 AACATTG sgRNA_ TTCCGCAAAATAG 127 chr3:105746019 CBLB CBLB 91.55 10 AGCCCCA sgRNA_ TGCACAGAACTAT 128 chr3:105751622 CBLB CBLB 91.43 11 CGTACCA sgRNA_ GCAATAAGACTCT 129 chr3:105853470 CBLB CBLB 76.18 12 TTAAAGA sgRNA_ CAAAGAGATTACG 130 chr1:116754658 CD2 CD2 89.80 13 AATGCCT sgRNA_ CAAGGCACCCCAG 131 chr1:116754663 CD2 CD2 92.70 14 GTTTCCA sgRNA_ TTACGAATGCCTT 132 chr1:116754666 CD2 CD2 92.82 15 GGAAACC sgRNA_ CAGAGACGCATCT 133 chr11:118315540 CD3E CD3E 90.96 16 GACCCTC sgRNA_ CATGCAGTTCTCA 134 chr11:118313715 CD3E CD3E 87.47 17 CACACTG sgRNA_ GTGTGAGAACTGC 135 chr11:118313715 CD3E CD3E 86.65 18 ATGGAGA sgRNA_ TCTCATTTCAGGA 136 chr11:118349748 CD3G CD3G 87.24 19 AACCACT sgRNA_ AGTCATACACCTT 137 chr11:118349754 CD3G CD3G 87.99 20 AACCAAG sgRNA_ TTCAAGGAAACCA 138 chr11:118352458 CD3G CD3G 86.55 21 GTTGAGG sgRNA_ GAGCCTTGCCTGG 139 chr11:61118177 CD5 CD5 84.03 22 AAATCTG sgRNA_ AAGCGTCAAAAGT 140 chr11:61118324 CD5 CD5 89.19 23 CTGCCAG sgRNA_ CGTTCCAACTCGA 141 chr11:61118121 CD5 CD5 83.11 24 AGTGCCA sgRNA_ GAGCGACTGGGAC 142 chr9:136866246 EDF1 EDF1 88.84 25 ACGGTGA sgRNA_ GCTGCGCAAGAAG 143 chr9:136866211 EDF1 EDF1 91.04 26 GGCCCTA sgRNA_ TTGTTCTGGCCAG 144 chr9:136863433 EDF1 EDF1 85.98 27 CAGCCCC sgRNA_ CTTCCAGAGCCAC 145 chr19:48965791 FTL FTL 93.10 28 ATCATCG sgRNA_ GGGACTCACCAGA 146 chr19:48965601 FTL FTL 88.86 29 GAGAGGT sgRNA_ CGGTCGAAATAGA 147 chr19:48965770 FTL FTL 93.14 30 AGCCCTA sgRNA_ AAAAGGATATTGT 148 chr10:87933015 PTEN PTEN 92.37 31 GCAACTG sgRNA_ TGTGCATATTTAT 149 chr10:87933183 PTEN PTEN 90.64 32 TACATCG sgRNA_ TTTGTGAAGATCT 150 chr10:87933087 PTEN PTEN 85.36 33 TGACCAA sgRNA_ TGTCATGCTGAAC 151 chr18:12830972 PTPN2 PTPN2 87.94 34 CGCATTG sgRNA_ CCACTCTATGAGG 152 chr18:12859219 PTPN2 PTPN2 92.45 35 ATAGTCA sgRNA_ TTGACATAGAAGA 153 chr18:12836828 PTPN2 PTPN2 93.96 36 GGCACAA sgRNA_ GAGTACTACACTC 154 chr12:6952098 PTPN6 PTPN6 89.61 37 AGCAGCA sgRNA_ TCACGCACAAGAA 155 chr12:6954872 PTPN6 PTPN6 82.74 38 ACGTCCA sgRNA_ AGGTCTCGGTGAA 156 chr12:6951610 PTPN6 PTPN6 91.27 39 ACCACCT sgRNA_ AGCATTATCCAAA 157 chr1:198696873 PTPRC PTPRC 88.88 40 GAGTCCG sgRNA_ ATATTAATTCTTA 158 chr1:198692370 PTPRC PTPRC 88.95 41 CCAGTGG sgRNA_ AGCTTTAAATCAA 159 chr1:198756176 PTPRC PTPRC 96.89 42 GGTTCAT sgRNA_ ATCCCGAGCCCTA 160 chr11:67436325 PTPRCAP PTPRCAP 84.08 43 AGGTGCA sgRNA_ GGCAGCGCGGAG 161 chr11:67436285 PTPRCAP PTPRCAP 97.74 44 GACAGCGT sgRNA_ CTCAGGGGGCTAC 162 chr11:67436170 PTPRCAP PTPRCAP 91.50 45 TACCACC sgRNA_ GTCACCGACGAGA 163 chr5:82277810 RPS23 RPS23 79.40 46 CCAGAAG sgRNA_ GTCGTGGACTTCG 164 chr5:82277843 RPS23 RPS23 83.07 47 TACTGCT sgRNA_ TAATTTTTAGGCA 165 chr5:82277860 RPS23 RPS23 61.94 48 AGTGTCG sgRNA_ TTAGCTGTTAGAC 166 chr14:51993810 RTRAF RTRAF 85.50 49 TTGAATA sgRNA_ CGAGAGCCGTCAA 167 chr14:51989652 RTRAF RTRAF 85.64 50 CTTGCGT sgRNA_ CGGCTTCAACTGC 168 chr14:51989700 RTRAF RTRAF 88.77 51 AAAGGTG sgRNA_ TATGAAAAAGCAG 169 chr15:43793025 SERF2 SERF2 89.61 52 AGCGACT sgRNA_ TCTGGCGGGCGAG 170 chr15:43792989 SERF2 SERF2 86.73 53 CTCACGC sgRNA_ CTCACGCTGGTTA 171 chr15:43792977 SERF2 SERF2 80.57 54 CCGCCTA sgRNA_ AAAGATTACGAAC 172 chr12:46207559 SLC38A1 SLC38A1 92.24 55 TTCCCTG sgRNA_ GTTAAAAACAGAC 173 chr12:46229232 SLC38A1 SLC38A1 91.51 56 ATGCCTA sgRNA_ ATGCCTAAGGAGG 174 chr12:46229246 SLC38A1 SLC38A1 79.48 57 TTGTACC sgRNA_ CTCCAGGTATCCC 175 chr18:47869418 SMAD2 SMAD2 79.53 58 ATCGAAA sgRNA_ CACCAAATACGAT 176 chr18:47870532 SMAD2 SMAD2 86.61 59 AGATCAG sgRNA_ TGGCGGCGTGAAT 177 chr18:47896729 SMAD2 SMAD2 82.91 60 GGCAAGA sgRNA_ TAGGATGGTAGCA 178 chr16:11255478 SOCS1 SOCS1 92.25 61 CACAACC sgRNA_ CAGCAGCAGAGCC 179 chr16:11255432 SOCS1 SOCS1 83.79 62 CCGACGG sgRNA_ CGGCGTGCGAACG 180 chr16:11255296 SOCS1 SOCS1 84.24 63 GAATGTG sgRNA_ TATAGACGCTGCC 181 chr15:40038895 SRP14 SRP14 95.12 64 CGACGTC sgRNA_ TCCAAAGAAGGGT 182 chr15:40038368 SRP14 SRP14 92.14 65 ACTGTGG sgRNA_ ACAGTACCCTTCT 183 chr15:40038358 SRP14 SRP14 65.82 66 TTGGAAT sgRNA_ GCGACGGGCGCAT 184 chr12:120469572 SRSF9 SRSF9 83.68 67 CTACGTG sgRNA_ CCCGACCTCCATA 185 chr12:120465700 SRSF9 SRSF9 92.56 68 AGTCCTG sgRNA_ GGGGTCCTCGAAG 186 chr12:120469426 SRSF9 SRSF9 89.94 69 CGCACGA sgRNA_ TGCTCTGTTTAGA 187 chr5:32591641 SUB1 SUB1 79.36 70 AGATGAC sgRNA_ ATATTCTTTTCTAG 188 chr5:32591566 SUB1 SUB1 70.93 71 TTAAAG sgRNA_ CCTGTAAAGAAAC 189 chr5:32591614 SUB1 SUB1 93.66 72 AAAAGAC sgRNA_ TGGAGAAAGACGT 190 chr4:105234315 TET2 TET2 83.53 73 AACTTCG sgRNA_ TCTGCCCTGAGGT 191 chr4:105234747 TET2 TET2 90.97 74 ATGCGAT sgRNA_ ATTCCGCTTGGTG 192 chr4:105235656 TET2 TET2 89.62 75 AAAACGA sgRNA_ CAGGCACAATAGA 193 chr3:114295571 TIGIT TIGIT 92.65 76 AACAACG sgRNA_ CCATTTGTAATGC 194 chr3:114295700 TIGIT TIGIT 60.75 77 TGACTTG sgRNA_ CTGGGTCACTTGT 195 chr3:114295634 TIGIT TIGIT 87.99 78 GCCGTGG sgRNA_ GTCAGGGTTCTGG 196 chr14:22547508 TRAC TRAC 98.20 79 ATATCTG sgRNA_ TGGATTTAGAGTC 197 chr14:22547541 TRAC TRAC 88.15 80 TCTCAGC sgRNA_ CTGCGGCTGTGGT 198 chr14:22550661 TRAC TRAC 94.77 81 CCAGCTG sgRNA_ ACAAAACTGTGCT 199 chr14:22547658 TRAC TRAC 87.86 82 AGACATG sgRNA_ TTCTTCCCCAGCC 200 chr14:22547778 TRAC TRAC 89.85 83 CAGGTAA sgRNA_ CGTCATGAGCAGA 201 chr14:22550625 TRAC TRAC 95.81 84 TTAAACC sgRNA_ GAGAGCGCCTGCG 202 chr19:58544980 TRIM28 TRIM28 89.44 85 ACCCGAG sgRNA_ CCAGCGGGTGAAG 203 chr19:58544869 TRIM28 TRIM28 94.79 86 TACACCA sgRNA_ GGAGCGCTTTTCG 204 chr19:58544839 TRIM28 TRIM28 91.81 87 CCGCCAG sgRNA_ TGAGGCCTGGACC 205 chr10:33134193 chr10: desert_1 69.44 88 TTATGCA 33130000- (GS88) 33140000 sgRNA_ CCTGGTGGAGTGA 206 chr10:33132917 chr10: desert_1 95.25 89 ACCATGA 33130000- (GS89) 33140000 sgRNA_ CAAGCACTTAGGT 207 chr10:33134633 chr10: desert_1 91.13 90 TCCCCTG 33130000- (GS90) 33140000 sgRNA_ GGTCTCCCTACAA 208 chr10:72294568 chr10: desert_2 92.02 91 TTCAGCG 72290000- (GS91) 72300000 sgRNA_ CACAGCGCGTGAC 209 chr10:72298268 chr10: desert_2 90.22 92 TGCAATG 72290000- (GS92) 72300000 sgRNA_ TCTGGGGCACCAA 210 chr10:72292786 chr10: desert_2 86.35 93 TTCTAGG 72290000- (GS93) 72300000 sgRNA_ GAGCCATGCTTGG 211 chr11:128342576 chr11: desert_3 91.24 94 CTTACGA 128340000- (GS94) 128350000 sgRNA_ GTACAAGTACTTA 212 chr11:128343592 chr11: desert_3 89.02 95 TCTCATG 128340000- (GS95) 128350000 sgRNA_ GAGATAACAACAT 213 chr11:128347170 chr11: desert_3 96.47 96 AACAACA 128340000- (GS96) 128350000 sgRNA_ CATATTCCATAGT 214 chr11:65425000 chr11: desert_4 88.54 97 CTTTGGG 65425000- (GS97) 65427000 (NEAT1) sgRNA_ CTGCCCCTTAGCA 215 chr11:65425507 chr11: desert_4 92.76 98 ACTTAGG 65425000- (GS98) 65427000 (NEAT1) sgRNA_ TGTTTAAAAATAT 216 chr11:65426264 chr11: desert_4 90.76 99 GTTGACA 65425000- (GS99) 65427000 (NEAT1) sgRNA_ CCAGGAATGGAAA 217 chr15:92830315 chr15: desert_5 87.84 100 CTCACGC 92830000- (GS100) 92840000 sgRNA_ GAGGCCGCTGAAT 218 chr15:92831850 chr15: desert_5 85.32 101 TAACCCG 92830000- (GS101) 92840000 sgRNA_ ATACACGCACACT 219 chr15:92831131 chr15: desert_5 99.92 102 TGCAGAA 92830000- (GS102) 92840000 sgRNA_ GAGCAGACAGAA 220 chr16:11225670 chr16: desert_6 87.92 103 ACCCAGGG 11220000- (GS103) 11230000 sgRNA_ TGAGTCTCCAAAC 221 chr16:11226284 chr16: desert_6 88.53 104 AGAACAG 11220000- (GS104) 11230000 sgRNA_ TAATATCACTGAC 222 chr16:11225029 chr16: desert_6 87.65 105 TTCACGG 11220000- (GS105) 11230000 sgRNA_ TACACACAATGTA 223 chr2:87467461 chr2: desert_7 71.79 106 AGCAGCA 87460000- (GS106) 87470000 sgRNA_ GGGAGCTCAATTC 224 chr2:87468809 chr2: desert_7 65.89 107 GAAACCA 87460000- (GS107) 87470000 sgRNA_ TTGGACAGGTGAG 225 chr2:87467001 chr2: desert_7 72.64 108 ACAGTCG 87460000- (GS108) 87470000 sgRNA_ AAGCTCACTCAGA 226 chr3:186511316 chr3: desert_8 76.89 109 TAGTGTG 186510000- GS109) 186520000 sgRNA_ CAGGAGAACCACC 227 chr3:186515260 chr3: desert_8 86.31 110 TTACACG 186510000- (GS110) 186520000 sgRNA_ GGACAGACCCTGA 228 chr3:186519655 chr3: desert_8 85.47 111 TTCACAA 186510000- (GS111) 186520000 sgRNA_ ACATGGCAGTCTA 229 chr3:59451154 chr3: desert_9 87.77 112 TGAACAG 59450000- (GS112) 59460000 sgRNA_ CCTATAGAGAGTA 230 chr3:59456416 chr3: desert_9 79.33 113 CTACTTG -59450000 (GS113) 59460000 sgRNA_ CCAACCGGGTCTT 231 chr3:59457029 chr3: desert_9 92.21 114 CATTACG 59450000- (GS114) 59460000 sgRNA_ TCAAGCGTAGAGT 232 chr8:127993006 chr8: desert_10 93.07 115 TCCGAGT 127980000- (GS115) 128000000 sgRNA_ TCATGCAATTATG 233 chr8:127994663 chr8: desert_10 89.40 116 GACCCAG 127980000- (GS116) 128000000 sgRNA_ CGGGAAAGTGACT 234 chr8:127996766 chr8: desert_10 87.45 117 GGCCATG 127980000- (GS117) 128000000 sgRNA_ TGAGATTGAAATC 235 chr9:7974159 chr9: desert_11 84.84 118 AAATCGG 7970000- (GS118) 7980000 sgRNA_ TATGCAATATTCA 236 chr9:7977914 chr9: desert_11 85.44 119 TCACGCG 7970000- (GS119) 7980000 sgRNA_ AATGTGTTAAATC 237 chr9:7976895 chr9: desert_11 83.48 120 AAATGCA 7970000- (GS120) 7980000

    CRISPR-Cas Editing

    [0325] One effective example of gene editing is the CRISPR-Cas approach (e.g. CRISPR-Cas9). This approach incorporates the use of a guide polynucleotide (e.g. guide ribonucleic acid or gRNA) and a cas endonuclease (e.g. Cas9 endonuclease).

    [0326] As used herein, a polypeptide referred to as a Cas endonuclease or having Cas endonuclease activity refers to a CRISPR-related (Cas) polypeptide encoded by a Cas gene, wherein a Cas polypeptide is a target DNA sequence that can be cleaved when operably linked to one or more guide polynucleotides (see, e.g., U.S. Pat. No. 8,697,359). Also included in this definition are variants of Cas endonuclease that retain guide polynucleotide-dependent endonuclease activity. The Cas endonuclease used in the donor DNA insertion method detailed herein is an endonuclease that introduces double-strand breaks into DNA at the target site (e.g., within the target locus or at the safe harbor site).

    [0327] As used herein, the term guide polynucleotide relates to a polynucleotide sequence capable of complexing with a Cas endonuclease and allowing the Cas endonuclease to recognize and cleave a DNA target site. The guide polynucleotide can be a single molecule or a double molecule. The guide polynucleotide sequence can be an RNA sequence, a DNA sequence, or a combination thereof (RNA-DNA combination sequence). A guide polynucleotide comprising only ribonucleic acid is also referred to as guide RNA. In some embodiments, a polynucleotide donor construct is inserted at a safe harbor locus using a guide RNA (gRNA) in combination with a cas endonuclease (e.g. Cas9 endonuclease).

    [0328] The guide polynucleotide includes a first nucleotide sequence domain (also referred to as a variable targeting domain or VT domain) that is complementary to a nucleotide sequence in the target DNA, and a second nucleotide that interacts with a Cas endonuclease polypeptide. It can be a double molecule (also referred to as a double-stranded guide polynucleotide) comprising a sequence domain (referred to as a Cas endonuclease recognition domain or CER domain). The CER domain of this double molecule guide polynucleotide comprises two separate molecules that hybridize along the complementary region. The two separate molecules can be RNA sequences, DNA sequences and/or RNA-DNA combination sequences.

    [0329] Genome editing using CRISPR-Cas approaches relies on the repair of site-specific DNA double-strand breaks (DSBs) induced by the RNA-guided Cas endonuclease (e.g. Cas 9 endonuclease). Homology-directed repair (HDR) of these DSBs enables precise editing of the genome by introducing defined genomic changes, including base substitutions, sequence insertions, and deletions. Conventional HDR-based CRISPR/Cas9 genome-editing involves transfecting cells with Cas9, gRNA and donor DNA containing homologous arms matching the genomic locus of interest.

    [0330] HITI (homology independent targeted insertion) uses a non-homologous end joining (NHEJ)-based homology-independent strategy and the method can be more efficient than HDR. Guide RNAs (gRNAs) target the insertion site. For HITI, donor plasmids lack homology arms and DSB repair does not occur through the HDR pathway. The donor polynucleotide construct can be engineered to include Cas9 cleavage site(s) flanking the gene or sequence to be inserted. This results in Cas9 cleavage at both the donor plasmid and the genomic target sequence. Both target and donor have blunt ends and the linearized donor DNA plasmid is used by the NHEJ pathway resulting integration into the genomic DSB site. (See, for example, Suzuki, K., et al. (2016). In vivo genome editing via CRISPR/Cas9 mediated homology-independent targeted integration. Nature, 540 (7631), 144-149, the relevant disclosures of which are herein incorporated in their entirety).

    [0331] Methods for conducing gene editing using CRISPR-Cas approaches are known to those of ordinary skill in the art. (See, for example, U.S. application Ser. Nos. 16/312,676, 15/303,722, and 15/628,533, the disclosures of which are herein incorporated by reference in their entirety). Additionally, uses of endonucleases for inserting transgenes into safe harbor loci are described, for example, in U.S. application Ser. No. 13/036,343, the disclosures of which are herein incorporated by reference in their entirety.

    [0332] The guide RNAs and/or mRNA (or DNA) encoding an endonuclease can be chemically linked to one or more moieties or conjugates that enhance the activity, cellular distribution, or cellular uptake of the oligonucleotide. Non-limiting examples of such moieties include lipid moieties such as a cholesterol moiety, cholic acid, a thioether, a thiocholesterol, an aliphatic chain (e.g., dodecandiol or undecyl residues), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethylammonium 1,2-di-O-hexadecyl-rac-glycero-3-H-phosphonate, a polyamine or a polyethylene glycol chain, adamantane acetic acid, a palmityl moiety and an octadecylamine or hexylamino-carbonyl-t oxycholesterol moiety. See for example US Patent Publication No. 20180127786, the disclosure of which is herein incorporated by reference in its entirety.

    Therapeutic Applications

    [0333] For therapeutic applications, the engineered cells, populations thereof, or compositions thereof are administered to a subject, generally a mammal, generally a human, in an effective amount. The engineered cells may be administered to a subject by infusion (e.g., continuous infusion over a period of time) or other modes of administration known to those of ordinary skill in the art.

    [0334] The engineered cells provided herein not only find use in gene therapy but also in non-pharmaceutical uses such as, e.g., production of animal models and production of recombinant cell lines expressing a protein of interest.

    [0335] The engineered cells of the present disclosure can be any cell, generally a mammalian cell, generally a human cell that has been modified by integrating a transgene at a safe harbor locus described herein. Exemplary cells are provided in the Recombinant Cells section.

    [0336] The engineered cells, compositions and methods of the present disclosure are useful for therapeutic applications such as CAR T cell therapy and TCR T cell therapy. In some embodiments, the insertion of a sequence encoding a transgene within a safe harbor locus maintains the TCR expression relative to instances when there is no insertion and enables transgene expression while maintaining TCR function.

    [0337] In some embodiments, the present disclosure provides methods of treating a subject in need of treatment by administering to the subject a composition comprising any of the engineered cells described herein. In some embodiments, administration of the engineered cell composition results in a desired pharmacological and/or physiological effect. That effect can be partial or complete cure of the disease and/or adverse effects resulting from the disease. In some embodiments, treatment encompasses any treatment of a disease in a subject (e.g., mammal, e.g., human). Further, treatment may stabilize or reduce undesirable clinical symptoms in subjects (e.g., patients). The cells provided herein populations thereof, or compositions thereof may be administered during or after the occurrence of the disease.

    [0338] In certain embodiments, the subject has a disease, condition, and/or injury that can be treated and/or ameliorated by cell therapy. In some embodiments, the subject in need of cell therapy is a subject having an injury, disease, or condition, thereby causing cell therapy (e.g., therapy in which cellular material is administered to the subject). However, it is contemplated that it is possible to treat, ameliorate and/or reduce the severity of at least one symptom associated with the injury, disease or condition.

    Method of Administration

    [0339] An effective amount of the immune cell comprising the SPA peptide comprises may be administered for the treatment of cancer. The appropriate dosage of the immune cell comprising the SPA peptide may be determined based on the type of cancer to be treated, the type of the immune cell comprising the SPA peptide, the severity and course of the cancer, the clinical condition of the individual, the individual's clinical history and response to the treatment, and the discretion of the attending physician.

    Determining Expression of CD11c

    [0340] Also provided herein are methods of treating a cancer in a subject in need thereof comprising: determining or having determined the expression of CD11c in a cell comprising a synthetic pathway activator (SPA) peptide disclosed herein, optionally wherein the SPA is inserted into a target region of the genome of the cell; and administering or having administered to the subject the cell comprising the SPA.

    [0341] CD11c is also known as Integrin Subunit Alpha X, Integrin, Alpha X (Complement Component 3 Receptor 4 Subunit), or ITGAX (HGNC: 6152, NCBI Gene: 3687, UniProtKB/Swiss-Prot: P20702).

    [0342] In some aspects, provided herein are methods of determining an expression level of CD11c protein in a sample from a subject comprising contacting the sample with an anti-CD11c antibody and performing a FACS, Western blot, ELISA, immunoprecipitation, immunohistochemistry, monoplex immunohistochemistry, multiplex immunohistochemistry, immunofluorescence, radioimmunoassay, dot blotting, immunodetection methods, surface plasmon resonance, optical spectroscopy, mass spectrometry assay, or any combination thereof. In some aspects, provided herein are methods of determining an expression level of CD11c mRNA in a sample from a subject comprising performing qPCR, RT-qPCR, multiplex qPCR or RT-qPCR, RNA-seq, microarray analysis, SAGE, MassARRAY technique, Luminex, MSD, or FISH, and combinations thereof.

    [0343] In some aspects, provided herein are methods of producing a CD11c-positive (CD11c+) cell comprising inserting a nucleic acid encoding for any SPA peptide disclosed herein and expressing the SPA peptide in the cell. In other aspects, provided herein are methods of detecting a SPA, optionally a functional SPA, in a cell comprising a SPA peptide disclosed herein, optionally wherein the SPA is inserted into the genome of the cell.

    [0344] In some aspects, provided herein are methods of screening a cell for expression of a SPA, optionally wherein the SPA is a functional SPA, comprising expressing one or more SPAs in a cell and detecting CD11c expression in the cell, wherein the detection of CD11c indicates a functional SPA peptide. In such embodiments, a functional SPA is one that has functional signaling (e.g., can stimulate the cell, such as a T cell, via phosphorylation of STAT1, STAT3, and/or STAT5).

    [0345] In some aspects, provided herein are assays to detect cells, such as primary cells and/or immune cells, engineered to express a SPA comprising: determining or having determined CD11c expression in the primary cell, wherein CD11c expression indicates that the cell expresses the SPA.

    [0346] In some aspects, provided herein are methods of treating a patient with an engineered CD11c-expressing T cell comprising: administering a T cell comprising a SPA disclosed herein to the patient.

    [0347] In some embodiments, the CD11c expression is determined in the T cell, from a biological sample from a patient administered the cell expressing the SPA peptide, or in the tumor of a patient administered the cell expressing the SPA peptide. In some embodiments, the cell is an immune cell. In some embodiments, the immune cell is a primary immune cell. In some embodiments, the immune cell is a hematopoietic cell, an adaptive immune cell, an innate immune cell, a natural killer (NK) cell, a T cell, a CD8+ cell, a CD4+ cell, or a T cell progenitor cell. In some embodiments, the immune cells are T cells. In some embodiments, the T cells are regulatory T cells, effector T cells, or nave T cells. In some embodiments, the T cells are CD8.sup.+ T cells. In some embodiments, the T cells are CD4.sup.+ T cells. In some embodiments, the T cells are CD4.sup.+CD8.sup.+ T cells.

    [0348] In some embodiments, the expression level of CD11c comprises the mRNA expression level of CD11c. In some embodiments, the expression level of CD11c comprises the protein expression level of CD11c. In some embodiments the expression level of CD11c is detected in the sample using a method selected from the group consisting of FACS, Western blot, ELISA, immunoprecipitation, immunohistochemistry, monoplex immunohistochemistry, multiplex immunohistochemistry, immunofluorescence, radioimmunoassay, dot blotting, immunodetection methods, HPLC, surface plasmon resonance, optical spectroscopy, mass spectrometry, qPCR, RT-qPCR, multiplex qPCR or RT-qPCR, RNA-seq, microarray analysis, SAGE, MassARRAY technique, Luminex, MSD, and FISH, and combinations thereof.

    Pharmaceutical Compositions

    [0349] The engineered recombinant cells provided herein can be administered as part of a pharmaceutical compositions. These compositions can comprise, in addition to one or more of the recombinant cells, a pharmaceutically acceptable excipient, carrier, buffer, stabiliser or other materials well known to those skilled in the art. Such materials should be non-toxic and should not interfere with the efficacy of the active ingredient. The precise nature of the carrier or other material can depend on the route of administration, e.g. oral, intravenous, cutaneous or subcutaneous, nasal, intramuscular, intraperitoneal routes. The pharmaceutical composition may comprise one or more pharmaceutical excipients. Any suitable pharmaceutical excipient may be used, and one of ordinary skill in the art is capable of selecting suitable pharmaceutical excipients. Accordingly, the pharmaceutical excipients provided below are intended to be illustrative, and not limiting. Additional pharmaceutical excipients include, for example, those described in the Handbook of Pharmaceutical Excipients, Rowe et al. (Eds.) 6th Ed. (2009), incorporated by reference in its entirety.

    [0350] Various modes of administering the additional therapeutic agents are contemplated herein. In some embodiments, the additional therapeutic agent is administered by any suitable mode of administration.

    [0351] A composition can be administered alone or in combination with other treatments, either simultaneously or sequentially dependent upon the condition to be treated.

    Kits and Articles of Manufacture

    [0352] The present application provides kits comprising any one or more of the SPA peptides or cell compositions described herein along with instructions for use. The instructions for use can be present in the kits as a package insert, in the labeling of the container of the kit or components thereof, or can be in digital form (e.g. on a CD-ROM, via a link on the internet). A kit can include one or more of a genome-targeting nucleic acid, a polynucleotide encoding a genome-targeting nucleic acid, a site-directed polypeptide, and/or a polynucleotide encoding a site-directed polypeptide. Additional components within the kits are also contemplated, for example, buffer (such as reconstituting buffer, stabilizing buffer, diluting buffer), and/or one or more control vectors.

    [0353] In some embodiments, the kits further contain a component selected from any of secondary antibodies, reagents for immunohistochemistry analysis, pharmaceutically acceptable excipient and instruction manual and any combination thereof. In one specific embodiment, the kit comprises a pharmaceutical composition comprising any one or more of the antibody compositions described herein, with one or more pharmaceutically acceptable excipients.

    [0354] The present application also provides articles of manufacture comprising any one of the antibody compositions or kits described herein. Examples of an article of manufacture include vials (including sealed vials).

    EXAMPLES

    [0355] Below are examples of specific embodiments for carrying out the present disclosure. The examples are offered for illustrative purposes only, and are not intended to limit the scope of the present disclosure in any way. Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, temperatures, etc.), but some experimental error and deviation should, of course, be allowed for.

    [0356] The practice of the present disclosure will employ, unless otherwise indicated, conventional methods of protein chemistry, biochemistry, recombinant DNA techniques and pharmacology, within the skill of the art. Such techniques are explained fully in the literature. See, e.g., T. E. Creighton, Proteins: Structures and Molecular Properties (W.H. Freeman and Company, 1993); A. L. Lehninger, Biochemistry (Worth Publishers, Inc., current addition); Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); Methods In Enzymology (S. Colowick and N. Kaplan eds., Academic Press, Inc.); Remington's Pharmaceutical Sciences, 18th Edition (Easton, Pennsylvania: Mack Publishing Company, 1990); Carey and Sundberg Advanced Organic Chemistry 3.sup.rd Ed. (Plenum Press) Vols A and B (1992).

    Example 1: Synthesis and In Vitro Characterization of Logic Gate Circuits Comprising Synthetic Pathway Activators

    Materials and Methods

    ICT Construct Expression in T Cells

    [0357] Integrated circuit T (ICT) cells were generated through site directed CRISPR mediated knock in (KI). T cells were activated for two days using CD3-CD28 beads. At day 2, beads were removed followed by the delivery of the ICT transgene to the GS94 site in the genome of the T cells. Transgene integration was performed using a CRISPR-based process and electroporation step that combined activated T cells, CRISPR/Cas9 RNP targeting the GS94 non-coding autosomal integration site, and plasmid DNA constituting a repair template to effect insertion of the transgene cassette via cellular DNA repair machinery.

    [0358] The GS94 CRISPR/Cas9 RNP used was generated by complexing single guide RNA (sgRNA) with recombinant Streptococcus pyogenes Cas9 (SpCas9). The sgRNA contained a protospacer sequence directing the CRISPR/Cas9 RNP to the GS94-transgene integration site. The plasmid DNA repair template contained the ICT transgene cassette, flanked by 450 base pair (bp) sequences homologous to the regions flanking the integration site to effect repair-mediated insertion.

    [0359] A diagram of five ICT transgene cassettes generated is provided in FIG. 1. The ICT constructs 1, 2, 3, and 4 comprised a constitutively expressed priming receptor, an inducible CAR (collectively forming a Logic Gate or LG), constitutively expressed shRNAs, and a synthetic pathway activator (SPA). ICT 5 included LNGFR in place of the SPA.

    [0360] Following electroporation, cells were recovered and expanded in T cell media for 7 days. When indicated, negative control T cells were generated using a mock electroporation process that edited T cells with ribonucleoprotein (RNP) in the absence of donor plasmid and are referred to as RNP control.

    [0361] ICT cells were assessed for transgene KI and the expression of the PrimeR and CAR using flow based staining. Constructs contained tags myc and flag on the distal extracellular portion of the PrimeR and CAR respectively following the signal peptide. ICT cells at day 7 post activation were stained with myc, flag and CD3 antibodies for 30 min at 4c. Following activation, cells were washed in FACs buffer and run by flow cytometry. ICTs were analyzed for PrimeR and CAR expression following gating each sample for live CD3+ cells.

    ICT Induction of CARS

    [0362] ICTs were generated as described above from the T cells of 2 donors. On day 11 post activation, ICTs were measured for CAR and PrimeR expression by Flag and Myc staining. % KI was quantify by summing the % of T cells in a sample that were PrimeR+ or CAR+. Before co-culture setup, ICTs were normalized to the same % KI using the addition of donor matched RNP only cell. 110.sup.7 ICTs were co-cultured with 110.sup.7 target cells or media for 72 hours and stained to calculate the % of CAR+ cells using flag staining. Basal CAR expression was measured during assay set up.

    Synthetic Pathway Activators

    [0363] Synthetic Pathway Activators (SPAs) constitutively drive STAT signaling without the need for external cytokine input. SPAs can be designed to engage activity of multiple STAT family transcription factors at variable levels through rational design. Exemplary Class I SPAs primarily increase pSTAT3 activity and exemplary Class II SPAs primarily increase pSTAT5 activity. FIG. 2 shows the structures of exemplary synthetic pathway activators.

    [0364] A synthetic pathway activator (SPA) based on gp130 was constructed as shown in SEQ ID NOs: 20 and 81. SEQ ID NO: 20 includes a leader sequence. The SPA comprises the transmembrane region and intracellular domain of gp130 linked to an ectodomain derived from the cell adhesion protein CD34. An unpaired cysteine residue was introduced into the receptor ectodomain to permit formation of a covalent bond and subsequent dimerization of individual synthetic gp130 monomers. The SPA drives constitutive recruitment and phosphorylation of STAT1 and STAT3 transcription factors. (FIG. 5 and data not shown)

    [0365] To demonstrate the ability of the SPA module to drive constitutive STAT3 phosphorylation, ICTs expressing the SPA module under non-stimulated conditions were fixed, permeabilized, and stained for pSTAT3 and the myc epitope tag to distinguish between edited and non-edited cells expressing the ICT.

    Cytotoxicity, Engineered K562 Cells

    [0366] ICT cells expressing the integrated circuits comprising Logic Gate 1 IC, Logic Gate 2 IC, Logic Gate 3 IC, Logic Gate 4 IC, or Logic Gate 5 IC (LG-15 IC) with shRNA and a SPA were co-cultured with K562_EFG, K562_EFG_CAR, K562_EFG_primeR, or K562_EFG_CAR_primeR target antigens at varying E:T ratios for 72 hours at 37 C. Following incubation, cytotoxicity was measured using a luciferase reporter assay. Data are presented as the meanstandard deviation of 4 donors.

    Cytokine Secretion

    [0367] To further assess the specificity and function of ICT cells expressing Logic Gates 1-5, supernatants were collected from K562 target cytotoxicity co-cultures (Effector:Target ratio of 1:1, 72 hour co-culture). Following incubation, supernatants were collected at endpoint and cytokine release levels were measured using a Luminex assay. Data from 4 donors are shown.

    Cytotoxicity in Endogenous CAR Cells

    [0368] ICT cells expressing Logic Gates 1-5 were co-cultured with cells endogenously expressing a CAR target and engineered to express a primeR target at varying E:T ratios for 72 hours at 37 C. Following incubation, cytotoxicity was measured using a luciferase reporter assay. Data are presented as the meanstandard deviation of 4 donors. Prior to luciferase readout described above, supernatants were collected at endpoint and cytokines IFN-g, TNFa, GM-CSF, and IL-2 were measured using a Luminex assay. Data from 4 donors are shown.

    Mixed Co-Culture Cytotoxicity

    [0369] ICT cells expressing LG 1-5 ICs were co-cultured with primeR target+/CAR target HUVEC and luciferase expressing primeR target/CAR target+ cells (K562-EFG-CAR) at varying E:T ratios for 72 hours at 37 C. Following incubation, cytotoxicity was measured using a luciferase reporter assay. Data are from one normal donor. ICT-mediated CAR+ target cell killing was evaluated relative to an RNP-electroporated negative control using a luciferase reporter assay.

    Results

    [0370] All ICT cells constitutively expressed the PrimeR construct as shown by myc expression (FIG. 3). The inducible CAR was not expressed at basal state in the ICT cells, as indicated by the lack of FLAG expression (FIG. 3), indicating that the priming receptor had not induced expression of the CAR.

    [0371] As shown in FIG. 4, the ICT cells induced CAR expression when co-cultured with primeR target antigen expressing cell lines. Numbers shown in FIG. 4 are the (% CAR)/(% KI normalized to at the start of the assay)*100. Thus, the logic gate circuit functioned correctly by not expressing the CAR in the absence of binding of the primeR to primeR target antigen (FIG. 3) and induction of expression of the CAR upon binding of the primeR to its cognate ligand on a target cell (FIG. 4).

    [0372] As shown in FIG. 5, flow cytometry analysis revealed that ICT cells expressing the SPA (LG 1-4 ICs) exhibit approximately one log higher pSTAT3 expression when compared to the PrimeR-cells lacking a SPA (EGFRt). Edited T cells with a EGFRt (non-signaling) module in the place of a SPA does not exhibit increased pSTAT3 staining when compared to PrimeR-cells. Overall, the results indicate that ICTs expressing the SPA module exhibit increased STAT3 phosphorylation.

    [0373] ICTs expressing LG 1-5 IC's demonstrated cytotoxicity against only dual CAR target antigen and primeR target antigen expressing cells as compared to unedited control cells (RNP). FIG. 6A shows cytotoxicity against parental K562 cells expressing neither target antigen, FIG. 6B shows cytotoxicity against K562 cells expressing only CAR target antigen, FIG. 6C shows cytotoxicity against K562 cells expressing only primeR target antigen, and FIG. 6D shows cytotoxicity against K562 cells expressing both primeR target antigen and CAR target antigen. As shown in FIG. 6D, the ICTs exhibited cytotoxicity against only cells expressing both primeR target antigen and CAR target antigen as compared to unedited cells (RNP).

    [0374] IFN- production from ICTs expressing LG 1-5 ICs was observed only in supernatants taken from co-cultures where the target cells expressed both primeR target antigen and CAR target antigen (FIG. 7). Results from the cytokine analysis were consistent with the cytotoxicity data. Together, these data further demonstrate that ICT activity is driven by co-expression of primeR target antigen and CAR target antigen.

    [0375] ICTs expressing LG 1-5 ICs demonstrated in vitro cytotoxicity against the cell line expressing endogenous CAR target antigen and engineered primeR target antigen (FIG. 8A). ICTs also secrete cytokines after co-culture with the CAR/primeR target antigen cell line. FIG. 8B shows IFN, TNF, GM-CSF, and IL-2 secretion by ICT cells after co-culture with +CAR/+primeR target antigen cells. Thus, ICTs expressing Logic Gate 1-5 ICs secreted cytokines and killed ccRCC cell lines that express endogenous CAR antigen in the presence of primeR target antigen.

    [0376] Co-culture with HUVEC-primeR antigen+ cells induced expression of the CAR protein on ICT cells and specific killing of CAR antigen positive cells was confirmed (FIG. 9). Thus, ICTs expressing Logic Gates 1-5 were capable of inducing CAR expression through interaction with primeR target antigen positive endothelial cells and subsequently specifically engaging and killing CAR target antigen positive tumor cells. Therefore, without wishing to be bound by theory, ICTs can be primed by binding to endothelial cells expressing primeR target antigen in order to express the CAR and then kill CAR target tumor cells.

    [0377] Thus, logic gated ICT cells with signal pathway activators that utilize the presence of two antigens to trigger tumor cell killing to improve the therapeutic index of CAR T cells were developed, thereby enhancing tumor specificity. Induction of the CAR was gated on the expression of primeR target antigen found on the tumor neovasculature of ccRCC. When the priming receptor (PrimeR) binds the primeR target antigen, PrimeR engagement triggers proteolytic release of a transcription factor that induces expression of a CAR. The feasibility of vascular priming was confirmed using a transwell assay where ICTs were primed by a primeR target antigen expressing endothelial cell line and then migrated across the transwell membrane to kill CAR antigen expressing RCC cells.

    [0378] When constitutively expressed in the ICT cells, the SPA resulted in significant enhancements in T-cell potency and expansion. Repetitive stimulation assays, wherein T cells were challenged with tumor cells every 2 days, show that Class I SPAs result in 6-log or higher improved tumor cell clearance over a 2-week assay period. (data not shown) Across various mouse xenograft models (FIGS. 2, 10A, 10D, 11B, and data not shown), SPA-expressing ICTs reach at least 6-fold improved tumor growth inhibition. RNAseq and ATACseq analysis indicate changes to gene expression profiles in T cells expressing Class I SPAs, with maintenance of T cell stem-like phenotypes, and restricted accessibility of various exhaustion marker genes (FIG. 2 and data not shown). Importantly, despite significantly increased levels of expansion, ICTs equipped with SPAs are not immortalized, showing no signs of cytokine-independent outgrowth. (data not shown) In addition, SPA-expressing ICT cells rapidly contract following tumor clearance in-vivo (FIGS. 10B, 10C, 10E, 10F).

    [0379] Collectively, these results demonstrate that primeR/CAR plus SPA ICT cells can (i) selectively target antigens that cannot generally be safely targeted by conventional CARs; and (ii) overcome multiple suppressive mechanisms in the tumor microenvironment.

    Example 2: In Vivo Efficacy of primeR and CAR Logic Gate T Cells Expressing a Synthetic Pathway Activator

    Materials and Methods

    RCC Efficacy Model

    [0380] Human ccRCC cells express endogenous levels of the CAR target antigen and were engineered to express physiological levels of the primeR target antigen. 210.sup.6 primeR target antigen cells were inoculated into the right dorsal flank of five-six weeks old, female NSG MHC I/II DKO mice. Day 35 post tumor inoculation, mean tumor volume of 150 mm.sup.3 was reached and tumor-bearing animals were randomized into various treatment groups such that mean tumor volume per group was within 10% of the overall mean. Seven mice/group were injected intravenously with a single dose of 0.1510.sup.6 of PrimeR+ ICT cells expressing one of the five LG ICTs described in Example 1 (LG 1 IC, LG 2 IC, LG 3 IC, LG 4 IC, or LG 5 IC) RNP or PBS. The study was repeated with ICTs generated from two different normal donors. Tumor volumes and body weight were recorded bi-weekly. Tumor volume was calculated as per formula *L*W.sup.2, where L is tumor length and W is tumor width.

    [0381] Blood pharmacokinetics demonstrated the expansion of ICTs on day 14 followed by complete contraction by day 42 post T cell injection. PrimeR+ ICTs in mouse blood were quantified to track expansion of ICTs using flow cytometry with count bright beads for T cell quantification/volume. Mean and SEM plotted.

    Dual Flank Model

    [0382] Human ccRCC 786-O cells were engineered to express either the CAR target antigen and primeR target antigen or the CAR target antigen only. 210.sup.6 786-O-CAR+ and 786-O-CAR antigen+-primeR antigen+ cells were inoculated into the left and right dorsal flank respectively of five-six weeks old, female NSG MHC I/II DKO mice. Day 35 post tumor inoculation, mean tumor volume of 150-200 mm.sup.3 was reached on each flank and tumor-bearing animals were randomized into various treatment groups such that mean tumor volume per group on the right flank was within 10% of the overall mean. Seven mice/group were injected intravenously with a single dose of 0.2510.sup.6 or 110.sup.6 of PrimeR+ ICT cells, constitutive CAR T cells, RNP or PBS control. Tumor volumes and body weight were recorded bi-weekly. Tumor volume was calculated as per formula *L*W.sup.2, where L is tumor length and W is tumor width. (B) tumor volumes on the 786-O CAR only flank (left), and (C) tumor volumes on the 786-O-CAR+primeR+ flank (right). Data represents a single donor study with 7 mice per group, mean and SEM plotted.

    Results

    A498 RCC Efficacy Model

    [0383] ICTs expressing LG 1-5 ICs showed tumor elimination in a ccRCC model. FIGS. 10A and 10D show the tumor volume post tumor implant in mice treated with ICTs expressing Logic Gates 1-5, RNP or PBS generated from T cells from either donor 1 (FIGS. 10A-C) or donor 2 (FIGS. 10D-F). FIGS. 10B and 10E show the total T cells and expansion of the ICTs on day 12 post inoculation followed by contraction by day 21. FIGS. 10C and 10F show total T cells expressing the priming receptor on days 12 and 21. In both replicates, the ICT cells demonstrated significant tumor-growth inhibition in mice (P<0.05).

    Dual Flank Model

    [0384] The ICTs expressing LG 1-5 ICs showed specificity in a dual flank model (FIG. 11A-B). Greater tumor growth inhibition (TGI) was observed in the dual positive primeR/CAR flank (FIG. 11B) than the single positive CAR-only flank (FIG. 11A). Thus, the dual flank xenograft model shows that logic gated circuits (ICTs) more selectively killed tumors that express both CAR and primeR target antigens, and not tumors that express CAR alone.

    Example 3: Generation and Characterization of Novel Synthetic Pathway Activators

    Methods

    Generation and Synthesis

    [0385] A library of novel gp130-based STAT1/3 synthetic pathway activators (SPAs) was generated through diversification of extracellular and intracellular domains, multimerization modalities, membrane anchor modalities, and epitope tags/signaling peptides. The SPAs were generated to explore the optimization and improvement of SPA performance such as enhanced potency, reduced immunogenicity, detection capabilities, and reduced gene size. Various forms of the gp130 intracellular domain (ICD) were used, including the full length gp130 ICD, as well as truncations comprising deletions of the gp130 ICD 707-755, 771-811, 818-901, and combinations therein. A gp130 Y759F mutation was also included in some ICD constructs. Multimerization modalities used were unpaired cysteines, a leucine zipper, a BCR ectodomain (SEQ ID NO: 239), and a VASP tetramerization domain (SEQ ID NO: 240). Extracellular domains used were the full CD34 ectodomain (SEQ ID NO: 242), a CD34 epitope (SEQ ID NO: 238), a BCR ectodomain (SEQ ID NO: 239), a thrombopoetin receptor domain (SEQ ID NO: 243), and an erythropoietin receptor (EpoR) ectodomain (SEQ ID NO: 241). Membrane anchor modalities used were prenylation and myristoylation domains derived from src, fyn, or lck. The prenylation modification was also used at the C terminus of some SPAs. Some SPAs also comprised a CD8alpha hinge domain (FACD). In some SPAs, the SPA expression was inducible based on T cell activation. Other SPAs were constitutively expressed. The sequences of the novel SPAs are provided in SEQ ID NOs: 1-58 and 63-104. The SPAs were screened first by a pSTAT activation screen, then an in vitro functional assessment, and finally an in vivo efficacy assessment.

    [0386] Activity of the novel pathway activators were assessed by intracellular pSTAT staining following a 24 hour serum starve. ICTs expressing a SPA construct module were starved for 24 hours with no antigen stimulation or cytokine support, then fixed, permeabilized, and stained for pSTAT proteins. pSTAT3 and pSTAT1 MFI values were measured for two independent donors and the average value was plotted in a heat map.

    Cytokine Induction

    [0387] A diverse range of ICTs expressing novel STAT1/STAT3 SPAs were cultured with K562 cells at a 1:1 ratio. Following 72 hours of co-culture, supernatants were isolated and analyzed by Luminex for granzyme B and IL10 production. Representative cytokines displaying diverse expression levels are shown (plotted in groups according to pSTAT1 levels).

    Repetitive Stimulation and Memory Phenotype

    [0388] T cells expressing a logic gate (ICTs) and a diverse range of novel Class I STAT1/STAT3 SPAs were challenged in a 14-day repetitive stimulation assay with K562 cells expressing primeR and CAR antigens with IL-2 supplementation. ICTs and tumor cells were re-normalized to a fixed number every other day to maintain a 1:1 E:T ratio. The total tumor cell growth and T-cell expansion over the course of the experiment was normalized to the EGFRt control and plotted in groups according to pSTAT1 levels. L-gp130 was used as a control SPA. Additional controls included expressing cJun, EGFRt, mbIL-15, and IL7Ra-IL7 in the ICTs.

    [0389] Memory phenotype was measured by flow cytometry for CD45RA and CD27 expression at day 0 and repetitive stimulation assay endpoint (day 14) and plotted in groups according to pSTAT1 profile.

    Results

    [0390] Approximately 60% of the novel SPAs demonstrated elevated levels of pSTAT1 (FIG. 12B) and/or pSTAT3 (FIG. 12A) relative to a logic gate ICT expressing an inert truncated EGFR molecule in place of a SPA. The pSTAT1 vs pSTAT3 heatmap highlights the combinatorial diversity achieved through modification of the SPA architecture (FIG. 13). Thus, without wishing to be bound by theory, STAT-inducing cell-surface receptors demonstrate remarkable flexibility in their architecture, permitting diversification of STAT profiles.

    [0391] The diversity in pSTAT1 and pSTAT3 signaling resulted in varied cytokine secretion, including Granzyme B (top panel FIG. 14) and IL-10 (bottom panel FIG. 14). Thus, pSTAT1/3 signaling diversity achieved through the novel SPAs drove cytokine responses.

    [0392] All of the novel STAT1/STAT3 SPAs demonstrated dramatic levels of tumor clearance in the repetitive stimulation assay. FIG. 15 (top panel) shows the tumor expansion as compared to the control ICTs under chronic antigen stimulation. ICTs expressing the novel SPAs significantly reduced tumor expansion and resulted in increased tumor clearance as compared to control ICTs. FIG. 15 (bottom panel) shows T cell expansion. Without wishing to be bound by theory, the novel Class I SPAs thus demonstrated both superior anti-tumor activity as compared to L-gp130 (SEQ ID NO: 62) and also demonstrated a favorable safety profile by rapidly contracting after tumor clearance.

    [0393] The novel SPAs drove diverse memory phenotypes and functional phenotypes based on pSTAT profiles (FIG. 16). FIG. 16 shows the percent of ICT cells expressing the novel SPAs that expressed CD45RA and/or CD27 and the corresponding T cell type (e.g., Tscm, Tem, Teff) after the repetitive stimulation assay.

    [0394] Without wishing to be bound by theory, the novel SPAs thus represent a novel, improved, tunable, T cell intrinsic approach for engineering cell fates that result in potent anti-tumor properties, e.g., as compared to L-gp130.

    Example 4: In Vivo Efficacy of Second primeR and CAR Logic Gate T Cells Expressing a Synthetic Pathway Activator

    Materials and Methods

    [0395] In a mesothelioma (MSTO) solid tumor model, mice were engrafted with MSTO tumors expressing different primeR and CAR antigens as compared to Example 2. 300,000 ICT cells expressing the exemplary primeR and CAR logic gate and the novel Class I SPAs (SPAs of SEQ ID NOs: 20, 30, or 16) or the exemplary primeR and CAR logic gate and no additional SPA were administered intravenously when tumors reached 100 mm.sup.3. T cells expressing the exemplary primeR and CAR logic gate and L-gp130 (SPA001, SEQ ID NO: 62) were used as an additional control. Blood was drawn weekly for pharmacokinetics (PK) studies on days 7, 14, 21, and 28 post T cell injection. Tumor growth inhibition (TGI) was measured 3weekly for 45 days post T cell injection.

    Results

    [0396] The ICT cells expressing the logic gate and the novel Class I SPA variants of SEQ ID NOs: 20, 30, and 16 showed improved anti-tumor efficacy as compared to ICT cells expressing a logic gate and L-gp130 (SPA01, FIG. 17). The novel Class I SPA variants also exhibited improved T-cell expansion compared to the foundation SPA01 molecule. Thus, the novel SPAs outperformed L-gp130 in an in vivo experiment assessing anti-cancer efficacy.

    Example 5: SPA Expression in T Cells Induces CD11c Expression

    Materials and Methods

    [0397] 5-6 week old female NSG MHC DKO mice were implanted with 786-O B2M KO tumors expressing the primeR antigen and CAR antigen from Example 2. Day 36 post tumor implant mice were intravenously injected with LG 1 IC T cells or the non-SPA IC T cells (LG 5 ICT). Tumor and spleen samples were harvested from 15 mice treated with either LG 1 IC T cells (2 donors) or non-SPA ICT (1 donor) at day 7 post ICT injection. Tumors were lysed in DMEM containing Dnase/Collagenase/hyaluronidase. Tumor dissociation was performed in gentleMacs (Octo) using a custom built program. Spleen was mechanically digested using a syringe plunger handle-end over 70 m filter. Post digestion, cell suspension was filtered and subjected to RBC lysis and staining with live dead Zombie NIR dye for 15 min at RT in dark. After live-dead staining cells were centrifuged, washed and stained with cell surface antibody cocktail containing murineCD45, murineGR-1, humanCD3, CAR idiotype (CAR receptor), Prime idiotype (Prime receptor) in the presence of human and mouse Fc blocks for 30 minutes at RT in dark. Cells were then centrifuged at 400 g for 10 min, washed and suspended in flow staining buffer (BD Biosciences). Cells were then sorted using BD FACS Aria. Receptor positive cells were sorted and subjected to RNA seq and CITE seq.

    CITE Labeling Protocol

    [0398] TotalSeq-C Human Universal Cocktail, V1.0 (Biolegend Cat #399905) antibody mix was reconstituted in 26 ul PBS with 1% BSA. Cells were blocked with Fc Block and 13 uL antibody mix was added to 500,000 cells. Following 30 min. incubation on ice cells were washed, resuspended and transferred for GEM generation and barcoding with the 10 Genomics Chromium Next Gem 5 Single Cell kit (Cat #1000263). Cells were combined with GEM Master mix, loaded on a 10K chip into the Chromium X controller for GEM generation, then incubated in a thermal cycler for reverse transcription. Following GEM clean up and cDNA amplification, gene expression sequencing libraries were generated according to 10 Genomics published protocols. Using the supernatant fraction from the cDNA amplification clean up step, CITE-Seq libraries were constructed using Dual Index Plate TN Set A from 10 Genomics (Cat #1000250) with 8 cycles of PCR amplification. Following library preparation, samples were sequenced on an Illumina NovaSeq 6000.

    Bioinformatics Analysis

    [0399] Single-cell RNA-seq and CITE-seq data was first analyze by Cellranger 7.0 to generate gene by cell matrix. Cells were filtered to exclude low UMI cells, high mitochondrial content cells and non T cells. The filtered matrix was analyzed by scvi 0.20 to generate low dimensional visualizations for the individual cells. The filtered matrix was also analyzed by Seurat 5.0 to define differentially expressed genes, which are genes significantly up or down-regulated in SPA positive cells compared to SPA negative cells (SPA-vs-no SPA). The genes that up-regulated in SPA-vs-no SPA all of D0, D7 Tumor and D7 Spleen samples were intersected to generate a shared list. The same analysis was performed for both RNA-seq and CITE-seq to generate two shared lists. The two shared lists were further intersected to nominate the final list of cell surface protein genes that are most associated with SPA expression. CD11c was nominated by manually examining the final nominated genes.

    CD11c Detection by Flow Cytometry

    [0400] Peripheral blood samples were collected into EDTA-coated tubes. Red blood cells were lysed with ammonium chloride and the remaining cellular fraction was stained with a fixable amine-reactive viability dye and Fc receptors were blocked using anti-CD16/CD32 monoclonal antibodies. After washing, cell surface antigens were stained using fluorochrome-conjugated monoclonal antibodies against humanCD45, murineCD45, murineGr-1, Flag tag (chimeric antigen receptor) & Myc tag (priming receptor). The cells were then fixed with Cytofix fixation buffer (BD Biosciences) and permeabilized with Phosflow Perm Buffer III (BD Biosciences) following the manufacturers recommendation. After permeabilization, the cells were stained using fluorochrome-conjugated monoclonal antibodies against CD11c (CD11c is also known as Integrin, alpha X or ITGAX) and intracellular pSTAT3. Samples were analyzed on an Attune NT flow cytometer (Thermo Fisher Scientific).

    Results

    [0401] There is a strong correlation of CD11c RNA and cell surface expression with SPA expression in both CD4+ and CD8+ cells at D7 in both tumor and spleen samples (FIG. 18A). Increased expression of CD11c was observed in the CD8 and CD4 cells expressing the SPA as compared to the CD8 and CD4 cells that did not express the SPA. FIG. 18B shows CD11c (ITGAX) expression in spleen or tumor cells from mice treated with cells expressing a SPA ICT (two donors) as compared to cells that do not express a SPA ICT (one donor) collected on Day 0 and Day 7 post treatment. CD11c (i.e., ITGAX) was expressed at higher levels in SPA containing T cells isolated from the tumor and spleen as compared to non-SPA containing T cells (FIG. 18B). Thus, quantification of expression of CD11c (mRNA or protein) can be used as a marker for SPA expression in both CD4+ and CD8+ cells.

    [0402] While the disclosure has been particularly shown and described with reference to a preferred embodiment and various alternate embodiments, it will be understood by persons skilled in the relevant art that various changes in form and details can be made therein without departing from the spirit and scope of the disclosure.

    [0403] All references, issued patents and patent applications cited within the body of the instant specification are hereby incorporated by reference in their entirety, for all purposes.

    TABLE-US-00004 informalsequencelisting SEQ ID NO Name Sequence 1 QBEND10- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN Cys-L- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK GP130Y759F KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE withleader DLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQFSTVVH sequence SGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQH ESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPG TEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 2 QBEND10- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN Cys-L- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK GP130Y759F KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE 4771-811with DLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQFSTVVH leader SGYRHQQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSG sequence QMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 3 QBEND10- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN Cys-L- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK GP1304771- KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE 811with DLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVH leader SGYRHQQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSG sequence QMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 4 QBEND10-L- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSRIARLEEKVKTLKAQNSELAS GP130with TANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWP leader NVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSL sequence DLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRH QVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPD ISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQV ERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 5 QBEND10-L- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSRIARLEEKVKTLKAQNSELAS GP130Y759F TANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWP withleader NVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSL sequence DLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQFSTVVHSGYRH QVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPD ISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQV ERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 6 QBEND10-L- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSRIARLEEKVKTLKAQNSELAS GP130Y759F TANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWP 4771-811with NVPDPSKSHIAQWSPHTPPRHNENSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSL leader DLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQFSTVVHSGYRH sequence QQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMF QEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 7 QBEND10-L- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSRIARLEEKVKTLKAQNSELAS GP1304771- TANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWP 811with NVPDPSKSHIAQWSPHTPPRHNENSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSL leader DLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRH sequence QQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMF QEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 8 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN QBEND10- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK Cys-L-GP130 KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE withleader DLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVH sequence SGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQH ESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPG TEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 9 SPA-L- MRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNNKRDLIKKHIWPNVPDPS gp130_ICD- KSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKE hKRas-CAAX KINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQ VFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFER SKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETV GMEAATDEGMPKSYLPQTVRQGGYMPQMSKDGKKKKKKSKTKCVIM 10 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN QBEND10- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK Cys-L- KHIWPNVPDPSKSHIAQWSPHTPPRHNENSKDQMYSDGNFTDVSVVEIEANDKKPFPE GP130_ICD_ TVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQ trunc1with YFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEV leader SAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ sequence 11 SPA MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN QBEND10- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK Cys-L- KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE GP130771- DLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVH 811(ICD_ SGYRHQQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSG trunc2)with QMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ leader sequence 12 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN QBEND10- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK Cys-L- KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE GP130_ICD_ DLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVH trunc3with SGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQPKSYL leader PQTVRQGGYMPQ sequence 13 SPA MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN QBEND10- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK Cys-L- KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE GP130_ICD_ TVQYSTVVHSGYRHQQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISD trunc4with HISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQ leader GGYMPQ sequence 14 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN QBEND10- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK Cys-L- KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE GP130_ICD_ DLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVH trunc5with SGYRHQQQYFKQPKSYLPQTVRQGGYMPQ leader sequence 15 SPA MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN QBEND10- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK Cys-L- KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE GP130_ICD_ TVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQ trunc6with YFKQPKSYLPQTVRQGGYMPQ leader sequence 16 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN QBEND10- SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK Cys-L- KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE GP130_ICD_ TVQYSTVVHSGYRHQQQYFKQPKSYLPQTVRQGGYMPQ trunc7with leader sequence 17 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSRIARLEEKVKTLKAQNSELAS QBEND10-L- TANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWP GP130with NVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSL leader DLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRH sequence QVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPD ISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQV ERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 18 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSMVDPVGFAEAWKAQFPDSEPP QBEND10- RMELRSVGDIEQELERCKASIRRLEQEVNQERFRMIYLQTLLAKEAIVVPVCLAFLLT BCR-GP130 TLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNENSKDQMYSDGNFTD withleader VSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENE sequence SSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGD GILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQ MKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 19 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGAQGEIEAIVVPVCLAF QBEND10- LLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGN cys-GP130 FTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSD withleader ENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVD sequence GGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCG SGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 20 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGAIVVPVCLAFLLTTLL QBEND10- GVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSV cys- VEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQ GP130_TM_ NTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGIL ICDwithleader PRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKM sequence FQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 21 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGAQGEIEAIVVPVCLAF QBEND10- LLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGN cys-GP130-L FTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSD withleader ENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVD sequence GGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCG SGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQR IARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMN 22 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGAIVVPVCLAFLLTTLL QBEND10- GVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSV cys- VEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQ GP130_TM_ NTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGIL ICD-Lwith PRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKM leader FQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQRIARLEE sequence KVKTLKAQNSELASTANMLREQVAQLKQKVMN 23 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGAQGEIEAIVVPVCLAF QBEND10- LLTTLLGVLFCFNKRDLIKKHRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV cys- MNIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE GP130_TM_ DLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVH L-ICDwith SGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQH leader ESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPG sequence TEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 24 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGAIVVPVCLAFLLTTLL QBEND10- GVLFCFNKRDLIKKHRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNIWPN GP130_TM_ VPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLD -ICDwith LFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQ Lleader VPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDI sequence SHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVE RFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 25 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSFACDIYIWAPLAGTCGVLLLS QBEND10- LVITLYCNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVS FACD- VVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESS CD8a_TMD_ QNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGI GP130_ICD LPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMK withleader MFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ sequence 26 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSTTTPAPRPPTPAPTIASQPLS QBEND10- LRPEACRPAAGGAVHTRGLDFACDIYIWAPLAGTCGVLLLSLVITLYCNKRDLIKKHI CD8a_hinge- WPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLK TMD_GP130 SLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGY ICDwith RHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESS leader PDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEG sequence QVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 27 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGIYIWAPLAGTCGVLLL QBEND10- SLVITLYCNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDV cys- SVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENES CD8a_TMD_ SQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDG GP130_ICD ILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQM withleader KMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ sequence 28 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGIYIWAPLAGTCGVLLL QBEND10- SLVITLYCNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDV cys- SVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENES CD8a_TMD_ SQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDG GP130_ICD-L ILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQM withleader KMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQRIARL sequence EEKVKTLKAQNSELASTANMLREQVAQLKQKVMN 29 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN QBEND10- SELASTANMLREQVAQLKQKVMNIYIWAPLAGTCGVLLLSLVITLYCNKRDLIKKHIW cys-L- PNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKS CD8a_TMD_ LDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYR GP130_ICD HQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSP withleader DISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQ sequence VERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 30 SPA-C7- MLVRRGARAGPRMPRGWTALCLLSLLPSGFMSLDNNGTATPELPTQGTFSNVSTNVSY gp130with QETTTPSTLGSTSLHPVSQHGNEATTNITETTVKFTSTSVITSVYGNTNSSVQSQTSV leader ISTVFTTPANVSTPETTLKPSLSPGNVSDLSTTSTSLATSPTKPYTSSSPILSDIKAE sequence IKCSGIREVKLTQGICLEQNKTSSCAEFKKDRGEGLARVLCGEEQADADAGAQVCSLL LAQSEVRPQCLLLVLANRTEISSKLQLMKKHQSDLKKLGILDFTEQDVASHQSYSQKT PILLTCPTISILSFFSVALLVILACVLWNKRDLIKKHIWPNVPDPSKSHIAQWSPHTP PRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIG GSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLL DSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDE VRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPK SYLPQTVRQGGYMPQ 31 SPA-CD34- MLVRRGARAGPRMPRGWTALCLLSLLPSGFMSLDNNGTATPELPTQGTFSNVSTNVSY long-gp130 QETTTPSTLGSTSLHPVSQHGNEATTNITETTVKFTSTSVITSVYGNTNSSVQSQTSV withleader ISTVFTTPANVSTPETTLKPSLSPGNVSDLSTTSTSLATSPTKPYTSSSPILSDIKAE sequence IKCSGIREVKLTQGICLEQNKTSSCAEFKKDRGEGLARVLCGEEQADADAGAQVCSLL LAQSEVRPQCLLLVLANRTEISSKLQLMKKHQSDLKKLGILDFTEQDVASHQSYSQKT AQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTP PRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIG GSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLL DSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDF VRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPK SYLPQTVRQGGYMPQ 32 SPA-CD34- MLVRRGARAGPRMPRGWTALCLLSLLPSGFMSLDNNGTATPELPTQGTFSNVSTNVSY gp130_ICD QETTTPSTLGSTSLHPVSQHGNEATTNITETTVKFTSTSVITSVYGNTNSSVQSQTSV withleader ISTVFTTPANVSTPETTLKPSLSPGNVSDLSTTSTSLATSPTKPYTSSSPILSDIKAE sequence IKCSGIREVKLTQGICLEQNKTSSCAEFKKDRGEGLARVLCGEEQADADAGAQVCSLL LAQSEVRPQCLLLVLANRTEISSKLQLMKKHQSDLKKLGILDFTEQDVASHQSYSQKT AIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFN SKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMS SSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERP EDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQ ISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQT VRQGGYMPQ 33 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSPSSSDYSDLQRVKQELLEEVK QBEND10- KELQKVKEEIIEAFVQELRKRGSPAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLI VASP- KKHIWPNVPDPSKSHIAQWSPHTPPRHNENSKDQMYSDGNFTDVSVVEIEANDKKPFP GP130_ICD EDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVV withleader HSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQ sequence HESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGP GTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 34 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSAQGEIEAIVVPVCLAFLLTTL QBEND10- LGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVS GP130_ICD- VVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESS VASPwith QNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGI leader LPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMK sequence MFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQPSSSDY SDLQRVKQELLEEVKKELQKVKEEIIEAFVQELRKRGSP 35 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGPSSSDYSDLQRVKQEL QBEND10- LEEVKKELQKVKEEIIEAFVQELRKRGSPAQGEIEAIVVPVCLAFLLTTLLGVLFCFN cys-VASP- KRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEAND GP130_ICD KKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQ withleader YSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFK sequence QNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAA DAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 36 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGAQGEIEAIVVPVCLAF QBEND10- LLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGN cys- FTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSD GP130_ICD- ENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVD VASPwith GGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCG leader SGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQP sequence SSSDYSDLQRVKQELLEEVKKELQKVKEEIIEAFVQELRKRGSP 37 SPA-CD34- MLVRRGARAGPRMPRGWTALCLLSLLPSGFMSLDNNGTATPELPTQGTFSNVSTNVSY TpoR-gp130 QETTTPSTLGSTSLHPVSQHGNEATTNITETTVKFTSTSVITSVYGNTNSSVQSQTSV ISTVFTTPANVSTPETTLKPSLSPGNVSDLSTTSTSLATSPTKPYTSSSPILSDIKAE IKCSGIREVKLTQGICLEQNKTSSCAEFKKDRGEGLARVLCGEEQADADAGAQVCSLL LAQSEVRPQCLLLVLANRTEISSKLQLMKKHQSDLKKLGILDFTEQDVASHQSYSQKT ISLVTALLLVLGLNAVLGLLLLRKQFPAHYRRLRHAIWPNVPDPSKSHIAQWSPHTPP RHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGG SSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLD SEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFV RLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKS YLPQTVRQGGYMPQ 38 SPA-TpoR- MALPVTALLLPLALLLHAARPSDPTRVETATETAWISLVTALLLVLGLNAVLGLLLLR gp130_ICD KQFPAHYRRLRHAIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVE withleader IEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNT sequence SSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPR QQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQ EVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 39 SPA-EpoR- MDHLGASLWPQVGSLCLLLAGAAWAPPPNLPDPKFESKAALLAARGPEELLCFTERLE gp130_ICD DLVCFWEEAASAGVGPGNYSFSYQLEDEPWKLCRLHQAPTARGAVRFWCSLPTADTSS withleader FVPLELRVTAASGAPRYHRVIHINEVVLLDAPVGLVACLADESGHVVLRWLPPPETPM sequence TSHIRYEVDVSAGNGAGSVQRVEILEGRTECVLSNLRGRTRYTFAVRARMAEPSFGGF WSAWSEPVSLLTPSDLDPLILTLSLILVVILVLLTVLALLSNKRDLIKKHIWPNVPDP SKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKK EKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSV QVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFE RSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFET VGMEAATDEGMPKSYLPQTVRQGGYMPQ 40 SPA-EpoR- MDHLGASLWPQVGSLCLLLAGAAWAPPPNLPDPKFESKAALLAARGPEELLCFTERLE gp130with DLVCFWEEAASAGVGPGNYSFSYQLEDEPWKLCRLHQAPTARGAVRFWCSLPTADTSS leader FVPLELRVTAASGAPRYHRVIHINEVVLLDAPVGLVACLADESGHVVLRWLPPPETPM sequence TSHIRYEVDVSAGNGAGSVQRVEILEGRTECVLSNLRGRTRYTFAVRARMAEPSFGGF WSAWSEPVSLLTPSDLDPAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPS KSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLEKKE KINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQ VFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFER SKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETV GMEAATDEGMPKSYLPQTVRQGGYMPQ 41 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSAPPPNLPDPKFESKAALLAAR QBEND10- GPEELLCFTERLEDLVCFWEEAASAGVGPGNYSFSYQLEDEPWKLCRLHQAPTARGAV EpoR- RFWCSLPTADTSSFVPLELRVTAASGAPRYHRVIHINEVVLLDAPVGLVACLADESGH gp130_ICD VVLRWLPPPETPMTSHIRYEVDVSAGNGAGSVQRVEILEGRTECVLSNLRGRTRYTFA withleader VRARMAEPSFGGFWSAWSEPVSLLTPSDLDPLILTLSLILVVILVLLTVLALLSNKRD sequence LIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKP FPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYST VVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNC SQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAF GPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 42 SPA- MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSAPPPNLPDPKFESKAALLAAR QBEND10- GPEELLCFTERLEDLVCFWEEAASAGVGPGNYSFSYQLEDEPWKLCRLHQAPTARGAV EpoR-gp130 RFWCSLPTADTSSFVPLELRVTAASGAPRYHRVIHINEVVLLDAPVGLVACLADESGH withleader VVLRWLPPPETPMTSHIRYEVDVSAGNGAGSVQRVEILEGRTECVLSNLRGRTRYTFA sequence VRARMAEPSFGGFWSAWSEPVSLLTPSDLDPAIVVPVCLAFLLTTLLGVLFCFNKRDL IKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPF PEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTV VHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCS QHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFG PGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 43 SPA MGSSKSKPKDPSQRRRRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNNKR Myr(Src)-L- DLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNENSKDQMYSDGNFTDVSVVEIEANDKK gp130 PFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYS TVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQN CSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADA FGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 44 SPA- MGSSKSKPKDPSQRRRNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMY Myr(Src)- SDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSI gp130-L SSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLV DHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHIS QSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGY MPQRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMN 45 SPA- MGCVQCKDKEATKLTERIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNNKR Myr(Fyn)-L- DLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKK gp130 PFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYS TVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQN CSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADA FGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 46 SPA- MGCVQCKDKEATKLTENKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMY Myr(Fyn)- SDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSI gp130-L SSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLV DHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHIS QSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGY MPQRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMN 47 SPA- MGCGCSSHPEDDWMENNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMY Myr(Lck)- SDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSI gp130-L SSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLV DHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHIS QSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGY MPQRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMN 48 SPA- MGCGCSSHPEDDWMENRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNNKR Myr(Lck)-L- DLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKK gp130 PFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYS TVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQN CSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADA FGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 49 SPA- MGSSKSKPKDPSQRRRPSSSDYSDLQRVKQELLEEVKKELQKVKEEIIEAFVQELRKR Myr(Src)- GSPGSNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVV VASP-gp130 EIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQN ICD TSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILP RQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMF QEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 50 SPA- MGSSKSKPKDPSQRRRNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMY Myr(Src)- SDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSI gp130ICD- SSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLV VASP DHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHIS QSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGY MPQGSPSSSDYSDLQRVKQELLEEVKKELQKVKEEIIEAFVQELRKRGSP 51 Myr(Src)-L- MGSSKSKPKDPSQRRRRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNNKR gp130 DLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKK PFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYS TVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQN CSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADA FGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 52 Myr(Src)- MGSSKSKPKDPSQRRRNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMY gp130-L SDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSI SSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLV DHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHIS QSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGY MPQRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMN 53 Myr(Fyn)- MGCVQCKDKEATKLTENKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMY gp130-L SDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSI SSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLV DHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHIS QSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGY MPQRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMN 54 Myr(Lck)- MGCGCSSHPEDDWMENNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMY gp130-L SDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSI SSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLV DHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHIS QSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGY MPQRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMN 55 Myr(Lck)-L- MGCGCSSHPEDDWMENRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNNKR gp130 DLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKK PFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYS TVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQN CSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADA FGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 56 Myr(Src)- MGSSKSKPKDPSQRRRPSSSDYSDLQRVKQELLEEVKKELQKVKEEIIEAFVQELRKR VASP-gp130 GSPGSNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVV ICD EIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQN TSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILP RQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMF QEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 57 Inducible MALPVTALLLPLALLLHAARPELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQN (CAR-SPA) SELASTANMLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIK QBEND10- KHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPE Cys-L-GP130 DLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVH withleader SGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQH sequence ESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPG TEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 58 myr(Src)-JUN- MGSSKSKPKDPSQRRRRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNNKR gp130 DLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKK PFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYS TVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQN CSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADA FGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 59 Gp130 MLTLQTWLVQALFIFLTTESTGELLDPCGYISPESPVVQLHSNFTAVCVLKEKCMDYF (IL6ST) HVNANYIVWKTNHFTIPKEQYTIINRTASSVTFTDIASLNIQLTCNILTFGQLEQNVY UniProt GITIISGLPPEKPKNLSCIVNEGKKMRCEWDGGRETHLETNFTLKSEWATHKFADCKA P40189 KRDTPTSCTVDYSTVYFVNIEVWVEAENALGKVTSDHINFDPVYKVKPNPPHNLSVIN SEELSSILKLTWTNPSIKSVIILKYNIQYRTKDASTWSQIPPEDTASTRSSFTVQDLK PFTEYVFRIRCMKEDGKGYWSDWSEEASGITYEDRPSKAPSFWYKIDPSHTQGYRTVQ LVWKTLPPFEANGKILDYEVTLTRWKSHLQNYTVNATKLTVNLTNDRYLATLTVRNLV GKSDAAVLTIPACDFQATHPVMDLKAFPKDNMLWVEWTTPRESVKKYILEWCVLSDKA PCITDWQQEDGTVHRTYLRGNLAESKCYLITVTPVYADGPGSPESIKAYLKQAPPSKG PTVRTKKVGKNEAVLEWDQLPVDVQNGFIRNYTIFYRTIIGNETAVNVDSSHTEYTLS SLTSDTLYMVRMAAYTDEGGKDGPEFTFTTPKFAQGEIEAIVVPVCLAFLLTTLLGVL FCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEI EANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTS STVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQ QYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQE VSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 60 Gp130 NKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEAN (IL6ST) DKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTV intracellular QYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYF signaling KQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSA domain ADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 61 Gp130 AIVVPVCLAFLLTTLLGVLFCF (IL6ST) transmembrane domain 62 L-gp130 MALPVTALLLPLALLLHAARPDYKDDDDKELCGGRIARLEEKVKTLKAQNSELASTAN MLREQVAQLKQKVMNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVP DPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLF KKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVP SVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISH FERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERF ETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 63 QBEND10- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV Cys-L- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH GP130Y759F TPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSG IGGSSCMSSSRPSISSSDENESSQNTSSTVQFSTVVHSGYRHQVPSVQVFSRSESTQP LLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEE DFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGM PKSYLPQTVRQGGYMPQ 64 QBEND10- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV Cys-L- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH GP130Y759F TPPRHNfNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSG 4771-811 IGGSSCMSSSRPSISSSDENESSQNTSSTVQFSTVVHSGYRHQQQYFKQNCSQHESSP DISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQ VERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 65 QBEND10- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV Cys-L- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH GP1304771- TPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSG 811 IGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQQQYFKQNCSQHESSP DISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQ VERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 66 QBEND10-L- ELPTQGTFSNVSTNVSRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNAQG GP130 EIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRH NFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSS CMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSE ERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRL KQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYL PQTVRQGGYMPQ 67 QBEND10-L- ELPTQGTFSNVSTNVSRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNAQG GP130Y759F EIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRH NFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSS CMSSSRPSISSSDENESSQNTSSTVQFSTVVHSGYRHQVPSVQVFSRSESTQPLLDSE ERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRL KQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYL PQTVRQGGYMPQ 68 QBEND10-L- ELPTQGTFSNVSTNVSRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNAQG GP130Y759F EIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRH 4771-811 NFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSS CMSSSRPSISSSDENESSQNTSSTVQFSTVVHSGYRHQQQYFKQNCSQHESSPDISHF ERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFE TVGMEAATDEGMPKSYLPQTVRQGGYMPQ 69 QBEND10-L- ELPTQGTFSNVSTNVSRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNAQG GP1304771- EIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRH 811 NFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSS CMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQQQYFKQNCSQHESSPDISHF ERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFE TVGMEAATDEGMPKSYLPQTVRQGGYMPQ 70 SPA- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV QBEND10- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH Cys-L-GP130 TPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSG IGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQP LLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEE DFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGM PKSYLPQTVRQGGYMPQ 71 SPA- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV QBEND10- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH Cys-L- TPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPETVQYSTVVHSGYRHQVPSVQV GP130_ICD_ FSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERS trunc1 KQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVG MEAATDEGMPKSYLPQTVRQGGYMPQ 72 SPA- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV QBEND10- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH Cys-L- TPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSG GP1304771- IGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQQQYFKQNCSQHESSP 811(ICD_trunc2) DISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQ VERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 73 SPA- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV QBEND10- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH Cys-L- TPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSG GP130_ICD_ IGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQP trunc3 LLDSEERPEDLQLVDHVDGGDGILPRQQYFKQPKSYLPQTVRQGGYMPQ 74 SPA- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV QBEND10- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH Cys-L- TPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPETVQYSTVVHSGYRHQQQYFKQ GP130_ICD_ NCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAAD trunc4 AFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 75 SPA- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV QBEND10- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH Cys-L- TPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSG GP130_ICD_ IGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQQQYFKQPKSYLPQTV trunc5 RQGGYMPQ 76 SPA- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV QBEND10- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH Cys-L- TPPRHNfNSKDQMYSDGNFTDVSVVEIEANDKKPFPETVQYSTVVHSGYRHQVPSVQV GP130_ICD_ FSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQPKSYLPQTVRQGGYMPQ trunc6 77 SPA- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV QBEND10- MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH Cys-L- TPPRHNfNSKDQMYSDGNFTDVSVVEIEANDKKPFPETVQYSTVVHSGYRHQQQYFKQ GP130_ICD_ PKSYLPQTVRQGGYMPQ trunc7 78 SPA- ELPTQGTFSNVSTNVSRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNAQG QBEND10-L- EIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRH GP130 NFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSS CMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSE ERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRL KQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYL PQTVRQGGYMPQ 79 SPA- ELPTQGTFSNVSTNVSMVDPVGFAEAWKAQFPDSEPPRMELRSVGDIEQELERCKASI QBEND10- RRLEQEVNQERFRMIYLQTLLAKEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWP BCR-GP130 NVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSL DLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRH QVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPD ISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQV ERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 80 SPA ELPTQGTFSNVSTNVSELCGGAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKH QBEND10- IWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDL cys-GP130 KSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSG YRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHES SPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTE GQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 81 SPA- ELPTQGTFSNVSTNVSELCGGAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVP QBEND10- DPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLF cys- KKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVP GP130_TM_ SVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISH ICD FERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERF ETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 82 SPA- ELPTQGTFSNVSTNVSELCGGAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKH IWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDL QBEND10- KSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSG cys-GP130-L YRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHES SPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTE GQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQRIARLEEKVKTLKAQNSELAST ANMLREQVAQLKQKVMN 83 SPA- ELPTQGTFSNVSTNVSELCGGAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVP QBEND10- DPSKSHIAQWSPHTPPRHNfNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLF cys- KKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVP GP130_TM_ SVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISH ICD-L FERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERF ETVGMEAATDEGMPKSYLPQTVRQGGYMPQRIARLEEKVKTLKAQNSELASTANMLRE QVAQLKQKVMN 84 SPA- ELPTQGTFSNVSTNVSELCGGAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKH QBEND10- RIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNIWPNVPDPSKSHIAQWSPH cys- TPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSG GP130_TM_L- IGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQP ICD LLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEE DFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGM PKSYLPQTVRQGGYMPQ 85 SPA- ELPTQGTFSNVSTNVSELCGGAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHRIARLE QBEND10- EKVKTLKAQNSELASTANMLREQVAQLKQKVMNIWPNVPDPSKSHIAQWSPHTPPRHN GP130_TM_L- FNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSC ICD MSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEE RPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLK QQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLP QTVRQGGYMPQ 86 SPA- ELPTQGTFSNVSTNVSFACDIYIWAPLAGTCGVLLLSLVITLYCNKRDLIKKHIWPNV QBEND10- PDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDL FACD- FKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQV CD8a_TMD_ PSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDIS GP130_ICD HFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVER FETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 87 SPA- ELPTQGTFSNVSTNVSTTTPAPRPPTPAPTIASQPLSLRPEACRPAAGGAVHTRGLDF QBEND10- ACDIYIWAPLAGTCGVLLLSLVITLYCNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPP CD8a_hinge- RHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGG TMD_GP130_ SSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLD ICD SEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFV RLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKS YLPQTVRQGGYMPQ 88 SPA- ELPTQGTFSNVSTNVSELCGGIYIWAPLAGTCGVLLLSLVITLYCNKRDLIKKHIWPN QBEND10- VPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLD cys- LFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQ CD8a_TMD_ VPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDI GP130_ICD SHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVE RFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 89 SPA- ELPTQGTFSNVSTNVSELCGGIYIWAPLAGTCGVLLLSLVITLYCNKRDLIKKHIWPN QBEND10- VPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLD cys- LFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQ CD8a_TMD_ VPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDI GP130_ICD-L SHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVE RFETVGMEAATDEGMPKSYLPQTVRQGGYMPQRIARLEEKVKTLKAQNSELASTANML REQVAQLKQKVMN 90 SPA- ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV QBEND10- MNIYIWAPLAGTCGVLLLSLVITLYCNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPR cys-L- HNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGS CD8a_TMD_ SCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDS GP130_ICD EERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVR LKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSY LPQTVRQGGYMPQ 91 SPA-C7- SLDNNGTATPELPTQGTFSNVSTNVSYQETTTPSTLGSTSLHPVSQHGNEATTNITET gp130 TVKFTSTSVITSVYGNTNSSVQSQTSVISTVFTTPANVSTPETTLKPSLSPGNVSDLS TTSTSLATSPTKPYTSSSPILSDIKAEIKCSGIREVKLTQGICLEQNKTSSCAEFKKD RGEGLARVLCGEEQADADAGAQVCSLLLAQSEVRPQCLLLVLANRTEISSKLQLMKKH QSDLKKLGILDFTEQDVASHQSYSQKTPILLTCPTISILSFFSVALLVILACVLWNKR DLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKK PFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYS TVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQN CSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADA FGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 92 SPA-CD34- SLDNNGTATPELPTQGTFSNVSTNVSYQETTTPSTLGSTSLHPVSQHGNEATTNITET long-gp130 TVKFTSTSVITSVYGNTNSSVQSQTSVISTVFTTPANVSTPETTLKPSLSPGNVSDLS TTSTSLATSPTKPYTSSSPILSDIKAEIKCSGIREVKLTQGICLEQNKTSSCAEFKKD RGEGLARVLCGEEQADADAGAQVCSLLLAQSEVRPQCLLLVLANRTEISSKLQLMKKH QSDLKKLGILDFTEQDVASHQSYSQKTAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKR DLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKK PFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYS TVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQN CSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADA FGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 93 SPA-CD34- LDNNGTATPELPTQGTFSNVSTNVSYQETTTPSTLGSTSLHPVSQHGNEATTNITETT gp130_ICD VKFTSTSVITSVYGNTNSSVQSQTSVISTVFTTPANVSTPETTLKPSLSPGNVSDLST TSTSLATSPTKPYTSSSPILSDIKAEIKCSGIREVKLTQGICLEQNKTSSCAEFKKDR GEGLARVLCGEEQADADAGAQVCSLLLAQSEVRPQCLLLVLANRTEISSKLQLMKKHQ SDLKKLGILDFTEQDVASHQSYSQKTAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHI WPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLK SLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGY RHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESS PDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEG QVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 94 SPA- ELPTQGTFSNVSTNVSPSSSDYSDLQRVKQELLEEVKKELQKVKEEIIEAFVQELRKR QBEND10 GSPAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSP VASP- HTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSS GP130_ICD GIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQ PLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNE EDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEG MPKSYLPQTVRQGGYMPQ 95 SPA- ELPTQGTFSNVSTNVSAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNV QBEND10- PDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDL GP130_ICD- FKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQV VASP PSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDIS HFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVER FETVGMEAATDEGMPKSYLPQTVRQGGYMPQPSSSDYSDLQRVKQELLEEVKKELQKV KEEIIEAFVQELRKRGSP 96 SPA- ELPTQGTFSNVSTNVSELCGGPSSSDYSDLQRVKQELLEEVKKELQKVKEEIIEAFVQ QBEND10- ELRKRGSPAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHI cys-VASP- AQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINT GP130_ICD EGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSR SESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQV SSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEA ATDEGMPKSYLPQTVRQGGYMPQ 97 SPA- ELPTQGTFSNVSTNVSELCGGAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKH QBEND10- IWPNVPDPSKSHIAQWSPHTPPRHNENSKDQMYSDGNFTDVSVVEIEANDKKPFPEDL cys- KSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSG GP130_ICD- YRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHES VASP SPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTE GQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQPSSSDYSDLQRVKQELLEEVKK ELQKVKEEIIEAFVQELRKRGSP 98 SPA-CD34- SLDNNGTATPELPTQGTFSNVSTNVSYQETTTPSTLGSTSLHPVSQHGNEATTNITET TpoR-gp130 TVKFTSTSVITSVYGNTNSSVQSQTSVISTVFTTPANVSTPETTLKPSLSPGNVSDLS TTSTSLATSPTKPYTSSSPILSDIKAEIKCSGIREVKLTQGICLEQNKTSSCAEFKKD RGEGLARVLCGEEQADADAGAQVCSLLLAQSEVRPQCLLLVLANRTEISSKLQLMKKH QSDLKKLGILDFTEQDVASHQSYSQKTISLVTALLLVLGLNAVLGLLLLRKQFPAHYR RLRHAIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKP FPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYST VVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNC SQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAF GPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ 99 SPA-TpoR- SDPTRVETATETAWISLVTALLLVLGLNAVLGLLLLRKQFPAHYRRLRHAIWPNVPDP gp130_ICD SKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKK EKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSV QVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFE RSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFET VGMEAATDEGMPKSYLPQTVRQGGYMPQ 100 SPA-EpoR- APPPNLPDPKFESKAALLAARGPEELLCFTERLEDLVCFWEEAASAGVGPGNYSFSYQ gp130_ICD LEDEPWKLCRLHQAPTARGAVRFWCSLPTADTSSFVPLELRVTAASGAPRYHRVIHIN EVVLLDAPVGLVACLADESGHVVLRWLPPPETPMTSHIRYEVDVSAGNGAGSVQRVEI LEGRTECVLSNLRGRTRYTFAVRARMAEPSFGGFWSAWSEPVSLLTPSDLDPLILTLS LILVVILVLLTVLALLSNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQM YSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPS ISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQL VDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHI SQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGG YMPQ 101 SPA-EpoR- APPPNLPDPKFESKAALLAARGPEELLCFTERLEDLVCFWEEAASAGVGPGNYSFSYQ gp130 LEDEPWKLCRLHQAPTARGAVRFWCSLPTADTSSFVPLELRVTAASGAPRYHRVIHIN EVVLLDAPVGLVACLADESGHVVLRWLPPPETPMTSHIRYEVDVSAGNGAGSVQRVEI LEGRTECVLSNLRGRTRYTFAVRARMAEPSFGGFWSAWSEPVSLLTPSDLDPAIVVPV CLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMY SDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSI SSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLV DHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHIS QSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGY MPQ 102 SPA- ELPTQGTFSNVSTNVSAPPPNLPDPKFESKAALLAARGPEELLCFTERLEDLVCFWEE QBEND10- AASAGVGPGNYSFSYQLEDEPWKLCRLHQAPTARGAVRFWCSLPTADTSSFVPLELRV EpoR- TAASGAPRYHRVIHINEVVLLDAPVGLVACLADESGHVVLRWLPPPETPMTSHIRYEV gp130_ICD DVSAGNGAGSVQRVEILEGRTECVLSNLRGRTRYTFAVRARMAEPSFGGFWSAWSEPV SLLTPSDLDPLILTLSLILVVILVLLTVLALLSNKRDLIKKHIWPNVPDPSKSHIAQW SPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGH SSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSES TQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSV NEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATD EGMPKSYLPQTVRQGGYMPQ 103 SPA- ELPTQGTFSNVSTNVSAPPPNLPDPKFESKAALLAARGPEELLCFTERLEDLVCFWEE QBEND10- AASAGVGPGNYSFSYQLEDEPWKLCRLHQAPTARGAVRFWCSLPTADTSSFVPLELRV EpoR-gp130 TAASGAPRYHRVIHINEVVLLDAPVGLVACLADESGHVVLRWLPPPETPMTSHIRYEV DVSAGNGAGSVQRVEILEGRTECVLSNLRGRTRYTFAVRARMAEPSFGGFWSAWSEPV SLLTPSDLDPAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWS PHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHS SGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSEST QPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVN EEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDE GMPKSYLPQTVRQGGYMPQ 104 Inducible ELPTQGTFSNVSTNVSELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKV (CAR-SPA) MNAQGEIEAIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPH QBEND10- TPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSG Cys-L-GP130 IGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQP LLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEE DFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGM PKSYLPQTVRQGGYMPQ 105 L-gp130 DYKDDDDKELCGGRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNAQGEIE AIVVPVCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNEN SKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMS SSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERP EDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQ ISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQT VRQGGYMPQ 238 QBEND10 ELPTQGTFSNVSTNVS (CD34 epitope) 239 BCRdomain MVDPVGFAEAWKAQFPDSEPPRMELRSVGDIEQELERCKASIRRLEQEVNQERERMIY LQTLLAKE 240 VASPdomain PSSSDYSDLQRVKQELLEEVKKELQKVKEEIIEAFVQELRKRGSPAQGEIE 241 EpoRdomain APPPNLPDPKFESKAALLAARGPEELLCFTERLEDLVCFWEEAASAGVGPGNYSFSYQ withcysteine LEDEPWKLCRLHQAPTARGAVRFWCSLPTADTSSFVPLELRVTAASGAPRYHRVIHIN mutation EVVLLDAPVGLVACLADESGHVVLRWLPPPETPMTSHIRYEVDVSAGNGAGSVQRVEI LEGRTECVLSNLRGRTRYTFAVRARMAEPSFGGFWSAWSEPVSLLTPSDLDP 242 CD34 SLDNNGTATPELPTQGTFSNVSTNVSYQETTTPSTLGSTSLHPVSQHGNEATTNITET ectodomain TVKFTSTSVITSVYGNTNSSVQSQTSVISTVFTTPANVSTPETTLKPSLSPGNVSDLS TTSTSLATSPTKPYTSSSPILSDIKAEIKCSGIREVKLTQGICLEQNKTSSCAEFKKD RGEGLARVLCGEEQADADAGAQVCSLLLAQSEVRPQCLLLVLANRTEISSKLQLMKKH QSDLKKLGILDFTEQDVASHQSYSQKT 243 TpoRdomain ISLVTALLLVLGLNAVLGLLLL