RAMP TAG FOR INSULIN OVEREXPRESSION AND METHOD FOR MANUFACTURING INSULIN USING SAME
20230272004 · 2023-08-31
Inventors
Cpc classification
C07K2319/35
CHEMISTRY; METALLURGY
C07K2319/20
CHEMISTRY; METALLURGY
International classification
Abstract
The present invention relates to a ramp tag for insulin overexpression represented by an amino acid sequence of RGSX.sub.1GGX.sub.2R (wherein, X.sub.1 means any amino acids in the number of 0-8, and X.sub.2 means S or T) and a method for manufacturing insulin, in which the ramp tag is applied, and when the ramp tag according to the present invention is used, in the process of manufacturing recombinant proteins, without changing the ORF sequence, such as the codon optimization method, translation efficiency of insulin is increased, thereby providing the effect of remarkably increasing the expression amount of insulin.
Claims
1. A ramp tag for insulin overexpression represented by a following amino acid sequence: RGSX.sub.1GGX.sub.2R, wherein, X.sub.1 represents 0-8 arbitrary amino acids and X.sub.2 represents S or T.
2. The ramp tag according to claim 1, wherein the ramp tag is represented by an amino acid sequence selected from the group consisting of SEQ ID NO: 5, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 19, and SEQ ID NO: 24 to SEQ ID NO: 31.
3. A ramp tag represented by an amino acid sequence selected from the group consisting of SEQ ID NOS: 1 to 4, SEQ ID NOS: 6 to 8, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NOS: 13 to 18, and SEQ ID NOS: 20 to 23.
4. A nucleic acid encoding the ramp tag according to claim 1.
5. The nucleic acid according to claim 4, wherein the nucleic acid is represented by a nucleic acid sequence selected from the group consisting of SEQ ID NO: 22 to SEQ ID NO: 42.
6. An gene construct comprising the nucleic acid according to claim 4 and a nucleic acid encoding insulin or an insulin precursor.
7. The gene construct according to claim 6, further comprising a nucleic acid encoding a tag other than the ramp tag and/or a nucleic acid encoding a fusion protein.
8. The gene construct according to claim 7, wherein the tag other than the ramp tag is one or more selected from the group consisting of a His tag, HAK tag, T7 tag, S-tag, Flag-tag, HA-tag, V5 epitope, PelB, and Xpress epitope.
9. The gene construct according to claim 7, wherein the fusion protein is one or more selected from the group consisting of COMP, GST, MBP, NusA, CBP, GFP, Thioredoxin, Mistic, Sumo, and DSB.
10. The gene construct according to claim 6, wherein the insulin precursor is an insulin glargine precursor, an insulin aspart precursor, or an insulin lispro precursor.
11. A recombinant vector comprising the gene construct according to claim 6.
12. A recombinant cell comprising the gene construct according to claim 6.
13. The recombinant cell according to claim 12, wherein the recombinant cell is configured such that the gene construct is introduced into a host cell selected from the group consisting of Escherichia coli, yeast, a Chinese hamster ovary cell (CHO cell), and a human cell line.
14. A method of manufacturing insulin or an insulin precursor comprising: (a) expressing insulin or an insulin precursor by culturing the recombinant cell according to claim 12; and (b) recovering the expressed insulin or insulin precursor.
Description
DESCRIPTION OF DRAWINGS
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
MODE FOR INVENTION
[0036] Unless otherwise defined, all technical and scientific terms used herein have the same meanings as those typically understood by those skilled in the art to which the present invention belongs. Generally, the nomenclature used herein and the test method described below are well known in the art and are typical.
[0037] In the present invention, in order to increase the expression level of insulin in the process of producing insulin using a recombinant cell, various ramp tag sequences capable of increasing the expression level of insulin when expressed along with insulin are designed, and it is experimentally confirmed that the expression of insulin is significantly improved when such a ramp tag is applied to insulin expression.
[0038] Accordingly, an aspect of the present invention pertains to a ramp tag for insulin overexpression represented by the following amino acid sequence:
TABLE-US-00002 RGSX.sub.1GGX.sub.2R
[0039] (wherein, X.sub.1 represents 0-8 arbitrary amino acids and X.sub.2 represents S or T).
[0040] The ramp tag may be represented by an amino acid sequence selected from the group consisting of SEQ ID NO: 5, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 19, and SEQ ID NO: 24 to SEQ ID NO: 31, but is not limited thereto.
[0041] In addition, the present invention pertains to a ramp tag represented by an amino acid sequence selected from the group consisting of SEQ ID NOS: 1 to 4, SEQ ID NOS: 6 to 8, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NOS: 13 to 18, and SEQ ID NOS: 20 to 23.
[0042] Another aspect of the present invention pertains to a nucleic acid encoding the ramp tag.
[0043] The nucleic acid is preferably represented by a sequence of 21 to 45 nucleic acids (in which the number of nucleic acids is 3n) comprising a nucleic acid sequence of AGA GGA TCA at the 5′ end and a sequence of GGA GGA TCA (or ACA) CGA at the 3′ end, but is not limited thereto.
[0044] In addition, the present invention pertains to a nucleic acid encoding the ramp tag, in which the nucleic acid is represented by a nucleic acid sequence selected from the group consisting of SEQ ID NO: 22 to SEQ ID NO: 42.
[0045] Still another aspect of the present invention pertains to an insulin construct comprising the nucleic acid described above and a nucleic acid encoding insulin or an insulin precursor.
[0046] In the present invention, the nucleic acid encoding the ramp tag and the nucleic acid encoding the insulin or insulin precursor may be comprised in the gene construct such that these nucleic acids are expressed in a fused form or independently. Also, the gene construct may further comprise a nucleic acid encoding a tag other than the ramp tag and/or a nucleic acid encoding a fusion protein. In a preferred embodiment, the tag other than the ramp tag and the fusion protein may be operably linked to the N-terminus, C-terminus or an internal portion of the ramp tag, to which insulin or an insulin precursor is further operably linked, so these may be expressed in a form of insulin in which the ramp tag, the other tag, and the fusion protein are fused together. In the present invention, when the tag other than the ramp tag or the fusion protein, which performs a special function with regard to solubility, detection, or localization, is bound to the inside or outside of the ramp tag and expressed along with insulin, it is possible not only to ensure the function of the other tag or the fusion protein but also to improve the expression level of insulin.
[0047] In the present invention, the tag other than the ramp tag may be one or more selected from the group consisting of a His tag, T7 tag, S-tag, Flag-tag, HA-tag, V5 epitope, PelB, and Xpress epitope, but is not limited thereto.
[0048] Also, in the present invention, the fusion protein may be one or more selected from the group consisting of COMP, GST, MBP, NusA, CBP, GFP, Thioredoxin, Mistic, Sumo, and DSB, but is not limited thereto.
[0049] The fused form of the ramp tag that may be used in the present invention and the tag other than the ramp tag may be represented by a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 76 to 88, but is not limited thereto.
[0050] The fused form of the ramp tag that may be used in the present invention and the fusion protein may be represented by a nucleic acid sequence selected from the group consisting of SEQ ID NO: 93 to SEQ ID NO: 96, but is not limited thereto.
[0051] In the present invention, the insulin precursor may be an insulin glargine precursor, an insulin aspart precursor, or an insulin lispro precursor, but is not limited thereto.
[0052] In the present invention, the insulin glargine precursor may be represented by the nucleic acid sequence of SEQ ID NO: 97, the insulin aspart precursor may be represented by the nucleic acid sequence of SEQ ID NO: 98, and the insulin lispro precursor may be represented by the nucleic acid sequence of SEQ ID NO: 99. However, in order to synthesize the amino acid sequence of the insulin glargine precursor, the insulin aspart precursor, or the insulin lispro precursor, each of the above nucleic acid sequences may be appropriately modified and applied within a range that is able to be easily modified and implemented by those skilled in the art.
[0053] Yet another aspect of the present invention pertains to a recombinant vector comprising the gene construct.
[0054] In the present invention, the gene construct may be introduced into a vector selected from the group consisting of a pQE30 vector, a pET vector, a pBAD vector, a pACYC vector, a pUC18 vector, a pUC1 vector, a pAPT vector, series vectors thereof, and other overexpression induction vectors known in the art, but the present invention is not limited thereto.
[0055] Still yet another aspect of the present invention pertains to a recombinant cell comprising the gene construct.
[0056] In the present invention, the recombinant cell is configured such that the gene construct is introduced into a host cell selected from the group consisting of Escherichia coli, yeast, a Chinese hamster ovary cell (CHO cell), and a human cell line. Here, the host cell is preferably selected from the group consisting of E. coli K-12, Saccharomyces cerevisiae, CHO-K1, and HEK 293t, and is more preferably E. coli K-12, but the scope of the present invention is not limited to the above examples.
[0057] A further aspect of the present invention pertains to a method of manufacturing insulin or an insulin precursor comprising:
[0058] (a) expressing insulin or an insulin precursor by culturing the recombinant cell; and
[0059] (b) recovering the expressed insulin or insulin precursor.
[0060] As used herein, the term “insulin precursor” refers to a single-stranded peptide comprising an insulin A-chain and an insulin B-chain, with a C-peptide therebetween, and may be used interchangeably with “proinsulin”. In the present invention, the insulin precursor conceptually comprises all precursor forms such as native insulin precursors, insulin analogue precursors, and derivatives thereof.
[0061] As used herein, the term “insulin” refers to a protein that controls blood sugar in the body. Native insulin is a hormone secreted by the pancreas, and typically promotes intracellular glucose uptake and inhibits the breakdown of fat, and thus plays a role in controlling blood sugar in the body. In the present invention, insulin conceptually comprises all forms such as native insulin, insulin analogues, and derivatives thereof.
[0062] For insulin, an insulin precursor (proinsulin) having no blood sugar control function is processed into insulin having a blood sugar control function. Insulin is composed of two polypeptide chains, particularly an A-chain and a B-chain, each comprising 21 and 30 amino acid residues, which are linked by two disulfide bridges. The A-chain and B-chain of native insulin may comprise the following amino acid sequences.
TABLE-US-00003 A-chain: Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr-Ser-Ile-Cys- Ser-Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn B-chain: Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu- Val-Glu-Ala-Leu-Tyr-Leu-Val-Cys-Gly-Glu-Arg- Gly-Phe-Phe-Tyr-Thr-Pro-Lys-Thr
[0063] The insulin precursor and insulin used in the present invention may be of human origin, but the present invention is not limited thereto.
[0064] In the present invention, the insulin analogue comprises one in which the amino acid of the B-chain or the A-chain is mutated compared to the native type. The in-vivo blood sugar control function of the insulin analogue may be the same as or may correspond to that of native insulin. Specifically, the insulin analogue precursor or insulin analogue may be configured such that at least one amino acid of native insulin is subjected to any variation selected from the group consisting of substitution, addition, deletion, modification, and combinations thereof, but the present invention is not limited thereto.
[0065] The insulin analogue that may be used in the present invention comprises an insulin analogue made by genetic recombination technology, and the insulin analogue conceptually comprises inverted insulin, insulin variants, insulin fragments, and the like.
[0066] The derivative has a blood sugar control function in the body, exhibits homology to each of the amino acid sequences of the A-chain and B-chain of the native insulin or insulin analogue, and comprises a peptide in a form in which some groups of one amino acid residue are chemically substituted (e.g. alpha-methylation, alpha-hydroxylation), removed (e.g. deamination), or modified (e.g. N-methylation). The insulin fragment is a form in which at least one amino acid is added to or deleted from insulin, and the added amino acid may be an amino acid that does not exist in nature (e.g. a D-type amino acid), and such an insulin fragment plays a blood sugar control function in the body.
[0067] The insulin variant is a peptide having a sequence in which at least one amino acid is different from that of insulin, and plays a blood sugar control function in the body.
[0068] The insulin analogue, derivative, fragment and variant of the present invention may be used independently or in combination. For example, a peptide, which has a sequence in which at least one amino acid is different, in which the amino-terminal amino acid residue is subjected to deamination, and which plays a blood sugar control function in the body, is also comprised in the scope of the present invention.
[0069] Specifically, in an embodiment of the present invention, the insulin analogue may be insulin glargine. Insulin glargine is stabilized by substituting asparagine, which is the 21.sup.st amino acid of the A-chain of native insulin, with glycine, and is also made soluble at a weakly acidic pH by adding two arginines to the carboxy terminus of the B-chain. Here, insulin glargine is an insulin analogue developed such that it forms a microprecipitate in subcutaneous tissue when administered with an acidic solution (pH 4.0) and is slowly dissolved and released from the microprecipitate, which is an insulin glargine hexamer, whereby the action time is prolonged up to 24 hours. The A-chain and B-chain of insulin glargine may comprise the following amino acid sequences (U.S. Pat. No. 5,656,722).
TABLE-US-00004 A-chain: Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr-Ser-Ile-Cys- Ser-Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Gly B-chain: Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu- Val-Glu-Ala-Leu-Tyr-Leu-Val-Cys-Gly-Glu-Arg- Gly-Phe-Phe-Tyr-Thr-Pro-Lys-Thr-Arg-Arg
[0070] In another embodiment, the insulin analogue in the present invention may be insulin aspart.
[0071] Insulin aspart is an insulin analogue developed such that proline, which is the 28.sup.th amino acid of the B-chain of native insulin, is substituted with aspartic acid to thus reduce the tendency of insulin molecules to self-associate in a hexamer form due to the repulsive force of the charge, so it is absorbed and acts faster than native insulin. The A-chain and B-chain of insulin aspart may comprise the following amino acid sequences (U.S. Pat. No. 5,618,913).
TABLE-US-00005 A-chain: Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr-Ser-Ile-Cys-Ser- Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn B-chain: Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu-Val- Glu-Ala-Leu-Tyr-Leu-Val-Cys-Gly-Glu-Arg-Gly-Phe- Phe-Tyr-Thr-Asp-Lys-Thr
[0072] In another embodiment, the insulin analogue in the present invention may be insulin lispro.
[0073] Insulin lispro is an insulin analogue developed such that the positions of proline, which is the 28.sup.th amino acid, and lysine, which is the 29.sup.th amino acid, of the B-chain of native insulin, are exchanged to thus place lysine at the 28.sup.th amino acid and proline at the 29.sup.th amino acid, thereby increasing the uptake rate in the body. The A-chain and B-chain of insulin lispro may comprise the following amino acid sequences (EP 0383472).
TABLE-US-00006 A-chain: Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr-Ser-Ile-Cys-Ser- Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn B-chain: Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu-Val- Glu-Ala-Leu-Tyr-Leu-Val-Cys-Gly-Glu-Arg-Gly-Phe- Phe-Tyr-Thr-Lys-Pro-Thr
[0074] The precursor of the insulin analogue described above may be represented by the following nucleic acid and amino acid sequences.
[0075] Nucleic acid sequence of insulin glargine precursor (SEQ ID NO: 97)
TABLE-US-00007 TTTGTGAACC AACACCTGTG CGGCTCACAC CTGGTGGAAG CTCTCTACCT AGTGTGCGGG GAACGAGGCT TCTTCTACAC ACCCAAGACC CGCCGGGAGG CAGAGGACCT GCAGGTGGGG CAGGTGGAGC TGGGCGGGGG CCCTGGTGCA GGCAGCCTGC AGCCCTTGGC CCTGGAGGGG TCCCTGCAGA AGCGTGGCAT TGTGGAACAA TGCTGTACCA GCATCTGCTC CCTCTACCAG CTGGAGAACT ACTGCGGCTA A
[0076] Nucleic acid sequence of insulin aspart precursor (SEQ ID NO: 98)
TABLE-US-00008 TTTGTGAATC AGCATCTGTG TGGTAGCCAT CTGGTTGAAG CACTGTATCT GGTTTGTGGT GAACGTGGTT TTTTCTACAC CGATAAAACC GAAG CAGAAGATCT GCAGGTTGGT CAGGTTGAAT TAGGTGGTGG TCCTGGTGCA GGTAGCCTGC AGCCGCTGGC ACTGGAAGGT AGTCTGCAGA AACGTGGTAT TGTTGAACAG TGTTGTACCA GCATTTGTAG CCTGTATCAG CTGGAAAACT ATTGCAACTA A
[0077] Nucleic acid sequence of insulin lispro precursor (SEQ ID NO: 99)
TABLE-US-00009 TTTGTTAATC AGCATCTGTG TGGTAGCCAT CTGGTTGAAG CACTGTATCT GGTTTGTGGT GAACGTGGTT TTTTCTATAC CAAACCGACA GAAG CAGAAGATCT GCAGGTTGGT CAGGTTGAAT TAGGTGGTGG TCCTGGTGCA GGTAGCCTGC AGCCGCTGGC ACTGGAAGGT AGTCTGCAGA AACGTGGTAT TGTTGAACAG TGTTGTACCA GCATTTGTAG CCTGTATCAG CTGGAAAACT ATTGCAACTA A
[0078] Amino acid sequence of insulin glargine precursor (SEQ ID NO: 100)
TABLE-US-00010 FVNQHLCGSH LVEALYLVCG ERGFFYTPKT RREAEDLQVG QVELGGGPGA GSLQPLALEG SLQKRGIVEQ CCTSICSLYQ LENYCG
[0079] Amino acid sequence of insulin aspart precursor (SEQ ID NO: 101)
TABLE-US-00011 FVNQHLCGSH LVEALYLVCG ERGFFYTDKT EAEDLQVG QVELGGGPGA GSLQPLALEG SLQKRGIVEQ CCTSICSLYQ LENYCN
[0080] Amino acid sequence of insulin lispro precursor (SEQ ID NO: 102)
TABLE-US-00012 FVNQHLCGSH LVEALYLVCG ERGFFYTKPT EAEDLQVG QVELGGGPGA GSLQPLALEG SLQKRGIVEQ CCTSICSLYQ LENYCN
[0081] Amino acid sequence of insulin aspart precursor (SEQ ID NO: 103)
TABLE-US-00013 FVNQHLCGSH LVEALYLVCG ERGFFYTDKT RREAEDLQVG QVELGGGPGA GSLQPLALEG 60 SLQKRGIVEQ CCTSICSLYQ LENYCN
[0082] Amino acid sequence of insulin lispro precursor (SEQ ID NO: 104)
TABLE-US-00014 FVNQHLCGSH LVEALYLVCG ERGFFYTKPT RREAEDLQVG QVELGGGPGA GSLQPLALEG 60 SLQKRGIVEQ CCTSICSLYQ LENYCN
[0083] For reference, all of insulin glargine, aspart, and lispro may have the amino acid sequence of RR between the B-chain and the C-chain in the precursor state, and may show activity as short-acting insulin when the RR sequence belongs to the C-chain and is removed, depending on the type of cleavage enzyme that is used, and may act as long-acting insulin when the RR sequence remains in the B-chain. Accordingly, SEQ ID NO: 101 and SEQ ID NO: 102 may use a sequence in a form in which Arg-Arg is added between the B-chain and the C-chain.
[0084] As used herein, the term “ramp tag” refers to a synthetic tag that is able to solve the problem of instability of a translation rate caused by poor compatibility between the codon in a foreign gene and a host when expressing a recombinant protein in a recombinant cell, and is based on the principle by which the expression level of a protein is regulated only by allowing the tag to behave together inside/outside of the target gene in the transcription or translation step (inducing adoption of tRNA and providing reuse thereof) without changing the original codon sequence, unlike the existing codon optimization method that solves the problem of rare codons. A detailed description of the ramp tag in the present invention may comprise the contents set forth in Korean Patent No. 10-1446054.
[0085] As used herein, the term “gene construct” refers to a structure comprising at least one nucleic acid sequence encoding a polypeptide. In an embodiment, there is provided a gene construct comprising a nucleic acid encoding the ramp tag; and a nucleic acid encoding a tag other than the ramp tag and/or a nucleic acid encoding a fusion protein, and in another embodiment, there is provided a gene construct comprising a nucleic acid encoding the ramp tag; and a nucleic acid encoding insulin, and in still another embodiment, there is provided a gene construct comprising a nucleic acid encoding the ramp tag; a nucleic acid encoding a tag other than the ramp tag and/or a nucleic acid encoding a fusion protein; and a nucleic acid encoding insulin, in which a nucleic acid encoding a polypeptide having other functions may be further comprised, as necessary. Here, these nucleic acids are preferably operably linked so as to be expressed by one promoter, but the nucleic acid encoding the ramp tag may be located so as to be separately translated by a separate promoter.
[0086] As used herein, the term “operably linked” refers to a functional linkage between a nucleic acid expression control sequence (e.g. a promoter, a signal sequence, or a series of transcription-factor-binding sites) and a heterologous nucleic acid sequence. Here, the expression control sequence affects the transcription and/or translation of a nucleic acid corresponding to the heterologous nucleic acid sequence.
[0087] As used herein, the term “recombinant” refers to any material (e.g. a vector or microorganism) that may be formed as a result of genetic manipulation.
[0088] As used herein, the term “vector” refers to a DNA product containing a DNA sequence that is operably linked to a suitable control sequence capable of expressing DNA in a suitable host. The vector may be a plasmid, a phage particle, or a simple potential genomic insert. When transformed into an appropriate host, the vector may replicate and function independently of the host genome, or in some cases may be integrated into the genome itself. Since a plasmid is currently the most commonly used form of the vector, the terms “plasmid” and “vector” are sometimes used interchangeably in the specification of the present invention. For the purposes of the present invention, it is preferred to use a plasmid vector. A typical plasmid vector that may be used for this purpose is configured to comprise (a) a replication origin that allows efficient replication to comprise ones to hundreds of plasmid vectors in each host cell, (b) an antibiotic resistance gene that enables selection of a host cell transformed with a plasmid vector, and (c) a restriction enzyme cleavage site into which a foreign DNA fragment may be inserted. Even when an appropriate restriction enzyme cleavage site does not exist, the vector and foreign DNA may be easily ligated using a synthetic oligonucleotide adapter or linker according to a typical method. After ligation, the vector has to be transformed into an appropriate host cell. Transformation may be easily accomplished using a calcium chloride (CaCl.sub.2) process or an electroporation process (Neumann, et al., EMBO J., 1:841, 1982).
[0089] As the vector that is used for overexpression of the gene according to the present invention, an expression vector known in the art may be used. A base sequence is said to be “operably linked” when placed in a functional relationship with another nucleic acid sequence. This may be a gene and control sequence(s) linked in such a way as to enable gene expression when an appropriate molecule (e.g. a transcriptional activation protein) is bound to the control sequence(s). For example, DNA for a pre-sequence or secretory leader is operably linked to DNA for a polypeptide when expressed as a preprotein that participates in the secretion of the polypeptide, a promoter or enhancer is operably linked to a coding sequence when it affects the transcription of the sequence, a ribosome-binding site is operably linked to a coding sequence when it affects transcription of the sequence, or a ribosome-binding site is operably linked to a coding sequence when placed to facilitate translation. In general, “operably linked” means that the linked DNA sequence is in contact therewith, or that a secretory leader is in contact therewith and is present in the reading frame. However, the enhancer need not be in contact therewith. The linking of these sequences is accomplished by ligation (linkage) at a convenient restriction enzyme site. When no such site exists, a synthetic oligonucleotide adapter or linker according to a typical method is used.
[0090] As is well known in the art, in order to increase the expression level of a transformed gene in a host cell, the gene has to be operably linked to a transcriptional and translational expression control sequence that functions in the selected expression host. Preferably, the expression control sequence and the corresponding gene are comprised in a single recombinant vector that comprises both a bacterial selection marker and a replication origin.
[0091] The recombinant cell into which the gene construct described above is introduced or which is transformed by the recombinant vector constitutes another aspect of the present invention. As used herein, the term “transformation” refers to introduction of DNA into a host such that DNA becomes replicable as an extrachromosomal factor or through chromosomal integration.
[0092] Here, it is to be understood that not all vectors function equally in expressing the DNA sequence of the present invention. Likewise, not all hosts function equally for the same expression system. However, those skilled in the art will be able to make an appropriate selection from among various vectors, expression control sequences and hosts without undue experimentation and without departing from the scope of the present invention. For example, a vector may be selected in consideration of the host. This is because the vector has to be replicated in the host. The number of copies of a vector, ability to control the number of copies, and expression of another protein encoded by the vector, for example, an antibiotic marker, also have to be taken into consideration.
[0093] Moreover, the gene construct according to the present invention may be introduced into the genome of a host cell and may be present as a chromosomal factor. It will be apparent to those skilled in the art to which the present invention belongs that, even when the gene construct is inserted into the genome chromosome of a host cell, it has the same effect as when the recombinant vector is introduced into the host cell as described above.
[0094] A better understanding of the present invention may be obtained through the following examples. These examples are merely set forth to illustrate the present invention, and are not to be construed as limiting the scope of the present invention.
EXAMPLES
Example 1: Design of Ramp Tag Fused to Insulin Gene
[0095] For the configuration of a rare codon for manufacturing a ramp tag, a rare codon table of E. coli as shown in Table 1 below was created by choosing a method that takes into consideration codon frequency or isoacceptor tRNA, and, only for sequences that are likely to be depleted of amino acids, normal codons were mixed, or the length of tags was limited, using a translational pairing technique.
[0096] A ramp tag suitable for an insulin protein was designed as follows.
[0097] 1. Creating a rare codon table depending on a host
[0098] 2. Preparing the DNA sequence of an insulin gene with codons
[0099] 3. Analyzing the frequency of appearance and location of codons in the rare codon table of E. coli on the ORF of a target gene
[0100] 4. Collecting rare codons and placing the same in the order of appearance on the ORF
[0101] 5. Inducing expression by substituting the 5′ or 3′ of the target gene or the N-terminus (1-20 codons corresponding to the tag length) of the target protein with a tag.
TABLE-US-00015 2 Box tRNA Set AA condon Frequency tRNA H His CAC 0.97 1 C Cys UGU 0.50 UGC 0.64 1 4 Box tRNA Set G GGA GGA 0.79 1 P Pro CCU 0.70 CCC 0.55 1 CCA 0.84 1 T Thr ACU 0.89 ACA 0.70 1 6 Box tRNA Set S Ser UCU 0.80 UCC 0.86 2 UCA 0.71 1 R Arg CGA 0.35 AGG 0.11 1 AGA 0.20 1 L Leu CUA 0.39 1 3 Box tRNA Set I Ile AUA 0.43
Example 2: Sequence and Configuration of Ramp Tag Fused to Insulin Gene
[0102] In order to provide a ramp tag that induces overexpression of insulin, which is a protein that is difficult to express, for the purpose of enabling mass production in E. coli, the functionality thereof was demonstrated in a form fused with a target gene.
[0103] A gene construct in which a ramp tag customized to improve the low expression level of difficult-to-express proteins and the other tag were fused was designed. As shown in
TABLE-US-00016 TABLE 2 Amino acid sequence Nucleic acid sequence
indicates data missing or illegible when filed
Example 3: Analysis of Expression Level of Insulin by Fusion of Ramp Tag
[0104] Among the designed ramp tags in Table 2, the ramp tags of Ramp 5 and Ramp 9 were selected to induce expression in the form fused with insulin in an induction vector capable of causing a lack of tRNA. Here, the nucleic acid sequence of the ramp tag was located in front of the insulin gene to induce expression in the form fused with the insulin protein. pQE30 and pAPT were used as induction vectors for confirming the overexpression effect by the ramp tag. As a control group, a fusion protein in which MBP or GST was fused with insulin was used.
[0105] The cloning of the gene for expression of the fusion protein was carried out as shown in
[0106] The expression levels of the proteins after addition of IPTG in the induction vector were compared using SDS-PAGE, and the results thereof are shown in
Example 4: Effect of Simultaneous Application of Histidine Tag and Ramp Tag
[0107] Whether it is possible to induce overexpression of an insulin protein even when the ramp tag is used together with various tags for other purposes was evaluated. To this end, as shown in Table 3 below, a protein in which a histidine tag, which is commonly used as a universal tag, was fused together with the ramp tag was expressed. In addition, a complex histidine synthetic tag (histidine-alanine-lysine-(4x-alanine-histidine)-glycine-histidine-alanine-histidine (HAK)) in which histidine, alanine and lysine were used in combination was manufactured in the form of a modified histidine tag so that it was co-expressed between the ramp tags to thus express the insulin fusion protein. The size of the insulin protein comprising the histidine tag and the ramp tag was 1 kDa, and the cloning process was applied in the same manner as in Example 3.
TABLE-US-00017 TABLE 3 Gene notation Amino acid sequence Base sequence Ramp i-His RGSHHHHHHLLRIRIFTGGSR AGA GGA TCA CAT CAC CAT CAC (SEQ ID NO: 63) CAT CAC CTA CTA CGC ATA CGC ATA TTT ACA GGA GGA TCA CGA (SEQ ID NO: 76) Ramp 5-His RGSHHHHHHRCRCFTFTGGSR AGA GGA TCA CAT CAC CAT CAC (SEQ ID NO: 64) CAT CAC CGC TGT CGC TGT TTT ACA TTT ACA GGA GGA TCA CGA (SEQ ID NO: 77) Ramp 9-His RGSHHHHHHLLRCRCFTGGSR AGA GGA TCA CAT CAC CAT CAC (SEQ ID NO: 65) CAT CAC CTA CTA CGC TGT CGC TGT TTT ACA GGA GGA TCA CGA (SEQ ID NO: 78) Ramp H-His RGSHHHHHHGGSR AGA GGA TCA CAT CAC CAT CAC (SEQ ID NO: 66) CAT CAC GGA GGA TCA CGA (SEQ ID NO: 79) Ramp D5-His RGSDHHHHHGGSR AGA GGA TCA GAC CAT CAC CAT (SEQ ID NO: 67) CAC CAT GGA GGA TCA CGA (SEQ ID NO: 80) Ramp D6-His RGSDHHHHHHGGSR AGA GGA TCA GAC CAT CAC CAT (SEQ ID NO: 68) CAC CAT CAC GGA GGA TCA CGA (SEQ ID NO: 81) Ramp G5-His RGSGHHHHHGGSR AGA GGA TCA GGC CAT CAC CAT (SEQ ID NO: 69) CAC CAT GGA GGA TCA CGA (SEQ ID NO: 82) Ramp G6-His RGSGHHHHHHGGSR AGA GGA TCA GGC CAT CAC CAT (SEQ ID NO: 70) CAC CAT CAC GGA GGA TCA CGA (SEQ ID NO: 83) Ramp A-His RGSAHHHHHHGGSR AGA GGA TCA GCC CAT CAC CAT (SEQ ID NO: 71) CAC CAT CAC GGA GGA TCA CGA (SEQ ID NO: 84) Ramp K-His RGSKHHHHHHGGSR AGA GGA TCA AAA CAT CAC CAT (SEQ ID NO: 72) CAC CAT CAC GGA GGA TCA CGA (SEQ ID NO: 85) Ramp N-His RGSNHHHHHHGGSR AGA GGA TCA AAC CAT CAC CAT (SEQ ID NO: 73) CAC CAT CAC GGA GGA TCA CGA (SEQ ID NO: 86) Ramp R-His RGSRHHHHHHGGSR AGA GGA TCA CGT CAT CAC CAT (SEQ ID NO: 74) CAC CAT CAC GGA GGA TCA CGA (SEQ ID NO: 87) Ramp HAK RGSHAKAHAHAHAHGHAHGGSR AGA GGA TCA CAC GCA AAA GCG (SEQ ID NO: 75) CAC GCC CAC GCA CAC GCG CAT GGA CAC GCT CAT GGA GGA TCA CGA (SEQ ID NO: 88)
[0108] As shown in
Example 5: Effect of Simultaneous Application of Fusion Protein COMP and Ramp Tag
[0109] Whether it is possible to induce overexpression of an insulin protein even when the ramp tag is used together with fusion proteins for other purposes was evaluated. To this end, as shown in Table 4 below, COMP, a small fusion protein, was simultaneously applied with each of four ramp tags (Ramp 5, Ramp 9, Ramp 12, and Ramp 19) among the ramp tags shown in Table 2 to express an insulin protein. In Table 4 below, the ramp tag sequence is shown in bold text, and the COMP sequence, which is a fusion protein, is shown in plain text. The fusion protein COMP was located at the C-terminus of the ramp tag, the size of the insulin protein in which the ramp tag and COMP were fused was 18 kDa, and the cloning process was applied in the same manner as in Example 3.
TABLE-US-00018 TABLE 4 Fusion protein Amino acid sequence Base sequence Ramp 5 COMP RGSRCRCFTFTGGSRGDLGPQML AGAGGATCACGCTGTCGCTGTTTTACATTTACAG RELQETNAALQDVRDYLRQLVRE GAGGATCACGAGGAGACCTGGGCCCGCAGATGCT ITFLKNTVMECDACGMQQ GCGTGAACTGCAGGAAACCAACGCTGCTCTGCAG (SEQ ID NO: 89) GACGTTCGTGACTACCTGCGTCAGCTGGTTCGTG AAATCACCTTCCTGAAAAACACCGTTATGGAATG CGACGCTTGCGGTATGCAGCAG (SEQ ID NO: 93) Ramp 9 RGSLLRCRCFTGGSRGDLGPQML AGAGGATCATTGCTACGCTGTCGCTGTTTTACAG RELQETNAALQDVRDYLRQLVRE GAGGATCACGAGGAGACCTGGGCCCGCAGATGCT ITFLKNTVMECDACGMQQ GCGTGAACTGCAGGAAACCAACGCTGCTCTGCAG (SEQ ID NO: 90) GACGTTCGTGACTACCTGCGTCAGCTGGTTCGTG AAATCACCTTCCTGAAAAACACCGTTATGGAATG CGACGCTTGCGGTATGCAGCAG (SEQ ID NO: 94) Ramp 12 RGSLLLLRTRTGGSRGDLGPQML AGAGGATCATTGCTATTGCTACGCACACGCACAG RELQETNAALQDVRDYLRQLVRE GAGGATCACGAGGAGACCTGGGCCCGCAGATGCT ITFLKNTVMECDACGMQQ GCGTGAACTGCAGGAAACCAACGCTGCTCTGCAG (SEQ ID NO: 91) GACGTTCGTGACTACCTGCGTCAGCTGGTTCGTG AAATCACCTTCCTGAAAAACACCGTTATGGAATG CGACGCTTGCGGTATGCAGCAG (SEQ ID NO: 95) Ramp 19 RGSFTLLLLRCGGSRGDLGPQML AGAGGATCATTTACATTGCTATTGCTACGCTGTG RELQETNAALQDVRDYLRQLVRE GAGGATCACGAGGAGACCTGGGCCCGCAGATGCT ITFLKNTVMECDACGMQQ GCGTGAACTGCAGGAAACCAACGCTGCTCTGCAG (SEQ ID NO: 92) GACGTTCGTGACTACCTGCGTCAGCTGGTTCGTG AAATCACCTTCCTGAAAAACACCGTTATGGAATG CGACGCTTGCGGTATGCAGCAG (SEQ ID NO: 96)
[0110] As shown in
[0111] From the above description, those skilled in the art will appreciate that the present invention may be embodied in other specific forms without changing the technical spirit or essential features thereof. In this regard, the embodiments described above should be understood to be non-limiting and illustrative in every way. The scope of the present invention is defined by the claims below rather than the aforementioned detailed description, and all changes or modified forms that are capable of being derived from the meaning, range, and equivalent concepts of the appended claims should be construed as being comprised in the scope of the present invention.
INDUSTRIAL APPLICABILITY
[0112] When a ramp tag according to the present invention is used, in the process of manufacturing recombinant insulin, without changing the ORF sequence, such as a codon optimization method, the translation rate of insulin is increased, thereby notably increasing the expression level of insulin.