TRANSLATION ENHANCER FOR USE IN CELL-FREE PROTEIN SYNTHESIS SYSTEM AND USE THEREOF
20180119154 ยท 2018-05-03
Inventors
Cpc classification
C12N2830/50
CHEMISTRY; METALLURGY
C12N15/1068
CHEMISTRY; METALLURGY
C12N15/67
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
C12N15/63
CHEMISTRY; METALLURGY
International classification
C12N15/63
CHEMISTRY; METALLURGY
Abstract
This specification provides a translation enhancer that makes it possible to obtain translation template mRNA efficiently in a cell-free protein synthesis system and, in turn, to realize excellent translation efficiency. Therefore, in this specification, a nucleic acid of at most 200 bases in length as a 3 untranslated region linked adjacent to the 3 end of a coding region that encodes the amino acid sequence of a desired protein, is used as a translation enhancer in a cell-free protein synthesis system.
Claims
1-29. (canceled)
30. A method for producing transcription template DNA to be used for a cell-free protein synthesis system, wherein the method comprises a step for carrying out a nucleic acid amplification reaction on DNA including a coding region of a desired protein using a first forward primer and a first reverse primer, a step for carrying out a nucleic acid amplification reaction on DNA including the coding region of the desired protein using a second forward primer and a second reverse primer, the second reverse primer comprises at least a part of translation enhancer which is a nucleic acid as a 3 untranslated region linked adjacent to the 3 end of the coding region that encodes an amino acid sequence of the desired protein, the 3 untranslated region has a poly A sequence of two or more consecutive A and bases linked adjacent to the 3 end of the poly A sequence.
31. The method for producing transcription template DNA of claim 30, wherein the 3 untranslated region is at most 200 bases in length.
32. The method for producing transcription template DNA of claim 30, wherein the poly A sequence is from 5 to 40.
33. The method for producing transcription template DNA of claim 32, wherein the poly A sequence is from 5 to 20 and total length of the 3 untranslated region is at most 50.
34. The method for producing transcription template DNA of claim 30, wherein the 3 untranslated region is a base sequence represented by SEQ ID NO: 6 or 26, a base sequence in which from one to five bases have been substituted, added, inserted, or deleted in the said sequences, or a base sequence having identity of at least 85% with a base sequence represented by SEQ ID NO: 6 or 26, and has a base sequence having 3 untranslated region activity.
35. The method for producing transcription template DNA of claim 30, wherein the poly A sequence is from 10 to 20 and the 3 end of the poly A sequence does not include a base.
36. The method for producing transcription template DNA of claim 30, wherein the desired protein is a fusion protein provided with a protein tag at the C end of an arbitrary protein.
37. The method for producing transcription template DNA of claim 30, wherein the desired protein is a fusion protein provided with a protein tag at the N end of an arbitrary protein.
38. A translation enhancer to be used for a cell-free protein synthesis system, wherein the translation enhancer comprises a nucleic acid of at most 200 bases in length as a 3 untranslated region linked adjacent to the 3 end of a coding region that encodes an amino acid sequence of a desired protein, the 3 untranslated region has a poly A sequence of two or more consecutive A and bases linked adjacent to the 3 end of the poly A sequence.
39. The translation enhancer of claim 38, wherein the poly A sequence is from 5 to 40.
40. The translation enhancer of claim 39, wherein the poly A sequence is from 5 to 20 and total length of the 3 untranslated region is at most 50.
41. The translation enhancer of claim 38, wherein the 3 untranslated region is a base sequence represented by SEQ ID NO: 6 or 26, a base sequence in which from one to five bases have been substituted, added, inserted, or deleted in the said sequences, or a base sequence having identity of at least 85% with a base sequence represented by SEQ ID NO: 6 or 26, and has a base sequence having 3 untranslated region activity.
42. The translation enhancer of claim 38, wherein the poly A sequence is from 10 to 20 and the 3 end of the poly A sequence does not include base.
43. A template nucleic acid to be used for a cell-free protein synthesis system, wherein the template nucleic acid comprises a promoter region, a coding region that encodes an amino acid sequence of a desired protein linked operably by the promoter region, and, a 3 untranslated region of the coding region, comprising the translation enhancer of claim 30.
44. The template nucleic acid of claim 43, wherein the coding region is a region that encodes a fusion protein provided with a protein tag at the C end of an arbitrary protein as the desired protein.
45. The template nucleic acid of claim 43, wherein the coding region is a region that encodes a fusion protein provided with a protein tag at the N end of an arbitrary protein as the desired protein.
46. The template nucleic acid of claim 43, wherein the template nucleic acid is transcription template DNA.
47. The template nucleic acid of claim 43, wherein the template nucleic acid is translation template mRNA.
48. A method for producing a translation template to be used for a cell-free protein synthesis system, wherein the method comprises a step for synthesizing translation template mRNA using the template nucleic acid of claim 43 in the absence of cells and in the presence of elements for transcribing transcription template DNA into mRNA.
49. A method for producing a protein, wherein the method comprises a step for synthesizing a protein using the template nucleic acid of claim 47 in the absence of cells and in the presence of elements for translating translation template mRNA into a protein.
50. An array of translation template DNA, wherein the array carries a plurality of the template nucleic acids of claim 43 corresponding to a plurality of proteins as transcription template DNA.
51. An RNA stabilizer, wherein the RNA stabilizer comprises a nucleic acid of at most 200 bases in length as a 3 untranslated region linked adjacent to the 3 end of a coding region that encodes an amino acid sequence of a desired protein, the 3 untranslated region has a poly A sequence of two or more consecutive A and bases linked adjacent to the 3 end of the poly A sequence.
52. The RNA stabilizer of claim 51, wherein the poly A sequence is from 5 to 40.
53. The RNA stabilizer of claim 52, wherein the poly A sequence is from 5 to 20 and total length of the 3 untranslated region is at most 50.
54. The RNA stabilizer of claim 51, wherein the 3 untranslated region is a base sequence represented by SEQ ID NO: 6 or 26, a base sequence in which from one to five bases have been substituted, added, inserted, or deleted in the said sequences, or a base sequence having identity of at least 85% with a base sequence represented by SEQ ID NO: 6 or 26, and has a base sequence having 3 untranslated region activity.
55. The RNA stabilizer of claim 51, wherein the poly A sequence is from 10 to 20 and the 3 end of the poly A sequence does not include base.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0066]
[0067]
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
[0074]
[0075]
[0076]
[0077]
[0078]
[0079]
[0080]
[0081]
[0082]
[0083]
DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0084] The disclosure of this specification relates to a translation enhancer for use in a cell-free protein synthesis system, template nucleic acids, primer sets, a method for producing transcription template DNA, a method for producing translation template mRNA, a method for producing a protein, a transcription template DNA array, and the like.
[0085] The translation enhancer disclosed in this specification functions as a 3 UTR in translation template mRNA even though it is at most 200 bases in length, according to the 3 UTR of the coding region of translation template mRNA (also referred to as a 3 UTR in transcription template DNA) in a cell-free protein synthesis system, and can contribute to efficient translation to a protein.
[0086] In addition, since the 3 UTR is constituted of at most 200 bases, the length of the transcription template DNA can be shortened, and transcription template DNA can be obtained efficiently by a nucleic acid amplification reaction.
[0087] Furthermore, since the 3 UTR is at most 200 bases in length, this makes it possible to easily obtain transcription template DNA for synthesizing a fusion protein provided with a protein tag at the C end of an arbitrary protein.
[0088] Specifically, since the 3 UTR is short, a reverse primer provided with part or all of the 3 UTR as a tag region or hybridization region can be designed. Transcription template DNA that encodes the desired protein provided with a protein tag at the C end can be obtained easily by conducted a nucleic acid amplification reaction using this reverse primer.
[0089] The template nucleic acids and primer sets used in a cell-free protein synthesis system, methods for producing such transcription template DNA, methods for producing translation template mRNA, methods for producing a protein by a cell-free protein synthesis system, and a transcription template DNA array, which are other embodiments of this disclosure, make it possible to synthesize a desired protein easily and efficiently by using a 3 UTR of such a base length as a translation enhancer, and make it possible to supply the protein for various uses.
[0090] Furthermore, the term cell-free protein synthesis system in this specification means a translation system (so-called two-step method) constituted by an element composition that realizes a translation step from at least mRNA derived from a cell to a protein in the absence of cells. A cell-free protein synthesis system also means a system (so-called one-step method) provided with both a translation system and a transcription system constituted by an element composition that realizes a transcription step from DNA to mRNA prior to the above translation step in the absence of cells.
[0091] Cell-free protein synthesis systems are well known to those skilled in the art. Cell-free protein synthesis systems derived from prokaryotes such as Escherichia coli and eukaryotes such as wheat, rabbits, insects, and the like are commercially available to those skilled in the art, and can be prepared appropriately by those skilled in the art. Various synthesis systems such as systems derived from Escherichia coli, from wheat germ, from rabbit reticulocytes, and from cultured insect cells can be given as examples of cell-free protein synthesis systems. In addition, a cell-free synthesis system can also construct an artificial synthesis system combining the elements of a transcription system and translation system individually, and is available commercially. The cell-free protein synthesis systems used in the various embodiments of this disclosure are preferably derived from eukaryotes, more preferably derived from wheat germ.
[0092] Representative and nonlimiting concrete examples of this disclosure are explained in detail below with reference to the drawings as appropriate. This detailed explanation is intended simply to show those skilled in the art the details for carrying out preferred examples of the present invention; it is not intended to limit the scope of the disclosure. Additional features and inventions also disclosed below can be used separately from or together with other features or inventions to provide a translation enhancer for use in an improved cell-free protein synthesis system, and the use thereof.
[0093] In addition, the features and steps disclosed by the following detailed explanation are not essential for carrying out this disclosure in the broadest sense, but are described only to explain representative concrete examples of this disclosure. Furthermore, the various features of the representative concrete examples described above and below and the various features of what is described in the independent and dependent claims do not have to be combined as in the concrete examples described here or in the order listed in providing additional and useful embodiments of this disclosure.
[0094] All of the features described in this specification and/or in the claims are intended to be disclosed individually and independently of each other as an initial disclosure of the application and a limitation to the specific items claimed, separate from the constitution of features described in the examples and/or claims. Furthermore, all of the numerical ranges and descriptions relating to groups or populations have the intention of disclosing intermediate configurations thereof as an initial disclosure of the application and a limitation to the specific items claimed.
[0095] Various forms of this disclosure are explained below with reference to the drawings as appropriate.
[0096] (Translation Enhancer)
[0097] The translation enhancer of this disclosure (also referred to as this enhancer hereinafter) is an agent to be used in a cell-free protein synthesis system (also referred to simply as synthesis system hereinafter). As shown in
[0098] For this enhancer, the desired protein to be synthesized is preferably a fusion protein provided with a peptide heterologous to an arbitrary protein, such as a protein tag at the C end of the arbitrary protein. This is because this protein required a very complex procedure associated with cloning and the like to be used in the synthesis system in the past, and high-throughput synthesis was difficult. Similarly, for this enhancer, the desired protein to be synthesized is preferably a fusion protein provided with a heterologous peptide such as a protein tag at the C end of the arbitrary protein.
[0099] The heterologous peptide of the protein tag that can be provided at the C end and/or N end of the arbitrary protein is not particularly restricted. Examples of this protein tag include, but are not limited to, a His tag, GST tag, MBP tag, myc tag, FLAG tag, and BCCP tag. In addition, examples of tags that permit visual detection include, but are not limited to, GFP (green fluorescent protein), BFP (blue fluorescent protein), CFP (cyan fluorescent protein), RFP (red fluorescent protein), YFP (yellow fluorescent protein), EGFP (enhanced green fluorescent protein), ECFP (enhanced cyan fluorescent protein), ERFP (enhanced red fluorescent protein), EYFP (enhanced yellow fluorescent protein), TMR (tetramethyl-rhodamine), luciferase, and the like.
[0100] Furthermore, the protein tag is linked directly or via a suitable linker to the N end and/or C end of the arbitrary protein.
[0101] This enhancer preferably comprises a nucleic acid no more than 200 bases in length. The base length of this enhancer is far shorter, approximately - 1/15.sup.th, in comparison to the base length of conventional 3 UTR. A longer 3 UTR was said to be effective in the past from the viewpoint of improving translation template stability and improving translation efficiency in the synthesis system. The 3 UTR length was generally at least 500 bases in length, typically from 1000 to 3000 bases in length. In contrast to this, this enhancer has a very short base length of no more than 200 bases in length, and its use as a 3 UTR, regardless of its base sequence, makes it possible to synthesize a protein surprisingly efficiently in the synthesis system; also, such a short 3 UTR makes it possible to easily obtain a transcription template and translation template for a fusion protein provided with a protein tag in unprecedented form and, as a result, to easily synthesize the fusion protein.
[0102] This enhancer is provided as a DNA double strand on the 3 end side of the DNA when used in the transcription template DNA of the synthesis system. In addition, this enhancer is provided as single-stranded RNA on the 3 end side of the mRNA when used in the translation template mRNA of the synthesis system.
[0103] As shown in
[0104] This enhancer is preferably shorter than 200 bases in length as long a certain translation efficiency can be maintained. More preferred is at most 150 bases in length, even more preferred is at most 100 bases in length, even more preferred is at most 80 bases in length, and even more preferred is at most 60 bases in length. Even more preferred is at most 40 bases in length. Most preferred is at most 30 bases in length or 20 bases in length.
[0105] In addition, the base length of this enhancer is preferably shorter from the viewpoint of primer design when one considers that the transcription template DNA including a region that encodes the desired protein to be synthesized is obtained by a nucleic acid amplification reaction such as PCR. Considering this viewpoint, when the base length of this enhancer is at most 100 bases in length or 80 bases in length, about two or three reverse primers suffice to add this enhancer to the 3 end of the coding region of the transcription template DNA. Furthermore, when the base length of this enhancer is at most 60 bases in length or 40 bases in length, about one or two reverse primers suffice to add it to the 3 end of the coding region. Moreover, when the base length of this enhancer is at most 30 bases in length or 20 bases in length, in the same way, about one reverse primer suffices.
[0106] In addition, the base length of this enhancer may be at most 15 bases in length, even at most 10 bases in length.
[0107] The lower limit of the base length of this enhancer is not particularly restricted, but at least 5 bases in length is preferred. This is because there is a possibility that the intended coding region will not be translated when the length is less than 5 bases. More preferred is at least 10 bases in length.
[0108] A base sequence from the first base of the 5 end of the base sequence (1200 bases) represented by SEQ ID NO: 1 to a base corresponding to a predetermined base length can be given as an example of a preferred embodiment of this enhancer. This enhancer preferably uses a base sequence of from 5 to 400 bases in length, more preferably from 5 to 200 bases in length, from the 5 end of a base sequence represented by SEQ ID NO: 1 as a 3 UTR. More preferred is from 5 to at most 100 bases in length, even more preferred is from 5 to at most 80 bases in length, even more preferred is from 5 to at most 60 bases in length, and even more preferred is from 5 to at most 40 bases in length. 3 UTR of each of these embodiments more preferably comprise only a nucleotide represented by a base sequence of a predetermined base length from the 5 side of the base sequence represented by SEQ ID NO: 1.
[0109] This enhancer can also use a nucleic acid having 3 UTR activity that is a base sequence in which from one to several bases have been substituted, added, inserted, or deleted in a base sequence (SEQ ID NO: 6) of 40 bases in length from the 5 end of the base sequence represented by SEQ ID NO: 1. For example, consecutive A on the 5 side are preferably at least 8, but may be at most 20, in the base sequence represented by SEQ ID NO: 1. Preferred is from 10 to 16.
[0110] A nucleic acid having 3 UTR activity that is a base sequence having at least 85% identity with the base sequence represented by SEQ ID NO: 6 can also be used. The identity to the base sequence represented by SEQ ID NO: 6 is more preferably at least 90%, even more preferably at least 95%, even more preferably at least 96%, even more preferably at least 97%, even more preferably at least 98%, and even more preferably at least 99%.
[0111] The base sequence represented by SEQ ID NO: 6 has a base sequence that can take on a stem-loop structure. Therefore, it is preferable that the base sequence derived from this base sequence also retains this characteristic.
[0112] When the base sequence represented by SEQ ID NO: 6 or a nucleic acid provided with a base sequence derived from this base sequence as described above is used as a 3 UTR, the region where the base sequence is provided is preferably at the 5 end or as close as possible to the 5 end of the 3 UTR. As long as a base sequence represented by SEQ ID NO: 6 or a sequence derived therefrom is provided, the base sequence on the 3 end side is not particularly restricted, and various embodiments can be adopted.
[0113] This enhancer can have a poly A sequence of at least two consecutive A (adenine). The number of consecutive A in such a poly A sequence is not particularly restricted, but preferred is at least three consecutive A, more preferably at least five consecutive A. Preferred is at least 10 consecutive A, more preferably at least 15, and even more preferably at least 20. The upper limit, for example, is preferably at most 40 consecutive A, more preferably at most 30. Furthermore, for example, the poly A sequence may have from 5 to 30 consecutive A, or from 5 to 20 consecutive A, even from 10 to 30 consecutive A, and even from 10 to 20 consecutive A.
[0114] This enhancer may have a plurality of poly A sequences. In this case, the plurality of poly A sequences can be provided in a state in which one or several G, T, and C other than A are interposed. There are preferably at most five interposed bases (nucleotides), more preferably at most three, even more preferably at most two, and even more preferably one. The plurality of adjacent poly A sequences having these interposed bases or sequences can, as a whole, be called a poly A motif.
[0115] This enhancer can be provided with a poly A sequence in its polynucleotide, for example, in certain embodiments, at the 5 end or on the 5 end side of this enhancer. In addition, in certain embodiments, a poly A sequence can be provided in the middle of the 5 end and 3 end. Furthermore, in certain embodiments, a poly A sequence can be provided at the 3 end or on the 3 end side. For example, when this enhancer is added as a 3 UTR to the coding region of transcription template DNA, a poly A sequence can be provided immediately after the stop codon, immediately at the 3 end of the stop codon. In this case, the poly A sequence in the polynucleotide of this enhancer is provided on the 5 end side.
[0116] When this enhancer has a poly A sequence, it may be constituted from only a poly A sequence or a poly A motif, or may have another sequence comprising arbitrary bases in addition to the poly A sequence or poly A motif. In this case, this enhancer, by including a poly A sequence, can be, as a whole, from 10 to 60 bases in length. Preferably, this enhancer can be from 10 to 50 bases in length. Furthermore, this enhancer can be from 10 to 40 bases in length. For example, this enhancer can have a base sequence of position 17 onward in the base sequence represented by SEQ ID NO: 1. For example, this enhancer may be provided with a consecutive base sequence of from several to at most 100, for example, at most 80, or for example, at most 70, or for example, at most 60, or for example, at most 50, from position 17 of such a base sequence. Having such an additional sequence in addition to the poly A sequence tends to further improve the RNA stabilization and translation enhancing performance.
[0117] Such other sequences, for example, preferably have a GC content in the whole (AGTC) of at least 50%, more preferably at least 55%, and even more preferably at least 58%.
[0118] As for the suitable number of consecutive A when this enhancer includes a poly A sequence, various primer sets including reverse primers are designed to make it possible to produce a template nucleic acid to provide a poly A sequence of various consecutive A numbers (for example, from about 5 to 50, preferably about 40 at most) as a 3 UTR. Template nucleic acids including poly A sequences of various lengths are produced by these primer sets and used in the cell-free protein synthesis system in which this enhancer is to be used. The suitable poly A length can then be decided based on the amount of protein obtained.
[0119] 3 UTR activity means that the intended protein can be synthesized when a nucleic acid comprising a predetermined base sequence is used as a 3 UTR in the synthesis system. The method for evaluating the existence and magnitude of 3 UTR activity is not particularly restricted. For example, 3 UTR activity is activity exhibiting a translation level of at least 50%, preferably at least 60%, more preferably at least 70%, even more preferably at least 80%, even more preferably 100%, and even more preferably more than 100%, relative to the translation level by a nucleic acid comprising a base sequence represented by SEQ ID NO: 1, 6, or 26 when a predetermined nucleic acid is used as the 3 UTR under the same conditions except that a nucleic acid comprising a base sequence represented by SEQ ID NO: 1, 6, or 26 is used as the 3 UTR. Furthermore, a synthesis system derived from wheat germ is preferably used as the synthesis system; more preferably a synthesis system derived from wheat germ disclosed in the examples is used.
[0120] Identity or similarity in this specification is as known in the art, is decided by comparing sequences, and is a relationship between two or more proteins or two or more polynucleotides. Identity in the art means the degree of sequence universality between proteins or polynucleotides as determined by alignment between the protein or polynucleotide sequences or, in some cases, by alignment between a series of such sequences. Similarity means the degree of correlation by alignment between protein or polynucleotide sequences or, in some cases, between a series of partial sequences. More specifically, similarity is decided by the identity and conservancy (substitution to maintain physicochemical properties in specific amino acids or sequences in the sequence). Furthermore, similarity is referred to as similarity in the BLAST homology search results described below. The method for deciding identity and similarity is preferably a method designed for the longest alignment between the sequences compared. Methods for deciding identity and similarity are provided as programs available to the public. For example, a decision can be made using the BLAST (basic local alignment search tool) program of Altschul et al. (for example, Altschul, S F, Gish W, Miller W, Myers E W, Lipman, D J, J. Mol. Biol., 215: p. 403-410 (1990), Altschul S F, Madden T L, Schaffer A A, Zhang Z, Miller W, Lipman, D J, Nucleic Acid Res., 25: p. 3398-3402 (1997)). The conditions when using software such as BLAST are not particularly restricted, but it is preferable to use the default values.
[0121] This enhancer can be obtained based on chemical or genetic engineering nucleic acid synthesis methods that are themselves known. However, as described below, this enhancer is used as an element of a template nucleic acid or primer, and is commonly obtained as a template nucleic acid or primer.
[0122] (RNA Stabilizer)
[0123] This enhancer improves the stability of mRNA and, as a result, can improve the translation efficiency to the protein. Specifically, this enhancer can also function as an RNA stabilizer to improve the stability of RNA such as mRNA. Therefore, the base sequences of embodiments in which the base sequence of various embodiments of this enhancer described above has been converted into RNA can serve as a base sequence possessed by RNA as an RNA stabilizer. When mRNA has this sequence on its 3 side, the mRNA is stabilized, translation to the protein can be enhanced, and synthesis of cDNA by reverse transcriptase can also be enhanced. Various other useful types of RNA can also be stabilized.
[0124] (Expression Vector)
[0125] The expression vector of this disclosure (simply referred to as this vector hereinafter) can be provided with this enhancer as part thereof. This enhancer is provided as double-stranded DNA in this vector. This vector can preferably be provided with this enhancer on the 3 side of the insertion site of the coding region that encodes the amino acid sequence of a desired protein. The insertion site will be described below.
[0126] This vector can include one or more of this enhancer. When this vector includes a promoter region, this enhancer may be incorporated in the forward direction (5.fwdarw.3) on the 3 downstream side of the promoter region, or may be incorporated in the reverse direction. The one or more of this enhancer may be the same as or different from each other. In addition, when two or more of this enhancer are provided, their directions may be the same as or different from each other.
[0127] This vector can be provided with a region that encodes a protein tag on the 5 side of this enhancer. This makes it possible to obtain the desired protein provided with a protein tag at the C end of an arbitrary protein. In this case, the region that encodes the protein tag can be provided on the 3 end side of the insertion site described below. In addition, this vector may also be provided with a region that encodes a protein tag on the 5 end side of the insertion site. Furthermore, the protein tag is described below.
[0128] This vector can also be provided with a promoter region on the 5 side of this enhancer. Examples of the promoter region include, but are not limited to, a conventionally known T7 promoter sequence, SP6 promoter sequence, T3 promoter sequence, and the like.
[0129] This vector can be provided with an insertion site of a coding region that encodes the amino acid sequence of a desired protein. General examples of the insertion site include a sequence used as a conventionally known multicloning site, a sequence for homologous recombination, or the like. The insertion site is incorporated on the 5 side of this enhancer.
[0130] This vector need not be provided with a poly A sequence and a terminator region on the 3 side of this enhancer. This is because this enhancer itself can enhance translation.
[0131] This vector can also be provided with conventionally known drug resistance markers to maintain stability in the host and conventionally known replication origins such as pBR322 Ori, pUC Ori, SV40 Ori, and the like for self-replication in the host.
[0132] This vector can be produced using conventionally known gene recombination technology.
[0133] Transcription template DNA can be constructed as a template nucleic acid by inserting a region that encodes a desired protein into this vector.
[0134] (Template Nucleic Acid)
[0135] The template nucleic acid of this disclosure (simply referred to as this template nucleic acid hereinafter) is one element used in the synthesis system. The template nucleic acid can be transcription template DNA or translation template mRNA. Transcription template DNA may be a circular form such as a plasmid in addition to a linear form synthesized by PCR or the like. Furthermore, in this specification, template nucleic acid means a form of DNA double strand that can be used in a cell-free protein synthesis system. When the template nucleic acid is provided with an enhancer, the antisense strand has a corresponding T sequence when the sense strand, for example, as a poly A sequence as this enhancer. In addition, when transcription template DNA and translation template mRNA are provided with this enhancer, this enhancer is provided on the 3 end side.
[0136] This template nucleic acid can be provided with a promoter region, a coding region that encodes the amino acid sequence of a desired protein operably linked by the promoter region, and a 3 UTR comprising the above translation enhancer. According to this template nucleic acid, this template nucleic acid can be obtained efficiently since the 3 UTR is at most 200 bases in length, and, as a result, the desired protein can be synthesized efficiently by the synthesis system. This template nucleic acid is also advantageous in that it does not require elements commonly used on the 3 side of the coding region such as a terminator region, poly A signal, and the like.
[0137] A suitable, known promoter region can be used in accordance with the type of synthesis system to be used as the promoter reign in this template nucleic acid.
[0138] This template nucleic acid can be provided with a coding region that encodes the amino acid sequence of a desired protein. As was already mentioned, the coding region is preferably provided with a heterologous peptide such as a protein tag on the C end and/or N end of an arbitrary protein as the abovementioned desired protein.
[0139] This template nucleic acid can be obtained by known chemical or genetic engineering techniques. However, as described below, it is preferably obtained using a gene or cDNA as a template by utilizing a nucleic acid amplification reaction such as PCR or the like. In addition, translation template mRNA can be obtained by a known translation template mRNA synthesis method used in a two-step method or the like.
[0140] (Primer Set)
[0141] The primer set of this disclosure (simply referred to as this primer set hereinafter) is one element used in the synthesis system. This primer set is used to obtain the transcription template DNA that is this template nucleic acid by a nucleic acid amplification reaction using DNA polymerase.
[0142] This primer set, as shown in
[0143] Furthermore, the primer sets shown in
[0144] The reverse primer provided with this enhancer can take on various embodiments since the configuration adopted and necessary number differ depending on the base length of this enhancer and the configuration of the 3 end of the coding region. For example, as shown in
[0145] In addition, as shown in
[0146] Also, for example, as shown in
[0147] In addition, as shown in
[0148] Furthermore, reverse primers of various embodiments can be provided suitably with a region that encodes a peptide linker for linking a protein tag or the like to an arbitrary protein.
[0149] This primer set can be provided with one or more forward primers. The forward primers, as shown in
[0150] Furthermore, the reverse primer in this primer set can include this enhancer. In this case, the reverse primer has a base sequence corresponding to the antisense strand of the template nucleic acid. Therefore, for example, when a DNA double-stranded template nucleic acid having a sense strand in which the above poly A sequence is provided in the 3 UTR is provided, the reverse primer will include a sequence complementary to the 3 UTR of the sense strand as this enhancer in part thereof. For example, when this enhancer has a poly A sequence, the reverse primer including this enhancer will include a poly T sequence.
[0151] This primer set can generally be obtained by chemical nucleic acid synthesis methods. This primer set explained above can serve as an element of a kit for a synthesis system of this disclosure described below.
[0152] (Kit)
[0153] The cell-free protein synthesis kit in this disclosure (simply referred to as this kit hereinafter) can be provided with this primer set and a composition for cell-free protein synthesis. Furthermore, this kit can be provided with nucleic acid amplification reagents such as DNA polymerase, various nucleotides, and various reagents such as buffers for carrying out a nucleic acid amplification reaction by PCR using this primer set. Furthermore, this kit can also be provided with solid-phase carriers of various forms such as an array form to hold a plurality of transcription template DNAs synthesized by this primer set. Examples of solid-phase carriers include, but are not limited to, glass well plates and the like.
[0154] The composition for cell-free protein synthesis (simply referred to as this composition hereinafter) is a composition of elements necessary for protein synthesis derived from various cells or configured artificially based on a two-step method or one-step method. Such compositions are commonly known to those skilled in the art, can be prepared as needed by those skilled in the art, and are also commercially available.
[0155] Examples of such compositions include, but are not limited to, conventionally known Escherichia coli, wheat, barley, rice, corn, and other such plants of the Gramineae family, and germs of plant seeds such as spinach, extracts and extract liquids extracted from rabbit reticulocytes, and the like. Commercial products of these can be used, and they can also be prepared by methods that are themselves already known, specifically, in the case of E. coli extract liquid, according to the method described in Zubay G Ann. Rev. Genet., Vol. 7, p. 267-287 (1973), and the like. Examples of commercial cell extract liquids for protein synthesis include, E. coli S30 extract for linear templates (manufactured by Promega Inc.) and the like among those derived from E. coli; rabbit reticulocyte lysate systems (manufactured by Promega Inc.), and the like among those derived from rabbit reticulocytes; wheat germ extract (manufactured by Promega Inc.), PROTEIOS (manufactured by Toyobo Co., Ltd.), and the like among those derived from wheat germ.
[0156] In addition, this composition may include extracts derived from foot animals, extracts derived from mammalian cultured cells, and the like. An extract derived from silkworm tissue is preferably produced by the method described in Japanese Laid-open Patent 2003-235598, and an extract liquid derived from cultured cells is preferably produced by the method described in Japanese Laid-open Patent 2004-215651.
[0157] The composition provided in this kit can, for example, be a composition derived from wheat germ.
[0158] This kit can be provided with a composition containing one or more of various conventionally known components necessary respectively in a translation system or transcription/translation system, in addition to these extract liquids or extracts. Examples include, but are not limited to, nucleic acid-degrading enzyme inhibitors necessary for protein synthesis, various ions, substrates, phosphotransferase, energy sources, and other such additives for various protein synthesis reactions, as well as RNA polymerase, adenosine triphosphate, guanosine triphosphate, and other such nucleotides, a transcription template or translation template for encoding the protein, and, if desired, a stabilizer containing at least one component selected from inositol, trehalose, mannitol, and the like. Examples of additives for a protein synthesis reaction include amino acids that serve as a substrate, energy sources, potassium salts, magnesium salts, and other various ion sources, buffers, an ATP regeneration system (phosphotransferase), nucleic acid-degrading enzyme inhibitors, tRNA, DTT, and other such reducing agents, polyethylene glycol, 3,5-cAMP, folates, antimicrobials, and the like. Creatine phosphoric acid and creatine kinase may also be included. Furthermore, for the added concentrations of each of these components, known combination ratios can be decided by one skilled in the art.
[0159] The various components necessary to this translation system or transcription/translation system may be prepared separately from the extract or extract liquid, and some or all may be included in the extract or extract liquid in advance.
[0160] Furthermore, this kit may be provided with this vector in place of this primer or together with this primer.
[0161] (Method for Producing Transcription Template DNA)
[0162] The method for producing transcription template DNA for the cell-free protein synthesis system of this disclosure can be provided with a step for synthesizing transcription template DNA by carrying out a nucleic acid amplification reaction on DNA including a coding region of the desired protein using this primer set. Transcription template DNA is to be obtained as a PCR product by carrying out a nucleic acid amplification reaction using suitable conventionally known PCR amplification reaction conditions on DNA including a coding region that encodes the amino acid sequence of the desired protein using this primer set. Furthermore, when this primer set includes two or more forward primers and/or two or more reverse primers, the intended transcription template DNA is obtained as an amplification product by utilizing two or more amplification reaction conditions (especially temperature cycle and the like) as needed.
[0163] Furthermore, transcription template DNA can also be obtained using this vector. Specifically, a template nucleic acid can be obtained by inserting DNA that includes at least a coding region that encodes the amino acid sequence of this protein into this vector. A vector produced in this way can itself be used as transcription template DNA, and a DNA fragment corresponding to the transcription template DNA can also be cut from this vector and used.
[0164] The transcription template DNA may be used, for example, in the synthesis system as a PCR reaction solution (specifically, without purifying the transcription template DNA), or may be used in the synthesis system suitably purified, and the like.
[0165] (Method for Producing Translation Template mRNA)
[0166] The method for producing a translation template for the cell-free protein synthesis system of this disclosure can be provided with a step for synthesizing translation template mRNA using transcription template DNA in the absence of cells and in the presence of elements for transcribing the transcription template DNA into mRNA. This method, for example, can be carried out in vitro in the presence of at least the various components necessary for the transcription reaction using the composition for cell-free protein synthesis already described. More specifically, translation template mRNA can be obtained by incubating a PCR reaction solution including transcription template DNA or transcription template DNA derived from this vector in the presence of a composition including the components necessary for the transcription reaction such as RNA polymerase compatible with the promoter region provided in the transcription template DNA and a substrate for RNA synthesis (four ribonucleoside triphosphates), for example, for a suitable length of time at approximately 20-60 C., preferably approximately 30-42 C.
[0167] This method can be carried out as a synthesis system as part of a transcription/translation system, or can be carried out as a step preceding the application of the translation template mRNA to a translation system. The reaction solution of translation template mRNA obtained in this way can be used in the translation system.
[0168] (Method for Producing Protein)
[0169] The method for producing a protein of this disclosure can be provided with a step for synthesizing a protein using translation template mRNA in the absence of cells and in the presence of elements for translating the translation template mRNA into the protein. This method can also be provided with a step for synthesizing translation template mRNA using transcription template DNA in the absence of cells and in the presence of elements for transcribing the transcription template DNA into mRNA. Furthermore, this method can also be provided with a step for synthesizing the transcription template DNA by carrying out a nucleic acid amplification reaction on DNA that includes a coding region of the desired protein. This method makes it possible to conduct protein synthesis efficiently because translation template mRNA and transcription template DNA including this enhancer are used.
[0170] The production of a protein in this method may be realized using a translation system or may be realized using a transcription/translation system.
[0171] This method, for example, can be conducted in vitro using the composition for cell-free protein synthesis already described in the presence of at least the various components necessary for a translation reaction. More specifically, the method can be carried out on translation template mRNA by incubating for a suitable length of time at a suitable temperature suited to the translation reaction in the presence of the necessary or appropriate level of amino acids that serve as a substrate, energy sources, various ions, buffers, ATP regeneration system, various degrading enzyme inhibitors, tRNA, reducing agents, polyethylene glycol, 3,5-cAMP, folates, antimicrobials, and the like.
[0172] (Transcription Template Array and Use Thereof)
[0173] The array of transcription template DNA of this disclosure can hold a plurality of this template nucleic acid corresponding to a plurality of desired proteins as transcription template DNA. This array makes it possible to synthesize a plurality of desired proteins at once by using them in the synthesis system on an array. This array can be used preferably for various analyses involving proteins, such as protein-protein interaction analysis, protein-DNA interaction analysis, protein post-translation modification analysis, protein structure analysis, synthesis of antigenic proteins, and the like.
[0174] Examples of this array include, but are not limited to, 96-well plates that can hold transcription template DNA in each well and plates, disks, and strips on which transcription template DNA is immobilized on a suitable solid-phase carrier, and the like. Furthermore, examples of solid carriers include nonporous materials and porous materials comprising glass materials, ceramic materials, plastic materials, and the like.
EXAMPLES
[0175] The present invention is explained concretely below with reference to examples relating to this disclosure. However, the following examples explain this disclosure and do not limit this disclosure.
Example 1
[0176] (Protein synthesis evaluation against 3 UTR lengths of 1200-40 bp)
[0177] This example is outlined in
[0178] Proteins were synthesized by a cell-free protein synthesis reaction using the DNA fragments provided with 3 UTR of various lengths. The protocol appears below.
[0179] (1) Primer Sequences for Obtaining 3 UTR of Different Lengths
[0180] The primer sets for the first PCR and the second PCR of the plasmid vector appear below. Furthermore, each of the individual forward primers shown below were used in the first and second PCR, but the same primer was used for the reverse primer.
TABLE-US-00001 TABLE1 NAME SEQUENCE SEQ.ID. Forward CCAGCAGGGAGGTACTATGGACGGTTCT 7 primer1 TCGTTT Forward CCCGCGAAATTAATACGACTCACTATAG 8 primer2 CGACTCACTATAGGGCTCACCTATCTCT 9 CTACACAAAACATTTCCCTACATACAAC TTTCAACTTCCTATTATGGACTACAAGG ATGACGATGACAAGCTCCAGCAGGGAGG TACTATG Reverse 1200bp CTTTTTGATAATCTCATGACC 10 primer1,2 600bp AGTCCTGTCGGGTTTCG 11 300bp ATTCATTAATGCAGCTGGC 12 100bp AATTAACCCTCACTAAAGG 13 50bp AGGATCAGGCCCTTATG 14 40bp CCTTATGGCCGGATCC 15
[0181] (2) PCR and Protein Synthesis
[0182] PCR to produce transcription template DNA was conducted under the following conditions using KOD-Plus-Neo (manufactured by Toyobo Co., Ltd.) as the DNA polymerase.
TABLE-US-00002 TABLE 2 Reagents 10 X PCR buffer 5 l 2 mm dNTPs 5 l 25 mM MgSO.sub.4 3 l 10 M 1st forward primer 1 l 10 M 1st reverse primer 1 l Plasmid 1.25 ng KOD 1 l Sterile water X l Total 50 l Temp. Time Cycle 94 C. 5 min 1 98 C. 10 sec 55 C. 30 sec {close oversize bracket} 30 68 C. 3 min (1 min/kb) 72 C. 2 min 1 20 C.
[0183] In addition, the second PCR was carried out using the reaction solution composition and reaction conditions shown below in Tables 3 and 4. In addition, 1 L of the first PCR product was used.
TABLE-US-00003 TABLE 3 Reagents 10 X PCR buffer 5 l 2 mM dNTPs 5 l 25 mM MgSO.sub.4 3 l 10 M SpT7u primer 1 l 100 nM SpT7dFLAG/His/HA/BAP 1 l 10 M Reverse_U 1 l 100 nM deReverse 1 l 1st PCR product 1-5 l* KOD 1 l Sterile water X l Total 50 l
TABLE-US-00004 TABLE 4 Temp. Time Cycle 98 C. 1 min 1 98 C. 10 sec 60 C. 1 min {close oversize bracket} 10 68 C. 3 min (1 min/kb) 98 C. 10 sec 60 C. 15 sec {close oversize bracket} 30 68 C. 3 min (1 min/kb) 72 C. 2 min 1 20 C.
[0184]
[0185] (Transcription Reaction)
[0186] Next, translation template mRNA was produced using the transcription template DNA produced. The transcription reaction was conducted for three hours at 37 C. using the following reaction solution of a MEGAscript T7 Transcription Kit (Invitrogen) and 2.5 L of the second PCR reaction solution (containing transcription template DNA) previously produced.
TABLE-US-00005 TABLE 5 Reagents (one reaction) 10 X Transcription buffer (TB) 2.5 l 25 mM NTPs 2.5 l RNase Inhibitor (RI) 0.06 l 0.1M DTT 1.25 l 1 X T7 RNA Polymerase 1 l 2nd PCR product 2.5 l RNase-free water (DEPC) 15.19 l Total 25 l
[0187] After the transcription reaction had been completed, 1 L of transcription product was confirmed using agarose gel electrophoresis. The results are shown on the left side of
[0188] (Translation Reaction)
[0189] Next, reaction was conducted for approximately 10 hours by incubating at 16 C. using a translation reaction solution of the following composition. Furthermore, a composition solution excluding the translation template mRNA from the following composition was prepared, the translation template mRNA was added after the composition solution had returned to room temperature thereafter, and the reaction was conducted by pumping so as not to create foam.
TABLE-US-00006 TABLE 6 Reagents Wheat germ extract 20 l 4 X Amino acid mix 20 l mRNA 70 l Total 110 l
[0190] After the reaction, the reaction solution was recovered in an Eppendorf tube and centrifuged (15,000 rpm, 10 min, 4 C.), and the supernatant was stored at 80 C. The results of Western blotting of the proteins obtained are shown in the right side of
[0191] As shown in
Example 2
[0192] (Evaluation of Three Kinds of 3 UTR)
[0193] In this example, the translation efficiency by the 3 UTR of two vectors, pENTR/D-TOPO and pDONR221, and the 3 UTR used in Example 1 was evaluated.
[0194] As shown in
TABLE-US-00007 TABLE7 NAME SEQUENCE SEQ.ID. 1st Forward CCAGCAGGGAGGTACT 16 (WRKY67) ATGGTTTCCAACATTG AT Forward CCAGCAGGGAGGTACT 17 (UPB1) ATGGGTGTAACATTAG AA A Reverse GGGATATCAGCTGGAT 18 GGCAA B Reverse CCTTATGGCCGGATCC 19 (WRKY67) AAGAGCTCTTTTTTTT TTTTAATCAAAAGCAG AAATGTT Reverse CCTTATGGCCGGATCC 20 (UPB1) AAGAGCTCTTTTTTTT TTTTAAACACAGTTAG TTTCGGT C Rreverse GGGATATCAGCTGGAT 21 GGC 2nd SpT7u CCCGCGAAATTAATAC 8 SpT7dFLAG GACTCACTATAG 9 CGACTCACTATAGGGC TCACCTATCTCTCTAC ACAAAACATTTCCCTA CATACAACTTTCAACT TCCTATTATG
CT CCAGCAGGGAGGTACT ATG A Reverse AGTGACCTGTTCGTTG 22 CAAC B Reverse_U GGCCCCCCCTCGAAGG 23 deReverse CCCTCGAAGGATCAGG 24 CCCTTATGGCCGGATC CAA C Reverse GACTGATAGTGACCTG 25 TTCG
[0195] Translation template mRNA was prepared from the transcription template DNA obtained, again in the same way as in Example 1, and proteins were synthesized. The results of Western blotting of the proteins synthesized are also shown in
[0196] As shown in
Example 3
[0197] (N End Tag Addition by a 3 UTR Approximately 40 Bases in Length)
[0198] In this example, proteins having a FLAG tag added to the N end of each of six proteins (MKK1, MKK2, MKK4, MKK5, MKK6, MKK9) using the 3 UTR 37 bases in length (SEQ ID NO: 26) evaluated in Example 2 as a translation enhancer were synthesized by preparing transcription template DNA based on the primer design shown in
[0199] First and second PCR were conducted in the same way as in Example 1 on a vector pDONR221 that cloned the cDNA that encodes each of these proteins using the following primer sets, and transcription template DNA was prepared.
TABLE-US-00008 TABLE8 NAME SEQUENCE SEQ.ID. 1st Forward MKK1 CCAGCAGGGAGGT 27 primer ACTATGAACAGAG GAAGCTTA MKK2 CCAGCAGGGAGGT 28 ACTATGAAGAAAG GTGGATTC MKK4 CCAGCAGGGAGGT 29 ACTATGAGACCGA TTCAATCG MKK5 CCAGCAGGGAGGT 30 ACTATGAAACCGA TTCAATCT MKK6 CCAGCAGGGAGGT 31 ACTATGGTGAAGA TCAAATCG MKK9 CCAGCAGGGAGGT 32 ACTATGGCTTTAG TACGTGAA Reverse MKK1 CCTTATGGCCGGA 33 primer TCCAAGAGCTCTT TTTTTTTTTTAGT TAGCAAGTGGGGG AAT MKK2 CCTTATGGCCGGA 34 TCCAAGAGCTCTT TTTTTTTTTTACA CGGAGAACGTACC AGA MKK4 CCTTATGGCCGGA 35 TCCAAGAGCTCTT TTTTTTTTTTATG TGGTTGGAGAAGA AGA MKK5 CCTTATGGCCGGA 36 TCCAAGAGCTCTT TTTTTTTTTTAAG AGGCAGAAGGAAG AGG MKK6 CCTTATGGCCGGA 37 TCCAAGAGCTCTT TTTTTTTTTTATC TAAGGTAGTTAAC AGG MKK9 CCTTATGGCCGGA 38 TCCAAGAGCTCTT TTTTTTTTTTAAA GATCTTCCCGGAG AAA 2nd Forward SpT7u CCCGCGAAATTAA 8 primer TACGACTCACTAT AG FLAG-tag CGACTCACTATAG 9 GGCTCACCTATCT CTCTACACAAAAC ATTTCCCTACATA CAACTTTCAACTT CCTATTATG
CTCCAGCAGGGAG GTACTATG Reverse Reverse_U GGCCCCCCCTCGA 23 primer AGG deReverse CCCTCGAAGGATC 24 AGGCCCTTATGGC CGGATCCAA
[0200] As shown in
[0201] Based on the above, it was understood that fusion proteins could be obtained by an embodiment that provided a FLAG tag to the N end of each protein even by using a 3 UTR approximately 40 bases in length.
Example 4
[0202] (C End Tag Addition by a 3 UTR Approximately 40 Bases in Length)
[0203] In this example, proteins having a FLAG tag added to the C end of each of six proteins (ERF1, WRKY18, TGA2, NPR1, MYC2, phyB) using the 3 UTR 37 bases in length (SEQ ID NO: 26) evaluated in Example 2 as a translation enhancer were synthesized by preparing transcription template DNA based on the primer design shown in
TABLE-US-00009 TABLE 9 Reagents 10 X PCR buffer 5 l 2 mM dNTPs 5 l 25 mM MgSO.sub.4 3 l 10 M T7Eu primer 1 l 100 nM FLAG/His/HA/BAP-deReverse 1 l 10 M Reverse_U 1 l 1st PCR product 1-5 l* KOD 1 l Sterile water X l Total 50 l
[0204] First and second PCR were conducted in the same way as in Example 1 on a vector pDONR221 that cloned the cDNA that encodes each of these proteins using the following primer sets, and transcription template DNA was prepared.
TABLE-US-00010 TABLE10 SEQ. NAME SEQUENCE ID. 1st Forward ERF1 CACAAAACATTTCCC 39 primer TACATACAACTTTCA ACTTCCTATTATGGA TCCATTTTTAATT WRKY18 CACAAAACATTTCCC 40 TACATACAACTTTCA ACTTCCTATTATGGA CGGTTCTTCGTTT TGA2 CACAAAACATTTCCC 41 TACATACAACTTTCA ACTTCCTATTATGGC TGATACCAGTCCG NPR1 CACAAAACATTTCCC 42 TACATACAACTTTCA ACTTCCTATTATGGA CACCACCATTGAT MYC2 CACAAAACATTTCCC 43 TACATACAACTTTCA ACTTCCTATTATGAC TGATTACCGGCTA phyB CACAAAACATTTCCC 44 TACATACAACTTTCA ACTTCCTATTATGGT TTCCGGAGTCGGG Reverse ERF1 AGTACCTCCCTGCTG 45 primer GAGACCCCAAGTCCC ACTATTTTC WRKY18 AGTACCTCCCTGCTG 46 GAGACCTGTTCTAGA TTGCTCCAT TGA2 AGTACCTCCCTGCTG 47 GAGACCCTCTCTGGG TCGAGCAAG NPR1 AGTACCTCCCTGCTG 48 GAGACCCCGACGACG ATGAGAGAG MYC2 AGTACCTCCCTGCTG 49 GAGACCACCGATTTT TGAAATCAA phyB AGTACCTCCCTGCTG 50 GAGACCATATGGCAT CATCAGCAT 2nd Forward T7Eu-primer CCCGCGAAATTAATA 51 primer CGACTCACTATAGGG CTCACCTATCTCTCT ACACAAAACATTTCC Reverse Reverse_U GGCCCCCCCTCGAAG 23 primer G FLAG- CCCTCGAAGGATCAG 52 deReverse GCCCTTATGGCCGGA TCCAAGAGCTCTTTT TTTTTTTTACTTGTC ATCGTCATCCTTGTA GTCAGTACCTCCCTG CTGG
[0205] As shown in
[0206] Based on the above, it was understood that fusion proteins could be obtained by an embodiment that provided a FLAG tag to the C end of each protein even by using a 3 UTR approximately 40 bases in length.
Example 5
[0207] (Example of Truncation)
[0208] In this example, a protein (WRKY18) was truncated as shown in
[0209] First and second PCR were conducted in the same way as in Example 1 on a vector pDONR221 that cloned the cDNA that encodes these proteins using the following primer sets, and transcription template DNA was prepared.
TABLE-US-00011 TABLE11 SEQ NAME SEQUENCE ID. 1st Forward 1 CCAGCAGGGAGGTACTATG 53 primer GAGGGTTCTTCGTTT 2 CCAGCAGGGAGGTACTATG 54 GACGGTTCTTCGTTT 3 CCAGCAGGGAGGTACTACT 55 GAAACATCGGACACA 4 CCAGCAGGGAGGTACTATG 56 AATGCTTCTGAAGGG Reverse 1 CCTTATGGCCGGATCCAAG 57 primer AGCTCTTTTTTTTTTTTAT GTTCTAGATTGCTCCAT 2 CCTATGGCCGGATCCAAGA 58 GCTCTTTTTTTTTTTTACA AGCTTGTGTCCGA 3 CCTTATGGCCGGATCCAAG 59 AGCTCTTTTTTTTTTTTAT GTAGCATCCCCTTC 4 CCTTATGGCCGGATCCAAG 60 AGCTCTTTTTTTTTTTTAT GTTCTAGATTGCTCCAT 2nd SpT7u CCCGCGAAATTAATACGAC 8 TCACTATAG SpT7dFLAG CGACTCACTATAGGGCTCA 9 CCTATCTCTCTACACAAAA CATTTCCCTACATACAACT TTCAACTTCCTATTATG
CTCCAGCAGGGAGG TACTATG Reverse Reverse_U GGCCCCCCCTCGAAGG 23 primer deReverse CCCTCGAAGGATCAGGCCC 24 TTATGGCCGGATCCAA
[0210] As shown in
[0211] Based on the above, it was understood that fusion proteins could be obtained as a truncated fragment of each type by an embodiment that provided a FLAG tag to the N end of each protein even by using a 3 UTR approximately 40 bases in length.
Example 6
[0212] (Evaluation of Enhancer Base Length)
[0213] In this example, the sequence and base length that can be used as a 3 UTR were evaluated. Poly A sequences of different numbers of consecutive A were added as a 3 UTR after the stop codon of a base sequence that encodes Arabidopsis transcriptional cofactor NPR1 and transcription factor AREB2 using the following primers, and the amount of each protein synthesized was evaluated.
[0214] Specifically, primer sets including a reverse primer for obtaining a template nucleic acid having a poly A sequence of 5, 10, 20, 30, and 40 consecutive A added immediately after the stop codon of the base sequence that encodes each protein were synthesized. Furthermore, forward primers were designed to be able to add a FLAG protein to the N end of each protein. First and second PCR were conducted on a vector pDONR221 cloned to include the coding region of these transcription factors and the like in the same way as in Example 1 using these primer sets, transcription template DNA and mRNA were prepared, and proteins were finally obtained by a cell-free protein synthesis system.
TABLE-US-00012 TABLE12 SEQ. NAME SEQUENCE ID. 1stPCR Forward NPR1 CCAGCAGGGAGGTACTAT 61 primer GGACACCACCATTGAT AREB2 CCAGCAGGGAGGTACTAT 62 GGGAACTCACATCAAT Reverse NPR1-A5 TTTTTTTACCGACGACGA 63 TGAGAGAG NPR1-A10 TTTTTTTTTTTTACCGAC 64 GACGATGAGAGAG NPR1-A20 TTTTTTTTTTTTTTTTTT 65 TTTTACCGACGACGATGA GAGAG NPR1-A30 TTTTTTTTTTTTTTTTTT 66 TTTTTTTTTTTTTTACCG ACGACGATGAGAGAG NPR1-A40 TTTTTTTTTTTTTTTTTT 67 TTTTTTTTTTTTTTTTTT TTTTTTACCGACGACGAT GAGAGAG AREB2-A5 TTTTTTTACCATGGTCCG 68 GTTAATGT AREB2-A10 TTTTTTTTTTTTACCATG 69 GTCCGGTTAATGT AREB2-A20 TTTTTTTTTTTTTTTTTT 70 TTTTACCATGGTCCGGTT AATGT AREB2-A30 TTTTTTTTTTTTTTTTTT 71 TTTTTTTTTTTTTTACCA TGGTCCGGTTAATGT AREB2-A40 TTTTTTTTTTTTTTTTTT 72 TTTTTTTTTTTTTTTTTT TTTTTTACCATGGTCCGG TTAATGT 2ndPCR Forward SpT7u CCCGCGAAATTAATACGA 8 primer CTCACTATAG SpT7dFLAG CGACTCACTATAGGGCTC 9 ACCTATCTCTCTACACAA AACATTTCCCTACATACA ACTTTCAACTTCCTATTA TGGACTACAAGGATGACG ATGAGAAGGTGCAGGAGG GAGGTACTATG Reverse NPR1-A5 1stprimerUsed 63 NPR1-A10 sameprimerasthe 64 NPR1-A20 firstprimer 65 NPR1-A30 66 NPR1-A40 67 AREB2-A5 68 AREB2-A10 69 AREB2-A20 70 AREB2-A30 71 AREB2-A40 72
[0215] As shown in
[0216] In addition, fairly adequate protein synthesis could be assured by a poly A sequence of from 5 to 40 consecutive A as the 3 UTR for the AREB2 protein. Good protein synthesis could be assured by a poly A sequence of 10-20 consecutive A.
[0217] Based on the above, it was understood that the mRNA is stabilized and protein can be synthesized by a cell-free protein synthesis system even by a poly A sequence of at least five consecutive A as the 3 UTR. In addition, it was understood that the number of consecutive A in the poly A can be selected as is appropriate within a range of from about five to 40, as needed.
Example 7
[0218] (Evaluation of Enhancer Base Sequence)
[0219] In this example, the base sequence that can be used as a 3 UTR was evaluated. A poly A sequence of 10 consecutive A, a sequence comprising a total of 20 bases provided with a poly A sequence of 10 consecutive A and a base sequence from positions 17 to 26 following a poly A of a base sequence represented by SEQ ID NO: 1, and a sequence comprising a total of 47 bases provided with a poly A sequence of 10 consecutive A and a base sequence from positions 17 to 63 following a poly A of a base sequence represented by SEQ ID NO: 1 were added as a 3 UTR after the stop codon of a base sequence that encodes Arabidopsis transcription cofactor NPR1 using the following primers, and the amount of each protein synthesized was evaluated.
[0220] Specifically, primer sets including a reverse primer for obtaining a template nucleic acid having a poly A sequence of 10 consecutive A directly following the stop codon of a base sequence that encodes each protein and also having a base sequence comprising bases of from position 17 to a predetermined number within in the above base sequence represented by SEQ ID NO: 1 adjacent to the poly A sequence of 10 consecutive A, were synthesized. Furthermore, forward primers were designed to be able to add a FLAG protein to the N end of each protein. First and second PCR were conducted on a vector pDONR221 cloned to include the coding region of these transcription factors in the same way as in Example 1 using these primer sets, transcription template DNA and mRNA were prepared, and proteins were finally obtained by a cell-free protein synthesis system.
TABLE-US-00013 TABLE13 SEQ. NAME SEQUENCE ID. 1stPCR Forward NPR1 CCAGCAGGGAGGTAC 61 primer TATGGACACCACCAT TGAT Reverse NPR1-A10 TTTTTTTTTTTTACC 64 GACGACGATGAGAGA G NPR1-A10+ 20 ATGGCCGGATCCAAG 65 AGCTCTTTTTTTTTT TTACCGACGACGATG AGAGAG NPR1-A10+ 47 CCTTATGGCCGGATC 72 CAAGAGCTCTTTTTT TTTTTTACCGACGAC GATGAGAGAG 2ndPCR Forward SpT7u CCCGCGAAATTAATA 8 primer CGACTCACTATAG SpT7dFLAG CGACTCACTATAGGG 9 CTCACCTATCTCTCT ACACAAAACATTTCC CTACATACAACTTTC AACTTCCTATTATGG ACTACAAGGATGACG ATGACAAGCTCCAGC AGGGAGGTACTATG Reverse NPR1-A10 TTTTTTTTTTTTACC 64 GACGACGATGAGAGA G NPR1-A10+ 20 ATGGCCGGATCCAAG 73 AGCTC NPR1-A10+ 47 GGCCCCCCCTCGAAG 74 GATCAGGCCCTTATG
[0221] As shown in
SEQUENCE LISTING FREE TEXT
[0222] SEQ ID NOS: 1-6, 26: artificial 3 UTR
[0223] SEQ ID NOS: 7-25, 27-74: primer