Artificial salt tolerant protein, its synthetic method and coding gene thereof and use of said coding gene

10570410 ยท 2020-02-25

Assignee

Inventors

Cpc classification

International classification

Abstract

This invention relates to an artificial salt tolerant protein NLEA with the amino acid sequence shown in SEQ ID No.1 and a synthetic method of salt tolerant protein NLEA comprising the steps of retrieving different types of LEA proteins from LEA database; making multiple sequence alignment on different types of LEA proteins to obtain conserved short peptides; selecting hydrophilic short peptides with a hydrophilicity index higher than 3.5 from conserved short peptides; arranging and splicing hydrophilic short peptides in the order of isoelectric point size from large to small, to obtain salt tolerant protein NLEA. This invention involves bioinformatics analysis by retrieving different LEA conserved amino acid sequences of LEA protein data. Physical properties are analyzed to find short peptides of high hydrophilicity, and such short peptides are arranged in the order of isoelectric point size and spliced to get a new hydrophilic amino acid sequence.

Claims

1. An artificial salt tolerant protein NLEA, possessing the amino acid sequence shown as SEQ ID NO: 1.

2. A method of making said salt tolerant protein NLEA according to claim 1, comprising the following steps: 1) retrieving different types of Late Embryogenesis Abundant (LEA) proteins from a LEA database; 2) making multiple sequence alignments on the different types of LEA proteins retrieved in step 1) to obtain conserved short peptides; 3) selecting hydrophilic short peptides with a hydrophilicity index higher than 3.5 from the conserved short peptides obtained in step 2); 4) arranging and splicing the hydrophilic short peptides obtained in step 3) in the order of isoelectric point size from large to small, to get the salt tolerant protein NLEA; wherein the amino acid sequence of said salt tolerant protein NLEA is shown as SEQ ID NO: 1; 5) synthesizing the sequence of a nlea gene of the NLEA protein by polymerase chain reaction (PCR), connecting said sequence of the nlea gene into a PYES2 vector to construct an expression vector PYES2-NLEA, in order to express the NLEA protein; wherein said sequence of the nlea gene is shown as SEQ ID NO: 2.

3. The method according to claim 2, wherein the hydrophilicity index of step 3) is calculated by using the Formula I below: hydrophilicityindex = .Math. i = 1 n f ( i ) n formula I f(i) is the hydrophilicity number of single amino acid; n=11.

4. The method according to claim 2, further comprising the step of making a clustering analysis of said conserved short peptides before selecting the hydrophilic short peptides in said step 3), to get the conserved short peptides clustering in different branches.

5. A gene coding the salt absorbent protein NLEA according to claim 1, comprising a nucleotide sequence of SEQ ID NO: 2.

6. A recombinant expression vector comprising the gene of claim 5.

7. The recombinant expression vector according to claim 6, characterized in that, said recombinant expression vector is pYES2-nlea.

8. A host comprising the recombinant expression vector according to claim 6.

9. The host according to claim 8, wherein said host comprises S. cerevisiae, E. coli or Agrobacterium.

10. A method of using the gene according to claim 5 as a selection marker in microorganism transgenic cell culture comprising the step of culturing transgenic cells or host cells under saline conditions.

11. A host comprising the recombinant expression vector according to claim 7.

Description

DESCRIPTION OF DRAWINGS

(1) FIG. 1 shows the principle of In-Fusion technology;

(2) FIG. 2 is a diagram of agarose gel electrophoresis of the vectors after double digestion in example 2;

(3) FIG. 3 shows an expression vector containing salt tolerant coding gene nlea in example 3;

(4) FIG. 4 shows the growth curve of S. cerevisiae under salt stress in example 5;

(5) FIG. 5 shows phenotype observations of S. cerevisiae with NLEA transferred and wild strains under different salt stresses in example 6.

(6) FIG. 6 shows observation on performance of wild-type Arabidopsis thaliana plant and transgenic Arabidopsis thaliana plant with nlea gene under salt stress with concentration of 100 m mol/L in example 7.

EMBODIMENTS

(7) This invention provides an artificial salt tolerant protein NLEA, possessing the amino acid sequence shown as SEQ ID NO:1.

(8) This invention provides a synthetic method of said salt tolerant protein NLEA, comprising the following steps:

(9) 1) retrieving different types of LEA proteins from LEA database;

(10) 2) making multiple sequence alignment on the different types of LEA proteins retrieved in step 1) to obtain conserved short peptides;

(11) 3) selecting hydrophilic short peptides with a hydrophilicity index higher than 3.5 from the conserved short peptides obtained in step 2);

(12) 4) arranging and splicing the hydrophilic short peptides obtained in step 3) in the order of isoelectric point size from large to small, to get salt tolerant protein NLEA.

(13) In this invention, different types of LEA proteins are retrieved from LEA database.

(14) In this invention, said LEA database is preferably Late Embryogenesis Abundant Proteins DataBase. The quantity of said different types of LEA proteins is preferably 1,643 LEA proteins sourced from different plants.

(15) After obtaining different types of LEA proteins, multiple sequence alignment of said proteins is made to obtain conserved short peptides.

(16) In this invention, the tool of said multiple sequence alignment is bioinformatics software BioEdit preferably. Said method of multiple sequence alignment is subject to no special restriction, and any common multiple sequence alignment methods known by a person skilled in the art can be used.

(17) In this invention, the length of said conserved short peptides is preferably 11 amino acids. When the length of conserved short peptides is greater than 11, the conserved short peptides whose whole hydrophilicity is higher will be selected preferably in the invention.

(18) After getting the conserved short peptides, hydrophilic short peptides with a hydrophilicity index higher than 3.5 are selected from said conserved short peptides.

(19) In this invention, said hydrophilicity index is preferably calculated by using formula I below:

(20) hydrophilicityindex = .Math. i = 1 n f ( i ) n formula I
f(i) is the hydrophilicity number of single amino acid; n=11.

(21) In this invention, said hydrophilicity numbers of single amino acid are shown in Table 1.

(22) TABLE-US-00001 TABLE 1 hydrophilicity numbers of common amino acids Abbreviation Hydrophilicity Abbreviation Hydrophilicity of amino acid number of amino acid number Arg (R) 4.5 Gly (G) 0.4 Lys (K) 3.9 Ser (S) 0.8 Asn (N) 3.5 Thr (T) 0.7 Asp (D) 3.5 Ala (A) 1.8 Gln (Q) 3.5 Met (M) 1.9 Glu (E) 3.5 Cys (C) 2.5 His (H) 3.2 Phe (F) 2.8 Pro (P) 1.6 Leu (L) 3.8 Tyr (Y) 1.3 Val (V) 4.2 Trp (W) 0.9 Ile (I) 4.5

(23) In this invention, preferably, before selecting the hydrophilic short peptides, clustering analysis is made on said conserved short peptides to obtain short peptides of different branches. The purpose of said clustering analysis is to select conserved short peptides composed of different amino acids.

(24) In this invention, preferably, the tool of said clustering is bioinformatics software MEGA6. The method of said clustering is subject to no special restriction, and any common clustering methods known by a person skilled in the art can be used.

(25) After obtaining the hydrophilic short peptides, said hydrophilic short peptides are arranged in the order of isoelectric point size from large to small, and then spliced to obtain salt tolerant protein NLEA.

(26) In this invention, molecular weights, isoelectric points and electric charge polarity of said hydrophilic short peptides are analyzed. Preferably, the said tool of analysis is online software SMART.

(27) In this invention, the properties of part of said hydrophilic short peptides such as molecular weight, isoelectric point and electric charge polarity are shown in Table 2.

(28) TABLE-US-00002 TABLE2 Listofpropertiesofhydrophilicshortpeptidessuchasmolecularweight, isoelectricpointandelectricchargepolarity Average Number Iso- Total hydro- Hydrophilic ofamino Molecular electric Molecular number philicity shortpeptides acids weight point formula ofatoms index KKRRRKERKEK 11 1541.8 11.59 C.sub.64H.sub.124N.sub.28O.sub.16 232 -4.045 KRRRKERKEKK 11 1541.8 11.59 C.sub.64H.sub.124N.sub.28O.sub.16 232 -4.045 EKRKKKKKEKK 11 1457.8 10.47 C.sub.64H.sub.124N.sub.22O.sub.16 226 -3.882 KRKKKKKEKKE 11 1457.8 10.47 C.sub.64H.sub.124N.sub.22O.sub.17 226 -3.882 RKKKKKEKKEK 11 1457.8 10.47 C.sub.64H.sub.124N.sub.22O.sub.18 226 -3.882 EKKKKKEKKKK 11 1429.8 10.3 C.sub.64H.sub.124N.sub.20O.sub.16 224 -3.827 KKKKKEKKKKD 11 1415.7 10.3 C.sub.63H.sub.122N.sub.20O.sub.16 221 -3.827 KKKKKEKKKKD 11 1571.6 9.4 C.sub.60H.sub.106N.sub.28O.sub.22 216 -3.955 EEKKRRRKERK 11 1542.8 11 C.sub.63H.sub.119N.sub.27O.sub.18 227 -4.009 EKRKKKKEKKK 11 1457.83 10.47 C.sub.64H.sub.124N.sub.22O.sub.16 226 -3.882

(29) In this invention, the method of said splicing is subject to no special restriction, and any common splicing methods known by a person skilled in the art can be used.

(30) In this invention, most amino acid residues of said salt tolerant protein NLEA are alkaline and hydrophilic amino acids, without cysteine or tryptophan, with strong hydrophilicity and conservativeness, and without complex tertiary structure.

(31) This invention provides a coding gene nlea of said salt tolerant protein, which has the nucleotide sequence shown as SEQ ID NO:2.

(32) In this invention, the selection method of said coding gene nlea preferably includes the following steps:

(33) A. translating the amino acid sequence of the salt tolerant protein to nucleotide sequence in accordance with that one amino acid corresponds to one or more codons;

(34) When one amino acid corresponds to several codons, dominant codon is selected.

(35) Said dominant codon is determined based on the use frequencies of different codons of the same amino acid in microzyme genome in Genetic Codes database. The codon with the highest use frequency is the dominant one.

(36) B. Adjusting said nucleotide sequence obtained by translation to exclude sequences with a GC content higher than 60%.

(37) This invention also provides a recombinant expression vector containing said coding gene. In this invention, preferably, said recombinant expression vector is pYES2-nlea.

(38) In this invention, the construction method of said recombinant expression vector preferably includes the following steps:

(39) (1) taking the coding gene NLEA sequence synthesized by the company as template, conducting PCR amplification, recycling the product of PCR amplification, and obtaining PCR product;

(40) (2) Performing double digestion on plasmid pYES2 by using SacI and EcoRI enzymes to get linearized plasmid PYES2;

(41) (3) Connecting the gel extraction product of linearized plasmid pYES2 and the gel extraction PCR product of NLEA by means of In-Fusion reaction to get recombinant expression vector.

(42) Said Step (1) and Step (2) are subject to no restriction in time sequence.

(43) In this invention, the synthesized sequence of coding gene nlea is taken as template for PCR amplification. In this invention, the nucleotide sequence of the forward primer for said PCR amplification is shown as SEQ ID NO:3; the nucleotide sequence of the reverse primer for said PCR amplification is shown as SEQ ID NO:4.

(44) In this invention, said PCR amplification system preferably includes DNA template 1 L, 5 Phushion HF buffer 4 L, 10 mmol/L dNTPs 0.4 L, 10 M gene specific primers 1 L, Phusion DNA polymerase 0.2 L, DMSO 0.6 L, and the whole reaction volume is made up a deficiency to 20 L by using water.

(45) In this invention, preferably, said PCR amplification procedure is 98 C. 30 s, taking 98 C. 10 s, 60 C. 30 s, 72 C. 1 min as one cycle, conducting 35 cycles, and finally 72 C. 5 min.

(46) After said PCR amplification, the PCR amplification products are reclaimed. Said reclaiming of amplification products is conducted preferably by using reagent kit. The type of said reagent kit is subject to no special restriction, and any common reagent kit for the PCR amplification products known by a person skilled in the art can be used.

(47) In this invention, double digestion is performed on plasmid PYES2 by using SacI and EcoRI enzymes.

(48) In this invention, the source of said SacI and EcoRI enzymes is subject to no special restriction, and any common SacI and EcoRI enzymes known to a person skilled in the art can be used. In the embodiments of this invention, said SacI and EcoRI enzymes were purchased from Beijing NEB Company.

(49) In this invention, the source of said plasmid pYES2 is subject to no special restriction, and any common sources known to a person skilled in the art can be used. In the examples of this invention, said plasmid pYES2 was presented by China Agricultural University.

(50) In this invention, the reaction system of said digestion is 20,000 unit/ml SacI 1 L, EcoRI 1 L, 10 CutSmart 2.5 L, pYES2 5 L; the digestion temperature is preferably 50 C., incubating for 1 h; deactivating at 65 C. for 15 min.

(51) After obtaining plasmid pYES2 and gel extraction PCR product of NLEA, the gel extraction product of linearized plasmid pYES2 and the gel extraction PCR product of nlea are connected to get recombinant expression vector.

(52) In this invention, said connection is preferably performed by means of In-Fusion reaction. The principle of said In-Fusion reaction is 15 homologous bases at the ends of inserted sections and the linearized vector (see FIG. 1).

(53) In this invention, the system of said In-Fusion reaction is as below: the linearized vector pYES2 3 L, PCR product 5 L, 5 In-Fusion HD Enzyme Premix 2 L, reaction temperature 50 C., incubating for 15 min.

(54) In this invention, said recombinant expression vectors are preferably verified before use. The method of said verification includes the following steps:

(55) {circle around (1)} transforming the recombinant expression vector into E. coli competent cell DH5a through thermal shock transformation, and then screening by using ampicillin;

(56) {circle around (2)} Performing bacterium shaking and plasmid extraction on the obtained bacterial plaque, saving the bacterial colony for PCR appraisal; there are T7 primer and infusion-nlea reverse primer in the upstream section of PYES2-NLEA; PCR appraisal is performed by taking the transformed plasmid as template to get the positive clone;
{circle around (3)} Sequencing the positive clone, alignment between the sequenced sequence and the original sequence is conducted, and the positive clones with a similarity of 100% is the successfully transformed recombinant E. coli.

(57) This invention also provides a host containing said recombinant expression vectors.

(58) In this invention, the host preferably includes S. cerevisiae, E. coli or Agrobacterium.

(59) In this invention, the host containing said recombinant expression vectors has good salt tolerance in sodium chloride solution with molar concentrations within the range of 0.5 mol/L0.8 mol/L.

(60) In this invention, preferably, the preparation method of said host is transforming said recombinant expression vectors into the hosts.

(61) In this invention, the method of said transformation is subject to no special restriction, and any common transformation methods known to a person skilled in the art can be used.

(62) This invention also provides the use of said coding gene as selection marker in microorganism transgenic cell culture.

(63) In this invention, said use as selection marker is specifically as follows: the transgenic cells or hosts are cultured under saline condition, only the cells or hosts with normal growth are successful transgenic material, and hosts without transformed salt tolerant protein cannot grow under saline condition.

(64) An artificial salt tolerant protein, its synthetic method and coding gene thereof, and use of gene provided in this invention are described in detail below in combination with examples. However, these examples should not be understood as definition of the protection scope of this invention.

(65) Unless defined otherwise, all terms of art, notations and other technical and scientific terms or terminology used herein are intended to have the same meaning as is commonly understood by one of ordinary skill in the art to which the claimed subject matter pertains. In some cases, terms with commonly understood meanings are defined herein for clarity and/or for ready reference, and the inclusion of such definitions herein should not necessarily be construed to represent a substantial difference over what is generally understood in the art. Many of the techniques and procedures described or referenced herein are well understood and commonly employed using conventional methodology by those skilled in the art.

(66) Unless defined otherwise, all terms of art, notations and other technical and scientific terms or terminology used herein are intended to have the same meaning as is commonly understood by one of ordinary skill in the art to which the claimed subject matter pertains. In some cases, terms with commonly understood meanings are defined herein for clarity and/or for ready reference, and the inclusion of such definitions herein should not necessarily be construed to represent a substantial difference over what is generally understood in the art. Many of the techniques and procedures described or referenced herein are well understood and commonly employed using conventional methodology by those skilled in the art.

(67) All headings are for the convenience of the reader and should not be used to limit the meaning of the text that follows the heading, unless so specified.

(68) As used herein and in the appended claims, the singular forms a, an, and the include plural referents unless the context clearly dictates otherwise. For example, a or an means at least one or one or more.

Example 1

(69) Plasmid with synthesized gene determined by sequencing is taken as template, and the synthesized gene is also taken as template. The gene synthesized by the company is directly connected to puc57 vector which is made by the company itself, and this gene cannot exist alone. Amplification is conducted by using the positive primer shown in SEQ ID NO:3 and the reverse primer shown in SEQ ID NO:4, with the synthesized gene as template. The PCR amplification system is as follows: the total reaction system includes puc57-NLEA 1 L, 5 phushion HF buffer 4 L, 10 mmol/L dNTPs 0.4 L, 10 M gene specific primers 1 L, Phusion DNA polymerase 0.2 L, DMSO 0.6 L, and the whole reaction volume is made up a deficiency by using 11.8 L water. The PCR amplification procedure is: 98 C. 30 s; and taking 98 C. 10 s, 60 C. 30 s, 72 C. 1 min as one cycle, conducting 35 cycles; and finally 72 C. 5 min. The whole sample of 50 L is applied for electrophoresis, and PCR products are reclaimed after agarose electrophoresis. puc57 vector in this example can be purchased.

Example 2

(70) Double Enzyme Digestion on Vector PYES2

(71) Plasmid PYES2 appraised by sequencing is taken as template, and SacI and EcoRI enzymes are used for the double enzyme digestion. The reaction system is as follows: 20,000 unit/ml SacI 1 L, EcoRI 1 L, 10 CutSmart 2.5 L, PYES2 5 L; the digestion temperature is preferably 50 C., with incubation 1 h; 65 C., with deactivation 15 min. The whole sample of 25 L is applied for eletrophoresis, and digestion products are reclaimed after agarose electrophoresis.

(72) FIG. 2 is a diagram of agarose gel electrophoresis of the vectors PYES2 after double enzyme digestion.

Example 3

(73) Construction of Expression Vector PYES2-NLEA

(74) The properly appraised gel extraction product of linearized plasmid PYES2 and the gel extraction PCR product of NLEA are connected by means of In-Fusion reaction. The reaction system is as follows: linearized vector PYES2 3 L, PCR product 5 L, 5 In-Fusion HD Enzyme Premix 2 L, reaction temperature 50 C., incubating for 15 min.

(75) The constructed recombinant expression vector is transformed into E. coli DH5a competent cell through thermal shock transformation, which specifically includes that, 2 L connection product is transformed into 50 L DH5 competent cell. The reactant is placed in ice bath for 30 min, thermal shock at 42 C. for 50 s, and placed onto ice quickly to keep for 2 min; 700 L liquid LB culture media is added and shaken for 1 h at 37 C., 250 rpm. The reactant is centrifuged for 1 min at 5,000 rpm. The supernatant of 600 L is removed, and the rest 150 L is blown and sucked to suspend thalli. All the residual bacteria liquid is spread on solid LB plate containing 100 mg/ml Amp, to screen by ampicillin. Bacterium shaking and plasmid extraction for the bacterial plaque obtained by transformation are performed, and the bacterial colony is saved for PCR appraisal; there are T7 primer and infusion-nlea reverse primer in the upstream section of PYES2-NLEA, and PCR appraisal is performed by using the transformed plasmid as template. The positive clone is sequenced, and alignment between the sequencing result and the coding gene is conducted. Sequence of complete consistence indicates successful construction of PYES2-NLEA. FIG. 3 shows an expression vector containing salt tolerant coding gene NLEA.

Example 4

(76) Transformation of Expression Vector pYES2-nlea into S. cerevisiae

(77) 1) 50 L S. cerevisiae competent cells is melt on ice, and then 24, constructed recombinant vector is added.

(78) 2) 5 L denatured linearized DNA is added into 50 L PEG/LiAc, mixed up gently, and placed on ice for half an hour.

(79) 2) 20 L DMSO is added into the mixture of step 1).

(80) 4) the reactant is placed to water bath for 15 min at 42 C.

(81) 5) high speed centrifugation 15 s.

(82) 6) the supernatant is discarded, and thalli are suspended with 1 ml YPD Plus, and shaken for 1 h at 30 C.

(83) 7) After high speed centrifugation for 15 s, the supernatant is discarded, and cells are suspended with 0.9% NaCl solution.

(84) 8) the bacteria solution is diluted with NaCl solution respectively by 10 times and 100 times.

(85) 9) 100 L diluted bacteria solution is spread on auxotrophic culture medium SC-Ura.

(86) 10) being cultured for 3-5 days at 30 C., the positive clones are selected.

(87) The auxotrophic culture medium is used to screen out whether the recombinant plasmid is successfully transformed into microzyme. As such culture media is uracil auxotrophic culture medium, and pYES2 contains uracil gene, only microzyme with successfully transcribed pYES2 can grow in this culture medium.

Example 5

(88) Phenotype Appraisal of Transgenic Microzyme

(89) S. cerevisiae with empty vector pYES2 (as control group) and S. cerevisiae with pYES2-nlea are respectively placed into selection culture medium with galactose induced liquid uracil, to observe their growth curve. If s shown from the observation results that, the S. cerevisiae with PYES2-NLEA had a better growth than the S. cerevisiae with empty vector PYES2. Then, high temperature sterilization is performed.

(90) The growth conditions of the S. cerevisiae with empty vector PYES2 and S. cerevisiae with pYES2-nlea are observed after three days of growth (FIG. 4). If s shown from FIG. 4 that, S. cerevisiae with pYES2-nlea had significantly better growth than S. cerevisiae with empty vector pYES2 under the treatment condition of 600 mM NaCl.

Example 6

(91) S. cerevisiae with empty vector pYES2 (as control group) and S. cerevisiae with pYES2-nlea are respectively placed into selection culture medium with galactose induced solid uracil. This culture medium includes salt treatments of different concentrations, respectively 0M, 0.1M, 0.2M, 0.3M, 0.4M, 0.5M, 0.6M, 0.7M, 0.8M, 0.9M and 1.0M.

(92) The phenotype of NLEA transformed S. cerevisiae and wild strains under different salt stresses are observed, and the phenotype observations of NLEA transformed S. cerevisiae and wild strains under different salt stresses are shown in FIG. 5. If s shown from FIG. 5 that, the salt tolerance of NLEA transformed S. cerevisiae in sodium chloride solution with molar concentrations of 0.5 mol/L and 0.6 mol/L is significantly different from that of the control group, and the highest salt tolerance of NLEA transformed S. cerevisiae is up to 0.8 mol/L NaCl.

Example 7

(93) Plant expression vector pGWB14-NLEA which was constructed through Gateway technology was transformed into Agrobacterium GV3101 cells. Model plant of Arabidopsis thaliana were transfected by Agrobacterium GV3101 cells with pGWB14-NLEA transformed, and transgenic plants over-expressing nlea gene were then obtained by Agrobacterium-medicated transfection technology. Positive transfected plant were identified by kanamycin selective screen.

(94) Wild type and transgenic Arabidopsis plants were sown on 1/2 MS solid media with different concentrations of NaCl as follows: 0 mM, 50 mM, 100 mM, 150 mM and 200 mM, and their growth under different salt stress were observed. It's shown by the observing results that, wild type plants turned to white, while transgenic plants stayed green after 20 days treatments. By transferring all plants to fresh 0 mM NaCl media, transgenic plants survived, while wildtype plants can't recover from salt treatments and all died (see FIG. 6).

(95) The aforesaid embodiments are only preferred examples of present invention. It should be noted that a common person skilled in the art may make some improvements and modifications based on the principles of present invention. Such improvements and modifications should be deemed within the protection scope of present invention.