Peptide sequence of a guide protein for the production of a peptide of interest, an expression vector, a host cell, and a method for the production of a peptide of interest

11261472 · 2022-03-01

Assignee

Inventors

Cpc classification

International classification

Abstract

A peptide sequence of a guide protein for the production of a peptide of interest; a peptide sequence that has a similarity of at least 90% to SEQ. ID. NO 1; a nucleotide sequence encoding the guide protein; an expression vector comprising the nucleotide sequence; a host cell that expresses a fusion protein comprising a peptide of interest; a method of production of a peptide of interest, comprising the steps of A) constructing an expression vector; B) inserting the expression vector into a host cell; C) expressing the fusion protein, culturing the host cell in a culture medium; D) recovering the accumulated fusion protein in the host cell; E) cleaving the fusion protein; F) purifying the peptide of interest.

Claims

1. An isolated peptide sequence of a guide protein for the production of a peptide of interest, comprising the sequence SEQ ID NO: 1.

2. An isolated peptide sequence of a guide protein for the production of a peptide of interest, comprising a sequence comprising at least 92% identity with SEQ ID NO: 1.

3. An isolated nucleotide sequence encoding the guide protein of claim 1, comprising the sequence SEQ ID NO: 2.

4. An expression vector of a fusion protein for the production of a peptide of interest, comprising: a sequence encoding for the guide protein, according to any of claim 1, (ii) a sequence that encodes a cleavage site, and (iii) at least one sequence encoding a peptide of interest, wherein the sequence which encodes the cleavage site is located between the sequences of (i) and (ii).

5. The expression vector of claim 4, further comprising a sequence encoding a purification tag.

6. The expression vector of claim 4, further comprising a sequence of an expression promoter operably linked to the sequence encoding the fusion protein.

7. The expression vector of claim 4, further comprising a sequence encoding at least one selection marker.

8. A host cell, transformed with the expression vector of claim 4.

9. The host cell of claim 8, selected from the group of microorganisms, consisting of Escherichia coli, Bacillus subtilis, and Saccharomyces cerevisiae.

10. A method for producing a peptide of interest, comprising: A) constructing an expression vector according to claim 4; B) inserting the expression vector of step A) into a host cell; C) expressing the fusion protein, culturing the host cell of step B) in a culture medium; D) recovering the fusion protein accumulated in the host cell; E) cleaving the fusion protein and F) purifying the peptide of interest.

11. The method of claim 10, wherein in step C) the host cell is cultured at a temperature between 20 and 40° C.

12. The method of claim 11, wherein in step C) the culture medium comprises a nutritive medium, a selection marker, and an inducer.

13. The method of claim 12, wherein the nutrient medium is Luria Bertani; the selection marker is ampicillin; and the inducer is isopropyl-β-D-1-thiogalactopyranoside.

14. The method of claim 10, wherein in step D) the separation and purification techniques are selected from centrifugation, chromatography, or precipitation.

15. The method of claim 10, wherein in step E) the fusion protein is solubilized in a cleavage solution by at least one step of dialysis, dilution, retention in column chromatography, or precipitation and dissolution.

16. The method of claim 15, wherein the cleavage solution is a semi-denaturing solution.

17. The method of claim 10, wherein in step E) the peptide is separated from the fusion protein by protease cleavage, chemical cleavage or metal-catalyzed cleavage.

18. The method of claim 10, wherein in step F) the peptide is purified by a chromatographic and/or precipitation method.

19. The method of claim 18, wherein the chromatographic method is selected from: metal affinity, size exclusion, or reverse phase.

20. An expression vector of a fusion protein for the production of a peptide of interest, comprising: (i) a sequence encoding for the guide protein, according to any of claim 2, (ii) a sequence that encodes a cleavage site, and (iii) at least one sequence encoding a peptide of interest, wherein the sequence which encodes the cleavage site is located between the sequences of (i) and (iii).

Description

DESCRIPTION OF THE FIGURES

(1) FIG. 1: Comparison of expression of the fusion protein using SSD guide protein (FIG. 1A), vs using KSI guide protein (FIG. 1B) for the production of the peptide of interest p53pAnt. In this figure it is shown a comparison of the SDS-PAGE analysis of the intracellular protein 1: total protein, 2: soluble protein, 3: insoluble protein. In column 1, it is observed that when using the SSD guide protein, a higher total fusion protein productivity is obtained with respect to the productivity obtained when using the KSI guide protein; in column 2, it is observed that with the SSD guide protein, fusion protein is obtained in soluble form, whereas with KSI it is not obtained; In column 3 it is observed that in the SSD case the insoluble fusion protein is in a higher degree of purity than when using the KSI guide protein, since no contaminating band is observed.

(2) FIG. 2: SDS-PAGE analysis of the purified fusion protein from total protein in a single chromatography step. 1: Purified fusion protein.

(3) FIG. 3: SDS-PAGE analysis of precipitation of the fusion protein induced by temperature. 1: total intracellular protein; 2: soluble intracellular protein; 50° C. for 10 min, 3: pellet, 4: supernatant; 45° C. for 15 min, 5: pellet; 6: supernatant; 40° C. for 20 min, 7: pellet; 8: supernatant; 35° C. for 30 minutes, 9: pellet; 10: supernatant; 30° C. for 35 min, 11: pellet; 12: supernatant. In column 11, selective precipitation of the fusion protein with an approximate purity of 75% is observed.

(4) FIG. 4: Comparison of fusion protein productivity obtained using KSI guide protein at 37° C., SSD at 37° C., SSD at 25° C. It is observed that the productivity obtained with SSD guide protein at 37° C. is higher than that obtained with the guide protein of the state of art, also the productivity obtained by expressing at 25° C. is higher than that obtained at 37° C.

DESCRIPTION OF THE INVENTION

(5) The present application refers to a peptide sequence, according to SEQ ID NO: 1, of a guide protein, here called SSD guide protein, which is used for the recombinant production of peptides of interest, included in a recombinant fusion protein, as well as the nucleotide sequence SEQ ID NO: 2, which encodes for said SSD guide protein.

(6) The present application also discloses an expression vector comprising said sequence encoding said SSD guide protein, and a sequence encoding at least one peptide of interest. This expression vector is used for the recombinant production of a fusion protein.

(7) The present application also describes a host cell transformed with said expression vector. This host cell, when cultured in the form of batch culture, semi-continuous culture, or continuous culture, expresses the fusion protein comprising the SSD guide protein, and the peptide of interest.

(8) The present application also describes a method of production of recombinant peptides of interest. Said method comprises the construction of an expression vector for the recombinant production of a fusion protein. Said fusion protein comprises the SSD guide protein, and at least one copy of a peptide of interest. Said vector is introduced into a host cell, which expresses the fusion protein. The fusion protein produced further comprises a cleavage site between the sequences of the SSD guide protein and the peptide of interest, and a purification tag to facilitate the recovery of the recombinant protein.

(9) This method includes the stages of

(10) A) constructing expression vector,

(11) B) inserting expression vector into host cell,

(12) C) expressing the fusion protein,

(13) D) recovering the fusion protein,

(14) E) cleaving the fusion protein,

(15) F) purifying the peptide of interest.

(16) These stages are described in detail below:

(17) A) Constructing Expression Vector.

(18) Constructing an expression vector, which includes: a nucleotide sequence coding for the SSD guide protein of sequence SEQ ID NO: 1, or a nucleotide sequence coding for a guide protein with a similarity of at least 90% with respect to SEQ ID NO: 1, at least one sequence that encodes a peptide of interest, a sequence encoding a cleavage site, with which it is then possible to separate the peptide of interest from the rest of the protein, wherein the cleavage site is preferably a recognition site for the protease thrombin, and a sequence encoding a purification tag, with which it is then possible to facilitate the purification of the fusion protein, wherein the purification tag is preferably a polyhistidine tag.

(19) B) Inserting Expression Vector into a Host Cell.

(20) Introduce the expression vector constructed in step A, in a host cell, wherein the host cell is preferably selected from Escherichia coli, Bacillus subtilis or Saccharomyces cerevisiae.

(21) C) Expressing the Fusion Protein.

(22) This step consists of culturing the transformed host cells of step B, in a culture medium for the expression of the fusion protein, wherein the culture medium comprises a nutritive medium, a selection marker, and an inducer. Preferably said nutrient medium is Luria Bertani (LB medium); said selection marker is ampicillin; and said inducer is isopropyl-β-D-1-thiogalactopyranoside (IPTG).

(23) In this culture, the transformed host cells produce the fusion protein comprising the peptide of interest, and the SSD guide protein. This fusion protein is accumulated by the host cell preferably in inclusion bodies.

(24) This stage of expression of the fusion protein is carried out in a temperature range between 20 and 40° C., preferably between 23 and 26° C., and for a time between 1 and 24 hours for batch cultures, or for an indefinite period for a continuous crop.

(25) This preferred temperature range between 23 and 26° C. is different from the preferred ranges commonly used in the state of art, which are usually between 35 and 40° C., preferably at 37° C.

(26) In this preferred temperature range between 23 and 26° C., the host cell obtains a productive result superior to that described in the state of art. (FIG. 4)

(27) This superior productive result is not obtainable with any guide protein described in the state of art, besides the consequent reduction of the working temperature, generating an energy saving.

(28) D) Recovering the Fusion Protein

(29) Once the fusion protein containing the peptide of interest is produced and accumulated within the host cell, the present application describes a step of separating and purifying the fusion protein.

(30) Said separation and purification step comprises lysing the host cell and then recovering the fusion protein in whole or in part using traditional separation and purification methods, such as centrifugation, precipitation and chromatography, wherein the chromatography method is preferably metal affinity chromatography.

(31) E) Cleaving the Fusion Protein

(32) The recovered fusion protein is solubilized in cleavage solution by at least one step of dialysis, dilution, retention in chromatography column, or precipitation and dissolution, and then to remove the SSD guide protein, it is processed by proteolytic methods, such as enzymatic proteolysis, chemical proteolysis by protease cleavage, chemical cleavage or metal-catalyzed cleavage, and where the cleavage solution is preferably a semi-denaturing solution and the proteolytic method is enzymatic proteolysis with thrombin.

(33) F) Purifying the Peptide of Interest

(34) Finally, the peptide of interest is recovered by chromatography and/or precipitation methods, preferably metal affinity chromatography, size exclusion, or reverse phase.

(35) Among the technical advantages of the present application is that the method provides an improved result of fusion protein productivity when using a temperature range between 23 and 26° C., a value that is lower than that usually used between 35 and 40° C., saving energy (FIG. 4).

(36) The described SSD guide protein allows the chromatography step to be optional for the recovery of the fusion protein, giving the flexibility to choose between: 1) to recover in a single chromatography step the totality of the fusion protein produced, (FIG. 2 column 1); or 2) to recover only the fraction of fusion protein produced in insoluble form without the need for chromatography (FIG. 1A column 3); 3) to recover only the soluble fraction of the fusion protein, by means of a selective precipitation by incubation at a specific temperature, without the need of chromatography (FIG. 3, column 11).

(37) These three forms are not obtainable using a guide protein known from the state of art. This is a technical result obtained due to the molecular properties of the SSD guide protein.

(38) This method allows a gross intracellular productivity greater than 130 mg of peptide of interest per gram of dry cell, which corresponds to a substantial yield improvement. This improvement is not obtainable from the use of the guide proteins described in the state of art.

(39) Up to 100 mg of peptide of interest is recovered per gram of dry cells, depending on the recovery method of the fusion protein used, which corresponds to an improvement in final productivity with respect to that described in the state of art.

(40) The present invention allows an improvement in yield with respect to the prior art. The use of this SSD guide protein allows the evaluation of different recovery alternatives of the recombinant protein.

(41) In addition to the obtaining of an increased productivity in expression and recovery of any recombinant peptide of interest, the method is carried out in a temperature range lower than that of previously described in the state of art.

APPLICATION EXAMPLES

Example

(42) Step A: Constructing Expression Vector

(43) An expression vector containing a coding sequence for the SSD guide protein of sequence SEQ ID NO: 1 was constructed. A sequence coding for a polyhistidine peptide at the N-terminus of the SSD guide protein and a coding sequence for a cutting site with thrombin at the C-terminal end were also incorporated. Finally, recognition sites were incorporated for the restriction enzymes AvrII and Pad, to be used as a cloning site of any nucleotide sequence coding for some peptide of interest. The design allows this last sequence to be inserted following the coding sequence for thrombin cleavage site. The DNA was incorporated into the expression vector pET22b (+) using the restriction sites recognized by the NdeI and XhoI enzymes, leaving the entire coding region under the control of the T7lac promoter.

(44) The coding sequence for the peptide of interest p53pAnt was inserted into the expression vector containing the coding sequence for the SSD guide protein. For this, the nucleotide sequence encoding the p53pAnt peptide was synthesized as two pairs of complementary single-stranded oligonucleotides. The oligonucleotides were synthesized with the restriction sites for the AvrII and PacI enzymes, for cloning into the expression vector, in the reading frame of the guiding protein. This gene was digested, simultaneously with the expression vector, with the AvrII and PacI enzymes and then both fragments were ligated.

(45) Step B: Inserting the Expression Vector into Host Cell

(46) E. Coli BL21(DE3) cells were transformed with the ligation product, using resistance to ampicillin as selection marker.

(47) Step C: Expressing the Fusion Protein

(48) A culture of Escherichia coli cells transformed with the expression vector was grown in LB medium with 100 ug/mL of ampicillin for 15 hours and an inoculum for cell growth in fresh LB medium was used, with 100 μg/mL of ampicillin. The cells were grown up to an optical density of about 1.0 to 600 nm. Then, the IPTG inducer was added to a final concentration of 1 mM and cells were incubated at 25° C. for 3.5 hours. The cells were collected by centrifugation (5 min at 5,000×g) and the pellet was frozen at −20° C. for further methoding. In FIG. 1, it can be seen that the fusion protein is accumulated within the cell, reaching about 60% of the total intracellular protein (column 1).

(49) Step D: Recovering the Fusion Protein

(50) The recovery of the fusion protein was carried out using 3 different forms.

(51) Recovery Form 1: Recovery of the Total Fusion Protein Expressed.

(52) The Escherichia coli cells containing the fusion protein inside comprising the sequence of the SSD guide protein fused to the peptide sequence of interest were solubilized in 40 mM Tris solution, 500 mM NaCl, 8 M Urea, pH 8.0, and lysed by sonication. The solution was centrifuged 10 min at 10,000×g and the supernatant containing the fusion protein was collected. The fusion protein was purified by metal affinity chromatography using a nickel column. Using this method, it was possible to obtain 400 mg of fusion protein per gram of dry cell weight, with a purity greater than 95% in a single chromatographic step. In FIG. 2, column 1, the total protein recovered is shown.

(53) Recovery Form 2: Partial Recovery of the Expressed Fusion Protein: Insoluble Fraction

(54) The Escherichia coli cells containing the fusion protein inside were solubilized in 40 mM Tris solution, 500 mM NaCl, pH 8.0, and lysed by sonication. The solution was centrifuged 10 min at 10,000×g and the pellet was collected, which was solubilized in 40 mM Tris, 500 mM NaCl, 8 M Urea, pH 8.0. The solubilized protein was centrifuged for 10 min at 10,000×g and the supernatant containing the fusion protein was recovered. By this method it was possible to obtain about 300 mg of fusion protein per gram of dry cell weight, with a purity greater than 90% without the need of purification by chromatography. In FIG. 1, column 3, the insoluble protein recovered is shown.

(55) Recovery Form 3: Partial Recovery of the Expressed Fusion Protein: Soluble Fraction

(56) The Escherichia coli cells containing the fusion protein inside were solubilized in 40 mM Tris solution, 500 mM NaCl, pH 8.0, and lysed by sonication. The solution was centrifuged 10 min at 10,000×g and the supernatant was recovered. The fusion protein contained in the supernatant was recovered by selective precipitation by temperature. About 100 mg of fusion protein was obtained, with a purity of approximately 75%, by incubation at 30° C. for 35 minutes and without the need of purification by chromatography. In FIG. 3, column 11, the recovered soluble protein is shown.

(57) Stage E: Cleaving the Fusion Protein

(58) The peptide of interest was recovered from the fusion protein. For this, a cutting sequence between the SSD guide protein and the peptide of interest was previously introduced in the construction of the fusion protein.

(59) The fusion protein was solubilized in cleavage solution 20 mM Tris, 150 mM NaCl, pH 8.0, 0.3% sarkosyl to be digested by thrombin protease. For this, the fusion protein recovered according to form 1 was dialyzed against cleavage solution; the fusion protein recovered according to form 2 was precipitated by dilution and then dissolved in cleaving solution; the fusion protein recovered according to form 3, was dissolved directly in cleavage solution. Enzymatic cleavage resulted in the digestion of up to 95% of the fusion protein.

(60) Stage F: Purifying the Peptide of Interest

(61) The peptide of interest released in the enzymatic cleavage was purified by metal affinity chromatography using a nickel column, recovering 80% of the free peptide.

(62) In the case where the fusion protein was recovered in stage E, according to form 1, a final productivity of 100 mg of peptide per gram of dry cell weight was achieved; in the case where the fusion protein was recovered according to form 2, a final productivity of 55 mg of peptide per gram of dry cell weight was achieved; and in the case where the fusion protein was recovered according to form 3, a final productivity of 20 mg of peptide per gram of dry cell weight was achieved.

(63) TABLE-US-00001 SEQUENCE LIST SEQ ID NO 1: SSD guide protein: DETGKELILVLYDYQEKSPRELTIKKGDILTLLNSTNKDWWKVEVND RQGFFPAANLKKLD SEQ ID NO 2: Nucleotide sequence GAT GAA ACC GGT AAA GAA CTT ATC CTG GTT CTG TAC GAT TAT CAA GAG AAA AGC CCG CGC GAA TTG ACT ATT AAG AAA GGC GAT ATT TTA ACC CTG CTC AAT TCT ACC AAC AAG GAT TGG TGG AAA GTG GAA GTC AAC GAC CGT CAG GGC TTC TTT CCA GCG GCC AAC CTG AAA AAA CTG GAC