EXPRESSION AND LARGE-SCALE PRODUCTION OF PEPTIDES

Abstract

The invention provides a method for the large-scale preparation of small peptides using recombinant DNA technology. Overexpression of small peptides, such as liraglutide precursor, as concatemers, improves the overall efficiency of the process due to increased yields per batch of the biologically active peptide. Digestion of these concatemers by combinations of specific enzymes yields the desired peptide monomer in large quantities. More particularly, the invention relates to the production of recombinant peptide precursor of liraglutide

Claims

1. A concatemeric DNA construct for producing a peptide of SEQ ID 1, wherein the concatemeric DNA construct comprises: a. DNA construct encoding a peptide of SEQ ID 1, codon optimized for expression in a suitable host b. wherein each unit of (a) is linked at its 3 .sup. end to a monomeric or polymeric codon optimized spacer DNA sequence to encode for monomeric or polymeric units of the amino acids X.sub.1X.sub.2, wherein X.sub.1 is Lys or Arg and X.sub.2 is Lys or Arg; c. obtaining concatemeric DNA construct for cloning into a suitable host capable of being expressed as multimers of SEQ ID 1; and d. obtaining multimers of SEQ ID 1, and treating with a combination of at least two proteases to obtain monomeric units of SEQ ID 1.

2. The concatemeric DNA construct of claim 1, wherein the concatemer comprises of at least about 6 monomeric units.

3. The concatemeric DNA construct of claim 1, wherein the DNA construct is at least about 500 bps.

4. The concatemeric DNA construct of claim 1, wherein the DNA construct is expressed in a prokaryotic or eukaryotic host.

5. A multimeric peptide of SEQ ID 1, obtainable from the DNA construct of claim 1.

6. A monomeric peptide of SEQ ID 1, obtainable from the DNA construct of claim 1.

7. A process for producing a peptide of SEQ ID 1, the process comprising: a. obtaining a codon optimized concatemeric DNA construct encoding for multimers of peptide of SEQ ID 1 for expression in a suitable host; b. cloning concatemeric DNA construct of (a) into a suitable vector for expression in a suitable host; c. expressing the concatemeric DNA construct of (a) to produce multimers of peptide of SEQ ID 1 as inclusion bodies; d. simultaneously or sequentially contacting multimeric units of (c) with at least two proteases to obtain the peptide of SEQ ID 1.

8. The process as claimed in claim 7, wherein the vector is a pET vector.

9. The process as claimed in claim 7, wherein at least two inducers are used to induce expression of the concatemeric DNA construct.

10. The process as claimed i n claim 7, wherein the inducers are arabinose and IPTG.

11. The process of claim 1, wherein the proteases are Kex2 protease and Carboxypeptidase B.

12. The process as claimed i n claim 7, wherein the contact with kex2 protease and carboxypeptidase B is simultaneous.

13. The process as claimed i n claim 7, wherein the contact with kex2 protease and carboxypeptidase B is sequential.

Description

BRIEF DESCRIPTION OF ACCOMPANYING FIGURES

[0034] FIG. 1 gives a schematic representation of the concatemer strategy with mGLP peptide as an example.

[0035] FIG. 2 shows the SDS PAGE gel picture of the E. coli concatemer clones displaying a high level expression of 35 kDa.

[0036] FIG. 3 illustrates the digestion profile of K34R GLP-1(7-37) inclusion bodies using varied concentrations of kex2 protease.

[0037] FIG. 4 illustrates the CPB digestion profile of kex2 protease-digested inclusion bodies

DETAILED DESCRIPTION OF THE INVENTION

[0038] As used herein, the term small peptide_or peptides_refers to those having molecular weight ranging from about 2 to 10 kDa, used as a bio-therapeutic or for diagnostic and research purposes, wherein the preferred peptide is the peptide precursor for liraglutide, namely, K34R GLP-1 (7-37), the mGLP. The above-mentioned precursor contains amino acid residues from 7 to 37 of the glucagon-like peptide-1 (GLP-1) wherein the Lys at position 34 in the naturally occurring GLP-1 is substituted by Arg.

[0039] Especially in case of low molecular weight peptides, like the desired peptide, recombinant technology techniques are used to further enhance yield by expressing tandem gene repeats of the desired peptide that have been referred to herein as :concatemer which is defined as a long continuous DNA molecule that contains serially linked multiple copies of a smaller DNA sequence that codes for a monomer of the desired peptide. A concatemer may comprise 2-20 repeats of the monomer.

[0040] In the concatemer, individual DNA sequences coding for the monomer were separated by short cleavable dipeptidyl spacer sequences between every monomeric units. Many inactive precursors of bioactive peptides contain processing signal sequences made of a pair of basic dipeptides like Arg-Arg, Lys-Lys, Arg-Lys, Lys-Arg that are processed by specific enzymes to give the physiologically active peptides. Several proteases are known to show strict primary and secondary specificities to the above mentioned dipeptides and cleave precisely at the C- or N-terminus of or between the dipeptide. Particularly, this method is effective only when the desired peptide does not contain such a sequence recognizable by the excising enzyme. The preferred peptide K34R GLP-1 (7-37) being free of such basic di peptides in its sequence is an excellent candidate for the above method.

[0041] In the present invention, a concatemeric gene construct possessing intervening codons for the requisite excision sites was synthesized and inserted into a suitable expression vector. As used herein, the term expression vector_refers to a DNA molecule used as a vehicle to artificially carry foreign genetic material into bacterial cell, where it can be replicated and over-expressed.

[0042] The concatemeric gene construct was placed downstream of a T7 promoter in the expression vector. As used herein, the term promoter_refers to a regulatory region of DNA usually located upstream of the inserted gene of interest, providing a control point for regulated gene transcription.

[0043] For cloning, suitable host cells such as E. coli host cells were transformed by the recombinant expression vector. As used herein, an E. coli host_refers to E. coli strains ranging from BL21, BL21 DE3, BL21 Al and others which are routinely used for expression of recombinant proteins.

[0044] In another embodiment, the expressed concatemer was isolated from the cell culture_by one or more steps including lysing of the cells using a homogenizer or a cell press, centrifugation of the resulting homogenate to obtain the target protein as insoluble aggregates.

[0045] In an embodiment, the concatemer was expressed as insoluble inclusion bodies that inherently possessed specific dipeptide sites which, upon digestion with specific enzymes, released the desired monomeric peptide precursors. In a preferred embodiment, the intervening Lys-Arg (K R) sites were cleaved using sequential action of kex2 protease and carboxypeptidase B.

[0046] In another embodiment, the invention relates to a process of producing a biologically active GLP-1 (7-37), the process comprising: [0047] a. creating a concatemeric gene construct containing 9 - 15 repeats of K34R GLP-1 (7-37) gene with each repeat separated from the adjacent one by codons for the K R dipeptide [0048] b. cloning the above concatemeric construct into E. coli using a suitable expression vector [0049] c. expressing the concatemeric gene in E. coli by inducing with arabinose and IPT G [0050] d. isolating the expressed protein from the cell culture in the form of inclusion bodies [0051] e. solubilizing the inclusion bodies at optimal conditions [0052] f. digesting the solubilized inclusion bodies under optimal conditions by sequentially subjecting them to specific enzymes essentially consisting of kex2 protease (kexin) and carboxypeptidase B (CPB)

[0053] Experimental Section

[0054] K34R GLP-1 (7-37) was produced by recombinant DNA technology using genetically engineered E. coli cells. The E. coli cells were cultured and concatemers of the peptide precursor for liraglutide were obtained in the form of inclusion bodies, post induction. Inclusion bodies were processed by (subjected to) solubilization and sequential digestion to release the biologically active K34RGL P-1 (7-37) monomers.

EXAMPLE 1

Synthesis of Concatemer DNA

[0055] The nucleotide sequence derived from the amino acid sequence for K34R GLP-1 (7-37) monomer (SEQ ID 1) was codon optimized for E. coli (SEQ ID 2) to synthesize the K34R GLP-1 (7-37) concatemer (SEQ ID 3) as illustrated in FIG. 1.

EXAMPLE 2

Cloning of GLP Concatemer in pET 24a Expression Vector

[0056] The concatemer was synthesized and cloned into pET24a vector within the cloning sites, Nde I and Hind III. The vector pET24a possesses a strong T7 promoter for the expression of recombinant protein and a kanamycin resistance gene for selection and screening. The digested pET24a vector was ligated to the concatemer to provide the recombinant vector which was used to transform the E. coli host. The clones were screened by colony PCR and confirmed by restriction digestion with Nde I and Hind III and sequence analysis of the clone.

EXAMPLE 3

Expression of Concatemeric Protein

[0057] E. coli BL21 A1 cell line was used as the expression host. Other cell lines that may be used include BL21 DE3 or any other cell line that contains the T7 RNA polymerase. BL21 A1 cells transformed with the recombinant pET24a-GLP concatemer were induced (OD.sub.6001) with 13 mM arabinose and 1 mM IPTG. The cells were harvested about 4 hours after induction. Determination of expression levels by SDS PAGE analysis of the whole cell lysate showed the presence of a 35 kDa band for the multimeric precursor peptide (FIG. 2, lanes 3, 4).

EXAMPLE 4

Solubilization of Inclusion Bodies

[0058] The cell lysate was further homogenized by sonication and centrifuged to separate inclusion bodies and soluble fractions. A bout 0.125 g inclusion bodies were weighed and dissolved in 3.0 mL of 2% SDS and 1.2 mL of 500 mM HEPES buffer (pH 7.5) diluted with milliQ water to make the volume to 6 mL. Complete solubilization (15-30 min) of the inclusion bodies was carried out by vortexing followed by centrifugation to obtain the K34R GLP-1 (7-37) multimer molecules in the supernatant. The solubilized inclusion bodies were further diluted 10 times in a final buffer composition of 50 mM HEPES, pH 7.5, 10 mM CaCl.sub.2 and 2% Triton-X -100.

EXAMPLE 5

Protease Digestion with kex2 Protease and Carboxypeptidase B

[0059] Protease digestion studies were carried out independently using 2.5, 5 and 20 .Math. g of kex2 protease (kex2 P) per mg of solubilized inclusion bodies for 20-28 h at room temperature. A band at 3 kDa observed by SDS PAGE (FIG. 3, lane 10 and 13) pertained to the monomer. Optimizing the quantities of kex2 protease by lowering from 20 .Math. g to 5 .Math. g and further to 2.5 .Math. g per mg of solubilized inclusion bodies showed complete digestion at 5 and 20 .Math. g and partial digestion at 2.5 .Math. g of kex2 protease used, with extended incubation with Kex2 protease for about 24-28 h. (FIG. 3).

[0060] A similar experiment was carried out with digestion of solubilized inclusion bodies with 5, 10 and 20 .Math. g of kex2 protease per mg of solubilized inclusion bodies for 16 h at room temperature. This was followed by further addition of 5 .Math. L (0.67 U/mL carboxypeptidase B (CPB) per mg of solubilized inclusion bodies at 37 C for 2 hours. The resulting digestion mixture was analyzed by SDS PAGE (FIG. 4). The same was ascertained by comparison of its R P-H PLC peaks with that of a commercial GLP peptide from Sigma (data not shown).

DETAILED DESCRIPTION OF FIGURES

[0061] FIG. 1: Schematic representation of concatemer strategy with GLP precursor peptide (mGLP peptide) as an example. The KR is a dipeptide which acts as recognition and cleavage site for kex2 protease enzyme. The kex2 enzyme will cleave the concatemer at the C terminus of the dipeptide resulting into peptide monomers along with the dipeptide, except last monomer. The dipeptides are removed through CPB digestion which specifically removes Lysine and Arginine residues at the C terminus.

[0062] FIG. 2: SDS PAGE analysis of whole cell lysate of E. coli concatemer clones. High level expression of multimeric mGLP is observed at 35 kDa level.

[0063] Lane 1: Molecular weight marker

[0064] Lane 2: Uninduced whole cell lysate of mGLP concatemer

[0065] Lane 3: Induced whole cell lysate of mGLP concatemer clone #1

[0066] Lane 4: Induced whole cell lysate of mGLP concatemer clone #2

[0067] FIG. 3: Optimization of kex2 protease digestion of mGLP inclusion bodies. As seen in FIG. 5 .Math. g and 20 .Math. g of Kex2 protease completely digested inclusion bodies to 3 kDA mGLP peptide, while 2.5 .Math. g of Kex2 protease partially digested the inclusion bodies, where a ladder of differentially digested peptide is visible.

[0068] Lane 1: Molecular weight marker

[0069] Lane 2 .sup. mGLP (concatemer) undigested_20 h

[0070] Lane 3 .sup. mGLP (concatemer) undigested_24 h

[0071] Lane 4 .sup. mGLP (concatemer) undigested_28 h

[0072] Lane 5 .sup. mGLP (concatemer) +2.5 g of Kex2 protease/mg of mGLP concatemer.sup. 20 h

[0073] Lane 6 .sup. mGLP (concatemer) +2.5 g of Kex2 protease /mg of concatemer.sup. 24 h

[0074] Lane 7 .sup. mGLP (concatemer) +2.5 g of Kex2 protease/mg of mGLP concatemer.sup. 28 h

[0075] Lane 8 .sup. mGLP (concatemer) +5 g of Kex2 protease/mg of mGLP concatemer.sup. 20 h

[0076] Lane 9 .sup. mGLP (concatemer) +5 g of Kex2 protease/mg of mGLP concatemer.sup. 24 h

[0077] Lane 10 .sup. mGLP (concatemer) +5 g of Kex2 protease/mg of mGLP concatemer.sup. 28 h

[0078] Lane 11 .sup. mGLP (concatemer) +20 g of Kex2 protease/mg of mGLP concatemer.sup. 20 h

[0079] Lane 12 .sup. mGLP (concatemer) +20 g of Kex2 protease/mg of mGLP concatemer.sup. 24 h

[0080] Lane 13 .sup. mGLP (concatemer) +20 g of Kex2 protease/mg of mGLP concatemer.sup. 28 h

[0081] FIG. 4: Kex2 protease digestion of mGLP inclusion bodies followed by CPB treatment.

[0082] Lane 1: Molecular weight marker

[0083] Lane 2 .sup. mGLP (concatemer) undigested - 16 h

[0084] Lane 3 .sup. No loading

[0085] Lane 4 .sup. mGLP (concatemer) +20 g of Kex2 protease/mg of mGLP concatemer.sup. 16 h

[0086] Lane 5 .sup. mGLP (concatemer) +10 g of Kex2 protease/mg of mGLP concatemer.sup. 16 h

[0087] Lane 6 .sup. mGLP (concatemer) +5 g of Kex2 protease/mg of mGLP concatemer.sup. 16 h

[0088] Sequences

TABLE-US-00001 SequenceID1 HAEGTFTSDVSSYLEGQAAKEFIAWLVRGRG SequenceID2 ATGAAACGTCACGCGGAAGGCACCTTTACGTCCGATGTGAGCTCTTATCT GGAAGGCCAGGCGGCCAAAGAATTTATTGCCTGGCTGGTCCGTGGCCGCG GTAAACGTCATGCCGAAGGCACCTTTACGAGCGACGTGAGTTCCTACCTG GAAGGTCAAGCAGCTAAAGAATTTATCGCATGGCTGGTTCGTGGCCGCGG CAAACGCCACGCTGAAGGCACCTTTACGTCTGATGTCTCATCGTATCTGG AAGGCCAAGCCGCGAAAGAATTTATCGCCTGGCTGGTGCGTGGCCGCGGC AAACGTCACGCAGAAGGCACCTTCACGAGTGACGTTAGCTCTTACCTGGA AGGCCAGGCCGCCAAAGAATTTATTGCTTGGTTAGTTCGTGGCCGCGGTA AACGCCATGCCGAAGGCACCTTCACGTCCGATGTGAGTTCCTATCTGGAA GGCCAAGCTGCCAAAGAATTTATCGCTTGGTTAGTGCGTGGCCGCGGAAA GCGCCACGCGGAAGGCACCTTCACGTCAGACGTCTCATCGTACCTGGAAG GCCAGGCGGCGAAAGAATTTATCGCGTGGTTAGTACGTGGCCGCGGAAAA CGCCACGCCGAGGGCACCTTTACGTCGGATGTTAGCTCTTATCTGGAAGG CCAAGCAGCGAAAGAATTTATTGCATGGTTGGTTCGTGGCCGCGGAAAGC GTCATGCAGAGGGCACCTTTACGAGCGATGTGAGTTCCTACCTGGAAGGG CAGGCCGCTAAGGAATTTATCGCGTGGCTTGTTCGTGGCCGCGGAAAACG TCATGCGGAGGGCACCTTTACGTCTGACGTCTCATCGTATCTGGAAGGCC AGGCCGCGAAGGAATTTATCGCCTGGTTAGTCCGTGGCCGCGGCAAGCGC CATGCGGAGGGCACCTTCACGAGCGACGTTAGCTCTTACCTGGAAGGTCA AGCGGCGAAAGAATTTATTGCGTGGCTGGTCCGTGGTCGTGGCTAATGA SequenceID3 MKRHAEGTFTSDVSSYLEGQAAKEFIAWLVRGRGKRHAEGTFTSDVSSYL EGQAAKEFIAWLVRGRGKRHAEGTFTSDVSSYLEGQAAKEFIAWLVRGRG KRHAEGTFTSDVSSYLEGQAAKEFIAWLVRGRGKRHAEGTFTSDVSSYLE GQAAKEFIAWLVRGRGKRHAEGTFTSDVSSYLEGQAAKEFIAWLVRGRGK RHAEGTFTSDVSSYLEGQAAKEFIAWLVRGRGKRHAEGTFTSDVSSYLEG QAAKEFIAWLVRGRGKRHAEGTFTSDVSSYLEGQAAKEFIAWLVRGRGKR HAEGTFTSDVSSYLEGQAAKEFTAWLVRGRG

EXPRESSION AND LARGE-SCALE PRODUCTION OF PEPTIDES

Assignee

Inventors

Cpc classification

Classification Explorer

C07K14/605

CHEMISTRY; METALLURGY

Classification Explorer

C12Y304/21061

CHEMISTRY; METALLURGY

Classification Explorer

C12Y304/17002

CHEMISTRY; METALLURGY

Classification Explorer

C12P21/06

CHEMISTRY; METALLURGY

International classification

Classification Explorer

C07K14/605

CHEMISTRY; METALLURGY

Classification Explorer

C12P21/06

CHEMISTRY; METALLURGY

Abstract

Claims

Description