Protease specific for a novel sumo-derived protease cleavage site

12209260 · 2025-01-28

Assignee

Inventors

Cpc classification

International classification

Abstract

The present invention relates to a fusion protein, comprising the structure N-PCS.sup.Y-degSig.sub.N-M-PCS.sup.X-degSig.sub.C-C; wherein N represents the N-terminus; PCS.sup.Y and PCS.sup.X each represent a protease cleavage site (PCS), which differ from each other in at least one amino acid residue: degSig.sub.N represents a degradation signal which promotes degradation of the fusion protein in a host cell if PCS.sup.Y is cleaved by a protease such that the first amino acid of degSig.sub.N becomes the new N-terminus of the remaining fusion; M represents a cytoplasmic selection marker; and degSig.sub.C represents a second degradation signal which promotes degradation of the fusion protein in a host cell if PCS.sup.X is not cleaved by a protease; and C represents the C-terminus. Further provided is a nucleic acid construct, comprising a nucleic acid sequence coding for said fusion protein, a nucleic acid expression construct library, comprising a plurality of such nucleic acid expression constructs in diversified form, and methods using the fusion protein and nucleic acid constructs coding therefor. Finally, the present invention provides variants of bdSUMO and bdSENP1 which have been identified by the methods of the present disclosure, and which exhibit improved properties over existing orthogonal protease/protease cleavage site-pairs which are currently used with wild-type bdSUMO and wildtype bdSENP1.

Claims

1. A variant protease, wherein said variant protease has at least 96% sequence identity over the full-length of the polypeptide of SEQ ID NO:6, wherein said variant protease comprises an amino acid substitution at the position corresponding to position 33 of the protein of SEQ ID NO: 6, wherein the amino acid at said position is substituted by an amino acid selected from the group consisting of S, H, Q, A, G, and C, and wherein said variant protease cleaves a first protease cleavage site (PCS) in a first fusion protein that comprises a maltose-binding protein (MBP) and said first PCS, wherein said first PCS is fused to the N-terminus of the MBP (first PCS-MBP fusion protein), wherein said first PCS has the amino acid sequence of SEQ ID NO: 41 and the MBP has the amino acid sequence of SEQ ID NO: 71, more efficiently after the C-terminal Gly-Gly motif of the first PCS of SEQ ID NO: 41 than said variant protease cleaves (i) a second PCS in a second fusion protein that comprises a MBP and said second PCS, wherein said second PCS is fused to the N-terminus of the MBP (second PCS-MBP fusion protein), wherein said second PCS has the amino acid sequence of SEQ ID NO: 4 and the MBP has the amino acid sequence of SEQ ID NO: 71, wherein the cleavage occurs after the C-terminal Gly-Gly motif of the second PCS of SEQ ID NO: 4, or (ii) a third PCS in a third fusion protein that comprises a MBP and said third PCS, wherein said third PCS is fused to the N-terminus of the MBP (third PCS-MBP fusion protein), wherein said third PCS has the amino acid sequence of SEQ ID NO: 3 and the MBP has the amino acid sequence of SEQ ID NO: 71, wherein the cleavage occurs after the C-terminal Gly-Gly motif of the third PCS of SEQ ID NO: 3, when tested in a buffer under standard conditions of 1 hour incubation at 21 C., wherein said buffer comprises an initial concentration of the first, second and third PCS-MBP fusion proteins of 100 M, and wherein the buffer comprises a pH of 7.5, Tris/HCl at a concentration of 45 mM, NaCl at a concentration of 250 mM, MgCl.sub.2 at a concentration of 2 mM, sucrose at a concentration of 250 mM, and DTT at a concentration of 10 mM.

2. The variant protease of claim 1, wherein said variant protease cleaves the first PCS-MBP fusion protein at said standard conditions and at an at least 500-fold molar excess of said PCS-MBP fusion protein compared to the amount of said protease.

3. The variant protease of claim 1, wherein the amino acid at the position corresponding to position 33 of the protein of SEQ ID NO: 6 is selected from the group consisting of S, H, Q, and A.

4. The variant protease of claim 1, wherein said variant protease further comprises an amino acid substitution at the position corresponding to position 99 of the protein of SEQ ID NO: 6, wherein the amino acid at said position is substituted by another amino acid selected from the group consisting of E, S, V, Y, and L.

5. The variant protease of claim 4, wherein the amino acid at the position corresponding to position 99 of the protein of SEQ ID NO: 6 is substituted by an amino acid selected from the group consisting of E, S, and V.

6. The variant protease of claim 4, wherein said variant protease further comprises an amino acid substitution at the position corresponding to position 22 of the protein of SEQ ID NO: 6, wherein the amino acid at said position is substituted by another amino acid selected from the group consisting of E, S, P, K, and V.

7. The variant protease of claim 4, wherein said variant protease further comprises an amino acid substitution at the position corresponding to position 103 of the protein of SEQ ID NO: 6, wherein the amino acid at said position is substituted by another amino acid selected from the group consisting of M, E, V, G, T, and R.

8. The variant protease of claim 7 wherein the amino acid at the position corresponding to position 103 of the protein of SEQ ID NO: 6 is substituted by an amino acid selected from the group consisting of M, E, V, G and T.

9. The variant protease of claim 1, wherein said variant protease further comprises two or three amino acid substitutions at positions corresponding to positions of the protein of SEQ ID NO: 6 selected from the group consisting of positions 99, 22, and 103, wherein the amino acid substitution at the position corresponding to position 99 of the protein of SEQ ID NO: 6 is a substitution by an amino acid selected from the group consisting of E, S, V, Y, and L, the amino acid substitution at the position corresponding to position 22 of the protein of SEQ ID NO: 6 is a substitution by an amino acid selected from the group consisting of E, S, P, K, and V, and the amino acid substitution at the position corresponding to position 103 of the protein of SEQ ID NO: 6 is a substitution by an amino acid selected from the group consisting of M, E, V, G, T, and R.

10. The variant protease of claim 1, wherein said variant protease comprises a combination of amino acid substitutions selected from the group consisting of (i) amino acid substitutions at the positions corresponding to positions 33 and 99 of the protein of SEQ ID NO: 6, wherein the amino acid at the position corresponding to position 33 of the protein of SEQ ID NO: 6 is S, and the amino acid at the position corresponding to position 99 of the protein of SEQ ID NO: 6 is E; (ii) amino acid substitutions at the positions corresponding to positions 22, 33 and 103 of the protein of SEQ ID NO: 6, wherein the amino acid at the position corresponding to position 33 of the protein of SEQ ID NO: 6 is H, the amino acid at the position corresponding to position 22 of the protein of SEQ ID NO: 6 is S, and the amino acid at the position corresponding to position 103 of the protein of SEQ ID NO: 6 is V; (iii) amino acid substitutions at the positions corresponding to positions 22, 33, 99 and 103 of the protein of SEQ ID NO: 6, wherein the amino acid at the position corresponding to position 22 of the protein of SEQ ID NO: 6 is P, the amino acid at the position corresponding to position 33 of the protein of SEQ ID NO: 6 is A, the amino acid at the position corresponding to position 99 of the protein of SEQ ID NO: 6 is E, and the amino acid at the position corresponding to position 103 of the protein of SEQ ID NO: 6 is M; (iv) amino acid substitutions at the positions corresponding to positions 22, 33, 99 and 103 of the protein of SEQ ID NO: 6, wherein the amino acid at the position corresponding to position 22 of the protein of SEQ ID NO: 6 is K, the amino acid at the position corresponding to position 33 of the protein of SEQ ID NO: 6 is H, the amino acid at the position corresponding to position 99 of the protein of SEQ ID NO: 6 is E, and the amino acid at the position corresponding to position 103 of the protein of SEQ ID NO: 6 is E; (v) amino acid substitutions at the positions corresponding to positions 22, 33, 99 and 103 of the protein of SEQ ID NO: 6, wherein the amino acid at the position corresponding to position 22 of the protein of SEQ ID NO: 6 is E, the amino acid at the position corresponding to position 33 of the protein of SEQ ID NO: 6 is S, the amino acid at the position corresponding to position 99 of the protein of SEQ ID NO: 6 is S, and the amino acid at the position corresponding to position 103 of the protein of SEQ ID NO: 6 is T; and (vi) amino acid substitutions at the positions corresponding to positions 22, 33, 99 and 103 of the protein of SEQ ID NO: 6, wherein the amino acid at the position corresponding to position 22 of the protein of SEQ ID NO: 6 is V, the amino acid at the position corresponding to position 33 of the protein of SEQ ID NO: 6 is Q, the amino acid at the position corresponding to position 99 of the protein of SEQ ID NO: 6 is V, and the amino acid at the position corresponding to position 103 of the protein of SEQ ID NO: 6 is G.

11. The variant protease of claim 1, wherein said variant protease comprises all of SEQ ID NO: 6, except for: (a) the amino acid substitution at the position corresponding to position 33 of the protein of SEQ ID NO: 6, or (b) the amino acid substitution at the position corresponding to position 33 of the protein of SEQ ID NO: 6 and one or more amino acid substitutions at positions corresponding to positions of the protein of SEQ ID NO: 6 selected from 22, 99, and 103.

12. The variant protease of claim 1 having the amino acid sequence of any one of SEQ ID NO: 56, 57, 58, 60, 61, 62, 63, 64, 65 or 66.

13. A process of purifying a protein of interest, comprising the steps of (i) providing a protein of interest to be purified, wherein said protein comprises an affinity tag fused to said protein via a protease cleavage site (PCS); (ii) binding the protein of step (i) to an affinity matrix via said affinity tag; and (iii) eluting the protein from the affinity matrix using the variant protein of claim 1, thereby purifying the protein.

14. The process of claim 13, wherein said PCS has at least 96% sequence identity over the full-length of the polypeptide of SEQ ID NO: 3, wherein said PCS comprises an amino acid substitution at the position corresponding to position 67 of the protein of SEQ ID NO: 3, wherein the amino acid at said position is substituted by another amino acid selected from the group consisting of K, R, N, A and H, and wherein said PCS comprises a C-terminal Gly-Gly motif.

15. The process of claim 14, wherein said PCS further comprises: (a) an amino acid substitution at the position corresponding to position 75 of the protein of SEQ ID NO: 3, wherein the amino acid at said position is substituted by another amino acid selected from the group consisting of R, W, A, H, M, I, P, and F; and/or (b) an amino acid substitution at the position corresponding to position 60 of the protein of SEQ ID NO: 3, wherein the amino acid at said position is substituted by another amino acid selected from the group consisting of S, N, K, P, HR, and Q.

16. The process of claim 14, wherein said PCS comprises a combination of amino acid substitutions selected from the group consisting of (i) amino acid substitutions at the positions corresponding to positions 67, 60 and 75 of the protein of SEQ ID NO: 3, wherein the amino acid at the position corresponding to position 67 of the protein of SEQ ID NO: 3 is K, the amino acid at the position corresponding to position 60 of the protein of SEQ ID NO: 3 is K, and the amino acid at the position corresponding to position 75 of the protein of SEQ ID NO: 3 is R; (ii) amino acid substitutions at the positions corresponding to positions 67, 60 and 75 of the protein of SEQ ID NO: 3, wherein the amino acid at the position corresponding to position 67 of the protein of SEQ ID NO: 3 is K, the amino acid at the position corresponding to position 60 of the protein of SEQ ID NO: 3 is P, and the amino acid at the position corresponding to position 75 of the protein of SEQ ID NO: 3 is W; (iii) amino acid substitutions at the positions corresponding to positions 67 and 75 of the protein of SEQ ID NO: 3, wherein the amino acid at the position corresponding to position 67 of the protein of SEQ ID NO: 3 is K, and the amino acid at the position corresponding to position 75 of the protein of SEQ ID NO: 3 is R; (iv) amino acid substitutions at the positions corresponding to positions 67, 60 and 75 of the protein of SEQ ID NO: 3, wherein the amino acid at the position corresponding to position 67 of the protein of SEQ ID NO: 3 is K, the amino acid at the position corresponding to position 60 of the protein of SEQ ID NO: 3 is S, and the amino acid at the position corresponding to position 75 of the protein of SEQ ID NO: 3 is H; (v) amino acid substitutions at the positions corresponding to positions 67, 60 and 75 of the protein of SEQ ID NO: 3, wherein the amino acid at the position corresponding to position 67 of the protein of SEQ ID NO: 3 is K, the amino acid at the position corresponding to position 60 of the protein of SEQ ID NO: 3 is S, and the amino acid at the position corresponding to position 75 of the protein of SEQ ID NO: 3 is W; (vi) amino acid substitutions at the positions corresponding to positions 67, 60 and 75 of the protein of SEQ ID NO: 3, wherein the amino acid at the position corresponding to position 67 of the protein of SEQ ID NO: 3 is K, the amino acid at the position corresponding to position 60 of the protein of SEQ ID NO: 3 is S, and the amino acid at the position corresponding to position 75 of the protein of SEQ ID NO: 3 is A; (vii) amino acid substitutions at the positions corresponding to positions 67, 60 and 75 of the protein of SEQ ID NO: 3, wherein the amino acid at the position corresponding to position 67 of the protein of SEQ ID NO: 3 is K, the amino acid at the position corresponding to position 60 of the protein of SEQ ID NO: 3 is N, and the amino acid at the position corresponding to position 75 of the protein of SEQ ID NO: 3 is W; and (viii) amino acid substitutions at the positions corresponding to positions 67, 60 and 75 of the protein of SEQ ID NO: 3, wherein the amino acid at the position corresponding to position 67 of the protein of SEQ ID NO: 3 is K, the amino acid at the position corresponding to position 60 of the protein of SEQ ID NO: 3 is N, and the amino acid at the position corresponding to position 75 of the protein of SEQ ID NO: 3 is A.

17. The process of claim 13, wherein said PCS has the amino acid sequence of SEQ ID NO: 3 except for an amino acid substitution at the position corresponding to position 67 of the protein of SEQ ID NO: 3, wherein the amino acid at said position is K.

18. The process of claim 13, wherein said PCS has the amino acid sequence of SEQ ID NO: 3 except for (a) an amino acid substitution at the position corresponding to position 67 of the protein of SEQ ID NO: 3, wherein the amino acid at said position is K, and an amino acid substitution at the position corresponding to position 60 of the protein of SEQ ID NO: 3; (b) an amino acid substitution at the position corresponding to position 67 of the protein of SEQ ID NO: 3, wherein the amino acid at said position is K, and an amino acid substitution at the position corresponding to position 75 of the protein of SEQ ID NO: 3; or (c) an amino acid substitution at the position corresponding to position 67 of the protein of SEQ ID NO: 3, wherein the amino acid at said position is K, and additional amino acid substitutions at positions corresponding to positions 60 and 75 of the protein of SEQ ID NO: 3.

19. The process of claim 13, wherein said PCS has an amino acid sequence selected from the group consisting of SEQ ID NO: 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54 and 55.

Description

DESCRIPTION OF THE FIGURES

(1) FIG. 1 Diagram of the in vivo system for the selection of proteases and protease-cleavage sites with orthogonal specificities. The system is based on the survival of E. coli cells in selective medium containing hygromycin B. Cells co-express a SUMO-specific protease and the fusion protein SUMO.sup.Y-Degron.sup.NER-HygB-SUMO.sup.X-ssrA as a selection marker. Cells survive only if a given SUMO-specific protease shows orthogonal specificity to two different protease cleavage sites (SUMO.sup.Y and SUMO.sup.X).

(2) FIG. 2 Validation of the in vivo selection system. (A) The SUMO protein (bdSUMO) and the SUMO-specific protease 1 (bdSENP1) from B. distachyon were used as model proteins to the test the functionality of the selection system. A non-cleavable SUMO mutant (SUMO*) was used to account against cleavage. (B) Cells expressing a HygB construct lacking both protein degradation signals were used as positive control for cellular growth in selective media. (C) Only cells expressing SUMO*-Degron.sup.NER-HygB-bdSUMO-ssrA survived due to the permanent inactivation of both degradation signals. (D, E, C and F). Bacteria co-expressing bdSENP1 and a different reporter protein do not survive after the degradation of HygB due to activation of the Degron.sup.NER and/or the lack of inactivation of the ssrA signal.

(3) FIG. 3 (A) Illustration of the fusion proteins used to select for bdSUMO mutants (bdSUMO.sup.MutX), which are not cleaved by the SUMOstar protease. (B) Sequence alignment of amino acids 56-79 of wild-type bdSUMO (bdSUMO.sup.wt) and ten different bdSUMO mutants (SEQ ID NOs: 9-19). Identical residues are highlighted in black boxes and the numbering of the sequence is according to the full-length wild type (wt) bdSUMO protein. The bdSUMO mutant that belongs to the SUMOvera system is underlined (bdSUMO.sup.Mut1).

(4) FIG. 4 (A) Reporter construct used to select for bdSENP1 mutants (bdSENP1.sup.MutX) that cleave bdSUMO.sup.Mut1 but do not recognize SUMOstar as substrate. (B) Sequence alignment of amino acids 254-354 of wild type bdSENP1 and six different functional bdSENP1 mutants (SEQ ID NO: 25-31). The bdSENP1 mutant that belongs to the SUMOvera system is shown underlined (bdSENP1.sup.MutB). The numbering of the sequencing is set according to the full-length bdSENP1 protein (set forth in SEQ ID NO: 73) and identical residues are highlighted in black.

(5) FIG. 5 Effect of each mutation in bdSUMO.sup.Mut1 for the cleavage resistance against SUMOstar protease. Different variants of the bdSUMO-MBP fusion protein containing one, two or three mutations as in bdSUMO.sup.Mut1 were incubated with a large amount of SUMOstar protease (10 M) for 2 h at 25 C. Samples were resolved by SDS-PAGE to separate the fusion protein from the C-terminal cleaved MBP. The same bdSUMO-MBP fusion proteins were used to evaluate the contribution of each mutation in bdSUMO.sup.Mut1 for the recognition by bdSENP1.sup.MutB. Samples containing 100 M of each MBP fusion protein were incubated with 200 nM of bdSENP1.sup.MutB protease for 1 h at 4 C.

(6) FIG. 6 Proteolytic efficiency of the bdSENP1.sup.MutB protease and other site-specific proteases. The proteolytic efficiency of different site-specific proteases was tested in solution for 1 h at 0 C. Different amount of a given protease were incubated with 100 M of the corresponding tagged MBP. Samples were resolved by SDS-PAGE to separate the full-length fusion protein from the C-terminal cleaved MBP. Samples highlighted with a black box indicate the lowest protease concentration at which 95% of the cognate substrate is cleaved. The experiments are ordered in the figure from the most to the least efficient protease used in this experiment.

(7) FIG. 7 bdSENP1.sup.MutB and SUMOstar proteases have fully orthogonal specificities. Samples containing 100 M of a given H14-SUMO-MBP fusion protein were incubated with increasing concentration of either bdSENP1.sup.MutB or SUMOstar proteases for 1 h at 0 C. The full-length SUMO-fusion protein was separated from the C-terminal cleaved MBP by SDS-PAGE. The asterisks represent the protease concentration at which 95% of the cognate SUMO-fusion protein is cleaved.

(8) FIG. 8 bdSUMO.sup.Mut1-fusion proteins are stable in S. cerevisiae. (A) Scheme of the SUMO fusion proteins over-expressed in S. cerevisiae (H14: poly-histidine tag, ZZ: double copy of the Z-domain from staphylococcal protein A). (B) After 6h of protein over-expression at 30 C., the stability of different SUMO fusion proteins was analyzed by western blot using an anti-citrine antibody. Cyan fluorescent protein fused to histone 2B (H2B-CFP) was used as an internal control to confirm even loading of the samples. Cross-reactivity between the anti-citrine antibody and the ZZ-tag is indicated by *. An empty yeast lysate was used as a negative control.

(9) FIG. 9 bdSUMO.sup.Mut1 is a stable fusion tag in eukaryotic lysates. (A) Figure of fusion proteins used to test the stability of different SUMO proteins in several eukaryotic lysates. (H14: poly-histidine tag, MBP: E. coli Maltose Binding Protein). (B) The stability of different SUMO fusion proteins was analyzed by western blot using an anti-MBP antibody after incubation in highly concentrated eukaryotic extracts for 2 h at 30 C. The presence of a C-terminal cleaved MBP indicates the cleavage of the SUMO tag by the endogenous SUMO specific proteases. Samples with a protease mix (Ulp1, SUMOstar protease, bdSENP1 and bdSENP1.sup.MutB protease, 1 M each) were included to rule out the presence of protease inhibitory substances present in the lysates.

(10) FIG. 10 Purification of a stoichiometric protein complex in yeast. (A) As a model for a hetero-dimeric complex, an anti-citrine nanobody (Nb) tagged with bdSUMO.sup.Mut1 and SUMOstar-tagged citrine were used. (B) Scheme of the purification processes composed of two consecutive chromatographic steps. The dimeric complex was co-expressed in S. cerevisiae as a soluble form for 6 h at 30 C. (soluble material). First, the complex was isolated through a poly-histidine tag (H14) fused to bdSUMO.sup.Mut1 using a Ni.sup.2+ chelate matrix (first chromatographic step). The complex was then eluated by in-column cleavage using bdSENP1.sup.MutB (On-column cleavage eluate 1). For the second purification step, the eluate 1 was loaded onto an anti-Z domain matrix to bind to the ZZ-tag fused to SUMOstar. The stoichiometric and highly pure complex was then finally eluted by on-column cleavage using the SUMOstar protease (On-column cleavage eluate 2). (C) Protein samples corresponding to 35mOD units of cells or 1/1000 of the total purified protein were analyzed by SDS-PAGE and further stained by coomassie blue. Labels on the middle of both images side define the protein identity of each band in both gels.

(11) FIG. 11 bdSENP1.sup.MutB protease can be ectopically over-expressed in S. cerevisiae. The viability of the yeast cells over-expressing a SUMO-specific protease was tested. Cells were subjected to different 10-fold dilutions (from 1:25 to 1:3125 v/v) and further spotted on plates containing glucose or galactose to repress and induce protein expression, respectively. Cells transformed with an empty vector or encoding a catalytically dead protease (C440S) were used as negative controls.

(12) FIG. 12 (A) Analysis of the amino acid frequency in the three mutagenized resides (T60X, D67X and Q75X) for the all the bdSUMO mutants selected by phage display. (B) Sequence alignment of amino acids 56-79 of wild-type bdSUMO (bdSUMO.sup.wt) and five different bdSUMO mutants that contain the most frequent mutations after selection by phage display (SEQ ID NOs: 9 and 20-24). Identical residues are highlighted in black boxes and the numbering of the sequence is according to the full-length wild type (wt) bdSUMO protein.

(13) FIG. 13 (A) Selection construct used in E. coli for the selection of bdSENP1 mutants (bdSENP1.sup.MutX) that cleavage preferentially bdSUMO.sup.Mut 1 and not wild type scSUMO. (B) Sequence alignment of amino acids 265-354 of wild-type bdSENP1 (bdSENP.sup.wt), bdSENP1.sup.Mut B and the six most abundant bdSENP1 mutants (SEQ ID NOs: 25, 27 and 32-40) after selection using the construct described in A. The numbering of the residues is according to the full-length protein (set forth in SEQ ID NO: 73) and the - denotes an amino acid deletion in the corresponding bdSENP1 mutant. Residues highlighted in black are strictly conserved within all mutants and the wild type (wt) bdSENP1 protease.

(14) FIG. 14 Analogues mutations of the bdSUMO.sup.Mut1 system can be used in wild type SUMO/SUMO proteases systems. scSUMO-MBP and hsSUMO2-MBP fusion proteins, containing mutations at the equivalent positions to the ones in the bdSUMO.sup.Mut1 and bdSUMO.sup.Mut11, were incubated for 1 h at 25 C. together with different SUMO-specific proteases. Numbering of the residues in scSUMO and hsSUMO2 is according to the full-length protein sequence. Samples were resolved by SDS-PAGE to separate the full-length fusion protein from the C-terminal cleaved MBP.

(15) FIG. 15 Substrate specificity of different wild type and mutant SUMO-specific proteases. (A) Samples with 100 M of MBP fused to a SUMO variant (wild type SUMO or different bdSUMO mutants) were incubated for 1 h at 25 C. with different SUMO-specific proteases. The protease concentration used in each assay is sufficient to completely cleave the cognate SUMO protein at the conditions mentioned above. Samples corresponding to around 2 g of the SUMO fusion were analyzed by SDS-PAGE to separate the full-length fusion protein from the cleaved MBP. (B, C) List of the mutations in the bdSUMO and SENP1 variants tested in A, respectively

SEQUENCES

(16) TABLE-US-00002 (Degron.sup.NER) SEQIDNO:1 FLFVQ (ssrA) SEQIDNO:2 AADENYALAA (WTbdSUMOaminoacids1-97) SEQIDNO:3 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMTAIAFLFDGRRLRAEQTPDELEMEDGDEIDAMLHQTGG (WTscSUMOaminoacids1-98) SEQIDNO:4 MSDSEVNQEAKPEVKPEVKPETHINLKVSDGSSEIFFKIKKTTPLRRLM EAFAKRQGKEMDSLRFLYDGIRIQADQTPEDLDMEDNDIIEAHREQIGG (WThsSUMO2;HomosapiensSUMO2,aminoacids1-93) SEQIDNO:5 MADEKPKEGVKTENNDHINLKVAGQDGSVVQFKIKRHTPLSKLMKAYCE RQGLSMRQIRFRFDGQPINETDTPAQLEMEDEDTIDVFQQQTGG (WTbdSENP1aminoacids248-481) SEQIDNO:6 PFVPLTDEDEDNVRHALGGRKRSETLSVHEASNIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR RWTTKRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (scUlp1aminoacids403-621) SEQIDNO:7 LVPELNEKDDDQVQKALASRENTQLMNRDNIEITVRDFKTLAPRRWLND TIIEFFMKYIEKSTPNTVAFNSFFYTNLSERGYQGVRRWMKRKKTQIDK LDKIFTPINLNQSHWALGIIDLKKKTIGYVDSLSNGPNAMSFAILTDLQ KYVMEESKHTIGEDFDLIHLDCPQQPNGYDCGIYVCMNTLYGSADAPLD FDYKDAIRMRRFIAHLILTDALK (hsSENP2;HomosapiensSENP2,aminoacids419-644) SEQIDNO:8 EFPEITEEMEKEIKNVFRNGNQDEVLSEAFRLTITRKDIQTLNHLNWLN DEIINFYMNMLMERSKEKGLPSVHAFNTFFFTKLKTAGYQAVKRWTKKV DVFSVDILLVPIHLGVHWCLAVVDFRKKNITYYDSMGGINNEACRILLQ YLKQESIDKKRKEFDTNGWOLFSKKSQEIPQQMNGSDCGMFACKYADCI TKDRPINFTQQHMPYFRKRMVWEILHRKLL (WtbdSUMO;aminoacids56-79) SEQIDNO:9 SVDMTAIAFLFDGRRLRAEQTPDE (Mut1bdSUMO;aminoacids56-79) SEQIDNO:10 SVDMKAIAFLFKGRRLRAERTPDE (Mut2bdSUMO;aminoacids56-79) SEQIDNO:11 SVDMTAIAFLFKGRRLRAECTPDE (Mut3bdSUMO;aminoacids56-79) SEQIDNO:12 SVDMHAIAFLFKGRRLRAEKTPDE (Mut4bdSUMO;aminoacids56-79) SEQIDNO:13 SVDMRAIAFLFRGRRLRAEVTPDE (Mut5bdSUMO;aminoacids56-79) SEQIDNO:14 SVDMTAIAFLFKGRRLRAEFTPDE (Mut6bdSUMO;aminoacids56-79) SEQIDNO:15 SVDMHAIAFLFKGRRLRAEQTPDE (Mut7bdSUMO;aminoacids56-79) SEQIDNO:16 SVDMDAIAFLFRGRRLRAECTPDE (Mut8bdSUMO;aminoacids56-79) SEQIDNO:17 SVDMPAIAFLFKGRRLRAEWTPDE (Mut9bdSUMO;aminoacids56-79) SEQIDNO:18 SVDMAAIAFLFKGRRLRAEYTPDE (Mut10bdSUMO;aminoacids56-79) SEQIDNO:19 SVDMTAIAFLFKGRRLRAERTPDE (Mut11bdSUMO;aminoacids56-79) SEQIDNO:20 SVDMSAIAFLFKGRRLRAEWTPDE (Mut12bdSUMO;aminoacids56-79) SEQIDNO:21 SVDMSAIAFLFKGRRLRAEHTPDE (Mut13bdSUMO;aminoacids56-79) SEQIDNO:22 SVDMSAIAFLFKGRRLRAEATPDE (Mut14bdSUMO;aminoacids56-79) SEQIDNO:23 SVDMNAIAFLFKGRRLRAEWTPDE (Mut15bdSUMO;aminoacids56-79) SEQIDNO:24 SVDMNAIAFLFKGRRLRAEATPDE (WtbdSENP1aminoacids265-354) SEQIDNO:25 GGRKRSETLSVHEASNIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLK-CHFFNTFFYKKLINGGYDYKSVRRWTTKRKLG (MutAbdSENP1aminoacids265-354) SEQIDNO:26 GGRKPSETLSVHEASGIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLK-CHFFNTFFYKKLINGGYDYKSVREWTTPRKLG (MutBbdSENP1aminoacids265-354) SEQIDNO:27 GGRKRSETLSVHEASSIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLK-CHFFNTFFYKKLINGGYDYKSVREWTTKRKLG (MutCbdSENP1aminoacids265-354) SEQIDNO:28 GGRKSSETLSVHEASAIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLK-CHFFNTFFYKKLINGGYDYKSVRGWTTVRKLG (MutDbdSENP1aminoacids265-354) SEQIDNO:29 GGRKPSETLSVHEASEIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLK-CHFFNTFFYKKLINGGYDYKSVREWTTQRKLG (MutEbdSENP1aminoacids265-354) SEQIDNO:30 GGRKRSETLSVHEASGIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLINGGYDYKSVRYWTTARKLG (MutFbdSENP1aminoacids265-354) SEQIDNO:31 GGRKPSETLSVHEASCIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLINGGYDYKSVRLWTTRRKLG (MutGbdSENP1aminoacids265-354) SEQIDNO:32 GGRKSSETLSVHEASHIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLINGGYDYKSVRRWTTV-KLG (MutHbdSENP1aminoacids265-354) SEQIDNO:33 GGRKPSETLSVHEASAIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLIN-GYDYKSVREWTTMRKLG (MutibdSENP1aminoacids265-354) SEQIDNO:34 GGRKKSETLSVHEASHIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLINGGYDYKSVREWTTRRKLG (MutJbdSENP1aminoacids265-354) SEQIDNO:35 GGRKESETLSVHEASSIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLINGGYDYKSVRSWTTTRKLG (MutKbdSENP1aminoacids265-354) SEQIDNO:36 GGRKVSETLSVHEASQIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLINGGYDYKSVRVWTTGRKLG (MutLbdSENP1aminoacids265-354) SEQIDNO:37 GGRKLSETLSVHEASVIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLINGGYDYKSVRPWTTARKLG (MutMbdSENP1aminoacids265-354) SEQIDNO:38 GGRKASETLSVHEASWIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLINGGYDYKSVRRWTTERKLG (MutNbdSENP1aminoacids265-354) SEQIDNO:39 GGRKSSETLSVHEASPIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLINGGYDYKSVRRWTTRRKLG (MutObdSENP1aminoacids265-354) SEQIDNO:40 GGRKRSETLSVHEASRIVITREILQCLNDKEWLNDEVINLYLELLKERE LREPNKFLKCHFFNTFFYKKLINGGYDYKSVRGWTTLRKLG (Mut1bdSUMOresidues1-97) SEQIDNO:41 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMKAIAFLFKGRRLRAERTPDELEMEDGDEIDAMLHQTGG (Mut2bdSUMOresidues1-97) SEQIDNO:42 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMTAIAFLFKGRRLRAECTPDELEMEDGDEIDAMLHQTGG (Mut3bdSUMOresidues1-97) SEQIDNO:43 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMHAIAFLFKGRRLRAEKTPDELEMEDGDEIDAMLHQTGG (Mut4bdSUMOresidues1-97) SEQIDNO:44 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMRAIAFLFRGRRLRAEVTPDELEMEDGDEIDAMLHQTGG (Mut5bdSUMOresidues1-97) SEQIDNO:45 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMTAIAFLFKGRRLRAEFTPDELEMEDGDEIDAMLHQTGG (Mut6bdSUMOresidues1-97) SEQIDNO:46 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMHAIAFLFKGRRLRAEQTPDELEMEDGDEIDAMLHQTGG (Mut7bdSUMOresidues1-97) SEQIDNO:47 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMDAIAFLFRGRRLRAECTPDELEMEDGDEIDAMLHOTGG (Mut8bdSUMOresidues1-97) SEQIDNO:48 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMPAIAFLFKGRRLRAEQTPDELEMEDGDEIDAMLHQTGG (Mut9bdSUMOresidues1-97) SEQIDNO:49 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMAAIAFLFKGRRLRAEYTPDELEMEDGDEIDAMLHQTGG (Mut10bdSUMOresidues1-97) SEQIDNO:50 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLM NAYCDRQSVDMTAIAFLFKGRRLRAERTPDELEMEDGDEIDAMLHQTGG (Mut11bdSUMOresidues1-97) SEQIDNO:51 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMSAIAFLFKGRRLRAEWTPDELEMEDGDEIDAMLHQTGG (Mut12bdSUMOresidues1-97) SEQIDNO:52 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLM NAYCDRQSVDMSAIAFLFKGRRLRAEHTPDELEMEDGDEIDAMLHQTGG (Mut13bdSUMOresidues1-97) SEQIDNO:53 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLM NAYCDRQSVDMSAIAFLFKGRRLRAEATPDELEMEDGDEIDAMLHQTGG (Mut14bdSUMOresidues1-97) SEQIDNO:54 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMNAIAFLFKGRRLRAEWTPDELEMEDGDEIDAMLHQTGG (Mut15bdSUMOresidues1-97) SEQIDNO:55 MSAAGGEEDKKPAGGEGGGAHINLKVKGQDGNEVFFRIKRSTQLKKLMN AYCDRQSVDMNAIAFLFKGRRLRAEATPDELEMEDGDEIDAMLHQTGG (MutAbdSENP1residues248-481) SEQIDNO:56 PFVPLTDEDEDNVRHALGGRKPSETLSVHEASGIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR EWTTPRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutBbdSENP1residues248-481) SEQIDNO:57 PFVPLTDEDEDNVRHALGGRKRSETLSVHEASSIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR EWTTKRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutCbdSENP1residues248-481) SEQIDNO:58 PFVPLTDEDEDNVRHALGGRKSSETLSVHEASAIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR GWTTVRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutDbdSENP1residues248-481) SEQIDNO:59 PFVPLTDEDEDNVRHALGGRKPSETLSVHEASEIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR EWTTQRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutEbdSENP1residues248-481) SEQIDNO:60 PFVPLTDEDEDNVRHALGGRKRSETLSVHEASGIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR YWTTARKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutFbdSENP1residues248-481) SEQIDNO:61 PFVPLTDEDEDNVRHALGGRKPSETLSVHEASCIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR LWTTRRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHIMSYFRRRTAKEILDLKAG (MutGbdSENP1residues248-481) SEQIDNO:62 PFVPLTDEDEDNVRHALGGRKSSETLSVHEASHIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR RWTTVKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYMD MKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMFM LKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutHbdSENP1residues248-481) SEQIDNO:63 PFVPLTDEDEDNVRHALGGRKPSETLSVHEASAIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGYDYKSVRE WTTMRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYMD MKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMFM LKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutibdSENP1residues248-481) SEQIDNO:64 PFVPLTDEDEDNVRHALGGRKKSETLSVHEASHIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR EWTTRRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutJbdSENP1residues248-481) SEQIDNO:65 PFVPLTDEDEDNVRHALGGRKESETLSVHEASSIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR SWTTTRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHIMSYFRRRTAKEILDLKAG (MutKbdSENP1residues248-481) SEQIDNO:66 PFVPLTDEDEDNVRHALGGRKVSETLSVHEASQIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR VWTTGRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutLbdSENP1residues248-481) SEQIDNO:67 PFVPLTDEDEDNVRHALGGRKLSETLSVHEASVIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR PWTTARKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutMbdSENP1residues248-481) SEQIDNO:68 PFVPLTDEDEDNVRHALGGRKASETLSVHEASWIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR RWTTERKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MutNbdSENP1residues248-481) SEQIDNO:69 PFVPLTDEDEDNVRHALGGRKSSETLSVHEASPIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR RWTTRRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHIMSYFRRRTAKEILDLKAG (MutObdSENP1residues248-481) SEQIDNO:70 PFVPLTDEDEDNVRHALGGRKRSETLSVHEASRIVITREILQCLNDKEW LNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKSVR GWTTLRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLGYM DMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCGMF MLKYIDFYSRDMELVFGQKHMSYFRRRTAKEILDLKAG (MBP) SEQIDNO:71 AGTGTSKTEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKL EEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPF TWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKA KGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKA GLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSK VNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDE GLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMS AFWYAVRTAVINAASGRQTVDEALKDAQTN Thefirstfourresiduesarepartofashortlinker betweenSUMOsandMBP. (WTbdSENP1aminoacids1-481) SEQIDNO:73 MGALTDSRKRVSADHRLHPSFPPSPPPPSKRTKLAPLLPVSSPPPLHYA SPSSAAPGPSSSAAAAAATASTSSHSSLPHPRRRLPPAPPISRPIHGPQ RVRRSFRGGNSRPNSNPPWYSPSPPPKPLGLDQYADLVYSVTHPPRPTP AVHVPRGTEAIPEVVMVDDNEDIRQDKEDEQDVEEEAKAKVVGRKVPLY KELYEKSSRQRDARLRTLEFEVQLAEKGRLGLERLAEVLPRITPNKEEV PEPFVPLTDEDEDNVRHALGGRKRSETLSVHEASNIVITREILQCLNDK EWLNDEVINLYLELLKERELREPNKFLKCHFFNTFFYKKLINGGYDYKS VRRWTTKRKLGYNLIDCDKIFVPIHKDVHWCLAVINIKEKKFQYLDSLG YMDMKALRILAKYLVDEVKDKSGKQIDVHAWKQEGVQNLPLQENGWDCG MFMLKYIDFYSRDMELVFGQKHIMSYFRRRTAKEILDLKAG

EXAMPLES

Example 1 Cleavage Efficiency bdSENP1.SUP.MutB .and Other Site Specific Proteases

(17) So far, bdSENP1 and Ulp1 are the most active Ubl-specific proteases known (Frey & Grlich 2014a). Around 15-50 nM of these two proteases were required to efficiently cleave the cognate substrate for 1 h at 0 C. (FIG. 6). Other commonly used Ubl-specific proteases (i.e. xlAtg4, xlSub2 and bdNEDP1) are from 15 to 150-fold less efficient than bdSENP1 and Ulp1. Here, we showed that bdSENP1.sup.MutB protease was only 5-fold less efficient as compared bdSENP1 or Ulp1. bdSENP1.sup.MutB protease is therefore a more attractive option to some wild type Ubl-specific proteases.

(18) This example also shows that the bdSENP1.sup.MutB protease is around 1000-fold more efficient than the site-specific protease from the tobacco etch virus (TEV protease). For instance, only 200 nM of bdSENP1.sup.MutB protease were enough to cleave 95% of the cognate substrate (bdSUMOmut 1), while even 10 M of TEV protease were not enough to cleave the same amount of fusion protein at the same conditions of incubation (FIG. 6). Although TEV protease is the most extensively used protease for tag removal, our data proves that bdSENP1.sup.MutB represents a more powerful tool for the same propose.

Materials and Methods

(19) Cleavage reactions were carried out using cleavage buffer (45 mM Tris/HCl PH 7.5, 250 mM NaCl, 2 mM MgCl.sub.2, 250 mM sucrose, 10 mM DTT) in a total volume of 20 l. Prior to the reaction, substrates and proteases were diluted with cleavage buffer to 2-fold of the concentration required for the reaction. Equal volumes of diluted substrate and proteases were mixed in order to start the reaction. For every reaction, 100 M of each MBP-tagged substrate were incubated with various amounts of a given protease (from 10 nM to 10 M) for 1h on ice. The cleavage reactions were stopped by adding 180 l of SDS sample buffer (3% SDS, 125 mM Tris/HCl (pH 6.8), 50 mM DTT, 1M sucrose and Coomassie brilliant blue G250). A sample corresponding to 2.5 g of the bdSUMO.sup.Mut1-tagged substrate was resolved by SDS-PAGE and further stained by Coomassie blue. Control samples without protease were included in the assays. The proteases tested were: bdSENP1 from B. distachyon, Ulp1 from S. cerevisiae, SUMOstar protease, bdSENP1.sup.MutB protease, bdNEDP1 from B. distachyon, xlATG4B and xlSub2 from X. leavis and TEV protease.

Example 2 bdSENP1.SUP.MutB .and SUMOstar are Orthogonal Proteases

(20) To test the orthogonality between the SUMOvera and SUMOstar systems, bdSUMO.sup.Mut1- and SUMOstar-MBP fusions were incubated for 1 h at 0 C. with increasing concentrations of either bdSENP1.sup.MutB or SUMOstar protease (FIG. 7). On one hand, bdSENP1.sup.MutB protease only cleaved its cognate substrate at a concentration of 200 nM, whereas the SUMOstar-MBP fusion remained intact even at the highest protease concentration of 10 M. On the other hand, SUMOstar protease only recognized SUMOstar-MBP and left intact the bdSUMO.sup.Mut1-MBP fusion protein even at the highest SUMOstar protease concentration of 10 M. Note that 10 M of either protease represents up to 1000-fold more protease needed for complete cleavage of the cognate substrate, and even so no protease cross-reactivity was observed. Therefore, the SUMOvera and the SUMOstar systems have indeed truly orthogonal specificities.

Materials and Methods

(21) Cleavage reactions were performed, stopped and analyzed as described in Example 1. The only exceptions are that bdSUMO.sup.Mut1 and SUMOstar fusion proteins were incubated with increasing concentration of each protease.

Example 3 Expression of bdSUMO.SUP.Mut1.-Fusion Proteins in S. cerevisiae

(22) Virtually, any protein tagged with a wild type SUMO protein is immediately cleaved if expressed in any eukaryotic host. SUMOstar is so far the only example of a SUMO protein that is a stable tag in different eukaryotic systems (Liu et al. 2008; Peroutka et al. 2008). We tested whether bdSUMO.sup.Mut1 would also be resistant to cleavage by Ulp1 in vivo and therefore stable in yeast cells.

(23) To this end, we over-expressed citrine tagged with scSUMO, SUMOstar, bdSUMO or bdSUMO.sup.Mut1 in S. cerevisiae to test if the fusion protein would remain as a full-length product. The stability of the fusion proteins was analyzed by western blot using an anti-citrine antibody (FIG. 8). As expected, scSUMO- and bdSUMO-tagged citrines were completely cleaved, whereas SUMOstar- and bdSUMO.sup.Mut1-tagged proteins remained intact even after 6h of over-expression. In fact, bdSUMO.sup.Mut1 was the most stable fusion tag during recombinant protein expression in yeast (even more than SUMOstar) since up to 100% of the overexpressed bdSUMO.sup.Mut1-tagged protein remained as full-length and almost no cleaved citrine was detected. These results suggest that bdSUMO.sup.Mut1 represents even a better choice than SUMOstar as a SUMO fusion tag for expression in S. cerevisiae.

Materials and Methods

(24) For the over-expression of the SUMO-citrine fusion proteins, the respective expression plasmids were transformed in S. cerevisiae strain SFY123 (MATa, ADE2, his3-11, 15 leu2-3, 112 LYS2, trp1-1, ura3 can1-100, H2B-CFP::Trp1) using the PEG/LiAc protocol described in (Gietz & Schiestl 2007). Next, a single transformed colony was picked to inoculate a starting preculture of SD-medium supplemented with 2% (w/v) of glucose. After overnight incubation at 30 C., cells were pelleted by centrifugation for 5 min at 2,000 rpm and further resuspended using fresh medium supplemented with 2% (w/v) of glucose and 2% (w/v) of raffinose. This last process was repeated twice. Centrifugation and subsequent resuspension of cells with fresh medium supplemented with 2% (w/v) of glucose and 2% (w/v) of raffinose were repeated twice. Resuspended cells were then used to inoculate 250 ml of SD-medium supplemented with 2% (w/v) of raffinose to an initial concentration of OD.sub.6000.2. The culture was incubated at 30 C. with shaking until exponential growth phase was reached (OD.sub.6000.8-1.0). Over-expression of the SUMO-citrine fusion proteins was induced by the addition of 2% (w/v) of galactose for 6 h at 30 C. while shaking. After protein over-expression, yeast cells were pelleted by centrifugation for 10 min at 2,000 rpm and 25 C. and further resuspended in resuspension buffer (45 mM Tris/HCl PH 7.5, 250 mM NaCl, 20 mM imidazole, 5 mM DTT).

(25) To analyze the stability of the SUMO-citrine proteins, the resuspended cells were used to generate lysates by TCA/NaOH extraction as described in (Zuk 1998). Cell lysates corresponding to 35mOD of cells expressing the citrine fusion proteins were resolved by SDS-PAGE and analyzed by western blot. An anti-GFP primary antibody was used to detect the presence of cleaved citrine and/or the full-length SUMO-citrine fusion protein. A cell lysate lacking a SUMO-tagged citrine was used as negative control.

Example 4 bdSUMO.SUP.Mut1 .is a Stable Tag in Eukaryotic Cellular Extracts

(26) We also analyzed the stability of scSUMO-, SUMOstar-, bdSUMO- and bdSUMO.sup.Mut1-MBP fusion proteins in different eukaryotic extracts. Each MBP fusion protein was incubated in a highly concentrated extract for 2 h at 25 C. and further analyzed by western blot (FIG. 9). Proteins tagged with SUMOstar and bdSUMO.sup.Mut1 were not cleaved in all samples, whereas the scSUMO as well as the bdSUMO fusions proteins were cleaved to different extents. Samples including a protease mix (Ulp1, SUMOstar protease, bdSENP1 and bdSENP1.sup.MutB protease, 1 M each) showed that there was not any inhibitory substance that could have prevented endogenous SUMO-specific proteins to cleave either the SUMOstar or the bdSUMO.sup.Mut1 fusion proteins. Therefore, these results confirm that bdSUMO.sup.Mut1 is also a suitable fusion tag that can be used in virtually any eukaryotic host.

Materials and Methods

(27) The stability of the different SUMO-tagged MBP fusion proteins was tested in different eukaryotic extracts (wheat germ extract, Xenopus laevis egg extract, rabbit reticulocytes extract, HeLa cell extract and Drosophila S2 cell extract). The preparation of the lysates was performed as described in (Mureev et al. 2009; Kovtun et al. 2010; Blow & Laskey 1986; Crevel & Cotterill 1991; Endo et al. 2010; Jackson & Hunt 1983). For a 12.5 l volume reaction, 1 M of SUMO-tagged substrate was incubated with 10 l of each lysate for 2 h at 25 C. in the presence and absence of a protease mix containing 1 M of different SUMO-specific proteases (Ulp1, SUMOstar, bdSENP1 and bdSENP1.sup.MutB protease). Finally, the reaction was stopped by adding SDS sample buffer (3% SDS, 125 mM Tris/HCl, (pH 6.8), 50 mM DTT, 1 M sucrose and coomassie brilliant blue G250) to a final volume of 100 l. The stability of the substrates was analyzed by western blot using an anti-MBP primary antibody.

Example 5 Purification of a Hetero-Dimeric Complex in Yeast

(28) Two or more site-specific proteases are used to purify protein complexes with defined subunit stoichiometry (Frey & Grlich 2014b). Although this technology is straightforward and requires only of proteases with mutually exclusive substrate specificity, it has been so far apply in prokaryotic systems such as E. coli. Here, we show that the SUMOvera system together with the SUMOstar system can be used to purify dimeric protein complexes that are expressed in S. cerevisiae. As proof of principle, we selected the high affinity hetero-dimeric complex composed of the anti-GFP nanobody (Nb) (Kirchhofer et al. 2010) and the GFP-like protein named citrine (Heikal et al. 2000). Nb was cloned as an H14-bdSUMO.sup.Mut1-fusion protein, whereas citrine was fused to an N-terminal ZZ-SUMOstar tag (FIG. 10-A).

(29) After the co-expression of both proteins, we used two consecutive capture-and-realize chromatographic steps as described in (Frey & Grlich 2014b) for the purification of the Nb.Math.Citrine complex (FIG. 10-B). For the first chromatographic step, the cellular lysate containing the complex (soluble material) was loaded onto a column containing a Ni.sup.2+ chelate matrix to immobilize the dimeric complex via the H14-bdSUMO.sup.Mut1-Nb fusion protein. The non-bound material is then removed from the column after washing the matrix (non-bound material). The elution of the protein complex was then achieved by on-column protein cleavage using bdSENP1.sup.MutB protease. Notably, untagged citrine was present in the eluted complex due to the partial cleavage ZZ-SUMOstar tag by endogenous Ulp1 as observed in (FIG. 8). After a second affinity chromatographic step, the stoichiometric protein complex was obtained since the surplus of untagged Nb from elute 1 as well as the untagged Citrine were removed from the protein preparation (non-bound material 2). The stoichiometric complex is obtained by a second on-column protein cleavage using SUMOstar protease. Finally, the use of bdSENP1.sup.MutB and the SUMOstar proteases allowed obtaining an untagged and a purer complex as both affinity tags and all contaminants remained bound to both affinity matrices (Post elution with imidazole and SDS).

Materials and Methods

(30) The transformation of the Nb.Math.Citrine complex was performed in S. cerevisiae strain SFY123 (MATa, ADE2, his3-11, 15 leu2-3, 112 LYS2, trp1-1, ura3, can1-100). One plasmid codified for the fusion protein H14-bdSUMO.sup.Mut1-Nb and the second for the ZZ-SUMOstar-Citrine protein. Expression was performed as described in Example 3. After expression, the yeast pellet was resuspended in resuspension buffer (50 mM Tris/HCl PH 7.5, 150 mM NaCl, 20 mM imidazole, 5 mM DTT) to a final OD.sub.600 of 20-50/ml. A cocktail of different protease inhibitors was added to the resuspended cells to a final concentration of 1. The stock (500) of protease inhibitors contained the following compounds: 5 mg/ml aprotinin, 5 mg/ml leupeptin, 2.5 mg/ml elastatinal, 2.5 mg/ml chymostatin and 0.5 mg/ml pepstatin A. Cells were snap-frozen in liquid nitrogen and immediately thawn in a hot water bath for 10-15 min. The cellular membrane was disrupted using glass beads and constant vortexing to generate a cell lysate. Cell debris and insoluble material was removed from the lysed cells by ultracentrifugation at 38,000 rpm and 4 C. for 1.5h.

(31) The purification of binary protein complexes was performed as described in (Frey & Grlich 2014b). Briefly, the cleared yeast lysate was incubated with Ni.sup.2+ chelate beads for 1 h at 4 C. Subsequently, beads were place in a column and contaminant proteins were removed by adding 2 column volume (CV) of resuspension buffer. The protein complex was then eluted by adding 1CV elution buffer containing 200 nM of bdSENP1.sup.MutB protease for 1 h at 4 C. For the second purification step, the complex was immobilized via the ZZ-tag (tandem repeat of the Z domain from staphylococcal protein A) using silica beads coupled to an anti ZZ-domain affibody. The sample was incubated with 2 ml of anti Z-domain beads for 1 h at 4 C. After incubation, beads were washed with 2CV of resuspension buffer. For protein elution, silica beads were incubated with resuspension buffer supplemented with 100 nM of SUMOstar protease for 1 h at 4 C. After elution, samples were pooled and frozen in liquid nitrogen for storage at 80. Protein samples were taken at different steps of the expression and purification of the complex and analyzed by SDS-PAGE.

Example 6. Over-Expression of bdSENP1.SUP.MutB .Protease in S. cerevisiae

(32) SUMO-specific proteases are the most efficient proteases for the removal of affinity tag from the protein of choice (Frey & Grlich 2014a; Malakhov et al. 2004). Unfortunately, this process can be only performed in vitro as the use of any exogenous SUMO-specific protease in vivo would compromise the viability of any eukaryotic cell. Since the site-specific proteolysis in vivo and specially using SUMO-specific proteases is of high relevance, we decided to test whether over-expression of bdSENP1.sup.MutB protease could be achieved without affecting the viability of yeast cells. Yeast cells were transformed with a high-copy vector encoding for Ulp1, bdSENP1, SUMOstar protease or bdSENP1.sup.MutB protease (FIG. 11). As a negative control, we transformed cells with a plasmid codifying for a bdSENP1 mutant (C440S) that is completely inactive, and therefore is not able to hydrolyze scSUMO. All transformed cells were able to grow when the expression of the proteases was repressed by the presence of glucose. After induction protease over-expression by the addition of galactose for 72 h at 30 C., only yeast cells expressing either bdSENP1.sup.MutB protease or bdSENP1 (C440S) grew to a very similar. In contrast, over-expression of Ulp1, bdSENP1 and SUMOstar protease led to complete cellular death in the presence of galactose. bdSENP1.sup.MutB protease is therefore the only SUMO-specific protease that could be use to perform site-directed proteolysis in living yeast cells.

Materials and Methods

(33) S. cerevisiae cells strain SFY123 (MATa, ADE2, his3-11, 15 leu2-3, 112 LYS2, trp1-1, ura3, can1-100) were used to test their viability after over-expression of a SUMO-specific protease. First, cells were transformed with a galactose inducible expression plasmid using the protocol described in (Gietz & Schiestl 2007). Transformed cells were inoculated in SD-medium supplemented with 2% (w/v) of glucose and further inoculated for 16 h at 30 C. Cells were then pelleted and resuspended using fresh SD-medium supplemented with 2% (w/v) glucose and 2% (w/v) raffinose. Resuspended cells were incubated in of SD-medium containing 2% (w/v) of raffinose until exponential growth phase was reached (OD.sub.6001.0). Next, cells were sequentially diluted in 10-fold steps and 5 l of each dilution were spotted in plates containing either galactose (0.02% and 0.2%) or glucose (2%). Plates were incubated for 72 h at 30 C. and further scanned.

(34) TABLE-US-00003 TABLE 1 Cleavage efficiency of different bdSUMO mutants by SUMO-specific proteases. scSUMO hsSUMO2 bdSUMO mutants .sup.C Protease .sup.A,B wt wt wt D67K Mut1 Mut8 Mut10 Mut11 Mut12 Mut13 Mut14 Mut15 Ulp1 ++++ + ++++ resistant resistant resistant resistant resistant resistant resistant resistant resistant SUMOstar ++++ + ++++ resistant resistant resistant resistant resistant resistant resistant resistant resistant protease hsSENP2 ++++ ++++ ++ resistant resistant resistant resistant resistant resistant resistant resistant resistant bdSENP1 ++ ++ ++++ + + + + + + ++ + + bdSENP1.sup.MutB resistant resistant + +++ ++++ ++ ++++ ++++ ++++ ++++ ++++ ++++ bdSENP1.sup.MutG + ++++ ++ +++ ++++ + ++++ +++ ++ +++ ++ +++ bdSENP1.sup.MutH resistant resistant + +++ ++++ +++ ++++ ++++ ++++ ++++ ++++ ++++ bdSENP1.sup.Muti resistant resistant + + ++++ + ++ ++ ++ +++ ++ ++ bdSENP1.sup.MutJ + ++ ++ ++++ ++++ ++ ++++ ++++ ++ ++++ ++ ++++ bdSENP1.sup.MutK resistant resistant ++ ++++ ++++ +++ ++++ ++++ ++++ ++++ ++++ ++++ .sup.A: The protease concentration used for the reaction is sufficient to completely cleave 100 M of the cognate SUMO protein within 1h at 25 C. .sup.B: Mutations of the tested bdSENP1 variants are shown in FIG. 15 .sup.C: Mutations of the tested bdSUMO variants are shown in FIG. 15 +: 25% cleavage of the SUMO fusion protein ++: 50% cleavage of the SUMO fusion protein +++: 75% cleavage of the SUMO fusion protein ++++: ~100% cleavage of the SUMO fusion protein

LIST OF REFERENCES

(35) Amor-Mahjoub, M. et al., 2006. The effect of the hexahistidine-tag in the oligomerization of HSC70 constructs. Journal of Chromatography B: Analytical Technologies in the Biomedical and Life Sciences, 844 (2), pp. 328-334. Bachmair, a, Finley, D. & Varshavsky, a, 1986. In vivo half-life of a protein is a function of its amino-terminal residue. Science (New York, N.Y.), 234 (4773), pp. 179-186. BOHNSACK, M. T., 2004. Exportin 5 is a RanGTP-dependent dsRNA-binding protein that mediates nuclear export of pre-miRNAs. RNA, 10 (2), pp. 185-191. Butt, T., Malakhova, O. & Malakhov, M., 2010. Methods and compositions for enhanced protein expression and purification. U.S. Pat. No. 7,655,413. Butt, T. R. et al., 2005. SUMO fusion technology for difficult-to-express proteins. Protein Expression and Purification, 43 (1), pp. 1-9. Chan, P. et al., 2011. Purification of Heterotrimeric G Protein Subunits by GST-Ric-8 Association: PRIMARY CHARACTERIZATION OF PURIFIED Golf. Journal of Biological Chemistry, 286 (4), pp. 2625-2635. Chant, A. et al., 2005. Attachment of a histidine tag to the minimal zinc finger protein of the Aspergillus nidulans gene regulatory protein AreA causes a conformational change at the DNA-binding site. Protein Expression and Purification, 39 (2), pp. 152-159. Chen, X., Pham, E. & Truong, K., 2010. TEV protease-facilitated stoichiometric delivery of multiple genes using a single expression vector. Protein Science, 19 (12), pp. 2379-2388. Frey, S. & Grlich, D., 2014a. A new set of highly efficient, tag-cleaving proteases for purifying recombinant proteins. Journal of Chromatography A, 1337, pp. 95-105. Frey, S. & Grlich, D., 2014b. Purification of protein complexes of defined subunit stoichiometry using a set of orthogonal, tag-cleaving proteases. Journal of Chromatography A, 1337, pp. 106-115. Harder, B. et al., 2008. TEV protease-mediated cleavage in Drosophila as a tool to analyze protein functions in living organisms. BioTechniques, 44 (6), pp. 765-772. Harper, S. & Speicher, D. W., 2011. Purification of Proteins Fused to Glutathione S-Transferase. In Methods in molecular biology (Clifton, N.J.). pp. 259-280. Hendriks, I. A. & Vertegaal, A. C. O., 2016. A comprehensive compilation of SUMO proteomics. Nature reviews. Molecular cell biology, 17 (9), pp. 581-95. Herrmann, J., Lerman, L. O. & Lerman, A., 2007. Ubiquitin and ubiquitin-like proteins in protein regulation. Circulation Research, 100 (9), pp. 1276-1291. Himeno, H., Kurita, D. & Muto, A., 2014. TmRNA-mediated trans-translation as the major ribosome rescue system in a bacterial cell. Frontiers in Genetics, 5 (APR), pp. 1-13. Katzmann, D. J., Babst, M. & Emr, S. D., 2001. Ubiquitin-dependent sorting into the multivesicular body pathway requires the function of a conserved endosomal protein sorting complex, ESCRT-I. Cell, 106 (2), pp. 145-155. Keiler, K. C., 2008. Biology of trans-Translation. Annual Review of Microbiology, 62 (1), pp. 133-151. Kerscher, O., Felberbaum, R. & Hochstrasser, M., 2006. Modification of proteins by ubiquitin and ubiquitin-like proteins. Annual review of cell and developmental biology, 22, pp. 159-80. Khorasanizadeh, S., Peters, I. D. & Roder, H., 1996. Evidence for a three-state model of protein folding from kinetic analysis of ubiquitin variants with altered core residues. Nature structural biology, 3 (2), pp. 193-205. Kimple, M. E., Brill, A. L. & Pasker, R. L., 2013. Overview of affinity tags for protein purification. Current Protocols in Protein Science, (SUPPL.73), pp. 608-616. Kosobokova, E. N., Skrypnik, K. A. & Kosorukov, V. S., 2016. Overview of fusion tags for recombinant proteins. Biochemistry (Moscow), 81 (3), pp. 187-200. Kostelansky, M. S. et al., 2007. Molecular Architecture and Functional Model of the Complete Yeast ESCRT-I Heterotetramer. Cell, 129 (3), pp. 485-498. Kuwata, T. & Nakamura, T., 2008. BCL11A is a SUMOylated protein and recruits SUMO-conjugation enzymes in its nuclear body. Genes to Cells, 13 (9), pp. 931-940. Malakhov, M. P. et al., 2004. SUMO fusions and SUMO-specific protease for efficient expression and purification of proteins. Journal of Structural and Functional Genomics, 5 (1-2), pp. 75-86. Marblestone, J. G. et al., 2006. Comparison of SUMO fusion technology with traditional gene fusion systems: enhanced expression and solubility with SUMO. Protein science: a publication of the Protein Society, 15 (1), pp. 182-9. McCoy, J. & La Ville, E., 1997. Expression and Purification of Thioredoxin Fusion Proteins. In Current Protocols in Protein Science. Hoboken, NJ, USA: John Wiley & Sons, Inc., p. 6.7.1-6.7.14. Pattenden, L. K. & Thomas, W. G., 2008. Amylose Affinity Chromatography of Maltose-Binding Protein. In Affinity Chromatography. Totowa, NJ: Humana Press, pp. 169-190. Rao, R. N., Allen, N. E. & Hobbs, J. N., 1983. Genetic and enzymatic basis of hygromycin B resistance in Escherichia coli. Genetic and Enzymatic Basis of Hygromycin B Resistance in Escherichia coli. Antimicrobial Agents and Chemotherapy, 24 (5), pp. 689-695. Reverter, D. & Lima, C. D., 2004. A basis for SUMO protease specificity provided by analysis of human Senp2 and a Senp2-SUMO complex. Structure, 12 (8), pp. 1519-1531. Reverter, D. & Lima, C. D., 2006. Structural basis for SENP2 protease interactions with SUMO precursors and conjugated substrates. Nature Structural & Molecular Biology, 13 (12), pp. 1060-1068. Sato, M. & Toda, T., 2007. Alp7/TACC is a crucial target in Ran-GTPase-dependent spindle formation in fission yeast. Nature, 447 (7142), pp. 334-337. Shen, L. et al., 2006. SUMO protease SENP1 induces isomerization of the scissile peptide bond. Nature Structural & Molecular Biology, 13 (12), pp. 1069-1077. Suh-Lailam, B. B. & Hevel, J. M., 2009. Efficient cleavage of problematic tobacco etch virus (TEV)-protein arginine methyltransferase constructs. Analytical Biochemistry, 387 (1), pp. 130-132. Taxis, C. & Knop, M., 2012. TIPI: TEV Protease-Mediated Induction of Protein Instability. In Methods in Molecular Biology. pp. 611-626. Vertegaal, A. C. O. et al., 2004. A proteomic study of SUMO-2 target proteins. Journal of Biological Chemistry, 279 (32), pp. 33791-33798. Wang, K. H. et al., 2008. Tuning the strength of a bacterial N-end rule degradation signal. Journal of Biological Chemistry, 283 (36), pp. 24600-24607. Woestenenk, E. A. et al., 2004. His tag effect on solubility of human proteins produced in Escherichia coli: A comparison between four expression vectors. Journal of Structural and Functional Genomics, 5 (3), pp. 217-229. Xu, Z. et al., 2006. Crystal structure of the SENP1 mutant C603S-SUMO complex reveals the hydrolytic mechanism of SUMO-specific protease. The Biochemical journal, 398 (3), pp. 345-352. YAN, Y., ORCUTT, S. J. & STRICKLER, J. E., 2009. The use of SUMO as a fusion system for protein expression and purification. Chimica oggi, 27 (6). Zuo, X., Li, S., et al., 2005. Enhanced expression and purification of membrane proteins by SUMO fusion in Escherichia coli. Journal of Structural and Functional Genomics, 6 (2-3), pp. 103-111. Zuo, X., Mattern, M. R., et al., 2005. Expression and purification of SARS coronavirus proteins using SUMO-fusions. Protein Expression and Purification, 42 (1), pp. 100-110.