Targeted protein characterization by mass spectrometry

10877044 ยท 2020-12-29

    Inventors

    Cpc classification

    International classification

    Abstract

    The invention provides methods for characterizing a target protein wherein a mass spectrum of digest peptides of the target protein is acquired and compared with measured reference mass spectra of digest peptides of reference proteins or of proteins of reference host cells. The comparison comprises determining similarity scores of the intensity patterns of the mass spectrum and the reference mass spectra. The characterization comprises assigning the target protein to a reference protein having a reference mass spectrum with a similarity score above a predetermined threshold.

    Claims

    1. A method for characterizing a target protein, comprising the steps of: providing a library of measured reference mass spectra of reference proteins wherein each reference mass spectrum is acquired for an enzymatic digest of one reference protein and the conditions of the enzymatic digest are substantially equal for all reference proteins; enzymatically digesting the target protein under the same conditions used for the reference proteins; acquiring a mass spectrum for the enzymatically digested target protein; determining similarity scores for intensity patterns of the mass spectrum and the reference mass spectra; and characterizing the target protein by assigning the target protein to a reference protein having a reference mass spectrum with a similarity score above a predetermined threshold.

    2. The method according to claim 1, wherein the enzymatic digest uses at least one of trypsin, Ides and Lys-C.

    3. The method according to claim 2, wherein the target protein and the reference proteins are equally denatured by a reducing agent and by a denaturing agent prior to the enzymatic digest.

    4. The method according to claim 1, wherein the mass spectrum of the digested target protein and the reference mass spectra are acquired by a MALDI-TOF mass spectrometer.

    5. The method according to claim 1, wherein the reference proteins have different amino acid sequences, the enzymatic digests of the reference proteins comprise digest peptides each having masses specific to the corresponding reference protein and the target protein is identified by assignment to one of the reference proteins.

    6. The method according to claim 1, wherein the target protein is extracted from a complex sample mixture by affinity capture.

    7. The method according to claim 6, wherein the complex mixture is one of urine, plasma, serum, spinal fluid and lysed tissue cells.

    8. The method according to claim 6, wherein an antibody is used for the affinity capture.

    9. The method according to claim 1, wherein the target protein is a biopharmaceutical.

    10. The method according to claim 9, wherein the reference proteins are different protein isoforms of the biopharmaceutical which are formed by at least one of alternative splicings, sequence variations, post-transcriptional modifications and stress-induced modifications, and the target protein is identified as one of the isoforms by assignment to one of the reference proteins.

    11. The method according to claim 1, wherein the target protein is an antibody.

    12. The method according to claim 11 wherein the reference proteins are different modifications of the antibody, the target protein and the reference proteins are enzymatically digested into domains and the target protein is identified to be modified by assignment to one of the reference proteins.

    13. The method according to claim 12, wherein the modifications are at least one of glycosylation, oxidation, acetylation and amidation.

    14. The method according to claim 13, wherein the enzyme IdeS is used to prepare Fc/2 antibody domains for glycoprofiling.

    15. The method according to claim 12, wherein one reference protein is the antibody and the target protein is identified to be the antibody by assignment to this reference protein.

    16. The method according to claim 1, wherein redundant mass signals which are present in a majority of the measured reference mass spectra are removed, and reduced reference mass spectra are provided and/or used for the determining step.

    17. The method according to claim 16, wherein the reference proteins have different amino acid sequences, the enzymatic digest of the reference proteins comprises digest peptides with masses which are characteristic for the corresponding reference protein and reduced reference mass spectra comprise only the characteristic mass signals.

    18. The method according to claim 16, wherein the reduced reference mass spectra additionally comprise some abundant mass signals which are present in substantially all reduced reference mass spectra and the number of the abundant mass signals is lower than the number of the characteristic mass signals.

    19. The method according to claim 1, wherein the similarity scores are determined by a cosine similarity measure or a cross-correlation.

    20. A method for characterizing proteins of target host cells, comprising the steps of: providing a library of measured reference mass spectra of proteins of reference host cells wherein each reference mass spectrum comprises mass signals of digest peptides which are generated by an enzymatic digest of multiple proteins of one type of reference host cells and are extracted by affinity capture using multiple affinity agents after the enzymatic digest; enzymatically digesting the proteins of the target host cells under the same conditions used for the proteins of the reference host cells and then extracting the digest peptides by affinity capture using the multiple affinity agents; acquiring a mass spectrum for the extracted digest peptides of the target host cells; determining similarity scores of the intensity pattern of the mass spectrum and the reference mass spectra; and characterizing the target host cells by assigning the proteins of the target host cells to a type of reference host cell having a reference spectrum with a similarity score above a predetermined threshold.

    Description

    BRIEF DESCRIPTION OF THE DRAWINGS

    (1) FIG. 1 illustrates a schematic workflow of a method for characterizing a target protein according to the present invention.

    (2) FIG. 2 illustrates the preferred workflow for a fast denaturing and digestion of a target protein.

    DETAILED DESCRIPTION

    (3) The invention provides a preferred method for identifying a target protein by peptide mass fingerprinting (PMF) of the peptide mixture produced by enzymatic digestion of the target protein by a protease such as trypsin, particularly a method for affirming the identity for quality assurance. The identification will be based, quite differently from the usual peptide mass fingerprinting, on the similarity of the measured intensity pattern of the target protein with the measured intensity pattern of the reference proteins.

    (4) Whereas known peptide mass fingerprinting relies on peptide masses virtually calculated from amino acid sequences found in libraries, the new method compares the similarities of masses and intensities of a measured mass spectrum with those of measured reference mass spectra of reference proteins in a library. A particular advantage of this approach is that it accounts for the fact that even identical digest peptides of two highly similar reference proteins may be represented in the two corresponding reference mass spectra, but produce mass signals that vary drastically in their relative intensities due to different physicochemical microenvironments near the cleavage sites.

    (5) Any suitable program may be used which determines the intensity pattern similarity between the mass spectrum of the target protein and the reference mass spectra of the reference proteins that form the library. As an example, the similarity of the intensity pattern can be rapidly determined by forming the cosine of the angle between the vectors formed by the intensity pattern (cosine similarity score) or by cross-correlation.

    (6) The reference spectra of reference proteins are preferably reduced to a smaller set of specific mass signals with their intensities (subset profile) which results in an improved specificity of the assignment. The reduced set preferably comprises approximately three to fifteen peptides. There are different ways available to generate such subset profiles. Such methods include typical statistical methods for spectra classification such as Principal Component analysis (PCA), Hierarchical Clustering (HC) or Receiver Operator Curves (ROC) to name some of these methods. If the reference proteins are antibodies, characteristic digest peptides can be determined from the complementarity determining regions (CDRs). The selection of the specific mass signals has to be repeated if the library is enlarged by the addition of further reference mass spectra. Optionally, a few abundant redundant peptide mass signals can be added for quality control.

    (7) The method is preferably performed with rapid protein denaturing and digesting methods, typically in less than 30 minutes and, in particular, in less than 15 minutes. The resulting peptide mixture will be prepared with matrix solution to produce a sample spot on a mass spectrometric sample support plate for use in time-of-flight mass spectrometers with ionization by matrix-assisted laser desorption (MALDI). Reasonably priced table-top time-of-flight reflector mass spectrometers may be used for spectrum acquisition.

    (8) The method of the invention can be used particularly in antibody production quality control, or in clone selection workflows during pharmaceutical development, e.g., to screen glycan profiles in intact Fc-domains or for clones properly carrying the target sequence, based on differentiating relative intensity patterns in the reference mass spectra of the digest peptides of the reference proteins. For the characterization of an antibody, protocols can be used to achieve a result from providing the antibody to the automatic identification within 15 minutes, in particular protocols based on trypsin/Lys-C digests.

    (9) The target proteins and the reference proteins may be rapidly denatured using a mixture of reducing and denaturing agents such as dithiothreitol and trifluorethanol, and rapidly digested by a protease such as trypsin or serial or parallel double-digest (e.g.,Trypsin/Lys-C). The peptide mixture may be prepared with a matrix substance for ionization by matrix assisted laser desorption (MALDI). HCCA (-cyano-4-hydroxycinnamic acid) may be used as matrix substance.

    (10) The methods, however, can also be applied to targeted proteins which are present in a complex substance mixture. The targeted protein then has to be extracted and purified. To avoid time-consuming methods such as HPLC, for instance, fast purification methods have to be applied. Examples are solid phase extraction, affinity capture on column or on magnetic beads, and the like.

    (11) Example for a rapid protein identity testing embodiment: The digest time including denaturation, reduction and proteolytic digestion is reduced to 15 min. with subsequent MALDI sample preparation including a simple on-target purification step. The quality of the MALDI peptide mass fingerprints achieved from all tested antibody digests is high (70% sequence coverage) and enables an identity assay that is substantially based on the differentiation peptides only, i.e., peptides derived from the variable N-termini of 120 residues of the antibodies; 4-13 peptides are used in these profiles in addition to six abundant common peptides. Profiles of these antibodies allow for their distinction based on cosine similarity scoring (CSS) with CCS>0.9 as acceptance criterion, non-matching identities yield CCS values of 0.2-0.6. In addition, butterfly plots allow the visual confirmation of the ID provided by the software.

    (12) Example for an embodiment procedure for clone selection: Digest and sample preparation time of IdeS digestion and MALDI sample preparation is about 30 minutes. Major glycans such as G0F, G1F, G2F and G3F are assayed by direct profiling of the Fc-domain of monoclonal antibodiestogether with the proper state of the Fc C-terminus. Spectra acquisition and processing are completed in less than 10 sec/sample. Different attributes such as the match of the glycan profile with a reference profile with a certain score or the test for G0F as being the base peak glycan are reported in the software. Automation and parallelization of sample processing in the clone selection workflow permit hundreds of samples to be assessed per day with a high degree of automation and drastically accelerate clone-selection based on major Fc-glycans.

    (13) The invention further provides a preferred method for characterizing proteins of target host cells. Here, anti-peptide antibodies are used to extract digest peptides of the proteins of the target host cells after a tryptic digestion. Typically, stable isotope labelled standard peptides (SIS-peptides) can be additionally added after digestion, so that the SIS-peptides and the native peptide obtained by trypsin digestion can be jointly extracted with an anti-peptide antibody. Multiple antibodies (either a polyclonal antibody or a pool of monoclonal antibodies) and SIS-peptides can now be used together in the analysis to extract 5, 10 or more peptides specific to a host cell protein (HCP) analysis. The profile would now be defined with the SIS-peptide mass signals and the intensity at a level relevant for each host cell protein. The native peptide intensity can then be determined based on the peak intensity ratio native/SIS peptides and deviations from the target HPC levels automatically detected and accurately be quantified.