METHOD FOR DETECTING CROSSLINKED PEPTIDES VIA REPRODUCIBLE FRAGMENTATION IN A MASS SPECTROMETER
20190234956 ยท 2019-08-01
Assignee
Inventors
- William R. Montfort (Tucson, AZ, US)
- Jessica A. Wales (Tucson, AZ, US)
- Linda A. Breci (Tucson, AZ, US)
Cpc classification
G01N33/6845
PHYSICS
G01N33/6803
PHYSICS
International classification
Abstract
Methods of identifying a crosslinking site or a binding site on a protein are described herein. The protein may comprise a binding site in a vicinity of the crosslinking site and identification of the crosslinking site may aid in identifying a location of the binding site in the protein. Methods of identifying a peptide or protein from a complex mixture are also described herein. The invention features a crosslinking agent that is configured to interact with a binding site of a protein and comprises a substituent configured to fragment in tandem mass spectrometry to yield a signature mass fragment.
Claims
1. A method of identifying a binding site of a protein, the protein comprising a crosslinking site, the method comprising: a. reacting said protein with a crosslinking agent at the crosslinking site to form a crosslinked protein, wherein the crosslinking agent comprises a substituent configured to fragment in tandem mass spectrometry to yield a signature mass fragment; b. cleaving said protein into two or more peptides, wherein at least one of the peptides is tagged by the crosslinking agent; and c. analyzing the peptides by tandem mass spectrometry to detect the tagged peptide, wherein said tagged peptide fragments to yield the signature mass fragment, and wherein the signature mass fragment is detected to identify the tagged peptide; wherein the crosslinking agent comprises (1) an agonist or antagonist that binds to the binding site, and (2) a reactive moiety that reacts with the crosslinking site on the protein, and wherein identification of the tagged peptide indicates the crosslinking site and the binding site on the protein.
2. The method of claim 1, wherein the binding site is in a vicinity of the crosslinking site.
3. The method of claim 1, wherein the crosslinking site is at a first distance away from the binding site, the first distance being about the same as a second distance stretching between the agonist or antagonist to the reactive moiety on the crosslinking agent.
4. The method of claim 1, wherein the binding site is a receptor.
5. The method of claim 1, wherein the crosslinking agent comprises a photo-cleavable diazirine moiety which transforms into a reactive carbene radical upon UV irradiation, wherein the carbene radical reacts with the protein to form the crosslinked protein.
6. (canceled)
7. The method of claim 1, wherein a bond linking an amide functional group and an ether functional group is broken during the analysis of the tagged peptide, thereby forming the signature mass fragment having an m/z ratio value.
8. The method of claim 1, wherein the substituent configured to fragment is according to the following structure: ##STR00003##
9. The method of claim 1, wherein the m/z ratio value of the signature mass fragment corresponds to an ion comprising the tagged peptide mass.
10. The method of claim 1, wherein the signature mass fragment corresponds to a singly charged fragment ion which comprises a biotin moiety with an amide substituent and has a m/z ratio of 270.127.
11. The method of claim 1, wherein the peptides are dissolved in acidic solvent before being analyzed by tandem mass spectrometry.
12. The method of claim 1, wherein the crosslinking agent is according to the following structure: ##STR00004##
13. A method of identifying a binding site on a protein, wherein the method comprises: a. reacting said protein with a crosslinking agent at a crosslinking site in a vicinity of the binding site, to form a crosslinked protein, wherein the crosslinking agent comprises a substituent configured to fragment in tandem mass spectrometry to yield a signature mass fragment; b. cleaving said protein into two or more peptides, wherein at least one of the peptides is tagged by the crosslinking agent; and c. analyzing the peptides by tandem mass spectrometry to detect the tagged peptide, wherein said tagged peptide fragments to yield the signature mass fragment, and wherein the signature mass fragment is detected to identify the tagged peptide; wherein identification of the tagged peptide indicates the crosslinking site on the protein, and wherein identification of the crosslinking site aids in identifying a location of the binding site in the protein.
14. The method of claim 13, wherein the protein comprises a guanylyl or guanylate cyclase, a truncated version of guanylyl cyclase or a bacterial H-NOX homolog.
15.-16. (canceled)
17. The method of claim 13, wherein the binding site is a drug binding site and the crosslinking agent comprises a modified drug.
18.-24. (canceled)
25. A method of identifying a binding site on a peptide or protein in a complex mixture, wherein identification of the binding site allows for identification of the peptide or protein which comprises the binding site, wherein the method comprises; a. providing the complex mixture; b. introducing a crosslinking agent which selectively interacts with the peptide or protein which comprises the binding site, and reacts with the peptide or protein to form a tagged peptide or protein, wherein the crosslinking agent comprises a substituent configured to fragment in tandem mass spectrometry to yield a signature mass fragment; and c. analyzing a portion of the complex mixture by tandem mass spectrometry to detect the tagged peptide or protein, wherein said tagged peptide or protein fragments to yield the signature mass fragment, and wherein the signature mass fragment is detected to identify the tagged peptide or protein.
26. The method of claim 25, wherein the complex mixture comprises at least one of the following: a peptide, protein, biomolecule, biopolymer, cell, cell lysate, pharmaceutical agent, DNA strand, organelle, or small-molecule.
27. The method of claim 25, wherein the crosslinking agent is configured to interact with said binding site.
28. The method of claim 27, wherein the binding site is a receptor and the crosslinking agent comprises an agonist or antagonist of said receptor.
29. The method of claim 27, wherein the binding site is a drug binding site and the crosslinking agent comprises a modified drug.
30. The method of claim 25, wherein the crosslinking agent comprises a photo-cleavable diazirine moiety which transforms into a reactive carbene radical upon UV irradiation, wherein the carbene radical reacts with the peptide or protein to form the tagged peptide or protein.
31.-36. (canceled)
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
DESCRIPTION OF PREFERRED EMBODIMENTS
[0030] In one embodiment, the present invention features a method of identifying a crosslinking site on a protein. As a non-limiting example, the method may comprise: reacting said protein with a crosslinking agent at the crosslinking site to form a crosslinked protein, wherein the crosslinking agent comprises a substituent configured to fragment in tandem mass spectrometry to yield a signature mass fragment; cleaving said protein into two or more peptides, wherein at least one of the peptides is tagged by the crosslinking agent; and analyzing the peptides by tandem mass spectrometry to detect the tagged peptide, wherein said tagged peptide fragments to yield the signature mass fragment, wherein the signature mass fragment is detected to identify the tagged peptide and wherein identification of the tagged peptide indicates the crosslinking site on the protein. In some embodiments, the protein may comprise a binding site in a vicinity of the crosslinking site and identification of the crosslinking site may aid in identifying a location of the binding site in the protein.
[0031] In another embodiment, the present invention features a method of identifying a binding site on a protein. As a non-limiting example, the method may comprise: reacting said protein with a crosslinking agent at a crosslinking site in a vicinity of the binding site, to form a crosslinked protein, wherein the crosslinking agent comprises a substituent configured to fragment in tandem mass spectrometry to yield a signature mass fragment; cleaving said protein into two or more peptides, wherein at least one of the peptides is tagged by the crosslinking agent; and analyzing the peptides by tandem mass spectrometry to detect the tagged peptide, wherein said tagged peptide fragments to yield the signature mass fragment, and wherein the signature mass fragment is detected to identify the tagged peptide; wherein identification of the tagged peptide indicates the crosslinking site on the protein and wherein identification of the crosslinking site aids in identifying a location of the binding site in the protein.
[0032] In still another embodiment, the present invention may feature a method of identifying a peptide or protein from a complex mixture. As a non-limiting example, the method may comprise providing the complex mixture; introducing a crosslinking agent which selectively interacts with the peptide or protein, and reacts with the peptide or protein to form a tagged peptide or protein, wherein the crosslinking agent comprises a substituent configured to fragment in tandem mass spectrometry to yield a signature mass fragment; and analyzing a portion of the complex mixture by tandem mass spectrometry to detect the tagged peptide or protein, wherein said tagged peptide or protein fragments to yield the signature mass fragment, and wherein the signature mass fragment is detected to identify the tagged peptide or protein. In some embodiments, the complex mixture may comprise at least one of the following: a peptide, protein, biomolecule, biopolymer, cell, cell lysate, pharmaceutical agent, DNA strand, organelle, or small-molecule.
[0033] In some embodiments, the crosslinking agent may be configured to interact with said binding site. As a non-limiting example, the binding site may be a receptor and the crosslinking agent may comprise an agonist or antagonist of said receptor. In another embodiment, the binding site may be a drug binding site and the crosslinking agent may comprise a modified drug. In yet another embodiment, the protein may comprise a guanylyl or guanylate cyclase, a truncated version of guanylyl cyclase or a bacterial H-NOX homolog.
[0034] According to one embodiment, the crosslinking agent may comprise a photo-cleavable diazirine moiety which transforms into a reactive carbene radical upon UV irradiation and the carbene radical reacts with the protein to form the crosslinked or tagged protein or peptide. In preferred embodiments, the duration of UV irradiation of the crosslinking agent may be about 5 min. In other embodiments, the duration of UV irradiation of the crosslinking agent may be about 0.5, 1, 2, 3, 7, 10, 15, 20 or 30 min.
[0035] According to some preferred embodiments, a bond linking an amide functional group and an ether functional group may be broken during the analysis of the tagged peptide or protein, thereby forming the signature mass fragment having an m/z ratio value. Without wishing to limit the invention to any theory or mechanism, it may be that the nitrogen or the amide is protonated and this protonation aids in the breaking of the bond linking the amide group to the ether group. In other embodiments, the m/z ratio value of the signature mass fragment may correspond to an ion comprising the tagged peptide or protein mass. In still another embodiment, the signature mass fragment may correspond to a singly charged fragment ion which comprises a biotin moiety with an amide substituent and has a m/z ratio of 270.127. As a non-limiting example, the substituent configured to fragment may be according to the following structure:
##STR00001##
[0036] According to one embodiment, the peptides or proteins may be dissolved in acidic solvent before being analyzed by tandem mass spectrometry. In another embodiment, the crosslinking agent is according to the following structure:
##STR00002##
[0037] The following is provided as a non-limiting example of the invention. Equivalents or substitutes are within the scope of the invention
[0038] Use of a Photolyzable Stimulator, IWP-854 (Developed by Ironwood Pharmaceuticals).
[0039] To localize stimulator binding to sGC, a photolabile compound called IWP-854 was utilized. The IWP-854 core motif is based on IWP-051, which replaces the fused-ring system with two five-membered rings capable of free rotation (
[0040] Diazirines generate a highly reactive carbene in response to irradiation with 350-365 nm UV light, which rapidly inserts into neighboring CC, CH, OH, and NH bonds. Robust cross-linking is seen to all 20 amino acid side chains as well as to peptide backbone atoms. These properties have made diazirines popular in photoaffinity labeling experiments seeking drug/protein binding sites and are ideal for localizing stimulator binding to sGC.
[0041] IWP-854 Retains Stimulator Activity.
[0042] Stimulation of recombinant human (Hs) sGC was examined and IWP-854 was found to stimulate to a similar extent as BAY 41-2272 (
[0043] A truncated version of sGC from Manduca sexta was developed for analyses of compound-enhanced CO binding. Herein, these constructs are referred to as Ms SGC-NT. Ms sGC-NT constructs are fully heme-loaded and stable in the ferrous (functional) state, as indicated by Soret band absorption. One hallmark of stimulator compounds is their ability to enhance CO and NO binding to the heme domain. CO binding to Ms sGC-NT23 in the absence of stimulator compound displays KdCO=710 nM (
[0044] IWP-854 and BAY 41-2272 Share a Common Binding Site.
[0045] Cross-linking with IWP-854 was visualized by probing the biotin affinity tag through western blot (
[0046] To assess compound specificity, IWP-854 cross-linking was monitored in the presence of increasing concentrations of BAY 41-2272 (
[0047] Stimulator binding to Cb SONO was characterized, as well as three H-NOX proteins from Nostoc sp. 7120 (Ns H-NOX), Shewanella oneidensis (So H-NOX), and Shewanella woodyi (Sw H-NOX). IWP-854 labeled all four bacterial H-NOX proteins (
[0048] Identifying Cross-Linked Residues in sGC and Bacterial H-NOX Proteins.
[0049] Residues labeled by IWP-854 were identified by liquid chromatography tandem mass spectrometry (LC-MS/MS) using an LTQ Velos Orbitrap mass spectrometer. Initial examination of IWP-854 alone (molecular mass 1,450.743 Da) revealed a distinct and highly reproducible fragmentation pattern (
[0050] The availability of multiple Ms sGC-NT constructs in high purity and abundance allowed for numerous experiments to be undertaken under varying conditions. Hs sGC and four bacterial H-NOX proteins were also examined using similar conditions to those initially developed with Ms sGC-NT. Results from a total of 43 experiments are reported in Table 1 and Table 2.
[0051] Identification of peptides was to high mass accuracy in all cases; however, certain peptides were detected more often than others (Table 1). The majority of labeled residues identified in Ms sGC-NT and Hs sGC are expected to originate from the stimulator-binding pocket, as evidenced by diminished cross-linking in the presence of excess BAY 41-2272 (
[0052] Labeling of Ms sGC-NT23 remained the same NO or CO, consistent with a stimulator binding site that does not greatly change upon heme ligation. Likewise, labeling did not appreciably differ in the presence (Ms sGC-NT13) or absence (Ms sGC-NT23) of the 1 pseudo H-NOX domain, indicating this domain does not harbor the stimulator binding site. IWP-854 labeling of Ms sGC-1 is similar to the other Ms sGC-NT constructs despite lacking the 1 chain and displaying poor competition with BAY 41-2272. Labeled peptides identified in the Ms sGC-NT constructs agreed well with those in full-length Hs sGC and were found nearly exclusively in the 1 subunit, as expected from the western blot analyses. Likewise, many labeled peptides identified in the bacterial H-NOX proteins overlapped with those from sGC.
[0053] The most frequently observed cross-links cluster together in previous models for Ms sGC-NT. IWP-854 predominately labeled the sGC 1 chain on H-NOX alpha helix A (Ms residues 1 6-8), H-NOX helix C (Hs residues 1 48-51), and the coiled-coil domain (Ms residues 1 361-362, 365-366; Hs residues 1 370-371, 374-375, 385). Cross-linking to helix A was also seen for the bacterial H-NOX proteins Cb SONO and Sw H-NOX, while cross-linking to helix C was seen for Ns H-NOX, So H-NOX, and Sw H-NOX. Helix D was not labeled in any sGC constructs but was detected in bacterial H-NOX proteins Ns H-NOX, Sw H-NOX, and Cb SONO. Labeled residues in A, C, and D localize around the interface of two subdomains in the 1 H-NOX and are predicted to reside near cross-linked residues in the coiled-coil and the linker connecting the PAS and coiled-coil domains of Ms sGC-NT (Ms residues 1 328-331).
[0054] Cross-links to residues Ms sGC-NT 1 195-198 were seen in 11 of 26 measurements; however, these residues lie in the linker between the H-NOX and PAS domains and in a different region of the model. This discrepancy could be due to limitations in modeling, a slight degree of non-specific binding or high dynamics in this loop.
[0055] A number of additional cross-linked peptides were detected on a less frequent basis in these experiments, and are listed in Table 1 and Table 2. Modifications to the 1 H-NOX/PAS linker were observed in all three Ms sGC-NT constructs, but not Hs sGC. Additionally, a variety of cross-links were detected in individual bacterial H-NOX proteins that do not agree with the most common binding arrangement. These are likely the result of unspecific labeling introduced by increased cross-linker concentrations and/or changes in compound affinity, as evidenced by incomplete elimination of IWP-854 cross-linking by competition with BAY 41-2272. For this reason, only labeled residues that were identified in multiple bacterial H-NOX proteins were considered to be part of the binding site.
[0056] Stimulators have been proposed to bind to a pseudosymmetric site in the cyclase domains similar to forskolin binding to adenylyl cyclase. Ms sGC-NT constructs retain stimulator binding and response despite lacking both cyclase domains, suggesting the primary stimulator binding site resides in the N-terminal two-thirds of the protein. The possibility of a secondary stimulator binding site in the catalytic domains was examined using photoaffinity cross-linking of full length Hs sGC. A single cross-link was found in cyclase domain (Hs residue 1 629, Table 2), which lies on the surface of the protein near where the coiled-coil attaches. No cross-links were found to residues in the cyclase domain active site or pseudosymmetric site, rendering the possibility of a secondary stimulator binding site unlikely. One additional cross-link to the human 1 chain was observed (Hs peptide 1 45-47, Table 2). The two 1 chain cross-links may represent the non-specific 1 cross-link found by western blotting.
[0057] Molecular Modeling of Stimulator Binding to sGC.
[0058] The cross-linking data indicate the primary binding site for stimulator compounds is in the 1 H-NOX domain. While a high-resolution structure of an sGC H-NOX domain has not been reported, crystal structures for several bacterial homologues are known, including those from Thermoanaerobacter tengcongensis (Tt H-NOX, recently renamed as Caldanaerobacter subterraneus, Cs H-NOX), Nostoc sp. (Ns H-NOX and Shewanella oneidensis (So H-NOX). These structures display the same overall fold and provide a solid scaffold for understanding H-NOX structure in sGC function. The overall H-NOX fold is 180 residues long and displays an N-terminal sub-domain encompassing residues 1-60, which is dominated by a 3-helix bundle, followed by a larger mixed helix/sheet sub-domain that contains the heme pocket. Alignment of the larger sub-domains of several H-NOX structures indicate the smaller and larger domains can move independently of one another, altering the orientation of the two domains, which may be key for signal transduction by H-NOX domains and proteins.
[0059] Most of the residues implicated in compound binding by cross-linking lie at the interface of the small and large H-NOX domains (
[0060] Modeling of compound into heterodimeric sGC was more challenging since atomic-level models are unavailable. For this, a model for Ms sGC-NT based on small-angle X-ray scattering (SAXS) was utilized, chemical cross-linking and homology modeling of the H-NOX, PAS and coiled-coil domains. The cross-links found most frequently in the present example were to the coiled-coil. Encouragingly, these residues residue near the labeled residues in the A, C, and D helixes of 1 H-NOX domain (
[0061] Discussion
[0062] In the present example, a photoactivatable stimulator compound coupled with LC-MS/MS was used to narrow the binding site to the .sub.1 H-NOX domain. No binding to the cyclase domain active site or pseudosymmetric site was observed, nor was there binding to the 1 H-NOX domain. The discrepancy likely results from choice of cross-linker, with the present example utilizing a diazirine versus an aryl azide in the former. Diazirines improve upon aryl azides with quicker reaction times and a lower frequency of stable intermediates that are capable of diffusing away from the binding site, which in the case of aryl azides, includes ketenimine decay products that react strongly with nucleophiles such as cysteines. Since both residues previously identified were cysteines, the reaction may have been with the ketenimine.
[0063] A possible binding complex in which stimulators bind at the interface of the two H-NOX subdomains was modeled, where the majority of the cross-links were located (
[0064] In summary, the present example has shown that stimulator compounds bind to the H-NOX domain of sGC and that binding also occurs in bacterial homologs. Binding likely occurs at the interface of the H-NOX large and small sub-domains, and may act through both inducing an active conformation and through directly blocking a tunnel for gas release to bulk solvent. These data provide insight into sGC function and stimulator action, and provide a roadmap for improved compounds targeting cardiovascular disease. A photolyzable cross-linking compound with a signature cleavage pattern that may be of general applicability has also been described.
Methods
Example 1: The Following are Exemplary Synthetic Procedures, and are Included Herein as Non-Limiting Example
[0065] Syntheses.
[0066] IWP-051 and IWP-898 (identical to phosphodiesterase 9A inhibitor PF-044447943) were produced. Photoactive cross-linker IWP-854 was synthesized and purified in a manner similar to IWP-051.
[0067] Materials:
[0068] Chemicals were purchased from Sigma Aldrich unless otherwise indicated. Uniformly labeled 15N-ammonium chloride and deuterium oxide (D2O) were purchased from Cambridge Isotope Laboratories. 2-(N,N-Diethylamino)diazenolate-2-oxide (DEA/NO) was provided by Dr. Katrina Miranda from the University of Arizona. Full-length human sGC was purchased from Enzo Life Sciences, Inc. HEK293T cells were acquired from the American Type Culture Collection (ATCC). Turbofect was purchased from Fermentas. DMEM media was purchased from Gibco Life Technologies. Fetal bovine serum was obtained from the University of Arizona Cancer Center. cGMP was measured using a commercially available homogenous time-resolved fluorescence (HTRF) immunoassay (CisBio). Sequencing grade trypsin was acquired from Promega, and C18 zip-tips were purchased from Pierce Thermo Fisher Scientific.
[0069] Generation of Ms sGC-NT Constructs.
[0070] Multiple N-terminal truncations of Manduca sexta sGC (Ms sGC-NT) were utilized in the present example (
[0071] cDNA coding for Ms sGC residues .sub.1 49-459 was PCR amplified from plasmid pETDuet-1-msGC-NT2 using primers F1 and R1. The PCR product was subcloned into vector pGEM-T. After digestion with BamHI and NotI, the new .sub.1 subunit was cloned into plasmid pETDuet-1-NT2 following removal of the original .sub.1 subunit using the same restriction enzymes. The Ms sGC .sub.1 subunit was shortened by inserting a stop codon (TAA) after Leu 389 using the QuikChange Lightning Site-Derected Mutagensis Kit with primers F2 and R2. The resulting pETDuet-1-NT25 plasmid codes for Ms sGC-NT25.
[0072] Ms sGC-NT23 was generated from Ms sGC-NT25 by removing the .sub.1 pseudo H-NOX coding sequence (.sub.1 49-271). To accomplish this, an equimolar mixture of DNA oligonucleotides F3 and R3 were annealed by heating at 95 C. for 2 min and cooling to room temperature, yielding a 37-base double-stranded DNA with 4-base overhangs on each end. This was ligated into the pETDuet-1-NT25 plasmid with T4 ligase after removal of the .sub.1 H-NOX domain with BamHI and NheI restriction enzymes. The resulting plasmid, pETDuet-1-NT23, codes for Ms sGC-NT23.
[0073] Expression and Purification of Ms sGC-NT.
[0074] Ms sGC-NT13 and Ms sGC-NT21 were expressed and purified from Eschercheria coli. Ms sGC- (1 1-380) was purified by washing Ms sGC-NT21 bound to a TALON Superflow Metal Affinity column with 50 mM sodium phosphate buffer (pH 7.4), 300 mM NaCl saturated with 1 mM CO. CO binding to Ms sGC-NT21 led to selective elution of the 1 subunit while the al subunit remained bound to the column.
[0075] Ms sGC-NT23 was expressed in the same manner as Ms sGC-NT13 and Ms sGC-NT21. For purification of Ms sGC-NT23, pellets were resuspended in 50 mM sodium phosphate buffer (pH 7.4), 300 mM NaCl, 0.75 mM DNase I, and protease inhibitors (2 mM MgCl2, 1 mM PMSF, 1 g/mL aprotinin, 1 g/mL leupeptin, 0.25 mg/mL lysozyme, 1 mM benzamine), and lysed by French press. Cell debris was removed by ultracentrifugation at 40,000 rpm in a Ti45 rotor for 35 min at 4 C. The sample was loaded onto a TALON Superflow Metal Affinity column and washed with 50 mM sodium phosphate (pH 7.4), 300 mM NaCl. Protein was eluted by supplementing wash buffer with 30 mM EDTA. Protein fractions were loaded onto a Strep-tactin Sepharose column and eluted with 3 mM desthiobiotin in 50 mM sodium phosphate buffer (pH 7.4), 100 mM NaCl, 5% glycerol. Samples were concentrated using Viva spin 30 kDa protein concentrators and flash frozen in liquid nitrogen for storage at 80 C.
[0076] Expression and Purification of Bacterial H-NOX Proteins.
[0077] The genes coding for H-NOX proteins from Nostoc sp. PCC 7120 (Ns H-NOX; residues 1-182; NCBI Ref Seq: WP_010996435.1), Shewanella oneidensis (So H-NOX; residues 1-181; NCBI Ref Seq: WP_011072197.1), Shewanella woodyi (Sw H-NOX; residues 1-182; NCBI Ref Seq: WP_012325363.1), and Clostridium botulinum (Cb SONO; residues 1-186; NCBI Ref Seq: WP_012048396.1) were synthesized and cloned into the pET21b+ vector between restriction sites NdeI and XhoI by GenScript Biotech Corporation. All bacterial H-NOX proteins were engineered to contain a C-terminal TEV cleavage sequence and a 6 poly-histidine purification tag (ENLYFQSLEHHHHHH).
[0078] With the exception of Sw H-NOX, the plasmid coding for the bacterial H-NOX protein of interest was transformed into Rosetta competent cells and grown to an OD.sub.600 of 0.8-1.0 in Terrific Broth media at 37 C. while shaking at 225 rpm. Protein expression was initiated by adding 0.5 mM IPTG and 0.1 mM -aminolevulinic acid, and continued for 18-22 hours at 16 C. The plasmid coding for Sw H-NOX was transformed into Escherichia coli Tuner (DE3) pLysS competent cells and grown to an OD.sub.600 of 0.8-1.0 in M9 media at 37 C. Protein expression was induced with 0.1 mM IPTG and 0.05 mM -aminolevulinic acid, and expressed for 18-22 hours at 22 C. All bacterial cultures were harvested in the same manner as Ms sGC-NT constructs.
[0079] Pellets containing were resuspended in 50 mM sodium phosphate buffer (pH 7.4), 300 mM NaCl, 0.75 mM DNase I, 1 mM PMSF and lysed by French press. Cell debris was removed by ultracentrifugation at 40,000 rpm in a Ti45 rotor for 35 min at 4 C. The supernatant was loaded onto a Ni-NTA affinity column and washed with 50 mM sodium phosphate (pH 7.4), 300 mM NaCl. Ns H-NOX and So H-NOX were eluted with wash buffer supplemented with 30 mM EDTA. Sw H-NOX and Cb SONO required higher salt concentrations for elution and were eluted in 50 mM sodium phosphate buffer (pH 7.4), 500 mM NaCl, 30 mM EDTA. Buffer was exchanged to 50 mM sodium phosphate (pH 7.4), 90 mM NaCl, <0.1 mM EDTA using Viva spin 10 kDa protein concentrators and the sample was incubated with a 1:100 (protease:protein) ratio of TEV protease (purified in-house) at 4 C. overnight. The reaction solution was passed over a Ni-NTA column and cleaved protein was collected in the flow through. The protein of interest was further purified by gel filtration using Superdex 200 that was pre-equilibrated with 50 mM sodium phosphate buffer (pH 7.4), 100 mM NaCl. Protein concentrations were assessed spectroscopically based on Soret and with the Bradford Protein Assay. All samples were frozen in liquid nitrogen for storage at 80 C.
[0080] Ns H-NOX and So H-NOX were purified in the ferrous state (soret maximum 431 nm), while Cb SONO and Sw H-NOX were purified in the ferric state (soret maximum 408 nm and 416 nm, respectively). Cb SONO and Sw H-NOX were fully reduced to the ferrous state (soret maximum 431 nm) immediately prior to use by incubating the sample with 2 mM sodium dithionite in degassed buffer for 10 minutes at room temperature. Sodium dithionite was removed from samples used in PAL experiments using 10 kDa Zebra spin desalting columns.
[0081] Determination of CO Dissociation Constants:
[0082] A 1 mM CO solution was obtained by bubbling 50 mM sodium phosphate buffer (pH 7.5), 100 mM NaCl, 5% glycerol with CO for 10-20 minutes. The Ms SGC-NT or bacterial H-NOX construct to be analyzed (25 nM) was suspended in 50 mM sodium phosphate buffer, pH 7.5, 100 mM NaCl, 5% glycerol in the presence or absence of 5-10 M IWP-854 or BAY 41-2272. A final concentration of 5% EtOH was added to the solution to maintain solubility of BAY 41-2272. CO titrations were measured using a Cary350 UV/visible spectrophotometer using a cuvette with 10 cm pathlength. K.sub.d.sup.CO measurements were calculated.
[0083] Generation of Human sGC Constructs:
[0084] cDNA sequences for Hs SGC- (1 1-690-Strep-tag II) and Hs SGC-1 (1 1-619-His.sub.6) were PCR amplified using primers F4, R4, F5 and R5. PCR products for Hs sGC-1 were cloned into plasmid pCMV_3 TAG9 between restriction sites BAMHI and HindIII and PCR products for Hs SGC-1 were cloned into pCMV_3 TAG3A between sites SacI and XhoI to create the plasmids pCMV_1 and pCMV_1, which code for Hs SGC- and Hs SGC-, respectively.
[0085] Expression of human sGC.
[0086] HEK 293T cells were grown in Dulbecco's Modified Eagle's medium (DMEM) supplemented with 10% fetal bovine serum (FBS) at 37 C. with 5% CO.sub.2. A transfection mixture containing 22.5 g pCMV_1, 2.5 g pCMV_1, and 31.25 L Turbofect was assembled in 2.5 mL serum-free DMEM. The transfection mixture was incubated for 30 min at room temperature and added drop wise to a 10 cm dish containing HEK 293T cells grown to 75% confluency. Protein expression continued for 24 h at 37 C. with 5% CO.sub.2. Transfection and protein expression were confirmed by western blot, probing for the Strep and His.sub.6 purification tags found on the C-terminus of sGC sGC-1 and sGC-1, respectively.
[0087] cGMP Measurement.
[0088] HEK 293T cells transfected with human sGC were washed twice with PBS and suspended in 50 mM Tris buffer (pH 7.4), 100 mM NaCl, 8 mM MgCl.sub.2, 0.5 mM IMBX, protease inhibitors (1 mM PMSF, 1 g/mL aprotinin, 1 g/mL leupeptin, 1:100 dilution of protease inhibitor cocktail) and lysed with 30 stokes of a 25-gauge needle. Cell debris was removed by centrifugation at 16,000 g for 20 minutes at 4 C. Transfected HEK 293T lysate was divided into equal aliquots and combined with 5 M stimulator (IWP-854 or BAY 41-2272) and/or 100 M DEA/NO, as indicated. Samples were incubated at room temperature for 10 min to permit NO release and the reaction initiated with 1 mM GTP. Reactions proceeded for 5 min at 37 C. before quenching with 1.5% glacial acetic acid. Precipitated protein was removed by centrifuging at 16,000 g for 10 min at 4 C., and the supernatant was diluted 1:100 in 50 mM sodium phosphate (pH 7.0), 0.2% BSA, 0.2% NaN.sub.3. cGMP was quantified using a commercially available homogenous time resolved fluorescence (HTRF) assay. All samples were measured in duplicate using a SynergyH1 fluorescent plate reader. HTRF measurements were analyzed according to the manufacturer's instructions using Sigma Plot.
[0089] Photoaffinity Labeling sGC with IWP-854.
[0090] The sGC or bacterial H-NOX construct of interest (1 M for human sGC or Ms SGC-NT, 10 M for bacterial H-NOX proteins) was suspended in 50 mM sodium phosphate buffer, pH 7.5, 100 mM NaCl. DEA/NO (100 M), CO (50 M), and/or varying concentrations of BAY 41-2272 (1-25 M for human sGC or Ms SGC-NT, 10-100 M for bacterial H-NOX proteins) were added. For competition assays, a final concentration of 5% ethanol was added to maintain the solubility of BAY 41-2272. Samples were equilibrated for 10 min at room temperature before adding IWP-854 (1 M for sGC constructs, 10 M for bacterial H-NOX proteins). Following a 10 min incubation at room temperature in the dark, samples were placed in a 96-well tissue culture plate and irradiated for 15 minutes on ice with broad band UV light (366 nm maximum) using a multiband UVGL-58 lamp. Protein stability throughout the experiment was assessed based on heme content, which was monitored spectroscopically using a Cary350 UV/visible spectrophotometer.
[0091] Western Blot Analysis of Photoaffinity Labeled Samples.
[0092] Photoaffinity labeling of IWP-854 to sGC or bacterial H-NOX proteins was visualized by probing the biotin affinity-tag via western blot. Total protein was visualized by probing for Strep or His.sub.6 purification tags, by probing the 1 subunit directly, or by Ponceau stain. With the exception of Hs sGC, all samples were suspended 1SDS loading buffer and a total of 1 g protein was run on a 15% bis-acrylamide gel for 90 minutes at 96 V. Samples were transferred to a nitrocellulose membrane at 100 V for 1 h at 4 C. The membrane was blocked for 1 h in 5% BSA dissolved in PBST, and incubated in a 1:1000 dilution of primary antibody (Cell Signal #5597 for probing the biotin affinity-tag of IWP-854; Abcam ab76949 for probing the Strep-tag, QED Biosciences #18814-01 for probing His.sub.6) over night at 4 C. The membrane washed three times in PBST and incubated in secondary antibody (IRDye 680 goat anti-rabbit secondary mAB, Licor 925-68071) for 2 h while shaking at room temperature. Membranes were washed an additional three times and imaged using the Odyssey Infrared Imaging System.
[0093] For Hs sGC, samples were suspended in 2 of loading buffer and a total of 200 ng of protein was run on a NuPAGE 4-12% Bis-Tris gel for 60 min at 160 volts. Samples were transferred to a nitrocellulose membrane at 150 volts for 45 min at room temperature. The membrane was blocked for 1 h in PBS with 5% dry fat free milk. The membrane was washed three times with TBS containing 0.1% Tween-20 and incubated for 1 h at room temperature in a 1:2000 dilution of IRdye800 streptavidin, into Odyssey blocking buffer containing 0.2% Tween-20. The membrane was washed three times with TBS-T and imaged using Odyssey CLx imager. Then, as a loading control, membrane was incubated overnight at 4C in 1:1000 dilution of antibody against sGC 1 subunit into TBS-T with 5% BSA, The membrane was washed three times with TBS-T and incubated for 1 h at room temperature in a 1:15000 dilution of IRdye680 secondary antibody into Odyssey blocking buffer containing 0.2% Tween-20. The membranes were washed three times with TBS-T and imaged using Odyssey CLx imager.
[0094] Preparing Samples for Mass Spectrometry.
[0095] Photoaffinity samples were buffer exchanged to 100 mM ammonium bicarbonate (pH 8.0) using 10 kDa centrifugal filters. Samples were reduced with 12 mM dithiothreitol for 45 min at 56 C. and alkylated with 20 mM iodoacetic acid for 30 min at room temperature in the dark, followed by overnight digestion with a protease:protein ratio of 1:30 trypsin at 37 C. or 1:50 of chymotrypsin at 30 C. A final concentration of 1 mM CaCl.sub.2) was added to samples digested with chymotrypsin. Digested peptides were cleaned using C18 zip tips according to manufacturer's instructions and dried via speedvac prior to storage in 20 C.
[0096] LC-MS/MS Analysis of Photoaffinity Labelled Samples.
[0097] Digested peptides were analyzed by LC-MS/MS using a LTQ Velos Orbitrap mass spectrometer. Peptides were eluted from a C18 pre-column (100 M i.d.2 cm) onto an analytical column (75 M i.d.2 cm). Solvent A was 0.1% FA. Solvent B (ACN, 0.1% FA) was applied as follows: 5-20% B over 75 min, 20-35% B over 25 min, 35% B for 19 min, 3 min ramp to 95% B and held for 18 min. Flow rates were 400 nL/min directed to an Advion NanoMate nano-ESI source held at 1.75 eV applied voltage.
[0098] Data-dependent scanning was performed with Xcalibur v 2.1.0 software using a survey mass scan at 60,000 resolution in the Orbitrap analyzer scanning mass/charge (m/z) range of 350-1600, followed by collision-induced dissociation tandem mass spectrometry (MS/MS) of the 6 most intense ions in the linear ion trap analyzer at 7,500 resolution. Precursor ions were selected by the monoisotopic precursor selection setting, with the instrument set to observe fragment ions once and then excluded from analysis for 45 seconds, allowing for interrogation of lower abundance ions. Ions were excluded with a 10 ppm window. Precursor ions with a charge less than +3 were excluded from selection since ionization of IWP-854 increases the charge state of the peptide, making labeled peptides prone to higher charge states (+3 and above) and allowing for exclusion of peptides with a charge below +3.
[0099] Tandem mass spectra were searched against a protein database made by combining sequences for the sGC and H-NOX constructs analyzed with the proteome for Escherichia coli BL21, and common human contaminants. The combined database contained 5200 entries. MS/MS spectra were searched against the described protein database using Thermo Proteome Discovered 1.3, version 1.3.0.339 considering the tryptic peptides with up to 2 missed cleavage sites. Iodacetamide derivatives of cysteines and oxidation of methionines were specified as variable modifications. Modification by IWP-854 (1422.717 Da) was searched against all 20 amino acid residues in an iterative fashion. Proteins and peptides were visualized using the Discoverer software. IWP-854 modified residues were confirmed by the presence of a peak corresponding to the mass of the precursor ion minus the mass of 270.127 (
[0100] Molecular Modeling.
[0101] A molecular model for Ms sGC-NT13 was previously assembled using small-angle X-ray scattering (SAXS), chemical cross-linking and domain homology modeling. Models for compounds IWP-051 and IWP-864 were prepared by first generating a SMILES string in ChemDraw and submitting the string to the Grade Web Server (http://grade.globalphasing.org/cgi-bin/grade/server.cgi), which generates energy-minimized coordinates and refinement parameters using known structures for the Cambridge Structural Database. Modeling of compound binding was done manually in COOT followed by energy minimization in REFMAC5 as encoded in CCP4i. Structure figures were prepared using the Discovery Studio Visualizer 4.1 and PyMOL Molecular Graphics System, Version 1.8 Schrdinger, LLC.
TABLE-US-00001 TABLE 1 Summary of Residues Multiply Modified by IWP-854 Modified Residues.sup.a Construct Occurance.sup.b Error.sup.c H-NOX Domain 1 6-8 Ms sGC-NT.sup.d 18/26 3.2 2.0 Sw H-NOX 2/2 0.8 0.8 Cb SONO 3/3 1.78 1 41-48 Hs sGC 1/2 1.4 So H-NOX 4/4 1.5 2.3 Ns H-NOX 3/3 0.4 0.1 Sw H-NOX 2/2 2.1 0.4 1 48-51 Hs sGC 2/2 2.1 1.0 Ns H-NOX 3/3 1.1 0.7 1 76-82 Ns H-NOX 2/3 1.3 1.0 Sw H-NOX 2/2 3.5 0.1 Cb SONO 3/3 1.07 Linker (H-NOX Domain - PAS Domain) 1 195-198 Ms sGC-NT.sup.d 11/26 2.4 1.0 Linker (PAS Domain - CC Domain) 1 328-331 Ms sGC-NT.sup.d 11/26 3.1 1.7 CC Domain 1 361-362 Ms SGC-NT.sup.d 26/26 2.6 1.7 1 365-366 Ms sGC-NT.sup.d 26/26 2.6 1.7 1 370-371 Hs sGC 2/2 1.4 0.3 1 374-375 Hs sGC 2/2 1.4 0.3 1 385 Hs sGC 2/2 2.2 2.8 .sup.aIncluded are peptides and sequence regions modified in more than one species. Modified residues are listed where known. A range of residues is listed where the exact modified residue could not be determined due to incomplete fragmentation. All peptides were in either the +3 or +4 charge states and had masses between 2200 and 3900 Da. Complete mass and charge information can be found in Table 2. .sup.bThe number of times a peptide was identified out of the total number of experiments conducted. .sup.cErrors listed are the average and standard deviation of mass discrepancies for all peptides identified. For n = 2, the range is presented, and for n = 1, the single value is listed for M. .sup.dFor Ms sGC-NT experiments, results for all constructs and ligation states (NO, CO, etc.) are combined. In each case, the modified peptide was identified in all Ms sGC-NT constructs analyzed (Ms sGC-NT23, Ms sGC-NT13, Ms sGC-1).
[0102] Table 2: Detailed Summary of Peptides Modified by IWP-854.
[0103] Depicted is a summary of peptides modified by IWP-854 from Ms sGC-NT23, Ms sGC-NT13, Ms sGC 1 (1-380), Hs sGC, Ns H-NOX, So H-NOX, Sw H-NOX, and Cb SONO. Samples were photoaffinity labeled with IWP-854 and digested with trypsin or chymotrypsin as indicted. Digested samples were then analyzed by LC-MS/MS with both rounds of MS measured in high resolution mode. Peptides labelled by IWP-854 were identified both manually and using the Discoverer program
TABLE-US-00002 TABLE 2 Modified Modified Mass Error Construct Peptide Residues.sup.a n.sup.b X.sub.corr (Da).sup.c Charge [ppm].sup.d H-NOX Domain Ms sGC-NT.sup.e 1 1-15 N6, Y7, A8 18/26 4.19 3213.67 +3, +4 3.2 2.0 (Trypsin) 3229.65 3245.64 Hs sGC 1 42-53 P45-C47 1/2 0.97 2736.40 +4 1.4 (Trypsin) 1 41-47 I41-D44 1/2 0.57 2276.15 +3 1.4 1 48-57 T48-L51 2/2 0.97 2461.27 +4 2.1 1.0 So H-NOX 41-57 Y42-E46 4/4 4.35 3364.68 +3, +4 1.5 2.3 (Trypsin) Ns H-NOX 38-43 M40-Y43 2/3 0.51 2092.01 +3 0.9 0.5 (Chymotrypsin) 38-49 S44-D46 2/3 2.05 2772.27 +3, +4 0.4 0.1 2788.27 0.3 44-49 S44-D46 3/3 1.05 2122.00 +3 0.7 0.7 44-59 S44-D46 2/3 1.65 3098.54 +4 0.2 50-59 H50-V52 2/3 1.24 2418.27 +3, +4 0.7 0.4 50-66 H50-V52 2/3 3.58 3142.65 +3, +4 1.1 0.7 71-74 G71-E72 2/3 0.92 1976.94 +3 1.0 0.7 78-86 S79-G84 2/3 1.75 2407.14 +3 1.3 1.0 78-87 S79-G84 2/3 0.96 2520.21 +3, +4 1.1 0.7 168-177 D173-D175 3/3 0.80 2641.25 +3, +4 0.8 0.5 182-187 E182-A184 3/3 0.64 2218.07 +3 0.3 0.3 Cb SONO 2-10 T7 3/3 0.82 2383.27 +3 1.78 (Trypsin) 76-95 A76-Y81 3/3 1.76 3888.86 +3 0.78 176-181 N176-Y177 3/3 0.49 2237.13 +3 1.07 Sw H-NOX 6-15 T6-G7 1/2 1.00 2086.05 +3 0.8* (Chymotrypsin) 22-30 S28-L30 1/2 1.97 2445.18 +3 2.27* 26-36 S28-L30 1/2 0.92 2680.24 +3 3.04* 31-42 I35-Y36 1/2 1.89 2713.30 +3 1.89* 37-47 E46-L48 1/2 0.99 2658.26 +3, +4 2.1 0.4 37-48 E46-L48 1/2 0.83 2771.35 +3, +4 2.7 0.8 43-48 E46-L48 2/2 0.46 2177.09 +3 1.1 0.1 49-65 P61-L65 2/2 2.02 3249.76 +3, +4 1.8 1.7 51-65 P61-L65 1/2 1.99 3008.61 +3, +4 2.7 0.7 55-65 E56-V57 1/2 2.59 2569.30 +3, +4 2.5 1.2 55-69 E56-V57 1/2 2.46 3084.62 +4 2.2 74-92 V75-L77 1/2 2.45 3627.76 +4, +5 3.5 0.1 96-107 E105-Y107 1/2 2.18 2859.45 +4 1.95 146-152 L146 1/2 0.23 2193.04 +3 6.14* 146-153 E151 1/2 0.30 2340.10 +3 7.27* 157-174 D167-E169 1/2 0.30 3361.64 +4 1.2* H-NOX/PAS Linker Ms sGC-NT.sup.e 1 189-205 A195-E198 11/26 2.45 3248.71 +3, +4 2.4 1.0 (Trypsin) PAS Domain Ms sGC-NT.sup.e 1 206-213 F211-R213 5/26 0.60 2303.16 +3, +4 3.2 1.9 (Trypsin) Ms sGC-NT23 1 279-286 F284-K286 2/26 0.52 2247.15 +3 0.1 0.0 (Trypsin) PAS/Coiled-Coil Linker Ms sGC-NT.sup.e 1 328-341 G328-I331 11/26 2.83 2917.51 +3, +4 3.1 1.7 (Trypsin) Coiled-Coil Ms sGC-NT.sup.e 1 356-366 E361, V362 26/26 1.63 2696.43 +3, +4 2.6 1.7 (Trypsin) D365, K366 Ms sGC-1 1 342-355 E351-D353 2/26 1.66 3110.50 +4 0.9 2.5 (Trypsin) 3126.51 Hs sGC 1 375 E370, I371 2/2 1.19 2753.44 +3, +4 1.4 0.3 (Trypsin) D374, R375 1 382-387 D385 2/2 0.39 2127.07 +3 2.2 2.8 Cyclase Hs sGC 1 628-637 C629 1/2 0.82 2561.25 +4 1.14 (Trypsin) .sup.aResidues modified by IWP-854 are listed where known. Where the exact modified residue cannot be determined due to incomplete fragmentation, a range of residues is listed. .sup.bThe number of times a peptide was identified out of the total number of experiments conducted (n). For Ms sGC-NT experiments, results for all constructs and ligation states (NO, CO, etc.) are combined. .sup.cAll peptides were in either the +3 or +4 charge states and had masses between 2200 and 3900 Da. .sup.dErrors listed are the average and standard deviation of mass discrepancies for all peptides identified. For n = 2, the range is presented, and for n = 1, the single value is listed for M. .sup.ePeptides that were modified by IWP-854 in all Ms sGC-NT constructs analyzed (Ms sGC-NT23, Ms sGC-NT13, Ms sGC 1).
[0104] As used herein, the term about refers to plus or minus 10% of the referenced number.
[0105] Various modifications of the invention, in addition to those described herein, will be apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims. Each reference cited in the present application is incorporated herein by reference in its entirety.
[0106] Although there has been shown and described the preferred embodiment of the present invention, it will be readily apparent to those skilled in the art that modifications may be made thereto which do not exceed the scope of the appended claims. Therefore, the scope of the invention is only to be limited by the following claims. In some embodiments, the figures presented in this patent application are drawn to scale, including the angles, ratios of dimensions, etc. In some embodiments, the figures are representative only and the claims are not limited by the dimensions of the figures. In some embodiments, descriptions of the inventions described herein using the phrase comprising includes embodiments that could be described as consisting of, and as such the written description requirement for claiming one or more embodiments of the present invention using the phrase consisting of is met.