Supramolecular Complex of a Biomolecule
20250257111 ยท 2025-08-14
Inventors
- Oren Scherman (Cambridge, Camrbidgeshire, GB)
- Xiaoyi Chen (Cambridge, Cambridgeshire, GB)
- Zehuan Huang (Cambridge, Cambridgeshire, GB)
- Jade McCune (Cambridge, Cambridgeshire, GB)
Cpc classification
C07K1/34
CHEMISTRY; METALLURGY
C07K1/1136
CHEMISTRY; METALLURGY
International classification
Abstract
The invention provides a ternary complex of a cucurbituril host, such as CB[8], with a first aryl guest and second aryl guest, wherein the first aryl guest is a group within a biomolecule, such as insulin, and the first and second aryl guests are different. Also provided is a method of decomplexing the complex, the method comprising the step of treating the complex with a competitor guest, and permitting the competitor guest to displace at least the first aryl guest from the complex.
Claims
1. A ternary complex of a cucurbituril host with a first aryl guest and second aryl guest, wherein the first aryl guest is a group within a biomolecule, and the first and second aryl guests are different.
2. The complex of claim 1, wherein the cucurbituril host is CB[8] or a variant or derivative thereof.
3. The complex of claim 2, wherein the cucurbituril host is CB[8].
4. The complex of any one of the preceding claims, wherein the first aryl guest is selected from phenyl, naphthyl, such as naphth-2-yl, hydroxyphenyl, such as 4-hydroxyphenyl, and indolyl, such as indol-3-yl
5. The complex of any one of the preceding claims, wherein the second aryl guest is a fluorophenyl, such as perfluorophenyl.
6. The complex of any one of the preceding claims, wherein the second aryl guest is covalently linked, directly or indirectly, to a solid phase.
7. The complex of any one of the preceding claims, wherein the biomolecule is a polypeptide.
8. The complex of any one of the preceding claims, wherein the first aryl guest is provided within the side chain of a phenylalanine, tyrosine, indole or naphthylalanine residue.
9. The complex of any one of the preceding claims, wherein the biomolecule is insulin, such as human insulin, such as recombinant human insulin.
10. The complex of claim 9, wherein the first guest is the phenyl group within the phenylalanine residue at position 1 of insulin.
11. A method of decomplexing a complex according to any one of claims 1 to 10, the method comprising the step of treating the complex with a competitor guest, and permitting the competitor guest to displace at least the first aryl guest from the complex.
12. The method of claim 11, wherein the competitor guest is or comprises adamantane amine (ADA) or memantine (DMADA).
13. A method of forming a complex according to any one of claims 1 to 10, the method comprising the step of mixing a cucurbituril host together with a biomolecule having a first aryl guest, and a second aryl guest, and permitting the host and guests to self-assemble.
14. The method of claim 13, comprising the step of at least partially purifying the complex, such as by filtration.
Description
SUMMARY OF THE FIGURES
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
DETAILED DESCRIPTION OF THE INVENTION
[0033] The present inventors have found that a CB[8] host may be used to form a complex together with a biomolecule, such as a polypeptide, and a second component, which is selected for the formation of a heterogenous pairing with a group of the biomolecules in the host cavity.
[0034] The complex has been found to stabilise a biomolecule, such as insulin, for storage, and the biomolecule may be released on demand for use, and without loss of biomolecule integrity, including structural and functional integrity.
[0035] Li et al. describe the use of a CB[7] to form a binary complex with insulin or human growth hormone (hGH). Here, the CB[7] is covalently connected to the solid phase. The insulin and HGA are releasable from the complex upon treatment with a competitor guest. However, Li et al. do not show or suggest that the formation of the complex is associated with a stabilisation of the insulin or hGH biomolecule.
[0036] Also, the preparation of CB[7], and its covalent connection to the solid-phase, is not straightforward, and is onerous. Thus, accessibility and scalability are a problem. In contrast, the use of readily available and easily prepared CB[8], and related larger cucurbituril hosts, as described in the present case, avoids the problems associated with the use of CB[7] and its derivatives.
Complex
[0037] A complex of the invention comprises a cucurbituril host, such as CB[8], holding non-covalently within its cavity a first guest and a second guest. This is supramolecular complex, and it is also a heterogenous complex, and the first and second guests differ.
[0038] The cucurbituril forms a non-covalent bond to each guest. The present inventors have previously established that complexes of cucurbituril are readily formed and can be used to provide robust non-covalent connections between compounds.
[0039] A complex may be formed between the cucurbituril and a first guest from a biomolecule, and the formation of the complex is tolerant of many functionalities within the monomer.
[0040] The complex also comprises the second guest, and in the preferred embodiments of the invention, this second guest is connected to a solid phase. The formation of the complex is also tolerant of such solid phase functionality.
[0041] Where the complex comprises two guests within the cucurbituril cavity, the association constant, K.sub.a, for that complex is at least 10.sup.3 M.sup.2, at least 10.sup.4 M.sup.2, at least 10.sup.5 M.sup.2, at least 10.sup.6 M.sup.2, at least 10.sup.7 M.sup.2, at least 10.sup.8 M.sup.2, at least 10.sup.9 M.sup.2, at least 10.sup.10 M.sup.2, at least 10.sup.11 M.sup.2, or at least 10.sup.12 M.sup.2. Preferably the association constant is at least 10.sup.10 M.sup.2.
[0042] The cucurbituril hosts two guest molecules, and the guest molecules are different. The formation of a ternary host-guest complex is believed to proceed via an intermediate binary complex.
[0043] The references in the present case to first and second guests is not a description of the order in which the guests complex with the cucurbituril host. Thus, the intermediate binary may form with the second guest and the host, or the first guest and the host.
[0044] The formation of the complex is reversible. The separation of a guest from the cucurbituril host, may be referred to as decomplexation.
[0045] The decomplexation of the complex to separate the host and the guests may occur in response to an external stimulus, including, for example, a competitor guest compound, light, an oxidising or reducing agent, electrochemical potential, and temperature changes amongst others. In the methods of the present invention, the decomplexation of the complex is undertaken under conditions that will not damage or interfere with the structure of the biomolecule. For this reason, displacement methods involving a benign competitor guest compound are favoured.
[0046] A competitive guest for use in the decomplexation of a CB[8]-based network is or comprises adamantane amine (ADA) or memantine (DMADA). A competitor guest may be used in excess to the amount (mole amount) of guests present in the complex. In one embodiment, the competitive guest has a higher association constant than a guest of the complex.
[0047] A decomplexation reaction is reversible. Thus, a complex may be permitted to reform, if needed, after the removal of the external stimulus, including for example the removal of a competitor guest compound.
Cucurbituril and CB[8]
[0048] The complex of the invention has a cucurbituril host. The cucurbituril is a cucurbituril capable of hosting two guests. Thus, the host may be a CB[8] host, including variant and derivative forms.
[0049] Cucurbiturils that are capable of forming ternary complexes and binary complexes find use in the present invention. There are many examples in the art of cucurbiturils for use in forming such complexes.
[0050] Recent work has shown that cucurbituril compounds have high in vitro and in vivo biocompatibility and have extremely low toxicity (see Uzunova et al. Org. Biomol. Chem. 2010, 8, 2037-2042). Thus, when used together with non-toxic polymer components, the present hydrogels are also suitable for use in biological systems.
[0051] The cucurbituril should be capable of forming a ternary complex. For example, CB[8], is capable of forming a ternary complex. As too are CB[10] and CB[12] compounds.
[0052] The cucurbituril may be a CB[8], CB[10] or CB[12] compound.
[0053] In one embodiment, the cucurbituril is a CB[8] compound.
[0054] References to a cucurbituril compound are references to variants and derivatives thereof.
[0055] A variant of CB[8] may include a structure having one or more repeat units that are structurally analogous to glycoluril. The repeat unit may include an ethylurea unit. Where all the units are ethylurea units, the variant is a hemicucurbituril. The variant may be a hemicucurbit[12] uril (shown below, see also Lagona et al. Angew. Chem. Int. Ed. 2005, 44, 4844).
##STR00001##
[0056] Cucurbituril derivatives are provided and find use in the methods described herein. A derivative of a cucurbituril is a structure having one, two, three, four or more substituted glycoluril units. A substituted cucurbituril compound may be represented by the structure below:
##STR00002##
wherein: [0057] n is an integer of at least 8; [0058] and for each glycoluril unit [0059] each X is O, S or NR.sup.3, and [0060] R.sup.1 and R.sup.2 are each independently selected from H and the following optionally substituted groups: R.sup.3, OH, OR.sup.3, COOH, COOR.sup.3, NH.sub.2, NHR.sup.3 and N(R.sup.3).sub.2 where R.sup.3 is independently selected from C.sub.1-20alkyl, C.sub.5-20carboaryl, and C.sub.5-20heteroaryl, or where R.sup.1 and/or R.sup.2 is N(R.sup.3).sub.2, both R.sup.3 together form a C.sub.5-20heterocyclic ring; or together R.sup.1 and R.sup.2 are C.sub.4-6alkylene forming a C.sub.6-8carbocyclic ring together with the uracil frame.
[0061] In one embodiment, one of the glycoluril units is a substituted glycoluril unit. Thus, R.sup.1 and R.sup.2 are each independently H for n1 of the glycoluril units.
[0062] In one embodiment, n is 8, 9, 10, 11 or 12.
[0063] In one embodiment, n is 8, 10 or 12.
[0064] In one embodiment, n is 8.
[0065] In one embodiment, each X is O.
[0066] In one embodiment, each X is S.
[0067] In one embodiment, R.sup.1 and R.sup.2 are each independently H.
[0068] In one embodiment, for each unit one of R.sup.1 and R.sup.2 is H and the other is independently selected from H and the following optionally substituted groups: R.sup.3, OH, OR.sup.3, COOH, COOR.sup.3, NH.sub.2, NHR.sup.3 and N(R.sup.3).sub.2. In one embodiment, for one glycoluril unit, one of R.sup.1 and R.sup.2 is H and the other is independently selected from H and the following optionally substituted groups: R.sup.3, OH, OR.sup.3, COOH, COOR.sup.3, NH.sub.2, NHR.sup.3 and N(R.sup.3).sub.2. In this embodiment, the remaining glycoluril units are such that R.sup.1 and R.sup.2 are each independently H.
[0069] Preferably R.sup.3 is C.sub.1-20alkyl, most preferably C.sub.1-6 alkyl. The C.sub.1-20alkyl group may be linear and/or saturated. Each group R.sup.3 may be independently unsubstituted or substituted.
[0070] Preferred substituents are selected from: R.sup.4, OH, OR.sup.4, SH, SR.sup.4, COOH, COOR.sup.4, NH.sub.2, NHR.sup.4 and N(R.sup.4).sub.2, wherein R.sup.4 is selected from C.sub.1-20 alkyl, C.sub.6-20 carboaryl, and C.sub.5-20heteroaryl. The substituents may be independently selected from COOH and COOR.sup.4.
[0071] In some embodiments, R.sup.4 is not the same as R.sup.3. In some embodiments, R.sup.4 is preferably unsubstituted.
[0072] Where R.sup.1 and/or R.sup.2 is OR.sup.3, NHR.sup.3 or N(R.sup.3).sub.2, then R.sup.3 is preferably C.sub.1-6alkyl. In some embodiments, R.sup.3 is substituted with a substituent OR.sup.4, NHR.sup.4 or N(R.sup.4).sub.2. Each R.sup.4 is C.sub.1-6 alkyl and is itself preferably substituted.
Cucurbituril Guest
[0073] As noted above, a guest is a compound that is capable of forming a host-guest complex with a cucurbituril. The term complexation therefore refers to the establishment of the host-guest complex. The host-guest complex is a ternary complex comprising the cucurbituril host and a first guest and a second guest. A first guest is provided as a group on a biomolecule. A second guest is provided as a group on a second component, which component is optionally a solid phase.
[0074] Typically such complexes are based around CB[8] and variants and derivatives thereof. The first guest and the second guest are not the same.
[0075] In principle, any compound having a suitable binding affinity (such as those mentioned in the discussion of the complex above) may be used in the present invention. The compound used may be selected based on the size of the moieties that are thought to interact with the cavity of the cucurbituril. The size of these moieties may be sufficiently large to permit complexation only with larger cucurbituril forms.
[0076] In the present case, each of the first guest and the second guest is an aryl group.
[0077] The aryl group may be a carboaryl group or a heteroaryl group, and is preferably a carboaryl group.
[0078] The aryl group may be a monocyclic, bicyclic or tricyclic aryl group.
[0079] The aryl group may be a monocyclic or bicyclic aryl group.
[0080] The aryl group may be unsubstituted or substituted, such as monosubstituted.
[0081] Examples of the carboaryl group include phenyl, naphthyl and anthracenyl.
[0082] Examples of the heteroaryl group include indolyl.
[0083] The first and the second guests are not the same. In some embodiments, the aryl ring may be the same, but the guests differ in the number and/or identity of the substituents to the ring. For example in the worked examples of the present case, a phenyl first guest is used with a second guest that has a phenyl ring, but this ring is perfluorinated.
[0084] Each aryl group substituent may be selected from the group consisting of alkyl, halo, OH, OR.sup.1, NH.sub.2, NHR.sup.1, NR.sup.1R.sup.2, where each R.sup.1 and R.sup.2 is independently alkyl.
[0085] Alternatively, each aryl group substituent may be selected from the group consisting of alkyl, halo, OH, OR.sup.1, CN, NO.sub.2, NH.sub.2, NHR.sup.1, NR.sup.1R.sup.2, where each R.sup.1 and R.sup.2 is independently alkyl.
[0086] An alkyl group may be C.sub.1-6 alkyl, such as C.sub.1-4 alkyl, such as methyl or ethyl, such as methyl.
[0087] A halo group may be selected from F, Cl, Br and I, and is preferably F.
[0088] Typically, one aryl guest is halogenated, such as fluorinated, whilst the other aryl guest is not. It is preferred that the second aryl guest is halogenated, such as fluorinated. Here, it is also preferred that the first aryl guest is not halogenated, for example it is not fluorinated.
[0089] The inventors have found that an electron-poor guest, such as a halogenated aryl guest, such as perfluorphenyl, advantageously forms heterocomplexes with an electron-rich guest, such as a substituted aryl guest.
[0090] The first guest is typically an aryl group that is unsubstituted or monosubstituted.
[0091] The second guest is typically an aryl group that is substituted.
[0092] The first guest may be selected from the group consisting of phenyl, naphthyl, such as naphth-2-yl, hydroxyphenyl, such as 4-hydroxyphenyl, and indolyl, such as indol-3-yl.
[0093] The first aryl guest may be covalently connected to a methylene group.
[0094] The first guest may be phenyl or indolyl, including indol-3-yl.
[0095] It is preferred that the first guest is phenyl.
[0096] In another embodiment, the first guest is indolyl, including indol-3-yl.
[0097] The first guest may be present within a side chain of an amino acid residue.
[0098] Where the first guest is phenyl, the amino acid residue may be phenylalanine.
[0099] Where the first guest is hydroxyphenyl, an example amino acid residue where the guest is 4-hydroxyphenyl is tyrosine.
[0100] Where the first guest is naphthyl, an example amino acid residue where the guest is naphth-2-yl is (2-naphthyl)alanine.
[0101] Where the first guest is indolyl, an example amino acid residue where the guest is indol-3-yl is tryptophan.
[0102] The first aryl guest may be provided within the side chain of a phenylalanine, tyrosine, indole or naphthylalanine residue.
[0103] The second aryl guest may be selected from substituted phenyl, indolyl, such as indol-3-yl, naphthyl, such as naphth-2-yl, and anthracenyl, such as anthracene-2-yl. Here, the aryl group is substituted with one or more substituents. The aryl group may be per-substituted.
[0104] That is, each available ring position may be substituted. Here, each substituent may be a halo, and preferably each substituent is fluoro.
[0105] The second aryl guest may be covalently connected to a methylene group.
[0106] The second aryl guest may be phenyl having five halo substituents, such as five fluoro substituents. Thus, the second aryl guest may be perfluorophenyl (pentafluorphenyl), and this is preferably connected to a methylene group.
[0107] The second aryl guest may be indolyl, such as indol-3-yl, having from 1 to 5 halo substituents, such as 1 to 5 fluoro substituents.
[0108] The second aryl guest may be naphthyl, such as naphth-2-yl, having from 1 to 7 halo substituents, such as 1 to 7 fluoro substituents.
[0109] The second aryl guest may be anthracenyl, such as anthracene-2-yl, having from 1 to 9 halo substituents, such as 1 to 9 fluoro substituents.
[0110] The second aryl guest may be present within a side chain of an amino acid residue. Where the second guest is perfluorophenyl, the amino acid residue may be perfluorophenylalanine.
[0111] An amino acid residue containing the second aryl guest may be provided within a polypeptide sequence having two or more contiguous amino acid residues, such as three or more, such as 4 or more amino acid residues, such as 5 or more, such as 6 or more amino acid residues.
[0112] The amino acid residue having the second aryl guest may be provided at the N terminal of the polypeptide sequence. The terminal amino acid residue may be connected to a polypeptide sequence having one or more, such as three or more, such as 4 or more, such as 5 or more amino acid residues that are contiguous with the terminal amino acid residue.
[0113] Preferably, the polypeptide sequence has an amino acid residue with the second aryl guest at the N terminal, and this is connected to one or more, such as three or more, such as 4 or more, such as 5 or more glycine residues that are contiguous with the terminal amino acid residue.
[0114] The second aryl guest may be covalently linked, directly or indirectly, to a solid phase.
[0115] The solid phase may be a polymeric phase, such a polystyrene resin or poly(ethylene glycol) (PEG) resin, such as a Rink Amide ChemMatrix Resin. Here, the formation of the complex connects the biomolecule to the solid-phase via the non-covalent interaction in the complex.
[0116] Once the biomolecule is immobilised, it may be separated from other components, which are not insolubilized, by simple filtration. Subsequent release of the biomolecule from the complex, for example using the decomplexation methods described herein, then provides the biomolecules in a higher purity form.
[0117] The aryl guest may be connected to an optionally substituted imidazolium group, such as connected to the optionally substituted imidazolium group through methylene. The aryl guest may be connected, such as through methylene, to the 1-position of the imidazolium ring nitrogen.
[0118] The 2-, 4- and 5-positions of the imidazolium group are preferably unsubstituted. The 3-position of the imidazolium group is optionally substituted and may be substituted.
[0119] The imidazolium 3-position may be substituted with alkyl, such as C.sub.1-4 alkyl, such as methyl or ethyl, such as methyl.
[0120] The imidazolium 3-position may be connected to a methylene group.
[0121] The imidazole group may be covalently linked, directly or indirectly, to a solid phase. The connection to the solid phase may be made via the imidazolium 3-position, such as through a methylene group connected to the imidazolium 3-position.
[0122] An amino acid residue as described herein may be an -amino acid residue, a -amino acid residue or a -amino acid residue.
[0123] An amino acid residue as described herein may be an -amino acid residue.
[0124] The amino acid residue may be an L- or D-form, and is typically L-form.
[0125] In a preferred embodiment, both the first guest and the second guest are groups within the side-chain of an amino acid residue. Preferably, each amino acid residue has the same stereoform, such as both amino acid residues having the L-form.
Biomolecule
[0126] The biomolecule may be selected from a polypeptide, a polynucleotide and a polysaccharide.
[0127] The present inventors have found that the ternary complex of the invention stabilised a biomolecule, such as insulin, preventing its degradation. The complex may be stored without appreciable loss of structural or functional activity.
[0128] The biomolecule may inherently contain a group that is suitable for use as a guest in a ternary complex of the invention. For example, a polypeptide may contain a phenylalanine residue, and the phenyl group in the side chain of that residue may be a suitable guest.
[0129] In other embodiments a biomolecule may be modified to include a guest aryl group so that the biomolecule may be held in complex.
[0130] A guest group present within a biomolecule is available for interaction with the cucurbituril host. Thus, a first aryl guest may be provided at a terminal of a biomolecule, where it is more likely to be freely available for interaction with a host. However, a guest group may also be provided with mid-chain if that guest is available for interaction. It is preferred that the first aryl guest is provided at a terminal region of a biomolecule.
[0131] In one embodiment, the biomolecule has a molecular weight of at least 100, at least 200, at least 300, at least 1,000 (1 k), at least 5,000 (5 k), at least 10,000 (10 k), at least 15,000 (15 k), at least 20,000 (20 k), at least 50,000 (50 k), at least 100,000 (100 k) or at least 200,000 (200 k).
[0132] The polypeptide may be a protein, such as a peptide hormone or an antibody.
[0133] Thus, the biomolecule may be selected from the group of polypeptides consisting of: [0134] Blood factors such as Factor VIII and Factor IX; [0135] Thrombolytic agents, such as tissue plasminogen activator; [0136] Peptide hormones, such as insulin, glucagon, growth hormone, gonadotrophins; [0137] Haematopoietic growth factors, such as Erythropoietin, colony-stimulating factors; [0138] Interferons, such as Interferons-, -, -; [0139] Interleukin-based products, such as Interleukin-2; [0140] Vaccines, such as Hepatitis B surface antigen; [0141] Antibodies, including Monoclonal antibodies; and [0142] Other proteins such as tumour necrosis factor, and therapeutic enzymes.
[0143] Preferably, the biomolecule is a peptide hormone, such as insulin. The polypeptide includes proteins having post-translation modifications. The polypeptide may also include non-natural amino acid residues, and the polypeptide may also include with amino acid residues that are modified during post-translation modification.
[0144] A polypeptide may have at least 10 contiguous amino acid residues, such as at least 20, at least 50, at least 100 or at least 500 amino acid residues.
[0145] The worked examples in the present case use insulin as an exemplary polypeptide, and connections are formed to the 1-Phe residue within insulin within chain B. Webber et al. have also formed cucurbituril complexes with the 1-Phe residue within insulin, albeit using CB[7] and in binary complex only. Webber et al. also shown that cucurbituril complexation with other biomolecules is possible, such as other polypeptides, including glucagon and an anti-CD20 antibody. A skilled person will appreciate that the present invention may be used to form complexes with biomolecules other than insulin.
[0146] The first guest may be provided at the N or C terminal regions of a polypeptide, and preferably the N terminal. Here, a terminal region may be regarded as the three contiguous amino acid residues at the N or C terminal. Typically, the first guest is present within the amino acid residue at the N or C terminal itself, and preferably the N terminal. The inventors have found that the association constant for the first guest may be greatest when the first guest is present with the amino acid residue at the terminal itself, with the association constants decreasing as the first guest is provided at further distance from the N terminal, such as within the second and third amino resides from the N terminal.
[0147] Where, the biomolecule is a polypeptide, the first guest is typically present in an arylalanine residue within the polypeptide.
[0148] Where, the biomolecule is a polypeptide, the first guest is typically present in a phenylalanine, tyrosine, indole or naphthylalanine residue within the polypeptide.
[0149] A polynucleotide may be a DNA, RNA or a related from such as a PNA, and may be single stranded.
[0150] A polynucleotide may have at least 10 contiguous nucleotides, such as at least 50, at least 100 or at least 500 nucleotides.
[0151] The first guest may be provided at the 3 or 5 terminal region of the polynucleotide.
[0152] A polysaccharide may contain two or more, such as three or more contiguous sugar groups.
[0153] A biomolecule may a naturally occurring biomolecule or it may be a variant, such as a mutant, of such a biomolecule.
[0154] The biomolecule may also be a modified form, for example one that contains a label for detection or capture. Typically, such functionality is provided away from the first guest, so as not to interfere with the formation of the complex.
[0155] A biomolecule may also be in protected from, such as chemically protected form.
[0156] Typically, the biomolecules is not itself immobilised on a solid phase. Rather, the formation of the complex may serve to non-covalently connect the biomolecule to a solid phase through its interaction with a second guest in the complex, where the second guest is connected to the solid phase, as present in a preferred embodiment of the invention.
[0157] A biomolecule may be provided in an aqueous mixture, and the biomolecule may be provided together with other components. Here, the formation of the complex, preferentially comprising the biomolecule over the other components, may serve as a process for purifying the biomolecule from those components.
[0158] These other components may not possess a suitable guest group for formation of a complex, or a functional group may be present, but unavailable for formation (for example, because that guest is internalised within the other component).
Insulin
[0159] The present invention is well suited to the formation of complexes where the biomolecule is insulin.
[0160] The insulin may be a polypeptide having the amino acid sequence corresponding to human insulin.
[0161] The insulin may be an insulin that is recombinantly produced. Thus, the insulin may be human recombinant insulin.
[0162] In a preferred embodiment, the first guest is the phenyl group present in the phenylalanine residue present at position 1 of insulin, particularly human insulin.
[0163] The insulin need not be provided in sustainably purified form. The methods of the invention may allow for the preferential formation of the complex with insulin over other compounds present. Once formed, the complex may be readily purified from other components, for example by simple filtration where the complex includes a solid-phase bound component, such as the second compound.
Formation and Decomplexation of a Complex
[0164] The invention provides a method of preparing a complex of the invention, the method comprising the step of mixing a cucurbituril host together with a biomolecule having a first aryl guest, and a second aryl guest, and permitting the host and guests to self-assemble.
[0165] The method may further comprise the step of at least partially purifying the complex. For example, the complex may be a least partially separate from one or more other components that are not in the complex.
[0166] It is preferred that the second aryl guest is bound to the solid phase, and it follows that the resulting complex is also bound to the solid phase. Here the complex may be at least partially purified by collection of the solid phase, for example by filtration, to separate the complex from other components.
[0167] The invention also provides a method of decomplexing a complex of the invention, the method comprising the step of treating the complex an external stimulus, and permitting the external stimulus to displace at least the first aryl guest from the complex.
[0168] The release of the biomolecule may be achieved as needed, and may be regarded as on-demand release.
[0169] The external stimulus may the addition of a competitor guest. The competitor guest may be or comprise adamantane amine (ADA) or memantine (DMADA).
[0170] The formation and the decomplexation may be performed under aqueous conditions.
[0171] Adventurously, the formation and decomplexation steps may also be performed at ambient temperatures, such as a temperature in the range 5 to 30 C.
[0172] The complexation and decomplexation may be performed at temperatures that are not associated with a degradation of the biomolecule, and the integrity of the biomolecule may be preserved in the complexation and decomplexation processes.
Stabilization of a Biomolecule
[0173] The complexes of the invention may be used to stabilise a biomolecule for storage, and the biomolecule may be easily and rapidly released from the complex when needed. The complex formation thereby limits the degradation of the biomolecule and maintains its activity upon release.
[0174] Here, the complexes of the invention are preferably bound to the solid phase. For example, the second guest is connected to the solid phase, such as a polymeric phase.
[0175] Accordingly, there is provided the use of a complex to stabilise a biomolecule, such as a protein.
[0176] The stability of a biomolecule may be assessed by structural analysis, such as circular dichroism, and may include a comparison of the secondary and tertiary structures of the biomolecules before and after complexation. The satiability of a biomolecule may be assessed by a suitable activity assay, which may include cells-based studies.
Purification of a Biomolecule
[0177] The selective capture of the biomolecule in a complex also allows for the biomolecule to be separated from other components in a mixture, thereby also allowing the at least partial purification of the biomolecules. For example, the complex may be connected to the solid phase allowing the complex to be collected by simple filtration, with separation from other solubilised components of a mixture.
[0178] Accordingly, there is provided a method for at least partially purifying a biomolecule from a mixture, the method comprising the step of providing a biomolecule in a mixture, such as an aqueous mixture, where the biomolecule has a first aryl guest; providing a cucurbituril host and a second aryl guest in the mixture; permitting the host and guests to self-assemble; and subsequently separating the resulting complex from the mixture.
Experimental
[0179] On account of their high binding affinity and range of guests, cucurbit[n] uril (CB[n]) macrocyclic hosts are ideal to bind peptides.sup.6-11 Urbach and co-workers reported a homopeptide dimer between two FGG tripeptides and CB[8], displaying high binding strength (K ca. 10.sup.11 M.sup.2).sup.6. This homodimer has been adopted as a versatile building block in the design and fabrication of supramolecular oligomers,.sup.12 polymers,.sup.13,14 hydrogels,.sup.15,16 and protein/peptide assemblies..sup.17 Although significant advances have been made, a major challenge remains favorable, quantitative formation of heteropeptide dimers without homodimerization.
[0180] Herein, we employ CB[8] to mediate heterodimerization of a canonical aromatic peptide and a non-canonical L-perfluorophenylalanine (F)-containing peptide,
[0181] Five model aromatic tripeptides containing Lperfluorophenylalanine20 (FGG), L-tryptophan (WGG), L-phenylalanine (FGG), L-tyrosine (YGG), and L-(2-naphthyl)alanine (NpGG) at the N-termini were designed and prepared,
##STR00003## ##STR00004##
[0182] .sup.1H and .sup.19F NMR titrations were performed to probe heteropeptide dimerization within CB[8],
[0183] Isothermal titration calorimetry (ITC) was employed to study binding thermodynamics of heteropeptide dimerization (
[0184] Table 1 shows that all XGG peptides exhibited high binding strengths (K.sub.a>10.sup.4 M.sup.1), confirming thermodynamic stability of heteropeptide dimerization. YGG (
[0185] To understand influence of aromatic position on heterodimerization, K.sub.a of GXG and GGX (X=W, F, Y, Np) with FGG-CB[8] were determined by ITC. Shifting aromatic residues from N to C-termini led to a notable decrease in K.sub.a, Table 1. The increased distance between the positive charge at the N-terminus and the aromatic motif weakens ion-dipole interactions at the CB[8] portal, reducing the secondary binding affinity. This is exemplified by GGY, where no secondary binding to FGG-CB[8] was observed. Seven new, derivatized heteropeptide dimers, FGG-CB[8]-GXG and FGG-CB[8]-GGX, expand the scope of host-enhanced heteropeptide dimerization. The exhibited binding selectivity to aromatic residues is an advantage of this system, enabling access to a range of peptides and proteins. Compared to previous reports on CB[8]-peptide heteroternary complexes,.sup.27 the system described here is simply based upon a F amino acid, easily accessible for ligation in chemical biology and biochemistry.
TABLE-US-00001 TABLE 1 Thermodynamic data for secondary association of aromatic residue-containing tripeptides and control tripeptides with the 1:1 complex of FGG-CB[8].sup.a model K.sub.a H.sub.a TS.sub.a peptide (10.sup.3M.sup.1) (kcal mol.sup.1) (kcal mol.sup.1) WGG 460 53 13.3 0.2 5.6 0.2 GWG 103 6 11.9 0.1 5.0 0.1 GGW 90 7 14.5 0.3 7.8 0.4 FGG 356 26 14.4 0.4 6.8 0.3 GFG 58 3 9.1 0.4 2.6 0.4 GGF 22 1 7.5 0.2 1.5 0.2 YGG 3 5 9.6 0.1 3.6 0.2 GYG 4 0.3 4.7 0.2 0.2 0.3 GGY .sup.b .sup.b .sup.b NpGG 382 22 14.6 0.2 7.0 0.2 GNpG 165 10 12.7 0.4 5.6 0.4 GGNp 90 2 12.0 0.1 5.2 0.1 KGG .sup.b .sup.b .sup.b EGG .sup.b .sup.b .sup.b LGG .sup.b .sup.b .sup.b Note: .sup.aAveraged with three replicates. .sup.bNot detected.
[0186] The thermodynamic mechanism behind peptide heterodimerization is attributed to social self-sorting,
[0187] After elucidation of thermodynamics, we demonstrated the utility of heteropeptide dimers to achieve on-resin recognition of aromatic peptides,
[0188] UV experiments were performed to test absorption efficiency through quantification of aromatic peptides present before and after on-resin treatment,
[0189] Multi-cycle on-resin recognition was performed to evaluate recyclability,
[0190] To extend applicability of this approach, we exploited interfacial recognition for insulin stabilization and its on demand release from the resin,
[0191] In conclusion, we have introduced a new type of quantitative heteropeptide dimerization. Through host-enhanced polar-interactions, the binding affinity between aryl and perfluorophenyl groups from two different peptides is significantly enhanced with a K.sub.a up to 10.sup.5 M.sup.1 and a K.sub.1 K.sub.a up to 101 M.sup.2, ensuring exclusive formation of heteropeptide dimers. To demonstrate utility, the solution-phase host-guest complex (FGG-CB[8]-XGG) was transferred to a solid-liquid interface achieving on-resin recognition and isolation of aromatic peptides as well as stabilization and on demand release of insulin under ambient conditions.
[0192] This generic approach enables accumulation and separation of aromatic-abundant biomacromolecules useful in biomedical research. We anticipate that this work will inspire research into exploitation of heteropeptide dimerization as a versatile strategy for a wide range of life science applications.
Materials and Instrumentation
Materials
[0193] Unless noted, all the starting materials were purchased from commercial suppliers and were used without further purification: (L)-Fmoc-Glu(OtBu)OH, (L)-Fmoc-Gly-OH, (L)-Fmoc-Leu-OH, (L)-Fmoc-Lys(Boc)-OH, (L)-Fmoc-Phe-OH, (L)-Fmoc-Trp(Boc)-OH, (L)-Fmoc-Tyr(tBu)-OH, (L)-Fmoc-3-(2-naphthyl)-alanine-OH, piperidine, diethyl ether, (L)-Fmoc-pentafluorophenylalanine-OH, diisopropylethylamine (DIPEA), trifluoroacetic acid (TFA), triisopropylsilane (TIPS), N,N,N,N-tetramethyl-O-(1H-benzotriazol-1-yl)uronium hexafluorophosphate (HBTU), dimethylformamide (DMF), dichloromethane (DCM), monobasic sodium phosphate (NaH.sub.2PO.sub.4), dibasic sodium phosphate (Na.sub.2HPO.sub.4), deuterium oxide (D2O, D, 99.8%), 1,3-dimethylaminoadamantane hydrochloride (memantine hydrochloride, DMADA), Rink Amide-AM Resin, H-Rink Amide ChemMatrix Resin (surface loading of functionalization ca. 0.5 mmol/g), Insulin (Human Recombinant). Phosphate buffer solution was prepared by adding 10 mM NaH.sub.2PO.sub.4 solution to 10 mM Na.sub.2HPO.sub.4 solution until pH equal to 7.0. Cucurbit[8] uril (CB[8]) was synthesized and purified according to a previous report (see Kim), and the molecular weight for CB[8] was calibrated as 1600 g/mol. Water was obtained from a Milli-Q Integral Water Purification System (22.5 M.Math.cm).
Peptide Synthesis and Purification
[0194] All the tripeptides (XGG, GXG, GGX) were synthesized on an automated microwave peptide synthesizer (Liberty Blue, CEM) using standard 9-fluorenylmethoxycarbonyl (Fmoc) chemistry. Peptides were cleaved from the Rink Amide-AM resin with a mixture of 95% TFA, 2.5% TIPS, and 2.5% H2O and shaken for 1 hr at room temperature. The resin was filtered and washed with the cleavage mixture. After evaporating most of the cleavage mixture, the peptides were crushed out and washed three times with cold diethyl ether. The crude peptides were purified using reverse-phase high-performance liquid chromatography (RP-HPLC) on a Phenomenex C18 Kinetic-Evo column with 5 m pore size, 110 particle size and 15021.2 mm dimensions. The mobile phase for water contained 0.1% TFA. The purification method gradient was set from 5% acetonitrile and 95% water to 100% acetonitrile. Dry peptides were obtained (as TFA salt) by lyophilization and then verified by NMR and ESI-MS. On resin peptide FGGGGG were synthesized using H-Rink Amide ChemMatrix Resin using the same method above. The FGGGGG-functionalized resin was washed with DMF for 3 times and then left in D.sub.2O/H.sub.2O (for 3 days, change with fresh D.sub.2O/H.sub.2O each day). Then, the washed FGGGGG resin was preserved in D.sub.2O/H.sub.2O in the fridge at 277 K.
Nuclear Magnetic Resonance Spectroscopy (NMR)
[0195] 1H, .sup.13C & .sup.19F NMR spectra of pure peptides were acquired in D.sub.2O at 298 K on a Bruker AVANCE 400 (400 MHz) apparatus and a BrukerAVANCE 500 apparatus with TCI Cryoprobe (500 MHz). .sup.1H & .sup.19F NMR spectra of other experiments (NMR titration, on-resin recognition tests) were acquired in phosphate buffer (D.sub.2O) with pH 7.0 on a Bruker AVANCE 400 (400 MHz) apparatus. Chemical shifts were referenced to the residual solvent peak of HDO at 4.79 ppm.
Isothermal Titration Calorimetry (ITC)
[0196] ITC experiments were operated on a Malvern MicroCal Auto-ITC200 apparatus at 298 K in 10 mM phosphate buffer with pH 7.0. In a of 1 injection of 0.6 L and 32 consecutive injections of 1.2 L with 90 s intervals between injections. The first data point was removed before analysis. The ITC data were fitted by MicroCal Analysis Centre software using one set of sites model. In a typical experiment for homoternary complexation, the host molecule (CB[8]) was loaded in the sample cell with a concentration of 0.1 mM, and the guest peptide was loaded in the syringe with a concentration of 2.0 mM. The injection and interval settings were the same as above. The first data point was removed from the data set before analysis. The obtained ITC curves were fitted by MicroCal Analysis Centre software using sequential binding model. All parameters were averaged with three replicates, s.d. n=3. K.sub.a values <10.sup.2 M.sup.1 were too low to be detected.
Electrospray Ionization Mass Spectrometry (ESI-MS)
[0197] ESI-MS data were obtained on a Thermo Fisher Q Exactive Orbitrap mass spectrometry with a nanospraying ion source. Positive mode was chosen for all experiments at a working 35 temperature of 320 C. and a capillary voltage of 1.5 kV. All the sample solutions were prepared in pure water.
Ultraviolet (UV) Spectroscopy
[0198] UV spectra were obtained on a Cary 4000 Ultravioletvisible Spectrometer. A pair of quartz cuvettes with 2 mm gap was used. Double beam mode with a fixed 2 nm spectral bandwidth and baseline correction were selected. In a typical run, 300 l of 1 mM solution was transferred to the quartz cuvette and was scanned compared to phosphate buffer in D.sub.2O. UV spectra were obtained in the linear absorption range (0.2-0.8) for better quantification.
Confocal Microscopy
[0199] On-resin imaging was performed on a Leica Stellaris 5 confocal microscope (Leica Microsystems) with a 40 oil immersion objective and numerical aperture of 1.3. Suspensions of resin beads were dropped on a coverslip and stacks of 50 images with a resolution of 10241024 pixels at _EXC 405 nm were acquired. The stack boundaries were selected at the point where there was no distinguishable difference of signal-to-noise for both extremes (edges) of the beads. The 3D dataset was processed by Fiji/ImageJ and the maximum intensity Z-projection was obtained to compare the mean fluorescent intensity across resin beads. 3 groups of resin beads were tested: A group, original unmodified ChemMatrix resin (resin) and FG5 functionalized resin (resin-G5F); B group, resin-G5F with WGG but no CB[8](resin-G5F+WGG), resin-G5F with WGG and CB[8](resin-G5F+WGG+CB[8]), resin-G5F with WGG, CB[8] and then DMADA solution (resin-G5F+WGG+CB[8]+DMAMA); C group, resin-G5F with WG5-dansyl but no CB[8](resin-G5F+WG5-dansyl), resin-G5F with WG5-dansyl and CB[8](resin-G5F+WG5-dansyl+CB[8]), resin G5F with WG5-dansyl, CB[8] and then DMADA solution (resin-G5F+WG5-dansyl+CB[8]+DMAMA). 3 groups of resin beads were prepared according to the procedures in the UV Spectroscopy above. All samples then were drained and added with the same amount of phosphate buffer with pH 7.0 to yield suspensions (10 mM) for imaging.
Circular Dichroism (CD) Spectroscopy
[0200] CD measurements were performed on a Chirascan spectrometer (Applied Photophysics). Scans were carried out at 298 K from 200-500 nm at 1.0 nm intervals. The acquisition time was 0.5 second in a 0.1 cm path length quartz cuvette. CD spectra were collected using solutions from the above UV experiments of the 8-day storage and release experiments. The original insulin solution was 0.2 mM. All solutions were diluted 2 before the measurements. All scans were collected and averaged out over three scans with background noise correction.
Procedures for On-Resin Recognition
[0201] A standard procedure of NMR quantification for on-resin recognition within a peptide mixture is shown as follows. A solution of mixed peptides containing WGG, KGG, LGG, EGG, CB[8]([XGG]=CB[8]=1 mM) was used as the original cycle 0 solution. For cycle 1, 1.0 ml of the mixed peptides (1 mM) was added to the FGGGGG-functionalized resin (10 mM) and shaken for 10 min at room temperature. After draining, 1 ml of 2 mM DMADA solution was added, shaken for 45 min, and collected as the released solution. For cycle 2, fresh CB[8](2.88 mg, 1.8 mol) was added to 0.9 ml of the released solution (2 mM DMADA, 1.8 mol) from cycle 1 to bind with excess DMADA and the isolated WGG. Around 0.1 ml solution was lost after each cycle. This new solution was added with the resin, shaken for 45 min, drained and then 0.9 ml of 2 mM DMADA solution was added to generate the released solution from cycle 2. For cycle 3, fresh CB[8](2.56 mg, 1.6 mol) was added to 0.8 ml of the released solution (2 mM DMADA, 1.6 mol) from cycle 2. This new solution was added with the resin, shaken for 45 min, drained and then 0.8 ml of DMADA solution was added to generate the released solution from cycle 3. The original cycle 0 solution and the subsequently released solutions from each cycle were tested with NMR. After NMR analysis, the relative amount of WGG was determined by integrating a set of doublet peak at 3.36 ppm; the relative amount of KGG was determined by integrating a set of triplet peak at 3.02 ppm; the relative amount of EGG was determined by integrating a set of triplet peak at 2.37 ppm. There was a singlet peak of KGG, LGG, EGG that overlapped at 3.95 ppm. Thus, the relative amount of LGG was determined by the relative amount of KGG and EGG subtracted from the integration of peak at 3.95 ppm.
[0202] A standard procedure of UV quantification for peptide on-resin recognition is shown as follows. Using a model tripeptide WGG as an example, 10 mol FGGGGG equivalent of resin D.sub.2O suspension was added to a 12 ml syringe with a polyester frit (r=8.0 mm, h=2.0 mm). After D.sub.2O was drained, 1 ml of 1 mM WGG:CB[8]=1:1 solution (original solution, in D.sub.2O, phosphate buffer, pH=7.0) was added to the resin. Then, the plunge and a stopper were assembled to the syringe to form a sealed chamber. The syringe was transferred onto a wrist-action shaker and shaken for 10 min. Then, the liquid (resin-treated solution) in syringe was drained thoroughly and the resin was washed with D.sub.2O once. After that, 1 ml of 2 mM DMADA solution (in D.sub.2O, phosphate buffer, pH=7.0) was added to the resin and shaken for an additional 45 min. The solution (released solution) was drained for further testing. For efficiency test over 3 cycles (
[0203] A standard procedure of UV quantification for insulin on-resin recognition is shown as follows. Insulin (Human Recombinant) was suspended at 0.2 mM using 0.22 m filtered H2O. 1 N HCl was immediately added to reach acid concentration of 5 mM. Solution was gently stirred until fully dissolved and was filtered through 0.22 m filter and stored at 277 K. Method was slightly modified from a Cold Spring Harbor Protocol (see Recipe: Insulin). Insulin solution was warmed to room temperature before being used in on-resin recognition. In a typical experiment, 8 mol FGGGGG equivalent of resin H2O suspension was added to a 12 ml syringe with a polyester frit (r=8.0 mm, h=2.0 mm). After H2Owas drained, 8 ml of 0.1 mM CB[8] solution was added to the resin and the plunge and a stopper were assembled to the syringe to form a sealed chamber. The syringe was transferred onto a wrist-action shaker and shaken for 12 hours. Then, after CB[8] solution was drained and resin was washed once with H2O, 0.8 ml of 0.2 mM insulin solution was added to the resin and shaken gently for 3 hours. Then, the liquid (resin-treated solution) in syringe was drained thoroughly and the resin was washed with H2Oonce. After that, 0.8 ml of 2 mM DMADA solution (in 5 mM of HCl) was added to the resin and shaken gently for an additional 3 hours. The solution (released solution) was drained. The origin insulin solution, resin-treated solution and released solution were measured on UV spectroscopy immediately.
[0204] The 8-day storage and release experiment was conducted as follows. 9 syringes were prepared each with 8 mol FGGGGG equivalent of resin H2Osuspension. On day 0, as the standard procedure of UV quantification for insulin on-resin recognition described above, all 9 syringes were drained, bound with CB[8], washed, bound with insulin, drained (to get resin-treated solutions), washed. All syringes with the resin were kept at room temperature. Starting from day 0 to day 8, on each day, a syringe with resin was added with 0.8 ml of 2 mM DMADA solution (to get released solution, one per day). All yielded solutions were analyzed on UV spectroscopy immediately. Numbers to calculate released percentage in
Synthesis & Characterization of Model Peptides
(S)-2-amino-N-(2-((2-amino-2-oxoethyl)amino)-2-oxoethyl)-3-(perfluorophenyl) propenamide (FGG)
[0205] .sup.1H NMR (500 MHz, D.sub.2O, ppm): 6=4.32 (t, 1H), 3.99 (q, 2H), 3.93 (s, 2H), 3.37 (d, 2H); .sup.13C NMR (125 MHz, D.sub.2O): 6=174.01, 170.89, 168.81, 146.39, 144.44, 141.90, 139.89, 138.49, 136.49, 107.29, 52.04, 42.30, 41.89, 23.85; .sup.19F NMR (375 MHz, D.sub.2O, ppm): =142.65, -154.41, -162.18; ESI MS for [M-H]+: calc. m/z=369.10, found m/z=369.17.
(S)N-(2-amino-2-oxoethyl)-2-(2-aminoacetamido)-3-(perfluorophenyl) propanamide (GFG)
[0206] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =4.77-4.74 (m, 1H), 3.95-3.86 (m, 2H), 3.85-3.79 (m, 2H), 3.32 (dd, 1H), 3.20 (dd, 1H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =173.56, 172.10, 166.91, 146.25, 144.32, 141.27, 139.28, 138.29, 136.33, 109.48, 52.55, 42.00, 40.22, 24.43; .sup.19F NMR (375 MHz, D.sub.2O, ppm): =143.48, 155.99, 162.89; ESI MS for [M-H]+: calc. m/z=369.10, found m/z=369.17.
(S)-2-(2-(2-aminoacetamido)acetamido)-3-(perfluorophenyl)propanamide (GGF)
[0207] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =4.73-4.70 (m, 1H), 3.96 (s, 2H), 3.88 (s, 2H), 3.31 (dd, 1H), 3.16 (dd, 1H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =174.32, 170.86, 167.64, 146.27, 144.34, 141.17, 139.17, 138.24, 136.27, 109.89, 52.04, 42.05, 40.28, 24.52; .sup.19F NMR (375 MHz, D.sub.2O, ppm): =143.47, 156.31, 163.04; ESI MS for [M-H]+: calc. m/z=369.10, found m/z=369.17.
(S)-2-amino-N-(2-((2-amino-2-oxoethyl)amino)-2-oxoethyl)-3-(1H-indol-3-yl) propanamide (WGG)
[0208] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =7.64 (d, 1H), 7.54 (d, 1H), 7.35 (s, 1H), 7.29 (t, 1H), 7.21 (t, 1H), 4.36 (t, 1H), 3.92 (d, 1H), 3.91-3.75 (m, 3H), 3.49-3.38 (m, 2H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =173.90, 171.22, 170.24, 136.11, 126.42, 125.23, 122.21, 119.61, 117.98, 112.03, 106.24, 53.68, 42.36, 41.83, 26.70; ESI MS for [M-H]+: calc. m/z=318.16, found m/z=318.34.
(S)N-(2-amino-2-oxoethyl)-2-(2-aminoacetamido)-3-(1H-indol-3-yl)propanamide (GWG)
[0209] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =7.67 (d, 1H), 7.52 (d, 1H), 7.30-7.24 (m, 2H), 7.19 (t, 1H), 4.77-4.69 (m, 1H), 3.84 (d, 1H), 3.81-3.73 (m, 2H), 3.70 (d, 1H), 3.36-3.23 (m, 2H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =173.90, 173.77, 166.95, 136.04, 126.64, 124.54, 122.00, 119.38, 118.25, 111.89, 108.60, 54.89, 42.00, 40.23, 26.96; ESI MS for [M-H]+: calc. m/z=318.16, found m/z=318.34.
(S)-2-(2-(2-aminoacetamido)acetamido)-3-(1H-indol-2-yl)propanamide (GGW)
[0210] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =7.70 (d, 1H), 7.52 (d, 1H), 7.30-7.24 (m, 2H), 7.19 (t, 1H), 4.68 (dd, 1H), 3.99-3.87 (m, 2H), 3.77 (s, 2H), 3.34 (dd, 1H), 3.23 (dd, 1H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =176.11, 170.80, 167.56, 136.05, 126.84, 124.49, 121.89, 119.30, 118.29, 111.84, 108.86, 54.13, 42.11, 40.23, 26.96; ESI MS for [M-H]+: calc. m/z=318.16, found m/z=318.34.
(S)-2-amino-N-(2-((2-amino-2-oxoethyl)amino)-2-oxoethyl)-3-phenylpropanamide (FGG)
[0211] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =7.47-7.36 (m, 3H), 7.35-7.29 (m, 2H), 4.32 (t 1H), 4.01 (d, 1H), 3.94-3.87 (m, 3H), 3.29-3.20 (m, 2H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): _=173.97, 171.27, 169.82, 133.68, 129.35, 129.14, 128.00, 54.41, 42.26, 41.90, 36.69; ESI MS for [M-H]+: calc. m/z=279.15, found m/z=279.34.
(S)N-(2-amino-2-oxoethyl)-2-(2-aminoacetamido)-3-phenylpropanamide (GFG)
[0212] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =7.44-7.37 (m, 2H), 7.37-7.31 (m, 1H), 7.34-7.28 (m, 2H), 4.68 (t, 1H), 3.86 (t, 2H), 3.81-3.74 (m, 2H), 3.16 (dd, 1H), 3.07 (dd, 1H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =173.76, 173.49, 166.96, 136.05, 129.14, 128.80, 127.28, 55.32, 41.99, 40.18, 36.93; ESI MS for [M-H]+: calc. m/z=279.15, found m/z=279.34.
(S)-2-(2-(2-aminoacetamido)acetamido)-3-phenylpropanamide (GGF)
[0213] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =7.43-7.36 (m, 2H), 7.36-7.29 (m, 2H), 7.30 (t, 1H), 4.62 (dd, 1H), 3.94 (q, 2H), 3.85 (s, 2H), 3.19 (dd, 1H), 3.01 (dd, 1H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =175.70, 170.82, 167.57, 136.41, 129.14, 128.69, 127.11, 54.68, 42.01, 40.29, 36.98; ESI MS for [M-H]+: calc. m/z=279.15, found m/z=279.34
(S)-2-(2-(2-aminoacetamido)acetamido)-3-(4-hydroxyphenyl)propanamide (GGY)
[0214] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =7.20-7.13 (m, 2H), 6.90-6.83 (m, 2H), 4.56 (dd, 1H), 3.95 (q, 2H), 3.86 (s, 2H), 3.10 (dd, 1H), 2.94 (dd, 1H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =175.77, 170.79, 167.57, 154.37, 130.49, 128.22, 115.38, 54.84, 42.01, 40.30, 36.18; ESI MS for [M-H]+: calc. m/z=295.14, found m/z=295.34.
(S)-2-amino-N-(2-((2-amino-2-oxoethyl)amino)-2-oxoethyl)-3-(naphthalen-2-yl) propanamide (NpGG)
[0215] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =7.97 (t, 2H), 7.95-7.90 (m, 1H), 7.83-7.79 (m, 1H), 7.64-7.55 (m, 2H), 7.46 (dd, 1H), 4.44-4.39 (m, 1H), 3.89 (s, 2H), 3.74-3.62 (m, 2H), 3.44 (dd, 1H), 3.37 (dd, 1H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =173.74, 171.04, 169.72, 133.04, 132.38, 131.40, 128.84, 128.33, 127.72, 127.62, 126.92, 126.84, 126.61, 54.27, 42.23, 41.68, 36.87; ESI MS for [M-H]+: calc. m/z=329.16, found m/z=329.34.
(S)N-(2-amino-2-oxoethyl)-2-(2-aminoacetamido)-3-(naphthalen-2-yl)propanamide (GNpG)
[0216] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =7.93 (td, 3H), 7.80-7.76 (m, 1H), 7.57 (tt, 2H), 7.46 (dd, 1H), 4.77-4.74 (m, 1H), 3.84 (dd, 2H), 3.73 (dd, 2H), 3.32 (dd, 1H), 3.23 (dd, 1H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =173.67, 173.45, 166.94, 133.76, 133.04, 132.10, 128.31, 127.75, 127.62, 127.47, 127.24, 126.62, 126.21, 55.25, 41.96, 40.16, 37.13; ESI MS for [M-H]+: calc. m/z=329.16, found m/z=329.34.
(S)-2-(2-(2-aminoacetamido)acetamido)-3-(naphthalen-2-yl)propanamide (GGNp)
[0217] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =7.97-7.89 (m, 3H), 7.79-7.75 (m, 1H), 7.57 (tt, 2H), 7.46 (dd, 1H), 4.73 (dd, 1H), 3.90 (q, 2H), 3.76 (s, 2H), 3.36 (dd, 1H), 3.16 (dd, 1H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =175.66, 170.78, 167.47, 134.17, 133.03, 132.06, 128.19, 127.71, 127.61, 127.49, 127.31, 126.51, 126.11, 54.53, 42.02, 40.21, 37.16; ESI MS for [MH]+: calc. m/z=329.16, found m/z=329.34.
(S)-2,6-diamino-N-(2-((2-amino-2-oxoethyl)amino)-2-oxoethyl)hexanamide (KGG)
[0218] .sup.1H NMR (500 MHz, D.sub.2O, ppm): =4.12-4.02 (m, 3H), 3.95 (s, 2H), 3.02 (t, 2H), 2.01-1.90 (m, 2H), 1.73 (p, 2H), 1.55-1.43 (m, 2H); .sup.13C NMR (125 MHz, D.sub.2O, ppm): =05, 25.74; ESI MS for [M-H]+: calc. m/z=261.12, found m/z=261.34.
(S)-2-amino-N-(14-amino-2,5,8,11,14-pentaoxo-3,6,9,12-tetraazatetradecyl)-3-(perfluorophenyl) propenamide (FGGGGG)
[0219] .sup.1H NMR (700 MHz, D.sub.2O, ppm): =4.30 (t, 1H), 4.02 (m, 8H), 3.95 (s, 2H), 3.37 (d, 2H); .sup.19F NMR (375 MHz, D.sub.2O, ppm): =142.63, 154.54, 162.22; ESI MS for [M-H]+: calc. m/z=540.16, found m/z=540.67.
2-amino-N-(14-amino-2,5,8,11,14-pentaoxo-3,6,9,12-tetraazatetradecyl)acetamide (GGGGGG)
[0220] .sup.1H NMR (700 MHz, D.sub.2O, ppm): =4.09 (s, 2H), 4.04 (d, 4H), 4.02 (s, 2H), 3.95 (s, 2H), 3.93 (s, 2H); ESI MS for [M-H]+: calc. m/z=360.16, found m/z=360.50.
(S)-2-amino-N-(17-((5-(dimethylamino)naphthalene)-1-sulfonamido)-2,5,8,11,14-pentaoxo-3,6,9,12,15-pentaazaheptadecyl)-3-(1H-indol-3-yl)propenamiden(WGGGGG-dansyl)
[0221] .sup.1H NMR (700 MHz, D.sub.2O, ppm): =8.68 (d, 1H), 8.47 (d, 1H), 8.36 (d, 1H), 7.97-7.85 (m, 3H), 7.62 (d, 1H), 7.53 (d, 1H), 7.35 (s, 1H), 7.26 (t, 1H), 7.17 (t, 1H), 4.41 (t, 1H), 4.07 (s, 2H), 4.03 (s, 2H), 4.02-3.92 (m, 4H), 3.69 (s, 2H), 3.48-3.40 (m, 2H), 3.37 (s, 6H), 3.21 (t, 2H), 3.14 (t, 2H); ESI MS for [M-H]+: calc. m/z=765.31, found m/z=765.34.
On-Resin Recognition of Insulin
[0222] As shown in
[0223] Comparing to the 77% efficiency of treating FGGGGG resin with WGG (1.0 mM) solution, a higher efficiency of 94% is observed when treating FGGGGG resin with insulin (0.2 mM) solution. This phenomenon is actually one of the advantages of on-resin insulin stabilisation. Considering the Ka values of WGG and FGG are quite close, the difference of absorption efficiency between WGG and insulin is not simply related to the equilibrium calculations. Unlike WGG, insulin has two relatively long peptide chains consisting of 51 amino acids, thus its relatively large size could make it kinetically trapped into the crosslinked PEG(material of ChemMatrix resin) resin network. This can increase the absorption efficiency to about 94%, which is considerably higher than free small tripeptide WGG.
TABLE-US-00002 TABLE 2 UV absorbance data at 276 nm for on-resin storage and on-demand release of insulin on day 0, 2, 5, 8 of an 8-day experiment insulin insulin insulin insulin day 0 (abs) day 2 (abs) day 5 (abs) day 8 (abs) original 0.23508 0.23508 0.23508 0.23508 resin-treated 0.01213 0.00845 0.01349 0.01162 released 0.17598 0.17676 0.16827 0.17394 absorption 94.8% 96.4% 94.3% 95.1% efficiency (%) recycling 100% 100.4% 95.6% 98.8% efficiency (%) Note: .sup.a The efficiency results for FIG. 4g were calculated from Table S8 using the following equations:
Secondary Structure Verification of Released Insulin
[0224] As shown in
TABLE-US-00003 TABLE 3 CD absorbance data at 208 nm for on-resin storage and on-demand release of insulin on day 2, 5, 8 of an 8-day experiment insulin, original insulin, day 2 insulin, day 5 insulin, day 8 (m .Math. deg) (m .Math. deg) (m .Math. deg) (m .Math. deg) repeat 1 58.1 58.1 53.0 52.4 repeat 2 55.2 55.4 48.9 52.3 repeat 3 55.8 55.7 49.1 52.2 signal 56.4 56.4 50.3 52.3 std dev 1.5 1.5 2.3 0.1 active 100.0% 89.2% 92.7% ratio
Additional Experimental
On-Resin Selective Isolation of Aromatic Peptides
[0225] The release DMADA-CB[8] complex might have some weak interactions (i.e. electrostatic interactions, hydrogen bonds) with the remaining tripeptides, which could lead to some changes in chemical shifts after release. The yield of the selectively separated WGG for a single cycle was 77% (data not shown), so the final yield of WGG after 3 cycles of selectively separation was 77% x77% x77%=45.6% (see
TABLE-US-00004 TABLE 4 .sup.1H NMR integration analysis for on-resin recognition within a peptide mixture.sup.a WGG KGG EGG KGG + LGG + EGG LGG area percentage (%) area percentage (%) area percentage (%) total area area percentage (%) cycle 0 1000.0 24.5 919.6 22.6 1129.5 27.7 3078.1 1029.0 25.2 cycle 1 1000.0 72.7 108.8 7.9 175.3 12.7 376.1 92.0 6.7 cycle 2 1000.0 86.7 25.6 2.2 40.5 3.5 153.2 87.1 7.6 cycle 3 1000.0 95.0 16.4 1.6 25.4 2.4 52.7 10.9 1.0 Note: .sup.aAll the above results of percentages were shown in FIG. 4d.
REFERENCES
[0226] (1) Call et al. Cell 2006, 127, 355-368. [0227] (2) Luo et al. Chem. Rev. 2016, 116, 13571-13632. [0228] (3) Tang et al. Chem. Soc. Rev. 2014, 43, 7013-7039. [0229] (4) Zhang et al. Nat. Chem. 2016, 8, 120-128. [0230] (5) Crowley, P. B., Ed. Supramolecular protein chemistry; Monographs in supramolecular chemistry; The Royal Society of Chemistry, 2021; pp 1-312. [0231] (6) Heitmann et al. J. Am. Chem. Soc. 2006, 128, 12574-12581. [0232] (7) Logsdon et al. J. Am. Chem. Soc. 2011, 133, 17087-17092. [0233] (8) Biedermann et al. Angew. Chem. Int. Ed. 2014, 53, 5694-5699. [0234] (9) Clarke et al. J. Am. Chem. Soc. 2021, 143, 6323-6327. [0235] (10) Wang et al. CCS Chem. 2021, 3, 1413-1425. [0236] (11) Sonzini et al. Angew. Chem. Int. Ed. 2016, 55, 14000-14004. [0237] (12) de Vink et al. Chem. Sci. 2021, 12, 6726-6731. [0238] (13) Hou et al. Angew. Chem. Int. Ed. 2013, 52, 5590-5593. [0239] (14) Li et al. Chem. Commun. 2017, 53, 5870-5873. [0240] (15) Xu et al. ACS Appl. Mater. Interfaces 2017, 9, 11368-11372. [0241] (16) Parkins et al. Biomaterials 2021,120919. [0242] (17) de Vink et al. Angew. Chem. Int. Ed. 2017, 56, 8998-9002. [0243] (18) Huang et al. J. Am. Chem. Soc. 2020, 142, 7356-7361. [0244] (19) Huang et al. Nat. Mater. 2022, 21, 103-109. [0245] (20) Filler et al. Nature 1965, 205, 1105-1105. [0246] (21) Sogah et al. J. Am. Chem. Soc. 1979, 101, 3035-3042. [0247] (22) Kataki-Anastasakou et al. J. Am. Chem. Soc. 2020, 142, 20513-20518. [0248] (23) Huang et al. Langmuir 2016, 32, 12352-12360. [0249] (24) Biedermann et al. J. Am. Chem. Soc. 2012, 134, 15318-15323. [0250] (25) Zheng et al. Angew. Chem. Int. Ed. 2010, 49, 8635-8639. [0251] (26) Pace et al. Angew. Chem. Int. Ed. 2012, 51, 103-107. [0252] (27) Urbach et al. J. Chem. 2011, 51, 664-678. [0253] (28) Patterson et al. Lancet 2009, 373, 2027-2033. [0254] (29) Sluzky et al. Proc. Natl. Acad. Sci. U.S.A. 1991, 88, 9377-9381. [0255] (30) Chinai et al. J. Am. Chem. Soc. 2011, 133, 8810-8813. [0256] (31) Webber et al. Proc. Natl. Acad. Sci. U.S.A. 2016, 113,14189-14194. [0257] (32) Maikawa et al. Nat. Biomed. Eng. 2020, 4, 507-517. [0258] Kim et al. J. Am. Chem. Soc. 2000, 122, 540-541. [0259] Recipe: Insulin (0.5 mg/mL). Cold Spring Harbor Laboratory Press, 2014. [0260] Li et al. Chem. Commun. 2016, 52, 8537.