Method for the synthesis of a bifunctional complex
09885035 ยท 2018-02-06
Assignee
Inventors
Cpc classification
C12N15/1065
CHEMISTRY; METALLURGY
C12N15/1068
CHEMISTRY; METALLURGY
International classification
C12N15/10
CHEMISTRY; METALLURGY
Abstract
Disclosed is a method for obtaining a bifunctional complex comprising a display molecule part and a coding part, wherein a nascent bifunctional complex comprising a chemical reaction site and a priming site for enzymatic addition of a tag is reacted at the chemical reaction site with one or more reactants, and provided with respective tag(s) identifying the reactant(s) at the priming site is using one or more enzymes.
Claims
1. A split-and-mix method for synthesizing a library of bifunctional complexes comprising a reaction product and a double-stranded identifier oligonucleotide, the method comprising: (1) performing a first synthesis round comprising: providing nascent bifunctional complexes to two or more compartments, each nascent bifunctional complex comprising one or more chemical reaction sites and a site for the addition of an oligonucleotide tag, reacting in each compartment the one or more chemical reaction sites with one or more reactants, adding in each compartment one or more oligonucleotide tags identifying the one or more reactants to the site for addition of the oligonucleotide tag, and pooling together the contents of two or more compartments to provide an admixture of bifunctional molecules; (2) performing a second synthesis round comprising: providing an admixture of second round nascent bifunctional complexes to two or more compartments, wherein the second round nascent bifunctional molecules are obtained from a method comprising one or more first synthesis rounds, reacting in each compartment one or more chemical reaction sites on the second round nascent bifunctional complexes with one or more reactants, adding in each compartment one or more oligonucleotide tags identifying the one or more reactants to a site on the second round nascent bifunctional complexes for addition of an oligonucleotide tag, and producing an admixture of bifunctional complexes each comprising a reaction product and an identifier oligonucleotide comprising the oligonucleotide tags identifying the reactants that participated in the formation of the reaction product; (3) contacting one or more bifunctional complexes obtained from a method comprising one or more second synthesis rounds with a complimentary oligonucleotide capable of hybridizing with at least two oligonucleotide tags of the one or more bifunctional complexes, and (4) obtaining a library of bifunctional complexes in which each member of the library comprises a reaction product and a double-stranded oligonucleotide identifier.
2. The method of claim 1, wherein the reaction product is a small, non-polymeric molecule.
3. The method of claim 2, wherein the small, non-polymeric molecule is a scaffolded molecule.
4. The method of claim 1, wherein the library contains from 10.sup.5 to 10.sup.8 different bifunctional complexes.
5. The method of claim 1, wherein the complimentary oligonucleotide is attached to an attachment entity, and wherein the reaction product of a bifunctional complex is reacted with the attachment entity after hybridization of the identifier oligonucleotide to the complimentary oligonucleotide.
6. The method of claim 5, wherein the complimentary oligonucleotide is prepared by an extension reaction prior to reacting the reaction product of a bifunctional complex with the attachment entity.
7. The method of claim 5, wherein one or more attachment entities comprises a scaffold.
8. The method of claim 1, wherein the complimentary oligonucleotide comprises an assembly platform comprising a unique region that hybridizes to at least two oligonucleotide tags of an identifier oligonucleotide of a first bifunctional complex.
9. The method of claim 8, wherein the assembly platform also hybridizes to an identifier oligonucleotide of a second bifunctional complex.
10. The method of claim 9, wherein the assembly platform comprises anti-tags that hybridize to at least two oligonucleotide tags of an identifier oligonucleotide of a first and/or second bifunctional complex.
11. The method of claim 10, further comprising reacting the reaction products of first and second bifunctional complexes hybridized to the assembly platform.
12. The method of claim 11, wherein a region of a complimentary oligonucleotide capable of hybridizing to an identifier oligonucleotide of a first and/or second bifunctional complex is prepared by an extension reaction prior to reacting the reaction products of the first and second bifunctional complexes.
13. The method of claim 1, wherein oligonucleotide tags are added by enzymatic ligation.
14. The method of claim 13, wherein enzymatic ligation of oligonucleotide tags is performed in a single stranded state.
15. The method of claim 13, wherein enzymatic ligation of oligonucleotide tags is performed in a double stranded state.
16. The method of claim 14, wherein enzymatic ligation of oligonucleotide tags is performed in a double stranded state in which two oligonucleotide tags are kept together by a complementing oligonucleotide which hybridizes with ends of the two oligonucleotide tags to be ligated.
17. The method of claim 13, wherein enzymatic ligation of oligonucleotide tags is performed by a ligase selected from the group consisting of Taq DNA ligase, T4 DNA ligase, T4 RNA ligase, T7 DNA ligase, and E. coli DNA ligase.
18. The method of claim 1, wherein oligonucleotide identifier tags are added by chemical ligation.
19. The method of claim 1, wherein at least one oligonucleotide tag of an identifier oligonucleotide of a bifunctional complex is attached by an enzymatic reaction, and wherein further oligonucleotide tags are attached by chemical or enzymatic means.
20. The method of claim 1, wherein the complementary oligonucleotide comprises anti-tags, and wherein hybridization of the complementary oligonucleotide to at least two oligonucleotide tags of an identifier oligonucleotide of a bifunctional complex is random or semi-random.
21. The method of claim 1, wherein the complementary oligonucleotide is randomly or semi-randomly designed.
22. The method of claim 20, wherein the complementary oligonucleotide is randomly or semi-randomly designed by split-and-mix synthesis.
23. The method of claim 1, wherein the identifier oligonucleotide is randomly or semi-randomly designed.
24. The method of claim 23, wherein the identifier oligonucleotide is randomly or semi-randomly designed by split-and-mix synthesis.
25. The method of claim 1, wherein oligonucleotide tags are separated by a binding region having from 1 to 20 optionally modified nucleotides.
26. The method of claim 25, wherein the backbone of one or more nucleotides in the binding region is modified, said binding region modified nucleotides being selected from the group consisting of nucleotides having a 2-O-methyl substitution on the ribose, nucleotides having a 2-fluoro substitution on the ribose, peptide nucleic acids (PNA), and nucleotides having a 2-4 O-methylene cyclisation on the ribose.
27. The method of claim 1, wherein the identity of a reaction product is determined by sequencing oligonucleotide tags of an identifier oligonucleotide, or by sequencing anti-tags of a complementary oligonucleotide.
28. The method of claim 1, wherein reactants are generated by split-and-mix synthesis.
29. The method of claim 1, wherein identifier oligonucleotides or complementary oligonucleotides are generated by split-and-mix synthesis.
30. The method of claim 1, wherein identifier oligonucleotides and complementary oligonucleotides are generated by split-and-mix synthesis.
31. The method of claim 1, wherein one or more chemical reaction sites on a nascent bifunctional molecule comprises a scaffold.
32. The method of claim 1, wherein one or more identifier oligonucleotides are amplifiable.
33. The method of claim 1, wherein the bifunctional complexes comprise a linker moiety that identifies the one or more chemical reaction sites of the nascent bifunctional molecules having reacted with the one or more reactants in the first synthesis round.
34. The method of claim 33, wherein the linker moieties of the bifunctional complexes comprise an oligonucleotide tag identifying the one or more chemical reaction sites.
35. The method of claim 1, wherein the bifunctional complexes comprise a cleavable linker linking the reaction product and the identifier oligonucleotide.
36. The method of claim 35, wherein the cleavable linker is selected among linkers comprising enzymatic cleavage sites, linkers comprising a chemical degradable component, and linkers cleavable by electromagnetic radiation.
37. The method of claim 1, wherein the bifunctional complexes comprise a polyethylene glycol (PEG) linker.
38. The method of claim 1, wherein the identifier oligonucleotide tags and/or the complementary oligonucleotide anti-tags contain natural nucleotides.
39. The method of claim 38, wherein the natural nucleobases of the nucleotides of the identifier oligonucleotide are selected from the group consisting of adenine, guanine, thymine, cytosine, 5-methylcytosine, and uracil.
40. The method of claim 38, wherein the natural nucleotides are selected from the group consisting of deoxyadenosine, deoxyguanosine, deoxythymidine, and deoxycytidine, and from the ribonucleotide (RNA) constituents adenosine, guanosine, uridine, cytidine, and inosine.
41. The method of claim 1, wherein the identifier oligonucleotide tags and/or the complementary oligonucleotide anti-tags contain natural and non-natural nucleotides.
42. The method of claim 41, wherein the natural nucleobases of the nucleotides of the identifier oligonucleotide comprise ribonucleotides selected from the group consisting of adenine, guanine, thymine, cytosine, 5- methylcytosine, and uracil.
43. The method of claim 41, wherein the natural nucleotides comprise deoxyribonucleotides selected from the group consisting of deoxyadenosine, deoxyguanosine, deoxythymidine, and deoxycytidine, and from the ribonucleotide (RNA) constituents adenosine, guanosine, uridine, cytidine, and inosine.
44. The method of claim 41, wherein the nucleobase of the nucleotides is selected from the group consisting of adenine, guanine, thymine, cytosine, uracil, purine, xanthine, diaminopurine, 8-oxo-N6-methyladenine, 7-deazaxanthine, 7-deazaguanine, N4,N4-ethanocytosin, N6,N6-ethano-2,6-diamino-purine, 5-methylcytosine, 5-(C3-C6)-alkynylcytosine, 5-fluorouracil, 5-bromouracil, pseudoisocytosine, 2-hydroxy-5-methyl-4-triazolopyridine, isocytosine, isoguanine, and inosine.
45. The method of claim 1, wherein the identifier oligonucleotide tags and/or the complementary oligonucleotide anti-tags comprise one or more non-natural nucleotides.
46. The method of claim 45, wherein the ribose moieties of the non-natural nucleotides are selected from the group consisting of 2-0-methyl-ribose, 2-flour-ribose, and 2-4-0-methylene-ribose (LNA).
47. The method of claim 45, wherein the internucleoside linkages of the non- natural nucleotides are selected from the group consisting of natural phosphodiester linkages, phosphorothiote linkages, phosphodithioate linkages, methylphosphonate linkages, phosphoramidate linkages, phosphotriester linkages, and non-phosphorous-containing internucleoside linkage.
48. The method of claim 45, wherein the nucleobase of the nucleotides is selected from the group consisting of adenine, guanine, thymine, cytosine, uracil, purine, xanthine, diaminopurine, 8-oxo-N6-methyladenine, 7-deazaxanthine, 7-deazaguanine, N4,N4-ethanocytosin, N6,N6-ethano-2,6-diamino-purine, 5-methylcytosine, 5-(C3-C6)-alkynylcytosine, 5-fluorouracil, 5-bromouracil, pseudoisocytosine, 2-hydroxy-5-methyl-4-triazolopyridine, isocytosine, isoguanine, and inosine.
49. The method of claim 1, wherein identifier oligonucleotide tag backbone units are selected from the group consisting of ##STR00049## ##STR00050## ##STR00051## ##STR00052## wherein B denotes a nucleobase.
50. The method of claim 1, wherein the number of reaction compartments is in the first and second synthesis rounds is from 5 to 5,000.
51. The method of claim 1, wherein three synthesis rounds are performed, and the reaction product is identified by three oligonucleotide tags.
52. The method of claim 1, wherein chemical entity reactions are selected from the group consisting of an acylation reaction; an alkylation reaction; a vinylation reaction; an alkenylidation reaction; a transition metal catalyzed arylation reaction, a transition metal catalyzed hetarylation reaction, and a transition metal catalyzed vinylation reaction.
Description
BRIEF DESCRIPTION OF THE FIGURES
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
(26)
(27)
(28)
(29)
(30)
(31)
(32)
(33)
(34)
(35)
(36)
(37)
(38)
(39)
(40)
(41)
(42)
(43)
(44)
(45)
(46)
(47)
DETAILED DESCRIPTION OF THE FIGURES
(48)
(49) The building block comprises a functional entity attached to an oligonucleotide which is sufficiently complementary to the identifier region to allow for a hybridisation product to be formed. The functional entity is able to be transferred to the attachment entity through a chemical reaction. The complementing identifier region further comprises a unique codon at the 3 or 5 end thereof. The unique codon identifies the functional entity in an unequivocal way.
(50) Following the formation of the hybridisation product between the identifier and the building block, the functional entity and the unique anti-codon are transferred to the identifier. In an aspect of the invention, the linker connecting the functional entity and the complementing identifier region is cleaved simultaneously with the reaction with the attachment entity resulting in a transfer of the functional entity to the attachment entity. Prior to, simultaneously with or subsequent to the transfer, the transcription of the codon occurs. The transcription is performed by an enzyme capable of polymerisation or oligomerisation of oligonucleotides using a template oligonucleotide to form a complementary stand. Usually a polymerase, such as the Pfu polymerase is used together with suitable dNTPs, i.e. a mixture of ATP, CTP, GTP, and TTP, to form the unique codon as an extension of the identifier strand using the unique anti-codon of the building block as template.
(51)
(52)
(53) After the transfer of functional entity and codon, respectively, the spent building block is separated from the identifier by increasing the stringency. Usually, the stringency is increased by a increasing the temperature, changing the pH or by increasing the ionic strength. After the rupture of the duple helix structure, the identifier is recovered. In one aspect of the invention the identifier is immobilized to ease the separation from the spent building block. In another aspect the spent building block is degraded chemically or enzymatically. Following the recovery of the identifier a new cycle can be initiated by contacting the identifier with a further building block.
(54) The final product after four cycles of transfer is a bifunctional complex, which comprises a reaction product at one end and an encoding region at the other. The reaction product comprises constituents from the transferred functional entities and the initial scaffold. The encoding region comprises a genetic code for which entities that have been transferred in which order. Thus, the synthetic history may be decoded from the encoding region.
(55)
(56) In
(57)
(58) The use of internal codons is of particular importance when several rounds of selection are anticipated, especially when the encoded molecule is formed from a PCR product of a previous round. The internal anti-codons in the building block may completely or partly match the identifier sequence or may comprise one or more universal bases to provide for affinity but not for specificity. The role of the internal unique codons is only to guide the annealing between the identifier molecule and the building block molecule. The correct encoding is taken care of by the unique codons which are created in the extension process. These unique codons are passed on to the next generation of molecules and used to decode the synthetic history of the displayed molecules. This system will not be totally dependent on an accurate encoding function by the internal unique codons in order to pass the correct genotype to the next generation of identifier molecules.
(59) In panel A the hybridisation product provides for a spatial proximity between the functional entity and the attachment entity, thus increasing the probability that a reaction occurs. The unique codon templates the codon on the identifier sequence by an enzymatic extension reaction. In panel B a binding region is introduced between each unique coding sequence to provide for affinity of the two strands to each other even though one or more mis-matching bases appear in the codon:non-coding domain of a previously used codon.
(60)
(61) The sub-library may be reduced by subjecting it to the target again and use more stringent conditions. However, to obtain a higher number of each of the members of the sub-library before a second selection, it is generally preferred to amplify the complex. Thus, a primer which is loaded with a scaffold is initially annealed to a primer site at one end of the encoding region. Subsequently a transcript is formed. A reverse primer is preferably present to obtain a duple stranded PCR product having a scaffold attached thereto.
(62) This PCR is the basis for the generation of en amplification of the sub-library. The identifier sequence is segregated into a number of internal unique codons, abbreviated IUC in the drawing. The number of the IUCs corresponds to the number of functional entities participating in the formation of the display molecule. The sequence of the IUCs expresses the identity of the individual functional entities and the order of the IUCs indicates the order of reaction of the functional entities. Preferably, a primer region is presented adjacent to the sequence of IUCs to allow for a later amplification of the nucleic acid sequence.
(63) The sub-library is contacted with a plurality of building blocks comprising a transferable functional entity and an internal unique anti-codon (IUA) complementary to at least one of the IUCs. The complementing identifier region is provided with sufficient complementarity to provide for a hybridisation with the oligonucleotide identifier region. In a preferred embodiment the IUCs not identifying a functional entity to be transferred is opposed in the complementary identifier region with a neutral binding region. As mentioned above the neutral binding region may comprise universal bases, i.e. bases that have the ability to be paired with two or more of the naturally occurring nucleobases. Adjacent to the region comprising specific base-pairing sequences and non-specific base-pairing sequences, i.e. the complementary identifier region is a unique anticodon (UA). The UA comprises the same information as the IUA of the complementing identifier region, typically the UA and the IUA has the same sequence on nucleotides.
(64) The transfer step and the reaction step are conducted in several cycles as described above to form a bifunctional complex. In
(65) Using the above encoding method, allows for the amplification of more and more focused sub-libraries to obtain a sufficient amount of material to allow decoding.
(66) The encoding method shown in
(67)
(68) In
(69) In
(70)
(71)
(72)
(73) The nascent bifunctional molecule is divided into a plurality of compartments, usually wells of a microtiter plate or similar equipment that allow easy handling of multiple spatially separated containers. Each of the compartments is reacted with a specific small molecule fragment, also referred to herein as a reactant. Thus, in a first compartment, the nascent bifunctional molecule is reacted with a first small molecule fragment (F.sub.1), in a second compartment; the nascent bifunctional molecule is reacted with a second small molecule fragment (F.sub.2), etc. The number of compartments may in principle be indefinite, however, for practical reasons; the number is usually between 5 and 5000, such as 10 and 500. In each of the compartments the small molecule fragments may be identical or different as the case may be. In each compartment, one, two, or more reactants may participate in the reaction. After the reaction between the drug fragment and the nascent bifunctional molecule has occurred in each compartment, a tag is added, said tag identifying the small molecule fragment. In certain aspects of the invention, the tag is a nucleic acid. Thus, in the first compartment, a first nucleic acid tag (T.sub.1) is added to the priming site of the reaction product, in the second compartment, a second nucleic acid tag (T.sub.2) is added to the priming site of the second reaction product, etc. Various methods for enzymatic encoding are contemplated and discussed herein. Following the enzymatic addition of the tags in each of the compartments, the contents of the compartments are collected.
(74) In a second round the mixture of bifunctional molecules is split into compartments again. The number of compartments of the second round need not be the same as the number of compartments in the first round. In each compartment the products of the previous round serves as the nascent bifunctional molecule. Thus, a reactive group appearing on the reaction product between the scaffold and the small molecule fragment of the first round is reacted with one or more small molecule fragments of the second round. Thus, in a first compartment, the mixed reaction products of the first round are reacted with a first small molecule fragment (F.sub.1), in a second compartment, the mixed reaction products of the first round are reacted with a second small molecule fragment (F.sub.2), etc.
(75) The small molecule fragments F.sub.1, F.sub.2, . . . F.sub.X of the second round may be identical or different from the small molecule fragments used in the first round.
(76) After the reactions have been allowed to occur, a tag specifying the small molecule fragment is added. The tag added in the first round usually comprises a priming site that can be used for addition of the tag in the second round so as to produce a linear identifier comprising the tags. In the first compartment, the reacted product is added a first tag which identifies the reactant of the second round that has reacted with the reactive reaction site of the nascent bifunctional molecule; in a second compartment, the product reacted with the second small molecule fragment of the second round is added the tag identifying said reactant, etc. Following the addition of the tags in each compartment, the content of the compartments are mixed in a common pool. The split-reaction-combining cycle can be repeated an appropriate number of times to obtain a library of bifunctional molecules comprising a display molecule part and a coding part. The library may be used in a selection process disclosed elsewhere herein.
(77) Above, the general principle for split-and-mix is disclosed, in which the reaction of the small molecule fragment and the chemical reaction site occurs prior to the encoding step. Obviously, the events can occur in the reverse order or simultaneously.
(78)
(79) Then the content of each well is combined and, optionally, divided into a range of wells again for a second round of reaction and encoding. In final step, the combined contents of the wells are used in a selection or partition step, as disclosed herein.
(80)
(81)
(82)
(83) Free Reactant/Polymerase Encoding:
(84) A nascent bifunctional complex comprises a scaffold (=chemical reaction site) comprising a reactive group and an oligonucleotide part comprising a codon identifying the scaffold. The codon is associated with an oligonucleotide binding region capable of forming a hybridisation product with a complementing binding region of an anti-codon oligonucleotide. The hybridisation product is subjected to an extension reaction, in which the scaffold oligonucleotide is extended over the anti-codon, thereby providing the scaffold oligonucleotide with a codon. Subsequent, simultaneously with or prior to the extension reaction, a free reactant coded for by the anti-codon is reacted with the scaffold.
(85) Zipper Building Block/Polymerase:
(86) A nascent bifunctional complex comprises a scaffold (=chemical reaction site) comprising a reactive group and an oligonucleotide part comprising a codon identifying the scaffold. The codon is associated with two oligonucleotide binding region capable of forming a hybridisation product with a complementing binding region of an anti-codon oligonucleotide and a complementing binding region of the reactant. The hybridisation product is subjected to an extension reaction, in which the scaffold oligonucleotide is extended over the anti-codon, thereby providing the scaffold oligonucleotide with a codon. Subsequent, simultaneously with or prior to the extension reaction, a functional entity coded for by the anti-codon is reacted with the scaffold. The selection of polymerase may determine the order of reaction and encoding as some polymerase, such as Sequenase, displaces the binding region attached to the functional entity, while other polymerases, like Taq polymerase, do not perform the displacement of the binding region. When a zipper building block is used a close proximity between the scaffold and the functional entity is obtained thereby promoting a reaction to take place.
(87) E2 Building Block/Polymerase Encoding:
(88) A nascent bifunctional complex comprises a chemical scaffold and an oligonucleotide part comprising the codon identifying the scaffold. The oligonucleotide part comprises two binding region on each sides of the codon. An E2 building block anneals to the scaffold oligonucleotide such that the functional entity comes in close proximity as to the scaffold and a double helix is formed just before the anti-codon, thus enable a polymerase to recognize the double helix as a binding area. Applying appropriate conditions and substrates enable the extension of the identifier oligonucleotide over the anti-codon, thus transcribing the genetic information of the function entity to the identifier. Opposing the scaffold codon is a stretch of universal binding nucleotides, such as inosine. Use of an E2 building block allows for one-pot synthesis of a library.
(89) Loop Building Block/Polymerase Encoding:
(90) A nascent bifunctional complex comprises a chemical scaffold and an oligonucleotide part comprising the codon identifying the scaffold. The oligonucleotide part comprises two binding region on each sides of the codon. A loop building block anneals to the scaffold oligonucleotide such that the functional entity comes in close proximity as to the scaffold and a double helix is formed just before the anti-codon, thus enable a polymerase to recognize the double helix as a binding area. Applying appropriate conditions and substrates enable the extension of the identifier oligonucleotide over the anti-codon, thus transcribing the genetic information of the function entity to the identifier. As no sequence on the building block complements the scaffold codon sequence, this codon sequence loops out. Use of a loop building block allows for one-pot synthesis of a library.
(91) N Building Block/Polymerase Encoding:
(92) A nascent bifunctional complex comprises a chemical scaffold attached to a scaffold codon through a linker. On one or each side of the codon a binding region is present. An N building block comprises a binding region which is complementary to the scaffold binding region and an anti-codon. A functional entity is attached to the codon or a binding region. Under hybridisation conditions the complementary binding regions hybridise and a polymerase extends in both directions, thereby transferring the genetic information of the anti-codon to the oligonucleotide covalently connected to the scaffold. Before, after or simultaneously with the extension reaction, the reaction between the functional entity and the scaffold may take place. Usually, the functional entity is attached to the anti-codon oligonucleotide via a cleavable linker so as to allow for transfer of the functional entity to the scaffold structure.
(93) Free Reactant/Ligase:
(94) A scaffold entity is attached to an oligonucleotide comprising a codon. The scaffold oligonucleotide further comprises a priming site to which a codon oligonucleotide is ligated. The ligation is performed by a ligase. The ligation can take place in a single stranded or double stranded form. In the single stranded form, a 3 OH (or 5-phosphate) of the scaffold oligonucleotide is ligated to a 5-phosphate (or 3-OH) of the codon oligonucleotide. In the double stranded form, an oligonucleotide complementing the ends of the scaffold and codon oligonucleotides, respectively, is used and designed so that the ends abuts each other. Optionally, the ligation occurs between two double stranded oligonucleotides, i.e. a double stranded scaffold oligonucleotide with an over hang (sticky end) is ligated to a double stranded codon oligonucleotide provided with a complementing overhang. The type of ligation depends on the selected enzyme. Usually, the double stranded ligation is preferred because the reaction is faster due to the guiding effect of the oligonucleotide complementing the ends. The complementing oligonucleotide is also referred to herein as the splint oligonucleotide. Following, preceding, or simultaneously with the ligation of the codon oligonucleotide to the scaffold oligonucleotide a reaction between the free reactant and the scaffold takes place.
(95) Zipper Building Block/Ligase:
(96) A scaffold entity is attached to an oligonucleotide comprising a codon and binding region between the scaffold and the codon. The scaffold oligonucleotide further comprises a priming site to which a codon oligonucleotide is ligated. The ligation is performed by a ligase. The ligation can take place in a single stranded or double stranded form. In the single stranded form, a 3 OH (or 5-phosphate) of the scaffold oligonucleotide is ligated to a 5-phosphate (or 3-OH) of the codon oligonucleotide. In the double stranded form, an oligonucleotide complementing the ends of the scaffold and codon oligonucleotides, respectively, is used and designed so that the ends abuts each other. Optionally, the ligation occurs between two double stranded oligonucleotides, i.e. a double stranded scaffold oligonucleotide with an over hang (sticky end) is ligated to a double stranded codon oligonucleotide provided with a complementing overhang. The type of ligation depends on the selected enzyme. Usually, the double stranded ligation is preferred because the reaction is faster due to the guiding effect of the oligonucleotide complementing the ends. The complementing oligonucleotide is also referred to herein as the splint oligonucleotide. A zipper building block is a functional entity attached to a binding oligonucleotide. The binding oligonucleotide is complementing the binding region of the scaffold oligonucleotide, thus forming a hybridisation product under hybridisation conditions. Following, preceding, or simultaneously with the ligation of the codon oligonucleotide to the scaffold oligonucleotide a reaction between the functional entity and the scaffold takes place. The use of the binding region on the reactant ensures a close proximity between the functional entity and the scaffold.
(97) E2 Building Block/Ligational Encoding:
(98) Initially is provided a nascent bifunctional complex comprising a scaffold attached to an oligonucleotide, said oligonucleotide comprising a codon and a binding region between the scaffold codon and the scaffold codon. The scaffold oligonucleotide also comprises a priming site to which a codon oligonucleotide can be ligated. The scaffold oligonucleotide is hybridised to an E2 building block which carries a double stranded part. The oligonucleotide complementing the anticodon as ligated to the scaffold oligonucleotide using the E2 building block as a template. Before, after or simultaneously with the ligation a reaction takes place between the functional entity and the scaffold.
(99) Loop Building Block/Ligational Encoding:
(100) A bifunctional complex is provided comprising a scaffold attached to an oligonucleotide, wherein the scaffold oligonucleotide comprises a codon flanked by two binding regions. A loop building block is provided which has binding regions complementing the binding regions of the scaffold oligonucleotide. Upon hybridisation, the codon part of the scaffold oligonucleotide loops out. The loop building block also comprises a double stranded codon part. The oligonucleotide complementing the anti-codon part of the loop building block is ligated to the free binding region of the scaffold oligonucleotide. Before, after or simultaneously with the ligation a reaction takes place between the functional entity and the scaffold.
(101) N Building Block/Ligational Encoding:
(102) A nascent bifunctional complex is initially provided in which a scaffold via a suitable linker is attached the codon identifying said scaffold or attached to a binding region connect to the codon. A building block having a functional entity connected to a codon is the ligated to the scaffold oligonucleotide to connect the scaffold oligonucleotide with functional entity oligonucleotide. The ligation may be performed in a single stranded or in a double stranded state, depending on the particular enzyme selected for the ligation. Subsequently, the functional entity is reacted with the scaffold. In the alternative, the functional entity and the scaffold are reacted prior to ligation of the respective oligonucleotides.
(103) When a round, i.e. a reaction with and a tagging of the nascent bifunctional complex, has been completed in accordance with any of the above encoding methods, a new round maybe in initialized according to any of the above reaction/encoding methods. Thus, the encoding and reaction in a first round may be the same or different in a subsequent second or further round. A single bifunctional complex or a library of complexes may be generated. When a library is contemplated, one-pot-synthesis can be conducted with the building blocks in which a covalent link between the functional entity and the codon/anti-codon is used, i.e. the columns of E2 building block, loop building block, and N building block. Split and mix synthesis can be performed, when no covalent link between the functional entity/reactant and the codon/anti-codon is present, i.e. in the columns indicating the free reactant and the zipper building block.
(104)
(105) To the right is illustrated an example, in which the above method is applied on 100 different scaffold oligonucleotides and 100 building blocks. The hybridisation product between the scaffold oligonucleotides and the building block oligonucleotides is divided into 100 different wells. In each of the wells the extension, addition of anti-codon oligonucleotide and reaction with specific free reactant is allowed. In total 10.sup.6 different bifunctional molecules are generated.
(106)
(107)
(108) To the right a particular embodiment for carrying out the present invention is indicated. Accordingly, each codon is 5 nucleotides in length and the binding regions flanking the scaffold are 20 nucleotides each. The building blocks designed to hybridise to the binding regions of the scaffold comprises a 20 nucleotide complementing sequence as well as a 5 nucleotide codon.
(109) An embodiment of the enrichment method of the present invention is shown on
(110) The purified matter may comprise anti-tag sequences not hybridised to a cognate sequence. As these anti-tag sequences are not coupled to a chemical compound to be selected for, the enrichment sequences may remain in the media. However, in some applications it may be preferably to make the excess anti-tag sequences double stranded, as illustrated in
(111) The purified fraction is in step 4 is subjected to a selection process. The selection comprises probing for a set of properties, e.g. but not limited to affinity for a specific protein. In such a case, entities which do not bind to the specific protein will be eliminated. Anti-tags complexed to entities binding to the specific protein may be recovered/be isolated through e.g. the use of its purification handle.
(112) In step 5 isolated anti-tags are optionally amplified through the use of PCR or RTPCR.
(113) In step 6, the initial library of tagged entities produced in step 1, may undergo further rounds of complexation and screening, i.e. the anti-tags from step 5 may be added the library of tagged entities of step 1 and then be submitted to step 3, step 4 and step 5. Step 6 may be repeated.
(114) In step 7, the isolated anti-tags of step 5 may be cloned and their identity be revealed. E.g. in the case of DNA, sequencing may be applied whereby the identity of specific entities with selected properties in the library of tagged entities will be revealed.
(115) The embodiment shown in
(116) In
(117) In step 2, all tags of tagged entities are made double stranded by use of a primer (optionally carrying a @-handle such as e.g. biotin), nucleotide triphosphates and a polymerase or transcriptase. Remaining single stranded DNA or RNA may optionally be digested by use of nucleases.
(118) The mixture, is probed for a set of properties in step 3, e.g. but not limited to affinity for a specific protein. In such a case, entities which do not bind to the specific protein will be eliminated. Anti-tags complexed to entities binding to the specific protein may be recovered/be isolated through e.g. the use of its @-handle.
(119) Isolated anti-tags may optionally be amplified in step 4 through the use of PCR or RTPCR.
(120) In step 5, the library of tagged entities of step 1, may undergo complexation to the isolated and optionally amplified anti-tags of step 3 and 4.
(121) Single stranded components are being digested in step 6 by use of e.g. nucleases. The remaining double stranded subset of the library is optionally subjected to a renewed enrichment of the library according to step 3-6. Steps 3-6 may be repeated as sufficient number of times to obtain an appropriate chemical entity having the desired property.
(122) In step 7, the isolated anti-tags of step 4 can be cloned and their identity be revealed, e.g. in the case of DNA, sequencing may be applied, whereby the identity of specific entities in the library of tagged entities is revealed.
(123)
(124) In step 2, mixtures of tagged entities are combined with a set of complementary anti-tags. Anti-tags may be, but is not limited to nucleotide derivatives. Anti-tags may optionally carry a @-handle. The tag and the anti-tags are allowed to form a complex. The complexation may be, but is not limited to hybridization. Some anti-tags will not form a complex with a tagged entity and some tagged entities will not form a complex with an anti-tag.
(125) Non-complexed components is digested in step 3 using e.g. nucleases when the tags and/or anti-tags are composed of DNA or RNA or hybrids thereof.
(126) The mixture of step 3, is probed for a set of properties in step 4, e.g. but not limited to affinity for a specific protein. In such a case, entities which do not bind to the specific protein will be eliminated. Anti-tags complexed to entities binding to the specific protein may be recovered/be isolated through e.g. the use of its @handle. Step 4 may be repeated one or more times.
(127) Isolated anti-tags may optionally be amplified through the use of PCR or RTPCR as illustrated in step 5. Anti-tags may then also be used as described in
(128) The isolated anti-tags may be cloned and their identity be revealed in step 6, e.g. in the case of DNA, sequencing may be applied, whereby the identity of specific entities in the library of tagged entities will be revealed.
(129) According to
(130) All tags of tagged entities are made double stranded in step 2 by use of a primer (optionally carrying a @-handle such as e.g. biotin), nucleotide triphosphates and a polymerase or transcriptase. Remaining single stranded DNA or RNA may optionally be digested by use of e.g. nucleases.
(131) In step 3, the mixture is probed for a set of properties, e.g. but not limited to affinity for a specific protein. In such a case, entities which do not bind to the specific protein will be eliminated. Anti-tags complexed to tags having appended entities binding to the specific protein may be recovered/be isolated through e.g. the use of its @-handle. Step 3 may be repeated one or more times.
(132) According to step 4, isolated anti-tags may optionally be amplified through the use of PCR or RTPCR. Anti-tags may then also be used as described in
(133) The isolated anti-tags may be cloned in step 5 and their identity be revealed, e.g. in the case of DNA, sequencing may be applied. Whereby, the identity of specific entities in the library of tagged entities will be revealed.
(134)
(135) In step 2, the mixture is probed for a set of properties, e.g. but not limited to affinity for a specific protein. In such a case, entities which do not bind to the specific protein will be eliminated. Step 2 may be repeated.
(136) All tags of tagged entities are made double stranded in step 3 by use of a primer (optionally carrying a @-handle such as e.g. biotin), nucleotide triphosphates and a polymerase or transcriptase. Remaining single stranded DNA or RNA may optionally be digested by use of e.g. nucleases.
(137) Anti-tags complexed to tags of entities binding to the specific protein may be recovered/be isolated in step 4 through e.g. the use of its @-handle. Anti-tags may optionally be amplified through the use of PCR or RTPCR. Anti-tags may then also be used as described in
(138) The isolated anti-tags may be cloned in step 5 and their identity be revealed, e.g. in the case of DNA, sequencing may be applied, whereby, the identity of specific entities in the library of tagged entities is revealed.
(139)
(140)
(141)
(142)
EXAMPLES
Example 1: Loading of a Scaffold Onto Identifier Molecules
(143) An amino-modifier C6 5-labeled identifier oligo (5-X-TCGTAACGACTGAATGACGT-3, (SEQ ID NO: 5) wherein X may be obtained from Glen research, cat. #10-1039-90) was loaded with a peptide scaffold (Cys-Phe-Phe-Lys-Lys-Lys, CFFKKK, SEQ ID NO: 6) using SPDP activation (see below). The SPDP-activation of amino-oligo was performed using 160 l of 10 nmol oligo in 100 mM Hepes-KOH, pH=7.5, and 40 l 20 mM SPDP and incubation for 2 h at 30 C. The activated amino-oligo was extracted 3 times with 500 l EtOAc, dried for 10 min in a speed-vac and purified using micro bio-spin column equilibrated with 100 mM Hepes-KOH. The loading of scaffold was then performed by adding 10 l of 100 mM attachment entity and incubating overnight at 30 C.
(144) The loaded identifier oligo was precipitated with 2 M NH.sub.4OAc and 2 volume 96% ethanol for 15 min at 80 C. and then centrifuged for 15 min at 4 C. and 15.000 g. The pellet was re-suspended in water and the precipitation was repeated. Wash of the oligo-pellet was done by adding 100 l of 70% ethanol and then briefly centrifuged. The oligo was re-dissolved in 50 l H.sub.2O and analysed by MS. The MS analysis was performed after 100 pmol oligo in 10 l water was treated with 10 l of ion exchanger resin and incubated minimum 2 h at 25 C. on a shaker. After incubation the resin was removed by centrifugation and 15 l of the supernatant was mixed with 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 l acetonitrile. The sample was analysed using a mass spectroscopy instrument (Bruker Daltonics, Esquire 3000plus). The observed mass, as can be seen in
(145) ##STR00040##
(146) In the above figure, the DNA sequence is SEQ ID NO:5 and the peptide sequence (which, in the figure, is in reverse order, C-terminal to N-terminal is SEQ ID NO:6.
Example 2: Loading of Functional Entities Onto Building Blocks
(147) Loading of functional entities onto building block molecules can be done using a thiol-oligo (see below). An Biotin 5 labeled and thio-modifier C6 SS (obtainable from Glen Research, cat #10-1936-90) 3-labeled building block oligo (5-BTGCAGACGTCATTCAGTCGTTACGA-3 SEQ ID NO: 7) was converted to an NHS-oligo using NHM.
(148) 10 nmol oligo was dried in speed-vac, re-dissolved in 50 l 100 mM DTT, 100 mM sodium-phosphate pH 8.0 and incubated at 37 C. for 1 hour. The thiol-oligo was then purified using micro bio-spin column equilibrated with 100 mM Hepes-KOH, pH 7.5. The thiol-oligo was converted to NHS-oligo by adding 100 mM NHM in 100 mM Hepes-KOH pH. 7.5. The sample was incubated at 25 C. over night. The NHS-oligo was then purified using bio-spin column equilibrated with MS-grade H.sub.2O.
(149) ##STR00041##
(150) In the above figure, the DNA sequence is SEQ ID NO:7.
(151) The MS analysis was performed after 100 pmol oligo in 10 l water was treated with 10 l of ion exchanger resin and incubated minimum 2 h at 25 C. on a shaker. After incubation the resin was removed by centrifugation and 15 l of the supernatant was mixed with 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 l acetonitrile. The sample was analysed using a mass spectroscopy instrument (Bruker Daltonics, Esquire 3000plus). The observed mass as can be seen in
(152) The NHS-oligo was then used to load functional entities. EDC activation of the functional entity (4-pentynoic acid) was performed mixing 50 l of 200 mM functional entity in DMF with 50 l of 200 mM EDC in DMF and incubated for 30 min at 25 C. on a shaker. The loading was then performed using 1 nmol NHS-oligo lyophilized in a speed-vac and 10 l of the activated building block (see below). This was incubated at 25 C. for 5 min and then mixed with 30 l 100 mM MES pH. 6.0. The loaded NHS-oligo was purified using bio-spin column equilibrated with 100 mM MES pH 6.0. The loaded building block oligo is then used immediately for the transfer reaction without any MS analysis. This is due to the unstable structure of the functional entity during the conditions used for the MS measurements.
(153) ##STR00042##
(154) In the above Figure, the DNA sequence is SEQ ID NO:7.
(155) This experiment exemplifies a complete loading of a functional entity onto a building block molecule ready for transfer to an recipient reactive group when annealed to the complementary identifier molecule.
(156) Another example of a functional entity that can be loaded as described above onto a building block is a 5-hexynoic acid as shown below. Again, no MS analysis was performed on this compound due to the unstable structure of the functional entity in the conditions used in the MS measurements.
(157) ##STR00043##
(158) In the above figure, the DNA sequence is SEQ ID NO:7
Example 3: Transfer of Functional Entities from the Building Block to the Identifier Molecule
(159) The attachment entity (AE) in the following experiments are either a scaffold, e.g. the peptide, CFFKKK (SEQ ID NO: 134), loaded on an identifier as prepared in Example 1 or a recipient reactive group exemplified by an amino modified oligonucleotide used as starting material in Example 1. These attachment entities allow transfer of three or one functional entities, respectively.
(160) The identifier used in this experiment is an identifier oligonucleotide loaded with CFFKKK as described in Example 1. The functional entity (FE) in this experiment is the 4-Pentynoic acid, the loading of which was described in Example 2. The identifier molecule loaded with the scaffold is annealed to the loaded building block molecule to bring the attachment entity and the functional entity in close proximity. The annealing is directed by the identifier region in the identifier molecule and the complementary sequence in the building block molecule.
(161) ##STR00044##
(162) After the annealing step between the identifier and building block molecules, the transfer reaction takes place where the functional entity is transferred to the identifier molecule.
(163) The annealing was performed using 600 pmol of the building block and 400 pmol identifier molecules in 0.1 M MES buffer at 25 C. in a shaker for 2 hours. The reactive part (functional entity) of the building block was transferred to the one of the amino group on the attachment entity on the identifier molecule during the annealing (see below). After annealing the sample was purified by micro-spin gel filtration and analyzed by MS. The sample was prepared for MS analysis using equal amount of sample (about 100 pmol) and ion exchanger resin and incubated minimum 2 h at 25 in a shaker. After incubation the resin was centrifuged down and 15 l of the supernatant was added 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 ul acetonitrile. The sample was analysed on a Mass Spectroscopy instrument (Bruker Daltonics, Esquire 3000plus). The observed mass (see
(164) ##STR00045##
(165) In the above figure, the first sequence is SEQ ID NO:5 and the second is SEQ ID NO:7.
(166) Another example of transfer of functional entity is shown below using the amino oligo directly as the AE on the identifier molecule. The functional entity on the building block molecule used in this experiment was 4-pentynoic acid, as disclosed in example 2.
(167) The annealing was performed using 500 pmol of the building block and the identifier molecules in 0.1 M MES buffer and incubating the mixture at 25 C. in a shaker for 2 hours. The reactive part (functional entity) of the building block was transfer to the amino group on the identifier molecule during the annealing (see below). After annealing and transfer the sample was purified by micro-spin gel filtration and analyzed by MS. The sample was prepared for MS analysis using equal amount of sample (about 100 pmol) and ion exchanger resin and incubated minimum 2 h at 25 in a shaker. After incubation the resin was removed by centrifugation and 15 l of the supernatant was added 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 ul acetonitrile.
(168) ##STR00046##
(169) In the above figure, in both the starting materials and the products, the first sequence is SEQ ID NO:5 and the second is SEQ ID NO:7.
(170) The sample was analysed on a Mass Spectroscopy instrument (Bruker Daltonics, Esquire 3000plus). The observed mass was 6398.04 Da, which correspond well with the calculated mass, 6400.00 Da. Thus, the MS spectra of the identifier molecule after transfer of the functional entity show a mass corresponding to the transferred functional entity on the identifier molecule. This example shows that functional entities can be transferred using this setup of a building block molecule and an identifier molecule.
(171) Another example of transfer of functional entity is shown below using the amino oligo directly as the identifier molecule. The functional entity used in this experiment was 5-Hexpoic acid, prepared as shown in example 2.
(172) The annealing was performed using 500 pmol of the building block and 500 pmol of the identifier molecules in 0.1 M MES buffer incubated at 25 C. in a shaker for 2 hours. The reactive part (functional entity) of the building block was transferred to the amino group on the identifier molecule (see below). After annealing and transfer the sample was purified by micro-spin gel filtration and analyzed by MS. The sample was prepared for MS analysis using equal amount of sample (about 100 pmol) and ion exchanger resin and incubated minimum 2 h at 25 C. in a shaker. After incubation the resin was removed by centrifugion and 15 l of the supernatant was added 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 ul acetonitrile.
(173) ##STR00047##
(174) In the above figure, in both the starting materials and the products, the first sequence is SEQ ID NO:5 and the second is SEQ ID NO:7.
(175) The sample was analysed on a Mass Spectroscopy instrument (Bruker Daltonics, Esquire 3000plus). The observed mass was 6411.96 Da, which correspond well with the calculated mass, 6414 Da. Thus, the MS spectra of the identifier molecule after transfer of the functional entity show a mass corresponding to the transferred functional entity onto the identifier molecule. This example shows that functional entities can be transferred using this setup of a building block molecule and an identifier molecule.
Example 4: Extension of the Identifier Molecule to Transfer Unique Codons
(176) After the transfer of the functional entity (FE) to the attachment entity (AE) on the identifier molecule, the identifier molecule is extended in order to transfer the unique codon, that identifies the transferred functional entity, to the identifier molecule. This is accomplished by adding a suitable polymerase and a polymerase buffer containing the wild type nucleotides (dATP, dTTP, dCTP, dGTP). This will extend the identifier molecule in the 3-end towards the end of the 5-end of the building block molecule. The extension of the identifier molecule to transfer the unique anticodon(s) is preferably performed after the transfer of the FE as shown below.
(177) ##STR00048##
(178) The extension was performed using 15 units Taq polymerase in a buffer containing 0.4 mM of each nucleotide in an extension buffer (20 mM HEPES-KOH, 40 mM KCl, 8 mM MgCl.sub.2, pH=7.4). After the extension reaction the sample was analyzed using MS. The MS analysis was performed using about 100 pmol purified extension mixture in a half volume of ion exchanger resin and incubated minimum 2 h at 25 C. in a shaker. After incubation the resin was removed by centrifugation and 15 l of the supernatant was mixed with 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 l acetonitrile. The sample was analysed on a Mass Spectroscopy instrument (Bruker Daltonics, Esquire 3000plus).
(179) The MS data for extension on the identifier molecule with a transferred 4-Pentynoic acid is shown
(180) The observed mass was 7922.53 Da, which correspond well with the calculated mass, 7924.00 Da. The MS spectra of the identifier molecule after the transfer reaction of the functional entity and extension reaction of the encoding region (the unique codon) showed a mass corresponding to the transferred functional entity and the extension on the identifier molecule. This example shows that functional entities can be transferred using this setup with a longer building block molecule than the identifier molecule and that the identifier molecule can be extended using a polymerase after the transfer process. This shows the possibility to transfer both the functional entity and the unique codon from the same building block to an identifier molecule.
(181) Another example showing transfer and extension is for the building block with the functional entity 5-Hexynoic acid. The MS data for extension on the identifier molecule with a transferred 5-Hexynoic acid is shown in
(182) The observed mass was 7936.99 Da, which correspond well with the calculated mass, 7938.00 Da. The MS spectra of the identifier molecule after transfer reaction of the functional entity and extension reaction of the encoding region (the unique codon) showed a mass corresponding to the transferred functional entity and the extension on the identifier molecule. This example also shows that functional entities can be transferred using this setup with a longer building block molecule than the identifier molecule and the identifier molecule can be extended using a polymerase after the transfer process. This exemplifies the possibility to transfer both the functional entity and the unique codon from one building block molecule to one identifier molecule.
Example 5: Library Design
(183) The identifier molecule can be designed to operate optimal under various conditions. However, it should contain a few elements that are vital for the function. The identifier molecule should comprise of a sequence that can anneal to the building block and an attachment entity that can accommodate various functional entities. Below is an example on how an identifier molecule can be designed in the extension region. The region that becomes extended during each step of transfer and encoding can be designed using various approaches. Importantly, there must be a base-pair match between the building block and the identifier to allow efficient extension using a polymerase. This can be accomplished using either a region that is constant, the binding region as described in
(184) The first step in the extension process needs no special binding region due to the match of the identifier and the building block molecules (step 1 shown below). However, the subsequently steps needs a binding region sufficient complementary to the identifier molecule to allow for hybridisation because the enzyme, preferably a polymerase must be able to bind to the douplex and perform an extension. The example below shows four steps in the encoding procedure. This process of extension can be continued to obtain the suitable number of transfer of building blocks. The binding region in this example contains 6 nucleotides, but this can be varied dependent on the design of the building blocks.
(185) A possibility to accommodate the possible mismatches in the previous anticodon is to use universal nucleobases, i.e. a nucleobases with the ability to base pair with more than one of the natural nucleobases. A possible base is inosine which can form base pairs with cytidine, thymidine, and adenosine (although the inosine:adenosine pairing presumably does not fit quite correctly in double stranded DNA, so there may be an energetic penalty to pay when the helix bulges out at this purine:purine pairing). In principle, any design that allows extension of the unique codons is possible to use.