Method for the synthesis of a bifunctional complex

Abstract

Disclosed is a method for obtaining a bifunctional complex comprising a display molecule part and a coding part, wherein a nascent bifunctional complex comprising a chemical reaction site and a priming site for enzymatic addition of a tag is reacted at the chemical reaction site with one or more reactants, and provided with respective tag(s) identifying the reactant(s) at the priming site is using one or more enzymes.

Claims

1. A split-and-mix method for synthesizing a library of bifunctional complexes comprising a reaction product and a double-stranded identifier oligonucleotide, the method comprising: (1) performing a first synthesis round comprising: providing nascent bifunctional complexes to two or more compartments, each nascent bifunctional complex comprising one or more chemical reaction sites and a site for the addition of an oligonucleotide tag, reacting in each compartment the one or more chemical reaction sites with one or more reactants, adding in each compartment one or more oligonucleotide tags identifying the one or more reactants to the site for addition of the oligonucleotide tag, and pooling together the contents of two or more compartments to provide an admixture of bifunctional molecules; (2) performing a second synthesis round comprising: providing an admixture of second round nascent bifunctional complexes to two or more compartments, wherein the second round nascent bifunctional molecules are obtained from a method comprising one or more first synthesis rounds, reacting in each compartment one or more chemical reaction sites on the second round nascent bifunctional complexes with one or more reactants, adding in each compartment one or more oligonucleotide tags identifying the one or more reactants to a site on the second round nascent bifunctional complexes for addition of an oligonucleotide tag, and producing an admixture of bifunctional complexes each comprising a reaction product and an identifier oligonucleotide comprising the oligonucleotide tags identifying the reactants that participated in the formation of the reaction product; (3) contacting one or more bifunctional complexes obtained from a method comprising one or more second synthesis rounds with a complimentary oligonucleotide capable of hybridizing with at least two oligonucleotide tags of the one or more bifunctional complexes, and (4) obtaining a library of bifunctional complexes in which each member of the library comprises a reaction product and a double-stranded oligonucleotide identifier.

2. The method of claim 1, wherein the reaction product is a small, non-polymeric molecule.

3. The method of claim 2, wherein the small, non-polymeric molecule is a scaffolded molecule.

4. The method of claim 1, wherein the library contains from 10.sup.5 to 10.sup.8 different bifunctional complexes.

5. The method of claim 1, wherein the complimentary oligonucleotide is attached to an attachment entity, and wherein the reaction product of a bifunctional complex is reacted with the attachment entity after hybridization of the identifier oligonucleotide to the complimentary oligonucleotide.

6. The method of claim 5, wherein the complimentary oligonucleotide is prepared by an extension reaction prior to reacting the reaction product of a bifunctional complex with the attachment entity.

7. The method of claim 5, wherein one or more attachment entities comprises a scaffold.

8. The method of claim 1, wherein the complimentary oligonucleotide comprises an assembly platform comprising a unique region that hybridizes to at least two oligonucleotide tags of an identifier oligonucleotide of a first bifunctional complex.

9. The method of claim 8, wherein the assembly platform also hybridizes to an identifier oligonucleotide of a second bifunctional complex.

10. The method of claim 9, wherein the assembly platform comprises anti-tags that hybridize to at least two oligonucleotide tags of an identifier oligonucleotide of a first and/or second bifunctional complex.

11. The method of claim 10, further comprising reacting the reaction products of first and second bifunctional complexes hybridized to the assembly platform.

12. The method of claim 11, wherein a region of a complimentary oligonucleotide capable of hybridizing to an identifier oligonucleotide of a first and/or second bifunctional complex is prepared by an extension reaction prior to reacting the reaction products of the first and second bifunctional complexes.

13. The method of claim 1, wherein oligonucleotide tags are added by enzymatic ligation.

14. The method of claim 13, wherein enzymatic ligation of oligonucleotide tags is performed in a single stranded state.

15. The method of claim 13, wherein enzymatic ligation of oligonucleotide tags is performed in a double stranded state.

16. The method of claim 14, wherein enzymatic ligation of oligonucleotide tags is performed in a double stranded state in which two oligonucleotide tags are kept together by a complementing oligonucleotide which hybridizes with ends of the two oligonucleotide tags to be ligated.

17. The method of claim 13, wherein enzymatic ligation of oligonucleotide tags is performed by a ligase selected from the group consisting of Taq DNA ligase, T4 DNA ligase, T4 RNA ligase, T7 DNA ligase, and E. coli DNA ligase.

18. The method of claim 1, wherein oligonucleotide identifier tags are added by chemical ligation.

19. The method of claim 1, wherein at least one oligonucleotide tag of an identifier oligonucleotide of a bifunctional complex is attached by an enzymatic reaction, and wherein further oligonucleotide tags are attached by chemical or enzymatic means.

20. The method of claim 1, wherein the complementary oligonucleotide comprises anti-tags, and wherein hybridization of the complementary oligonucleotide to at least two oligonucleotide tags of an identifier oligonucleotide of a bifunctional complex is random or semi-random.

21. The method of claim 1, wherein the complementary oligonucleotide is randomly or semi-randomly designed.

22. The method of claim 20, wherein the complementary oligonucleotide is randomly or semi-randomly designed by split-and-mix synthesis.

23. The method of claim 1, wherein the identifier oligonucleotide is randomly or semi-randomly designed.

24. The method of claim 23, wherein the identifier oligonucleotide is randomly or semi-randomly designed by split-and-mix synthesis.

25. The method of claim 1, wherein oligonucleotide tags are separated by a binding region having from 1 to 20 optionally modified nucleotides.

26. The method of claim 25, wherein the backbone of one or more nucleotides in the binding region is modified, said binding region modified nucleotides being selected from the group consisting of nucleotides having a 2-O-methyl substitution on the ribose, nucleotides having a 2-fluoro substitution on the ribose, peptide nucleic acids (PNA), and nucleotides having a 2-4 O-methylene cyclisation on the ribose.

27. The method of claim 1, wherein the identity of a reaction product is determined by sequencing oligonucleotide tags of an identifier oligonucleotide, or by sequencing anti-tags of a complementary oligonucleotide.

28. The method of claim 1, wherein reactants are generated by split-and-mix synthesis.

29. The method of claim 1, wherein identifier oligonucleotides or complementary oligonucleotides are generated by split-and-mix synthesis.

30. The method of claim 1, wherein identifier oligonucleotides and complementary oligonucleotides are generated by split-and-mix synthesis.

31. The method of claim 1, wherein one or more chemical reaction sites on a nascent bifunctional molecule comprises a scaffold.

32. The method of claim 1, wherein one or more identifier oligonucleotides are amplifiable.

33. The method of claim 1, wherein the bifunctional complexes comprise a linker moiety that identifies the one or more chemical reaction sites of the nascent bifunctional molecules having reacted with the one or more reactants in the first synthesis round.

34. The method of claim 33, wherein the linker moieties of the bifunctional complexes comprise an oligonucleotide tag identifying the one or more chemical reaction sites.

35. The method of claim 1, wherein the bifunctional complexes comprise a cleavable linker linking the reaction product and the identifier oligonucleotide.

36. The method of claim 35, wherein the cleavable linker is selected among linkers comprising enzymatic cleavage sites, linkers comprising a chemical degradable component, and linkers cleavable by electromagnetic radiation.

37. The method of claim 1, wherein the bifunctional complexes comprise a polyethylene glycol (PEG) linker.

38. The method of claim 1, wherein the identifier oligonucleotide tags and/or the complementary oligonucleotide anti-tags contain natural nucleotides.

39. The method of claim 38, wherein the natural nucleobases of the nucleotides of the identifier oligonucleotide are selected from the group consisting of adenine, guanine, thymine, cytosine, 5-methylcytosine, and uracil.

40. The method of claim 38, wherein the natural nucleotides are selected from the group consisting of deoxyadenosine, deoxyguanosine, deoxythymidine, and deoxycytidine, and from the ribonucleotide (RNA) constituents adenosine, guanosine, uridine, cytidine, and inosine.

41. The method of claim 1, wherein the identifier oligonucleotide tags and/or the complementary oligonucleotide anti-tags contain natural and non-natural nucleotides.

42. The method of claim 41, wherein the natural nucleobases of the nucleotides of the identifier oligonucleotide comprise ribonucleotides selected from the group consisting of adenine, guanine, thymine, cytosine, 5- methylcytosine, and uracil.

43. The method of claim 41, wherein the natural nucleotides comprise deoxyribonucleotides selected from the group consisting of deoxyadenosine, deoxyguanosine, deoxythymidine, and deoxycytidine, and from the ribonucleotide (RNA) constituents adenosine, guanosine, uridine, cytidine, and inosine.

44. The method of claim 41, wherein the nucleobase of the nucleotides is selected from the group consisting of adenine, guanine, thymine, cytosine, uracil, purine, xanthine, diaminopurine, 8-oxo-N6-methyladenine, 7-deazaxanthine, 7-deazaguanine, N4,N4-ethanocytosin, N6,N6-ethano-2,6-diamino-purine, 5-methylcytosine, 5-(C3-C6)-alkynylcytosine, 5-fluorouracil, 5-bromouracil, pseudoisocytosine, 2-hydroxy-5-methyl-4-triazolopyridine, isocytosine, isoguanine, and inosine.

45. The method of claim 1, wherein the identifier oligonucleotide tags and/or the complementary oligonucleotide anti-tags comprise one or more non-natural nucleotides.

46. The method of claim 45, wherein the ribose moieties of the non-natural nucleotides are selected from the group consisting of 2-0-methyl-ribose, 2-flour-ribose, and 2-4-0-methylene-ribose (LNA).

47. The method of claim 45, wherein the internucleoside linkages of the non- natural nucleotides are selected from the group consisting of natural phosphodiester linkages, phosphorothiote linkages, phosphodithioate linkages, methylphosphonate linkages, phosphoramidate linkages, phosphotriester linkages, and non-phosphorous-containing internucleoside linkage.

48. The method of claim 45, wherein the nucleobase of the nucleotides is selected from the group consisting of adenine, guanine, thymine, cytosine, uracil, purine, xanthine, diaminopurine, 8-oxo-N6-methyladenine, 7-deazaxanthine, 7-deazaguanine, N4,N4-ethanocytosin, N6,N6-ethano-2,6-diamino-purine, 5-methylcytosine, 5-(C3-C6)-alkynylcytosine, 5-fluorouracil, 5-bromouracil, pseudoisocytosine, 2-hydroxy-5-methyl-4-triazolopyridine, isocytosine, isoguanine, and inosine.

49. The method of claim 1, wherein identifier oligonucleotide tag backbone units are selected from the group consisting of ##STR00049## ##STR00050## ##STR00051## ##STR00052## wherein B denotes a nucleobase.

50. The method of claim 1, wherein the number of reaction compartments is in the first and second synthesis rounds is from 5 to 5,000.

51. The method of claim 1, wherein three synthesis rounds are performed, and the reaction product is identified by three oligonucleotide tags.

52. The method of claim 1, wherein chemical entity reactions are selected from the group consisting of an acylation reaction; an alkylation reaction; a vinylation reaction; an alkenylidation reaction; a transition metal catalyzed arylation reaction, a transition metal catalyzed hetarylation reaction, and a transition metal catalyzed vinylation reaction.

Description

BRIEF DESCRIPTION OF THE FIGURES

(1) FIG. 1 shows the components of the identifier and the building block

(2) FIG. 2 shows the principle of encoding by extension

(3) FIG. 3 shows the extension region of the building block

(4) FIG. 4 shows the components of the identifier and the building block with internal codons

(5) FIG. 5 shows the principle of encoding by extension with specific annealing

(6) FIG. 6 shows the encoding of scaffolded and polymer molecules

(7) FIG. 7 shows the encoding by extension using three-strand assembly principle

(8) FIG. 8 shows encoding by extension using three-strand assembly principle with specific annealing

(9) FIG. 9 shows the synthesis of three-strand identifier-displayed molecules using a solid-phase approach.

(10) FIG. 10 shows the sequential reaction/extension using platform assembly.

(11) FIG. 11 discloses a general scheme for alternating parallel synthesis of a combinatorial library.

(12) FIG. 12 discloses an encoding method using ligational encoding and a free reactant.

(13) FIG. 13 discloses a library generating method in which a reaction is followed be an encoding step.

(14) FIG. 14 discloses a library generation method using polymerase encoding.

(15) FIG. 15 discloses various embodiments for single encoding methods.

(16) FIG. 16 discloses a double encoding method.

(17) FIG. 17 discloses various double encoding methods.

(18) FIG. 18 discloses encoding using an loop building block.

(19) FIG. 19 discloses a method in which a flexible linker is used in the building block.

(20) FIG. 20 discloses a gel showing the result of an experiment according to example 6.

(21) FIG. 21 discloses a triple encoding method.

(22) FIG. 22 shows the setup used in example 9.

(23) FIG. 23 shows the split-and-mix structure used in example 9.

(24) FIG. 24 discloses an embodiment of library enrichment, amplification and identification.

(25) FIG. 25 shows an embodiment in which anti-tag sequences not hybridised to a identifier sequence are made double stranded and thus inert.

(26) FIG. 26 shows an embodiment in which an enrichment step is before the purification step.

(27) FIG. 27 shows a general principle of library enrichment, amplification, and identification.

(28) FIG. 28 shows a general principle of library enrichment, amplification, and identification omitting the intermediate amplification step between subsequent enrichment procedures.

(29) FIG. 29 shows a general principle of library enrichment, amplification, and identification in which the initial single stranded library is made double stranded prior to enrichment.

(30) FIG. 30 shows a general principle for library enrichment, in which the anti-tag is not formed until after the one and more enrichment processes.

(31) FIG. 31 shows two gels reported in example 13.

(32) FIG. 32 shows the result of the experiment reported in Example 14.

(33) FIG. 33 shows the result of the experiment reported in Example 14.

(34) FIG. 34 is a mass spectrogram showing the observed mass (7244.93 Da) for the sample of example 1.

(35) FIG. 35 is a mass spectrogram showing the observed mass (8369.32 Da) for the sample of example 2.

(36) FIG. 36 is a mass spectrogram showing the observed mass (7323.45 Da) for the first sample of example 3.

(37) FIG. 37 is a mass spectrogram showing the observed mass (6398.04 Da) for the second sample of example 3.

(38) FIG. 38 is a mass spectrogram showing the observed mass (6411.96 Da) for the third sample of example 3.

(39) FIG. 39 is a mass spectrogram showing the observed mass (7922.53 Da) for the first sample of Example 4.

(40) FIG. 40 is a mass spectrogram showing the observed mass (7936.99 Da) for the second sample of example 4.

(41) FIG. 41 is a mass spectrogram showing the observed masses or the template (15452.14 Da) and the extended primer (15328.92 Da) in the first experiment of example 5.

(42) FIG. 42 is a mass spectrogram showing the observed mass for the extended primer (28058.14 Da) for the second experiment of example 5.

(43) FIG. 43 is a flow chart for the production of one embodiment of a library of bifunctional complexes, as set forth in Example 7. DF: Drug fragment/functional entity; B: Biotin; SA: Streptavidin.

(44) FIG. 44 shows the retention time of the complex of Example 8 on a size-exclusion column.

(45) FIG. 45 is a mass spectrogram showing the observed mass (66716.11 Da) for the loaded oligo in Example 9, section 9.1.

(46) FIG. 46 is a mass spectrogram showing the observed masses for the starting benzaldehyde loaded L1 oligo (A) and the UGI product (B) in Example 9, section 9.2.

(47) FIG. 47 is a mass spectrogram showing the observed masses for the starting benzaldehyde loaded L1 oligo (A), dilcetopiperazine (B), UGI product (C) and amine product (H) in Example 9, section 9.3.

DETAILED DESCRIPTION OF THE FIGURES

(48) FIG. 1 discloses in panel A a hybridisation product between a nascent bifunctional complex and a building block. The nascent bifunctional complex, for short the Identifier, comprises an attachment entity connected to an oligonucleotide identifier region by a linker moiety. The attachment entity may be a single recipient reactive group having been adapted to receive a functional entity or may be a scaffold structure comprising one or more recipient reactive groups. In panel A the attachment entity is indicated as a scaffold having four reactive groups capable of receiving functional entities.

(49) The building block comprises a functional entity attached to an oligonucleotide which is sufficiently complementary to the identifier region to allow for a hybridisation product to be formed. The functional entity is able to be transferred to the attachment entity through a chemical reaction. The complementing identifier region further comprises a unique codon at the 3 or 5 end thereof. The unique codon identifies the functional entity in an unequivocal way.

(50) Following the formation of the hybridisation product between the identifier and the building block, the functional entity and the unique anti-codon are transferred to the identifier. In an aspect of the invention, the linker connecting the functional entity and the complementing identifier region is cleaved simultaneously with the reaction with the attachment entity resulting in a transfer of the functional entity to the attachment entity. Prior to, simultaneously with or subsequent to the transfer, the transcription of the codon occurs. The transcription is performed by an enzyme capable of polymerisation or oligomerisation of oligonucleotides using a template oligonucleotide to form a complementary stand. Usually a polymerase, such as the Pfu polymerase is used together with suitable dNTPs, i.e. a mixture of ATP, CTP, GTP, and TTP, to form the unique codon as an extension of the identifier strand using the unique anti-codon of the building block as template.

(51) FIG. 1, panel B illustrates a typical setup for a second transfer of functional entity. The identifier has been provided with a first functional entity and has been extended by a codon. Furthermore, the codon also comprises a binding region as an extension of the codon. The binding region is usually a constant region transferred to the identifier in the first transfer cycle by the first building block. The identifier forms a hybridisation product with a second building block. The second building block comprises a second functional entity connected to an oligonucleotide sufficient complementary to the identifier region of the identifier to allow for a hybridisation. A part of the complementing identifier region comprises a non-coding region and a region complementing the binding region. The non-coding region opposes the codon transferred in the first cycle and the complementing binding region is complementary to the binding region to allow for a hybridisation which is sufficiently strong for an enzyme to bind to the helix. A second unique anti-codon is attached to the complementary binding region and identifies the second functional entity. The second codon is transferred to the identifier using the second anti-codon as template in the same manner as described above for the first codon.

(52) FIG. 2 illustrates four cycles of functional entity and codon transfer. In the first cycle, a hybridisation product is formed between the identifier and building block. The hybridisation product ensures that the functional entity and the scaffold are brought into close spatial proximity, thus increasing the probability that a reaction will take place. The formation of a duplex between the two oligonucleotides also provides a binding region for a polymerase. In the presence of a polymerase, a mixture of dNTPs and a suitable puffer such as an aqueous solution containing 20 mM HEPES-KOH, 40 mM KCl and 8 mM MgCl.sub.2 and a pH adjusted to 7.4, the unique anti-codon (UA.sub.1) is transferred to the identifier as a codon.

(53) After the transfer of functional entity and codon, respectively, the spent building block is separated from the identifier by increasing the stringency. Usually, the stringency is increased by a increasing the temperature, changing the pH or by increasing the ionic strength. After the rupture of the duple helix structure, the identifier is recovered. In one aspect of the invention the identifier is immobilized to ease the separation from the spent building block. In another aspect the spent building block is degraded chemically or enzymatically. Following the recovery of the identifier a new cycle can be initiated by contacting the identifier with a further building block.

(54) The final product after four cycles of transfer is a bifunctional complex, which comprises a reaction product at one end and an encoding region at the other. The reaction product comprises constituents from the transferred functional entities and the initial scaffold. The encoding region comprises a genetic code for which entities that have been transferred in which order. Thus, the synthetic history may be decoded from the encoding region.

(55) FIG. 3 shows examples of the design of the coding area. Panel A, depicts a detailed view of an example of a design according to FIG. 1, panel B. The unique codon transferred in a first cycle is opposed by a partly mis-matching region. To compensate for the decrease in affinity a binding region is following the codon. The binding region is opposed by a matching complementary binding region of the building block.

(56) In FIG. 3, panel B the unique codon incorporated in a first cycle is opposed by a second building block having incorporated in the complementing identifier region a neutral binding region. The neutral binding region is not capable of discriminating between varieties of unique codons, but is able to show some kind of affinity towards the each of the codons. Usually, the neutral binding region comprises one or more universal bases and more preferred the neutral binding region comprises a sequence of universal bases opposing at least a part of the codon region on the identifier.

(57) FIG. 4 shows a hybridisation product between an identifier and a building block wherein the identifier has internal codons and the building block has corresponding anti-codons. The identifier region and the complementing identifier region can also contain specific unique codons and anti-codons, respectively.

(58) The use of internal codons is of particular importance when several rounds of selection are anticipated, especially when the encoded molecule is formed from a PCR product of a previous round. The internal anti-codons in the building block may completely or partly match the identifier sequence or may comprise one or more universal bases to provide for affinity but not for specificity. The role of the internal unique codons is only to guide the annealing between the identifier molecule and the building block molecule. The correct encoding is taken care of by the unique codons which are created in the extension process. These unique codons are passed on to the next generation of molecules and used to decode the synthetic history of the displayed molecules. This system will not be totally dependent on an accurate encoding function by the internal unique codons in order to pass the correct genotype to the next generation of identifier molecules.

(59) In panel A the hybridisation product provides for a spatial proximity between the functional entity and the attachment entity, thus increasing the probability that a reaction occurs. The unique codon templates the codon on the identifier sequence by an enzymatic extension reaction. In panel B a binding region is introduced between each unique coding sequence to provide for affinity of the two strands to each other even though one or more mis-matching bases appear in the codon:non-coding domain of a previously used codon.

(60) FIG. 5 shows an embodiment useful when an amplification step is involved between selections. Initially, a library of complexes is produced as depicted in FIG. 2. The library of the complexes may be subjected to a selection process. The selection process may involve presenting the display molecule on the complex to a target and subsequent selecting the display molecules which shows a desired interaction with the target. It may be advantageously to use relatively mild conditions during the selection process, to obtain a sub-library. The sub-library may be decoded to obtain information on the synthetic history for the entire sub-library. However, it is usually preferred to reduce the sub-library further before a decoding is performed.

(61) The sub-library may be reduced by subjecting it to the target again and use more stringent conditions. However, to obtain a higher number of each of the members of the sub-library before a second selection, it is generally preferred to amplify the complex. Thus, a primer which is loaded with a scaffold is initially annealed to a primer site at one end of the encoding region. Subsequently a transcript is formed. A reverse primer is preferably present to obtain a duple stranded PCR product having a scaffold attached thereto.

(62) This PCR is the basis for the generation of en amplification of the sub-library. The identifier sequence is segregated into a number of internal unique codons, abbreviated IUC in the drawing. The number of the IUCs corresponds to the number of functional entities participating in the formation of the display molecule. The sequence of the IUCs expresses the identity of the individual functional entities and the order of the IUCs indicates the order of reaction of the functional entities. Preferably, a primer region is presented adjacent to the sequence of IUCs to allow for a later amplification of the nucleic acid sequence.

(63) The sub-library is contacted with a plurality of building blocks comprising a transferable functional entity and an internal unique anti-codon (IUA) complementary to at least one of the IUCs. The complementing identifier region is provided with sufficient complementarity to provide for a hybridisation with the oligonucleotide identifier region. In a preferred embodiment the IUCs not identifying a functional entity to be transferred is opposed in the complementary identifier region with a neutral binding region. As mentioned above the neutral binding region may comprise universal bases, i.e. bases that have the ability to be paired with two or more of the naturally occurring nucleobases. Adjacent to the region comprising specific base-pairing sequences and non-specific base-pairing sequences, i.e. the complementary identifier region is a unique anticodon (UA). The UA comprises the same information as the IUA of the complementing identifier region, typically the UA and the IUA has the same sequence on nucleotides.

(64) The transfer step and the reaction step are conducted in several cycles as described above to form a bifunctional complex. In FIG. 5 four cycles are performed, however, it will be appreciated that less than cycles, such as 3 or 2 cycles can be performed to produce a reaction product comprising constituent from 3 or 2 functional entities respectively. Also more, than four cycles may be performed, such as 5 to 20 to form a more diverse library of display molecules. The complexes resulting form the cycles are a reaction product between the functional entities and the scaffold, and an oligonucleotide. The oligonucleotide can be divided into a guiding region, that is, the region that guided the annealing of the individual building blocks, and an encoding region, which comprises the unique codons which have been transferred from the building blocks to the identifier.

(65) Using the above encoding method, allows for the amplification of more and more focused sub-libraries to obtain a sufficient amount of material to allow decoding.

(66) The encoding method shown in FIG. 6 can create both monomer and polymer encoded molecules. Panel A: Complex reaction products can be created using an attachment entity which has reacted with multiple functional entities. Panel B: Polymers can be created using one attachment entity with one reactive group allowing attachment with a functional entity having at least two reactive groups.

(67) FIG. 7 illustrates a three strand assembly procedure for the encoding by extension principle. A: The identifier and building block can be assembled on an assembly platform. This assembly platform contains a unique anticodon region and a unique anticodon where these two elements are directly linked through their sequences. There may be a connecting region linking the unique anticodon region together with the complementing identifier region. B: Describes all the components of the identifier, building block and the assembly platform used in the consecutive reaction, where the identifier also contain a unique codon and a binding region and the assembly platform also contains a non-coding region and a complementing binding region.

(68) In FIG. 8 it is shown that internal codons can also be used for the three-strand assembly principle. This will be useful when selection will be performed in multiple rounds with intermediate amplification steps.

(69) In FIG. 8A the identifier comprises an attachment entity whereas in FIG. 8B it comprises FE.sub.1. Also, in FIG. 8B the identifier comprises a unique codon ER.sub.1 and a binding region, whereas in FIG. 8A it does not. Also, in FIG. 8B the assembly platform comprises a non-coding region and a complementing binding region between the complementing identifier region and the unique anticodon, whereas in FIG. 8A it does not.

(70) FIG. 9 shows a solid-phase three-strand displayed-molecule synthesis. The assembly platform molecule is attached to a solid support to allow sequential attachment of building blocks to the attachment entity. Different libraries of assembly platform molecules, which is extended with suitable non-coding regions and complementing binding regions, can be used in each step in separate vials. This will allow the use of identical building block and identifier molecules in each step.

(71) FIG. 10 shows the sequential transfer/extension using the assembly platform principle. Each well contains a library of platform molecules. The platform molecule is extended with one unique anticodon in the subsequent wells. A library of identifier and building block molecule is added to the first well which allows specific annealing and transfer of functional entities. The reaction mixture is the transferred to the next wells which finally generates the identifier-displayed library.

(72) FIG. 11 discloses a general scheme for alternating parallel synthesis of combinatorial libraries. In a first step a nascent bifunctional molecule is provided. The nascent bifunctional molecule comprises as one part of the molecule a reactive group, which may appear on a chemical scaffold, and some times referred to herein as a chemical reactive site. Another part of the bifunctional molecule comprises a priming site for addition of a tag. The priming site may be a 3-OH group or a 5-phosphate group of a nucleotide in case the tag is a nucleotide. The chemical reactive site and the priming site may optionally be spaced by a linking group. In the event that the linking group is resent it may be a nucleotide or a sequence of nucleotides. The spacing entity may further comprise a hydrophilic linker, such as a polyethylene or polypropylene, to distance the chemical reactive site from the nucleotide. Also comprised in the linking moiety may be a selective cleavable linker that allows the experimenter to separate the display molecule from the coding part.

(73) The nascent bifunctional molecule is divided into a plurality of compartments, usually wells of a microtiter plate or similar equipment that allow easy handling of multiple spatially separated containers. Each of the compartments is reacted with a specific small molecule fragment, also referred to herein as a reactant. Thus, in a first compartment, the nascent bifunctional molecule is reacted with a first small molecule fragment (F.sub.1), in a second compartment; the nascent bifunctional molecule is reacted with a second small molecule fragment (F.sub.2), etc. The number of compartments may in principle be indefinite, however, for practical reasons; the number is usually between 5 and 5000, such as 10 and 500. In each of the compartments the small molecule fragments may be identical or different as the case may be. In each compartment, one, two, or more reactants may participate in the reaction. After the reaction between the drug fragment and the nascent bifunctional molecule has occurred in each compartment, a tag is added, said tag identifying the small molecule fragment. In certain aspects of the invention, the tag is a nucleic acid. Thus, in the first compartment, a first nucleic acid tag (T.sub.1) is added to the priming site of the reaction product, in the second compartment, a second nucleic acid tag (T.sub.2) is added to the priming site of the second reaction product, etc. Various methods for enzymatic encoding are contemplated and discussed herein. Following the enzymatic addition of the tags in each of the compartments, the contents of the compartments are collected.

(74) In a second round the mixture of bifunctional molecules is split into compartments again. The number of compartments of the second round need not be the same as the number of compartments in the first round. In each compartment the products of the previous round serves as the nascent bifunctional molecule. Thus, a reactive group appearing on the reaction product between the scaffold and the small molecule fragment of the first round is reacted with one or more small molecule fragments of the second round. Thus, in a first compartment, the mixed reaction products of the first round are reacted with a first small molecule fragment (F.sub.1), in a second compartment, the mixed reaction products of the first round are reacted with a second small molecule fragment (F.sub.2), etc.

(75) The small molecule fragments F.sub.1, F.sub.2, . . . F.sub.X of the second round may be identical or different from the small molecule fragments used in the first round.

(76) After the reactions have been allowed to occur, a tag specifying the small molecule fragment is added. The tag added in the first round usually comprises a priming site that can be used for addition of the tag in the second round so as to produce a linear identifier comprising the tags. In the first compartment, the reacted product is added a first tag which identifies the reactant of the second round that has reacted with the reactive reaction site of the nascent bifunctional molecule; in a second compartment, the product reacted with the second small molecule fragment of the second round is added the tag identifying said reactant, etc. Following the addition of the tags in each compartment, the content of the compartments are mixed in a common pool. The split-reaction-combining cycle can be repeated an appropriate number of times to obtain a library of bifunctional molecules comprising a display molecule part and a coding part. The library may be used in a selection process disclosed elsewhere herein.

(77) Above, the general principle for split-and-mix is disclosed, in which the reaction of the small molecule fragment and the chemical reaction site occurs prior to the encoding step. Obviously, the events can occur in the reverse order or simultaneously.

(78) FIG. 12 schematically shows a 96 well microtiter plate to the left. In each well or in a selected number of wells, the process to the right occurs. Initially, a bifunctional molecule is provided. The bifunctional molecule comprise a chemical reaction site (oval) attached to a codon (rectangle) through a linker (line). To the left of the codon a binding region is provided. Next, a codon oligonucleotide and a splint oligonucleotide are added. The codon oligonucleotide is provided with a codon and flanking binding regions. The splint is designed with sequences complementing the binding region of the nascent bifunctional molecule and a binding region of the codon oligonucleotide such that the ends abut each other under hybridisation conditions. The nascent bifunctional complex, the splint and the codon oligonucleotide forms a hybridisation product under appropriate conditions. A ligase is added to couple the codon oligo to the nascent bifunctional complex. In a second step, a drug fragment, i.e. a reactant, is added and conditions providing for a reaction with the chemical reaction site is instituted.

(79) Then the content of each well is combined and, optionally, divided into a range of wells again for a second round of reaction and encoding. In final step, the combined contents of the wells are used in a selection or partition step, as disclosed herein.

(80) FIG. 13 outlines an embodiment with the encoding and reaction step reversed compared to the embodiment shown in FIG. 12. In a variety of wells a nascent bifunctional complex having a reactive group (Rx) attached to an oligonucleotide (horizontal line) is dispensed. In a first step, the reactive group in each compartment is reacted with a reactant, in a second step a codon oligonucleotide and a splint is added together with a ligase to ligate covalently the codon oligonucleotide to the reacted nascent bifunctional complex, and in a third step the ligation product is recovered. The content of the wells may subsequently be combined and used as a library of bifunctional complexes or recycled for another round of reaction and addition of tag.

(81) FIG. 14 discloses the use of the library produced in accordance FIG. 13, or any other library having a coding part and display molecule part, in a further round. Initially, the combined contents of the wells from the embodiment of FIG. 13 are dispensed in separate wells. Then an anti-codon oligonucleotide having a binding region which is complementary to the binding region of the nascent bifunctional molecule is added under hybridisation conditions, i.e. conditions which favour the assembly of the hybridisation product between the nascent bifunctional complex and the anti-codon oligonucleotide. Subsequently, or simultaneously with the addition of the anti-codon oligonucleotide, a polymerase, a collection of dNTP (usually, dATP, dGTP, dCTP, and dTTP), and appropriate salts and buffer are added to provide for an extension to occur. The extension (dotted arrow) transcribe the anti-codon to the identifier, thus attaching a tag that encodes the identity of the reactant subsequently reacted at the chemical reaction site. The anti-codon oligonucleotide is connected to a biotin (B) to allow for removal of the oligonucleotide.

(82) FIG. 15 discloses a scheme of various encoding methods combined with a collection of reactants. All the combinations are in according the invention.

(83) Free Reactant/Polymerase Encoding:

(84) A nascent bifunctional complex comprises a scaffold (=chemical reaction site) comprising a reactive group and an oligonucleotide part comprising a codon identifying the scaffold. The codon is associated with an oligonucleotide binding region capable of forming a hybridisation product with a complementing binding region of an anti-codon oligonucleotide. The hybridisation product is subjected to an extension reaction, in which the scaffold oligonucleotide is extended over the anti-codon, thereby providing the scaffold oligonucleotide with a codon. Subsequent, simultaneously with or prior to the extension reaction, a free reactant coded for by the anti-codon is reacted with the scaffold.

(85) Zipper Building Block/Polymerase:

(86) A nascent bifunctional complex comprises a scaffold (=chemical reaction site) comprising a reactive group and an oligonucleotide part comprising a codon identifying the scaffold. The codon is associated with two oligonucleotide binding region capable of forming a hybridisation product with a complementing binding region of an anti-codon oligonucleotide and a complementing binding region of the reactant. The hybridisation product is subjected to an extension reaction, in which the scaffold oligonucleotide is extended over the anti-codon, thereby providing the scaffold oligonucleotide with a codon. Subsequent, simultaneously with or prior to the extension reaction, a functional entity coded for by the anti-codon is reacted with the scaffold. The selection of polymerase may determine the order of reaction and encoding as some polymerase, such as Sequenase, displaces the binding region attached to the functional entity, while other polymerases, like Taq polymerase, do not perform the displacement of the binding region. When a zipper building block is used a close proximity between the scaffold and the functional entity is obtained thereby promoting a reaction to take place.

(87) E2 Building Block/Polymerase Encoding:

(88) A nascent bifunctional complex comprises a chemical scaffold and an oligonucleotide part comprising the codon identifying the scaffold. The oligonucleotide part comprises two binding region on each sides of the codon. An E2 building block anneals to the scaffold oligonucleotide such that the functional entity comes in close proximity as to the scaffold and a double helix is formed just before the anti-codon, thus enable a polymerase to recognize the double helix as a binding area. Applying appropriate conditions and substrates enable the extension of the identifier oligonucleotide over the anti-codon, thus transcribing the genetic information of the function entity to the identifier. Opposing the scaffold codon is a stretch of universal binding nucleotides, such as inosine. Use of an E2 building block allows for one-pot synthesis of a library.

(89) Loop Building Block/Polymerase Encoding:

(90) A nascent bifunctional complex comprises a chemical scaffold and an oligonucleotide part comprising the codon identifying the scaffold. The oligonucleotide part comprises two binding region on each sides of the codon. A loop building block anneals to the scaffold oligonucleotide such that the functional entity comes in close proximity as to the scaffold and a double helix is formed just before the anti-codon, thus enable a polymerase to recognize the double helix as a binding area. Applying appropriate conditions and substrates enable the extension of the identifier oligonucleotide over the anti-codon, thus transcribing the genetic information of the function entity to the identifier. As no sequence on the building block complements the scaffold codon sequence, this codon sequence loops out. Use of a loop building block allows for one-pot synthesis of a library.

(91) N Building Block/Polymerase Encoding:

(92) A nascent bifunctional complex comprises a chemical scaffold attached to a scaffold codon through a linker. On one or each side of the codon a binding region is present. An N building block comprises a binding region which is complementary to the scaffold binding region and an anti-codon. A functional entity is attached to the codon or a binding region. Under hybridisation conditions the complementary binding regions hybridise and a polymerase extends in both directions, thereby transferring the genetic information of the anti-codon to the oligonucleotide covalently connected to the scaffold. Before, after or simultaneously with the extension reaction, the reaction between the functional entity and the scaffold may take place. Usually, the functional entity is attached to the anti-codon oligonucleotide via a cleavable linker so as to allow for transfer of the functional entity to the scaffold structure.

(93) Free Reactant/Ligase:

(94) A scaffold entity is attached to an oligonucleotide comprising a codon. The scaffold oligonucleotide further comprises a priming site to which a codon oligonucleotide is ligated. The ligation is performed by a ligase. The ligation can take place in a single stranded or double stranded form. In the single stranded form, a 3 OH (or 5-phosphate) of the scaffold oligonucleotide is ligated to a 5-phosphate (or 3-OH) of the codon oligonucleotide. In the double stranded form, an oligonucleotide complementing the ends of the scaffold and codon oligonucleotides, respectively, is used and designed so that the ends abuts each other. Optionally, the ligation occurs between two double stranded oligonucleotides, i.e. a double stranded scaffold oligonucleotide with an over hang (sticky end) is ligated to a double stranded codon oligonucleotide provided with a complementing overhang. The type of ligation depends on the selected enzyme. Usually, the double stranded ligation is preferred because the reaction is faster due to the guiding effect of the oligonucleotide complementing the ends. The complementing oligonucleotide is also referred to herein as the splint oligonucleotide. Following, preceding, or simultaneously with the ligation of the codon oligonucleotide to the scaffold oligonucleotide a reaction between the free reactant and the scaffold takes place.

(95) Zipper Building Block/Ligase:

(96) A scaffold entity is attached to an oligonucleotide comprising a codon and binding region between the scaffold and the codon. The scaffold oligonucleotide further comprises a priming site to which a codon oligonucleotide is ligated. The ligation is performed by a ligase. The ligation can take place in a single stranded or double stranded form. In the single stranded form, a 3 OH (or 5-phosphate) of the scaffold oligonucleotide is ligated to a 5-phosphate (or 3-OH) of the codon oligonucleotide. In the double stranded form, an oligonucleotide complementing the ends of the scaffold and codon oligonucleotides, respectively, is used and designed so that the ends abuts each other. Optionally, the ligation occurs between two double stranded oligonucleotides, i.e. a double stranded scaffold oligonucleotide with an over hang (sticky end) is ligated to a double stranded codon oligonucleotide provided with a complementing overhang. The type of ligation depends on the selected enzyme. Usually, the double stranded ligation is preferred because the reaction is faster due to the guiding effect of the oligonucleotide complementing the ends. The complementing oligonucleotide is also referred to herein as the splint oligonucleotide. A zipper building block is a functional entity attached to a binding oligonucleotide. The binding oligonucleotide is complementing the binding region of the scaffold oligonucleotide, thus forming a hybridisation product under hybridisation conditions. Following, preceding, or simultaneously with the ligation of the codon oligonucleotide to the scaffold oligonucleotide a reaction between the functional entity and the scaffold takes place. The use of the binding region on the reactant ensures a close proximity between the functional entity and the scaffold.

(97) E2 Building Block/Ligational Encoding:

(98) Initially is provided a nascent bifunctional complex comprising a scaffold attached to an oligonucleotide, said oligonucleotide comprising a codon and a binding region between the scaffold codon and the scaffold codon. The scaffold oligonucleotide also comprises a priming site to which a codon oligonucleotide can be ligated. The scaffold oligonucleotide is hybridised to an E2 building block which carries a double stranded part. The oligonucleotide complementing the anticodon as ligated to the scaffold oligonucleotide using the E2 building block as a template. Before, after or simultaneously with the ligation a reaction takes place between the functional entity and the scaffold.

(99) Loop Building Block/Ligational Encoding:

(100) A bifunctional complex is provided comprising a scaffold attached to an oligonucleotide, wherein the scaffold oligonucleotide comprises a codon flanked by two binding regions. A loop building block is provided which has binding regions complementing the binding regions of the scaffold oligonucleotide. Upon hybridisation, the codon part of the scaffold oligonucleotide loops out. The loop building block also comprises a double stranded codon part. The oligonucleotide complementing the anti-codon part of the loop building block is ligated to the free binding region of the scaffold oligonucleotide. Before, after or simultaneously with the ligation a reaction takes place between the functional entity and the scaffold.

(101) N Building Block/Ligational Encoding:

(102) A nascent bifunctional complex is initially provided in which a scaffold via a suitable linker is attached the codon identifying said scaffold or attached to a binding region connect to the codon. A building block having a functional entity connected to a codon is the ligated to the scaffold oligonucleotide to connect the scaffold oligonucleotide with functional entity oligonucleotide. The ligation may be performed in a single stranded or in a double stranded state, depending on the particular enzyme selected for the ligation. Subsequently, the functional entity is reacted with the scaffold. In the alternative, the functional entity and the scaffold are reacted prior to ligation of the respective oligonucleotides.

(103) When a round, i.e. a reaction with and a tagging of the nascent bifunctional complex, has been completed in accordance with any of the above encoding methods, a new round maybe in initialized according to any of the above reaction/encoding methods. Thus, the encoding and reaction in a first round may be the same or different in a subsequent second or further round. A single bifunctional complex or a library of complexes may be generated. When a library is contemplated, one-pot-synthesis can be conducted with the building blocks in which a covalent link between the functional entity and the codon/anti-codon is used, i.e. the columns of E2 building block, loop building block, and N building block. Split and mix synthesis can be performed, when no covalent link between the functional entity/reactant and the codon/anti-codon is present, i.e. in the columns indicating the free reactant and the zipper building block.

(104) FIG. 16 shows a double encoding method, i.e. a method for encoding two or more reactants in one go. In certain embodiments, the multiple encoding methods allow for multi reaction between reactants and scaffold. Initially, a scaffold connected to an oligonucleotide comprising a hybridisation region, a scaffold codon and a binding region is annealed to an E2 building block. Subsequently, an extension is performed in which the anti-codon of the building block is transferred to the identifier. Several polymerases form an overhang of one or more single stranded nucleotides. This overhang is used in the present invention to attach an anti-codon oligo and allow the polymerase to further extent the identifier oligonucleotide over the anti-codon region of the anti-codon oligonucleotide. The transfer of the information of the anti-codon oligonucleotide allows for encoding a third free reactant C. The annealing between the oligonucleotide carrying A and the oligonucleotide carrying B provide for a close proximity between A and B and thus a high local concentration. Thus, when the free reactant C is added a reaction between the three components is favoured. One advantage of double encoding is that it is possible to exchange solvent, such that the reaction not necessarily must take place in the same solvent as the extension occurs.

(105) To the right is illustrated an example, in which the above method is applied on 100 different scaffold oligonucleotides and 100 building blocks. The hybridisation product between the scaffold oligonucleotides and the building block oligonucleotides is divided into 100 different wells. In each of the wells the extension, addition of anti-codon oligonucleotide and reaction with specific free reactant is allowed. In total 10.sup.6 different bifunctional molecules are generated.

(106) FIG. 17 discloses various methods for performing double encoding. In all the examples, the encoding is shown to occur prior to reaction, but it will be within the ambit of the skilled person to perform the reaction first and then the encoding. When a library is contemplated, it is possible to conduct the reaction in a single container (one-pot synthesis) using the N building blocks in combination with any of the encoding methods. For the remaining reactants it is necessary to conduct one or more split-and-mix step. In the combination of the zipper building block, E2 building block, and the loop building block with any of the encoding methods a single split-and-mix step is necessary, whereas two split-and-mix steps are necessary for the free reactant in combination with any encoding method. The scheme makes it possible for the skilled person to select a reaction/encoding method which is useful for a specific reaction. If triple-, quadro-, or multi encoding is contemplated, it is possible to perform such encoding using an embodiment of the double encoding scheme in combination with an embodiment of the single encoding scheme of FIG. 15 one or more times to arrive at an encoding/reaction method that suits the need for a specific chemical reaction.

(107) FIG. 21 discloses a triple encoding method. Initially, a scaffold attached to a scaffold oligonucleotide is provided. The scaffold is attached to a binding region the scaffold oligonucleotide, and the scaffold oligonucleotide is further provided with a codon. The two building blocks of the E2 type is annealed to the scaffold oligonucleotide, thereby bringing the functional entities BB1 and BB2 into close proximity with the scaffold. Simultaneously, prior or subsequent to the addition the building blocks a codon oligonucleotide coding for a third reactant (BB3) is provided which comprises a part complementing a nucleotide sequence of the first building block. The components of the system are allowed to hybridise to each other and a polymerase and a ligase is provided. The polymerase performs an extension where possible and the ligase couples the extended oligonucleotides together so as to form a double stranded product. Following the encoding process, the third reactant is added and conditions are provided which promote a reaction between the scaffold and the reactants. Finally, a selection is used to select reaction products that perform a certain function towards a target. The identifying oligonucleotides of the selected bifunctional complexes are amplified by PCR and identified.

(108) To the right a particular embodiment for carrying out the present invention is indicated. Accordingly, each codon is 5 nucleotides in length and the binding regions flanking the scaffold are 20 nucleotides each. The building blocks designed to hybridise to the binding regions of the scaffold comprises a 20 nucleotide complementing sequence as well as a 5 nucleotide codon.

(109) An embodiment of the enrichment method of the present invention is shown on FIG. 24. Initially, each chemical entity (denoted by letters A, B, C, . . . ) in a library is attached to a unique identifier tag (denoted a, b, c, . . . ). The identifier tag comprises information about that particular compound or group of compounds with respect to e.g. structure, mass, composition, spatial position, etc. In a second step, tagged chemical compounds are combined with a set of anti-tag sequences (denoted a, b, c, . . . ). Each anti-tag sequence carries a handle, like biotin, for purification purposes. The anti-tag sequences comprise a segment which is complementary to a sequence of the identifier sequence. The combination of anti-tag sequences and identifier sequences are allowed to form hybridisation products. Optionally, there may be tagged chemical entities present which have not been recognized by an anti-tag. In a third step, the sequences carrying a handle are removed, i.e. the tagged chemical compounds are left in the media while the matter comprising a handle is transferred to a second media. In the event, the handle is biotin it may be transferred to a second media using immobilized streptavidin.

(110) The purified matter may comprise anti-tag sequences not hybridised to a cognate sequence. As these anti-tag sequences are not coupled to a chemical compound to be selected for, the enrichment sequences may remain in the media. However, in some applications it may be preferably to make the excess anti-tag sequences double stranded, as illustrated in FIG. 25, because the double helix normally is inert relative to the selection procedure. The excess anti-tag sequences may be transformed into the double helix state by the use of a primer together with a suitable polymerase and nucleotide triphosphates.

(111) The purified fraction is in step 4 is subjected to a selection process. The selection comprises probing for a set of properties, e.g. but not limited to affinity for a specific protein. In such a case, entities which do not bind to the specific protein will be eliminated. Anti-tags complexed to entities binding to the specific protein may be recovered/be isolated through e.g. the use of its purification handle.

(112) In step 5 isolated anti-tags are optionally amplified through the use of PCR or RTPCR.

(113) In step 6, the initial library of tagged entities produced in step 1, may undergo further rounds of complexation and screening, i.e. the anti-tags from step 5 may be added the library of tagged entities of step 1 and then be submitted to step 3, step 4 and step 5. Step 6 may be repeated.

(114) In step 7, the isolated anti-tags of step 5 may be cloned and their identity be revealed. E.g. in the case of DNA, sequencing may be applied whereby the identity of specific entities with selected properties in the library of tagged entities will be revealed.

(115) The embodiment shown in FIG. 26 resembles that of FIG. 24 except that the non-complexed components are rendered inert, e.g. if the tags and/or anti-tags are composed of single stranded DNA or RNA, they may be transformed into double stranded DNA, RNA or a hybrid thereof. This may be accomplished by use of a primer, nucleotide triphosphates and a polymerase or transcriptase. Furthermore, the sequence of purification (by use of the purification handle on anti-tags) and probing for properties is changed compared to the method of FIG. 24.

(116) In FIG. 27, step 1, a number of entities (denoted by letters A, B, C . . . ), being it mixtures or single compounds are attached to a unique tag more specifically a DNA or RNA sequence or a derivative thereof, holding information on that compound or mixture, such as e.g. structure, mass, composition, spatial information etc.

(117) In step 2, all tags of tagged entities are made double stranded by use of a primer (optionally carrying a @-handle such as e.g. biotin), nucleotide triphosphates and a polymerase or transcriptase. Remaining single stranded DNA or RNA may optionally be digested by use of nucleases.

(118) The mixture, is probed for a set of properties in step 3, e.g. but not limited to affinity for a specific protein. In such a case, entities which do not bind to the specific protein will be eliminated. Anti-tags complexed to entities binding to the specific protein may be recovered/be isolated through e.g. the use of its @-handle.

(119) Isolated anti-tags may optionally be amplified in step 4 through the use of PCR or RTPCR.

(120) In step 5, the library of tagged entities of step 1, may undergo complexation to the isolated and optionally amplified anti-tags of step 3 and 4.

(121) Single stranded components are being digested in step 6 by use of e.g. nucleases. The remaining double stranded subset of the library is optionally subjected to a renewed enrichment of the library according to step 3-6. Steps 3-6 may be repeated as sufficient number of times to obtain an appropriate chemical entity having the desired property.

(122) In step 7, the isolated anti-tags of step 4 can be cloned and their identity be revealed, e.g. in the case of DNA, sequencing may be applied, whereby the identity of specific entities in the library of tagged entities is revealed.

(123) FIG. 28 relates to a method involving a digestion of single stranded oligonucleotides. In a first step a number of entities (denoted by letters A, B, C . . . ), being it mixtures or single compounds, are attached to a unique tag, holding information on that compound or mixture, such as e.g. structure, mass, composition, spatial information etc.

(124) In step 2, mixtures of tagged entities are combined with a set of complementary anti-tags. Anti-tags may be, but is not limited to nucleotide derivatives. Anti-tags may optionally carry a @-handle. The tag and the anti-tags are allowed to form a complex. The complexation may be, but is not limited to hybridization. Some anti-tags will not form a complex with a tagged entity and some tagged entities will not form a complex with an anti-tag.

(125) Non-complexed components is digested in step 3 using e.g. nucleases when the tags and/or anti-tags are composed of DNA or RNA or hybrids thereof.

(126) The mixture of step 3, is probed for a set of properties in step 4, e.g. but not limited to affinity for a specific protein. In such a case, entities which do not bind to the specific protein will be eliminated. Anti-tags complexed to entities binding to the specific protein may be recovered/be isolated through e.g. the use of its @handle. Step 4 may be repeated one or more times.

(127) Isolated anti-tags may optionally be amplified through the use of PCR or RTPCR as illustrated in step 5. Anti-tags may then also be used as described in FIGS. 24-27.

(128) The isolated anti-tags may be cloned and their identity be revealed in step 6, e.g. in the case of DNA, sequencing may be applied, whereby the identity of specific entities in the library of tagged entities will be revealed.

(129) According to FIG. 29, step 1, a number of entities (denoted by letters A, B, C . . . ), being it mixtures or single compounds, are attached to a unique tag more specifically a DNA or RNA sequence or a derivative thereof, holding information on that compound or mixture, such as e.g. structure, mass, composition, spatial information etc.

(130) All tags of tagged entities are made double stranded in step 2 by use of a primer (optionally carrying a @-handle such as e.g. biotin), nucleotide triphosphates and a polymerase or transcriptase. Remaining single stranded DNA or RNA may optionally be digested by use of e.g. nucleases.

(131) In step 3, the mixture is probed for a set of properties, e.g. but not limited to affinity for a specific protein. In such a case, entities which do not bind to the specific protein will be eliminated. Anti-tags complexed to tags having appended entities binding to the specific protein may be recovered/be isolated through e.g. the use of its @-handle. Step 3 may be repeated one or more times.

(132) According to step 4, isolated anti-tags may optionally be amplified through the use of PCR or RTPCR. Anti-tags may then also be used as described in FIGS. 24-27.

(133) The isolated anti-tags may be cloned in step 5 and their identity be revealed, e.g. in the case of DNA, sequencing may be applied. Whereby, the identity of specific entities in the library of tagged entities will be revealed.

(134) FIG. 30, step 1, produces a number of entities (denoted by letters A, B, C . . . ), being it mixtures or single compounds which are attached to a unique tag more specifically a DNA or RNA sequence or a derivative thereof, holding information on that compound or mixture, such as e.g. structure, mass, composition, spatial information etc.

(135) In step 2, the mixture is probed for a set of properties, e.g. but not limited to affinity for a specific protein. In such a case, entities which do not bind to the specific protein will be eliminated. Step 2 may be repeated.

(136) All tags of tagged entities are made double stranded in step 3 by use of a primer (optionally carrying a @-handle such as e.g. biotin), nucleotide triphosphates and a polymerase or transcriptase. Remaining single stranded DNA or RNA may optionally be digested by use of e.g. nucleases.

(137) Anti-tags complexed to tags of entities binding to the specific protein may be recovered/be isolated in step 4 through e.g. the use of its @-handle. Anti-tags may optionally be amplified through the use of PCR or RTPCR. Anti-tags may then also be used as described in FIGS. 24-27.

(138) The isolated anti-tags may be cloned in step 5 and their identity be revealed, e.g. in the case of DNA, sequencing may be applied, whereby, the identity of specific entities in the library of tagged entities is revealed.

(139) FIG. 31A shows the result of polyacrylamide gel electrophoresis of the sample created in example 13, the result of annealing identifier oligo E57 with zipper building block E32 and anti-codon oligonucleotide CD-M-8-01720001 (with anti-codon sequence Anti-Codon 1) (lane 2) and of annealing the same identifier oligo with E32 and anti-codon oligo E60 (with anti-codon sequence Anti-codon X) (lane 3).

(140) FIG. 31B shows the result of polyacrylamide gel electrophoresis of the sample in which E58 is annealed to zipper building block CX-1 and anti-codon oligo CD-M-8-0172-0001, and E58 to E32 and E60. This time a reactant on the zipper building block was cross linked to the display molecule in the identifier oligonucleotide.

(141) FIGS. 32 and 33 are more fully discussed in Example 14.

(142) FIGS. 34-47 are more fully discussed in examples 1-5, 7-9.

EXAMPLES

Example 1: Loading of a Scaffold Onto Identifier Molecules

(143) An amino-modifier C6 5-labeled identifier oligo (5-X-TCGTAACGACTGAATGACGT-3, (SEQ ID NO: 5) wherein X may be obtained from Glen research, cat. #10-1039-90) was loaded with a peptide scaffold (Cys-Phe-Phe-Lys-Lys-Lys, CFFKKK, SEQ ID NO: 6) using SPDP activation (see below). The SPDP-activation of amino-oligo was performed using 160 l of 10 nmol oligo in 100 mM Hepes-KOH, pH=7.5, and 40 l 20 mM SPDP and incubation for 2 h at 30 C. The activated amino-oligo was extracted 3 times with 500 l EtOAc, dried for 10 min in a speed-vac and purified using micro bio-spin column equilibrated with 100 mM Hepes-KOH. The loading of scaffold was then performed by adding 10 l of 100 mM attachment entity and incubating overnight at 30 C.

(144) The loaded identifier oligo was precipitated with 2 M NH.sub.4OAc and 2 volume 96% ethanol for 15 min at 80 C. and then centrifuged for 15 min at 4 C. and 15.000 g. The pellet was re-suspended in water and the precipitation was repeated. Wash of the oligo-pellet was done by adding 100 l of 70% ethanol and then briefly centrifuged. The oligo was re-dissolved in 50 l H.sub.2O and analysed by MS. The MS analysis was performed after 100 pmol oligo in 10 l water was treated with 10 l of ion exchanger resin and incubated minimum 2 h at 25 C. on a shaker. After incubation the resin was removed by centrifugation and 15 l of the supernatant was mixed with 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 l acetonitrile. The sample was analysed using a mass spectroscopy instrument (Bruker Daltonics, Esquire 3000plus). The observed mass, as can be seen in FIG. 34, was 7244.93 Da, which correspond well with the calculated mass, 7244.00 Da. This experimental data exemplify the possibility to load scaffolds onto identifier oligonucleotides. This loaded identifier molecule can be used to receive functional entities from building blocks. This particular scaffold harbours three identical reactive groups, i.e. the amine group of the lycin side chain, and can therefore be transferred with one, two, or three functional entities, which is capable of reacting with the amine groups.

(145) ##STR00040##

(146) In the above figure, the DNA sequence is SEQ ID NO:5 and the peptide sequence (which, in the figure, is in reverse order, C-terminal to N-terminal is SEQ ID NO:6.

Example 2: Loading of Functional Entities Onto Building Blocks

(147) Loading of functional entities onto building block molecules can be done using a thiol-oligo (see below). An Biotin 5 labeled and thio-modifier C6 SS (obtainable from Glen Research, cat #10-1936-90) 3-labeled building block oligo (5-BTGCAGACGTCATTCAGTCGTTACGA-3 SEQ ID NO: 7) was converted to an NHS-oligo using NHM.

(148) 10 nmol oligo was dried in speed-vac, re-dissolved in 50 l 100 mM DTT, 100 mM sodium-phosphate pH 8.0 and incubated at 37 C. for 1 hour. The thiol-oligo was then purified using micro bio-spin column equilibrated with 100 mM Hepes-KOH, pH 7.5. The thiol-oligo was converted to NHS-oligo by adding 100 mM NHM in 100 mM Hepes-KOH pH. 7.5. The sample was incubated at 25 C. over night. The NHS-oligo was then purified using bio-spin column equilibrated with MS-grade H.sub.2O.

(149) ##STR00041##

(150) In the above figure, the DNA sequence is SEQ ID NO:7.

(151) The MS analysis was performed after 100 pmol oligo in 10 l water was treated with 10 l of ion exchanger resin and incubated minimum 2 h at 25 C. on a shaker. After incubation the resin was removed by centrifugation and 15 l of the supernatant was mixed with 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 l acetonitrile. The sample was analysed using a mass spectroscopy instrument (Bruker Daltonics, Esquire 3000plus). The observed mass as can be seen in FIG. 35 was 8369.32, which correspond well with the calculated mass, 8372.1. The experimental data exemplify the possibility to convert the attachment entity on building block oligonucleotides. This product can later be used to attach transferable functional entities.

(152) The NHS-oligo was then used to load functional entities. EDC activation of the functional entity (4-pentynoic acid) was performed mixing 50 l of 200 mM functional entity in DMF with 50 l of 200 mM EDC in DMF and incubated for 30 min at 25 C. on a shaker. The loading was then performed using 1 nmol NHS-oligo lyophilized in a speed-vac and 10 l of the activated building block (see below). This was incubated at 25 C. for 5 min and then mixed with 30 l 100 mM MES pH. 6.0. The loaded NHS-oligo was purified using bio-spin column equilibrated with 100 mM MES pH 6.0. The loaded building block oligo is then used immediately for the transfer reaction without any MS analysis. This is due to the unstable structure of the functional entity during the conditions used for the MS measurements.

(153) ##STR00042##

(154) In the above Figure, the DNA sequence is SEQ ID NO:7.

(155) This experiment exemplifies a complete loading of a functional entity onto a building block molecule ready for transfer to an recipient reactive group when annealed to the complementary identifier molecule.

(156) Another example of a functional entity that can be loaded as described above onto a building block is a 5-hexynoic acid as shown below. Again, no MS analysis was performed on this compound due to the unstable structure of the functional entity in the conditions used in the MS measurements.

(157) ##STR00043##

(158) In the above figure, the DNA sequence is SEQ ID NO:7

Example 3: Transfer of Functional Entities from the Building Block to the Identifier Molecule

(159) The attachment entity (AE) in the following experiments are either a scaffold, e.g. the peptide, CFFKKK (SEQ ID NO: 134), loaded on an identifier as prepared in Example 1 or a recipient reactive group exemplified by an amino modified oligonucleotide used as starting material in Example 1. These attachment entities allow transfer of three or one functional entities, respectively.

(160) The identifier used in this experiment is an identifier oligonucleotide loaded with CFFKKK as described in Example 1. The functional entity (FE) in this experiment is the 4-Pentynoic acid, the loading of which was described in Example 2. The identifier molecule loaded with the scaffold is annealed to the loaded building block molecule to bring the attachment entity and the functional entity in close proximity. The annealing is directed by the identifier region in the identifier molecule and the complementary sequence in the building block molecule.

(161) ##STR00044##

(162) After the annealing step between the identifier and building block molecules, the transfer reaction takes place where the functional entity is transferred to the identifier molecule.

(163) The annealing was performed using 600 pmol of the building block and 400 pmol identifier molecules in 0.1 M MES buffer at 25 C. in a shaker for 2 hours. The reactive part (functional entity) of the building block was transferred to the one of the amino group on the attachment entity on the identifier molecule during the annealing (see below). After annealing the sample was purified by micro-spin gel filtration and analyzed by MS. The sample was prepared for MS analysis using equal amount of sample (about 100 pmol) and ion exchanger resin and incubated minimum 2 h at 25 in a shaker. After incubation the resin was centrifuged down and 15 l of the supernatant was added 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 ul acetonitrile. The sample was analysed on a Mass Spectroscopy instrument (Bruker Daltonics, Esquire 3000plus). The observed mass (see FIG. 36) was 7323.45 Da, which correspond well with the calculated mass, 7324.00 Da. Thus, the MS spectrum of the identifier molecule after the transfer reaction shows a mass corresponding to the transferred functional entity on the identifier molecule.

(164) ##STR00045##

(165) In the above figure, the first sequence is SEQ ID NO:5 and the second is SEQ ID NO:7.

(166) Another example of transfer of functional entity is shown below using the amino oligo directly as the AE on the identifier molecule. The functional entity on the building block molecule used in this experiment was 4-pentynoic acid, as disclosed in example 2.

(167) The annealing was performed using 500 pmol of the building block and the identifier molecules in 0.1 M MES buffer and incubating the mixture at 25 C. in a shaker for 2 hours. The reactive part (functional entity) of the building block was transfer to the amino group on the identifier molecule during the annealing (see below). After annealing and transfer the sample was purified by micro-spin gel filtration and analyzed by MS. The sample was prepared for MS analysis using equal amount of sample (about 100 pmol) and ion exchanger resin and incubated minimum 2 h at 25 in a shaker. After incubation the resin was removed by centrifugation and 15 l of the supernatant was added 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 ul acetonitrile.

(168) ##STR00046##

(169) In the above figure, in both the starting materials and the products, the first sequence is SEQ ID NO:5 and the second is SEQ ID NO:7.

(170) The sample was analysed on a Mass Spectroscopy instrument (Bruker Daltonics, Esquire 3000plus). The observed mass was 6398.04 Da, which correspond well with the calculated mass, 6400.00 Da. Thus, the MS spectra of the identifier molecule after transfer of the functional entity show a mass corresponding to the transferred functional entity on the identifier molecule. This example shows that functional entities can be transferred using this setup of a building block molecule and an identifier molecule.

(171) Another example of transfer of functional entity is shown below using the amino oligo directly as the identifier molecule. The functional entity used in this experiment was 5-Hexpoic acid, prepared as shown in example 2.

(172) The annealing was performed using 500 pmol of the building block and 500 pmol of the identifier molecules in 0.1 M MES buffer incubated at 25 C. in a shaker for 2 hours. The reactive part (functional entity) of the building block was transferred to the amino group on the identifier molecule (see below). After annealing and transfer the sample was purified by micro-spin gel filtration and analyzed by MS. The sample was prepared for MS analysis using equal amount of sample (about 100 pmol) and ion exchanger resin and incubated minimum 2 h at 25 C. in a shaker. After incubation the resin was removed by centrifugion and 15 l of the supernatant was added 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 ul acetonitrile.

(173) ##STR00047##

(174) In the above figure, in both the starting materials and the products, the first sequence is SEQ ID NO:5 and the second is SEQ ID NO:7.

(175) The sample was analysed on a Mass Spectroscopy instrument (Bruker Daltonics, Esquire 3000plus). The observed mass was 6411.96 Da, which correspond well with the calculated mass, 6414 Da. Thus, the MS spectra of the identifier molecule after transfer of the functional entity show a mass corresponding to the transferred functional entity onto the identifier molecule. This example shows that functional entities can be transferred using this setup of a building block molecule and an identifier molecule.

Example 4: Extension of the Identifier Molecule to Transfer Unique Codons

(176) After the transfer of the functional entity (FE) to the attachment entity (AE) on the identifier molecule, the identifier molecule is extended in order to transfer the unique codon, that identifies the transferred functional entity, to the identifier molecule. This is accomplished by adding a suitable polymerase and a polymerase buffer containing the wild type nucleotides (dATP, dTTP, dCTP, dGTP). This will extend the identifier molecule in the 3-end towards the end of the 5-end of the building block molecule. The extension of the identifier molecule to transfer the unique anticodon(s) is preferably performed after the transfer of the FE as shown below.

(177) ##STR00048##

(178) The extension was performed using 15 units Taq polymerase in a buffer containing 0.4 mM of each nucleotide in an extension buffer (20 mM HEPES-KOH, 40 mM KCl, 8 mM MgCl.sub.2, pH=7.4). After the extension reaction the sample was analyzed using MS. The MS analysis was performed using about 100 pmol purified extension mixture in a half volume of ion exchanger resin and incubated minimum 2 h at 25 C. in a shaker. After incubation the resin was removed by centrifugation and 15 l of the supernatant was mixed with 7 l of water, 2 l of piperidine and imidazole (each 625 mM) and 24 l acetonitrile. The sample was analysed on a Mass Spectroscopy instrument (Bruker Daltonics, Esquire 3000plus).

(179) The MS data for extension on the identifier molecule with a transferred 4-Pentynoic acid is shown FIG. 39.

(180) The observed mass was 7922.53 Da, which correspond well with the calculated mass, 7924.00 Da. The MS spectra of the identifier molecule after the transfer reaction of the functional entity and extension reaction of the encoding region (the unique codon) showed a mass corresponding to the transferred functional entity and the extension on the identifier molecule. This example shows that functional entities can be transferred using this setup with a longer building block molecule than the identifier molecule and that the identifier molecule can be extended using a polymerase after the transfer process. This shows the possibility to transfer both the functional entity and the unique codon from the same building block to an identifier molecule.

(181) Another example showing transfer and extension is for the building block with the functional entity 5-Hexynoic acid. The MS data for extension on the identifier molecule with a transferred 5-Hexynoic acid is shown in FIG. 40.

(182) The observed mass was 7936.99 Da, which correspond well with the calculated mass, 7938.00 Da. The MS spectra of the identifier molecule after transfer reaction of the functional entity and extension reaction of the encoding region (the unique codon) showed a mass corresponding to the transferred functional entity and the extension on the identifier molecule. This example also shows that functional entities can be transferred using this setup with a longer building block molecule than the identifier molecule and the identifier molecule can be extended using a polymerase after the transfer process. This exemplifies the possibility to transfer both the functional entity and the unique codon from one building block molecule to one identifier molecule.

Example 5: Library Design

(183) The identifier molecule can be designed to operate optimal under various conditions. However, it should contain a few elements that are vital for the function. The identifier molecule should comprise of a sequence that can anneal to the building block and an attachment entity that can accommodate various functional entities. Below is an example on how an identifier molecule can be designed in the extension region. The region that becomes extended during each step of transfer and encoding can be designed using various approaches. Importantly, there must be a base-pair match between the building block and the identifier to allow efficient extension using a polymerase. This can be accomplished using either a region that is constant, the binding region as described in FIG. 3 (A), or a region that allow binding to any given sequence, also shown in FIG. 3 (B). A combination of these to approaches can also be used.

(184) The first step in the extension process needs no special binding region due to the match of the identifier and the building block molecules (step 1 shown below). However, the subsequently steps needs a binding region sufficient complementary to the identifier molecule to allow for hybridisation because the enzyme, preferably a polymerase must be able to bind to the douplex and perform an extension. The example below shows four steps in the encoding procedure. This process of extension can be continued to obtain the suitable number of transfer of building blocks. The binding region in this example contains 6 nucleotides, but this can be varied dependent on the design of the building blocks.

(185) A possibility to accommodate the possible mismatches in the previous anticodon is to use universal nucleobases, i.e. a nucleobases with the ability to base pair with more than one of the natural nucleobases. A possible base is inosine which can form base pairs with cytidine, thymidine, and adenosine (although the inosine:adenosine pairing presumably does not fit quite correctly in double stranded DNA, so there may be an energetic penalty to pay when the helix bulges out at this purine:purine pairing). In principle, any design that allows extension of the unique codons is possible to use.

Method for the synthesis of a bifunctional complex

Assignee

Inventors

Cpc classification

Classification Explorer

C12N15/1065

CHEMISTRY; METALLURGY

Classification Explorer

C12Q2521/101

CHEMISTRY; METALLURGY

Classification Explorer

C40B50/16

CHEMISTRY; METALLURGY

Classification Explorer

C40B50/10

CHEMISTRY; METALLURGY

Classification Explorer

C12N15/1068

CHEMISTRY; METALLURGY

Classification Explorer

C12Q2521/501

CHEMISTRY; METALLURGY

Classification Explorer

C12Q1/6806

CHEMISTRY; METALLURGY

International classification

Classification Explorer

C12N15/10

CHEMISTRY; METALLURGY

Classification Explorer

C40B50/10

CHEMISTRY; METALLURGY

Classification Explorer

C12Q1/68

CHEMISTRY; METALLURGY

Classification Explorer

C40B50/16

CHEMISTRY; METALLURGY

Abstract

Claims

Description