G11C13/0019

DNA CANVAS FOR INFORMATION STORAGE AND NANOFABRICATION
20210350879 · 2021-11-11 ·

A DNA canvas comprising a plurality of uniquely-coded polymer strands immobilized on a substrate can be used to provide a reference map comprising a set of reference association polymers having a dual-barcode generated by nondestructively associating spatially-adjacent polymers on the DNA canvas, encoding digital information on the DNA canvas to provide a patterned DNA canvas by disabling a pattern of selected plurality of polymers strands to provide a set of data association polymers having a single bar code that corresponds to a single bit in the bitmap. The digital information capable of being retrieved by sequencing the set of reference and data association polymers, computationally recovering spatial locations of each of the selected polymer strands that were disabled and recovering the bitmap encoded in the pattern of disabled polymer strands by comparison of the set of reference association polymer sequences to the set of data association polymer sequences.

FLOW CELL WITH ONE OR MORE BARRIER FEATURES

An apparatus includes a flow cell body, a plurality of electrodes, an imaging assembly, and one or more barrier features. The flow cell body defines one or more flow channels and a plurality of wells defined as recesses in the floor of each flow channel. Each well is fluidically coupled with the corresponding flow channel. The flow cell body further defines interstitial surfaces between adjacent wells. Each well defines a corresponding depth. Each electrode is positioned in a corresponding well of the plurality of wells. The electrodes are to effect writing of polynucleotides in the wells. The imaging assembly is to capture images of polynucleotides written in the wells. The one or more barrier features are positioned in the wells, between the wells, or above the wells. The one or more barrier features contain reactions in each well, reduce diffusion between the wells, or reduce optical cross-talk between the wells.

FLOW CELL WITH SELECTIVE DEPOSITION OR ACTIVATION OF NUCLEOTIDES

An apparatus includes a flow cell body, a plurality of electrodes, an integrated circuit, and an imaging assembly. The flow cell body defines one or more flow channels and a plurality of wells. Each flow channel is configured to receive a flow of fluid. Each well is fluidically coupled with the corresponding flow channel. Each well is configured to contain at least one polynucleotide. Each electrode is positioned in a corresponding well of the plurality of wells. The electrodes are operable to effect writing of polynucleotides in the corresponding wells. The integrated circuit is operable to drive selective deposition or activation of selected nucleotides to attach to polynucleotides in the wells to thereby generate polynucleotides representing machine-written data in the wells. The imaging assembly is operable to capture images indicative of one or more nucleotides in a polynucleotide.

Enzymatic DNA repair

One or more enzymes are used to repair damage in synthetic DNA molecules that encode digital information. The enzymes are included in a repair mixture containing one or more of DNA polymerase, DNA ligase, T4 Endonuclease, Endonuclease IV, Endonuclease VIII, and uracil glycosylase. The repair mixture may also contain one or more of a buffering solution, oxidized nicotinamide adenine dinucleotide (NAD.sup.+), and deoxyribose nucleoside triphosphates (dNTPs). The synthetic DNA molecules are incubated with the repair mixture for approximately four hours. Use of the repair solution allows recovery of the digital information from damaged DNA molecules.

Sequencing Data-Encoded Peptides from Tandem Mass Spectra

Peptide sequencing is important in decoding data stored in a data-encoded peptide. Tandem mass spectrometry (MS/MS) is particularly useful for peptide sequencing. In a computer-implemented method for sequencing the data-encoded peptide from an experimental spectrum, raw data of the experimental spectrum are first preprocessed to remove uninterpretable peaks to yield preprocessed data. A first set of one or more candidate sequences contending for a peptide sequence of the peptide is identified from a spectrum graph. The spectrum graph is formed according to the preprocessed data rather than the raw data for generating a fewer number of candidate sequences to thereby reduce a time cost in sequencing. The first candidate-sequence set is then processed to estimate the peptide sequence to thereby obtain a set of peptide-sequence estimate(s). Each estimate is verified whether it is invalid. The set of peptide-sequence estimate(s) is purged to remove any invalid estimate.

Integrated system for nucleic acid-based storage and retrieval of digital data using keys

In some embodiments, systems and methods for storing and/or retrieving digital information in a nucleic acid library are provided. In some embodiments, an integrated system comprising a nucleic acid synthesis device, a nucleic acid sequencing device, a computing device, and a nucleic acid library is provided. In some embodiments, a write request that associates a value with a key is received by the system, the system synthesizes nucleic acid molecules associated with the request, and stores the nucleic acid molecules in the nucleic acid library. In some embodiments, a read request for a key is received by the system, and the system sequences nucleic acid molecules from the nucleic acid library that are associated with the key.

SYSTEMS AND METHODS FOR WRITING BY SEQUENCING OF NUCLEIC ACIDS
20230317164 · 2023-10-05 ·

The systems, devices, and methods described herein provide nucleic acid digital data storage encoding and retrieving methods that are less costly and easier to commercially implement than existing methods. The systems, devices, and methods described herein provide scalable methods for writing data to and reading data from nucleic acid molecules. The present disclosure covers five primary areas of interest: (1) writing digital information into nucleic acid molecules, (2) accurately and quickly reading information stored in nucleic acid molecules, (3) partitioning data to efficiently encode data in nucleic acid molecules, (4) error protection and correction when encoding data in nucleic acid molecules, and (5) data structures to provide efficient access to information stored in nucleic acid molecules.

NUCLEIC ACID BASED DATA STORAGE USING ENZYMATIC BIOENCRYPTION
20230153452 · 2023-05-18 ·

Provided herein are compositions, devices, systems and methods for the generation and use of secured biomolecule-based information for storage. Further described herein are compositions, devices, systems and methods for bioencryption or biodecryption of information. Conversion of a digital sequence to a nucleic based sequence includes a step of selection of one or more bioencryption methods.

OLIGONUCLEOTIDE ASSEMBLY USING HAIRPINS AND INVADING STRANDS
20230257788 · 2023-08-17 ·

Sequential assembly of oligonucleotide hairpins is used to create oligonucleotides that encode a specific sequence of arbitrary information. Each oligonucleotide hairpin includes a payload region in the loop region of the hairpin. The payload region encodes arbitrary information such as a binary digit. Overhang regions on the oligonucleotide hairpins hybridize to anchor strands attached to a substrate. The hybridized oligonucleotide hairpins are covalently attached to the anchor strands by ligase. Invading strands are used to open the hairpin structures by also hybridizing to the anchor strand and displacing the double-stranded stem region of the hairpin. This process is repeated with another oligonucleotide hairpin that hybridizes to the end of the previously added oligonucleotide hairpin. A microelectrode array may be used to control the location of hybridization and create multiple different oligonucleotides in parallel. Fully assembled oligonucleotides may be separated from the substrate and stored or otherwise processed.

CHEMICAL METHODS FOR NUCLEIC ACID-BASED DATA STORAGE

The present disclosure discloses methods and systems for encoding digital information in nucleic acid (e.g., deoxyribonucleic acid) molecules without base-by-base synthesis, by encoding bit-value information in the presence or absence of unique nucleic acid sequences within a pool, comprising specifying each bit location in a bit-stream with a unique nucleic sequence and specifying the bit value at that location by the presence or absence of the corresponding unique nucleic acid sequence in the pool. Also disclosed are chemical methods for generating unique nucleic acid sequences using combinatorial genomic strategies (e.g., assembly of multiple nucleic acid sequences or enzymatic-based editing of nucleic acid sequences).