G06F19/22

A BIOMOLECULE BASED DATA STORAGE SYSTEM
20170249345 · 2017-08-31 ·

The present invention describes a biomolecule based storage system for converting, storing the data in DNA coded form and retrieving data using pointer file approach. User input data is converted into 4base DNA sequence, called Nibble, which is further mapped onto the DNA sequence of an organism. The first position of each converted nibble is then obtained and stored in a pointer file. By mapping the positions of pointer file onto the DNA sequence of the organism, the data can be retrieved.

GENOME METHYLATION ANALYSIS
20170247748 · 2017-08-31 ·

Provided herein are methods for identifying sites and regions within a gene or genome that are amenable to analysis of methylation. The methods disclosed herein allow the efficient identification on a genome-wide scale of target restriction sites and fragments that provide targets for subsequent analysis.

Methods for accurate sequence data and modified base position determination

Disclosed herein are methods of determining the sequence and/or positions of modified bases in a nucleic acid sample present in a circular molecule with a nucleic acid insert of known sequence comprising obtaining sequence data of at least two insert-sample units. In some embodiments, the methods comprise obtaining sequence data using circular pair-locked molecules. In some embodiments, the methods comprise calculating scores of sequences of the nucleic acid inserts by comparing the sequences to the known sequence of the nucleic acid insert, and accepting or rejecting repeats of the sequence of the nucleic acid sample according to the scores of one or both of the sequences of the inserts immediately upstream or downstream of the repeats of the sequence of the nucleic acid sample.

Adaptive processing for sequence alignment

Disclosed are various embodiments for adaptive processing for sequence alignment. In one embodiment, among others, a method includes obtaining a query sequence and a plurality of database sequences. A first portion of the plurality of database sequences is distributed to a central processing unit (CPU) and a second portion of the plurality of database sequences is distributed to a graphical processing unit (GPU) based upon a predetermined splitting ratio associated with the plurality of database sequences, where the database sequences of the first portion are shorter than the database sequences of the second portion. A first alignment score for the query sequence is determined with the CPU based upon the first portion of the plurality of database sequences and a second alignment score for the query sequence is determined with the GPU based upon the second portion of the plurality of database sequences.

Methods and systems for genetic analysis

This disclosure provides systems and methods for sample processing and data analysis. Sample processing may include nucleic acid sample processing and subsequent sequencing. Some or all of a nucleic acid sample may be sequenced to provide sequence information, which may be stored or otherwise maintained in an electronic storage location. The sequence information may be analyzed with the aid of a computer processor, and the analyzed sequence information may be stored in an electronic storage location that may include a pool or collection of sequence information and analyzed sequence information generated from the nucleic acid sample. Methods and systems of the present disclosure can be used, for example, for the analysis of a nucleic acid sample, for producing one or more libraries, and for producing biomedical reports. Methods and systems of the disclosure can aid in the diagnosis, monitoring, treatment, and prevention of one or more diseases and conditions.

Discovering Population Structure from Patterns of Identity-By-Descent
20170242957 · 2017-08-24 ·

Described are techniques for determining population structure from identity-by-descent (IBD) of individuals. The techniques may be used to predict that an individual belongs to zero, one or more of a number of communities identified within an IBD network. Additional data may be used to annotate the communities with birth location, surname, and ethnicity information. In turn, these data may be used to provide to an individual a prediction of membership to zero, one or more communities, accompanied by a summary of the information annotated to those communities.

METHODS AND APPLICATIONS OF GENE FUSION DETECTION IN CELL-FREE DNA ANALYSIS
20170240972 · 2017-08-24 ·

Systems and methods are disclosed for determining gene fusion by determining a fused read containing sequencing data of a portion of a fused chromosome DNA molecule; determining a predetermined point on the genome with least one mapped portion of the fused read clipped at the predetermined point (a breakpoint); identifying two mapped read portions from two breakpoints (breakpoint pair) as a potential fusion candidate; creating one or more fusion sets based on breakpoint pairs and clustering the fusion sets into one or more fusion clusters; and identifying each fusion cluster meeting a predetermined criterion as a gene fusion.

METHODS TO DETERMINE TUMOR GENE COPY NUMBER BY ANALYSIS OF CELL-FREE DNA
20170240973 · 2017-08-24 ·

Methods are provided herein to improve automatic detection of copy number variation in nucleic acid samples. These methods provide improved approaches for determining baseline copy number of genetic loci within a sample, reduce variation due to features of genetic loci, sample preparation, and probe exhaustion.

SYSTEMS AND METHODS FOR TRACKING AND IDENTIFYING INFECTION TRANSMISSION

The present disclosure describes systems and methods for determining sources of infection transmission. Phylogenetic methods are used for determining the evolutionary history and replication rates of infection isolates. The evolutionary distance and/or replication rate of an infection isolate maybe compared to other isolates. Based on a comparison of the evolutionary distance and/or replication rate, a determination of the source of infection transmission is made.

ANALYSIS OF A POLYMER

Analysis Of A Polymer A biochemical analysis system analyses polymers by taking measurements of a polymer from a sensor element comprising a nanopore during translocation of the polymer through the nanopore. When a polymer has partially translocated, the series of measurements is analysed using reference data derived from a reference sequence to provide a measure of similarity. Responsive to the measure of similarity, the sensor element may be selectively operated to eject the polymer and thereby make the nanopore available to receive a further polymer. Where the biochemical analysis system comprises an array of sensor elements and is takes measurements from sensor elements selected in a multiplexed manner, responsive to the measure of similarity, the biochemical analysis system ceases taking measurements from the currently selected sensor element and to starts taking measurements from a newly selected sensor element.