G06F19/14

Discovering population structure from patterns of identity-by-descent

Described are techniques for determining population structure from identity-by-descent (IBD) of individuals. The techniques may be used to predict that an individual belongs to zero, one or more of a number of communities identified within an IBD network. Additional data may be used to annotate the communities with birth location, surname, and ethnicity information. In turn, these data may be used to provide to an individual a prediction of membership to zero, one or more communities, accompanied by a summary of the information annotated to those communities.

Ultra-low coverage genome sequencing and uses thereof

Methods are provided for analyzing one or more genetic samples, comprising procuring one or more genetic samples comprising genetic material from one or more individuals and sequencing the genetic material using non-targeted, ultra-low coverage sequencing to obtain genetic information for individual associated with the one or more genetic samples. Personal and genetic information associated with the individuals is stored in a database for retrieval and manipulation.

SYSTEMS AND METHODS FOR NUCLEIC ACID-BASED IDENTIFICATION
20180018422 · 2018-01-18 ·

Systems and methods for calculating a predictive index of identity of a nucleic acid sample using polymorphic genetic marker data are provided. In one embodiment, a predictive index of identity of the nucleic acid sample is calculated using a value from a second set of data from a polymorphic genetic marker that is not linked to a polymorphic genetic marker used to produce a first set of data. In another embodiment, the predictive index of identity is calculated using a value from a second set of data from a polymorphic genetic marker that is linked to a polymorphic genetic marker used to produce the first set of data. Systems and methods for generating an identifier for a biological sample and for verifying a relationship between a biological sample and an identifier are also provided. The identifier is an encoding of a set of values for polymorphic genetic markers.

MICROBIOME BASED SYSTEMS, APPARATUS AND METHODS FOR THE EXPLORATION AND PRODUCTION OF HYDROCARBONS

There are provided methods, systems and processes for the utilization of microbial and related genetic information for use in the exploration, determination, production and recovery of natural resources, including energy sources, and the monitoring, control and analysis of processes and activities.

ANCESTRY FINDER

Inferring a characteristic of an individual is disclosed. An indication that a first user and a second user have at least one shared chromosomal segment is received. Information about the second user is obtained. A characteristic of the first user is inferred based at least in part on the information about the second user.

GENETIC ANALYSIS SYSTEMS AND METHODS
20170199959 · 2017-07-13 ·

Genomes of different species may be embodied as a graph in which conserved parts of multiple genomes are stored at a fixed location in memory and accessed via spatial addressing. The graph branches into plural paths, each defined by pointers to other fixed locations in the memory, where the genomes diverge due to either divergent homology or non-homologous portions. The graph can represent whole genomic information for multiple species with the natural relationships among parts of the genomes being represented by the structure of the graph. Newly obtained sequences such as output from NGS instruments can be mapped onto the graph for assembly or identification.

MICROBIOME BASED SYSTEMS, APPARATUS AND METHODS FOR THE EXPLORATION AND PRODUCTION OF HYDROCARBONS

There are provided methods, systems and processes for the utilization of microbial and related genetic information for use in the exploration, determination, production and recovery of natural resources, including energy sources, and the monitoring, control and analysis of processes and activities.

Ancestral-specific reference genomes and uses in determining prognosis
09639658 · 2017-05-02 · ·

Ancestry has a significant impact on the major and minor alleles found in each nucleotide position within the genome. Due to mechanisms of inheritance, ancestral-specific information contained within the genome is conserved within members of an ancestry. For this reason, individuals within a specific ancestry are more likely to share alleles in their genomes with other members of the same ancestry. Functionally, the combination of alleles at all positions within a group of individuals defines that group as having a common ancestry. Moreover, the aggregation of differences between alleles at all positions distinguishes one ancestry from another. The genomic similarities and differences between ancestries provides a mechanism to generate reference genomes that are specific for each ancestry. Reference genomes that are specific to an ancestry can be used to increase the accuracy of whole genome sequencing, DNA-based diagnostics and therapeutic marker discovery and in a variety of real-world DNA-based applications. Provided herein is a method for determining a prognosis for a genetic disease or disorder comprising the step of comparing a DNA sequence of or derived from the whole genome of a patient with any one or combination of two or more ancestral-specific reference genomes of an ancestral-specific reference genome database described herein to determine the level of severity of the genetic disease or disorder.

Ancestral-specific reference genomes and uses in identifying a candidate for a clinical trial
09639659 · 2017-05-02 · ·

Ancestry has a significant impact on the major and minor alleles found in each nucleotide position within the genome. Due to mechanisms of inheritance, ancestral-specific information contained within the genome is conserved within members of an ancestry. For this reason, individuals within a specific ancestry are more likely to share alleles in their genomes with other members of the same ancestry. Functionally, the combination of alleles at all positions within a group of individuals defines that group as having a common ancestry. Moreover, the aggregation of differences between alleles at all positions distinguishes one ancestry from another. The genomic similarities and differences between ancestries provides a mechanism to generate reference genomes that are specific for each ancestry. Reference genomes that are specific to an ancestry can be used to increase the accuracy of whole genome sequencing, DNA-based diagnostics and therapeutic marker discovery and in a variety of real-world DNA-based applications. Provided herein is a method for identifying a candidate individual for participation in a clinical trial, comprising the step of comparing a DNA sequence of the whole genome of a patient with any one or more of the ancestral-specific reference genomes of an ancestral-specific reference genome database described herein, wherein the presence or absence of a clinically relevant genetic marker indicates that the individual is candidate for participation in the clinical trial.

Algorithm for constructing hypothetical evolutionary trees using common mutations similarity matrices
20170076034 · 2017-03-16 ·

The present invention permits constructing hypothetical evolutionary trees for a set of genetically related DNA strings or a set of proteins within a protein family. The main novelty of the invention compared to other hypothetical evolutionary tree construction methods is the use of a common mutations similarity matrix.