Patent classifications
G16B20/10
Methods and systems for genetic analysis
This disclosure provides systems and methods for sample processing and data analysis. Sample processing may include nucleic acid sample processing and subsequent sequencing. Some or all of a nucleic acid sample may be sequenced to provide sequence information, which may be stored or otherwise maintained in an electronic storage location. The sequence information may be analyzed with the aid of a computer processor, and the analyzed sequence information may be stored in an electronic storage location that may include a pool or collection of sequence information and analyzed sequence information generated from the nucleic acid sample. Methods and systems of the present disclosure can be used, for example, for the analysis of a nucleic acid sample, for producing one or more libraries, and for producing biomedical reports. Methods and systems of the disclosure can aid in the diagnosis, monitoring, treatment, and prevention of one or more diseases and conditions.
Methods and systems for genetic analysis
This disclosure provides systems and methods for sample processing and data analysis. Sample processing may include nucleic acid sample processing and subsequent sequencing. Some or all of a nucleic acid sample may be sequenced to provide sequence information, which may be stored or otherwise maintained in an electronic storage location. The sequence information may be analyzed with the aid of a computer processor, and the analyzed sequence information may be stored in an electronic storage location that may include a pool or collection of sequence information and analyzed sequence information generated from the nucleic acid sample. Methods and systems of the present disclosure can be used, for example, for the analysis of a nucleic acid sample, for producing one or more libraries, and for producing biomedical reports. Methods and systems of the disclosure can aid in the diagnosis, monitoring, treatment, and prevention of one or more diseases and conditions.
METHODS FOR NON-INVASIVE PRENATAL PLOIDY CALLING
The present disclosure provides methods for determining the ploidy status of a chromosome in a gestating fetus from genotypic data measured from a mixed sample of DNA comprising DNA from both the mother of the fetus and from the fetus, and optionally from genotypic data from the mother and father. The ploidy state is determined by using a joint distribution model to create a plurality of expected allele distributions for different possible fetal ploidy states given the parental genotypic data, and comparing the expected allelic distributions to the pattern of measured allelic distributions measured in the mixed sample, and choosing the ploidy state whose expected allelic distribution pattern most closely matches the observed allelic distribution pattern. The mixed sample of DNA may be preferentially enriched at a plurality of polymorphic loci in a way that minimizes the allelic bias, for example using massively multiplexed targeted PCR.
METHODS FOR NON-INVASIVE PRENATAL PLOIDY CALLING
The present disclosure provides methods for determining the ploidy status of a chromosome in a gestating fetus from genotypic data measured from a mixed sample of DNA comprising DNA from both the mother of the fetus and from the fetus, and optionally from genotypic data from the mother and father. The ploidy state is determined by using a joint distribution model to create a plurality of expected allele distributions for different possible fetal ploidy states given the parental genotypic data, and comparing the expected allelic distributions to the pattern of measured allelic distributions measured in the mixed sample, and choosing the ploidy state whose expected allelic distribution pattern most closely matches the observed allelic distribution pattern. The mixed sample of DNA may be preferentially enriched at a plurality of polymorphic loci in a way that minimizes the allelic bias, for example using massively multiplexed targeted PCR.
METHODS AND SYSTEMS FOR IDENTIFYING RECOMBINANT VARIANTS
Disclosed herein include systems, devices, and methods for identifying recombinant variants (e.g., gene conversion variants) of genes such as GBA gene and CYP21A2 gene, the copy numbers of recombinant variants, and gene variant status (e.g., carrier, compound heterozygous, or homozygous).
METHODS AND SYSTEMS FOR IDENTIFYING RECOMBINANT VARIANTS
Disclosed herein include systems, devices, and methods for identifying recombinant variants (e.g., gene conversion variants) of genes such as GBA gene and CYP21A2 gene, the copy numbers of recombinant variants, and gene variant status (e.g., carrier, compound heterozygous, or homozygous).
SOMATIC VARIANT COOCCURRENCE WITH ABNORMALLY METHYLATED FRAGMENTS
Systems and methods for identifying variant alleles as somatic or germline are provided. Reference and variant alleles for a genomic position are identified. Methylation states and sequences of nucleic acid fragment sequences that map to the genomic position are obtained from a sample of a subject. Using the sequences of nucleic acid fragment sequences, each nucleic acid fragment sequence that has the reference allele is assigned to a reference subset, and each nucleic acid fragment sequence that has the variant allele is assigned to a variant subset. One or more indications of the methylation states across the nucleic acid fragment sequences in the variant subset and an indication of the number of nucleic acid fragment sequences in the reference subset versus the variant subset are applied to a trained binary classifier. An identification of the variant allele at the genomic position as somatic or germline is obtained from the classifier.
SOMATIC VARIANT COOCCURRENCE WITH ABNORMALLY METHYLATED FRAGMENTS
Systems and methods for identifying variant alleles as somatic or germline are provided. Reference and variant alleles for a genomic position are identified. Methylation states and sequences of nucleic acid fragment sequences that map to the genomic position are obtained from a sample of a subject. Using the sequences of nucleic acid fragment sequences, each nucleic acid fragment sequence that has the reference allele is assigned to a reference subset, and each nucleic acid fragment sequence that has the variant allele is assigned to a variant subset. One or more indications of the methylation states across the nucleic acid fragment sequences in the variant subset and an indication of the number of nucleic acid fragment sequences in the reference subset versus the variant subset are applied to a trained binary classifier. An identification of the variant allele at the genomic position as somatic or germline is obtained from the classifier.
METHOD FOR DETERMINING FETAL NUCLEIC ACID CONCENTRATION AND FETAL GENOTYPING METHOD
Provided in the present disclosure are a method for determining fetal nucleic acid concentration and a fetal genotyping method. According to the embodiments of the present disclosure, the method for determining cell-free fetal nucleic acid concentration includes: (1) acquiring sequencing data of a first nucleic acid sample of a pregnant woman and a reference genome sequence, the first nucleic acid sample of the pregnant woman containing cell-free fetal nucleic acids, and the sequencing data being composed of a plurality of sequencing reads; (2) selecting a predetermined region on the reference genome sequence and determining, based on the sequencing data of the first nucleic acid sample of the pregnant woman, mutation information in the predetermined region; and (3) determining the concentration of cell-free fetal nucleic acids corresponding to the predetermined region based on the mutation information in the predetermined region.
METHOD FOR DETERMINING FETAL NUCLEIC ACID CONCENTRATION AND FETAL GENOTYPING METHOD
Provided in the present disclosure are a method for determining fetal nucleic acid concentration and a fetal genotyping method. According to the embodiments of the present disclosure, the method for determining cell-free fetal nucleic acid concentration includes: (1) acquiring sequencing data of a first nucleic acid sample of a pregnant woman and a reference genome sequence, the first nucleic acid sample of the pregnant woman containing cell-free fetal nucleic acids, and the sequencing data being composed of a plurality of sequencing reads; (2) selecting a predetermined region on the reference genome sequence and determining, based on the sequencing data of the first nucleic acid sample of the pregnant woman, mutation information in the predetermined region; and (3) determining the concentration of cell-free fetal nucleic acids corresponding to the predetermined region based on the mutation information in the predetermined region.