G16B40/00

DEEP NEURAL NETWORK-BASED SEQUENCING

A system, a method and a non-transitory computer readable storage medium for base calling are described. The base calling method includes processing through a neural network first image data comprising images of clusters and their surrounding background captured by a sequencing system for one or more sequencing cycles of a sequencing run. The base calling method further includes producing a base call for one or more of the clusters of the one or more sequencing cycles of the sequencing run.

Scalable pipeline for local ancestry inference

Ancestry deconvolution includes obtaining unphased genotype data of an individual; phasing, using one or more processors, the unphased genotype data to generate phased haplotype data; using a learning machine to classify portions of the phased haplotype data as corresponding to specific ancestries respectively and generate initial classification results; and correcting errors in the initial classification results to generate modified classification results.

Scalable pipeline for local ancestry inference

Ancestry deconvolution includes obtaining unphased genotype data of an individual; phasing, using one or more processors, the unphased genotype data to generate phased haplotype data; using a learning machine to classify portions of the phased haplotype data as corresponding to specific ancestries respectively and generate initial classification results; and correcting errors in the initial classification results to generate modified classification results.

Methods and systems for copy number variant detection

Methods and systems for determining copy number variants are disclosed. An example method can comprise applying a sample grouping technique to select reference coverage data, normalizing sample coverage data comprising a plurality of genomic regions, and fitting a mixture model to the normalized sample coverage data based on the selected reference coverage data. An example method can comprise identifying one or more copy number variants (CNVs) according to a Hidden Markov Model (HMM) based on the normalized sample coverage data and the fitted mixture model. An example method can comprise outputting the one or more copy number variants.

Methods and systems for copy number variant detection

Methods and systems for determining copy number variants are disclosed. An example method can comprise applying a sample grouping technique to select reference coverage data, normalizing sample coverage data comprising a plurality of genomic regions, and fitting a mixture model to the normalized sample coverage data based on the selected reference coverage data. An example method can comprise identifying one or more copy number variants (CNVs) according to a Hidden Markov Model (HMM) based on the normalized sample coverage data and the fitted mixture model. An example method can comprise outputting the one or more copy number variants.

Neural network architectures for scoring and visualizing biological sequence variations using molecular phenotype, and systems and methods therefor
11568960 · 2023-01-31 · ·

Systems and methods for scoring and visualizing the effects of variants in biological sequences. Variants may include substitutions, insertions and deletions. The method comprises encoding biological sequences as vector sequences and then operating a neural network in the forward-propagation mode and possibly in the back-propagation mode to compute variant scores. Variant scores are determined by normalizing the gradients. Variant scores may be used to select a subset of variants, which are then used to produce modified vector sequences which are analyzed by the neural network operating in forward-propagation mode, to determine improved variant scores. The variant scores may be visualized using black and white, greyscale or colored elements that are arranged in blocks with dimensions corresponding to different possible symbols and the length of the sequence. These blocks are aligned with the biological sequence, which is illustrated by a symbol sequence arranged in a line.

Neural network architectures for scoring and visualizing biological sequence variations using molecular phenotype, and systems and methods therefor
11568960 · 2023-01-31 · ·

Systems and methods for scoring and visualizing the effects of variants in biological sequences. Variants may include substitutions, insertions and deletions. The method comprises encoding biological sequences as vector sequences and then operating a neural network in the forward-propagation mode and possibly in the back-propagation mode to compute variant scores. Variant scores are determined by normalizing the gradients. Variant scores may be used to select a subset of variants, which are then used to produce modified vector sequences which are analyzed by the neural network operating in forward-propagation mode, to determine improved variant scores. The variant scores may be visualized using black and white, greyscale or colored elements that are arranged in blocks with dimensions corresponding to different possible symbols and the length of the sequence. These blocks are aligned with the biological sequence, which is illustrated by a symbol sequence arranged in a line.

Process for creating reference data for predicting concentrations of quality attributes

A process and system for efficiently producing reference data that can be fed into a predictive model for predicting quality attribute concentrations in cell culture processes. A perfusion bioreactor is operated at pseudo-steady-state conditions and one or more attribute influencing parameters are manipulated and changed over time. As the one or more attribute influencing parameters are manipulated, one or more quality attributes are monitored and measured. In one embodiment, multiple quality attributes are monitored and measured in parallel. The quality attribute information is recorded in conjunction with the changes in the attribute influencing parameters. This information is then fed to the predictive model for propagating cell cultures in commercial processes and maintaining the cell cultures within desired preset limits.

Process for creating reference data for predicting concentrations of quality attributes

A process and system for efficiently producing reference data that can be fed into a predictive model for predicting quality attribute concentrations in cell culture processes. A perfusion bioreactor is operated at pseudo-steady-state conditions and one or more attribute influencing parameters are manipulated and changed over time. As the one or more attribute influencing parameters are manipulated, one or more quality attributes are monitored and measured. In one embodiment, multiple quality attributes are monitored and measured in parallel. The quality attribute information is recorded in conjunction with the changes in the attribute influencing parameters. This information is then fed to the predictive model for propagating cell cultures in commercial processes and maintaining the cell cultures within desired preset limits.

IMMUNOME WIDE ASSOCIATION STUDIES TO IDENTIFY CONDITION-SPECIFIC ANTIGENS
20230024898 · 2023-01-26 ·

The present invention provides compositions and methods that can be used to identify an antigen or epitope region of an antigen specific for a disease or other condition. Such methods incorporate k-mer binding statistics to serum antibody from condition and control cohort samples to predict the suitability of antigen sequences identified as relevant to the disease or condition as antigen markers. Also disclosed herein are systems for performing the same.