Patent classifications
G06F19/28
METHOD AND SYSTEM FOR COMPRESSING GENOME SEQUENCES USING GRAPHIC PROCESSING UNITS
The present invention provides a method for compressing genome sequences readers using GPU processing unit. The method comprising the steps of: identifying position of each given genome reader characters string in the sequence of a reference genome, determining alignment of each reader string within the reference genome, comparing each reader characters string to corresponding reference genome sequence based on determined alignment, filtering characters in each reader by GPU processor by eliminating similar characters and extracting only characters differences in association to their position in the genome sequence and recording filtered data of each reader in association to its alignment in genome reference at the genome compressed database.
SYSTEMS AND METHODS FOR PROCESSING SEQUENCE DATA FOR VARIANT DETECTION AND ANALYSIS
Systems and methods for processing sequence data are disclosed herein. In an embodiment, the system is comprised of a computing device that is configured for receiving, storing, and processing sequence data utilizing object-oriented functions. Sequencing is disclosed herein which provides for the customization of sequencing and analysis processing for next generation sequence processing and analysis. The system may be characterized as a bioinformatics system, which uses object oriented functions to process and store sequencing data efficiently and without the need for extensive programming knowledge. Object instances configured as part of the system may be manipulated, transformed, probed, and shared in memory, yet still saved to the disk. Due to the nature of sequence representation within the system, the required disk space needed is much less than existing bioinformatics programs. In another embodiment, MATLAB is utilized as part of the configuration of the system. Due to its object-oriented approach it may be adapted to more complex development functions and processing. This provides for much needed flexibility and ease of use.
INFORMATION ITEM RETRIEVAL
The invention provides a method and system for enabling a user (100) to identify one or more information items which the user (100) or another party has previously accessed, the method comprising the steps of: recording in a computer readable storage medium concurrent attributes (101) concerning one or more events or computer system states occurring concurrently with the previous access of the information items by the user (100) or other party; receiving a search request specification (102) from the user (100) seeking to find one of the previously accessed information items, the search request specification comprising one or more specified concurrent attributes (30,40,50) including at least one unrelated concurrent attribute (50) which bears no relation, other than concurrence, to the previously accessed information item being sought or to the previous access thereof; accessing the recorded concurrent attributes and identifying to the user one or more of the previously accessed information items which satisfy the search request specification.
GENOME ANALYSIS DEVICE AND GENOME VISUALIZATION METHOD
There is provided a genome analysis device configured to analyze genome data including a large quantity of fragmented genome base sequences, and transmit output data concerning the genome data in response to an output request from a client device connected via a network, the genome analysis device including; a storage unit for storing data for visualization of multiple different layers, for the genome data; a request receiving unit for receiving an output request from the client device; and an output data generating unit for selecting data for visualization of a layer corresponding to the output request from the storage when the request receiving unit receives the output request, and generating output data based on the data for visualization of the selected layer.
SYSTEM AND METHOD FOR MELTING CURVE NORMALIZATION
The present invention relates to methods for the analysis of nucleic acids present in biological samples, and more specifically to normalize a high resolution melt curve to assist in the identification of one or more properties of the nucleic acids. The present invention provides methods and systems that incorporate a background identification algorithm according to invention principles using raw melt curve data to identify reactions that are unrelated actual DNA melt reactions. Furthermore, a web-based application for analyzing experimental data is provided. The raw experimental data obtained from a variety of instruments is processed and analyzed on a server and presented to a user through a user interface (UI).
BACTERIAL EPIGENOMIC ANALYSIS
Provided herein are systems and methods for determining the epigenetic sequences and signatures of bacteria, methods of characterizing bacteria based thereon, and methods of use thereof.
SYSTEMS AND METHODS FOR GENOMIC MANIPULATIONS AND ANALYSIS
Method and systems for preparing and analyzing a DNA sample from a subject are provided herein. Also provided are methods and systems for obtaining, analyzing, and manipulating genomic and proteomic sequence data. In particular, methods and systems provided herein involve transformation of raw genetic or proteomic sequence into a compressed data set and transmission of the compressed data set using a fixed binary encoding scheme capable of compressing the data by up to 75%. An interface is in communication with the compression module and configured to display transmitted genomic and proteomic sequence data.
SYSTEMS AND METHODS FOR AUTOMATED ANNOTATION AND SCREENING OF BIOLOGICAL SEQUENCES
The present disclosure describes software tools for effective biosecurity based on community knowledge and participation. Annotation tools described herein provide assistance to the synthetic biology community to track emerging science on the link between individual proteins and negative outcomes. Screening tools described herein enables the community to broaden both interest and effective practice of biosecurity so that practitioners and biological sequence or construct providers are empowered to evaluate the safety of order requests rather than waiting until synthesis or even expression. In addition, screening tools described herein provide for screening of polynucleotides across the same or multiple orders for sequences associated with harmful biological sequences from a reference database.
Method for development of a peptide building block useful for de novo protein design
The present invention relates to a top-down symmetric deconstruction approach which provides a novel alternative means to successfully identify a useful polypeptide “building block” for subsequent “bottom-up” de novo design of target protein architecture. The present invention also pertains to a novel peptides isolated by top-down symmetric deconstruction which may be useful for design or directed evolution of novel proteins with novel functionalities.
Approximation of datastore storing indexed data entries
The technology can compute an approximation of a datastore storing a multiplicity of indexed data. An example method can a template including programming logic that, when executed, calculates output(s) based on input(s) and undetermined parameter(s). The undetermined parameter(s) are input into a machine learning framework. Data entries reflecting one or more inputs are retrieved from a datastore and input into the machine learning framework, which determines value(s) for the undetermined parameter(s), respectively (making them determined parameters). The example method generates an approximation of the datastore using the determined parameter(s) and the input(s).