INTERACTIVE PRECISION MEDICINE EXPLORER FOR GENOMIC ABBERATIONS AND TREATMENT OPTIONS
20180314795 ยท 2018-11-01
Inventors
- Yee Him CHEUNG (Boston, MA, US)
- Nevenka Dimitrova (Pelham Manor, NY, US)
- JOHANNA MARIA DE BONT (EINDHOVEN, NL)
Cpc classification
G16B20/20
PHYSICS
G16B45/00
PHYSICS
G16H50/20
PHYSICS
G16H10/60
PHYSICS
G06F3/0481
PHYSICS
G16B20/00
PHYSICS
International classification
G06F3/0481
PHYSICS
G16H10/60
PHYSICS
Abstract
A data-driven integrative visualization system and method for summarizing and presenting genomic aberrations, their drug responses and multi-omic data of a patient, is disclosed. Specifically, a method for displaying genomic aberrations and multi-omic data of a patient in an interactive tool which allows the medical practitioner to access underlying supporting biologic and scientific evidence from relevant knowledge bases through a set of graphical interactions, is described. The method comprises the steps of obtaining and inputting multi-omic data of a patient or cohorts, identifying genomic aberrations and their drug responses, and displaying this information in a first level interactive classical/circular ideogram in one or multiple layers on a GUI, from which the user can access and view further information on the gene and molecular levels. The system provides an improved process of integrative analysis on a patient's multi-omic data for effective treatment planning.
Claims
1. A computer-implemented method for summarizing and presenting patient-specific multi-omic data in a multilayered format, the method comprising: a computing device with a graphical user interface, determining a dataset of files containing patient information by obtaining genomic aberration and other omics data from a patient and storing said data on a non-transitory computer readable storage medium; determining selection criteria based on the patient dataset; inputting patient-specific data, by a user interface, onto a processor configured to receive said patient-specific data, selecting a cohort of samples based on user-defined demographic and phenotypic criteria from a repository of patient or healthy samples, and inputting said demographic and phenotype criteria into said computing device through said graphical user interface; extracting said cohort genomic aberration and omics data for comparison with the patient of interest based on said demographic and phenotype criteria and inputting said cohort genomic aberration and omics data, by a user interface, onto a processor configured to receive said cohort genomic aberration and omics data; annotating said patient-specific genomic aberration and omics data in a first layer of said multilayered format, using internal/external knowledge bases, which include information such as mutation impact, population allele frequency, disease association with model of inheritance and drug response; filtering said patient-specific genomic aberrations and omics data based on user-defined criteria, such as chromosome regions, genes and variant type/function/impact/population allele frequency; and displaying said patient-specific genomic aberration and omics data in said interactive multi-level format, wherein said multilayer format comprises; said first layer, said first layer comprising an interactive chromosomal view that summarizes all the clinically relevant or actionable genomic aberrations of said patient by marking them on the genome coordinates, including known drug responses associated with a particular mutation/gene marked next to the mutation/gene accordingly, said first layer further comprising; a first sub-layer comprising an ideogram view where chromosomes are arranged in a circular format; a second sub-layer comprising an ideogram view where each chromosome in said first sub-layer is separately displayed in a schematic; a second layer comprising an interactive intergenic genomic scale where multiple genes are displayed with their expression levels indicated by color; a third level comprising an interactive genic scale, depicting the structure and functional blocks within a gene, omics data such as methylation levels and gene/exon expression, the 3D protein structure (ribbon plot), with mutations marked and including general information about said gene; and a fourth level, comprising a molecular scale displaying the molecular sequence and its detailed annotations, such as the nucleotide sequence of the reference genome, the corresponding amino acid sequence in the protein-coding regions, nucleotide/amino acid changes caused by the mutations, exon/gene expression and methylation levels of CpG sites, ChIP-Seq data for histone modification.
2. The method of claim 1, wherein the mulilayered format is a circular or linear multilayered format.
3. The method of claim 1, wherein said obtaining genomic aberration and other omics data from a patient comprises the collection of tissue and blood samples from said patient, performing next-generation sample preparation and DNA/RNA seqeuncing, read alignment and culling of variants and gene expressions.
4. The method of claim 1, wherein said second layer further comprises additional data tracks to add more details, such as methylation, chromatin immunoprecipitation sequencing and assay data which may improve the functional view of genomic aberrations.
5. A non-transitory computer readable storage medium tangibly encoded with computer-executable instructions, that when executed by a processor associated with computing device having a graphical user interface, cause the device to carry out the steps of the method as defined in claim 1.
6. A computer program product, comprising a computer-readable code to be executed by one or more processors when retrieved from a non-transitory computer-readable medium, the computer-readable program code including instructions to: determine a dataset of files containing patient information by obtaining genomic aberration and other omics data from a patient and storing said data on a non-transitory computer readable storage medium; receive selection criteria by a user through a graphical user interface, said selection criteria determined by said user based on said patient dataset, and input said patient-specific data, onto a processor configured to receive said patient-specific data, select a cohort of samples based on user-defined demographic and phenotypic criteria from a repository of patient or healthy samples, and input said demographic and phenotype criteria into said computing device through said graphical user interface; extract said cohort genomic aberration and omics data for comparison with the patient of interest based on said demographic and phenotype criteria and inputting said cohort genomic aberration and omics data, by a user interface, onto a processor configured to receive said cohort genomic aberration and omics data; annotate said patient-specific genomic aberration and omics data, using internal/external knowledge bases, which include information such as mutation impact, population allele frequency, disease association with model of inheritance and drug response; filter said patient-specific genomic aberrations and omics data based on user-defined criteria, such as chromosome regions, genes and variant type/function/impact/population allele frequency; and display said patient-specific genomic aberration and omics data in said interactive multi-level format, wherein said multilayer format comprises; a first layer comprising an interactive chromosomal view that summarizes all the clinically relevant or actionable genomic aberrations of said patient by marking them on the genome coordinates, including known drug responses associated with a particular mutation/gene marked next to the mutation/gene accordingly, said first layer further comprising; a first sub-layer comprising an ideogram view where chromosomes are arranged in a circular format; a second sub-layer comprising an ideogram view where each chromosome in said first sub-layer is separately displayed in a schematic; a second layer comprising an interactive intergenic genomic scale where multiple genes are displayed with their expression levels indicated by color; a third level comprising an interactive genic scale, depicting the structure and functional blocks within a gene, omics data such as methylation levels and gene/exon expression, the 3D protein structure (ribbon plot), with mutations marked and including general information about said gene; and a fourth level, comprising a molecular scale displaying the molecular sequence and its detailed annotations, such as the nucleotide sequence of the reference genome, the corresponding amino acid sequence in the protein-coding regions, nucleotide/amino acid changes caused by the mutations, exon/gene expression and methylation levels of CpG sites, ChIP-Seq data for histone modification.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0023] The methods according to the invention will now be described in more detail with regard to the accompanying figures. The figures showing ways of implementing the present invention and are not to be construed as being limiting to other possible embodiments falling within the scope of the attached claims.
[0024] The methods according to the invention will now be described in more detail with regard to the accompanying figures. The figures showing ways of implementing the present invention and are not to be construed as being limiting to other possible embodiments falling within the scope of the attached claims.
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
DETAILED DESCRIPTION OF THE INVENTION
[0035] The present invention provides a system and method for summarizing and presenting genomic aberrations, their drug responses and multi-omic data of a patient, by displaying genomic aberrations and multi-omic data of the patient in an interactive classical/circular ideogram format which allows the medical practitioner to access underlying supporting biologic and scientific evidence from relevant knowledge bases through a set of graphical interactions. The present invention is described in further detail below with reference made to
[0036] With a computing device having a graphical user interface, the genomic aberration and omics data are then displayed in an interactive multi-level format. At Level 1 of the method and system for displaying patient-specific genomic data and genomic aberrations, all the clinically relevant or actionable aberrations of a patient are summarized by marking them on the genome coordinates (see
[0037]
[0038] By selecting a specific gene at Level 2, the user is directed to Level 3 of this embodiment, as shown in
[0039] Similarly, the user accesses Level 4, as seen in
Mutations and Drug Response
[0040] To enhance data presentation, the invention employs different symbols to represent different types of aberrations and drug/clinical trial associations with their levels of significance indicated by properties such as color and size, as can be seen in
Interactions
[0063] To enable the seamless navigation to the patient's multi-omic data at different levels of details and quick access to relevant information from different knowledge bases, the Precision Medicine tool of this invention is highly interactive and user friendly. The set of supported user interactions include, but are not limited to, the following: [0064] 1. Toggle between the classical ideogram, circos and horizontal (linear) views of the genome [0065] 2. Zoom in/out to different data levels by using a zoom slider, selecting a region on the genomic scale or directly specifying a gene, locus or the start and end chromosome positions [0066] 3. Rearrange the layout of chromosomes in the ideogram, rotate the circular ideogram or navigate to nearby regions by swiping [0067] 4. Select the inclusion/exclusion criteria for aberrations to be displayed, e.g., by specifying the types of mutations and the chromosome regions or gene subsets [0068] 5. Import and display additional tracks of data and annotation, e.g., mutational density [0069] 6. Select and display the omic data of one or more individual patients and cohorts in multiple layers [0070] 7. Hover on any color-scaled data, such as gene expression and methylation levels, and display the actual numerical values [0071] 8. Select a nucleotide, amino acid or mutation and their locations will be marked on the corresponding gene and 3D protein structure (see
Comparison of Multiple Samples and Cohorts
[0082] In a further embodiment, users can choose to display the omic data of a patient or a cohort of patients in multiple layers of the visual representation in the Precision Medicine Explorer for side-by-side comparison. See
Presentation Filters for Genomics Aberrations
[0083] In genomics, it is customary to offer multiple filtering options to the user for each of the types of genomic aberrations. Within this embodiment, the goal is to associate the genomic aberrations to key evidence for treatment planning. In any embodiment of this invention, users can determine what data is to be presented in one or multiple layers of ideogram by applying a combination of filters that include but are not limited to the following: [0084] 1. Chromosome regions, e.g., chr1:1000000-5000000, chrX, etc. [0085] 2. Genes [0086] (a) List of specific genes [0087] (b) Biological concepts or terms that are associated with gene subsets, e.g., oncogene, suppressor, transcription factor, signaling pathways such as ER, PR, Wnt, PI3K, MAPK, etc. [0088] (c) Significantly mutated genes (SMGs)users can select the methods for computing the SMGs and their parameters [0089] (d) Mutation burdenusers can specify the number and types of mutations that a gene needs to carry to be included for display [0090] (e) Genes which have associated drug response information: [0091] 3. Variant Type: single nucleotide variants (SNVs), short insertions/deletions (indels), copy number variations (CNVs), gene fusions, over expression, under expression, etc. [0092] 4. Variant Function: synonymous, missense, nonsense, nonsense mediated decay (NMD), frameshift, splice site, promoter, etc. [0093] 5. Variant Impact [0094] (a) Therapeutic/Pharmacogeneticvariants with available drug options. Genomic aberrations have associated drug response information: 1) resistance association that depicts that the mutation is associated with resistance within a certain indication and 2) response association that depicts that the mutation is associated with likely response to the drug within a certain indication (e.g., response to First generation Tyrosine kinase inhibitor) [0095] (b) Classificationcan be based on the ACMG guidelines, i.e., Classes 1-5 for somatic mutations, and for germline mutations pathogenic, likely pathogenic, uncertain significance, likely benign or benign [0096] (c) Pathogenicity predictionusers can choose a combination of algorithms and their thresholds, which are joined together by and/or operators [0097] 6. Variant Frequency in Ethnic Groupsminor allele frequency thresholds in one or more ethnic groups (white/black/Asian/all), with the conditions joined by and/or operators [0098] 7. Variant Frequency in Samples/Cohortsfor each sample/cohort, users can specify the range of the number/frequency of a variant or their carriers, with the conditions joined by and/or operators
Depending on the purpose of the application, e.g. diagnostic, therapy selection or research, different default filter settings can be applied so that only the relevant information is shown.
Search by Keywords with Autocomplete Suggestions
[0099] Users can show the genes or other information associated with a keyword on the ideogram by typing the keyword in a search box with autocomplete functionality. The search term can be a gene symbol, signaling pathway, disease, drug, or biological concept such as oncogene/suppressor, etc. Users can also search for a combination of these terms concatenated by logical operators, such as ,/OR, &/AND, etc. Once the data related to the search term(s) are retrieved from the databases, they are displayed on the same or a separate ideogram (see
[0100] Referring to
[0101] To make the zoom-in or zoom-out transition look continuous and smooth, and enhance the navigation and user experience, our Precision Medicine Explorer includes a 3D option that enables users to view the chromosome layouts from different visual perspectives (see
Association with Evidence for Key Findings
[0102] One essential functionality of our Precision Medicine Explorer is to display the drugs/treatments with their known predicted/experimental/clinical responses (increased/decreased) or clinical trial options associated with patient-specific data, such as genomic aberrations, up/down-regulated gene expressions, abnormal methylation levels or other omics anomalies with supporting evidence, which can be further explored through user interactions. For example, the gene mutation BRAF V600E is known for increased sensitivity to Vemurafenib in Melanoma, and the gene mutation EGFR T790M for resistance to tyrosine kinase inhibitors. Such associations can be looked up from local/external knowledge bases such as the Catalogue Of Somatic Mutations In Cancer (COSMIC) Database, the Mutations and Drugs Portal (MDP), the Cancer Drug Resistance Database (CancerDR), the Drug Gene Interaction Database (DGIdb) and ClinicalTrials.gov. Additional information on the drugs, such as the side effects, toxicity, mechanism of action, interactions with other drugs and the supporting scientific evidence can be accessed for display. Gathering, summarizing and presenting such information in one single tool can facilitate the design of combinatorial therapy and caution the potential threats of certain drug combinations that should be avoided.
Example
[0103] As a use case example, our Precision Medicine Explorer is used for examining the omic data of an ER+ breast cancer patient. From the top-level view, the oncologist gets a genomic overview of the clinically relevant mutations carried by the patient and the available drug options. As expected, an overexpression of the ESR1 gene was reported with a list of drug options consisting of ER inhibitors. If the oncologist wants to further examine the expression levels of the genes in the ER pathway, she would then add a track for gene expression and filter for a pre-defined panel of ER pathway genes. After inspecting the expression values, she confirmed whether the patient has a hyperactive ER pathway, which could be effectively suppressed by ER inhibitors. She also noticed that the patient carries a known pathogenic mutation in the PIK3CA gene. She clicks on the mutation and checks the allele frequency, function, pathogenicity, call quality, related publications, among other details, and confirmed that the mutation served as a good prognostic biomarker for favorable therapeutic response of PIK3CA inhibitors. After comparing the clinical evidence and possible side effects of the drug options, she decided to administer the two inhibitors with the strongest clinical evidence respectively for suppressing the activities of ER and PIK3CA in combination for treating the patient. Our Precision Medicine Explorer significantly improved the workflow of an oncologist in performing integrative analysis on a patient's omic data for treatment planning.