Identifying cancer driver genes in tumor genome sequencing studies

Wholegenome sequencing and comprehensive molecular. As most chromatin regulators are normally associated with thousands of target genes and loci throughout the genome, their disruption is likely to have profound effects on global gene expression and contribute to various forms of malignancies. The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, socalled driver mutations. Sep 05, 20 a major challenge facing the field of cancer genome sequencing is to identify cancer associated genes with mutations that drive the cancer phenotype. Sequencing has identified millions of somatic mutations in human cancers, but distinguishing cancer driver genes remains a major challenge. Numerous methods have been developed to identify driver genes, but evaluation of the performance of these methods is hindered by the lack of a gold standard, that is, bona fide driver gene mutations. The pan cancer project explored the remaining 99 percent of the genome, which includes regions that regulate the activity of genes. Integration of multiomics data of cancer can help people to explore cancers comprehensively. Cancer genome sequencing studies have identified driver mutations in chromatin regulatory proteins, which cause deregulation of histone modifications, dna methylation, nucleosome remodeling, and highorder chromatin organization. Cancer genome sequencing an overview sciencedirect topics. Aug 25, 2017 a major aim of cancer genomics is to pinpoint which somatically mutated genes are involved in tumor initiation and progression. We performed wholegenome sequencing in 100 tumornormal pairs, along with dna copy number, gene expression and methylation profiling, for integrative genomic analysis.

Big step toward identifying all cancercausing genetic. Li ding, phd, led a team that analyzed dna from more than 11,000 tumors across 33 cancer types. Accurate identification of driver genes and driver mutations is critical for. Massive cancer genome study reveals how dna errors drive tumor growth. After the sequencing of the human reference genome, increasingly efficient sequencing techniques have made it possible to generate dna profiles of human cancers in large patient cohorts, spanning all human. It has been noted that the distinct features of driver gene alterations in different subgroups are based on clinical characteristics. Advanced dna sequencing technologies have accelerated the discovery of cancer genes by cataloging the genetic aberrations in cancerous cells 2, 3 and consortia such as the cancer genome atlas tcga and the international cancer genome consortium icgc have undertaken the systematic profiling of genomic alterations in many cancer types.

Our statistical approaches and several auxiliary bioinformatics tools have been incorporated into a computational tool, drgap, for cancer genomics research. Many advanced approaches, including targetregion sequencing, arraybased gene expression, copy number variation cnv, and whole genome sequencing of tumor samples, have been taken to portray the genomic landscape in pca 5 x 5 tomlins, s. However, the resulting sequencing datasets are large and complex, obscuring the clinically important mutations in a background of errors, noise, and random mutations. A much smaller fraction of driver mutations are important for cancer development, with current estimates ranging from 10 to 20 driver mutations per tumor. Identifying the drivers in the coding region and beyond part 2 of a series status updates and predictions 10 years after the first draft of the human genome sequence tuma, rabiya s. Gastric cancer is a heterogeneous disease with diverse molecular and histological subtypes. We performed whole genome sequencing in 100 tumor normal pairs, along with dna copy number, gene. The largest ever study to analyze entire tumor genomes has provided the most. Cancer specific highthroughput annotation of somatic mutations. Identifying cancer driver genes in tumor genome sequencing. Exome sequencing of oral squamous cell carcinoma in users of. It is a biochemical laboratory method for the characterization and identification of the dna or rna sequences of cancer cells. Differential analysis between somatic mutation and germline. Pdf identifying driver mutations in sequenced cancer.

In section 3, we will evaluate the new method using lung tumor genome sequences. Identifying cancer driver genes in tumor genome sequencing studies. These genes are involved in pathways known to contribute to cancer pathogenesis, but before this study most would not have been candidates for targeted gene therapy. Cancer sequencing using nextgeneration sequencing ngs methods provides more information in less time compared to traditional singlegene and arraybased approaches. Tumor suppressor or lossoffunction driver genes were discovered mainly by genetic studies of individuals with inherited cancer syndromes. Recurrent fusion of tmprss2 and ets transcription factor genes in pca. Wholegenome sequencing and comprehensive molecular profiling. Highthroughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. In summary, we have developed a powerful and flexible statistical framework for identifying driver genes and pathways in cancer genome sequencing studies. However, driver gene discovery is a very challenging task because we are not only dealing with huge amount of data. The ctd 2 network focuses on identifying and understanding 1 pathways that influence cancer phenotypes including understanding the function of the genes target that are essential during cancer initiation, progression and maintenance.

New technologies and the knowledge gained from previous genomic studies could. Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. Computational tools that prioritize cancer driver genes are needed. However, with a large volume of different omics and functional data being generated, there is a major challenge to distinguish functional driver genes from a sea of inconsequential passenger genes that accrue stochastically but do not contribute to cancer development. The clonal theory of cancer posits that all cancerous cells in a tumor descended from a single cell in which the first driver mutation occurred, and that subsequent clonal expansions and selective sweeps lead to a tumor with a dominant majority population of cancerous cells containing early driver events. Computational approaches to enable precision medicine. Tumor dna sequencing also called genetic profiling or genetic testing is a test to identify dna changes in a patients cancer. Cancer wholegenome sequencing tumornormal comparisons to. Cancer is a genetic disease with somatically acquired genomic aberrations. Wholegenome and transcriptome sequencing of prostate cancer.

Carter h, chen s, isik l, tyekucheva s, velculescu ve, et al. Identification of cancer driver genes from a custom set of. Cancer sequencing methods understanding genetic changes in. Previously, we have developed a database, driverdb, to integrate all public cancer sequencing data and to identify cancer driver genes according to bioinformatics tools. We introduce a new framework for uncovering cancer genes, differential mutation analysis, which compares the mutational profiles of genes across cancer genomes with their natural germline variation across healthy individuals. Genome sequencing to identify driver pathways in cancer. Cancers free fulltext identifying cancer driver genes. Jan 31, 2019 identifying cancer driver genes cdg is a crucial step in cancer genomic toward the advancement of precision medicine. This is the first study that has used whole genome sequencing to evaluate a. In some cases, this information can help determine a treatment plan.

With ngs, researchers can perform whole genome studies, targeted gene profiling, tumor normal comparisons, and more. Major tumor sequencing projects have been conducted in the past few years to identify genes that contain driver somatic mutations in tumor samples. Highthroughput dna sequencing is revolutionizing the study of cancer and enabling the measurement of the somatic mutations that drive cancer development. Previous cancer genome studies focused on the 1 percent of the genome that codes for proteins, known as the exome. Cancer genome sequencing is the whole genome sequencing of a single, homogeneous or heterogeneous group of cancer cells. Dec 23, 2014 whole genome sequencing can be used to identify patients risk for hereditary cancer, researchers have demonstrated. Ontologybased prediction of cancer driver genes scientific. In this article, we propose and evaluate a new method for identifying driver genes. Wholegenome sequencing can successfully identify cancer. Driver mutations are required for the cancer phenotype, whereas passenger mutations are irrelevant to tumor development and accumulate through dna replication. Identifying and distinguishing cancer driver genes among thousands of candidate mutations remains a major challenge.

Here, we describe a powerful and flexible statistical framework for identifying driver genes and driver signaling pathways in cancer genome sequencing studies. Comparison of different functional prediction scores using a. Although a large number of genetic alterations that drive the development and progression of many types of cancer have been identified through largescale research studies, some tumor types have not been deeply characterized. A major challenge in cancer genome sequencing is to identify cancer associated genes with mutations that drive the cancer phenotype 3,48. Rachel karchin, phd, is a professor of biomedical engineering, oncology, and computer science, with joint appointments at the whiting school of engineering and school of medicine at johns hopkins university in baltimore. This analysis validated the approach of whole cancer genome sequencing in identifying somatic mutations and the importance of parallel sequencing of normal and tumor cell genomes. Several major cancer sequencing projects, such as the cancer genome atlas tcga, the international cancer genome consortium icgc and the. Identifying cancerdriving gene mutations cancer network. Identifying driver mutations in sequenced cancer genomes.

May 11, 2014 gastric cancer is a heterogeneous disease with diverse molecular and histological subtypes. Identifying cancer driver genes in tumor genome sequencing studies article pdf available in bioinformatics 272. Over the decade, many computational algorithms have been developed to predict the effects of. Research wholeexome sequencing combined with functional. These genes have been defined as those for which the nonsilent mutation rate is significantly greater than a background mutation rate estimated from silent mutations.

In section 2, we will define pvalues for testing whether a gene is a driver gene. Discussion of identifying cancer driver genes in tumor. Because cancer cells have a large variety of relatively rare mutations, genome wide studies for identifying cancer driver mutations require sequencing numerous patients. The project is part of the cancer genome atlas, a national program aimed at understanding the genetic roots of cancer photo credit. Tumor normal comparisons are crucial for identifying the somatic variants that act as driver mutations in cancer progression. Identifying driver mutations in a patients tumor cells is a central task in the era of precision cancer medicine. A major challenge facing the field of cancer genome sequencing is to identify cancer associated genes with mutations that drive the cancer phenotype.

She is a core member of the institute for computational medicine. The pan cancer project also improved and developed new methods for analyzing cancer genomes. Massive cancer genome study reveals how dna errors drive. Nov 22, 2019 identifying and distinguishing cancer driver genes among thousands of candidate mutations remains a major challenge. Earlier genetic studies identified a number of genes involved in hnscc including tp53, cdkn2a, pik3ca, ccnd1, egfr, pten and hras. Research wholeexome sequencing combined with functional genomics reveals novel candidate driver cancer genes in endometrial cancer han liang,1,9,11 lydia w.

252 1284 1165 75 775 933 1173 1271 122 830 1004 517 1066 538 204 821 214 1171 180 710 1092 207 523 904 586 1314 322 326 505 208 20 784 1483 7 853 1013 838 262 1241