An advanced software tool for analyzing DNA sequences from tumor samples has likely discovered new cancer-causing genes in a study led by researchers at Weill Cornell Medicine.
In the study, published Sept. 26 in Nature Communications, researchers designed software known as CSVDriver to map and analyze the locations of large mutations, known as structural variants (SVs), in tumor DNA datasets. They then applied the tool to a data set of 2,382 genomes from 32 different cancer types and analyzed the cancer genomes from different organ systems separately. The results confirmed the likely cancer-causing role of 47 genes, tentatively linked several of them to certain cancers for the first time, and pointed to 26 other genes as likely cancer-causing genes, although they had never been linked to cancer before.
“Our results show that CSVDriver could be of great benefit to the cancer research community by providing new insights into cancer development as well as potential new targets,” said the study’s senior author, Dr. Ekta Khurana, associate professor of physiology and biophysics and co-director of the Cancer Genetics and Epigenetics Program at Meyer Cancer Center at Weill Cornell Medicine.
The first author of the study was Dr. Alexander Martinez-Fundichely, associate professor of physiology and biophysics at Weill Cornell Medicine and member of the Khurana Laboratory.
Cancers typically arise and progress to greater malignancy when DNA mutations occur in a single cell, effectively eliminating or overriding the normal brakes on cell division. Cancer biologists have cataloged hundreds of these cancer-causing mutations over the past few decades, and many are now the target of drug treatments. But the discovery of carcinogenic mutations is far from over.
The vast majority of mutations in cancer cells are not driver mutations. They are so-called passenger or background mutations that do not promote tumor growth or survival. These passenger mutations are spread across the genome, and it can be difficult to distinguish driver mutations amidst all this “background noise.” Researchers have made significant progress in sorting out drivers and passengers in the simplest class of DNA mutations, point mutations, also known as single nucleotide variants. But they’ve made less progress on SVs, which are larger, more complex mutations, including deletions and extra copies of sometimes long segments of DNA.
In the new study, researchers developed CSVDriver to analyze datasets of SVs in cancer genomes to uncover likely cancer drivers.
The general idea was to model the distribution of background mutations that we would expect for a given cancer type, and then identify as potential driver sites regions where mutations occur more frequently than expected in a large proportion of patients.
dr Alexander Martinez-Fundichely, Associate Professor of Physiology and Biophysics, Weill Cornell Medicine
CSVDriver represents an advance on previous efforts in this area because it models the expected SV background in a way that accounts for tissue-specific factors that can affect that background, such as: B. the three-dimensional folding of DNA.
Overall, the analysis identified as putative cancer drivers within the large SV dataset 53 protein-coding genes, three DNA segments encoding regulatory RNAs, and 24 sites known as “enhancers” because they attract transcription factor proteins that affect the activity of other genes. Many of these suspects were already known to be cancer drivers from previous studies, so the results validated the algorithm in this respect.
However, CSVDriver also proved its value as a discovery tool by uncovering some known cancer-related genes as likely drivers of cancers with which they had not previously been associated, for example the gene DMD in esophageal cancer and NF1 in ovarian cancer. In addition, the results also highlighted 26 genes not previously associated with cancer as likely cancer drivers.
“These are results that can be followed up with further wet lab and animal model studies to study the effects of mutations in these genes, which in turn could lead to the development of new cancer treatments that target these mutations,” said Dr. Khurana, who is also a WorldQuant Foundation Research Fellow at Weill Cornell Medicine.
Most of the genomes analyzed in the study were from primary cancers, but Dr. Khurana and Martinez-Fundichely and their colleagues now plan to use CSVDriver to uncover drivers of advanced, metastatic cancer that carries the worst prognosis and has few effective treatments.
Source:
Magazine reference:
Martinez-Fundichely, A., et al. (2022) Modeling the tissue-specific breakpoint proximity of structural variations from whole genomes to identify cancer drivers. nature communication. doi.org/10.1038/s41467-022-32945-2.