RaScALL: Rapid screening of RNA-seq in acute lymphoblastic leukaemia
RNA-sequencing (RNA-seq) efforts in acute lymphoblastic leukaemia (ALL) have identified numerous prognostically significant genomic alterations which can guide diagnostic risk stratification and treatment choices when detected early. However, a full RNA-seq Bioinformatics workflow is time-consuming and costly in a clinical setting where rapid detection and accurate reporting of clinically relevant alterations are essential. To accelerate the identification of ALL-specific alterations (including gene fusions, single nucleotide variants and focal gene deletions), we developed the rapid screening tool RaScALL, capable of identifying more than 100 prognostically significant lesions directly from raw sequencing reads. RaScALL uses the k-mer based targeted detection tool km and known ALL variant information to achieve a high degree of accuracy for reporting subtype defining genomic alterations compared to standard alignment-based pipelines. Gene fusions, including difficult to detect fusions involving EPOR and DUX4, were accurately identified in 98% (164 samples) of reported cases in a 180-patient Australian study cohort and 95% (n=63) of samples in a North American validation cohort. Pathogenic sequence variants were correctly identified in 75% of tested samples, including all cases involving subtype defining variants PAX5 p.P80R (n=12) and IKZF1 p.N159Y (n=4). Accurate detection of intragenic IKZF1 deletions resulting in aberrant transcript isoforms was also detectable with 98% accuracy. Importantly, the median analysis time for detection of all targeted alterations averaged 22 minutes per sample, significantly shorter than standard alignment-based approaches, ensuring accerelated risk-stratification and therapeutic triage.
Study
EGAS00001006460
Exploration of coding and non-coding variants in cancer using GenomePaint.
GenomePaint (https://proteinpaint.stjude.org/genomepaint) is a dynamic visualization platform for whole-genome, whole-exome, transcriptome, and epigenomic data, featuring a novel design that captures the inter-relatedness between DNA variations and RNA expression. Regulatory non-coding variants can be inspected and discovered along with coding variants, and their functional impact further explored by examining 3D genome and/or ChIP-seq data generated from cancer cell lines. Further, GenomePaint correlates mutation and expression patterns with patient outcomes, and can display external data such as adult cancer datasets and user-provided custom tracks. We used GenomePaint to analyze multi-omics data from 3,652 pediatric cancers representing 16 histotypes, and demonstrate the visualization features through examples, including two that led to new insights into oncogenic mechanisms in pediatric cancer. The first is the discovery of a new class of pathogenic recurrent variants that cause aberrant splicing, disrupting the RING domain of CREBBP, a driver gene frequently mutated in relapsed pediatric leukemia. The second is the cis-activation of the MYC oncogene in a subset of B-lineage acute lymphoblastic leukemia (B-ALL) via duplication of the NOTCH1-MYC enhancer (N-ME), previously discovered only in T-lineage ALL. The regulatory impact of N-ME enhancer amplification was initially confirmed by allelic imbalance in published gene expression and ChIP-seq data and verified by additional Capture-C and fluorescence in situ hybridization data generated by follow-up experiments. These examples demonstrate the power of GenomePaint in enabling not only data visualization but also integrative genomic analysis that can lead to novel biological insight for follow-up experimental validation.
Study
EGAS00001004669
Cancer and germline exomes, and cancer RNA-seq consisiting of FASTQ paired-end reads from melanoma, lung and colon cancer samples
Discovery of patient-specific tumor antigens usually requires in vitro-expanded autologous tumor infiltrating lymphocytes (TILs), among which tumor antigen-specific T cells are however often rare, thus limiting sensitivity. We designed in vitro culture conditions to improve the identification of rare tumor antigen-specific CD8 TILs. This innovative yet accessible pipeline allows highly-sensitive identification of tumor antigens and cognate T cell receptors (TCRs), greatly improving the selection of candidates for personalized cancer vaccines and TCR-based cellular immunotherapies.
Study
EGAS00001005513
Molecular analysis of circulating tumour cells identifies distinct profiles in chemosensitive and chemorefractory patients with small cell lung cancer
In a search for profiles that distinguish chemosensitive versus chemorefractory SCLC disease, we examined copy number aberrations (CNA) in circulating tumour cells (CTCs) from pre-treatment SCLC blood samples. A CNA classifier was generated from 88 CTCs isolated from 13 patients (training set) then verified in 18 additional patients (112 CTC samples) (testing set) and in six SCLC patient-derived CTC explants explant tumours. These data highlight the potential utility of CTCs for molecular profiling and stratification of SCLC patients in future clinical trials.
Study
EGAS00001001951
Lung Adenocarcinoma Promotion by Air Pollutants
A complete understanding of how environmental carcinogenic exposures promote cancer formation is lacking. Over 70 years ago, tumour formation was proposed to occur in a two step process: an initiating step which induces mutations in normal tissue, followed by a promoter step which triggers cancer development. Recent evidence has revealed healthy human tissue contains a patchwork of clones harbouring oncogenic mutations.This led us to hypothesise that environmental particulate matter measuring PM2.5, known to be associated with lung cancer risk, might promote lung cancer by acting on pre-existing cells harbouring oncogenic mutations in normal lung tissue. Here we use a combination of WGS and RNA-seq of mouse tumours from pollution-exposed mice to examine the impact of particulate matter on mutagenesis and gene expression respectively.
Study
EGAS00001006951
Whole exome sequencing of small cell neuroendocrine cancer of the cervix
In order to improve treatment selection for high grade neuroendocrine carcinomas of the cervix (NECC), we performed a comparative genomic analysis between this rare tumor type and other cervical cancer types, as well as extra-cervical neuroendocrine small cell carcinomas of the lung and bladder. We performed whole exome sequencing on fresh-frozen tissue from 15 NECCs and matched normal tissue. We then identified mutations and copy number variants using standard analysis pipelines. Published mutation tables from cervical cancers and extra-cervical small cell carcinomas were used for comparative analysis. Descriptive statistical methods were used and a two-sided threshold of P < .05 was used for significance. In the NECC cohort, we detected a median of 1.7 somatic mutations per megabase (range 1.0-20.9). PIK3CA p.E545K mutations were the most frequency observed oncogenic mutation (4/15 tumors, 27%). Activating MAPK pathway mutations in KRAS (p.G12D) and GNAS (p.R201C) co-occurred in two tumors (13%). In total we identified PI3-kinase or MAPK pathway activating mutations in 67% of NECC. When compared to NECC, lung and bladder small cell carcinomas exhibited a statistically significant higher rate of coding mutations (P < .001 for lung; P = .001 for bladder). Mutation of TP53 was uncommon in NECC (13%) and was more frequent in both lung (103 of 110 tumors [94%], P < .001) and bladder (18 of 19 tumors [95%], P < .001) small cell carcinoma. These comparative genomics data suggest that NECC may be genetically more similar to common cervical cancer subtypes than to extra-cervical small cell neuroendocrine carcinomas of the lung and bladder. These results may have implications for the selection of cytotoxic and targeted therapy regimens for this rare disease.
Study
EGAS00001003142
an integrated molecular study of clear cell renal cell carcinoma (ccRCC) including whole-genome/exome and RNA sequencing as well as array-based gene expression/copy-number/methylation analyses
Study
EGAS00001000509
Pediatric B-cell precursor acute lymphoblastic leukemia RNA sequencing
This dataset included 33 childhood B-cell precursor acute lymphoblastic leukemia RNA sequencing samples. All samples were subjected to Illumina pair-end sequencing.
Study
EGAS50000000763
ATACseq - Notch Signaling Maintains a Progenitor-Like Subclass of Hepatocellular Carcinoma
To assess how Notch inhibition impacts chromatin accessibility and how chromatin changes relate to HNF4A and CEBPA activities, we performed Assay-for-Transposase-Accessible-Chromatin (ATAC) sequencing of LIV78 tumors, 72 hours after treatment with NOTCH2 blocking or control antibodies. Our TF activity and chromatin analyses thus lead to a model in which Notch inhibition leads to increased expression of CEBPA, thus enabling CEBPA to partner with HNF4A to jointly drive transcriptional programs and the underlying chromatin rearrangements that promote differentiation of progenitor-like tumor cells to a mature hepatocyte fate incompatible with tumor growth and maintenance.
Study
EGAS50000000516
A comprehensive characterization of the cell-free transcriptome reveals tissue- and subtype-specific biomarkers for cancer detection
Cell-free RNA (cfRNA) is a promising analyte for cancer detection. However, a comprehensive assessment of cfRNA in individuals with and without cancer has not been conducted. We performed the first transcriptome-wide characterization of cfRNA in cancer (stage III breast [n=46], lung [n=30]) and non-cancer (n=89) participants from the Circulating Cell-free Genome Atlas (NCT02889978). Of 57,820 annotated genes, 39,564 (68%) were not detected in cfRNA from non-cancer individuals. Within these low-noise regions, we identified tissue- and cancer-specific genes, defined as "dark channel biomarker" (DCB) genes, that were recurrently detected in individuals with cancer. DCB levels in plasma were correlated with tumor shedding rate and RNA expression in matched tissue, suggesting that DCBs with high expression in tumor tissue could enhance cancer detection in patients with low levels of circulating tumor DNA. Overall, cfRNA provides a unique opportunity to detect cancer, predict the tumor tissue of origin, and determine the cancer subtype.
Study
EGAS00001004704