HPV integration disrupts host genomic structure and expression, but whether these alterations promote cancer development remains unclear. Multiple genomic analyses of oropharyngeal cancers identified several host gene fusions, including recurrent FGFR3-TACC3 fusions, expressed from rearranged genomic loci adjacent to HPV integration sites. Evolutionary modeling implicated integration of virus concatemers into the host genome as a common initiating event in fusion formation. Co-expression of HPV16 E6/E7 and FGFR3-TACC3, but neither alone, was sufficient for tumor development in both xenograft and syngeneic mouse models and led to unique transcriptional programs implicated in carcinogenesis. FGFR3-TACC3 expression decreased the ubiquitination and degradation of E6 and E7, thereby increasing oncoprotein abundance. We conclude that expression of HPV16 oncoproteins and host gene fusions generated from HPV integration sites can be sufficient for cancer development.
ONT whole-genome sequencing data for "HPV integration induces gene fusions" . We sequenced five HPV-positive head and neck cancer samples using Oxford Nanopore platform. The sequences was submitted in fastq format.
RNA-seq data for "HPV integration induces gene fusions" We performed RNA-seq analysis of five HPV+ head and neck cancer samples using Illumina short reads. Sequenced are submitted in bam format. We also sequenced one samples with pacBio long reads, and the reads are submitted in fastq format.
pacBio whole-genome sequencing data for "HPV integration induces gene fusions" We performed long read whole-genome sequencing on four HPV+ head and neck cancer samples using pacBio HiFi. The sequence reads were submitted in fastq format.
Illumina whole-genome sequencing data for "HPV integration induces gene fusions" We performed short read whole-genome sequencing of five HPV+ head and neck cancer samples using Illumina. The reads are submitted in bam or fastq file format.
Predicting resistance to chemotherapy using chromosomal instability signatures Joe Sneath Thompson1,2,*, Laura Madrid2,*, Barbara Hernando1,*, Carolin M. Sauer3, Maria Vias3, Maria Escobar-Rey1,2, Wing-Kit Leung2,3, Diego Garcia-Lopez2, Jamie Huckstep3, Magdalena Sekowska3, Karen Hosking4,5, Mercedes Jimenez-Linan5,6, Marika A. V. Reinius3,5,6, Abhipsa Roy2, Omar Abdulle2, Justina Pangonyte3, Harry Dobson2, Amy Cullen2,3, Dilrini De Silva2, David Gómez-Sánchez1,7, Marina Torres1, Ángel Fernández-Sanromán1, Deborah Sanders3, Filipe Correia Martins3,5,6, Ionut-Gabriel Funingana3,4,5, Giovanni Codacci-Pisanelli3,4,8, Miguel Quintela-Fandino1, Florian Markowetz2,3,4, Jason Yip2, James D. Brenton2,3,4,5,6, Anna M. Piskorz#,2,3, Geoff Macintyre#,1,2 1 Spanish National Cancer Research Centre (CNIO), Madrid, Spain 2 Tailor Bio Ltd, Cambridge, UK 3 Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK 4 Department of Oncology, University of Cambridge, Cambridge, UK 5 Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK 6 Cancer Research UK Major Centre - Cambridge, University of Cambridge, Cambridge, UK 7 H12O-CNIO Lung Cancer Clinical Research Unit, Health Research Institute Hospital 12 de Octubre (imas12), Madrid, Spain 8 University of Rome "la Sapienza", Rome, Italy
This dataset consists of 8 ovarian cancer DNA sample libraries prepared using the Illumina TruSight Oncology 500 Kit (Illumina, cat no. 20076480) following the manufacturer's protocol. Library quality and quantity were assessed with High Sensitivity/D5000 Screentape Assay (Agilent Technologies, cat no. 5067-5592) on 4150/4200 Tapestation Quant-IT/Qubit dsDNA HS (Qiagen, cat no. Q32851) assay according to the supplier’s recommendations. Libraries were then pooled together in equal ratios and sequenced using PE-150bp mode on NovaSeq S1 flow cell 300 cycles kit (Illumina, 20028317) or S4 flow cell 300 cycles kit (Illumina, 20028312) aiming for 100 million reads per sample. The dataset contains sequence reads in fastq file format.
This dataset will include Spatial Transcriptomics, Single-Cell RNA-Seq, Bulk RNA-Seq, Clinical data, WES, and H&E data from 15 Muscle-invasive Bladder Cancer patients, treated with upfront cystectomy. Researchers from private or public institutions outside the MOSAIC Consortium will be able to apply to access this data and, pending approval, use the data for their research.
Determining response to therapy for patients with pancreatic cancer can be challenging. We evaluated methods for assessing therapeutic response using cell-free DNA (cfDNA) in plasma samples from 40 patients with metastatic pancreatic cancer as part of the CheckPAC trial (NCT02866383). Patients were evaluated before and after initiation of therapy using tumor-informed plasma whole-genome sequencing (WGMAF), and genome-wide cfDNA fragmentation profiles and repeat landscapes (ARTEMIS-DELFI). Of those assessed with WGMAF, molecular responders had a median overall survival (OS) of 319 days compared to 126 days for non-responders (HR=0.29, 95% CI=0.11–0.79, p=0.011). For ARTEMIS-DELFI, patients with low scores after therapy initiation had a longer median OS than patients with high scores (233 days versus 172 days, HR=0.12, 95% CI=0.046-0.31, p<0.0001). We validated ARTEMIS-DELFI in a separate cohort of 40 patients with pancreatic cancer who were part of the PACTO trial (NCT02767557). These analyses suggest that non-invasive mutation and fragmentation-based cfDNA approaches can identify therapeutic response of individuals with pancreatic cancer.
Genome and transcriptome sequencing of cancer of unknown primary tumours was used to determine yield of clinical biomarkers for a molecular guided trial or for resolving cancer type of origin. This study includes profiling of germline DNA and tumour DNA by whole genome sequencing of tissue and cfDNA biopsies, as well as targeted genome sequencing using two panels, including Comprehensive cancer panel (CCP) and TruSight Oncology 500 (TSO500). This study also includes profiling of tumour RNA by whole transcriptome sequencing.