Shallow Whole Genome Sequencing of Patient Derived Xenografts
Chemotherapy is the standard-of-care treatment for metastatic colorectal cancer (mCRC) and benefits some patients, but what distinguishes responders from non-responders is unclear. In this study, we leveraged a comprehensive collection of 27 molecularly annotated patient-derived xenografts to uncover functional predictors of response to 5-FU and irinotecan combination therapy (FOLFIRI) in mCRC. Genetic analyses revealed that treatment sensitivity was marked by genomic scars indicative of BRCAness, suggesting homologous recombination (HR) deficiency as a key determinant. Accordingly, we surveyed a manually curated panel of 44 genes with a documented role in HR for the potential presence of pathogenic mutations. We did not observe a specific enrichment of HR gene mutations based on response to FOLFIRI. This result, combined with the absence of widespread biallelic inactivation of the analyzed genes and the predominance of mutations categorized as variants of unknown significance, suggests that FOLFIRI sensitivity is not primarily governed by underlying mutations in HR genes responsible for mitigating the genotoxic effects of this therapeutic regimen.
Study
EGAS50000000191
Efficacy and safety of entrectinib in patients with ROS1-positive advanced/metastatic non-small cell lung cancer (NSCLC) from the Blood First Assay Screening Trial (BFAST)
BFAST is a global, open-label, multicohort trial that evaluates the efficacy and safety of multiple therapies in patients with advanced/metastatic NSCLC and targetable alterations, identified by blood-based molecular testing. We present data from Cohort D (ROS1-positive). Patients ≥18 years old with stage IIIB/IV, ROS1-positive NSCLC detected by blood-based testing, received entrectinib 600 mg daily. At data cut-off (November 2021), 55 patients were enrolled and 54 had measurable disease. Cohort D met its primary endpoint; the confirmed objective response rate (ORR) by investigator was 81.5% which was consistent with the ORR from the integrated analysis of entrectinib (investigator-assessed ORR: 73.4%, data cut-off May 2019, ≥12 months follow-up). The safety profile of entrectinib was consistent with prior reports. These results demonstrate consistency with those from the integrated analysis of entrectinib in patients with ROS1-positive NSCLC identified by tissue-based testing and support the clinical value of blood-based testing to inform clinical decision-making.
Study
EGAS50000000105
Clonal hematopoiesis is associated with adverse outcomes in patients with COVID-19
Clonal hematopoiesis of indeterminate potential (CHIP) is defined as the occurrence of an expanded proportion of mature blood cells derived from a mutant hematopoietic precursor without evidence of hematological malignancies. The principle behind this is that the somatic mutation confers a fitness advantage to the cell in which it arose. Different clinical consequences are linked with this expansion. Early evidence of an association with higher mortality risk was provided. This was not related to higher rates of cancer but was associated in particular with increased cardiovascular mortality. Mechanistically, inflammatory processes are not only related to the development of clonal hematopoiesis, but in turn it is also a driver of inflammation. Besides pulmonary symptoms, COVID-19 evokes complex extra-pulmonary manifestations driving the pathophysiology. Among them, both inflammatory and cardiac-associated mechanisms have been deciphered. With the aim of assessing the impact of clonal hematopoiesis on the pathophysiology of COVID-19, hospitalized patients with severe or critical course were evaluated for the presence of CHIP driver mutations and, more importantly, the association with the clinical picture.
Study
EGAS00001006218
Scalable whole-genome single-cell library preparation without pre-amplification
Single-cell genomics is critical for understanding cellular heterogeneity in cancer, but existing library preparation methods are expensive, require sample preamplification and introduce coverage bias. Here we describe direct library preparation (DLP), a robust, scalable, and high-fidelity method that uses nanoliter-volume transposition reactions for single-cell whole-genome library preparation without preamplification. We examined 782 cells from cell lines and triple-negative breast xenograft tumors. Low-depth sequencing, compared with existing methods, revealed greater coverage uniformity and more reliable detection of copy-number alterations. Using phylogenetic analysis, we found minor xenograft subpopulations that were undetectable by bulk sequencing, as well as dynamic clonal expansion and diversification between passages. Merging single-cell genomes in silico, we generated "bulk-equivalent" genomes with high depth and uniform coverage. Thus, low-depth sequencing of DLP libraries may provide an attractive replacement for conventional bulk sequencing methods, permitting analysis of copy number at the cell level and of other genomic variants at the population level.
Study
EGAS00001002170
Sequencing_component_for_the_whole_genome_methylation_analysis_in_PBMCs_and_cell_subsets__pilot_study_
DNA methylation has been shown to play a major role in determining cellular phenotype by regulating gene expression. Moreover, dysregulation of differentially methylated genes has been implicated in disease pathogenesis of various conditions including cancer development as well as autoimmune diseases such as systemic Lupus erythematosus and rheumatoid arthritis. Evidence is rapidly accumulating for a role of DNA methylation in regulating immune responses in health and disease. However, the exact mechanisms remain unknown. The overall aim of the project is to investigate the role of epigenetic mechanisms in regulating immunity and their impact on autoimmune disease pathogenesis.The aim of this pilot study is to perform whole genome methylation analysis in peripheral blood mononuclear cells (PBMCs) and cell subsets (CD4, CD8, CD14, CD19, CD16 and whole PBMCs) obtained from 6 healthy volunteers. Whole genome methylation analysis will be performed using two methodological approaches, the Infinium Methylation Bead Array K450 (Illumina) and MeDIP-seq. mRNA expression arrays will also be performed in order to correlate DNA methylation with gene expression as well as genotyping on the Illumina OmniExpress chip
Study
EGAS00001000490
Kalirin-RAC controls nucleokinetic migration in ADRN-type neuroblastoma
The migrational propensity of neuroblastoma is affected by cell identity, but the mechanisms behind the divergence remain unknown. Using RNAi and time-lapse imaging, we show that ADRN-type NB cells exhibit RAC1- and kalirin-dependent nucleokinetic (NUC) migration that relies on several integral components of neuronal migration. Inhibition of NUC migration by RAC1 and kalirin-GEF1 inhibitors occurs without hampering cell proliferation and ADRN identity. Using three clinically-relevant expression dichotomies, we reveal that most of up-regulated mRNAs in RAC1- and kalirin-GEF1-suppressed ADRN-type NB cells are associated with low-risk characteristics. The computational analysis shows that, in a context of overall gene set poverty, the upregulomes in RAC1- and kalirin-GEF1-suppressed ADRN-type cells are a batch of AU-rich elements (ARE)-containing mRNAs, which suggests a link between NUC migration and mRNA stability. GSEA-based search for vulnerabilities reveals prospective weak points in RAC1- and kalirin-GEF1-suppressed ADRN-type NB cells, including activities of H3K27- and DNA methyltransferases. Altogether, these data support the introduction of NUC inhibitors into cancer treatment research.
Study
EGAS00001005023
Somatic_mutation_profiling_of_intestinal_crypts_from_IBD
In this project we want to investigate the role that somatic mutations in intestinal stem cells play in IBD. We will use laser capture microdissection to ascertain epithelial cells from an individual crypts/vilus. We will extract DNA from these and use a new low-input material library prep methods to make accurate libraries for sequencing. In this pilot phase of the project we will perform 15X whole-genome sequencing of a small number of crypts biopsied from four patients. Biopsies have been taken from inflammed and noninflammed patches to compare mutation rates and mutational signatures. We will also test if crypts from inflammed patches of a single patient are clonal (either within a given inflammed patch or across inflammed patches). We will quantify the rate of cancer driver mutations in inflammed and non-inflammed crypts.
This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Study
EGAS00001002896
Inherited damaging mutations in immune-related genes favour the development of genetically heterogeneous synchronous colorectal cancer.
Synchronous colorectal cancers are physically separated tumours that develop simultaneously in the intestine. To understand how common genetic and environmental backgrounds influence the development of multiple tumours, we conduct a genome-wide comparative analysis of somatic alterations of 20 syCRCs from 10 patients. We show that syCRCs have independent genetic origins, acquire discordant somatic alterations, and follow different clonal developments.
Study
EGAS00001001461
Pseudotime_ordering_of_cell_cycle_state
We will be using G&T method to sequence single cell genome and transcriptome derived from FS13B iPSCs cell line. The cell cycle state of each of the single cells is known. Hence, we will be analysing the genome and transcriptome of single cells from each of the cell cycle state to generate a copy number profile and transcriptome profile per given cell cycle stage: G1, S, G2, S.
Study
EGAS00001003293
Correlative Genomic Analysis of Durvalumab plus Pazopanib Combination in Patients with Advanced Soft Tissue Sarcomas
We conducted an open-label, phase 2 study to determine the activity of the anti-VEGF receptor tyrosine-kinase inhibitor, pazopanib, combined with the anti-PD-L1 immune checkpoint inhibitor, durvalumab, in unselected advanced sarcomas. In addition, we conducted whole exome and transcriptomic sequencing with pre-treatment tissue biopsy to correlate clinical outcomes with molecular and genomic biomarkers to identify patients who would most likely benefit from the combination treatment.
Study
EGAS50000000082
Clonal_selection_after_gene_therapy_in_SCD___Duplex_sequencing
This study includes the targeted duplex sequencing data relating to this study. A custom targeted panel was used including 29 AML-associated genes, and a set of mutations from driver-containing clones that were sequenced during the WGS tree-building phase of the study. Targeted sequencing is performed on each of 6 individuals included at the study at at least 3 time points per individual: one prior to gene therapy, and two post-therapy time points.
Study
EGAS00001007253
The_genomic_architecture_of_mesothelioma_
The genome-wide landscape of somatically acquired mutations in mesothelioma has not been deeply characterised to date, but advances in DNA sequencing technology now allow this to be addressed comprehensively. Harnessing massively parallel DNA sequencing platforms, we will identify somatically acquired point mutations in all coding regions of the genome from patients with mesothelioma. In addition, using paired-end sequencing, we will map copy number changes and genomic rearrangements from the same patients.
Study
EGAS00001000353
Mutational_Signatures_of_relapse_in_rectal_cancer_FFPE_samples_in_the_CR07_clinical_trial
The CRO7 clinical trial recruited patients with clinically operable rectal adenocarcinoma. Patients were randomized to either pre-operative short course surgery followed by chemo-radiotherapy only in those patients at high risk of local relapse. Patients in both arms the received standard %-FU based adjuvant chemotherapy as per local policy. We intend to use FFPE derived DNA from the primary tumours to identify patterns of mutations or copy number alterations that are predictive of local or distant relapse.
Study
EGAS00001000651
Colorectal_Adenoma_Gene_Screen
A pilot to establish the feasability of using a custom Agilent targeted pulldown of 110 genes implicated in colorectal tumourigensis to sequence for driver mutations in a set of 30 FFPE colorectal adenomas. If successful, we propose to sequence an additional 350 adenomas as part of a MRC research study in order to define the pattern of driver mutations across the spectrum of pathological subtypes including coventional adenomas, serrated adenomas and hyperplastic polyps
Study
EGAS00001001261
Exploration_of__mutational_processes_in_human_cancer_cell_lines__Exome
Approximately 25 cell lines representative of 11 mutation signatures have been selected and will be grown from a single cell, referred to as parental clone, for a period of 3 months to allow accumulation of mutations. Following that period, we will obtain 2 subclones from each of the parental clone for sequencing analysis. In total we will be sequencing 2 parental clones and 4 subclones per cell line for approximately 25 cell lines.
Study
EGAS00001000790
Distinct portrayal of lesions in synchronous multifocal lung adenocarcinoma revealed by genome sequencing
Distinguishing multiple primary lung cancers in the synchronous multifocal intrapulmonary lesions has important significance on clinical staging and therapeutic decision. To investigate genomic aberration profiles, we applied whole genome and whole exome sequencing, and microarray-based comparative genomic hybridization on 15 intrapulmonary tumors derived from six patients with synchronous multifocal lung cancers having similar histological diagnosis. Any pair of intrapulmonary tumors in a single patient, which shared the identical genetic background and environment, showed an extinctive heterogeneity between each other. Phylogenetic relationship analysis indicated an independently branched evolution among all the tumors, suggesting they were multiple primary lung cancers. EGFR or KRAS mutations were found in 7 or 3 out of the 15 tumors, from 3 or 2 patients, respectively. Somatic mutational heterogeneity of these two genes in a single patient was also observed. Our analysis indicates genomic aberration profiling is valuable for identification of multiple primary lung cancer, especially when high histopathological concordance was observed between lesions. We also suggest a thoroughly molecular diagnosis against therapeutic target genes should be taken for each accessible nodule before making a plan for adjuvant therapy.
Study
EGAS00001001572
SmMIP-tools:a computational toolset for processing and analysis of single-molecule molecular inversion probes derived data
Single-molecule molecular inversion probes (smMIPs) provides a modular and cost-effective platform for high-multiplex targeted next-generation sequencing (NGS). Nevertheless, translating the raw smMIP-derived sequencing data into accurate and meaningful information currently requires proficient computational skills and a large amount of computational work, prohibiting wide-scale adoption of smMIP-based technologies. To enable easy, efficient, and accurate interrogation of smMIP-derived data, we developed SmMIP-tools, a computational toolset that combines the critical analytic steps for smMIP data interpretation into a single computational pipeline. Here, we describe in detail two major components of the software. The first is a read processing tool that performs quality control steps, generates read-smMIP linkages and retrieves molecular tags. The second is an error-aware variant caller capable of detecting single nucleotide variants (SNVs) and short insertions and deletions (indels). Using a cell-line DNA dilution series and a cohort of blood cancer patients, we benchmarked SmMIP-tools and evaluated its performance against clinical sequencing reports. We anticipate that SmMIP-tools will increase accessibility to smMIP-technology, enabling cost-effective genetic research to push personalized medicine forward.
Study
EGAS00001005359
Target gene sequencing for human normal endometrial glands
To elucidate the timing and mechanism of the clonal expansion of somatic mutations in cancer-associated genes in the normal endometrium, we conducted target sequencing of 112 genes for 1,298 endometrial glands and matched blood samples from 36 women. By collecting endometrial glands from different parts of the endometrium, we showed that multiple glands with the same somatic mutations occupied substantial areas of the endometrium. The 112 genes are as follows: ABCC1, ACRC, ANK3, ARHGAP35, ARID1A, ARID5B, ATCAY, ATM, ATR, BARD1, BCOR, BRCA1, BRCA2, BRD4, BRIP1, CAMTA1, CDC23, CDYL, CFAP54, CHD4, CHEK1, CHEK2, CTCF, CTNNB1, CUX1, DGKA, DISP2, DYNC2H1, EMSY, FAAP24, FAM135B, FAM175A, FAM65C, FANCA, FANCB, FANCC, FANCD2, FANCE, FANCF, FANCG, FANCI, FANCL, FANCM, FAT1, FAT3, FBN2, FBXW7, FGFR2, FRG1, GPR50, HEATR1, HIST1H4B, HNRNPCL1, HOOK3, KIAA1109, KIF26A, KMT2B, KMT2C, KRAS, LAMA2, LRP1B, MLH1, MON2, MRE11A, MSH2, MSH6, MTOR, NBN, PALB2, PHEX, PIK3CA, PIK3R1, PLXNB2, PLXND1, PMS2, POLE, POLR3B, PPP2R1A, PTEN, PTPN13, RAD50, RAD51, RAD51B, RAD51C, RAD51D, RAD52, RAD54B, RAD54L, RICTOR, SACS, SIGLEC9, SLC19A1, SLX4, SPEG, STT3A, TAF1, TAF2, TAS2R31, TFAP2C, TNC, TONSL, TP53, TTC6, UBA7, VNN1, WT1, XIRP2, ZBED6, ZC3H13, ZFHX3, ZFHX4, ZMYM4.
Study
EGAS00001005914
The chemotherapeutic CX-5461 is extremely mutagenic and may increase cancer risk
The chemotherapeutic agent CX-5461 or pidnarulex has been fast-tracked by the US FDA for treatment of BRCA1-, BRCA2-, and PALB2-mutated cancers. It is under investigation in Phase I/II clinical trials. Here we find that although CX-5461 exhibits synthetic lethality in BRCA1-/BRCA2-deficient cells, it also causes extensive, non-selective, collateral mutagenesis in all cells, to magnitudes that exceed known environmental carcinogens, raising public health concerns regarding its potential for promoting secondary cancers.
Study
EGAS50000000023
BE_screens_of_WRN_gene_in_MSI_models
We will take advantage of a base-editing screening strategy to engineer mutations in the WRN gene by deep mutagenesis. Two MSI-H cell lines sensitive to WRNi will be genetically modified to express doxycycline-inducible ABE and CBE base editors through a knock-in strategy. Moreover, we will design a pooled library of ~4000 gRNAs targeting the WRN gene plus appropriate controls gRNAs. Library-transduced cells will be selected with WRNi to identify WRN mutations able to interfere with the activity of WRNi.
Study
EGAS00001006872
Expression quantitative trait loci influence DNA damage-induced apoptosis in cancer
The analysis of e2QTL allows for the identification of context-specific eQTL effects (Kim-Hellmuth et al. (2017), PMID: 28814792). To evaluate how inter-individual genetic variability influences the regulation of DNA damage-induced apoptosis, we performed e2QTL analysis of CD8+ T cells from 461 healthy European participants stimulated with high doses of 5 different carcinogens. These include Methyl-methanesulfonate (MMS), tert-butyl-hydroperoxide (TBOOH), benzo(a)pyrene-7,8-diol-9,10-epoxide (BPDE), 4-hydroxycyclophosphamide (HC) and UVC radiation.
Study
EGAS50000000666
RNAseq___Discovery_of_resistance_mechanisms_to_the_BRAF_inhibitor_vemurafenib_in_metastatic_BRAF_mutant_melanoma
We propose to biopsy 20 consented BRAF mutant melanoma patients at Addenbrooke's Hospital pre-treatment with vemurafenib and also upon the development of resistant disease, with the aim of using exome sequence and SNP6 data to identify novel sequence variants and copy number alterations that can be used to validate observed resistance mechanisms in our cell line models and also to use these models to inform as to likely candidate small molecule inhibitors to overcome resistance and that could be tested in the clinical trial setting.
Study
EGAS00001000813
Discovery_of_resistance_mechanisms_to_the_BRAF_inhibitor_vemurafenib_in_metastatic_BRAF_mutant_melanoma
We propose to biopsy 20 consented BRAF mutant melanoma patients at Addenbrooke's Hospital pre-treatment with vemurafenib and also upon the development of resistant disease, with the aim of using exome sequence and SNP6 data to identify novel sequence variants and copy number alterations that can be used to validate observed resistance mechanisms in our cell line models and also to use these models to inform as to likely candidate small molecule inhibitors to overcome resistance and that could be tested in the clinical trial setting.
Study
EGAS00001000812
Amplicon_based_sequencing_of_drug_resistant_organoids
A BRAF V600E colorectal organoid which is sensitive to MAP kinase inhibition was mutagenised with the chemical mutagen ENU and then drug selected using a combination of Trametinib, Dabrafenib and Cetuximab. Single cell derived organoids were then manually picked and expanded in drug. Resistance was confirmed in a 14 day assay and DNA was collected. These then underwent targeted amplicon-based sequencing to confirm candidate resistance effectors from a screen in 2 2D BRAF V600E colorectal cell lines. Pools of resistant clones were also sequenced.
Study
EGAS00001001639
ENU_LS_411N_TripleTherapy
BRAF V600E colorectal cancers do not respond to the only currently FDA approved targeted therapy for CRC. There is currently a trial underway in the UK recruiting V600E CRC patients for treatment with a triple therapy combination of Cetuximab, Trametinib and Dabrafenib. We have mutagenized a pool of V600E CRC cell lines and treated with this triple therapy to select out drug resistant clones. We will now sequence these drug resistant clones with the aim of identifying common point mutations engendering resistance to this new therapy.
Study
EGAS00001001777
The WID-CIN test identifies women with, and at risk of, cervical intraepithelial neoplasia grade 3 and invasive cervical cancer (CIN3+)
We analysed DNA methylation at >850,000 CpG sites across the genome using the Illumina EPIC array in a total of 1,254 cervical liquid-based cytology samples from cases of screen-detected histologically verified CIN1-3+ (98% hrHPV-positive) and population-based control women free from any cervical disease (100% hrHPV-positive). We developed the WID-CIN-test which is a DNA methylation signature consisting of 5,000 CpG sites.
Study
EGAS00001005078
Allele Balance Bias Identifies Systematic Genotyping Errors and False Disease Associations
In recent years, Next Generation Sequencing (NGS) has become a cornerstone of clinical genetics and diagnostics. Many clinical applications require high precision, especially if rare events such as somatic mutations in cancer or genetic variants causing rare diseases need to be identified. Although random sequencing errors can be modeled statistically and deep sequencing minimizes their impact, systematic errors remain a problem even at high depth of coverage. Understanding their source is crucial to increase precision of clinical NGS applications. In this work, we studied the relation between recurrent biases in allele balance (AB), systematic errors and false positive variant calls across a large cohort of human samples analyzed by whole exome sequencing (WES). We have modeled the allele balance distribution for biallelic genotypes in 987 WES samples in order to identify positions recurrently deviating significantly from the expectation, a phenomenon we termed allele balance bias (ABB). Furthermore, we have developed a genotype callability score based on ABB for all positions of the human exome, which detects false positive variant calls that passed state-of-the-art filters. Finally, we demonstrate the use of ABB for detection of false associations proposed by rare variant association studies (RVAS).
Study
EGAS00001003027
Stratifying and Targeting Pediatric Medulloblastoma through Genomics
In this project, genomic analyses of pediatric medulloblastoma samples, obtained through the international medulloblastoma consortium, will be performed. RNA and miRNA expression profiles of 1000 samples, representing all four subgroups (Wnt, Shh, Group C, and D), will be studied to identify novel subtypes within each subgroup. The resulting subtype-specific expression profiles will support the development of reliable and robust biomarkers to more accurately and reliably classify medulloblastomas for treatment in clinical trials. For that purpose, two assays will be developed: an antibody-based immunohistochemical assay and an orthogonal nucleic acid-based hybridization assay.
Additional genomic DNA analysis of the 300 high risk subgroup cases will support the discovery of subgroup specific somatic mutations in order to inform current clinical trials of targeted therapies, and to identify genes and pathways already targeted in other diseases. Such therapies could be rapidly transitioned to Phase II trials in medulloblastoma. Furthermore, the discovery of somatic mutations could be used for developing as well as validating specific biomarkers.
The project team will also try to identify risk factors that predispose children to this type of cancer.
Study
EGAS00001000273
Host whole genome variations are associated with neurocognitive outcome in survivors of pediatric medulloblastoma
Host whole genome analysis is a promising source of predictive information for long-term morbidity in cancer survivors. However, studies on genetic predictors of long-term outcome, particularly neurocognitive function following chemoradiation in pediatric oncology are limited. In the present study, we evaluated variations in host whole genome single nucleotide polymorphisms (SNPs) and their association with cognitive outcome. Whole-genome SNP analysis of host peripheral blood was conducted on 22 medulloblastoma long-term survivors, of whom 18 completed neuropsychological testing. First, unsupervised consensus clustering of the most variable SNPs within 409 genes involved in DNA repair was performed. Discrete variant groups were identified, although they were not associated with cognitive outcome, suggesting that variations in genes corresponding to a single functional group may be insufficient to predict long-term outcome alone. In support of this interpretation, unsupervised hierarchical clustering analysis using disease-associated gene variants by cognitive impairment status yielded two distinct variant clusters comprised of 36 variants, 34 of which were in noncoding regions. These findings illustrate for the first time that cognitively impaired survivors have a distinct variant profile compared to other medulloblastoma survivors. Future research in larger cohorts is needed to validate host genome predictors of cognitive impairment that may impact clinical management.
Study
EGAS00001002996
The PEMDAC phase 2 study of pembrolizumab and entinostat in patients with metastatic uveal melanoma
Patients with metastatic uveal melanoma (UM) have an abysmal prognosis. Preclinical studies have suggested that epigenetic therapy could enhance immunogenicity of cancer cells. Here we test if epigenetic therapy would enhance PD-1 immunotherapy in patients with metastatic UM. We report the results of the PEMDAC phase 2 clinical trial (n=29; NCT02697630) where the HDAC inhibitor entinostat was combined with the PD-1 inhibitor pembrolizumab in patients with metastatic UM. The primary endpoint was objective response rate (ORR), and was met with an ORR of 14%. The clinical benefit rate at 18 weeks was 28%, median progression free survival was 2.1 months and the median overall survival was 13.4 months. Toxicities were manageable, and there were no treatment-related deaths. Extensive genomics studies were performed using DNA/RNA and single cell sequencing and flow cytometry. Objective responses and prolonged survival were seen in three patients with BAP1 wildtype tumors, and in one patient with an iris melanoma that exhibited a UV signature. Longer survival also correlated with low baseline ctDNA levels or LDH. In conclusion, HDAC inhibition and anti-PD1 immunotherapy results in durable responses in a subset of patients with metastatic UM. Further exploration of combined immunotherapy and epigenetic therapy in metastatic UM is warranted.
Study
EGAS00001005478
Assessing gene expression profiling from FFPE Patient Samples: A Comparison of Two Library Preparation Approaches and Recommendations
Next-Generation Sequencing (NGS) has transformed cancer research and clinical practice,
with Whole Exome Sequencing (WES) driving advances in mutational profiling and
personalized oncology. Yet, transcriptomic signatures remain essential for understanding
disease mechanisms, including therapy resistance pathways. RNA sequencing (RNA-seq),
however, faces unique challenges when dealing with low-input or degraded RNA, as often
found in archival formalin-fixed paraffin-embedded (FFPE) tissues. Although previous
studies have compared library preparation protocols, rapidly evolving technologies call for
updated evaluations. Here, we present a direct comparison of two FFPE-compatible stranded
RNA-seq library preparation kits: TaKaRa™ SMARTer® Stranded Total RNA-Seq Kit v2
(Kit A) and Illumina Stranded Total RNA Prep Ligation with Ribo-Zero Plus (Kit B). Both
kits generated high-quality RNA-seq data, yet important differences emerged. Notably, Kit A
achieved comparable gene expression quantification to Kit B while requiring 20-fold less
RNA input, a crucial advantage for limited samples, albeit with increased sequencing depth.
We critically discuss these results in relation to RNA availability, technical performance,
cost-effectiveness, processing time, and automation potential, offering practical guidance for
selecting optimal RNA-seq strategies in clinical and translational research settings
Study
EGAS50000001066
Single-cell targeted DNA-sequencing and protein sequencing
We performed single-cell targeted DNA sequencing (MissionBio) on 13 high hyperdiploid B-ALL patients at diagnosis to study the genetic heterogeneity. For a number of patients single-cell surface protein antibody sequencing was added (DAB-seq) to better characterize the different cell types. Furthermore, additional samples during treatment were collected and analysed for some patients. In this way we want to study the clonal heterogeneity of these B-ALL samples, based both on mutations and CNVs, and identify resistant clones that could be targeted after relapse.
Study
EGAS50000000580
Clonal evolution of metastatic colorectal cancer under anticancer therapies
To investigate the clonal evolution of metastatic CRC at the single-cell level, we performed WGS in 58 clonal organoids and 18 fresh-frozen (FT) bulk-tissue samples from surgically resected primary and metastatic tumors before and after anticancer therapies in 6 patients. This approach enabled detailed phylogenetic reconstruction of individual clones. We discovered the timing and burden of treatment-related mutations as well as the heterogeneous evolution in driver mutations and genomic rearrangements in late-stage clonal evolution under anticancer therapies in metastatic CRC.
Study
EGAS50000001023
The_Causes_of_Clonal_Blood_Cell_Disorders_Study___SCOR_Custom
We took a bone marrow aspirate and peripheral blood samples from a healthy patient aged around 60, and use flow cytometry to isolate 100 HSCs, 50 MEPs, and 50 GMPs. We grew these up into colonies, then whole genome sequenced each colony. Somatic mutations act as a unique barcode for each clone. We have designed a panel for targeted resequencing of the mutations that we find. We are now looking for these mutations in the peripheral blood, to see the dynamics of how HSCs contribute to the peripheral blood in health.
Study
EGAS00001002257
Structural variant analysis of homologous recombination-deficient genomes
Homologous recombination (HR) deficiency causes DNA breaks and cytogenetic aberrations. Paradoxically, the types of DNA rearrangements specifically associated with HR-deficient cancers only minimally impact chromosomal structure. Addressing this, we combined a genome graph analysis of short-read whole genome sequencing (WGS) profiles across thousands of tumors with deep linked-read (LR) WGS of 46 BRCA1 or BRCA2 mutant breast cancers to discover a distinct class of HR deficiency-enriched rearrangements called reciprocal pairs. LR WGS showed that reciprocal pairs with identical rearrangement orientations gave rise to one of two distinct chromosomal outcomes, distinguishable only with long molecule data. While one (cis) outcome corresponded to the copy and pasting of a small segment to a distant site, a second (trans) outcome was a quasi-balanced translocation or multi-megabase inversion with substantial (10kb) duplications at each junction. The full spectrum of reciprocal pair outcomes could be explained by an HR-independent replication restart repair mechanism. LR WGS additionally identified single-strand annealing (SSA) as a BRCA2-deficiency specific repair pathway in human cancers. Replication restart- and SSA-associated SVs improved BRCA1- vs. BRCA2- deficiency classification and identified metastatic cancer cases with favorable chemotherapy responses. Our data reveal classes of BRCA1- and BRCA2-deficiency specific rearrangements as drivers of cytogenetic aberrations in HR deficient cells.
Study
EGAS00001007186
Molecular analysis of post-colonoscopy CRC (PCCRC)
Background: Post-colonoscopy colorectal cancers (PCCRCs, i.e. CRC diagnosed after a colonoscopy that excluded cancer) pose a challenge in clinical practice. In a retrospective cohort study, we compared clinical and molecular features of PCCRCs and prevalent CRCs.
Methods: PCCRCs were defined according to the WEO classification, as cancers occurring within 10 years after a complete index colonoscopy, which excluded CRC. Whole genome chromosomal copy number changes and mutation status of genes commonly affected in CRC (APC, KRAS, BRAF, FBXW7, PIK3CA, NRAS, SMAD4 and TP53) were examined by low-coverage WGS and targeted sequencing, respectively. MSI and CIMP status were also determined.
Results: In total, 122 PCCRCs and 98 prevalent CRCs were examined. PCCRCs are more often located proximally in the colon (p<0.001), non-polypoid appearing (p=0.004), early stage (p=0.009), and poorly differentiated (p=0.006) compared to prevalent CRCs. PCCRCs contain less frequently 18q deletions (p<0.001) than prevalent CRCs. MSI (p=0.029) and CIMP (p=0.014) are more frequently observed in PCCRCs.
Conclusion: Although no PCCRC specific pathway could be defined, PCCRCs’ molecular features are more associated with MSI and hypermethylation pathways than canonical CIN pathway. The clinical and molecular features observed in PCCRCs support the hypothesis that sessile serrated lesions and non-polypoid CRNs are contributors to the development of these cancers."
Study
EGAS00001004686
Whole-Genome and Epigenomic Landscapes of Etiologically Distinct Subtypes of Cholangiocarcinoma
Cholangiocarcinoma (CCA) is a hepatobiliary malignancy exhibiting high incidence in countries with endemic liver-fluke infection. We analyzed 489 CCAs from 10 countries, combining whole-genome (71 cases), targeted/exome, copy-number, gene expression, and DNA methylation information. Integrative clustering defined 4 CCA clusters-fluke-positive CCAs (clusters 1/2) are enriched in ERBB2 amplifications and TP53 mutations; conversely, fluke-negative CCAs (clusters 3/4) exhibit high copy-number alterations and PD-1/PD-L2 expression, or epigenetic mutations (IDH1/2, BAP1) and FGFR/PRKA-related gene rearrangements. Whole-genome analysis highlighted FGFR2 3' untranslated region deletion as a mechanism of FGFR2 upregulation. Integration of noncoding promoter mutations with protein-DNA binding profiles demonstrates pervasive modulation of H3K27me3-associated sites in CCA. Clusters 1 and 4 exhibit distinct DNA hypermethylation patterns targeting either CpG islands or shores-mutation signature and subclonality analysis suggests that these reflect different mutational pathways. Our results exemplify how genetics, epigenetics, and environmental carcinogens can interplay across different geographies to generate distinct molecular subtypes of cancer.SIGNIFICANCE: Integrated whole-genome and epigenomic analysis of CCA on an international scale identifies new CCA driver genes, noncoding promoter mutations, and structural variants. CCA molecular landscapes differ radically by etiology, underscoring how distinct cancer subtypes in the same organ may arise through different extrinsic and intrinsic carcinogenic processes.
Study
EGAS00001001653
Analysis_of_genomic_integrity_of_disease_corrected_human_induced_pluripotent_stem_cells_by_exome_sequencing
Induced pluripotent stem (iPS) cells hold great promise for autologous cell transplantation. In order to apply this therapy to monogenic disorder, disease-causing mutations must be corrected prior to transplantation. We generated iPS cells from patients with alpha-1 antitrypsin deficiency, which is caused by homozygous non-synonymous mutation in the SERPINA1 gene. In vitro differentiation of patient-derived iPS cells to hepatocytes showed recapitulation of the disease phenotype. We corrected the mutations by gene targeting using zinc finger nuclease (ZFN) and piggyBac transposon technologies. Our mutation correction is highly accurate. There is no exogenous sequence left behind at the targeted site. The corrected iPS cell-derived hepatocytes were indistinguishable from healthy hepatocyte, suggesting that the corrected gene was fully functional. The genomic integrity was also analyzed by CGH and SNP array, showing no genomic alterations. These are important points to ensure clinical safety as well as functional restoration of the corrected gene. One remaining question is if these corrected iPS cells are clinically safe at the single base-pair level. Recent publications suggested that cancer-related mutations are significantly accumulated during iPS cell propagation. We aim to assess the clinical safety of the corrected iPS cells by using exome sequencing. Outcome from this project would bring iPS cell technology one step closer to clinical applications.
Study
EGAS00001000055
Genomic gain of EBV's LMP-1 in NKTCL
The study found a high recurrence of genomic gain of LMP1 locus within EBV-associated NKTCL. The pilot data set and extended dataset of 77 WGS NKTCLs found 18/77 tumoral genomes to harbour genomic gain of LMP-1 locus within the EBV viral genomes within NKTCL tumoral sequencing data. The study also found 1/10 NKTCL cell lines to be positive for LMP-1 locus gain within EBV genomes too. This dataset includes solely the confirmatory WGS data of LMP-1 gain from NKYS cell lines using Oxford Nanopore long-read sequencing technology.
Study
EGAS50000000260
Targeted capture, whole genome sequencing, and RNAseq to identify rearrangements in B-cell lymphomas
This study contains whole genome and custom targeted capture sequencing of mature B-cell lymphomas (DLBCL, FL, BL, HGBCL-DH-BCL2, HGBCL-DH-BCL6) to identify translocation breakpoints of common oncogene rearrangements (MYC, BCL2, BCL6). It is complemented by RNAseq data where available. Complete details are available in the publication Hilton et al, 2024, Blood.
Capture sequencing: 357 samples; 364 unique libraries; cram file format aligned to hg38
Whole genome sequencing: 12 samples; 12 unique libraries; cram file format aligned to grch37
RNAseq: 257 samples; 257 unique libraries; fastq file format
Study
EGAS50000000328
The_Causes_of_Clonal_Blood_Cell_Disorders_Study___SCOR
We will take a bone marrow aspirate and peripheral blood samples from a healthy patient aged around 60, and use flow cytometry to isolate 100 HSCs, 50 MEPs, and 50 GMPs. We will grow these up into colonies, then whole genome sequence each colony. Somatic mutations will act as a unique barcode for each clone. We will then design a panel for targeted resequencing of the mutations that we find. It will then be possible to look for these mutations in the peripheral blood over several years, to see the dynamics of how HSCs contribute to the peripheral blood in health.
Study
EGAS00001002132
WXS of 147 lung cancer patients treated with immunotherapy
We first attempted to predict MHC-binding neoantigens at high accuracy with convolutional neural networks. This prediction model outperformed previous methods in > 70% of test cases. Importantly, our method remarkably increased the predictive value of neoantigen load especially in combination with known resistance parameters. We then developed a classifier that can predict resistance from point mutations that are deleterious to protein function. Notably, genes involved in the adaptive immune response, cytokine signaling,and EGFR signaling held high explanatory power. Furthermore, when integrated with our neoantigen profiling, these anti-immunogenic mutations revealed significantly higher predictive power than known resistance factors.
Study
EGAS00001003781
Unraveling metastatic progression of breast cancer
To reconstruct the trajectories of BC progression, we performed exome sequencing (Illumina HiSeq2000, Truseq DNA sample preparation kit v2 and Exome Enrichment Kit v2; alignement done with BWA, substitutions and indels called with GATK and DINDEL respectively) coupled to validation by Sequenom and deep re-sequencing (ongoing), as well as copy number (CN) analyses (Oncoscan, Affymetrix) on DNA from matched primary (n=1-6/pt), axillary lymph node (ALN, n=1-6 for 4 pts), local recurrence (for 1 pt) and distant metastatic (n=1-5/pt) FFPE samples obtained from an autopsy series of 10 BC pts.
Study
EGAS00001000760
NRG1 Fusions in KRAS Wild-type Pancreatic Cancer (H021)
We used whole-genome and transcriptome sequencing to identify clinically actionable gene fusions in young adults with KRAS wild-type (KRASwt) pancreatic ductal adenocarcinoma (PDAC). These alterations included recurrent NRG1 rearrangements predicted to drive PDAC development through aberrant ERBB receptor-mediated signaling, and pharmacologic ERBB inhibition resulted in clinical improvement and remission of liver metastases in two patients with NRG1-rearranged tumors that had proved resistant to standard treatment. Our findings demonstrate that systematic screening of KRASwt tumors for oncogenic fusion genes will substantially improve the therapeutic prospects for a sizeable fraction of PDAC patients.
Study
EGAS00001002759
NanoSring of PBMC from bladder cancer and RCC patients
PBMCs from patients were isolated using 50 mL Leucosep™ tubes (Greiner Bio-One International, Germany) and Ficoll-Paque™ PLUS (GE Healthcare, Sweden). Whole blood drawn into sodium heparin blood collection tubes were diluted 3x with phosphate-buffered saline (PBS) without calcium or magnesium (Lonza, Walkersville, MD). Diluted cell suspensions were carefully layered on Leucosep tubes and centrifuged for 15 minutes at 800 x g at room temperature (RT). Interphase containing PBMCs were harvested and washed with PBS and subsequently centrifuged for 10 minutes at 250 x g at RT before further processing.
Study
EGAS00001004229
The Medical Genome Reference Bank: a whole genome data resource of 4,000 healthy elderly individuals.
Allele frequency data from human reference populations is of increasing value for filtering and assignment of pathogenicity to genetic variants. Aged and healthy populations are more likely to be selectively depleted of pathogenic alleles, and therefore particularly suitable as a reference populations for the major diseases of clinical and public health importance. However, reference studies of the healthy elderly have remained under represented in human genetics. We have developed the Medical Genome Reference Bank (MGRB), a large scale comprehensive whole genome dataset of confirmed healthy elderly individuals, to provide a publicly accessible resource for health related research, and for clinical genetics. It also represents a useful resource for studying the genetics of healthy aging. The MGRB comprises 4,000 healthy, older individuals with no reported history of cancer, cardiovascular disease or dementia, recruited from two Australian community based cohorts. DNA derived from blood samples will be subject to whole genome sequencing. The MGRB will measure genome wide genetic variation in 4,000 individuals, mostly of European decent, aged 60 to 95 years (mean age 75 years). The MGRB has committed to a policy of data sharing, employing a hierarchical data management system to maintain participant privacy and confidentiality, whilst maximizing research and clinical usage of the database. The MGRB will represent a dataset of international significance, broadly accessible to the clinical and genetic research community.
Study
EGAS00001003511
Preclinical Pediatric Molecular Analysis for Therapy Choice (MATCH)
Purpose: Pediatric solid tumors arise from diverse tissues during development and exhibit a wide range of molecular, cellular and genetic features. This diversity, combined with the low incidence of pediatric cancer makes it increasingly difficult to personalize therapy for individual patients based on the unique features of their tumors. Therefore, well-credentialed preclinical models that capture the diversity and heterogeneity of pediatric solid tumors are essential for identifying molecular targeted therapeutics for precision medicine.
Experimental Design: Here, we report 281 orthotopic patient derived xenografts (O-PDXs) from 224 patients representing 24 different types of pediatric solid tumors. We have performed genomic characterization of the O-PDXs and compared them to their corresponding patient tumors. To demonstrate the feasibility and utility of using such a diverse collection of O-PDXs in preclinical studies, we performed a preclinical pediatric precision medicine trial based on the NCI-COG Pediatric MATCH trial enrollment criteria. We also tested molecular targeted therapy for a novel oncogenic fusion recently reported in pediatric melanoma and precision drug delivery using nano-liposomal irinotecan.
Results: In the preclinical match study, we identified one specific single agent response to LY3023414 in an osteosarcoma model with a TSC2 variant of unknown significance. Combinations of nano-liposomal irinotecan were effective in treating both Ewing sarcoma and rhabdomyosarcoma xenografts.
Conclusion: Our studies demonstrate the value of large, well-credentialed preclinical models for future precision medicine in pediatric oncology using single agents, drug combinations and novel drug formulations.
Study
EGAS00001008011
HSC_population_dynamics___KX007_samples
My research project aims to use the clonal dynamics of spontaneously occurring somatic mutations to answer fundamental questions about human haematopoietic stem cell (HSC) biology.
The four major questions I will address are:
1. How do age and aging affect normal human HSC dynamics in vivo?
2. How do in vivo perturbations, particularly chemotherapy and increased levels of reactive oxygen species, affect HSC population dynamics?
3. Is response to in vitro perturbation heritable and/or correlated with other features such as age of individual and contribution of the lineage to peripheral blood?
4. How are HSC dynamics altered in people with early driver mutations (clonal haematopoiesis)?
Study
EGAS00001004193
HSC_population_dynamics_CBD_samples
My research project aims to use the clonal dynamics of spontaneously occurring somatic mutations to answer fundamental questions about human haematopoietic stem cell (HSC) biology.
The four major questions I will address are:
1. How do age and aging affect normal human HSC dynamics in vivo?
2. How do in vivo perturbations, particularly chemotherapy and increased levels of reactive oxygen species, affect HSC population dynamics?
3. Is response to in vitro perturbation heritable and/or correlated with other features such as age of individual and contribution of the lineage to peripheral blood?
4. How are HSC dynamics altered in people with early driver mutations (clonal haematopoiesis)?
Study
EGAS00001003091
HSC_population_dynamics___PX001_samples
My research project aims to use the clonal dynamics of spontaneously occurring somatic mutations to answer fundamental questions about human haematopoietic stem cell (HSC) biology.
The four major questions I will address are:
1. How do age and aging affect normal human HSC dynamics in vivo?
2. How do in vivo perturbations, particularly chemotherapy and increased levels of reactive oxygen species, affect HSC population dynamics?
3. Is response to in vitro perturbation heritable and/or correlated with other features such as age of individual and contribution of the lineage to peripheral blood?
4. How are HSC dynamics altered in people with early driver mutations (clonal haematopoiesis)?
Study
EGAS00001004146
HSC_population_dynamics___KX009_samples
My research project aims to use the clonal dynamics of spontaneously occurring somatic mutations to answer fundamental questions about human haematopoietic stem cell (HSC) biology.
The four major questions I will address are:
1. How do age and aging affect normal human HSC dynamics in vivo?
2. How do in vivo perturbations, particularly chemotherapy and increased levels of reactive oxygen species, affect HSC population dynamics?
3. Is response to in vitro perturbation heritable and/or correlated with other features such as age of individual and contribution of the lineage to peripheral blood?
4. How are HSC dynamics altered in people with early driver mutations (clonal haematopoiesis)?
Study
EGAS00001004580
HSC_population_dynamics___KX010_samples
My research project aims to use the clonal dynamics of spontaneously occurring somatic mutations to answer fundamental questions about human haematopoietic stem cell (HSC) biology.
The four major questions I will address are:
1. How do age and aging affect normal human HSC dynamics in vivo?
2. How do in vivo perturbations, particularly chemotherapy and increased levels of reactive oxygen species, affect HSC population dynamics?
3. Is response to in vitro perturbation heritable and/or correlated with other features such as age of individual and contribution of the lineage to peripheral blood?
4. How are HSC dynamics altered in people with early driver mutations (clonal haematopoiesis)?
Study
EGAS00001004581
WES in muscle-invasive bladder cancer (MIBC) treated with durvalumab plus olaparib in the neoadjuvant setting: NEODURVARIB trial
This submission contains the metadata derived from the whole exome sequencing of 70 samples from 26 patients who developed advanced urothelial carcinomas. The patients were enrolled in the Neodurvarib clinical trial, which compares the molecular profiles at diagnosis and after neoadjuvant treatment with Durvalumab and Olaparib. In this regard, the analyzed samples include: normal tissue and tumoral tissue obtained from the transurethral resection of the bladder (TURBT, a procedure performed prior to neoadjuvant therapy) from the 26 patients (52 samples), as well as tissue obtained from the radical cystectomy (a procedure performed after treatment with the aforementioned drugs; 18 samples).
Study
EGAS50000000791
Diversity of U1 small nuclear RNAs and Evaluation of Diagnostic Methods for their Mutations
U1 small nuclear RNA (snRNA) mutations are recurrent non-coding alterations found in various malignancies, including medulloblastoma, yet their identification has proven challenging due to their repetitive nature. To establish a robust DNA-based, and RNA-based method for detecting U1 snRNA mutations, we performed targeted DNA sequencing using our original gene panel for medulloblastoma and RNA-sequencing. As an RNA-based method, we developed a random forest classifier employing splicing patterns which were massively altered as a result of U1 snRNA mutations. By using both methods, we successfully detected U1 snRNA mutation in five DNA samples and four RNA samples.
Study
EGAS50000000693
HSC_population_dynamics_KSP_samples
My research project aims to use the clonal dynamics of spontaneously occurring somatic mutations to answer fundamental questions about human haematopoietic stem cell (HSC) biology.
The four major questions I will address are:
1. How do age and aging affect normal human HSC dynamics in vivo?
2. How do in vivo perturbations, particularly chemotherapy and increased levels of reactive oxygen species, affect HSC population dynamics?
3. Is response to in vitro perturbation heritable and/or correlated with other features such as age of individual and contribution of the lineage to peripheral blood?
4. How are HSC dynamics altered in people with early driver mutations (clonal haematopoiesis)?
Study
EGAS00001002762
WGS___Exploration_of__mutational_processes_in_human_cancer_cell_lines
Approximately 25 cell lines representative of 11 mutation signatures have been selected and will be grown from a single cell, referred to as parental clone, for a period of 3 months to allow accumulation of mutations. Following that period, we will obtain 2 subclones from each of the parental clone for analysis by exome-sequencing . In total we will therefore be sequencing 2 parental clones and 4 subclones per cell line for approximately 25 cell lines. Once the data has been analysed and the potential ongoing mutation processes determined, we will perform whole-genome sequencing of the parental clones and subclones of the candidate cell lines.
Study
EGAS00001002680
Exome-wide analysis identifies three low-frequency missense variants associated with pancreatic cancer risk in Chinese populations
Germline coding variants have not been systematically investigated for pancreatic ductal adenocarcinoma (PDAC). Here, we report an exome-wide investigation using the Illumina Human Exome Beadchip with 943 PDAC cases and 3,908 controls in the Chinese population, followed by two independent replicate samples including 2,142 cases and 4,697 controls. We identify three low-frequency missense variants associated with the PDAC risk: rs34309238 in PKN1 (OR = 1.77, 95% CI: 1.48-2.12, P = 5.35 X 10-10), rs2242241 in DOK2 (OR = 1.85, 95% CI: 1.50-2.27, P = 4.34 X 10-9) and rs183117027 in APOB (OR = 2.34, 95% CI: 1.72-3.16, P = 4.21 X 10-8).
Study
EGAS00001003040
HSC_population_dynamics___CB001_samples
My research project aims to use the clonal dynamics of spontaneously occurring somatic mutations to answer fundamental questions about human haematopoietic stem cell (HSC) biology.
The four major questions I will address are:
1. How do age and aging affect normal human HSC dynamics in vivo?
2. How do in vivo perturbations, particularly chemotherapy and increased levels of reactive oxygen species, affect HSC population dynamics?
3. Is response to in vitro perturbation heritable and/or correlated with other features such as age of individual and contribution of the lineage to peripheral blood?
4. How are HSC dynamics altered in people with early driver mutations (clonal haematopoiesis)?
Study
EGAS00001003397
HSC_population_dynamics___LEUK4_samples
My research project aims to use the clonal dynamics of spontaneously occurring somatic mutations to answer fundamental questions about human haematopoietic stem cell (HSC) biology.
The four major questions I will address are:
1. How do age and aging affect normal human HSC dynamics in vivo?
2. How do in vivo perturbations, particularly chemotherapy and increased levels of reactive oxygen species, affect HSC population dynamics?
3. Is response to in vitro perturbation heritable and/or correlated with other features such as age of individual and contribution of the lineage to peripheral blood?
4. How are HSC dynamics altered in people with early driver mutations (clonal haematopoiesis)?
Study
EGAS00001004247
Diet driven microbial ecology underpins associations between cancer immunotherapy outcomes and the gut microbiome
This study profiled baseline gut (fecal) microbiota signatures and dietary patterns of 103 trial patients from Australia and the Netherlands treated with neoadjuvant ICIs for high risk resectable metastatic melanoma and performed an integrated analysis with data from 115 patients with melanoma treated with ICIs in the United States. We observed geographically distinct microbial signatures of response and immune-related adverse events. Overall, response rates were higher in Ruminococcaceae-dominated microbiomes than in Bacteroidaceae-dominated microbiomes. Poor response was associated with lower fiber and omega 3 fatty acid consumption and elevated levels of C-reactive protein in the peripheral circulation at baseline.
Study
EGAS00001006982
Quantification of chromosomal copy number aberrations by shallow whole-genome sequencing
Detection of DNA copy number aberrations by shallow whole-genome sequencing (WGS) faces many challenges including lack of completion and errors in the human reference genome, repetitive sequences, polymorphisms, variable sample quality, and biases in the sequencing procedures.Formalin-fixed paraffin-embedded (FFPE) archival material, the analysis of which is important for studies of cancer, presents particular analytical difficulties due to degradation of the DNA and frequent lack of matched reference samples. We present a robust, cost-effective WGS method for DNA copy number analysis that addresses these challenges more successfully than currently available procedures. In practice very useful profiles can be obtained with 0.1 fold genome coverage. We improve on previous methods by; first, implementing a combined correction for sequence mappability and GC content, and second, applying this procedure to sequence data from the 1000 Genomes Project in order to develop a blacklist of problematic genome regions. A small subset of these blacklisted regions were previously identified by ENCODE, but the vast majority are novel unappreciated problematic regions. Our procedures are implemented in a pipeline called QDNAseq. We have analyzed over 1,000 samples, most of which were obtained from the fixed tissue archives of over 25 institutions.We demonstrate that for most samples our sequencing and analysis procedures yield genome profiles with noise levels near the statistical limit imposed by read counting. The described procedures also provide better correction of artifacts introduced by low DNA quality than prior approaches, and better copy number data than high-resolution microarrays at substantially lower cost.
Study
EGAS00001000642
Healthspan and lifespan extension by fecal microbiota transplantation in progeroid mice
The gut microbiome is emerging as a key regulator of several metabolic, immune and neuroendocrine pathways. Gut microbiome deregulation has been implicated in major conditions such as obesity, type 2 diabetes, cardiovascular disease, non-alcoholic fatty acid liver disease or cancer, but its precise role in aging remains to be elucidated. Here, we characterize the gut microbiome profile of accelerated aging and show that its external modulation is sufficient to extend healthspan and lifespan. We found that two different mouse models of progeria present with intestinal dysbiosis, which is characterized, among other alterations, by an increment in proteobacteria, cyanobacteria and a loss in verrucomicrobia, whereas long-lived humans (i.e., centenarians) exhibit a remarkable increase in verrucomicrobia and a reduction in proteobacteria. Fecal microbiota transplantation proved to be effective to enhance healthspan and lifespan in both progeroid mouse models and, more importantly, the solely transplantation with the verrucomicrobia Akkermansia muciniphila was sufficient to exert beneficial effects. Our results demonstrate that intestinal dysbiosis is a phenotype associated with accelerated aging and its correction provides health benefits. Moreover, metabolomic analysis of ileal content points to the restoration of secondary bile acids as a possible mechanism for the beneficial outcome of reestablishing a healthy microbiome. Our results suggest the existence of a link between gut aging and the microbiota, and can help to gain insight into the rationale for microbiome-based interventions against age-related diseases. Mouse sequence data have been deposited in ENA (https://www.ebi.ac.uk/ena) under accession number PRJEB34214
Study
EGAS00001003656
Human genomic and phenotypic synthetic data for the study of rare diseases
The purpose of this dataset is to facilitate development of technical implementations for rare disease data integration, analysis, discovery, and federated access.
This synthetic dataset includes clinical and genomic data from 6 rare disease cases. It consists of 18 whole genomes (6 index cases with their parents) which have genetic background based on public human data sequenced in the context of the Illumina Platinum initiative (Eberle, MA et al. (2017)) and made available by the HapMap project (https://www.genome.gov/10001688/international-hapmap-project). In each of the cases, real causative variants correlating with the phenotypic data provided were spiked-in.
The cases included in this synthetic dataset correspond to the following type of disorders:
CASE 1- Congenital myasthenic syndrome (Autosomal Dominant -de novo variant)
CASE 2- Macular dystrophy (Autosomal Dominant)
CASE 3- Muscular dystrophy (Autosomal Recessive-compound heterozygous variants)
CASE 4- Mitochondrial disorder (Autosomal Recessive-consanguineous case - homozygous variant)
CASE 5- Breast cancer (Autosomal Dominant)
CASE 6- Similar as case 1 for patient matchmaking tests: Congenital myasthenic syndrome (Autosomal Dominant-de novo variant)
For each case you will be able to download the following data: clinical information (phenopackets per individual and pedigree per family), raw genomic data (FASTQ and BAMs) and processed genomic data (vcfs).
When using the data, the following should be acknowledged: the RD-Connect GPAP (https://platform.rd-connect.eu/), EC H2020 project EJP-RD (grant # 825575), EC H2020 project B1MG (grant # 951724) and Generalitat de Catalunya VEIS project (grant # 001-P-001647).
Study
EGAS00001005702
Lipomatous tumors with 12q amplification
The project concerns whole genome sequencing (short-read and long-read) and RNA-sequencing of lipomatous tumors with 12q-amplification. A subset of lipomatous tumors is driven by amplification of genes mapping to chromosome arm 12q, including the MDM2 gene. The goals of the study were to compare expression levels of genes mapping to 12q in tumors with amplification in rod-shaped or circularized chromosomes as well as to assess and compare the structural variants in those tumors. In total, 20 samples were analyzed, and the data were correlated with genomic data on bulk and single cell DNA from the same tumors. The fastq files from the tumors were uploaded to EGA.
Study
EGAS50000000062
Integrated Single-Nucleus and Spatial Transcriptomics Elucidate Heterogeneity and Hypoxia-Driven Organization of Supratentorial Ependymoma
Supratentorial ependymomas are often life-threatening brain tumours that are characterised by significant molecular heterogeneity. In this study, single-nucleus (n = 63) and spatial transcriptomics (n = 30) was performed to delineate the cellular and spatial landscape of supratentorial ependymoma. This study characterises the transcriptional consensus programs of this entity and connects them to clinical outcome. Two previously undescribed programs – associated with the remodelling of the extracellular matrix and the ZFTA fusion identity – are shown to be related to highly unfavourable outcome and to be exclusive to ZFTA fusion-positive tumours. Also, Hypoxia is proposed as the potential driver of spatial organisation, which may affect treatment success.
Study
EGAS50000001318
Guardians_of_the_genome__protecting_DNA_from_endogenous_sources_of_damage_
Analysis of mutational signatures caused by DNA repair defects in human induced pluripotent stem (iPS) cells. A reference human iPS cell line will be used for genetic manipulation to introduce homozygous knockouts of 100 genes known to be involved in or connected to DNA repair or DNA editing. Following a defined period of growth after homozygous knockout of each gene, sub clones will be generated and sequenced. The progenitor “parental” IPS cell line will be used to generate reference sequence data, in order to determine the mutational signature acquired due to the gene knockout. This is a pilot study to investigate the effects of oxygen conditions and growth period on mutations acquired
Study
EGAS00001000874
Genomic and transcriptomic analysis of thymic epithelial tumors
Thymic epithelial tumors (TETs), comprising various histologic types of thymomas and thymic carcinomas, originate from thymic epithelial cells. Each histologic type is typically associated with a distinct immune cell composition and clinical manifestation. A better understanding of the cellular origins and molecular pathways underlying this heterogeneity is needed to improve patient stratification and treatment. Here, we conducted an integrated genomic and transcriptomic analysis of 124 thymomas and 13 thymic carcinomas, including 20 newly sequenced cases combined with 117 cases from publicly available datasets. This approach stratified TETs into three subgroups with different possible origins: GTF2I-mutant (GTF2I-type) thymomas, copy number-altered (CN-type) thymomas, and thymic carcinomas.
Study
EGAS00001004227
Molecular_diagnosis_of_albinism
Albinism is genetically heterogeneous rare genetic condition affecting 1:17000 in the Western world (but more frequent in Africa) whose main feature is a profound visual impairment, characterised by foveal hypoplasia, abnormal chiasmatic connections, nystagmus and photofobia. All these features result in severly altered visual acuity (<0,1), absent depth perception and poor night vision. People with albinism are primarily visually handicapped. In addition, for some types of albinism, the visual phenotype can be presented with partial or total hypopigmentation, hence resulting in a secondary phenotype which can lead to skin cancer if skin is not adequately protected. Recently a new syndrome has been described, FHONDA, with the same visual abnormalities of albinism but without pigment alteration. The traditional classification differentiates Oculoculatenous albinism (OCA), where hypopigmentation involves hair, skin and eyes versus Ocular Albinism (OA), where hypopigmentation only affects the eyes. These are non-sydrimic types of albinism. Some syndromic forms (Hermansky-Pudlak=HPS, Chediak-Higashi=CHS) affect cells beyond pigment cells, present in the lungs, immune system, platelets and intestines, resulting in more severe phenotypes that can be fatal. Mutations in at least 19 genes are assocaited with the corresponding types of albinism. Most hospitals will only diagnose the most frequent cases using traditional Sanger, MLPA approaches. Some will use CGH arrays. We aim to diagnose all cases of albinism through the Albinochip proposal, which combines a Sequenom first step of known mutations combined with subsequent NGS approaches. In some cases we fail to find a second mutation, these are good candidates for further full exome analyses.
This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Study
EGAS00001002068
Plasma DNA aberrations in systemic lupus erythematosus revealed by genomic and methylomic sequencing
We performed a high-resolution analysis of the biological characteristics of plasma DNA in systemic lupus erythematosus (SLE) patients using massively parallel genomic and methylomic sequencing. A number of plasma DNA abnormalities were found. First, aberrations in measured genomic representations (MGRs) were identified in the plasma DNA of SLE patients. The extent of the aberrations in MGRs correlated with anti-double–stranded DNA (anti-dsDNA) antibody level. Second, the plasma DNA of active SLE patients exhibited skewed molecular size-distribution profiles with a significantly increased proportion of short DNA fragments. The extent of plasma DNA shortening in SLE patients correlated with the SLE disease activity index (SLEDAI) and anti-dsDNA antibody level. Third, the plasma DNA of active SLE patients showed decreased methylation densities. The extent of hypomethylation correlated with SLEDAI and anti-dsDNA antibody level. To explore the impact of anti-dsDNA antibody on plasma DNA in SLE, a column-based protein G capture approach was used to fractionate the IgG-bound and non–IgG-bound DNA in plasma. Compared with healthy individuals, SLE patients had higher concentrations of IgG-bound DNA in plasma. More IgG binding occurs at genomic locations showing increased MGRs. Furthermore, the IgG-bound plasma DNA was shorter in size and more hypomethylated than the non–IgG-bound plasma DNA. These observations have enhanced our understanding of the spectrum of plasma DNA aberrations in SLE and may provide new molecular markers for SLE. Our results also suggest that caution should be exercised when interpreting plasma DNA-based noninvasive prenatal testing and cancer testing conducted for SLE patients.
Study
EGAS00001000962
Ischemic stroke in a Swedish case-control study.
Background. Genetic risk scores (GRS), summing up the total effect of several single nucleotide polymorphisms (SNPs) in genes associated with either coronary risk or cardiovascular risk factors, have been tested for association with ischemic stroke with conflicting results. Recently an association was found between a GRS, based on 29 SNPs discovered by genome-wide association studies (GWAS) and hypertension. The aim of our study was to investigate the possible association of the same GRS with ischemic stroke on top of other “traditional risk factors”, also testing its potential improvement in indices of discrimination and reclassification, in a Swedish case-control study. Methods and results. Twenty-nine SNPs were genotyped in 3,677 stroke cases and 2,415 controls included in the Lund Stroke Register (LSR), the Malmö Diet and Cancer (MDC) study and the Sahlgrenska Academy Study on Ischemic Stroke (SAHLSIS). The analysis was conducted in the entire material, and separately for the three samples. After adjustment for hypertension, diabetes mellitus and smoking habits, the GRS was associated with ischemic stroke in the entire material [OR (95% CI) 1.086 (1.029-1.147) per SD increase in the GRS p=0.003] with similar trends in all three samples; LSR [1.050 (0.967-1.140)]; p=0.25], MDC [1.168 (1.060-1.288); p=0.002] and SAHLSIS [1.124 (0.997-1.267); P=0.055]. Measures of discrimination and reclassification improved marginally using the GRS. Conclusions. A blood pressure GRS is independently associated with ischemic stroke risk in three Swedish case-control studies, however, the effect size is low and adds marginally to prediction of stroke on top of traditional risk factors including hypertension.
Study
EGAS00001000936
Analysis of error profiles in deep next-generation sequencing data
BackgroundSequencing errors are key confounding factors for detecting low-frequency genetic variants that are important for cancer molecular diagnosis, treatment, and surveillance using deep next-generation sequencing (NGS). However, there is a lack of comprehensive understanding of errors introduced at various steps of a conventional NGS workflow, such as sample handling, library preparation, PCR enrichment, and sequencing. In this study, we use current NGS technology to systematically investigate these questions.ResultsBy evaluating read-specific error distributions, we discover that the substitution error rate can be computationally suppressed to 10-5 to 10-4, which is 10- to 100-fold lower than generally considered achievable (10-3) in the current literature. We then quantify substitution errors attributable to sample handling, library preparation, enrichment PCR, and sequencing by using multiple deep sequencing datasets. We find that error rates differ by nucleotide substitution types, ranging from 10-5 for A>C/T>G, C>A/G>T, and C>G/G>C changes to 10-4 for A>G/T>C changes. Furthermore, C>T/G>A errors exhibit strong sequence context dependency, sample-specific effects dominate elevated C>A/G>T errors, and target enrichment PCR led to aprox 6-fold increase of overall error rate. We also find that more than 70% of hotspot variants can be detected at 0.1% - 0.01% frequency with the current NGS technology by applying in-silico error suppression.ConclusionsWe present the first comprehensive analysis of sequencing error sources in conventional NGS workflows. The error profiles revealed by our study highlight new directions for further improving NGS analysis accuracy both experimentally and computationally, ultimately enhancing the precision of deep sequencing.
Study
EGAS00001003444
Recording physiological history of cells with chemical labeling.
Recordings of the physiological history of cells provide insights into biological processes, yet obtaining such recordings is a challenge. To address this, we introduce a method to record transient cellular events for later analysis. We designed proteins that become labeled in the presence of both a specific cellular activity and a fluorescent substrate. The recording period is set by the presence of the substrate, whereas the cellular activity controls the degree of the labeling. The use of distinguishable substrates enabled the recording of successive periods of activity. We recorded protein-protein interactions, G-protein-coupled receptor activation and elevations in intracellular calcium. Recordings of elevated calcium levels allowed selections of cells from heterogenous populations for transcriptomic analysis and tracking of neuronal activities in flies and zebrafish.
Study
EGAS50000000056
mRNA capture sequencing and RT-qPCR for the detection of pathognomonic, novel and secondary fusion transcripts in formalin-fixed paraffin-embedded tissue: a sarcoma showcase
RNA sequencing is widely used to study gene abundance patterns, but less frequently to assess structural alterations such as fusion transcripts. In this study, we assess the performance of messenger RNA (mRNA) capture sequencing to identify fusion transcripts in formalin-fixed paraffin-embedded (FFPE) tissue of different types of sarcoma, followed by orthogonal confirmation using RT-qPCR. To validate our workflow, we used the TruSight RNA Pan-Cancer Panel to analyze a first cohort of 6 positive control tumors for which the diagnostic workup demonstrated the presence of a specific chromosomal rearrangement. Fusion transcript calling by FusionCatcher confirmed all known aberrations and enabled the identification of both fusion gene partners and breakpoints. Next, whole transcriptome TruSeq RNA Exome sequencing was applied to a second cohort of 17 patients, diagnosed with fusion gene-negative alveolar rhabdomyosarcoma (ARMS) or undifferentiated round cell sarcoma (URCS), for whom fluorescence in situ hybridization (FISH) did not identify the classical pathognomonic rearrangements. Remarkably, for 6 out of 17 patients, mRNA capture sequencing readily detected a pathognomonic fusion transcript, i.e. PAX3-FOXO1 in 2 ARMS patients, and EWSR1-FLI1, EWSR1-ERG or EWSR1-NFATC2 in 4 URCS patients. These data show that RNA capture sequencing may enhance the detection rate of pathognomonic fusion genes in sarcoma. For the 11 patients with no known pathognomonic fusion transcript, 11 newly identified fusion transcripts were confirmed by RT-qPCR, including COPS3-TOM1L2, NCOA1-DTNB, WWTR1-LINC01986, PLAA-MOB3B, AP1B1-CHEK2 and BRD4-LEUTX fusion transcripts in ARMS patients. Additionally, recurrently detected secondary fusion transcripts in two patients diagnosed with EWSR1-NFATC2-positive sarcoma were confirmed (COPS4-TBC1D9, PICALM-SYTL2, SMG6-VPS53 and UBE2F-ALS2). Our results provide new insights into the underlying genetic causes of these malignancies.
Study
EGAS00001005202
A Comprehensive Catalogue of Somatic Mutations from a Human Cancer Genome
We have carried out complete sequencing of the genome of the human male maligant melanoma cell line COLO-829 using the Illumina Genome Analyzer II. We generated a sequencing library with a median insert size of ~200 bp following random fragmentation and gel fractionation of the genomic DNA. We sequenced 75 bases from both ends of these templates to cover the COLO-829 genome to an average depth of more than 40x. We have carried out purity-filtering (PF) to remove mixed reads, where two or more different template molecules are close enough on the surface of the flow-cell to form a mixed or overlapping cluster. No other filtering of the data has been carried out prior to submission. We have also submitted sequence data for a lymphoblastoid control cell line COLO-829BL from the same individual.
Study
EGAS00000000052
H3K27ac ChIP-seq in TMPRSS2:ERG positive and negative prostate cancer tissue samples
The TMPRSS2:ERG (T2E) structural rearrangements typifies ~50% of prostate tumors and results in overexpression of the ERG transcription factor. Using chromatin data collected in T2E and non-T2E primary prostate tumors, we show a distinct cis-regulatory landscape between T2E and non-T2E, inclusive of Cluster Of Regulatory Elements (COREs). This is mediated by ERG co-option of HOXB13 and FOXA1 implementing a T2E-specific transcriptional profile. We also report a T2E-specific CORE on the structurally rearranged ERG locus arising from spreading of the TMPRSS2 locus pre-existing CORE, assisting in its overexpression. Finally, we show that the T2E-specific cis-regulatory landscape reveals a vulnerability against the NOTCH pathway. Taken together, our work shows that overexpressed ERG co-opts master transcription factors to deploy a unique cis-regulatory landscape inducing a druggable dependency on NOTCH signaling in T2E prostate tumors.
Study
EGAS00001002496
Investigation_of_mutational_signatures_associated_with_DNMT3A_deficiency_
Mutational signatures have been shown to be attributable to specific genetic contexts, such as mutations in DNA repair genes. DNMT3A is a DNA methyltransferase that helps maintain the DNA methylation pattern in a site-specific manner and may participate in DNA repair or the stress response. We have identified an adult individual who is a germline mosaic for a DNMT3A mutation. We have obtained clonal lymphoblastoid cells (LCLs) from the subject representing both WT and mutant lines grown in the same individual for >50 years. These clones represent a unique opportunity to examine the mutational impact of the DNMT3A mutation in a well-controlled setting. Our goal is to perform WGS on whole blood, representing the pool, as well as several WT and several mutant clones, in order to investigate the contribution of DNMT3A to mutation rates and signatures.
Study
EGAS00001002329
Sequencing of pancreatic cancer primary tumors and metastases
The extent of heterogeneity of driver gene mutations present in naturally occurring metastases is largely unknown, i.e. treatment-naïve metastatic disease. To address this issue, 60x whole genome sequencing of 26 metastases from 4 patients was carried out. We found that the identical driver gene mutations were present in every metastatic lesion of each patient studied. Passenger gene mutations not known or predicted to have functional consequences accounted for all intratumoral heterogeneity. Even with respect to these passenger gene mutations, the genetic similarity among the founding cells of metastases was markedly higher than that expected for any two cells randomly taken from a normal tissue. The uniformity of driver gene mutations among metastases in the same patient has critical, encouraging implications for the success of future targeted therapies in advanced stage disease.
Study
EGAS00001002186
Autozygosity_pilot___Pakistani_from_Birmingham
The offspring of first cousin marriages have ~6% of their genome autozygous, i.e. homozygous identical by descent, or even more if there was further consanguinity in their ancestry. In the UK there are large populations with very high first cousin marriage rates of 50-80%. Sequencing the exomes of a sample of these individuals has the potential both to support genetic health programmes in these populations, and to provide genetic research information about rare loss of function mutations. This pilot study based on existing British-Pakistani cohort samples from Birmingham will identify homozygous individuals for almost all variants down to an allele frequency around 1%, plus individuals carrying hundreds of new homozygous rare loss-of-function variants, and will support development of community relations and ethics for a wider study currently being designed. The data deposited in the EGA consist of low coverage whole exome sequencing on these samples.
Study
EGAS00001000511
Single cell RNAseq of PBMC from bladder cancer patients
Frozen PBMC samples containing at least 1 million cells were thawed for 1 minute at 37C and washed twice with RPMI complete media (10% FBS with glutamate and Pen/Strep). All of the samples had >80% viable cells. Sample processing for single-cell RNA-seq was done using Chromium Single Cell 3’ Library and Gel bead kit v2 (PN-120237) following manufacturer’s user guide (CG00052, 10x Genomics, Pleasanton, CA). The total cell density was used to impute the volume of single cell suspension needed in the reverse transcription (RT) master mix, aiming to achieve ~ 6,000 cells per sample. cDNAs and libraries were prepared following manufacturer’s user guide (10x Genomics). cDNA amplification and indexed libraries were prepared using 12 and 14 cycles of PCR, respectively. Libraries were profiled, quantified, and sequenced as 5’ single-cell gene expression libraries.
Study
EGAS00001004008
Genomic and transcriptomic determinants of therapy resistance and immune landscape evolution during anti-EGFR treatment in colorectal cancer
Despite biomarker stratification, the anti-EGFR antibody cetuximab is only effective against a subgroup of colorectal cancers (CRC). This genomic and transcriptomic analysis of the cetuximab resistance landscape in 35 RAS wild-type CRCs identified associations of NF-1 and non-canonical RAS/RAF-aberrations with primary resistance and validated transcriptomic CRC-subtypes as non-genetic predictors of benefit. 64% of biopsies with acquired resistance harbored no genetic resistance drivers. Most of these had switched from a cetuximab-sensitive transcriptomic subtype pre-treatment to a fibroblast- and growth factor-rich subtype at progression. Fibroblast-supernatant conferred cetuximab resistance in vitro, confirming a major role for non-genetic resistance through stromal remodeling. Cetuximab treatment increased cytotoxic immune infiltrates and PDL1 and LAG3 immune-checkpoint expression, potentially providing opportunities to treat cetuximab-resistant CRCs with immunotherapy.
Study
EGAS00001003367
Integrated genomic analysis identifies driver genes and cisplatin-resistant progenitor phenotype in pediatric liver cancer
Pediatric liver cancers (PLCs) comprise diverse diseases affecting infants, children and adolescents. Despite overall good prognosis, PLCs display heterogeneous response to chemotherapy. Integrated genomic analysis of 126 pediatric liver tumors showed a continuum of driver mechanisms associated with patient age, including new targetable oncogenes. In 10% of hepatoblastoma patients, all before 3 years old, we identified a mosaic premalignant clonal expansion of cells altered at the 11p15.5 locus. Analysis of spatial and longitudinal heterogeneity revealed an important plasticity between ‘Hepatocytic’, ‘Liver Progenitor’ and ‘Mesenchymal’ molecular subgroups of hepatoblastoma. We showed that during chemotherapy, ‘Liver Progenitor’ cells accumulated massive loads of cisplatin-induced mutations with a specific mutational signature, leading to the development of heavily mutated relapses and metastases. Drug screening in PLC cell lines identified promising targets for cisplatin-resistant progenitor cells, validated in mouse xenograft experiments. These data provide new insights into cisplatin resistance mechanisms in PLC and suggest alternative therapies.
Study
EGAS00001005108
Phase 1 CX-5461 Trial (Canadian Cancer Trials Group Trial IND.231)
CX-5461 is a G-quadruplex stabilizer that exhibits synthetic lethality in homologous recombination (HR) deficient models. In this multicentre Phase Ib trial in patients with solid tumors, 40 patients were treated across 10 dose levels (50 - 650 mg/m2) to determine the recommended phase II dose (RP2D), and evaluate safety, tolerability, pharmacokinetics. Defective HR was explored as a predictive biomarker. CX-5461 was generally well tolerated, with a RP2D of 475 mg/m2 d1, 8 and 15 every 4 weeks, and dose limiting toxicities of phototoxicity. Responses were observed in 10% of patients, primarily in patients with defective HR (ORR 14%). Reversion mutations in PALB2 and BRCA2 were detected on progression following initial response in germline carriers, confirming the underlying synthetic lethal mechanism. In vitro characterization of UV sensitization showed this toxicity is related to the CX-5461 chemotype, independent of G4 synthetic lethality. These results establish clinical proof-of-concept for this first-in class G4 stabilizer.
Study
EGAS00001006173
Tissue and plasma RNA from esophageal cancer and precursor lesions
In the past decades, the incidence of esophageal adenocarcinoma has increased dramatically in Western populations. Better understanding of disease etiology along with the identification of novel prognostic and predictive biomarkers are urgently needed to improve the dismal survival probabilities. Here, we performed comprehensive RNA (both coding and non-coding) profiling in various samples from 17 patients diagnosed with esophageal adenocarcinoma, high-grade dysplastic or non-dysplastic Barrett’s esophagus. Per patient, a blood plasma sample, and a healthy esophageal and disease tissue sample were included. In total, this comprehensive dataset consists of 102 RNA-seq libraries from 51 samples. This unique resource allows for discovery of novel biomarkers and disease mechanisms, comparison of tissue and liquid biopsy profiles, integration of coding and non-coding RNA patterns, and can serve as a validation dataset in other RNA landscaping studies. Moreover, structural RNA differences can be identified in this dataset, including protein coding mutations fusion genes, and circular RNAs.
Study
EGAS00001004939
Mult-omics Palbociclib Resistance Study in HR+/HER2– Metastatic Breast Cancer
Resistance to CDK4/6 inhibitors and endocrine therapy (ET) are common and poorly understood since they have been real game changer in patients with HR-positive and HER2-negative MBC. A comprehensive genomic and transcriptomic analysis of pre-treatment and post-treatment tumors from patients treated with palbociclib plus ET identified novel markers associated with poor prognosis such as genomic scar features caused by homologous repair deficiency (HRD), estrogen response signatures, and four unique prognostic clusters with distinct molecular features. Tumors with TP53 mutations co-occurring with a unique HRD-high cluster responded poorly. By comparing genomic and transciptomic profiles of paired samples, tumors were found to be further enriched in HRD genomic scars and many had switched to aggressive molecular subtypes. Furthermore, we identified high frequencies of acquired genomic alterations upon disease progression in RB1, ESR1, PTEN, and KMT2C. Our findings provide new insight into potential predictive biomarkers that could be targeted to overcome resistance. (NCT03401359)
Study
EGAS00001005736
CNA differences between RNA-based subtypes of PDAC
Pancreatic ductal adenocarcinomas (PDAC) have a dismal prognosis with a 5-year survival of <5%. Chemoresistance and recurrences, seen in approximately 70% of PDAC patients, may partly be explained by intratumor heterogeneity (ITH) through which tumors may develop ‘escape routes’. In the Pancreas Multibiopsy (PM)-project we have obtained multiple fragments from pancreatic and other peri-ampullary carcinomas in order to reveal potential ITH and identify genetic aberrations that predict outcome after surgery and response to chemotherapy. CNA-profiles were used to complement the characterization of liabilities in PDAC that we identified previously using RNAseq in the form of epithelial, secretory, compound pancreatic, and mesenchymal subgroups that showed survival outcomes. Comparison of CNA-profiles between these transcriptomics-based subgroups (with 8, 3, 6, and 6 tumors respectively per group) showed differences, ranging from ‘quiet’(i.e. hardly any CNA) to ‘wild’(i.e. lots of CNA), but these were not statistically significant (Chi2 0.09). When differentiating between the classical and basal-like molecular subtypes of PDAC differences in CNA (high or low) were even less statistically significant (Chi2 0.292).
Study
EGAS50000001218
Mechanisms_of_patient_response_to_Dabrafenib_in_Melanoma
Samples will be from the BRF113683 (BREAK-3) study which is a Phase III Randomized, Open-label Study Comparing GSK2118436 to Dacarbazine (DTIC) in Previously Untreated Subjects With BRAF Mutation Positive Advanced (Stage III) or Metastatic (Stage IV) Melanoma (n=250 enrolled)•NGS [Agilent capture (Sanger V2 panel): 360 genes and 20 gene fusions; Illumina HiSEQ Sequencing]•CNV: [via NGS or Affy SNP 6.0 or Illumina Omni (TBD)]Bioinformatics: Analysis will be performed using core Sanger informatics pipelines similar to those previously described (Papaemmanuil E et al. (2013) Blood. 22:3616 -3627). Briefly, copy number analysis will be performed using the ASCAT algorithm, and base substitutions, small insertions and deletions using the CAVEMAN and Pindel algorithms, respectively. Statistical approaches including generalized linear models will be used to predict clinical variables such as maximum clinical response and duration of response using genetic data. Sanger and EBI to conduct analysis; Raw data and correlation with clinical endpoints to be analyzed by both EBI/Sanger and GSK (unique pipeline analyses to increase call confidence)
Study
EGAS00001000946
Pipeline Olympics: Continuous benchmarking of computational workflows for DNA methylation sequencing data
DNA methylation is a widely studied epigenetic mark and a powerful biomarker of cell type, age, environmental exposure, and disease. Genome-wide DNA methylation profiling is usually performed using whole-genome sequencing following selective conversion of unmethylated cytosines into thymines via bisulfite treatment or enzymatic methods. Numerous software tools facilitate processing of raw sequencing data for DNA methylation profiling, but a comprehensive benchmarking has been lacking. In this study, we systematically compared methods and tools for processing DNA methylation sequencing data. We established a dedicated benchmarking dataset comprising genome-wide DNA methylation profiling data for four reference samples using five experimental protocols each. The results of data processing were compared to highly quantitative locus-specific DNA methylation measurements available for 46 genomic loci. Based on this gold-standard reference dataset, we identified workflows that consistently demonstrated high performance. We present our results in an interactive application that provides a “living benchmark” ready for continuous updating as new workflows are proposed and require benchmarking against existing methods. In summary, our study provides guidance for any laboratory studying DNA methylation on a genome-wide scale.
Study
EGAS50000000541
Identification_of_synthetic_lethal_genes_by_CRISPR_Cas9_library
Sequencing is performed on PCR products from the amplification of gRNA segments introduced into cell lines (some are commercial, some are derived by collaborators, as described in HMDMC 13/076). Cellular DNA or RNA are not sequenced. This data is defined as "Other Biological/Biochemical Assay Data" under the WTSI datasharing guidelines and consequently will be released at the time of publication and/or through faculty websites.
SEQUENCING DETAILS: in October 2016 3 DNA samples will be submitted for PCR amplification and purification through CGP pipeline using Thompson's custom primer set and PCR settings and sequencing with Hiseq using a custom protocol (with 3 forward reads to read sgRNA1, sgRNA2 and the index, optimized with Illumina Bespoke Sanger team); read length 50bp. The following set of samples will be submitted in later 2016 and ealry 2017 as cell pellet to be processed through CGP Cas9 pipeline (Thompson's oligos and protocols) and sequences on Hiseq (same custom sequencing protocol described above)
We are utilizing a custom CRISPR/Cas9 library to identify synthetic lethal gene pairs. Cells will be transduced with LV expressing Cas9, then infected with LV expressign the library of 42'000 sgRNA pairs. Cell pellet will be collected from the cell population at different timepoints. The samples will be processed by CGP pipeline customized for the specific protocol and then sequenced to identify the 19bp of the sgRNA and quantify their representation.
Study
EGAS00001002117
Germline variants collaborate with somatic mutations and initiate and/or drive disease in primary myelodysplastic syndrome (MDS) and therapy-related myeloid neoplasms (t-MN)
Therapy-related myeloid neoplasms (T-MN) are poorly characterized secondary hematological malignancies following chemotherapy/radiotherapy exposure. We compared the clinical and mutational characteristics of T-MN (n=129) and primary myelodysplastic syndrome (P-MDS, n=108) patients. Although the somatic mutation frequency was similar between T-MN and P-MDS patients (93% in both groups), the pattern was distinct. TP53 mutations were more frequent in T-MN (29.5% versus 7%), while spliceosomal complex mutations were more common in P-MDS (56.5% versus 25.6%).In contrast to P-MDS, the ring sideroblasts (RS) phenotype was not associated with better survival in T-MN, most probably due to genetic association with TP53 mutations. SF3B1 was mutated in 96% of P-MDS with ≥15% RS, but in only 32% T-MN. TP53 mutations were detected in 92% T-MN with ≥15% RS and SF3B1 wild-type cases. Interestingly, T-MN and P-MDS patients with “Very low” or “Low” Revised International Prognostic Scoring System (IPSS-R) showed similar biological and clinical characteristics. In a Cox-regression analysis, TP53 mutation was a poor prognostic factor in T-MN, independent of IPSS-R cytogenetic, disease-modifying therapy and NRAS mutation.Our data have direct implications for T-MN management and provide evidence that, in addition to conventional disease parameters, mutational analysis should be incorporated in T-MN risk stratification.
Study
EGAS00001003547
Proteom characterization in primary colorectal cancer and corresponding liver metastasis
Colorectal adenocarcinomas (CRC) are one of the most commonly diagnosed tumors worldwide. Colorectal adenocarcinomas primarily metastasize into the liver and (less often) into the peritoneum. Patients suffering from CRC-liver metastasis (CRC-LM) typically present with a dismal overall survival compared to non-metastasized CRC patients. The metastasis process and metastasis promoting factors in patients with CRC are under intensive debate. However, CRC studies investigating the proteome biology are lacking. Formalin-fixed paraffin-embedded (FFPE) tissue specimens provide a valuable resource for comprehensive proteomic studies of a broad variety of clinical malignancies. The presented pilot study compares the proteome of primary CRC and patient-matched CRC-LM. The applied protocol allows a reproducible and straightforward identification and quantification of over 2,600 proteins within the dissected tumorous tissue. Subsequent unsupervised clustering reveals distinct proteome biologies of the primary CRC and the corresponding CRC-LM. Statistical analysis yields multiple differentially abundant proteins in either primary CRC or their corresponding liver metastases. A more detailed analysis of dysregulated biological processes suggests an active immune response in the liver metastases, including several proteins of the complement system. Proteins with structural roles, e.g. cytoskeleton organization or cell junction assembly appear to be less prominent in liver metastases as compared to primary CRC. Immunohistochemistry corroborates proteomic high expression levels of metabolic proteins in CRC-LM. We further assessed how the in vitro inhibition of two in CRC-LM enriched metabolic proteins affected cell proliferation and chemosensitivity. The presented proteomic investigation in a small clinical cohort promotes a more comprehensive understanding of the distinct proteome biology of primary CRC and their corresponding liver metastases.
Study
EGAS00001005641
HCA_Thymus_Disease_DiGeorge_Transplant_RNA
Single cell transcriptomics study of thymic transplant biopsies
Allogeneic thymus transplantation is the only curative therapy for complete DiGeorge Syndrome (cDGS), a rare severe primary immunodeficiency characterised by athymia. GOSH is one of only two centres worldwide to offer this treatment. Despite a lack of major histocompatibility complex (MHC)-matching between donor and host, transplanted thymus becomes repopulated by recipient bone marrow derived precursor cells and supports development of functional T-cells. The mechanisms underlying thymopoiesis in this context are poorly understood, but over time we observe reconstitution of T-cell immunity, with the ability to produce host naïve T-cells showing a broad T-cell receptor (TCR) repertoire and to generate MHC-restricted T-cell proliferative responses. Although lifesaving, the achieved immunological reconstitution is typically not complete with circulating T-cell numbers usually remaining below the age related normal ranges. Additionally, we observe persistence of donor-derived T-cells of unknown clinical significance. To gain more insight into the mechanisms by which MHC-mismatched transplanted thymus supports T-cell development with self-tolerance, as well as into the basis of suboptimal T-cell immunity, we now aim to investigate immune reconstitution after thymus transplantation in further detail by using single-cell transcriptomics, applied to thymic transplant biopsies and peripheral blood samples collected during standard post-transplant patient care. By identifying which lineages of host- derived cells repopulate the thymic tissue after transplantation, we will be able to address the role of MHC in positive and negative T-cell selection during T-cell differentiation. We will also be able to clarify the exact ontogeny of the persistent donor T-cells, as well as their possible role. Understanding the mechanisms of action of HLA-mismatched transplanted thymus will contribute to treatment optimisation. Additionally, our research provides a unique opportunity to further investigate key immunological concepts, such as tolerance and autoimmunity, challenging existing paradigms in thymus immunology.
This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Study
EGAS00001004025