Background: Lung carcinoma-in-situ (CIS) lesions are the pre-invasive precursor to lung squamous cell carcinoma. However, only half progress to invasive cancer in three years, while a third spontaneously regress. Whether modern molecular profiling techniques can identify those pre-invasive lesions that will subsequently progress and distinguish them from those that will regress is unknown. Methods: Progressive and regressive CIS lesions were laser-captured and their genome, epigenome and transcriptome interrogated. We analysed 83 progressive lesions, 41 regressive and 33 normal epithelial control samples. DNA methylation and gene expression profiles were further validated using publicly available lung cancer data. Results: Somatic mutation burden was higher in progressive lesions than regressive CIS lesions, across base substitutions, rearrangements, and copy number changes. Driver mutations were present in both progressive and regressive CIS lesions, but were more numerous in progressive cases. Progressive and regressive CIS lesions had distinct epigenomic and transcriptional profiles, with a strong chromosomal instability signature. Gene expression, methylation and copy number profiles can all predict accurately which CIS lesions will progress to lung cancer. Conclusion: Pre-invasive CIS lesions that will subsequently progress to invasive lung cancer can be distinguished from those that will regress using molecular profiling. Progression is associated with a strong chromosomal instability signature. These findings inform the development of novel therapeutic targets.
...
The study was conducted under the auspices of the Transdisciplinary Research In Cancer of the Lung (TRICL) Research Team, which is a part of the Genetic Associations and MEchanisms in ONcology (GAME-ON) consortium, and associated with the International Lung Cancer Consortium (ILCCO). Ethics All participants provided written informed consent. All studies were reviewed and approved by institutional ethics review committees at the involved institutions. Sequencing data are derived from four sub-studies. The sub-studies that contributed include Harvard, Liverpool, Toronto, and IARC. The IARC and Toronto studies are described above. A description of the Harvard and Liverpool studies is provided below. Liverpool Lung Project: The Liverpool Lung Project (LLP)1 is a case control and cohort study, which has over 11,500 individuals, with detailed epidemiological, clinical and outcome data with associated specimens (i.e. tumour tissue, blood, plasma, sputum, bronchial lavage, EBUS and oral brushings). The participants have completed a detailed lifestyle questionnaire and updated data on clinical outcome and hospital events are collected through the Office of National Statistics, Cancer Registry and from Health Episode Statistics. The project is registered on the UK National Institute for Health Research (NIHR) lung cancer portfolio and has all the required ethical approvals and sponsorship arrangements in place. The LLP has detailed standard operating procedures (SOP) for all aspects of the recruitment, data, specimen collection as well as the data storage. The LLP Cohort study has 8,224 participants with blood and 7,761 with plasma samples. The LLP case-control samples have been incorporated into in a large number of international GWAS and molecular studies 2,3, methylation 4-7, microRNA 8and next generation studies 9-11, resulting in high ranking publications, as well as forming the basis for the LLP risk prediction model 12-14 which has been utilised in the UK lung cancer screening trial (UKLS) 15-17 Patient and control DNAs were derived from EDTA-venous blood samples. Harvard Samples. David Christiani at the Harvard University School of Public Health has been directing research studies to investigate etiological factors influencing lung cancer development since 1983 and has amassed a collection of 2000 controls and 5055 lung cancer cases. He has been actively collecting and storing snap frozen tumor samples since 1992. Around 1500 tumor samples have been collected and the average wet tumor yield is about 30 grams of tumor, of which 631 cases have completely annotated clinical and survival information. Pathology confirmation is provided by two pathologists. At the time of surgery, a minimum of 30 grams of wet lung tumor tissue and 30 grams of non-involved tissue from the same lobe is sectioned, flash frozen and sent to Dr. Christiani's lab for logging and storage. A blood sample for DNA and serum is collected. A structured interview by trained research staff is conducted on each case, and clinical outcomes and treatments is extracted and entered into the molecular epidemiology data base at Harvard. Fresh frozen samples have been collected from 1451 lung cancer and are available for study. Samples from this collaborative study have played key roles in major studies, including the initial finding describing EGFR mutations in lung cancer 22. Participants in this study are patients, > 18 years of age, with newly diagnosed histologically confirmed lung cancer. Samples that are included in the analysis have the following histologies: Adenocarcinoma: 8140/3, 8250/3, 8260/3, 8310/3, 8480/3 8560/3; LCC: 8012/3, 8031/3; squamous carcinoma: 8070/3, 8071/3, 8072/3, 8074/3; and other NSCLC: 8010/3, 8020/3, 8021/3, 8032/3, 8230/3. The Toronto Study: The Toronto study was conducted in the Great Toronto Area between 1997 and 2014. Cases were recruited at the hospitals in the network of University of Toronto and Lunenfeld- Tanenbaum Research Institute. At the time of recruitment in the clinical setting, provisional diagnoses of lung carcinoma were first assigned based on clinical criteria. Diagnoses for all cases included were histologically confirmed by the reference pathologist who is a specialist in pulmonary pathology, based on review of pathology reports from surgery, biopsy or cytology samples in 100% of cases. Diagnostic classification was done initially according to ICD-9, ICD-10, and ICD for oncology-2, and subsequently converted to ICD-O-3. Tumors were grouped into the major categories included in this analysis according to primary cancer type based on the ICD-3 definitions. Controls were randomly selected from individual visiting family medicine clinics and Ministry of Finance Municipal Tax Tapes. All subjects were interviewed using a standard questionnaire and information on lifestyle risk factors, occupational history, medical and family history was collected. Blood samples were collected from more than 85% of the subjects. IARC: The IARC data are derived from case-control studies conducted in Russia and include samples that have available tissue samples. Patient and control DNAs were derived from EDTA-venous blood samples. The lung cancer patients were classified according to ICD-O-3; SQ: 8070/3, 8071/3, 8072/3, 8074/3; AD: 8140/3, 8250/3, 8260/3, 8310/3, 8480/3, 8560/3, 8251/3, 8490/3, 8570/3, 8574/3; with tumous with overlapping histologies classified as mixed. The Lung Cancer Transdisciplinary Research Cohort is utilized in the following dbGaP sub-studies. To view genotypes, other molecular data, and derived variables collected in these sub-studies, please click on the following sub-studies below or in the "Sub-studies" section of this top-level study page phs000876 Lung Cancer Transdisciplinary Research Cohort. phs000877 Meta Analysis phs000878 CIDR Lung Cancer phs001681 Affy Axiom Array
In the setting of localized colon cancer (CC), circulating tumor DNA (ctDNA) monitoring in plasma has shown potential for detecting minimal residual disease (MRD) and predicting higher risk of recurrence. With the tumor-only sequencing approach, however, germline variants may be misidentified as somatic variations, precluding the possibility of tracking in up to 11% of patients due to a lack of known somatic mutations. In this study comprising 148 prospectively recruited localized CC patients, a custom 29-gene panel was utilized to sequence both tumor tissue and matched white blood cells (WBCs) to enhance the accuracy of sequencing results. Performing targeted sequencing of paired tumor tissue and WBCs samples detected additional somatic mutations and increased the number of patients eligible for MRD tracking in plasma, although MRD detection sensitivity was not increased. Furthermore, the germline testing approach revealed the presence of pathogenic germline variants, thereby helping identify patients at elevated risk of hereditary cancer syndromes.
Mismatch repair (MMR) deficient cancer evolves through the continuous erosion of coding homopolymers in target genes. Curiously, the MMR genes MSH6 and MSH3 also contain coding homopolymers and these are frequent mutational targets in MMR-deficient cancers. The impact of incremental MMR mutations on MMR-deficient colorectal cancer evolution is unknown. We show that microsatellite instability modulates DNA repair by toggling hypermutable mononucleotide homopolymer runs in MSH6 and MSH3 through stochastic frameshift switching. Spontaneous mutation and reversion modulates subclonal mutation rate, mutation bias, and clonal HLA and neoantigen diversity. Patient-derived organoids corroborate these observations and show that in the absence of immune selection MMR homopolymer sequences drift back into reading frame, suggesting a fitness cost of elevated mutation rates. Combined experimental and simulation studies demonstrate that subclonal immune selection favours incremental MMR mutations. Overall, our data demonstrate that MMR-deficient colorectal cancers fuel intratumour heterogeneity by adapting subclonal mutation rate and diversity to immune selection.
Peritoneal metastases (PM) in gastric cancer (GC) portend a poor prognosis, yet our understanding of tumor microenvironmental (TME) characteristics associated with GCPM remain limited. Here, we analyzed intrinsic genomic alterations and transcriptomic programs predictive of GCPM in a prospective cohort of 248 patients, identifying CDH1, PIGR, and ELF3 mutations as predictors. By inspecting the spatial dynamics of the TME, we find that tumor compartment infiltration of pro-tumorigenic cell types such as inflammatory cancer-associated fibroblasts (CAFs) predict peritoneal recurrence. Next, in a cross-sectional study of 205 samples from 55 patients, distinct pathways and immune compositions in GCPM relative to liver metastases highlight the TME's significance in transcoelomic metastases. Notably, several putative therapeutic targets exhibited distinct expression patterns between PTs and PMs. Our findings highlight transcriptomic variations and niche reprogramming in the GCPM peritoneal environment, revealing roles of myeloid dendritic cells, effector memory CD8+ T cells, and CAFs in metastatic progression.