UK10K COHORT ALSPAC
The UK10K project proposes a series of complementary genetic approaches to find new low-frequency/rare variants contributing to disease phenotypes. These will be based on obtaining the genome-wide sequence of 4000 samples from the TwinsUK and ALSPAC cohorts (at 6x sequence coverage), and the exome sequence (protein-coding regions and related conserved sequence) of 6000 samples selected for extreme phenotypes. Our studies will focus primarily on cardiovascular-related quantitative traits, obesity and related metabolic traits, neurodevelopmental disorders and a limited number of extreme clinical phenotypes that will provide proof-of-concept for future familial trait sequencing. We will directly analyse quantitative traits in the cohorts and the selected traits in the extreme samples, and also use imputation down to 0.1% allele frequency to extend the analyses to further sample sets with genome wide genotype data. In each case we will investigate indels and larger structural variants as well as SNPs, and use statistical methods that combine rare variants in a locus or pathway as well as single-variant approaches.
The Avon Longitudinal Study of Parents and Children (ALSPAC) is a two-generation prospective study. Pregnant women living in one of three health districts in the former county of Avon with an expected delivery date between April 1991 and December 1992 were eligible to be enrolled in the study, and this formed the initial point of contact for the development of a large, family based resource.
Information has been collected on the children and the mothers through retrieval of biological materials (e.g. antenatal blood samples, placentas), biological sampling (e.g. collection of cord blood, umbilical cord, milk teeth, hair, toenails, blood and urine), self-administered questionnaires, data extraction from medical notes, linkage to routine information systems and at repeat research clinics.
Study
EGAS00001000090
UK10K_NEURO_ASD_FI
In the UK10K project we propose a series of complementary genetic approaches to find new low frequency/rare variants contributing to disease phenotypes. These will be based on obtaining the genome wide sequence of 4000 samples from the TwinsUK and ALSPAC cohorts (at 6x sequence coverage), and the exome sequence (protein coding regions and related conserved sequence) of 6000 samples selected for extreme phenotypes. Our studies will focus primarily on cardiovascular-related quantitative traits, obesity and related metabolic traits, neurodevelopmental disorders and a limited number of extreme clinical phenotypes that will provide proof-of-concept for future familial trait sequencing. We will analyse directly quantitative traits in the cohorts and the selected traits in the extreme samples, and also use imputation down to 0.1% allele frequency to extend the analyses to further sample sets with genome wide genotype data. In each case we will investigate indels and larger structural variants as well as SNPs, and use statistical methods that combine rare variants in a locus or pathway as well as single-variant approaches.These samples are a subset of a nationwide collection of Finnish autism spectrum disorder (ASD) samples. The samples have been collected from Central Hospitals across Finland in collaboration with the University of Helsinki. The samples consist of 93 individuals with a diagnosis of autistic disorder or Asperger syndrome from 36 families with at least two affected individuals. Of these individuals, 16 can be genealogically connected to form two large pedigrees originating from Central Finland, suggesting possible genetic risk factors shared identical by descent within the pedigrees. All diagnoses are based on ICD-10 and DSM-IV diagnostic criteria for ASDs. Additional phenotypic data is available for a subset of the individuals.For further information with regard to this cohort please contact Aarno Palotie (Aarno.palotie@helsinki.fi).
Study
EGAS00001000110
WTCCC case-control study for Type 1 Diabetes - Combined Controls
WTCCC genome-wide case-control association study for Type 1 Diabetes (T1D) using six disease collections together with the 1958 British Birth Cohort and the UK National Blood Service collections as controls.
Study
EGAS00000000015
WTCCC case-control study for Type 2 Diabetes - Combined Controls
WTCCC genome-wide case-control association study for Type 2 Diabetes (T2D) using six disease collections together with the 1958 British Birth Cohort and the UK National Blood Service collections as controls.
Study
EGAS00000000017
Saliva microbiota in Finnish children
The human intestinal microbiota may play a role in the development of overweight and obesity. However, associations between saliva microbiota and body mass index (BMI) have been sparsely studied, although the oral cavity is the major gateway for microbes into the body. The aim of this study was to identify associations between the saliva microbiota and BMI categories in Finnish children aged 9-14 years.
Study
EGAS00001003039
GoT2D: Genetics of Type 2 Diabetes, a study of the the genetic architecture of type 2 diabetes using low pass whole genome sequencing and high density SNP genotyping in 2,657 individuals.
The Genetics of Type 2 Diabetes Consortium (GoT2D) is a collaboration between the University of Michigan, the Broad Institute and the Wellcome Trust Centre for Human Genetics. The overall aim is to extend upon recent efforts, such as genome-wide association studies (GWAS) and large scale meta-analyses. While they have proved successful at mapping genomic loci that influence human diseases, like type 2 diabetes, much of the heritability remains unexplained. In this study, we use next generation sequencing and genotyping technologies to query for lower frequency variants in the human genome. Thereby, allowing a deeper characterization of the spectrum of alleles associated with type 2 diabetes risk, and a better assessment of the genes that play a role in the etiology of type 2 diabetes development. We studied 1,326 T2D cases and 1,331 normoglycemic controls from Northern and Central Europe (Sweden, Finland, UK, and Germany). To efficiently characterize the entire genome sequence of each individual, we performed low-coverage (~5x) whole-genome sequencing, augmented by deep coverage (~100x) sequencing of the exome, and dense (2.5M) single nucleotide polymorphism (SNP) genotyping using the HumanOmni2.5 array. The data deposited in EGA will include all the Swedish, Finnish, UK, and German samples.
Study
EGAS00001001459
Sequencing_probands_and_families_with_severe_insulin_resistance_syndromes
This is an ongoing project and continuation to all the sequencing we have been doing over the last few years. We have some additional families and probands with syndromes of insulin resistance not previously sequenced within uk10k or other core funded projects. We would like to complete the sequencing in all of the good quality families and probands we have, this would require another ~50 samples to be WES sequenced. This cohort has already proven to be a rich source of interesting findings with papers in Science and Nature genetics.
Study
EGAS00001000488
T2D-GENES: Exome sequencing
The Type 2 Diabetes Genetic Exploration by Next-generation sequencing in multi-Ethnic Samples (T2D-GENES) is a large collaboration effort between the University of Michigan, Broad Institute, the Wellcome Trust Centre for Human Genetics, University of Chicago, NIDDK, Texas Biomedical Research Institute, University of Texas Health Science Center, University of North Carolina, Blood Systems Research Institute, and University of Mississippi Medical Center. The goal of this study is to identify genetic variants for type 2 diabetes through multiethnic sequencing studies. In this study, we use next generation sequencing to assess the role of common, low frequency, and rare variants on type 2 diabetes risk. We preformed exome sequencing in 6,546 T2D cases and 6,457 control individuals of multiethnic origin, including individuals of African American, East Asian, South Asian, Hispanic, and European descent.
Study
EGAS00001001460
UK10K NEURO IOP COLLIER
In the UK10K project we propose a series of complementary genetic approaches to find new low frequency/rare variants contributing to disease phenotypes. These will be based on obtaining the genome wide sequence of 4000 samples from the TwinsUK and ALSPAC cohorts (at 6x sequence coverage), and the exome sequence (protein coding regions and related conserved sequence) of 6000 samples selected for extreme phenotypes. Our studies will focus primarily on cardiovascular-related quantitative traits, obesity and related metabolic traits, neurodevelopmental disorders and a limited number of extreme clinical phenotypes that will provide proof-of-concept for future familial trait sequencing. We will analyse directly quantitative traits in the cohorts and the selected traits in the extreme samples, and also use imputation down to 0.1% allele frequency to extend the analyses to further sample sets with genome wide genotype data. In each case we will investigate indels and larger structural variants as well as SNPs, and use statistical methods that combine rare variants in a locus or pathway as well as single-variant approaches. The Genetics and Psychosis (GAP) set consists of samples from subjects with schizophrenia, ascertained as a new-onset sample. This set is of UK origin, with data on cognition, brain imaging and other endophenotypes. The Maudsley twin series consists of probands ascertained from the Maudsley Twin Register, defined as patients of multiple birth who had suffered psychotic symptoms. This set is of UK origin, with data on cognition, brain imaging and other endophenotypes, with DNA available from an MZ or DZ affected or unaffected co-twin. The Maudsley family study (MFS) consists of over 250 families who have a history of schizophrenia or bipolar disorder Within the Maudsley Family Study, biological markers of psychosis include neuropsychological tests, Evoked Response Potentials Tests (ERPs), MRI scans, dermatoglyphics and eye tracking. Early risk factors for psychosis and clinical symptoms are also investigated. This set is of UK origin, with DNA available from both affected and unaffected relatives in many of the probands. For further information on this study please contact David Collier (david.collier@kcl.ac.uk).
Study
EGAS00001000121
UK10K NEURO GURLING
In the UK10K project we propose a series of complementary genetic approaches to find new low frequency/rare variants contributing to disease phenotypes. These will be based on obtaining the genome wide sequence of 4000 samples from the TwinsUK and ALSPAC cohorts (at 6x sequence coverage), and the exome sequence (protein coding regions and related conserved sequence) of 6000 samples selected for extreme phenotypes. Our studies will focus primarily on cardiovascular-related quantitative traits, obesity and related metabolic traits, neurodevelopmental disorders and a limited number of extreme clinical phenotypes that will provide proof-of-concept for future familial trait sequencing. We will analyse directly quantitative traits in the cohorts and the selected traits in the extreme samples, and also use imputation down to 0.1% allele frequency to extend the analyses to further sample sets with genome wide genotype data. In each case we will investigate indels and larger structural variants as well as SNPs, and use statistical methods that combine rare variants in a locus or pathway as well as single-variant approaches. This sample set consists of DNA from multiply affected schizophrenia families. The families have been diagnosed using the SADS-L clinical instrument which gives diagnoses at the probable level of the research diagnostic criteria (RDC). In addition all diagnoses are available using DSMIIIR criteria. These criteria are widely accepted as being valid and reliable for the diagnosis of schizophrenia. All families have been collected to ensure that they are uni-lineal for transmission of schizophrenia, i.e. they have only one affected parent with schizophrenia, or a relative of only one transmitting or obligate carrier parent with schizophrenia. Families with bi-lineal transmission of schizophrenia (i.e. with both parents being affected) were not sampled for this study. All families have multiple cases of schizophrenia and related disorders. The families have been selected to ensure there are no cases of bipolar disorder within them and that they do not contain bipolar disorder in any relatives on either side of the family.For further information on this cohort please contact Hugh Gurling (h.gurling@ucl.ac.uk).
Study
EGAS00001000225
Molecular Phenotyping to Accelerate Genomic Epidemiology (MolPAGE)
MolPAGE is an integrated EU project that aims to develop methods and tools for discovery of biomarkers associated with diabetes (see www.ocdem.ox.ac.uk for MolOBB and www.twinsuk.ac.uk for MolTWIN).
Study
EGAS00000000102
Human liver NPCs single cell project
Independent of their inflammatory phenotype, macrophages are key orchestrators of hepatic metabolism. Non-alcoholic fatty liver disease (NAFLD) often occurs in obese individuals and is among the most common causes of cirrhosis, the terminal chronic liver disease that may necessitate liver transplantation. While multiple populations of macrophages have been described in the human liver, their function and turnover in obese patients at high risk of developing NAFLD and cirrhosis is currently unknown. Herein we identified a specific human population of resident liver myeloid cells that protects against the metabolic impairment associated with obesity. By studying the turnover of liver myeloid cells in individuals undergoing liver transplantation using markers of donor-recipient mismatch, we made the novel discovery that liver myeloid cell turnover differs between humans and mice. Using single cell techniques and flow cytometry we determined that the proportion of the protective resident liver myeloid cells, denoted liver myeloid cells 2 (LM2), decreases during obesity. Functional validation approaches using human 2D and 3D cultures revealed that the presence of LM2 ameliorates the oxidative stress associated with obese conditions. Our study indicates that resident myeloid cells could be a therapeutic target to decrease the oxidative stress associated with NAFLD.
Study
EGAS00001007194
HipSci HumanExome BeadChip analysis - monogenic diabetes
The HipSci project brings together diverse constituents in genomics, proteomics, cell biology and clinical genetics to create a UK national iPS cell resource and use it to carry out cellular genetic studies. In this sub-study we performed Genotyping analysis using the Infinium HumanExome BeadChip on iPS cells generated from skin biopsies or blood samples from rare disease patients diagnosed with monogenic diabetes.
Study
EGAS00001001273
HipSci Illumina 450K Methylation analysis - monogenic diabetes
The HipSci project brings together diverse constituents in genomics, proteomics, cell biology and clinical genetics to create a UK national iPS cell resource and use it to carry out cellular genetic studies. In this sub-study we performed Methylation analysis using the Infinium HumanMethylation450 BeadChip on iPS cells generated from skin biopsies or blood samples from rare disease patients diagnosed with monogenic diabetes.
Study
EGAS00001001275
T1DGC GWAS 1958 British Birth Cohort controls
T1DGC genome-wide case-control association study for Type 1 Diabetes (T1D) using the 1958 British Birth Cohort as controls. The cases for this study are made available from the NCBI dbGAP.
Study
EGAS00000000038
HipSci HumanHT 12v4 Expression BeadChip analysis - monogenic diabetes
The HipSci project brings together diverse constituents in genomics, proteomics, cell biology and clinical genetics to create a UK national iPS cell resource and use it to carry out cellular genetic studies. In this sub-study we performed Expression analysis using the Illumina HumanHT -12 Expression BeadChip on iPS cells generated from skin biopsies or blood samples from rare disease patients diagnosed with monogenic diabetes.
Study
EGAS00001001277
Heritable pulmonary arterial hypertension in a large Iberian family
Data from a study of a large Iberian family (n=65 subjects, 5 generations) affected by pulmonary arterial hypertension (PAH) and segregating with the BMPR2 missense mutation p.Arg491Gln (rs137852749, c.1472G>A). PAH is a rare disease characterized by an abnormal rise in mean pulmonary arterial pressure (> or equal to 25 mmHg at rest), which, in turn, leads to a progressive increase in pulmonary vascular resistance and ultimately to death, due to right ventricular failure. Heritable PAH has an overall prevalence below 1 case per million adults and is defined by either the presence of a known genetic defect linked to the disease or a positive family history. Heritable PAH is inherited as an autosomal dominant disease. However, not all BMPR2 mutation carriers develop the disease, highlighting the presence of reduced penetrance. In this family, there are 22 mutation carriers from which 8 were diagnosed with heritable PAH and the other 14 were healthy at the time of examination.
Study
EGAS00001003123
RNA seq before and after cold pressor test
The cold pressor test (CPT) is a widely used pain provocation test to investigate both pain tolerance and cardiovascular responses. Twenty-two females were phenotypically assessed before and after a CPT, and blood samples were taken for RNA-sequencing.
Study
EGAS00001006690
Profiling of 27 type 2 diabetes GWAS loci using next-generation (NG) capture C in a human beta-cell model
Most type 2 diabetes association signals are due to variants that impact gene regulation but identifying the genes they impact is challenging. We generated interaction profiles at 27 T2D GWAS loci using next-generation (NG) capture C in a human beta-cell model ( EndoC-betaH1) and contrasted our maps with public Hi-C/promoter capture Hi-C maps in EndoC-betaH1 cells or human islets.
Study
EGAS00001006105
InsPIRE islets
We explored the relationship between genetic variants influencing predisposition to type 2 diabetes (T2D) and related glycemic traits, and human pancreatic islet transcription using RNA-Seq and genotyping data from 420 islet donors and 26 FAC sorted beta-cells.
Study
EGAS00001003997
UK10K RARE COLOBOMA
In the UK10K project we propose a series of complementary genetic approaches to find new low frequency/rare variants contributing to disease phenotypes. These will be based on obtaining the genome wide sequence of 4000 samples from the TwinsUK and ALSPAC cohorts (at 6x sequence coverage), and the exome sequence (protein coding regions and related conserved sequence) of 6000 samples selected for extreme phenotypes. Our studies will focus primarily on cardiovascular-related quantitative traits, obesity and related metabolic traits, neurodevelopmental disorders and a limited number of extreme clinical phenotypes that will provide proof-of-concept for future familial trait sequencing. We will analyse directly quantitative traits in the cohorts and the selected traits in the extreme samples, and also use imputation down to 0.1% allele frequency to extend the analyses to further sample sets with genome wide genotype data. In each case we will investigate indels and larger structural variants as well as SNPs, and use statistical methods that combine rare variants in a locus or pathway as well as single-variant approaches. Ocular coloboma is the most common significant developmental eye defect with an incidence of ~1 in 5,000 live births. It results from failure of optic fissure closure during embryogenesis. The position and extent of the fusion failure dictates the clinical appearance and functional effect. ~30% of coloboma cases are associated with other systemic malformations. These UK10K samples will mostly comprise isolated coloboma cases without systemic involvement (aka non-syndromal coloboma). There is strong evidence from family studies that coloboma has a major genetic component with autosomal dominance being the most common pattern of inheritance. However, many cases are isolated or show complex patterns of familial clustering. The genes responsible for isolated coloboma are largely unknown, but in a small number of families mutations in SHH, CHX10, and PAX6 have been identified indicating marked genetic heterogeneity. Thus in addition to the clinical benefits of achieving a molecular diagnosis there are also major scientific advantages to identifying coloboma genes, as these are likely to provide insights into the complex process of optic fissure closure, that is critical to normal eye development. In the longer term, understanding the molecular basis of the disease may provide clues to therapeutic strategies.For further information with regard to this cohort please contact David Fitzpatrick (david.fitzpatrick@ed.ac.uk).
Study
EGAS00001000127
ORCADES_WGA
The ORCADES Whole Genome Sequence Study plans to sequence the genomes of 400 subjects from the Orkney Complex Disease Study (ORCADES) at low coverage, each with data on up to 300 quantitative traits and other risk factors associated with cardiovascular, metabolic and other complex diseases
Study
EGAS00001000068
ESGI___Whole_Genome_Sequencing_of_samples_from_the_ORCADES_cohort__X10__
Deep whole genome sequencing of sampels from the Orkney Complex Disease Study (ORCADES), each with data on up to 300 quantitative traits and other risk factors associated with cardiovascular, metabolic and other complex diseases. The samples are sequenced using the Illumina HiSeq X Ten system.
Study
EGAS00001001125
Evolving epigenomics of immune cells in type 1 diabetes at single nuclei resolution
The appearance of diabetes-associated autoantibodies is the first detectable sign of the disease process leading to type 1 diabetes (T1D). Evidence suggests that T1D is a heterogenous disease, where the type of antibodies first formed imply subtypes. Here, we followed 49 children, who subsequently presented with T1D and 49 matched controls, profiling single-cell epigenomics at different time points of disease development. Quantitation of cell and nuclei populations as well as transcriptome and open-chromatin states indicated robust, early, replicable monocyte lineage differences between cases and controls, suggesting heightened pro-inflammatory cytokine secretion early among cases. The order of autoantibody emergence in cases showed variation across lymphoid and myeloid cells, potentially indicating cellular immune response divergence. The strong monocytic lineage representation in peripheral blood immune cells before seroconversion and the weaker differential coordination of these gene networks close to clinical diagnosis emphasizes the importance of early life as a critical phase in T1D development.
Study
EGAS50000000863
Platelet_collagen_defect
As part of the Bloodomics collaboration we have several categories of pedigrees with diseases/syndromes relevant to cardiovascular diseases (CVD). One such pedigree, is from families who have a platelet collagen defect. Exome sequencing has been performed as part of a discovery program to ascertain potential causative variants of the clinical phenotype.
Study
EGAS00001000105
RFX6-mediated dysregulation defines human β cell dysfunction in early type 2 diabetes
Study
EGAS00001006273
Somatic_mutation_and_clonal_evolution_in_premalignant_lung_disease___WGS
Cancer is a genetic disease caused by an accumulations of mutations, however many of these mutations have been identified in pathologically normal tissue. We aim to use laser-capture microscopy (LCM) to sample individual clones from the lung tissue of individuals with a variety of lung diseases (COPD, UIP, IPF, Emphysema, pulmonary hypertension). This will allow us to identify whether cancer-associated mutations appear in this normal tissue, assess the mutational burden present, and identify the mutational processes causing these mutations. Smoking is a large risk factor for developing many of these lung diseases so we are particularly keen to determining whether there is evidence of a smoking signature in these patients.
Study
EGAS00001002747
Mutational_landscape_in_haemochromatosis__WGS_
Somatic mutation has recently been shown to play a role in disease other than cancer. Driver mutations associated with lipid metabolism and insulin pathway have been identified in non-alcoholic fatty liver disease (NAFLD). It is plausible that driver mutations seen in NAFLD are a consequence of selection pressure exerted on hepatocytes in the context of excess fat accumulation. It has not been investigated whether selection pressure and driver landscape varies between chronic liver diseases of different aetiologies. This study aims to investigate mutational landscape in haemochromatosis, which is a disorder caused by a build-up of iron in the liver. We aim to use data generated by whole-genome sequencing of laser-dissected microbiopsies from diseased livers to characterize driver landscape and mutational signatures in haemochromatosis patients.
Study
EGAS00001005157
Mutational_landscape_in_haemochromatosis__exome_
Somatic mutation has recently been shown to play a role in disease other than cancer. Driver mutations associated with lipid metabolism and insulin pathway have been identified in non-alcoholic fatty liver disease (NAFLD). It is plausible that driver mutations seen in NAFLD are a consequence of selection pressure exerted on hepatocytes in the context of excess fat accumulation. It has not been investigated whether selection pressure and driver landscape varies between chronic liver diseases of different aetiologies. This study aims to investigate mutational landscape in haemochromatosis, which is a disorder caused by a build-up of iron in the liver. We aim to use data generated by whole-exome sequencing of laser-dissected microbiopsies from diseased livers to characterize driver landscape and mutational signatures in haemochromatosis patients.
Study
EGAS00001005158
WTCCC case-control study for Coronary Artery Disease
WTCCC genome-wide case-control association study for Bipolar disorder (CAD) using the 1958 British Birth Cohort and the UK National Blood Service collections as controls.
Study
EGAS00000000003
Epigenetic and metabolomic data from type 2 diabetes adolescents
Objective: Rates of type 2 diabetes (T2D) among adolescents are on the rise. Epigenetic changes could be associated with the
metabolic alterations in adolescents with T2D.
Methods: We performed a cross-sectional integrated analysis of DNA methylation data from peripheral blood mononuclear cells
with serum metabolomic data from First Nation adolescents with T2D and controls participating in the Improving Renal
Complications in Adolescents with type 2 diabetes through REsearch (iCARE) cohort study, to explore the molecular changes in
adolescents with T2D.
Results: Our analysis showed that 43 serum metabolites and 36 differentially methylated regions (DMR) were associated with T2D. Several DMRs were located near the transcriptional start site of genes with established roles in metabolic disease and associated with altered serum metabolites (e.g. glucose, leucine, and gamma-glutamylisoleucine). These included the free fatty acid receptor-1 (FFAR1), upstream transcription factor-2 (USF2), and tumor necrosis factor-related protein-9 (C1QTNF9), among others.
Conclusions: We identified DMRs and metabolites that merit further investigation to determine their significance in controlling
gene expression and metabolism which could define T2D risk in adolescents.
Study
EGAS00001003816
Hyperfibrinolysis
As part of the Bloodomics collaboration we have several categories of pedigrees with diseases/syndromes relevant to cardiovascular diseases (CVD). One such pedigree, hyperfibrinolysis is associated with increased fibrinolytic activity, resulting in increased bleeding. Exome sequencing has been performed as part of a discovery program to ascertain potential causative variants of the clinical phenotype.
Study
EGAS00001000104
Bleeding
As part of the Bloodomics collaboration we have several categories of pedigrees with diseases/syndromes relevant to cardiovascular diseases (CVD). One such group, is from individuals who exhibit severe bleeding with an unknown cause. Exome sequencing has been performed as part of a discovery program to ascertain potential causative variants of the clinical phenotype.
Study
EGAS00001000106
Various_Platelet_Disorders
As part of the Bloodomics collaboration we have several categories of pedigrees with diseases/syndromes relevant to cardiovascular diseases (CVD). One such group, is from individuals who have a platelet disorder with an unknown cause. Exome sequencing has been performed as part of a discovery program to ascertain potential causative variants of the clinical phenotype.
Study
EGAS00001000107
300-Obese: clinical cohort of obese individuals, Nijmegen, the Netherlands
300-Obese cohort was recruited at the Radboud University Medical Center (RUMC), Nijmegen, the Netherlands. The cohort comprises 377 participants included by the following criteria: age>55 years, BMI>27 kg/m2. The cohort data includes gut microbiome, NMR serum metabolomics, deep cardiovascular phenotyping and broad range of phenotypic information.
Study
EGAS00001003508
Grey_Platelet_Syndrome__GPS_
As part of the Bloodomics collaboration we have several categories of pedigrees with diseases/syndromes relevant to cardiovascular diseases (CVD). One such pedigree, Grey Platelet Syndrome (GPS) is a rare congenital bleeding disorder caused by a reduction or absence of alpha granules in platelets. Exome sequencing has been performed as part of a discovery program to ascertain potential causative variants of the clinical phenotype.
Study
EGAS00001000091
Cardiogenics_re_sequencing
The Cardiogenics re-sequencing study will consist of three parts: Eight pools of 25 individuals will be sequenced using a Nimblegen hybrid-capture solution specific to miRNA sequences, 80 pools of 25 individuals will be sequenced using a custom Agilent SureSelect array covering genes associated with coronary artery disease (CAD) and myocardial infarction (MI), 10 individuals from families with a history of CAD/MI will be exome sequenced using the Sanger exome array. The experiment will use the early onset patients from the German MI cohort and the UK BHF CAD/MI cohort both of which have strong family history. For controls we will consider individuals from the UKBS and KORA cohorts.
Study
EGAS00001000079
Investigating_low_frequency_variants_in_CAD_MI_cases__controls_and_pedigrees_using_whole_exome_sequencing_and_custom_pulldowns
The Cardiogenics re-sequencing study will consist of three parts: Eight pools of 25 individuals will be sequenced using a Nimblegen hybrid-capture solution specific to miRNA sequences, 80 pools of 25 individuals will be sequenced using a custom Agilent SureSelect array covering genes associated with coronary artery disease (CAD) and myocardial infarction (MI), 10 individuals from families with a history of CAD/MI will be exome sequenced using the Sanger exome array. The experiment will use the early onset patients from the German MI cohort and the UK BHF CAD/MI cohort both of which have strong family history. For controls we will consider individuals from the UKBS and KORA cohorts.
Study
EGAS00001000043
Investigating_low_frequency_variants_in_CAD_MI_cases__controls_and_pedigrees_using_whole_exome_sequencing_and_custom_pulldowns
The Cardiogenics re-sequencing study will consist of three parts: Eight pools of 25 individuals will be sequenced using a Nimblegen hybrid-capture solution specific to miRNA sequences, 80 pools of 25 individuals will be sequenced using a custom Agilent SureSelect array covering genes associated with coronary artery disease (CAD) and myocardial infarction (MI), 10 individuals from families with a history of CAD/MI will be exome sequenced using the Sanger exome array. The experiment will use the early onset patients from the German MI cohort and the UK BHF CAD/MI cohort both of which have strong family history. For controls we will consider individuals from the UKBS and KORA cohorts.
Study
EGAS00001000050
Human Pancreatic Beta Cell lncRNAs Control Cell-Specific Regulatory Networks
Recent studies have uncovered thousands of long non-coding RNAs (lncRNAs) in human pancreatic β cells. β cell lncRNAs are often cell type specific and exhibit dynamic regulation during differentiation or upon changing glucose concentrations. Although these features hint at a role of lncRNAs in β cell gene regulation and diabetes, the function of β cell lncRNAs remains largely unknown. In this study, we investigated the function of β cell-specific lncRNAs and transcription factors using transcript knockdowns and co-expression network analysis. This revealed lncRNAs that function in concert with transcription factors to regulate β cell-specific transcriptional networks. We further demonstrate that the lncRNA PLUTO affects local 3D chromatin structure and transcription of PDX1, encoding a key β cell transcription factor, and that both PLUTO and PDX1 are downregulated in islets from donors with type 2 diabetes or impaired glucose tolerance. These results implicate lncRNAs in the regulation of β cell-specific transcription factor networks.
Study
EGAS00001002865
Fecal microbiome predicts treatment response after the initiation of semaglutide or empagliflozin uptake
To investigate the effect of semaglutide and empagliflozin initiation on the gut microbiome of type 2 diabetes patients. In addition, to analyze whether the pre-treatment gut microbiome can predict the treatment efficacy.
Gut microbiome fecal samples donated at four timepoints (Baseline, Month 1, Month 3; Month 12) were studied using 16S ribosomal RNA gene sequencing and analysis.
Study
EGAS50000000531
The_International_1q_type_2_diabetes_consortium
The aim of this project is to identify rare variants in the 1q region associated with type 2 diabetes. To this end 651 case samples and 651 control samples from six populations have been pooled (pool sizes range from 27-33 individuals), and are being sequenced. The hybridization solution being used captures the exons and UTRs of genes in the 1q region.
Study
EGAS00001000062
Gene_Characterization_in_Carbohydrate_metabolic_alterations__neonatel_diabetes___congenital_hyperinsulinemic__in_early_childhood
Whole Exome Sequencing of trios (proband + parents) or probands only with Neonatal Diabetes Mellitus (NDM) or Congenital Hyperinsulinism of Infancy (CHI) of unknown genetic origin.
This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Study
EGAS00001002074
Extensive patient-to-patient single nuclei transcriptome heterogeneity in pheochromocytomas and paragangliomas
Pheochromocytomas (PC) and paragangliomas (PG) are rare neuroendocrine tumors of varied genetic makeup, associated with high cardiovascular morbidity and a variable risk of malignancy. We focused on PCPG tumors with germline SDHB and RET mutations, representing distinct prognostic groups with worse or better prognoses, respectively. We applied single-nuclei RNA sequencing (snRNA-seq) on tissue samples from 11 patients.
Study
EGAS00001006230
HipSci___Whole_Exome_sequencing___Monogenic_Diabetes
The HipSci project brings together diverse constituents in genomics, proteomics, cell biology and clinical genetics to create a UK national iPS cell resource and use it to carry out cellular genetic studies. In this sub-study we perform whole exome sequencing using Agilent whole exome pulldown method on iPS cells generated from skin biopsies or blood samples from rare disease patients diagnosed with Monogenic Diabetes.
Study
EGAS00001001140
Genes associated with pancreas development and function maintain open chromatin in iPSCs generated from human pancreatic beta cells
Current in-vitro islet differentiation protocols suffer from heterogeneity and low efficiency. Induced-pluripotent stem cells (iPSCs) derived from pancreatic beta-cells (BiPSCs) preferentially differentiate towards endocrine pancreas-like cells versus those from fibroblasts (FiPSCs). We interrogated genome-wide open chromatin in BiPSCs and FiPSCs via ATAC-seq, and identified ~8.3k significant, differential open chromatin sites (DOCS) between the two iPSC subtypes (FDR<0.05). DOCS where chromatin was more accessible in BiPSCs (Bi-DOCS) were significantly enriched for known regulators of endodermal development, including bivalent and weak enhancers, and FOXA2 binding sites (FDR<0.05). Bi-DOCS were associated with genes related to pancreas development and beta-cell function, including transcription factors mutated in monogenic diabetes (PDX1, NKX2-2, HNF1A; FDR<0.05). Moreover, Bi-DOCS correlated with enhanced gene expression in BiPSC-derived definitive endoderm and pancreatic progenitor cells. Bi-DOCS therefore highlight genes and pathways governing islet-lineage commitment, which can be exploited for differentiation protocol optimisation, diabetes disease modelling, and therapeutic purposes.
Study
EGAS00001002591
RODAM cohort
The RODAM study assesses the health and wellbeing of Ghanaian residents in Ghana and Europe and follows them up over time. With this unique approach the RODAM study attempts to unravel the causes of cardiovascular disease and its risk factors among African migrants in Europe and non-migrants sub-Saharan Africa. Fecal samples from RODAM participants in rural, urban Ghana and Amsterdam, the Netherlands were sequenced using 16S rRNA sequencing to assess gut microbiome differences between research sites.
Study
EGAS50000000805
RNA-seq study of longitudinal blood cell samples from children at risk of type 1 diabetes
The appearance of type 1 diabetes (T1D)-associated autoantibodies is the first and only measurable parameter to predict progression toward T1D in genetically susceptible individuals. However, autoantibodies indicate an active autoimmune reaction, wherein the immune tolerance is already broken. Therefore, there is a clear and urgent need for new biomarkers that predict the onset of the autoimmune reaction preceding auto-antibody positivity or reflect progressive b-cell destruction. Here we report the mRNA sequencing–based analysis of 306 samples including fractionated samples of CD4+ and CD8+ T cells as well as CD4, CD8 cell fractions and unfractionated PBMC samples longitudinally collected from seven children who developed beta-cell autoimmunity (Case subjects) at a young age and matched control subjects.
Study
EGAS00001004071
single-cell RNA-seq Case-Control study of children progressing to Type1 diabetes
Single-cell RNA sequencing was carried out on four selected PBMC samples of Finnish children at risk of developing Type 1 diabetes and their gender age and HLA matched control children using 10X genomic platform. All four Case children were positive for multiple islet specific autoantibodies and two of them also progressed to clinical disease during the follow up whereas the control children remain negative for all autoantibodies. Age at seroconversion for Case 2, 3, 5, and 9 children is 12, 18, 24, and 18 months, respectively. Age at sampling, in months, for the pairs was as follows (Case/Control): pair 2:24/24; pair 3: 12/18; pair 5: 12/12; pair 9:24/18.
Study
EGAS00001004070
Comparison of 3 protocols for deriving pancreatic progenitors from hPSC with RNA-seq and ATAC-seq
Several distinct differentiation protocols for deriving pancreatic progenitors (PP) from human pluripotent stem cells have been described but it remains to be shown how similar the PP are across protocols, and how well they resemble their in vivo counterparts. Here we evaluated three differentiation protocols; performed RNA and ATAC-seq on isolated PP derived with these, and compared them to fetal human pancreas populations. This enabled us to define a shared transcriptional and epigenomic signature of the PP, including several genes not previously implicated in pancreas development. Furthermore, we identified a significant and previously unappreciated cross-protocol variation of the PP through multi-omics analysis and demonstrate how such information can be applied to refine differentiation protocols for derivation of insulin-producing beta-like cells. Together, our study highlights the importance of a detailed characterization of defined cell populations derived from distinct differentiation protocols, and provides a valuable resource for exploring human pancreatic development.
Study
EGAS00001003513