Need Help?

Emirati Phased Diploid T2T Trio-Assembly of a Female Individual

This dataset contains a telomere-to-telomere (T2T), trio-based genome assembly in FASTA format for a single female Emirati individual (proband) generated using parental information for accurate phasing. Sequencing data comprised PacBio HiFi (>60X per parent, >120X offspring), ONT ultra-long reads (>110X offspring), and Illumina short-read WGS (>100X for all three individuals). The offspring genome was assembled in a trio framework and finished with NTLink (initial scaffolding), RagTag (reference-guided refinement), and Quartett (gap filling). The resulting contiguous assembly serves as a high-quality, population-relevant reference suitable for downstream variant discovery and integration into the pangenome.

Request Access

Data access policy for UAE Genomes data.

Submissions for data access is granted upon written request for research purposes (non-commercial) use only. Bona fide, academic users can apply for data access under the following conditions: 1. Data is provided for non-clinical, non-commercial research purposes only. 2. Data is not distributed to any other individual or entity without the UAE Genomes Data access Committee's permission. 3. Data present is experimental in nature, and must not be used to make any clinical decisions. 4. Data that you are accessing is done so with no warranties, expressed or implied, and employees or agents of Khalifa University of Science and Technology have no liability in connection with its use. If you agree with the conditions above, please apply for access by email indicating your consent.

Studies are experimental investigations of a particular phenomenon, e.g., case-control studies on a particular trait or cancer research projects reporting matching cancer normal genomes from patients.

Study ID Study Title Study Type
EGAS50000001234 Population Genomics
EGAS50000001235 Population Genomics

This table displays only public information pertaining to the files in the dataset. If you wish to access this dataset, please submit a request. If you already have access to these data files, please consult the download documentation.

ID File Type Size Quality Report
Located in
EGAF50000425210 Hap2 3.1 GB
EGAF50000425211 Hap1 3.1 GB
2 Files (6.1 GB)