Have a personal or library account? Click to login
Building Infrastructure for African Human Genomic Data Management Cover

Building Infrastructure for African Human Genomic Data Management

Open Access
|Sep 2019

Figures & Tables

Table 1

Description of data types for submission.

Exome/Whole Genome Sequence16S rRNA Microbiome studiesGenome Wide Association studies/genotyping arrays
Study type and descriptionStudy type and descriptionStudy type and description
Sequencing platform and technology usedSequencing platform and technology usedGenotyping array model/name and description of the software and version used for calling the genotypes
FASTQ files linked with de-identified participant ID (minus technical reads such as adapters, linkers, barcodes)FASTQ files linked with de-identified participant ID (minus technical reads such as adapters, linkers, barcodes)Raw intensity files linked with de-identified participant IDs (IDATs, CELs)
Binary Alignment files (BAMs, de-multiplexed) – linked with participant de-identified IDManifest file describing SNP or probe content on the genotyping array
Associated phenotypic data collectedAssociated phenotypic data collectedAssociated phenotypic data collected
Variant calling files (VCFs)Final analyses BIOM files (at minimum must contain OTUs)Final reports and analysis files generated
Mapping file indicating the relationship between the submitted filesMapping file indicating the relationship between the submitted filesMapping file indicating the relationship between the submitted files (completed Array Format template)
dsj-18-951-g1.png
Figure 1

Timeline for submission of data to public repositories, extracted from the H3Africa Data Sharing, Access and release policy.

dsj-18-951-g2.png
Figure 2

Diagram showing the process for submission of data to the Archive and EGA.

Table 2

Example speeds for time for moving data within and between the Archive and EGA.

FromToAverage Mbp/s (Megabits)Average Mb/s (Megabytes)Time to transfer (days)Size
1VaultLanding Area1201568.9 TB
2Other local serverEGA (Aspera)162908.9 TB
3VaultHard Drive (directly into port)2002538.9 TB
4Landing AreaEGA (Aspera)2403028.9 TB
Language: English
Submitted on: Jan 31, 2019
|
Accepted on: Sep 12, 2019
|
Published on: Sep 26, 2019
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2019 Ziyaad Parker, Suresh Maslamoney, Ayton Meintjes, Gerrit Botha, Sumir Panji, Scott Hazelhurst, Nicola Mulder, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.