Dataset 1: Genotyping results for 111 single nucleotide polymorphisms (SNPs) typed in 2486 Plasmodium falciparum samples collected from primary school children during a parasitological survey in western Kenya in 2009 and 2010.
datasetposted on 07.04.2017 by Irene Omedo, Polycarp Mogeni, Kirk Rockett
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
The columns contain the following information: sample_id, unique sample identifier; admin1, provincial location of school; district_name, district location of school; date_visit, date of sample collection; assay_code, name of assay; allele1 and allele2, alternative alleles at a specific SNP position; result, genotype call after processing; allele_ratio1, proportion of allele 1; allele_ratio2, proportion of allele 2; pass_fail, coding of SNP based on availability of valid genotype (pass=1) or lack of a valid genotype (fail=0). Geospatial data for individual school locations is considered sensitive data and therefore cannot be made open access. However, it can be accessed through a request to our data governance committee at email@example.com. The criteria for such access is specified in detail in the data sharing guidelines under which the DGC operates, and relates to a) addressing health research, b)operating within the bounds of informed consent, c)complying with confidentiality procedures, d) mitigating potential harm to participants in research.