Constructing a Genome-Wide LD Map of Wild A. gambiae Using Next-Generation Sequencing Article

Wang, X, Afrane, YA, Yan, G et al. (2015). Constructing a Genome-Wide LD Map of Wild A. gambiae Using Next-Generation Sequencing . BIOMED RESEARCH INTERNATIONAL, 2015 10.1155/2015/238139

cited authors

  • Wang, X; Afrane, YA; Yan, G; Li, J

abstract

  • Anopheles gambiae is the major malaria vector in Africa. Examining the molecular basis of A. gambiae traits requires knowledge of both genetic variation and genome-wide linkage disequilibrium (LD) map of wild A. gambiae populations from malaria-endemic areas. We sequenced the genomes of nine wild A. gambiae mosquitoes individually using next-generation sequencing technologies and detected 2,219,815 common single nucleotide polymorphisms (SNPs), 88% of which are novel. SNPs are not evenly distributed across A. gambiae chromosomes. The low SNP-frequency regions overlay heterochromatin and chromosome inversion domains, consistent with the lower recombinant rates at these regions. Nearly one million SNPs that were genotyped correctly in all individual mosquitoes with 99.6% confidence were extracted from these high-throughput sequencing data. Based on these SNP genotypes, we constructed a genome-wide LD map for wild A. gambiae from malaria-endemic areas in Kenya and made it available through a public Website. The average size of LD blocks is less than 40 bp, and several large LD blocks were also discovered clustered around the para gene, which is consistent with the effect of insecticide selective sweeps. The SNPs and the LD map will be valuable resources for scientific communities to dissect the A. gambiae genome.

publication date

  • January 1, 2015

published in

Digital Object Identifier (DOI)

volume

  • 2015