Authors
Xiuwen Zheng, David Levine, Jess Shen, Stephanie M Gogarten, Cathy Laurie, Bruce S Weir
Publication date
2012/12
Journal
Bioinformatics
Volume
28
Issue
24
Pages
3326-3328
Publisher
Oxford University Press
Description
Summary: Genome-wide association studies are widely used to investigate the genetic basis of diseases and traits, but they pose many computational challenges. We developed gdsfmt and SNPRelate (R packages for multi-core symmetric multiprocessing computer architectures) to accelerate two key computations on SNP data: principal component analysis (PCA) and relatedness analysis using identity-by-descent measures. The kernels of our algorithms are written in C/C++ and highly optimized. Benchmarks show the uniprocessor implementations of PCA and identity-by-descent are ∼8–50 times faster than the implementations provided in the popular EIGENSTRAT (v3.0) and PLINK (v1.07) programs, respectively, and can be sped up to 30–300-fold by using eight cores. SNPRelate can analyse tens of thousands of samples with millions of SNPs. For example, our package was used to perform PCA …
Total citations
2013201420152016201720182019202020212022202320248324285130193240265337376318212
Scholar articles