Fast and flexible simulation of DNA sequence data
Gary K. ChenDepartment of Preventive Medicine, University of Southern California, Los Angeles, California 90033, USAPaul MarjoramDepartment of Preventive Medicine, University of Southern California, Los Angeles, California 90033, USA;Jeffrey D. WallInstitute for Human Genetics and Department of Epidemiology and Biostatistics, University of California, San Francisco, California
94143, USA
2008en
ABI
Abstract
Simulation of genomic sequences under the coalescent with recombination has conventionally been impractical for regions beyond tens of megabases. This work presents an algorithm, implemented as the program MaCS (Markovian Coalescent Simulator), that can efficiently simulate haplotypes under any arbitrary model of population history. We present several metrics comparing the performance of MaCS with other available simulation programs. Practical usage of MaCS is demonstrated through a comparison of measures of linkage disequilibrium between generated program output and real genotype data from populations considered to be structured.
Identifiers
Citations and references
Cited by 20 references