Text:
 
				Increase font size
 
				Decrease font size
 
                
  
  Simulation Details underlying AbCD
  
Results presented by AbCD were generated using the following simulation protocol:
(1)	We first randomly simulated 10 1Mb regions using the 
cosi bestfit models. 
Table 1 in the 
cosi 
accompanying publication (Schaffner et al 2005 Genome Research) shows the parameters calibrated for the bestfit models which mimic the level of sequence variation, 
pattern of linkage disequilibrium, recombination rates and demographical history of four major populations: AA (African American), AF (African), AN (Asian), and EU (European). For 
each region, we simulated 450,000 chromosomes. 
(2)	Within each region, we then randomly picked 2
n chromosomes from the population of 450,000 to form 
ndiploid individuals, where 
n is referred to as 
sample size or number of individuals sequenced in the subsequent text. 
(3)	From the chromosomes picked in (2), we used 
ShotGun to generate short reads mimicking those from the 
Illumina Solexa technologies for 10 pre-specified sequencing depths (d = 0.5X, 2X, 4X, 6X, 8X, 10X, 15X, 20X, 25X and 30X).
(4)	We then performed LD-based genotyping calling using 
thunder on the short reads 
generated in (3). 
(5)	Finally, for each design (one set of
n, d, and ethnicity), we summarized several key statistics by taking an average across the ten simulated regions for each of the 
following seven MAF categories: (i) 0-0.1%; (ii) 0.1-0.2%; (iii) 0.2-0.5%; (iv) 0.5-1%; (v) 1-2%; (vi) 2-5%; (vii) 5-50%. The MAF-specific statistics summarized are: (a) Number of 
polymorphisms in the population of 450,000 chromosomes; (b) Number of variants segregating in the sample of 
nsequenced individuals; (c) Percent of 
all variants (that 
is, (a)) detected which is upper bounded by (b) divided by (a); (d) Average information content which is measured by dosage r2 the squared Pearson correlation between imputed dosages 
and their corresponding true genotypes; and (e) Effective sample size which is the multiplication of 
n and average information content. 
Notes:
   (*) The same simulation protocol was adopted in 
our published thunder paper.
   (*) The above steps (2)-(5) are implemented in our DesignPlanner C-shell script wrapper, which, together with all the software used and 10 regions each of 100Kb length 
simulated by cosi, can be downloaded via 
the ShotGun Download Page.