IMR Press / FBL / Volume 8 / Issue 6 / DOI: 10.2741/882

Frontiers in Bioscience-Landmark (FBL) is published by IMR Press from Volume 26 Issue 5 (2021). Previous articles were published by another publisher on a subscription basis, and they are hosted by IMR Press on as a courtesy and upon agreement with Frontiers in Bioscience.

Designing an optimum genetic association study using dense SNP markers and family-based sample
Show Less
1 Division of Biostatistics, Washington University School of Medicine, St. Louis, MO 63110, USA
2 Departments of Genetics and Psychiatry, Washington University School of Medicine, St. Louis, MO 63110

Academic Editor: Heping Zhang

Front. Biosci. (Landmark Ed) 2003, 8(6), 68–80;
Published: 1 January 2003
(This article belongs to the Special Issue Statistics and bioinformatics in medicine)

Genetic association analysis using thousands of single nucleotide polymorphism (SNP) markers has become a promising alternative to genome-wide linkage scan. Analysis based on linkage-disequilibrium (LD) is more efficient because meiotic information of past generations is utilized. However, in addition to the physical distance between the disease locus and a marker locus, numerous other factors such as admixture, genetic drift, and multiple mutations can affect the observed value of LD. The effect of these factors in a genomic LD association study must be carefully analyzed to obtain an efficient study design. In the following review, we consider studies using family-based data and carefully study the effects of some of these important design factors, including the sample size, frequency of SNP markers, and marker density. For example, we conclude that (1) for reasonably frequent SNP markers, a moderately large sample of 500 families is appropriate for a moderately stringent significance level (α = 0.00009); (2) to maintain a power of 80%, maximal difference in allele frequencies between the disease gene and a SNP marker varies between 0.1 (under additive model) and 0.5 (multiplicative); (3) a map density of 10 cM is appropriate only under idea scenario (moderately large sample size, equal trait/marker allele frequencies, maximum LD strength etc.). Results shown here should have practical implications to designing efficient LD association studies using dense SNP markers.

Back to top