University of Michigan Center for Statistical 
Genetics
Search Liming's website
 
 

 
 

Genotype-Based Matching to Correct for Population Stratification in
Large-Scale Case-Control Genetic Association Studies

Case-control association tests are generally more powerful than family-based association tests but population stratification can lead to spurious disease-marker association or mask a true association. We propose a similarity score matching approach that matches cases with controls and perform association test condition on the matched set so as to adjust for underlying population structures and potentially increase power. The genetic similarity score matching analysis consists of three steps:

      1) Similarity score calculation for each pair of case and control
      2) Optimal full matching to match cases with controls
      3) Conditional logistic regression (additive or 2 d.f. test)

Comments and suggestions are appreciated! Please email me: lliang@hsph.harvard.edu and Weihua Guan wguan@umn.edu

Precompiled binaries for Windows, example data and instructions.
Mac OS
If you are using Mac OS, please read this.
Precompiled binaries for Linux (64bit system), example data and instructions.
Precompiled binaries for Linux (32bit system), example data and instructions.

Instruction

Input data format, meaning of parameters and output interpretation

Update history

We start to add new features and document all updates and bug fixes

Citation:

Weihua Guan*, Liming Liang*, Michael Boehnke, Gonçalo R. Abecasis (2009). Genotype-based matching to correct for population stratification in large-scale case-control genetic association studies. Genet Epidemiol DOI:10.1002/gepi.20403

* These authors contributed equally to this work.


 
 

University of Michigan | School of Public Health