LRTae: improving statistical power for genetic association with case/control data when phenotype and/or genotype misclassification errors are present

S Barral, C Haynes, M Stone, D Gordon

Research output: Contribution to journalArticle

  • 10 Citations

Abstract

Background: In the field of statistical genetics, phenotype and genotype misclassification errors can substantially reduce power to detect association with genetic case/control studies. Misclassification also can bias population frequency parameters such as genotype, haplotype, or multi-locus genotype frequencies. These problems are of particular concern in case/control designs because, short of repeated sampling, there is no way to detect misclassification errors. We developed a double-sampling procedure for case/control genetic association using a likelihood ratio test framework. Different approaches have been proposed to deal with misclassification errors. We have chosen the likelihood framework because of the ease with which misclassification probabilities may be incorporated into in the statistical framework and hypothesis testing. The statistic is called the Likelihood Ratio Test allowing for errors ( LRTae) and is freely available via software download. Results: We applied our procedure to 10,000 replicates of simulated case/control data in which we introduced phenotype misclassification errors. The phenotype considered is Ankylosing Spondylitis ( AS). The LRTae method power was always greater than LRTstd power for the significance levels considered ( 5%, 1%, 0.1%, 0.01%). Power gains for the LRTae method over the LRTstd method increased as the significance level became more stringent. Multi-locus genotype frequency estimates using LRTae method were more accurate than estimates using LRTstd method. Conclusion: The LRTae method can be applied to single-locus genotypes, multi-locus genotypes, or multi-locus haplotypes in a case/control framework and can be more powerful to detect association in case/control studies when both genotype and/or phenotype errors are present. Furthermore, the LRTae method provides asymptotically unbiased estimates of case and control genotype frequencies, as well as estimates of phenotype and/or genotype misclassification rates.
LanguageEnglish
JournalBmc Genetics
Volume7
DOIs
StatusPublished - 2006

Fingerprint

Genotype
Phenotype
Haplotypes
Case-Control Studies
Ankylosing Spondylitis
Software
Population

Cite this

LRTae: improving statistical power for genetic association with case/control data when phenotype and/or genotype misclassification errors are present. / Barral, S; Haynes, C; Stone, M; Gordon, D.

In: Bmc Genetics, Vol. 7, 2006.

Research output: Contribution to journalArticle

@article{498017cfa6be487f8d24b00e15f937e9,
title = "LRTae: improving statistical power for genetic association with case/control data when phenotype and/or genotype misclassification errors are present",
abstract = "Background: In the field of statistical genetics, phenotype and genotype misclassification errors can substantially reduce power to detect association with genetic case/control studies. Misclassification also can bias population frequency parameters such as genotype, haplotype, or multi-locus genotype frequencies. These problems are of particular concern in case/control designs because, short of repeated sampling, there is no way to detect misclassification errors. We developed a double-sampling procedure for case/control genetic association using a likelihood ratio test framework. Different approaches have been proposed to deal with misclassification errors. We have chosen the likelihood framework because of the ease with which misclassification probabilities may be incorporated into in the statistical framework and hypothesis testing. The statistic is called the Likelihood Ratio Test allowing for errors ( LRTae) and is freely available via software download. Results: We applied our procedure to 10,000 replicates of simulated case/control data in which we introduced phenotype misclassification errors. The phenotype considered is Ankylosing Spondylitis ( AS). The LRTae method power was always greater than LRTstd power for the significance levels considered ( 5{\%}, 1{\%}, 0.1{\%}, 0.01{\%}). Power gains for the LRTae method over the LRTstd method increased as the significance level became more stringent. Multi-locus genotype frequency estimates using LRTae method were more accurate than estimates using LRTstd method. Conclusion: The LRTae method can be applied to single-locus genotypes, multi-locus genotypes, or multi-locus haplotypes in a case/control framework and can be more powerful to detect association in case/control studies when both genotype and/or phenotype errors are present. Furthermore, the LRTae method provides asymptotically unbiased estimates of case and control genotype frequencies, as well as estimates of phenotype and/or genotype misclassification rates.",
author = "S Barral and C Haynes and M Stone and D Gordon",
note = "ID number: ISI:000237846300001",
year = "2006",
doi = "10.1186/1471-2156-7-24",
language = "English",
volume = "7",
journal = "Bmc Genetics",
issn = "1471-2156",
publisher = "BioMed Central",

}

TY - JOUR

T1 - LRTae: improving statistical power for genetic association with case/control data when phenotype and/or genotype misclassification errors are present

AU - Barral,S

AU - Haynes,C

AU - Stone,M

AU - Gordon,D

N1 - ID number: ISI:000237846300001

PY - 2006

Y1 - 2006

N2 - Background: In the field of statistical genetics, phenotype and genotype misclassification errors can substantially reduce power to detect association with genetic case/control studies. Misclassification also can bias population frequency parameters such as genotype, haplotype, or multi-locus genotype frequencies. These problems are of particular concern in case/control designs because, short of repeated sampling, there is no way to detect misclassification errors. We developed a double-sampling procedure for case/control genetic association using a likelihood ratio test framework. Different approaches have been proposed to deal with misclassification errors. We have chosen the likelihood framework because of the ease with which misclassification probabilities may be incorporated into in the statistical framework and hypothesis testing. The statistic is called the Likelihood Ratio Test allowing for errors ( LRTae) and is freely available via software download. Results: We applied our procedure to 10,000 replicates of simulated case/control data in which we introduced phenotype misclassification errors. The phenotype considered is Ankylosing Spondylitis ( AS). The LRTae method power was always greater than LRTstd power for the significance levels considered ( 5%, 1%, 0.1%, 0.01%). Power gains for the LRTae method over the LRTstd method increased as the significance level became more stringent. Multi-locus genotype frequency estimates using LRTae method were more accurate than estimates using LRTstd method. Conclusion: The LRTae method can be applied to single-locus genotypes, multi-locus genotypes, or multi-locus haplotypes in a case/control framework and can be more powerful to detect association in case/control studies when both genotype and/or phenotype errors are present. Furthermore, the LRTae method provides asymptotically unbiased estimates of case and control genotype frequencies, as well as estimates of phenotype and/or genotype misclassification rates.

AB - Background: In the field of statistical genetics, phenotype and genotype misclassification errors can substantially reduce power to detect association with genetic case/control studies. Misclassification also can bias population frequency parameters such as genotype, haplotype, or multi-locus genotype frequencies. These problems are of particular concern in case/control designs because, short of repeated sampling, there is no way to detect misclassification errors. We developed a double-sampling procedure for case/control genetic association using a likelihood ratio test framework. Different approaches have been proposed to deal with misclassification errors. We have chosen the likelihood framework because of the ease with which misclassification probabilities may be incorporated into in the statistical framework and hypothesis testing. The statistic is called the Likelihood Ratio Test allowing for errors ( LRTae) and is freely available via software download. Results: We applied our procedure to 10,000 replicates of simulated case/control data in which we introduced phenotype misclassification errors. The phenotype considered is Ankylosing Spondylitis ( AS). The LRTae method power was always greater than LRTstd power for the significance levels considered ( 5%, 1%, 0.1%, 0.01%). Power gains for the LRTae method over the LRTstd method increased as the significance level became more stringent. Multi-locus genotype frequency estimates using LRTae method were more accurate than estimates using LRTstd method. Conclusion: The LRTae method can be applied to single-locus genotypes, multi-locus genotypes, or multi-locus haplotypes in a case/control framework and can be more powerful to detect association in case/control studies when both genotype and/or phenotype errors are present. Furthermore, the LRTae method provides asymptotically unbiased estimates of case and control genotype frequencies, as well as estimates of phenotype and/or genotype misclassification rates.

U2 - 10.1186/1471-2156-7-24

DO - 10.1186/1471-2156-7-24

M3 - Article

VL - 7

JO - Bmc Genetics

T2 - Bmc Genetics

JF - Bmc Genetics

SN - 1471-2156

ER -