Comparison of Bayesian model averaging and stepwise methods for model selection in logistic regression

Duolao Wang, Wenyang Zhang, Ameet Bakhai

Research output: Contribution to journalArticlepeer-review

107 Citations (SciVal)


Logistic regression is the standard method for assessing predictors of diseases. In logistic regression analyses, a stepwise strategy is often adopted to choose a subset of variables. Inference about the predictors is then made based on the chosen model constructed of only those variables retained in that model. This method subsequently ignores both the variables not selected by the procedure, and the uncertainty due to the variable selection procedure. This limitation may be addressed by adopting a Bayesian model averaging approach, which selects a number of all possible such models, and uses the posterior probabilities of these models to perforrn all inferences and predictions. This study compares the Bayesian model averaging approach with the stepwise procedures for selection of predictor variables in logistic regression using simulated data sets and the Framingham Heart Study data. The results show that in most cases Bayesian model averaging selects the correct model and out-performs stepwise approaches at predicting an event of interest. Copyright (C) 2004 John Wiley Sons, Ltd.
Original languageEnglish
Pages (from-to)3451-3467
Number of pages17
JournalStatistics in medicine
Issue number22
Publication statusPublished - 30 Nov 2004


Dive into the research topics of 'Comparison of Bayesian model averaging and stepwise methods for model selection in logistic regression'. Together they form a unique fingerprint.

Cite this