Abstract
Logistic regression is the standard method for assessing predictors of diseases. In logistic regression analyses, a stepwise strategy is often adopted to choose a subset of variables. Inference about the predictors is then made based on the chosen model constructed of only those variables retained in that model. This method subsequently ignores both the variables not selected by the procedure, and the uncertainty due to the variable selection procedure. This limitation may be addressed by adopting a Bayesian model averaging approach, which selects a number of all possible such models, and uses the posterior probabilities of these models to perforrn all inferences and predictions. This study compares the Bayesian model averaging approach with the stepwise procedures for selection of predictor variables in logistic regression using simulated data sets and the Framingham Heart Study data. The results show that in most cases Bayesian model averaging selects the correct model and out-performs stepwise approaches at predicting an event of interest. Copyright (C) 2004 John Wiley Sons, Ltd.
Original language | English |
---|---|
Pages (from-to) | 3451-3467 |
Number of pages | 17 |
Journal | Statistics in Medicine |
Volume | 23 |
Issue number | 22 |
DOIs | |
Publication status | Published - 30 Nov 2004 |