Incorporating high-dimensional exposure modelling into studies of air pollution and health

Yi Liu, Gavin Shaddick, James V. Zidek

Research output: Contribution to journalArticle

Abstract

Performing studies on the risks of environmental hazards on human health requires accurate estimates of exposures that might be experienced by the populations at risk. Often there will be missing data and in many epidemiological studies, the locations and times of exposure measurements and health data do not match. To a large extent this will be due to the health and exposure data having arisen from completely different data sources and not as the result of a carefully designed study, leading to problems of both ‘change of support’ and ‘misaligned data’. In such cases, a direct comparison of the exposure and health outcome is often not possible without an underlying model to align the two in the spatial and temporal domains. The Bayesian approach provides the natural framework for such models; however, the large amounts of data that can arise from environmental networks means that inference using Markov Chain Monte Carlo might not be computationally feasible in this setting. Here we adapt the integrated nested Laplace approximation to implement spatio–temporal exposure models. We also propose methods for the integration of large-scale exposure models and health analyses. It is important that any model structure allows the correct propagation of uncertainty from the predictions of the exposure model through to the estimates of risk and associated confidence intervals. The methods are demonstrated using a case study of the levels of black smoke in the UK, measured over several decades, and respiratory mortality.

Original languageEnglish
Pages (from-to)559–581
Number of pages23
JournalStatistics in Biosciences
Volume9
Issue number2
Early online date13 Jun 2016
DOIs
Publication statusPublished - 1 Dec 2017

Fingerprint

Air Pollution
Air pollution
Health
High-dimensional
Modeling
Markov Chains
Bayes Theorem
Information Storage and Retrieval
Model structures
Model
Laplace Approximation
Smoke
Markov processes
Uncertainty
Epidemiologic Studies
Hazards
Markov Chain Monte Carlo
Missing Data
Confidence Intervals
Bayesian Approach

Keywords

  • Air pollution
  • Bayesian modelling
  • Health risks
  • INLA
  • Spatio–temporal models

Cite this

Incorporating high-dimensional exposure modelling into studies of air pollution and health. / Liu, Yi; Shaddick, Gavin; Zidek, James V.

In: Statistics in Biosciences, Vol. 9, No. 2, 01.12.2017, p. 559–581.

Research output: Contribution to journalArticle

@article{603e262a2ff04bfba487b518d53fdda1,
title = "Incorporating high-dimensional exposure modelling into studies of air pollution and health",
abstract = "Performing studies on the risks of environmental hazards on human health requires accurate estimates of exposures that might be experienced by the populations at risk. Often there will be missing data and in many epidemiological studies, the locations and times of exposure measurements and health data do not match. To a large extent this will be due to the health and exposure data having arisen from completely different data sources and not as the result of a carefully designed study, leading to problems of both ‘change of support’ and ‘misaligned data’. In such cases, a direct comparison of the exposure and health outcome is often not possible without an underlying model to align the two in the spatial and temporal domains. The Bayesian approach provides the natural framework for such models; however, the large amounts of data that can arise from environmental networks means that inference using Markov Chain Monte Carlo might not be computationally feasible in this setting. Here we adapt the integrated nested Laplace approximation to implement spatio–temporal exposure models. We also propose methods for the integration of large-scale exposure models and health analyses. It is important that any model structure allows the correct propagation of uncertainty from the predictions of the exposure model through to the estimates of risk and associated confidence intervals. The methods are demonstrated using a case study of the levels of black smoke in the UK, measured over several decades, and respiratory mortality.",
keywords = "Air pollution, Bayesian modelling, Health risks, INLA, Spatio–temporal models",
author = "Yi Liu and Gavin Shaddick and Zidek, {James V.}",
year = "2017",
month = "12",
day = "1",
doi = "10.1007/s12561-016-9150-3",
language = "English",
volume = "9",
pages = "559–581",
journal = "Statistics in Biosciences",
issn = "1867-1764",
publisher = "Springer Verlag",
number = "2",

}

TY - JOUR

T1 - Incorporating high-dimensional exposure modelling into studies of air pollution and health

AU - Liu, Yi

AU - Shaddick, Gavin

AU - Zidek, James V.

PY - 2017/12/1

Y1 - 2017/12/1

N2 - Performing studies on the risks of environmental hazards on human health requires accurate estimates of exposures that might be experienced by the populations at risk. Often there will be missing data and in many epidemiological studies, the locations and times of exposure measurements and health data do not match. To a large extent this will be due to the health and exposure data having arisen from completely different data sources and not as the result of a carefully designed study, leading to problems of both ‘change of support’ and ‘misaligned data’. In such cases, a direct comparison of the exposure and health outcome is often not possible without an underlying model to align the two in the spatial and temporal domains. The Bayesian approach provides the natural framework for such models; however, the large amounts of data that can arise from environmental networks means that inference using Markov Chain Monte Carlo might not be computationally feasible in this setting. Here we adapt the integrated nested Laplace approximation to implement spatio–temporal exposure models. We also propose methods for the integration of large-scale exposure models and health analyses. It is important that any model structure allows the correct propagation of uncertainty from the predictions of the exposure model through to the estimates of risk and associated confidence intervals. The methods are demonstrated using a case study of the levels of black smoke in the UK, measured over several decades, and respiratory mortality.

AB - Performing studies on the risks of environmental hazards on human health requires accurate estimates of exposures that might be experienced by the populations at risk. Often there will be missing data and in many epidemiological studies, the locations and times of exposure measurements and health data do not match. To a large extent this will be due to the health and exposure data having arisen from completely different data sources and not as the result of a carefully designed study, leading to problems of both ‘change of support’ and ‘misaligned data’. In such cases, a direct comparison of the exposure and health outcome is often not possible without an underlying model to align the two in the spatial and temporal domains. The Bayesian approach provides the natural framework for such models; however, the large amounts of data that can arise from environmental networks means that inference using Markov Chain Monte Carlo might not be computationally feasible in this setting. Here we adapt the integrated nested Laplace approximation to implement spatio–temporal exposure models. We also propose methods for the integration of large-scale exposure models and health analyses. It is important that any model structure allows the correct propagation of uncertainty from the predictions of the exposure model through to the estimates of risk and associated confidence intervals. The methods are demonstrated using a case study of the levels of black smoke in the UK, measured over several decades, and respiratory mortality.

KW - Air pollution

KW - Bayesian modelling

KW - Health risks

KW - INLA

KW - Spatio–temporal models

UR - http://www.scopus.com/inward/record.url?scp=84974823989&partnerID=8YFLogxK

UR - http://dx.doi.org/10.1007/s12561-016-9150-3

U2 - 10.1007/s12561-016-9150-3

DO - 10.1007/s12561-016-9150-3

M3 - Article

VL - 9

SP - 559

EP - 581

JO - Statistics in Biosciences

JF - Statistics in Biosciences

SN - 1867-1764

IS - 2

ER -