Abstract
We introduce a deep learning accelerated methodology to solve PDE-based Bayesian inverse problems with guaranteed accuracy. This is motivated by solving the ill-posed problem of inferring a spatio-temporal heat-flux parameter known as the Biot number in a PDE model given temperature data; however, the methodology is generalizable to other settings. To achieve accelerated Bayesian inference we develop a novel training scheme that uses data to adaptively train a neural-network surrogate simulating the parametric forward model. By simultaneously identifying an approximate posterior distribution over the Biot number and weighting a physics-informed training loss according to this, our approach approximates a forward and inverse solution together without any need for external solves. Using a random Chebyshev series, we outline how to approximate a Gaussian process prior, and using the surrogate we apply Hamiltonian Monte Carlo (HMC) to sample from the posterior distribution. We derive convergence of the surrogate posterior to the true posterior distribution in the Hellinger metric as our adaptive loss approaches zero. Additionally, we describe how this surrogate-accelerated HMC approach can be combined with traditional PDE solvers in a delayed-acceptance scheme to a priori control the posterior accuracy. This overcomes a major limitation of deep learning-based surrogate approaches, which do not achieve guaranteed accuracy a priori due to their nonconvex training. Biot number calculations are involved in turbo-machinery design, which is safety critical and highly regulated, and therefore it is important that our results have such mathematical guarantees. Our approach achieves fast mixing in high-dimensional parameter spaces, while retaining the convergence guarantees of a traditional PDE solver, and without the burden of evaluating this solver for proposals that are likely to be rejected. A range of numerical results is given using real and simulated data that compare adaptive and general training regimes and various gradient-based Markov chain Monte Carlo sampling methods.
Original language | English |
---|---|
Pages (from-to) | 970-995 |
Number of pages | 26 |
Journal | SIAM/ASA Journal on Uncertainty Quantification |
Volume | 11 |
Issue number | 3 |
Early online date | 28 Aug 2023 |
DOIs | |
Publication status | Published - 1 Sept 2023 |
Bibliographical note
Funding Information:Last Received by the editors August 8, 2022; accepted for publication (in revised form) May 1, 2023; published electronically August 28, 2023. https://doi.org/10.1137/22M1513113 Funding: Support was provided by EPSRC CDT in Statistical Applied Mathematics at Bath (SAMBa), under the project EP/L015684/1. \dagger Department of Mathematical Sciences, University of Bath, Bath, UK ([email protected], [email protected], [email protected]).
Publisher Copyright:
© 2023 SIAM and ASA. Published by SIAM and ASA under the terms of the Creative Commons 4.0 license.
Funding
\ast Received by the editors August 8, 2022; accepted for publication (in revised form) May 1, 2023; published electronically August 28, 2023. https://doi.org/10.1137/22M1513113 Funding: Support was provided by EPSRC CDT in Statistical Applied Mathematics at Bath (SAMBa), under the project EP/L015684/1. \dagger Department of Mathematical Sciences, University of Bath, Bath, UK ([email protected], [email protected], [email protected]).
Keywords
- Bayesian inverse problems
- deep learning
- delayed-acceptance HMC
- surrogate models
ASJC Scopus subject areas
- Statistics and Probability
- Modelling and Simulation
- Statistics, Probability and Uncertainty
- Discrete Mathematics and Combinatorics
- Applied Mathematics