Accurate Estimation of Diffusion Coefficients and their Uncertainties from Computer Simulation

Andrew R. McCluskey, Samuel W. Coles, Benjamin Morgan

Research output: Contribution to journalArticlepeer-review

2 Citations (SciVal)

Abstract

Self-diffusion coefficients, D*, are routinely estimated from molecular dynamics simulations by fitting a linear model to the observed mean squared displacements (MSDs) of mobile species. MSDs derived from simulations exhibit statistical noise that causes uncertainty in the resulting estimate of D*. An optimal scheme for estimating D* minimizes this uncertainty, i.e., it will have high statistical efficiency, and also gives an accurate estimate of the uncertainty itself. We present a scheme for estimating D* from a single simulation trajectory with a high statistical efficiency and accurately estimating the uncertainty in the predicted value. The statistical distribution of MSDs observable from a given simulation is modeled as a multivariate normal distribution using an analytical covariance matrix for an equivalent system of freely diffusing particles, which we parametrize from the available simulation data. We use Bayesian regression to sample the distribution of linear models that are compatible with this multivariate normal distribution to obtain a statistically efficient estimate of D* and an accurate estimate of the associated statistical uncertainty.

Original languageEnglish
Pages (from-to)79-87
Number of pages9
JournalJournal of Chemical Theory and Computation
Volume21
Issue number1
Early online date30 Dec 2024
DOIs
Publication statusPublished - 14 Jan 2025

Data Availability Statement

Additional supporting information is available at ref 51 underan MIT license, including a complete set of analysis/plottingscripts allowing for a fully reproducible and automated analysisworkflow, using SHOWYOURWORK.52 The LLZO raw simulationtrajectories are available on Zenodo shared under a CC BY-SA4.0 license.53 The method outlined in this work isimplemented in the open-source Python package KINISI,22which is available under an MIT license.

Acknowledgements

The authors thank Jacob M. Dean and Tim Rogers for their valuable input in checking the mathematical derivations that make up the appendices, the beta-testers for the kinisi package, and Rodrigo Luger and Daniel Foreman-Mackey for their help using showyourwork.

The authors acknowledge the University of Bath’s Research Computing Group for their support in running the LLZO molecular dynamics simulations. Other simulations and analyses were carried out using the Data Management and Software Centre computing cluster at the European Spallation Source ERIC.

Funding

This work used the Isambard 2 UK National Tier-2 HPC Service (http://gw4.ac.uk/isambard/) operated by GW4 and the UK Met Office and funded by EPSRC (EP/T022078/1). S.W.C. and B.J.M. acknowledge the support of the Faraday Institution through the CATMAT Project (Grant FIRG016). B.J.M. acknowledges support from the Royal Society (UF130329 and URF\R\191006).

FundersFunder number
Engineering and Physical Sciences Research CouncilEP/T022078/1
The Royal SocietyUF130329, URF\R\191006
Faraday Institution CATMATFIRG016

Fingerprint

Dive into the research topics of 'Accurate Estimation of Diffusion Coefficients and their Uncertainties from Computer Simulation'. Together they form a unique fingerprint.

Cite this