Abstract
Self-diffusion coefficients, D*, are routinely estimated from molecular dynamics simulations by fitting a linear model to the observed mean squared displacements (MSDs) of mobile species. MSDs derived from simulations exhibit statistical noise that causes uncertainty in the resulting estimate of D*. An optimal scheme for estimating D* minimizes this uncertainty, i.e., it will have high statistical efficiency, and also gives an accurate estimate of the uncertainty itself. We present a scheme for estimating D* from a single simulation trajectory with a high statistical efficiency and accurately estimating the uncertainty in the predicted value. The statistical distribution of MSDs observable from a given simulation is modeled as a multivariate normal distribution using an analytical covariance matrix for an equivalent system of freely diffusing particles, which we parametrize from the available simulation data. We use Bayesian regression to sample the distribution of linear models that are compatible with this multivariate normal distribution to obtain a statistically efficient estimate of D* and an accurate estimate of the associated statistical uncertainty.
Original language | English |
---|---|
Pages (from-to) | 79-87 |
Number of pages | 9 |
Journal | Journal of Chemical Theory and Computation |
Volume | 21 |
Issue number | 1 |
Early online date | 30 Dec 2024 |
DOIs | |
Publication status | Published - 14 Jan 2025 |
Data Availability Statement
Additional supporting information is available at ref 51 underan MIT license, including a complete set of analysis/plottingscripts allowing for a fully reproducible and automated analysisworkflow, using SHOWYOURWORK.52 The LLZO raw simulationtrajectories are available on Zenodo shared under a CC BY-SA4.0 license.53 The method outlined in this work isimplemented in the open-source Python package KINISI,22which is available under an MIT license.Acknowledgements
The authors thank Jacob M. Dean and Tim Rogers for their valuable input in checking the mathematical derivations that make up the appendices, the beta-testers for the kinisi package, and Rodrigo Luger and Daniel Foreman-Mackey for their help using showyourwork.The authors acknowledge the University of Bath’s Research Computing Group for their support in running the LLZO molecular dynamics simulations. Other simulations and analyses were carried out using the Data Management and Software Centre computing cluster at the European Spallation Source ERIC.
Funding
This work used the Isambard 2 UK National Tier-2 HPC Service (http://gw4.ac.uk/isambard/) operated by GW4 and the UK Met Office and funded by EPSRC (EP/T022078/1). S.W.C. and B.J.M. acknowledge the support of the Faraday Institution through the CATMAT Project (Grant FIRG016). B.J.M. acknowledges support from the Royal Society (UF130329 and URF\R\191006).
Funders | Funder number |
---|---|
Engineering and Physical Sciences Research Council | EP/T022078/1 |
The Royal Society | UF130329, URF\R\191006 |
Faraday Institution CATMAT | FIRG016 |