Abstract
Correctly modeling the relationships between correlated, uncertain input data is crucial for producing accurate uncertainty estimates of model results. This requires both an uncertainty analysis that accounts for correlations and the appropriate communication of the results, so that other analysts can correctly interpret the reported uncertainties. However, neither is common practice in industrial ecology modeling. A typical case for correlated results is the disaggregation of a total value into uncertain shares, for which we present a practical yet robust approach to model the uncertainty. Our approach is based on two standard and two generalized Dirichlet distributions, and it uses the maximum entropy principle to choose minimally biased distribution parameters in the absence of specific known values. We discuss how correlation should be communicated to preserve accurate uncertainty information and provide examples to quantify the difference it makes to the results when the correlation is simplified or completely neglected. The proposed procedure will improve the accuracy of uncertainty quantification in Material Flow Analysis (e.g. where allocation coefficients split flows to sectors), Input Output Analysis (e.g. where aggregated environmental impact data has to be disaggregated to detailed economic sectors), and some instances in Life Cycle Assessment (e.g. where market shares are uncertain). Last but not least, to lower the technical barrier to applying these approaches, we provide easy-to-use Python and R packages which automate the approach.
| Original language | English |
|---|---|
| Journal | Journal of Industrial Ecology |
| Early online date | 20 Mar 2026 |
| DOIs | |
| Publication status | E-pub ahead of print - 20 Mar 2026 |
Data Availability Statement
The data and code that support the findings of this study are all openly available: The Python version of the package is available via PyPi and Anaconda. The source code is on GitHub: https://github.com/jakobsarthur/maxent_disaggregation. It is also archived on Zenodo (DOI: https://doi.org/10.5281/zenodo.17650501), and the package documentation is available at: https://maxent-disaggregation.readthedocs.io/en/latest/ . The R package created and used for this paper is available on GitHub: https://github.com/simschul/MaxentDisaggregation and archived on Zenodo (DOI: https://doi.org/10.5281/zenodo.17709080). The R code to reproduce all figures and the results from the case study is available on GitHub: https://github.com/simschul/uncertainty_disaggregation . The data needed to reproduce the case study is available on Zenodo (DOI: https://doi.org/10.5281/zenodo.13806019) . The data behind Figs. 2, 4, 7 and 8 is available on Zenodo (DOI: https://doi.org/10.5281/zenodo.18610842)Acknowledgements
We thank three anonymous reviewers for their valuable feedback on the paper and our sampling approach, in particular for pointing out the problems related to truncated distributions in an earlier draft. Moreover, we thank Stefan Pauliuk for the constructive discussion of the research idea.UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
-
SDG 12 Responsible Consumption and Production
Keywords
- Dirichlet
- Industrial ecology
- IO
- LCA
- Maximum entropy
- MFA
ASJC Scopus subject areas
- General Environmental Science
- General Social Sciences
Fingerprint
Dive into the research topics of 'When correlation matters: a practical guide to dealing with uncertainty in the case of data disaggregation'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS