Skip to main navigation Skip to search Skip to main content

Abstract

Correctly modeling the relationships between correlated, uncertain input data is crucial for producing accurate uncertainty estimates of model results. This requires both an uncertainty analysis that accounts for correlations and the appropriate communication of the results, so that other analysts can correctly interpret the reported uncertainties. However, neither is common practice in industrial ecology modeling. A typical case for correlated results is the disaggregation of a total value into uncertain shares, for which we present a practical yet robust approach to model the uncertainty. Our approach is based on two standard and two generalized Dirichlet distributions, and it uses the maximum entropy principle to choose minimally biased distribution parameters in the absence of specific known values. We discuss how correlation should be communicated to preserve accurate uncertainty information and provide examples to quantify the difference it makes to the results when the correlation is simplified or completely neglected. The proposed procedure will improve the accuracy of uncertainty quantification in Material Flow Analysis (e.g. where allocation coefficients split flows to sectors), Input Output Analysis (e.g. where aggregated environmental impact data has to be disaggregated to detailed economic sectors), and some instances in Life Cycle Assessment (e.g. where market shares are uncertain). Last but not least, to lower the technical barrier to applying these approaches, we provide easy-to-use Python and R packages which automate the approach.

Original languageEnglish
JournalJournal of Industrial Ecology
Early online date20 Mar 2026
DOIs
Publication statusE-pub ahead of print - 20 Mar 2026

Data Availability Statement

The data and code that support the findings of this study are all openly available: The Python version of the package is available via PyPi and Anaconda. The source code is on GitHub: https://github.com/jakobsarthur/maxent_disaggregation. It is also archived on Zenodo (DOI: https://doi.org/10.5281/zenodo.17650501), and the package documentation is available at: https://maxent-disaggregation.readthedocs.io/en/latest/ . The R package created and used for this paper is available on GitHub: https://github.com/simschul/MaxentDisaggregation and archived on Zenodo (DOI: https://doi.org/10.5281/zenodo.17709080). The R code to reproduce all figures and the results from the case study is available on GitHub: https://github.com/simschul/uncertainty_disaggregation . The data needed to reproduce the case study is available on Zenodo (DOI: https://doi.org/10.5281/zenodo.13806019) . The data behind Figs. 2, 4, 7 and 8 is available on Zenodo (DOI: https://doi.org/10.5281/zenodo.18610842)

Acknowledgements

We thank three anonymous reviewers for their valuable feedback on the paper and our sampling approach, in particular for pointing out the problems related to truncated distributions in an earlier draft. Moreover, we thank Stefan Pauliuk for the constructive discussion of the research idea.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy
  2. SDG 12 - Responsible Consumption and Production
    SDG 12 Responsible Consumption and Production

Keywords

  • Dirichlet
  • Industrial ecology
  • IO
  • LCA
  • Maximum entropy
  • MFA

ASJC Scopus subject areas

  • General Environmental Science
  • General Social Sciences

Fingerprint

Dive into the research topics of 'When correlation matters: a practical guide to dealing with uncertainty in the case of data disaggregation'. Together they form a unique fingerprint.

Cite this