Hou Ping, Cai Jiarui, Qu Shen, Xu Ming
School for Environment and Sustainability , University of Michigan , Ann Arbor , Michigan 48109 , United States.
Michigan Institute for Computational Discovery and Engineering , University of Michigan , Ann Arbor , Michigan 48104 , United States.
Environ Sci Technol. 2018 May 1;52(9):5259-5267. doi: 10.1021/acs.est.7b05366. Epub 2018 Apr 6.
In life cycle assessment (LCA), collecting unit process data from the empirical sources (i.e., meter readings, operation logs/journals) is often costly and time-consuming. We propose a new computational approach to estimate missing unit process data solely relying on limited known data based on a similarity-based link prediction method. The intuition is that similar processes in a unit process network tend to have similar material/energy inputs and waste/emission outputs. We use the ecoinvent 3.1 unit process data sets to test our method in four steps: (1) dividing the data sets into a training set and a test set; (2) randomly removing certain numbers of data in the test set indicated as missing; (3) using similarity-weighted means of various numbers of most similar processes in the training set to estimate the missing data in the test set; and (4) comparing estimated data with the original values to determine the performance of the estimation. The results show that missing data can be accurately estimated when less than 5% data are missing in one process. The estimation performance decreases as the percentage of missing data increases. This study provides a new approach to compile unit process data and demonstrates a promising potential of using computational approaches for LCA data compilation.
在生命周期评估(LCA)中,从实证来源(即仪表读数、操作日志/记录)收集单元过程数据通常成本高昂且耗时。我们提出了一种新的计算方法,仅基于基于相似性的链接预测方法,依靠有限的已知数据来估计缺失的单元过程数据。其直观依据是,单元过程网络中的相似过程往往具有相似的材料/能量输入和废物/排放输出。我们使用ecoinvent 3.1单元过程数据集,通过四个步骤来测试我们的方法:(1)将数据集划分为训练集和测试集;(2)随机删除测试集中指定为缺失的一定数量的数据;(3)使用训练集中不同数量的最相似过程的相似性加权平均值来估计测试集中的缺失数据;(4)将估计数据与原始值进行比较,以确定估计的性能。结果表明,当一个过程中缺失的数据少于5%时,可以准确估计缺失数据。随着缺失数据百分比的增加,估计性能会下降。本研究提供了一种编制单元过程数据的新方法,并展示了使用计算方法进行LCA数据编制的广阔潜力。