• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

所有数据都有用吗?使用有向信息和提升回归树推断因果关系以预测污水和排水系统中的流量。

Are all data useful? Inferring causality to predict flows across sewer and drainage systems using directed information and boosted regression trees.

机构信息

Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, United States.

School for Environment and Sustainability, University of Michigan, Ann Arbor, United States.

出版信息

Water Res. 2018 Nov 15;145:697-706. doi: 10.1016/j.watres.2018.09.009. Epub 2018 Sep 4.

DOI:10.1016/j.watres.2018.09.009
PMID:30216864
Abstract

As more sensor data become available across urban water systems, it is often unclear which of these new measurements are actually useful and how they can be efficiently ingested to improve predictions. We present a data-driven approach for modeling and predicting flows across combined sewer and drainage systems, which fuses sensor measurements with output of a large numerical simulation model. Rather than adjusting the structure and parameters of the numerical model, as is commonly done when new data become available, our approach instead learns causal relationships between the numerically-modeled outputs, distributed rainfall measurements, and measured flows. By treating an existing numerical model - even one that may be outdated - as just another data stream, we illustrate how to automatically select and combine features that best explain flows for any given location. This allows for new sensor measurements to be rapidly fused with existing knowledge of the system without requiring recalibration of the underlying physics. Our approach, based on Directed Information (DI) and Boosted Regression Trees (BRT), is evaluated by fusing measurements across nearly 30 rain gages, 15 flow locations, and the outputs of a numerical sewer model in the city of Detroit, Michigan: one of the largest combined sewer systems in the world. The results illustrate that the Boosted Regression Trees provide skillful predictions of flow, especially when compared to an existing numerical model. The innovation of this paper is the use of the Directed Information step, which selects only those inputs that are causal with measurements at locations of interest. Better predictions are achieved when the Directed Information step is used because it reduces overfitting during the training phase of the predictive algorithm. In the age of "big water data", this finding highlights the importance of screening all available data sources before using them as inputs to data-driven models, since more may not always be better. We discuss the generalizability of the case study and the requirements of transferring the approach to other systems.

摘要

随着城市水系统中可用的传感器数据越来越多,通常不清楚这些新测量值中哪些是有用的,以及如何有效地将其纳入以提高预测精度。我们提出了一种数据驱动的方法,用于对合流制排水系统的流量进行建模和预测,该方法将传感器测量值与大型数值模拟模型的输出融合在一起。我们的方法不是像通常在新数据可用时那样调整数值模型的结构和参数,而是学习数值模型输出、分布式降雨测量值和测量流量之间的因果关系。通过将现有的数值模型(即使是可能过时的模型)视为另一个数据流,我们说明了如何自动选择和组合最能解释给定位置流量的特征。这使得新的传感器测量值可以快速与系统的现有知识融合,而无需重新校准基础物理。我们的方法基于有向信息(DI)和增强回归树(BRT),通过融合密歇根州底特律市近 30 个雨量计、15 个流量位置和数值下水道模型的输出,对其进行了评估:这是世界上最大的合流制下水道系统之一。结果表明,增强回归树提供了流量的熟练预测,尤其是与现有的数值模型相比。本文的创新之处在于使用有向信息步骤,该步骤仅选择与感兴趣位置的测量值具有因果关系的输入。由于在预测算法的训练阶段减少了过拟合,因此使用有向信息步骤可以实现更好的预测。在“大数据时代”,这一发现强调了在将所有可用数据源用作数据驱动模型的输入之前筛选它们的重要性,因为更多的输入并不总是更好。我们讨论了案例研究的泛化能力以及将该方法转移到其他系统的要求。

相似文献

1
Are all data useful? Inferring causality to predict flows across sewer and drainage systems using directed information and boosted regression trees.所有数据都有用吗?使用有向信息和提升回归树推断因果关系以预测污水和排水系统中的流量。
Water Res. 2018 Nov 15;145:697-706. doi: 10.1016/j.watres.2018.09.009. Epub 2018 Sep 4.
2
What can we learn from a 500-year event? Experiences from urban drainage in Austria.我们能从一个持续了500年的事件中学到什么?奥地利城市排水的经验。
Water Sci Technol. 2018 May;77(7-8):2146-2154. doi: 10.2166/wst.2018.138.
3
An automated toolchain for the data-driven and dynamical modeling of combined sewer systems.一个用于合流制排水系统数据驱动和动态建模的自动化工具链。
Water Res. 2017 Dec 1;126:88-100. doi: 10.1016/j.watres.2017.08.065. Epub 2017 Sep 3.
4
Overland flow computations in urban and industrial catchments from direct precipitation data using a two-dimensional shallow water model.利用二维浅水模型,从直接降水数据中计算城市和工业流域的地表径流。
Water Sci Technol. 2010;62(9):1998-2008. doi: 10.2166/wst.2010.746.
5
Comparison of short-term rainfall forecasts for model-based flow prediction in urban drainage systems.基于模型的城市排水系统流量预测的短期降雨预报比较。
Water Sci Technol. 2013;68(2):472-8. doi: 10.2166/wst.2013.274.
6
One-dimensional modelling of the interactions between heavy rainfall-runoff in an urban area and flooding flows from sewer networks and rivers.城市地区强降雨径流与下水道网络及河流洪水流量之间相互作用的一维建模。
Water Sci Technol. 2009;60(4):927-34. doi: 10.2166/wst.2009.431.
7
Optimise inlet condition and design parameters of a new sewer overflow screening device using numerical model.使用数值模型优化新型下水道溢流筛分装置的入口条件和设计参数。
Water Sci Technol. 2014;70(11):1880-7. doi: 10.2166/wst.2014.422.
8
Nowcasting of rainfall and of combined sewage flow in urban drainage systems.城市排水系统中降雨及混合污水流量的临近预报。
Water Sci Technol. 2009;59(6):1145-51. doi: 10.2166/wst.2009.098.
9
Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.美国东部地区遥感气溶胶光学厚度与PM2.5之间关系的评估及统计建模
Res Rep Health Eff Inst. 2012 May(167):5-83; discussion 85-91.
10
Predicting combined sewer overflows chamber depth using artificial neural networks with rainfall radar data.利用降雨雷达数据的人工神经网络预测合流污水溢流池深度。
Water Sci Technol. 2014;69(6):1326-33. doi: 10.2166/wst.2014.024.