• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过筛选和插补来开发可靠的每小时电力需求数据。

Developing reliable hourly electricity demand data through screening and imputation.

机构信息

Carnegie Institution for Science, Stanford, United States.

University of California, Irvine, Irvine, United States.

出版信息

Sci Data. 2020 May 26;7(1):155. doi: 10.1038/s41597-020-0483-x.

DOI:10.1038/s41597-020-0483-x
PMID:32457368
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7250876/
Abstract

Electricity usage (demand) data are used by utilities, governments, and academics to model electric grids for a variety of planning (e.g., capacity expansion and system operation) purposes. The U.S. Energy Information Administration collects hourly demand data from all balancing authorities (BAs) in the contiguous United States. As of September 2019, we find 2.2% of the demand data in their database are missing. Additionally, 0.5% of reported quantities are either negative values or are otherwise identified as outliers. With the goal of attaining non-missing, continuous, and physically plausible demand data to facilitate analysis, we developed a screening process to identify anomalous values. We then applied a Multiple Imputation by Chained Equations (MICE) technique to impute replacements for missing and anomalous values. We conduct cross-validation on the MICE technique by marking subsets of plausible data as missing, and using the remaining data to predict this "missing" data. The mean absolute percentage error of imputed values is 3.5% across all BAs. The cleaned data are published and available open access: https://doi.org/10.5281/zenodo.3690240.

摘要

电力使用(需求)数据被公用事业公司、政府和学术界用于为各种规划目的(如容量扩展和系统运行)建模电网。美国能源信息署从美国大陆的所有平衡区(BAs)收集每小时的需求数据。截至 2019 年 9 月,我们发现数据库中有 2.2%的需求数据丢失。此外,报告数量中有 0.5%要么是负值,要么被确定为异常值。为了获得非缺失、连续和符合物理规律的需求数据以促进分析,我们开发了一个筛选过程来识别异常值。然后,我们应用了链式方程多重插补(MICE)技术来插补缺失值和异常值的替换值。我们通过将合理数据的子集标记为缺失,并使用其余数据来预测这些“缺失”数据,对 MICE 技术进行交叉验证。所有 BAs 的插补值的平均绝对百分比误差为 3.5%。经过清理的数据已发布并可公开获取:https://doi.org/10.5281/zenodo.3690240。

相似文献

1
Developing reliable hourly electricity demand data through screening and imputation.通过筛选和插补来开发可靠的每小时电力需求数据。
Sci Data. 2020 May 26;7(1):155. doi: 10.1038/s41597-020-0483-x.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
The multiple imputation method: a case study involving secondary data analysis.多重填补法:一项涉及二次数据分析的案例研究。
Nurse Res. 2015 May;22(5):13-9. doi: 10.7748/nr.22.5.13.e1319.
4
Imputation of missing values for cochlear implant candidate audiometric data and potential applications.人工耳蜗候选者听力学数据缺失值的推断及其潜在应用。
PLoS One. 2023 Feb 6;18(2):e0281337. doi: 10.1371/journal.pone.0281337. eCollection 2023.
5
Comparison of imputation methods for missing laboratory data in medicine.医学中缺失实验室数据的插补方法比较。
BMJ Open. 2013 Aug 1;3(8):e002847. doi: 10.1136/bmjopen-2013-002847.
6
A real data-driven simulation strategy to select an imputation method for mixed-type trait data.一种基于真实数据驱动的选择混合类型性状数据插补方法的模拟策略。
PLoS Comput Biol. 2023 Mar 22;19(3):e1010154. doi: 10.1371/journal.pcbi.1010154. eCollection 2023 Mar.
7
A Workflow for Missing Values Imputation of Untargeted Metabolomics Data.非靶向代谢组学数据缺失值插补的工作流程
Metabolites. 2020 Nov 26;10(12):486. doi: 10.3390/metabo10120486.
8
Characterizing and Managing Missing Structured Data in Electronic Health Records: Data Analysis.电子健康记录中结构化缺失数据的特征描述与管理:数据分析
JMIR Med Inform. 2018 Feb 23;6(1):e11. doi: 10.2196/medinform.8960.
9
[How to deal with missing data? Multiple imputation by chained equations: recommendations and explanations for clinical practice].[如何处理缺失数据?链式方程多重填补:临床实践的建议与解释]
Nephrol Ther. 2023 Jun 19;19(3):171-179. doi: 10.1684/ndt.2023.24.
10
The development and validation of prognostic models for overall survival in the presence of missing data in the training dataset: a strategy with a detailed example.训练数据集中存在缺失数据时总生存预后模型的开发与验证:一个详细示例的策略
Diagn Progn Res. 2021 Aug 4;5(1):14. doi: 10.1186/s41512-021-00103-9.

引用本文的文献

1
Energy consumption and IEQ monitoring in two university apartment buildings: Pre-retrofit dataset.两所大学公寓楼的能源消耗与室内环境质量监测:改造前数据集
Sci Data. 2025 Jun 18;12(1):1022. doi: 10.1038/s41597-025-05355-4.
2
The Influence of Regional Geophysical Resource Variability on the Value of Single- and Multistorage Technology Portfolios.区域地球物理资源变异性对单存储和多存储技术组合价值的影响。
Environ Sci Technol. 2024 Jul 15;58(30):13251-62. doi: 10.1021/acs.est.3c10188.
3
Implications of uncertainty in technology cost projections for least-cost decarbonized electricity systems.

本文引用的文献

1
The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2).现代时代研究与应用回顾分析第2版(MERRA-2)
J Clim. 2017 Jun 20;Volume 30(Iss 13):5419-5454. doi: 10.1175/JCLI-D-16-0758.1.
技术成本预测中的不确定性对最低成本脱碳电力系统的影响。
iScience. 2023 Dec 7;27(1):108685. doi: 10.1016/j.isci.2023.108685. eCollection 2024 Jan 19.
4
High-resolution electric power load data of an industrial park with multiple types of buildings in China.中国某工业园区多种类型建筑的高分辨率电力负荷数据。
Sci Data. 2023 Dec 6;10(1):870. doi: 10.1038/s41597-023-02786-9.
5
Meteorological drivers of resource adequacy failures in current and high renewable Western U.S. power systems.美国西部当前及高可再生能源电力系统中资源充足性故障的气象驱动因素。
Nat Commun. 2023 Oct 11;14(1):6379. doi: 10.1038/s41467-023-41875-6.
6
CarbonMonitor-Power near-real-time monitoring of global power generation on hourly to daily scales.CarbonMonitor-Power 对全球发电进行近实时监测,时间分辨率为每小时到每天。
Sci Data. 2023 Apr 17;10(1):217. doi: 10.1038/s41597-023-02094-2.
7
IDSEM, an invoices database of the Spanish electricity market.IDSEM,西班牙电力市场的发票数据库。
Sci Data. 2022 Dec 26;9(1):786. doi: 10.1038/s41597-022-01885-3.
8
Using temperature sensitivity to estimate shiftable electricity demand.利用温度敏感性来估计可转移的电力需求。
iScience. 2022 Aug 17;25(9):104940. doi: 10.1016/j.isci.2022.104940. eCollection 2022 Sep 16.
9
The quantity-quality transition in the value of expanding wind and solar power generation.扩大风能和太阳能发电价值中的量质转变。
iScience. 2022 Mar 22;25(4):104140. doi: 10.1016/j.isci.2022.104140. eCollection 2022 Apr 15.
10
A three-year dataset supporting research on building energy management and occupancy analytics.支持建筑能源管理和占用分析研究的三年数据集。
Sci Data. 2022 Apr 5;9(1):156. doi: 10.1038/s41597-022-01257-x.