• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大数据时代的科学数据管理:一种支持弹性指数开发工作的方法。

Scientific Data Management in the Age of Big Data: An Approach Supporting a Resilience Index Development Effort.

作者信息

Harwell Linda C, Vivian Deborah N, McLaughlin Michelle D, Hafner Stephen F

机构信息

Gulf Ecology Division, National Health and Environmental Effects Research Laboratory, Office of Research and Development, U.S. Environmental Protection Agency, Gulf Breeze, Florida, USA.

Student Services Contractor, Oak Ridge Associated Universities, Oak Ridge, Tennessee, USA.

出版信息

Front Environ Sci. 2019 Jun 4;7(Article 72):1-13. doi: 10.3389/fenvs.2019.00072.

DOI:10.3389/fenvs.2019.00072
PMID:33123540
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7592716/
Abstract

The increased availability of publicly available data is, in many ways, changing our approach to conducting research. Not only are cloud-based information resources providing supplementary data to bolster traditional scientific activities (e.g., field studies, laboratory experiments), they also serve as the foundation for secondary data research projects such as indicator development. Indicators and indices are a convenient way to synthesize disparate information to address complex scientific questions that are difficult to measure directly (e.g., resilience, sustainability, well-being). In the current literature, there is no shortage of indicator or index examples derived from secondary data with a growing number that are scientifically focused. However, little information is provided describing the management approaches and best practices used to govern the data underpinnings supporting these efforts. From acquisition to storage and maintenance, secondary data research products rely on the availability of relevant, high-quality data, repeatable data handling methods and a multi-faceted data flow process to promote and sustain research transparency and integrity. The U.S. Environmental Protection Agency recently published a report describing the development of a climate resilience screening index which used over one million data points to calculate the final index. The pool of data was derived exclusively from secondary sources such as the U.S. Census Bureau, Bureau of Labor Statistics, Postal Service, Housing and Urban Development, Forestry Services and others. Available data were presented in various forms including portable document format (PDF), delimited ASCII and proprietary format (e.g., Microsoft Excel, ESRI ArcGIS). The strategy employed for managing these data in an indicator research and development effort represented a blend of business practices, information science, and the scientific method. This paper describes the approach, highlighting key points unique for managing the data assets of a smaller scale research project in an era of "big data."

摘要

公开可用数据的日益增多在许多方面改变着我们开展研究的方式。基于云的信息资源不仅为传统科学活动(如实地研究、实验室实验)提供补充数据以增强其实力,还为诸如指标开发等二次数据研究项目奠定基础。指标和指数是综合不同信息以解决难以直接衡量的复杂科学问题(如恢复力、可持续性、福祉)的便捷方式。在当前文献中,不乏从二次数据得出的指标或指数示例,且越来越多的示例具有科学重点。然而,对于用于管理支持这些工作的数据基础的管理方法和最佳实践,所提供的信息却很少。从获取到存储和维护,二次数据研究产品依赖于相关高质量数据的可用性、可重复的数据处理方法以及多方面的数据流过程,以促进和维持研究的透明度与完整性。美国环境保护局最近发布了一份报告,描述了一个气候恢复力筛选指数的开发过程,该指数使用了超过一百万个数据点来计算最终指数。数据池完全来自二次数据源,如美国人口普查局、劳工统计局、邮政服务、住房和城市发展部、林业服务局等。可用数据以各种形式呈现,包括便携式文档格式(PDF)、分隔ASCII格式和专有格式(如微软Excel、ESRI ArcGIS)。在指标研发工作中管理这些数据所采用的策略融合了商业实践、信息科学和科学方法。本文描述了该方法,突出了在“大数据”时代管理较小规模研究项目数据资产的独特要点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/0d786d199f30/nihms-1611594-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/c5f2ce31afc4/nihms-1611594-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/452fa5933a34/nihms-1611594-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/fc7009c09a1d/nihms-1611594-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/309fd4e3d164/nihms-1611594-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/9ccad89c80f5/nihms-1611594-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/a1b2bdf15fc6/nihms-1611594-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/0d786d199f30/nihms-1611594-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/c5f2ce31afc4/nihms-1611594-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/452fa5933a34/nihms-1611594-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/fc7009c09a1d/nihms-1611594-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/309fd4e3d164/nihms-1611594-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/9ccad89c80f5/nihms-1611594-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/a1b2bdf15fc6/nihms-1611594-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37a6/7592716/0d786d199f30/nihms-1611594-f0008.jpg

相似文献

1
Scientific Data Management in the Age of Big Data: An Approach Supporting a Resilience Index Development Effort.大数据时代的科学数据管理:一种支持弹性指数开发工作的方法。
Front Environ Sci. 2019 Jun 4;7(Article 72):1-13. doi: 10.3389/fenvs.2019.00072.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
4
The 2023 Latin America report of the Countdown on health and climate change: the imperative for health-centred climate-resilient development.《2023年健康与气候变化倒计时拉丁美洲报告:以健康为中心的气候适应型发展的必要性》
Lancet Reg Health Am. 2024 Apr 23;33:100746. doi: 10.1016/j.lana.2024.100746. eCollection 2024 May.
5
Behavioral Management Programs to Promote Laboratory Animal Welfare促进实验动物福利的行为管理计划
6
Data stewardship and curation practices in AI-based genomics and automated microscopy image analysis for high-throughput screening studies: promoting robust and ethical AI applications.基于人工智能的基因组学和用于高通量筛选研究的自动显微镜图像分析中的数据管理与整理实践:推动可靠且符合伦理的人工智能应用。
Hum Genomics. 2025 Feb 23;19(1):16. doi: 10.1186/s40246-025-00716-x.
7
Tuberculosis结核病
8
Beyond the black stump: rapid reviews of health research issues affecting regional, rural and remote Australia.超越黑木树:影响澳大利亚地区、农村和偏远地区的健康研究问题的快速综述。
Med J Aust. 2020 Dec;213 Suppl 11:S3-S32.e1. doi: 10.5694/mja2.50881.
9
Planning Implications Related to Sterilization-Sensitive Science Investigations Associated with Mars Sample Return (MSR).与火星样本返回(MSR)相关的对灭菌敏感的科学研究的规划意义。
Astrobiology. 2022 Jun;22(S1):S112-S164. doi: 10.1089/AST.2021.0113. Epub 2022 May 19.
10

引用本文的文献

1
Resilience Informatics in Public Health: Qualitative Analysis of Conference Proceedings.公共卫生中的适应力信息学:会议论文集的定性分析
JMIR Form Res. 2025 Jan 16;9:e63217. doi: 10.2196/63217.

本文引用的文献

1
A Conceptual Enterprise Framework for Managing Scientific Data Stewardship.用于管理科学数据监管的概念性企业框架。
Data Sci J. 2018;17:15. doi: 10.5334/dsj-2018-015. Epub 2018 Jun 28.
2
Application of the Human Well-Being Index to Sensitive Population Divisions: A Children's Well-Being Index Development.人类福祉指数在敏感人群划分中的应用:儿童福祉指数的编制
Child Indic Res. 2018 Aug;11(4):1249-1280. doi: 10.1007/s12187-017-9469-4.
3
Conceptualizing Holistic Community Resilience to Climate Events: Foundation for a Climate Resilience Screening Index.
构建社区应对气候事件的整体复原力概念:气候复原力筛查指数的基础
Geohealth. 2017 Jun 1;1(4):151-164. doi: 10.1002/2016GH000047.
4
Our path to better science in less time using open data science tools.我们借助开放数据科学工具在更短时间内实现更优科研的途径。
Nat Ecol Evol. 2017 May 23;1(6):160. doi: 10.1038/s41559-017-0160.
5
Ten simple rules for responsible big data research.负责任的大数据研究的十条简单规则。
PLoS Comput Biol. 2017 Mar 30;13(3):e1005399. doi: 10.1371/journal.pcbi.1005399. eCollection 2017 Mar.
6
What is data ethics?什么是数据伦理?
Philos Trans A Math Phys Eng Sci. 2016 Dec 28;374(2083). doi: 10.1098/rsta.2016.0360.
7
The dynamics of big data and human rights: the case of scientific research.大数据与人权的动态关系:以科学研究为例
Philos Trans A Math Phys Eng Sci. 2016 Dec 28;374(2083). doi: 10.1098/rsta.2016.0129.
8
Locating ethics in data science: responsibility and accountability in global and distributed knowledge production systems.在数据科学中定位伦理:全球和分布式知识生产系统中的责任与问责
Philos Trans A Math Phys Eng Sci. 2016 Dec 28;374(2083). doi: 10.1098/rsta.2016.0122.
9
Answering Developmental Questions Using Secondary Data.使用二手数据回答发展性问题。
Child Dev Perspect. 2015 Dec 1;9(4):256-261. doi: 10.1111/cdep.12151. Epub 2015 Oct 22.
10
Sharing Research Data and Intellectual Property Law: A Primer.共享研究数据与知识产权法:入门指南
PLoS Biol. 2015 Aug 27;13(8):e1002235. doi: 10.1371/journal.pbio.1002235. eCollection 2015 Aug.