• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大数据策展变得简单:来自科学界的观点。

Curating Big Data Made Simple: Perspectives from Scientific Communities.

机构信息

Information Services Platform Laboratory, Universal Communication Research Institute, National Institute of Information and Communications Technology , Kyoto, Japan.

出版信息

Big Data. 2014 Mar;2(1):23-33. doi: 10.1089/big.2013.0046. Epub 2014 Feb 19.

DOI:10.1089/big.2013.0046
PMID:27447308
Abstract

The digital universe is exponentially producing an unprecedented volume of data that has brought benefits as well as fundamental challenges for enterprises and scientific communities alike. This trend is inherently exciting for the development and deployment of cloud platforms to support scientific communities curating big data. The excitement stems from the fact that scientists can now access and extract value from the big data corpus, establish relationships between bits and pieces of information from many types of data, and collaborate with a diverse community of researchers from various domains. However, despite these perceived benefits, to date, little attention is focused on the people or communities who are both beneficiaries and, at the same time, producers of big data. The technical challenges posed by big data are as big as understanding the dynamics of communities working with big data, whether scientific or otherwise. Furthermore, the big data era also means that big data platforms for data-intensive research must be designed in such a way that research scientists can easily search and find data for their research, upload and download datasets for onsite/offsite use, perform computations and analysis, share their findings and research experience, and seamlessly collaborate with their colleagues. In this article, we present the architecture and design of a cloud platform that meets some of these requirements, and a big data curation model that describes how a community of earth and environmental scientists is using the platform to curate data. Motivation for developing the platform, lessons learnt in overcoming some challenges associated with supporting scientists to curate big data, and future research directions are also presented.

摘要

数字宇宙呈指数级增长,产生了前所未有的大量数据,这给企业和科学界都带来了好处和根本性的挑战。这种趋势对于开发和部署云平台以支持管理大数据的科学界来说,具有内在的吸引力。这种兴奋源于这样一个事实,即科学家现在可以访问和提取大数据语料库中的价值,建立来自多种类型数据的信息片段之间的关系,并与来自不同领域的多样化研究人员社区进行合作。然而,尽管有这些预期的好处,但迄今为止,人们很少关注既是大数据受益者,同时又是大数据生产者的个人或社区。大数据带来的技术挑战与理解使用大数据的社区的动态一样大,无论是科学领域还是其他领域。此外,大数据时代还意味着,用于数据密集型研究的大数据平台必须设计成研究科学家可以轻松搜索和查找研究数据、上传和下载数据集以供现场/场外使用、进行计算和分析、共享他们的发现和研究经验,并与同事无缝协作的方式。在本文中,我们介绍了满足这些要求的云平台架构和设计,以及大数据管理模型,描述了地球和环境科学家社区如何使用该平台来管理数据。还介绍了开发该平台的动机、克服支持科学家管理大数据所面临的一些挑战的经验教训,以及未来的研究方向。

相似文献

1
Curating Big Data Made Simple: Perspectives from Scientific Communities.大数据策展变得简单:来自科学界的观点。
Big Data. 2014 Mar;2(1):23-33. doi: 10.1089/big.2013.0046. Epub 2014 Feb 19.
2
CADRE: A Collaborative, Cloud-Based Solution for Big Bibliographic Data Research in Academic Libraries.CADRE:学术图书馆中用于大型书目数据研究的基于云的协作解决方案。
Front Big Data. 2020 Nov 20;3:556282. doi: 10.3389/fdata.2020.556282. eCollection 2020.
3
Cognitive IT-systems for big data analysis in medicine.用于医学大数据分析的认知信息技术系统。
Int J Risk Saf Med. 2015;27 Suppl 1:S108-9. doi: 10.3233/JRS-150711.
4
Benchmarking Big Data Systems and the BigData Top100 List.基准测试大数据系统和 BigData Top100 列表。
Big Data. 2013 Mar;1(1):60-4. doi: 10.1089/big.2013.1509.
5
Rethinking Giftedness and Gifted Education: A Proposed Direction Forward Based on Psychological Science.重新思考天赋和英才教育:基于心理科学的前进方向建议。
Psychol Sci Public Interest. 2011 Jan;12(1):3-54. doi: 10.1177/1529100611418056.
6
How has the impact of 'care pathway technologies' on service integration in stroke care been measured and what is the strength of the evidence to support their effectiveness in this respect?“护理路径技术”对卒中护理服务整合的影响是如何衡量的,以及有哪些证据支持其在这方面的有效性?
Int J Evid Based Healthc. 2008 Mar;6(1):78-110. doi: 10.1111/j.1744-1609.2007.00098.x.
7
Proceedings of the Second Workshop on Theory meets Industry (Erwin-Schrödinger-Institute (ESI), Vienna, Austria, 12-14 June 2007).第二届理论与产业研讨会会议录(2007年6月12日至14日,奥地利维也纳埃尔温·薛定谔研究所)
J Phys Condens Matter. 2008 Feb 13;20(6):060301. doi: 10.1088/0953-8984/20/06/060301. Epub 2008 Jan 24.
8
The C-BIG Repository: an Institution-Level Open Science Platform.C-BIG 知识库:机构级开放科学平台。
Neuroinformatics. 2022 Jan;20(1):139-153. doi: 10.1007/s12021-021-09516-9. Epub 2021 May 18.
9
Finding the traces of behavioral and cognitive processes in big data and naturally occurring datasets.在大数据和自然发生的数据集里寻找行为和认知过程的痕迹。
Behav Res Methods. 2017 Oct;49(5):1630-1638. doi: 10.3758/s13428-017-0874-x.
10
Curating Scientific Workflows for Biomolecular Nuclear Magnetic Resonance Spectroscopy.为生物分子核磁共振光谱整理科学工作流程。
Int J Digit Curation. 2018;13(1):286-293. doi: 10.2218/ijdc.v13i1.657. Epub 2019 Apr 19.