Suppr超能文献

协作研究的数据管理实践。

Data management practices for collaborative research.

机构信息

Renaissance Computing Institute, University of North Carolina at Chapel Hill Chapel Hill, NC, USA.

出版信息

Front Psychiatry. 2011 Jul 22;2:47. doi: 10.3389/fpsyt.2011.00047. eCollection 2011.

Abstract

The success of research in the field of maternal-infant health, or in any scientific field, relies on the adoption of best practices for data and knowledge management. Prior work by our group and others has identified evidence-based solutions to many of the data management challenges that exist, including cost-effective practices for ensuring high-quality data entry and proper construction and maintenance of data standards and ontologies. Quality assurance practices for data entry and processing are necessary to ensure that data are not denigrated during processing, but the use of these practices has not been widely adopted in the fields of psychology and biology. Furthermore, collaborative research is becoming more common. Collaborative research often involves multiple laboratories, different scientific disciplines, numerous data sources, large data sets, and data sets from public and commercial sources. These factors present new challenges for data and knowledge management. Data security and privacy concerns are increased as data may be accessed by investigators affiliated with different institutions. Collaborative groups must address the challenges associated with federating data access between the data-collecting sites and a centralized data management site. The merging of ontologies between different data sets can become formidable, especially in fields with evolving ontologies. The increased use of automated data acquisition can yield more data, but it can also increase the risk of introducing error or systematic biases into data. In addition, the integration of data collected from different assay types often requires the development of new tools to analyze the data. All of these challenges act to increase the costs and time spent on data management for a given project, and they increase the likelihood of decreasing the quality of the data. In this paper, we review these issues and discuss theoretical and practical approaches for addressing these issues.

摘要

母婴健康领域或任何科学领域的研究成功都依赖于采用最佳的数据和知识管理实践。我们小组和其他小组的先前工作已经确定了许多存在的数据管理挑战的循证解决方案,包括确保高质量数据录入和适当构建和维护数据标准和本体的具有成本效益的实践。数据录入和处理的质量保证实践对于确保数据在处理过程中不被贬低是必要的,但这些实践在心理学和生物学领域并没有得到广泛采用。此外,合作研究越来越普遍。合作研究通常涉及多个实验室、不同的科学学科、众多数据源、大型数据集以及来自公共和商业来源的数据。这些因素为数据和知识管理带来了新的挑战。随着数据可能被来自不同机构的调查人员访问,数据安全性和隐私问题会增加。合作小组必须解决在数据收集站点和集中数据管理站点之间联合数据访问相关的挑战。不同数据集之间的本体融合可能变得很复杂,特别是在本体不断发展的领域。自动化数据采集的广泛应用可以产生更多的数据,但也可能增加将错误或系统偏差引入数据的风险。此外,不同检测类型收集的数据的集成通常需要开发新的工具来分析数据。所有这些挑战都增加了给定项目的数据管理成本和时间,并且增加了降低数据质量的可能性。在本文中,我们回顾了这些问题,并讨论了针对这些问题的理论和实践方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27d5/3143734/8a626ebfc167/fpsyt-02-00047-g001.jpg

相似文献

1
Data management practices for collaborative research.
Front Psychiatry. 2011 Jul 22;2:47. doi: 10.3389/fpsyt.2011.00047. eCollection 2011.
2
The future of Cochrane Neonatal.
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
3
American Society of Clinical Oncology policy statement: oversight of clinical research.
J Clin Oncol. 2003 Jun 15;21(12):2377-86. doi: 10.1200/JCO.2003.04.026. Epub 2003 Apr 29.
5
The project data sphere initiative: accelerating cancer research by sharing data.
Oncologist. 2015 May;20(5):464-e20. doi: 10.1634/theoncologist.2014-0431. Epub 2015 Apr 15.
7
[A proposal for reforming psychologists' training in France and in the European Union].
Encephale. 2009 Feb;35(1):18-24. doi: 10.1016/j.encep.2007.11.008. Epub 2008 Apr 2.
8
Critical Care Network in the State of Qatar.
Qatar Med J. 2019 Nov 7;2019(2):2. doi: 10.5339/qmj.2019.qccc.2. eCollection 2019.
9
Measuring hot flashes: summary of a National Institutes of Health workshop.
Mayo Clin Proc. 2004 Jun;79(6):777-81. doi: 10.4065/79.6.777.
10
Student and educator experiences of maternal-child simulation-based learning: a systematic review of qualitative evidence protocol.
JBI Database System Rev Implement Rep. 2015 Jan;13(1):14-26. doi: 10.11124/jbisrir-2015-1694.

引用本文的文献

1
Conducting multicenter research in healthcare simulation: Lessons learned from the INSPIRE network.
Adv Simul (Lond). 2017 Feb 28;2:6. doi: 10.1186/s41077-017-0039-0. eCollection 2017.
2
MAGIC populations in crops: current status and future prospects.
Theor Appl Genet. 2015 Jun;128(6):999-1017. doi: 10.1007/s00122-015-2506-0. Epub 2015 Apr 9.
3
The EnzymeTracker: an open-source laboratory information management system for sample tracking.
BMC Bioinformatics. 2012 Jan 26;13:15. doi: 10.1186/1471-2105-13-15.

本文引用的文献

1
SeqWare Query Engine: storing and searching sequence data in the cloud.
BMC Bioinformatics. 2010 Dec 21;11 Suppl 12(Suppl 12):S2. doi: 10.1186/1471-2105-11-S12-S2.
2
Cross-domain neurobiology data integration and exploration.
BMC Genomics. 2010 Dec 1;11 Suppl 3(Suppl 3):S6. doi: 10.1186/1471-2164-11-S3-S6.
3
Imaging genomics.
Curr Opin Neurol. 2010 Aug;23(4):368-73. doi: 10.1097/WCO.0b013e32833b764c.
4
Pooling FMRI data: meta-analysis, mega-analysis and multi-center studies.
Front Neuroinform. 2009 Sep 30;3:33. doi: 10.3389/neuro.11.033.2009. eCollection 2009.
5
Neural substrates of pleiotropic action of genetic variation in COMT: a meta-analysis.
Mol Psychiatry. 2010 Sep;15(9):918-27. doi: 10.1038/mp.2009.36. Epub 2009 May 5.
6
The science of team science: overview of the field and introduction to the supplement.
Am J Prev Med. 2008 Aug;35(2 Suppl):S77-89. doi: 10.1016/j.amepre.2008.05.002.
7
The increasing dominance of teams in production of knowledge.
Science. 2007 May 18;316(5827):1036-9. doi: 10.1126/science.1136099. Epub 2007 Apr 12.
8
A comparison of label-based review and ALE meta-analysis in the Stroop task.
Hum Brain Mapp. 2005 May;25(1):6-21. doi: 10.1002/hbm.20129.
9
Data integration: challenges for drug discovery.
Nat Rev Drug Discov. 2005 Jan;4(1):45-58. doi: 10.1038/nrd1608.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验