Suppr超能文献

psHarmonize:助力在R语言中实现可重复的大规模统计前数据协调与记录。

psHarmonize: Facilitating reproducible large-scale pre-statistical data harmonization and documentation in R.

作者信息

Stephen John J, Carolan Padraig, Krefman Amy E, Sedaghat Sanaz, Mansolf Maxwell, Allen Norrina B, Scholtens Denise M

机构信息

Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA.

Division of Epidemiology and Community Health, University of Minnesota School of Public Health, Minneapolis, MN 55455, USA.

出版信息

Patterns (N Y). 2024 Jun 14;5(8):101003. doi: 10.1016/j.patter.2024.101003. eCollection 2024 Aug 9.

Abstract

Combining pertinent data from multiple studies can increase the robustness of epidemiological investigations. Effective "pre-statistical" data harmonization is paramount to the streamlined conduct of collective, multi-study analysis. Harmonizing data and documenting decisions about the transformations of variables to a common set of categorical values and measurement scales are time consuming and can be error prone, particularly for numerous studies with large quantities of variables. The R package facilitates harmonization by combining multiple datasets, applying data transformation functions, and creating long and wide harmonized datasets. The user provides transformation instructions in a "harmonization sheet" that includes dataset names, variable names, and coding instructions and centrally tracks all decisions. The package performs harmonization, generates error logs as necessary, and creates summary reports of harmonized data. is poised to serve as a central feature of data preparation for the joint analysis of multiple studies.

摘要

整合来自多项研究的相关数据可以提高流行病学调查的稳健性。有效的“统计前”数据协调对于简化集体多研究分析的开展至关重要。将数据协调并记录关于将变量转换为一组共同分类值和测量尺度的决策既耗时又容易出错,尤其是对于有大量变量的众多研究而言。R包通过合并多个数据集、应用数据转换函数以及创建长格式和宽格式的协调数据集来促进协调。用户在“协调表”中提供转换说明,该表包括数据集名称、变量名称和编码说明,并集中跟踪所有决策。该包执行协调,必要时生成错误日志,并创建协调数据的总结报告。它有望成为多项研究联合分析数据准备的核心功能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a4/11368672/81ce4b501656/gr1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验