Suppr超能文献

2019冠状病毒病测序研究中报告的患者相关元数据:一项范围综述和文献计量分析方案

Patient-Related Metadata Reported in Sequencing Studies of SARS-CoV-2: Protocol for a Scoping Review and Bibliometric Analysis.

作者信息

O'Connor Karen, Weissenbacher Davy, Elyaderani Amir, Lautenbach Ebbing, Scotch Matthew, Gonzalez-Hernandez Graciela

机构信息

Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

Department of Computational Biomedicine, Cedars-Sinai Medical Center, West Hollywood, CA, USA.

出版信息

medRxiv. 2024 Mar 5:2023.07.14.23292681. doi: 10.1101/2023.07.14.23292681.

Abstract

BACKGROUND

There has been an unprecedented effort to sequence the SARS-CoV-2 virus and examine its molecular evolution. This has been facilitated by the availability of publicly accessible databases, the Global Initiative on Sharing All Influenza Data (GISAID) and GenBank, which collectively hold millions of SARS-CoV-2 sequence records. Genomic epidemiology, however, seeks to go beyond phylogenetic analysis by linking genetic information to patient characteristics and disease outcomes, enabling a comprehensive understanding of transmission dynamics and disease impact.While these repositories include fields reflecting patient-related metadata for a given sequence, inclusion of these demographic and clinical details is scarce. The extent to which patient-related metadata is reported in published sequencing studies and its quality remains largely unexplored.

METHODS

The NIH's LitCovid collection will be used for automated classification of articles reporting having deposited SARS-CoV-2 sequences in public repositories, while an independent search will be conducted in PubMed for validation. Data extraction will be conducted using Covidence. The extracted data will be synthesized and summarized to quantify the availability of patient metadata in the published literature of SARS-CoV-2 sequencing studies. For the bibliometric analysis, relevant data points, such as author affiliations and citation metrics will be extracted.

DISCUSSION

This scoping review will report on the extent and types of patient-related metadata reported in genomic viral sequencing studies of SARS-CoV-2, identify gaps in this reporting, and make recommendations for improving the quality and consistency of reporting in this area. The bibliometric analysis will uncover trends and patterns in the reporting of patient-related metadata, including differences in reporting based on study types or geographic regions. Co-occurrence networks of author keywords will also be presented. The insights gained from this study may help improve the quality and consistency of reporting patient metadata, enhancing the utility of sequence metadata and facilitating future research on infectious diseases.

摘要

背景

对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)病毒进行测序并研究其分子进化的工作力度空前。可公开访问的数据库——全球共享流感数据倡议组织(GISAID)和基因库(GenBank)提供了便利,它们总共保存了数百万条SARS-CoV-2序列记录。然而,基因组流行病学旨在通过将遗传信息与患者特征和疾病结果相联系,超越系统发育分析,从而全面了解传播动态和疾病影响。虽然这些数据库包含反映给定序列患者相关元数据的字段,但这些人口统计学和临床细节的纳入情况很少。在已发表的测序研究中报告患者相关元数据的程度及其质量在很大程度上仍未得到探索。

方法

美国国立卫生研究院(NIH)的LitCovid数据集将用于对报告已在公共数据库中存入SARS-CoV-2序列的文章进行自动分类,同时将在PubMed中进行独立检索以进行验证。将使用Covidence进行数据提取。提取的数据将进行综合和总结,以量化SARS-CoV-2测序研究已发表文献中患者元数据的可用性。对于文献计量分析,将提取相关数据点,如作者单位和引用指标。

讨论

本范围综述将报告在SARS-CoV-2基因组病毒测序研究中报告的患者相关元数据的范围和类型,确定该报告中的差距,并就提高该领域报告的质量和一致性提出建议。文献计量分析将揭示患者相关元数据报告中的趋势和模式,包括基于研究类型或地理区域的报告差异。还将展示作者关键词的共现网络。本研究获得的见解可能有助于提高报告患者元数据的质量和一致性,增强序列元数据的实用性,并促进未来传染病研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9588/10926763/2fc864779e70/nihpp-2023.07.14.23292681v2-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验