Rajczewski Andrew T, Mehta Subina, Nguyen Dinh Duy An, Grüning Björn A, Johnson James E, McGowan Thomas, Griffin Timothy J, Jagtap Pratik D
Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA.
Department of Computer Science, University of Freiburg, Freiburg, Germany.
medRxiv. 2021 Mar 1:2021.02.09.21251427. doi: 10.1101/2021.02.09.21251427.
The Coronavirus Disease 2019 (COVID-19) global pandemic has had a profound, lasting impact on the world's population. A key aspect to providing care for those with COVID-19 and checking its further spread is early and accurate diagnosis of infection, which has been generally done via methods for amplifying and detecting viral RNA molecules. Detection and quantitation of peptides using targeted mass spectrometry-based strategies has been proposed as an alternative diagnostic tool due to direct detection of molecular indicators from non-invasively collected samples as well as the potential for high-throughput analysis in a clinical setting; many studies have revealed the presence of viral peptides within easily accessed patient samples. However, evidence suggests that some viral peptides could serve as better indicators of COVID-19 infection status than others, due to potential misidentification of peptides derived from human host proteins, poor spectral quality, high limits of detection etc. In this study we have compiled a list of 639 peptides identified from Sudden Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) samples, including from in vitro and clinical sources. These datasets were rigorously analyzed using automated, Galaxy-based workflows containing tools such as PepQuery, BLAST-P, and the Multi-omic Visualization Platform as well as the open-source tools MetaTryp and Proteomics Data Viewer (PDV). Using PepQuery for confirming peptide spectrum matches, we were able to narrow down the 639 peptide possibilities to 87 peptides which were most robustly detected and specific to the SARS-CoV-2 virus. The specificity of these sequences to coronavirus taxa was confirmed using Unipept and BLAST-P. Applying stringent statistical scoring thresholds, combined with manual verification of peptide spectrum match quality, 4 peptides derived from the nucleocapsid phosphoprotein and membrane protein were found to be most robustly detected across all cell culture and clinical samples, including those collected non-invasively. We propose that these peptides would be of the most value for clinical proteomics applications seeking to detect COVID-19 from a variety of sample types. We also contend that samples taken from the upper respiratory tract and oral cavity have the highest potential for diagnosis of SARS-CoV-2 infection from easily collected patient samples using mass spectrometry-based proteomics assays.
2019年冠状病毒病(COVID-19)全球大流行对世界人口产生了深远而持久的影响。为COVID-19患者提供护理并遏制其进一步传播的一个关键方面是对感染进行早期准确诊断,这通常是通过扩增和检测病毒RNA分子的方法来完成的。由于可以直接从非侵入性采集的样本中检测分子指标,并且具有在临床环境中进行高通量分析的潜力,因此已提出使用基于靶向质谱的策略来检测和定量肽作为一种替代诊断工具;许多研究已经揭示了在易于获取的患者样本中存在病毒肽。然而,有证据表明,由于可能误识源自人类宿主蛋白的肽、光谱质量差、检测限高等原因,一些病毒肽作为COVID-19感染状态的指标可能比其他肽更好。在本研究中,我们汇编了一份从严重急性呼吸综合征冠状病毒2(SARS-CoV-2)样本中鉴定出的639种肽的列表,这些样本包括体外和临床来源。使用包含PepQuery、BLAST-P和多组学可视化平台等工具的基于Galaxy的自动化工作流程以及开源工具MetaTryp和蛋白质组学数据查看器(PDV)对这些数据集进行了严格分析。使用PepQuery确认肽谱匹配,我们能够将639种肽的可能性缩小到87种肽,这些肽被最可靠地检测到并且对SARS-CoV-2病毒具有特异性。使用Unipept和BLAST-P确认了这些序列对冠状病毒分类群的特异性。应用严格的统计评分阈值,并结合对肽谱匹配质量的人工验证,发现源自核衣壳磷蛋白和膜蛋白的4种肽在所有细胞培养和临床样本(包括非侵入性采集的样本)中被最可靠地检测到。我们认为,这些肽对于试图从各种样本类型中检测COVID-19的临床蛋白质组学应用最有价值。我们还认为,使用基于质谱的蛋白质组学分析,从上呼吸道和口腔采集的样本在通过易于采集的患者样本诊断SARS-CoV-2感染方面具有最高的潜力。