Suppr超能文献

使用模拟数据库对基于质谱的蛋白质组学算法进行基准测试。

Benchmarking mass spectrometry based proteomics algorithms using a simulated database.

作者信息

Awan Muaaz Gul, Awan Abdullah Gul, Saeed Fahad

机构信息

Lawrence Berkeley National Laboratory, Berkeley, CA, USA.

Al-Khwarizmi Institute of Computer Science (KICS), University of Engineering & Technology (UET), Lahore, Pakistan.

出版信息

Netw Model Anal Health Inform Bioinform. 2021;10. doi: 10.1007/s13721-021-00298-3. Epub 2021 Mar 26.

Abstract

Protein sequencing algorithms process data from a variety of instruments that has been generated under diverse experimental conditions. Currently there is no way to predict the accuracy of an algorithm for a given data set. Most of the published algorithms and associated software has been evaluated on limited number of experimental data sets. However, these performance evaluations do not cover the complete search space the algorithmand the software might encounter in real-world. To this end, we present a database of simulated spectra that can be used to benchmark any spectra to peptide search engine. We demonstrate the usability of this database by bench marking two popular peptide sequencing engines. We show wide variation in the accuracy of peptide deductions and a complete quality profile of a given algorithm can be useful for practitioners and algorithm developers. All benchmarking data is available at https://users.cs.fiu.edu/~fsaeed/Benchmark.html.

摘要

蛋白质测序算法处理来自各种仪器的数据,这些数据是在不同实验条件下生成的。目前,对于给定的数据集,没有办法预测算法的准确性。大多数已发表的算法和相关软件仅在有限数量的实验数据集上进行了评估。然而,这些性能评估并未涵盖算法和软件在现实世界中可能遇到的完整搜索空间。为此,我们提出了一个模拟光谱数据库,可用于对任何光谱与肽搜索引擎进行基准测试。我们通过对两个流行的肽测序引擎进行基准测试来证明该数据库的可用性。我们展示了肽推导准确性的广泛差异,并且给定算法的完整质量概况对从业者和算法开发者可能有用。所有基准测试数据可在https://users.cs.fiu.edu/~fsaeed/Benchmark.html获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8fc/8130870/840ba6dd6bc1/nihms-1698506-f0001.jpg

相似文献

6
7
8
Benchmark on Indexing Algorithms for Accelerating Molecular Similarity Search.用于加速分子相似性搜索的索引算法基准测试。
J Chem Inf Model. 2020 Dec 28;60(12):6167-6184. doi: 10.1021/acs.jcim.0c00393. Epub 2020 Oct 23.

本文引用的文献

3
MoleculeNet: a benchmark for molecular machine learning.分子网络:分子机器学习的一个基准
Chem Sci. 2017 Oct 31;9(2):513-530. doi: 10.1039/c7sc02664a. eCollection 2018 Jan 14.
7
The Proteome of Primary Prostate Cancer.原发性前列腺癌的蛋白质组。
Eur Urol. 2016 May;69(5):942-52. doi: 10.1016/j.eururo.2015.10.053. Epub 2015 Dec 2.
8
Novor: real-time peptide de novo sequencing software.Novor:实时肽从头测序软件。
J Am Soc Mass Spectrom. 2015 Nov;26(11):1885-94. doi: 10.1007/s13361-015-1204-0. Epub 2015 Jun 30.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验