Chi Hao, Liu Chao, Yang Hao, Zeng Wen-Feng, Wu Long, Zhou Wen-Jing, Wang Rui-Min, Niu Xiu-Nan, Ding Yue-He, Zhang Yao, Wang Zhao-Wei, Chen Zhen-Lin, Sun Rui-Xiang, Liu Tao, Tan Guang-Ming, Dong Meng-Qiu, Xu Ping, Zhang Pei-Heng, He Si-Min
Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China.
University of Chinese Academy of Sciences, Beijing, China.
Nat Biotechnol. 2018 Oct 8. doi: 10.1038/nbt.4236.
We present a sequence-tag-based search engine, Open-pFind, to identify peptides in an ultra-large search space that includes coeluting peptides, unexpected modifications and digestions. Our method detects peptides with higher precision and speed than seven other search engines. Open-pFind identified 70-85% of the tandem mass spectra in four large-scale datasets and 14,064 proteins, each supported by at least two protein-unique peptides, in a human proteome dataset.
我们提出了一种基于序列标签的搜索引擎Open-pFind,用于在超大规模搜索空间中识别肽段,该空间包括共洗脱肽段、意外修饰和消化产物。我们的方法在检测肽段时,比其他七种搜索引擎具有更高的精度和速度。Open-pFind在四个大规模数据集中识别出70%-85%的串联质谱,并在一个人类蛋白质组数据集中识别出14064种蛋白质,每种蛋白质至少由两个蛋白质特异性肽段支持。