Department of Medicinal Chemistry, University of Washington, Seattle, WA, USA.
J Proteome Res. 2010 May 7;9(5):2508-15. doi: 10.1021/pr9011816.
We developed an informatic method to identify tandem mass spectra composed of chemically cross-linked peptides from those of linear peptides and to assign sequence to each of the two unique peptide sequences. For a given set of proteins the key software tool, xComb, combs through all theoretically feasible cross-linked peptides to create a database consisting of a subset of all combinations represented as peptide FASTA files. The xComb library of select theoretical cross-linked peptides may then be used as a database that is examined by a standard proteomic search engine to match tandem mass spectral data sets to identify cross-linked peptides. The database search may be conducted against as many as 50 proteins with a number of common proteomic search engines, e.g. Phenyx, Sequest, OMSSA, Mascot and X!Tandem. By searching against a peptide library of linearized, cross-linked peptides, rather than a linearized protein library, search times are decreased and the process is decoupled from any specific search engine. A further benefit of decoupling from the search engine is that protein cross-linking studies may be conducted with readily available informatics tools for which scoring routines already exist within the proteomic community.
我们开发了一种信息学方法,用于从线性肽的串联质谱中识别化学交联肽的串联质谱,并为两个独特肽序列中的每一个分配序列。对于给定的一组蛋白质,关键的软件工具 xComb 会遍历所有理论上可行的交联肽,创建一个由代表肽 FASTA 文件的所有组合的子集组成的数据库。然后可以将选择的理论交联肽的 xComb 库用作数据库,由标准蛋白质组学搜索引擎检查,以将串联质谱数据集与鉴定交联肽相匹配。可以使用许多常见的蛋白质组学搜索引擎(例如 Phenyx、Sequest、OMSSA、Mascot 和 X!Tandem)对多达 50 个蛋白质进行数据库搜索。通过针对线性化交联肽的肽库而不是线性化蛋白质库进行搜索,可以减少搜索时间,并将该过程与任何特定的搜索引擎解耦。与搜索引擎解耦的另一个好处是,蛋白质交联研究可以使用现成的信息学工具进行,这些工具已经在蛋白质组学社区中存在评分程序。