Suppr超能文献

利用未解读的质谱数据研究人类基因组。

Interrogating the human genome using uninterpreted mass spectrometry data.

作者信息

Choudhary J S, Blackstock W P, Creasy D M, Cottrell J S

机构信息

Cell Mapping Project, Glaxo Wellcome R&D, Stevenage, Hertfordshire, UK.

出版信息

Proteomics. 2001 May;1(5):651-67. doi: 10.1002/1615-9861(200104)1:5<651::AID-PROT651>3.0.CO;2-N.

Abstract

The public availability of a draft assembly of the human genome has enabled us to demonstrate, for the first time, the feasibility of searching a complete, unmasked eukaryotic genome using uninterpreted mass spectrometry data. A complex LC-MS/MS data set, containing peptides from at least 22 human proteins, was searched against a comprehensive, nonidentical protein database, an expressed sequence tag (EST) database, and the International Human Genome Project draft assembly of the human genome. The results from the three searches are compared in detail, and the merits of the different databases for this application are discussed. In the case of the EST database, the UniGene index provided a method of simplifying and summarising the search results. In the case of the genomic DNA, the presence of introns prevented matching of roughly one quarter of the spectra, but the technique can provide primary experimental verification of predicted coding sequences, and has the potential to identify novel coding sequences.

摘要

人类基因组组装草图的公开使得我们首次证明,利用未经解释的质谱数据搜索完整、未屏蔽的真核生物基因组是可行的。针对一个全面的、不重复的蛋白质数据库、一个表达序列标签(EST)数据库以及国际人类基因组计划的人类基因组组装草图,对一个包含至少22种人类蛋白质肽段的复杂液相色谱-串联质谱(LC-MS/MS)数据集进行了搜索。详细比较了三次搜索的结果,并讨论了不同数据库在此应用中的优点。对于EST数据库,UniGene索引提供了一种简化和总结搜索结果的方法。对于基因组DNA,内含子的存在导致约四分之一的谱图无法匹配,但该技术可以为预测的编码序列提供初步实验验证,并有潜力识别新的编码序列。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验