• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过文本挖掘和归纳矩阵补全对G蛋白偶联受体进行功能预测

Function Prediction for G Protein-Coupled Receptors through Text Mining and Induction Matrix Completion.

作者信息

Wu Jiansheng, Yin Qin, Zhang Chengxin, Geng Jingjing, Wu Hongjie, Hu Haifeng, Ke Xiaoyan, Zhang Yang

机构信息

School of Geographic and Biological Information and School of Telecommunication and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210023, China.

Department of Computational Medicine and Bioinformatics and Department of Biological Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States.

出版信息

ACS Omega. 2019 Feb 12;4(2):3045-3054. doi: 10.1021/acsomega.8b02454. eCollection 2019 Feb 28.

DOI:10.1021/acsomega.8b02454
PMID:31459527
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6649004/
Abstract

G protein-coupled receptors (GPCRs) constitute the key component of cellular signal transduction. Accurately annotating the biological functions of GPCR proteins is vital to the understanding of the physiological processes they involve in. With the rapid development of text mining technologies and the exponential growth of biomedical literature, it becomes urgent to explore biological functional information from various literature for systematically and reliably annotating these known GPCRs. We design a novel three-stage approach, TM-IMC, using text mining and inductive matrix completion, for automated prediction of the gene ontology (GO) terms of the GPCR proteins. Large-scale benchmark tests show that inductive matrix completion models contribute to GPCR-GO association prediction for both molecular function and biological process aspects. Moreover, our detailed data analysis shows that information extracted from GPCR-associated literature indeed contributes to the prediction of GPCR-GO associations. The study demonstrated a new avenue to enhance the accuracy of GPCR function annotation through the combination of text mining and induction matrix completion over baseline methods in critical assessment of protein function annotation algorithms and literature-based GO annotation methods. Source codes of TM-IMC and the involved datasets can be freely downloaded from https://zhanglab.ccmb.med.umich.edu/TM-IMC for academic purposes.

摘要

G蛋白偶联受体(GPCRs)是细胞信号转导的关键组成部分。准确注释GPCR蛋白的生物学功能对于理解它们所涉及的生理过程至关重要。随着文本挖掘技术的迅速发展以及生物医学文献的指数级增长,从各种文献中探索生物学功能信息以系统且可靠地注释这些已知GPCR变得迫在眉睫。我们设计了一种新颖的三阶段方法TM-IMC,利用文本挖掘和归纳矩阵补全来自动预测GPCR蛋白的基因本体(GO)术语。大规模基准测试表明,归纳矩阵补全模型有助于在分子功能和生物学过程方面进行GPCR-GO关联预测。此外,我们详细的数据分析表明,从与GPCR相关的文献中提取的信息确实有助于GPCR-GO关联的预测。该研究展示了一条新途径,即在蛋白质功能注释算法的关键评估和基于文献的GO注释方法中,通过将文本挖掘与归纳矩阵补全相结合,相对于基线方法提高GPCR功能注释的准确性。TM-IMC的源代码及相关数据集可从https://zhanglab.ccmb.med.umich.edu/TM-IMC免费下载用于学术目的。

相似文献

1
Function Prediction for G Protein-Coupled Receptors through Text Mining and Induction Matrix Completion.通过文本挖掘和归纳矩阵补全对G蛋白偶联受体进行功能预测
ACS Omega. 2019 Feb 12;4(2):3045-3054. doi: 10.1021/acsomega.8b02454. eCollection 2019 Feb 28.
2
GLASS: a comprehensive database for experimentally validated GPCR-ligand associations.GLASS:一个用于实验验证的GPCR-配体关联的综合数据库。
Bioinformatics. 2015 Sep 15;31(18):3035-42. doi: 10.1093/bioinformatics/btv302. Epub 2015 May 13.
3
GPCRRD: G protein-coupled receptor spatial restraint database for 3D structure modeling and function annotation.GPCRRD:用于 3D 结构建模和功能注释的 G 蛋白偶联受体空间约束数据库。
Bioinformatics. 2010 Dec 1;26(23):3004-5. doi: 10.1093/bioinformatics/btq563. Epub 2010 Oct 5.
4
Finding Gene Associations by Text Mining and Annotating it with Gene Ontology.通过文本挖掘发现基因关联,并使用基因本体论对其进行注释。
Methods Mol Biol. 2022;2496:71-90. doi: 10.1007/978-1-0716-2305-3_4.
5
WDL-RF: predicting bioactivities of ligand molecules acting with G protein-coupled receptors by combining weighted deep learning and random forest.WDL-RF:通过结合加权深度学习和随机森林预测与 G 蛋白偶联受体相互作用的配体分子的生物活性。
Bioinformatics. 2018 Jul 1;34(13):2271-2282. doi: 10.1093/bioinformatics/bty070.
6
Evaluation of BioCreAtIvE assessment of task 2.生物创意任务2评估的评价
BMC Bioinformatics. 2005;6 Suppl 1(Suppl 1):S16. doi: 10.1186/1471-2105-6-S1-S16. Epub 2005 May 24.
7
Prediction of GPCR-G protein coupling specificity using features of sequences and biological functions.利用序列特征和生物学功能预测GPCR-G蛋白偶联特异性
Genomics Proteomics Bioinformatics. 2006 Nov;4(4):238-44. doi: 10.1016/S1672-0229(07)60004-7.
8
GPCR-PEnDB: a database of protein sequences and derived features to facilitate prediction and classification of G protein-coupled receptors.GPCR-PEnDB:一个蛋白质序列和衍生特征数据库,用于促进 G 蛋白偶联受体的预测和分类。
Database (Oxford). 2020 Nov 20;2020. doi: 10.1093/database/baaa087.
9
Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors.精确建模和解释靶向 G 蛋白偶联受体的配体的生物活性。
Bioinformatics. 2019 Jul 15;35(14):i324-i332. doi: 10.1093/bioinformatics/btz336.
10
How to link ontologies and protein-protein interactions to literature: text-mining approaches and the BioCreative experience.如何将本体和蛋白质-蛋白质相互作用与文献联系起来:文本挖掘方法和 BioCreative 的经验。
Database (Oxford). 2012 Mar 21;2012:bas017. doi: 10.1093/database/bas017. Print 2012.

引用本文的文献

1
Blinded Testing of Function Annotation for uPE1 Proteins by I-TASSER/COFACTOR Pipeline Using the 2018-2019 Additions to neXtProt and the CAFA3 Challenge.使用 2018-2019 年 neXtProt 的新增内容和 CAFA3 挑战赛,通过 I-TASSER/COFACTOR 管道对 uPE1 蛋白的功能注释进行盲测。
J Proteome Res. 2019 Dec 6;18(12):4154-4166. doi: 10.1021/acs.jproteome.9b00537. Epub 2019 Oct 18.

本文引用的文献

1
DeepText2GO: Improving large-scale protein function prediction with deep semantic text representation.DeepText2GO:利用深度语义文本表示提高大规模蛋白质功能预测。
Methods. 2018 Aug 1;145:82-90. doi: 10.1016/j.ymeth.2018.05.026. Epub 2018 Jun 6.
2
GOLabeler: improving sequence-based large-scale protein function prediction by learning to rank.GOLabeler:通过学习排序提高基于序列的大规模蛋白质功能预测。
Bioinformatics. 2018 Jul 15;34(14):2465-2473. doi: 10.1093/bioinformatics/bty130.
3
ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network.
ProLanGO:基于循环神经网络的神经机器翻译在蛋白质功能预测中的应用。
Molecules. 2017 Oct 17;22(10):1732. doi: 10.3390/molecules22101732.
4
DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.DeepGO:使用深度本体感知分类器从序列和相互作用预测蛋白质功能。
Bioinformatics. 2018 Feb 15;34(4):660-668. doi: 10.1093/bioinformatics/btx624.
5
Protein Function Prediction.蛋白质功能预测
Methods Mol Biol. 2017;1654:55-75. doi: 10.1007/978-1-4939-7231-9_5.
6
COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information.协同因子:通过结合结构、序列和蛋白质-蛋白质相互作用信息来改进蛋白质功能预测。
Nucleic Acids Res. 2017 Jul 3;45(W1):W291-W299. doi: 10.1093/nar/gkx366.
7
The Bologna Annotation Resource (BAR 3.0): improving protein functional annotation.博洛尼亚注释资源(BAR 3.0):改进蛋白质功能注释。
Nucleic Acids Res. 2017 Jul 3;45(W1):W285-W290. doi: 10.1093/nar/gkx330.
8
The ProFunc Function Prediction Server.ProFunc功能预测服务器。
Methods Mol Biol. 2017;1611:75-95. doi: 10.1007/978-1-4939-7015-5_7.
9
An expanded evaluation of protein function prediction methods shows an improvement in accuracy.对蛋白质功能预测方法的扩展评估显示准确性有所提高。
Genome Biol. 2016 Sep 7;17(1):184. doi: 10.1186/s13059-016-1037-6.
10
Scalable Algorithms for Multi-Instance Learning.可扩展的多实例学习算法。
IEEE Trans Neural Netw Learn Syst. 2017 Apr;28(4):975-987. doi: 10.1109/TNNLS.2016.2519102. Epub 2016 Feb 3.