• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TransportTP:一种用于膜转运蛋白预测和特征分析的两阶段分类方法。

TransportTP: a two-phase classification approach for membrane transporter prediction and characterization.

机构信息

Plant Biology Division, The Samuel Roberts Noble Foundation, Inc, Ardmore, OK 73401, USA.

出版信息

BMC Bioinformatics. 2009 Dec 14;10:418. doi: 10.1186/1471-2105-10-418.

DOI:10.1186/1471-2105-10-418
PMID:20003433
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3087344/
Abstract

BACKGROUND

Membrane transporters play crucial roles in living cells. Experimental characterization of transporters is costly and time-consuming. Current computational methods for transporter characterization still require extensive curation efforts, especially for eukaryotic organisms. We developed a novel genome-scale transporter prediction and characterization system called TransportTP that combined homology-based and machine learning methods in a two-phase classification approach. First, traditional homology methods were employed to predict novel transporters based on sequence similarity to known classified proteins in the Transporter Classification Database (TCDB). Second, machine learning methods were used to integrate a variety of features to refine the initial predictions. A set of rules based on transporter features was developed by machine learning using well-curated proteomes as guides.

RESULTS

In a cross-validation using the yeast proteome for training and the proteomes of ten other organisms for testing, TransportTP achieved an equivalent recall and precision of 81.8%, based on TransportDB, a manually annotated transporter database. In an independent test using the Arabidopsis proteome for training and four recently sequenced plant proteomes for testing, it achieved a recall of 74.6% and a precision of 73.4%, according to our manual curation.

CONCLUSIONS

TransportTP is the most effective tool for eukaryotic transporter characterization up to date.

摘要

背景

膜转运蛋白在活细胞中起着至关重要的作用。转运蛋白的实验特征具有成本高、耗时长的特点。目前用于转运蛋白特征描述的计算方法仍然需要大量的编辑工作,特别是对于真核生物。我们开发了一种新的基于基因组规模的转运蛋白预测和特征描述系统,称为 TransportTP,它结合了同源性和机器学习方法,采用两阶段分类方法。首先,传统的同源方法用于基于与 Transporter Classification Database(TCDB)中已知分类蛋白的序列相似性来预测新的转运蛋白。其次,使用机器学习方法来整合多种特征来细化初始预测。一组基于转运蛋白特征的规则是通过机器学习使用精心编辑的蛋白质组作为指导来开发的。

结果

在使用酵母蛋白质组进行训练和其他十个生物体的蛋白质组进行测试的交叉验证中,TransportTP 在基于手动注释的转运蛋白数据库 TransportDB 上实现了召回率和精度均为 81.8%的等效值。在使用拟南芥蛋白质组进行训练和四个最近测序的植物蛋白质组进行测试的独立测试中,根据我们的手动编辑,它实现了 74.6%的召回率和 73.4%的精度。

结论

TransportTP 是迄今为止最有效的真核转运蛋白特征描述工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7cc/3087344/e01b5fc52cfa/1471-2105-10-418-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7cc/3087344/3fb0ae36c7f2/1471-2105-10-418-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7cc/3087344/a49393cdad37/1471-2105-10-418-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7cc/3087344/9a349a7f5277/1471-2105-10-418-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7cc/3087344/e01b5fc52cfa/1471-2105-10-418-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7cc/3087344/3fb0ae36c7f2/1471-2105-10-418-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7cc/3087344/a49393cdad37/1471-2105-10-418-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7cc/3087344/9a349a7f5277/1471-2105-10-418-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7cc/3087344/e01b5fc52cfa/1471-2105-10-418-4.jpg

相似文献

1
TransportTP: a two-phase classification approach for membrane transporter prediction and characterization.TransportTP:一种用于膜转运蛋白预测和特征分析的两阶段分类方法。
BMC Bioinformatics. 2009 Dec 14;10:418. doi: 10.1186/1471-2105-10-418.
2
The Transporter Classification Database: recent advances.转运蛋白分类数据库:最新进展
Nucleic Acids Res. 2009 Jan;37(Database issue):D274-8. doi: 10.1093/nar/gkn862. Epub 2008 Nov 20.
3
A nearest neighbor approach for automated transporter prediction and categorization from protein sequences.一种基于最近邻方法从蛋白质序列进行自动转运蛋白预测和分类的方法。
Bioinformatics. 2008 May 1;24(9):1129-36. doi: 10.1093/bioinformatics/btn099. Epub 2008 Mar 12.
4
TCDB: the Transporter Classification Database for membrane transport protein analyses and information.TCDB:用于膜转运蛋白分析和信息查询的转运蛋白分类数据库。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D181-6. doi: 10.1093/nar/gkj001.
5
The Transporter Classification Database (TCDB): recent advances.转运蛋白分类数据库(TCDB):最新进展
Nucleic Acids Res. 2016 Jan 4;44(D1):D372-9. doi: 10.1093/nar/gkv1103. Epub 2015 Nov 5.
6
The transporter classification database.转运蛋白分类数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D251-8. doi: 10.1093/nar/gkt1097. Epub 2013 Nov 12.
7
Prediction of membrane transport proteins and their substrate specificities using primary sequence information.利用一级序列信息预测膜转运蛋白及其底物特异性。
PLoS One. 2014 Jun 26;9(6):e100278. doi: 10.1371/journal.pone.0100278. eCollection 2014.
8
The Transporter Classification Database (TCDB): 2021 update.《转运蛋白分类数据库(TCDB):2021 年更新》。
Nucleic Acids Res. 2021 Jan 8;49(D1):D461-D467. doi: 10.1093/nar/gkaa1004.
9
Discrimination of membrane transporter protein types using K-nearest neighbor method derived from the similarity distance of total diversity measure.基于总多样性测度的相似性距离,采用K近邻法鉴别膜转运蛋白类型。
Mol Biosyst. 2015 Mar;11(3):950-7. doi: 10.1039/c4mb00681j. Epub 2015 Jan 21.
10
TransportDB 2.0: a database for exploring membrane transporters in sequenced genomes from all domains of life.TransportDB 2.0:一个用于探索来自生命所有领域已测序基因组中膜转运蛋白的数据库。
Nucleic Acids Res. 2017 Jan 4;45(D1):D320-D324. doi: 10.1093/nar/gkw1068. Epub 2016 Nov 28.

引用本文的文献

1
Transporter annotations are holding up progress in metabolic modeling.转运蛋白注释阻碍了代谢建模的进展。
Front Syst Biol. 2024 Jul 24;4:1394084. doi: 10.3389/fsysb.2024.1394084. eCollection 2024.
2
Genome evolution of a nonparasitic secondary heterotroph, the diatom .一种非寄生性次生异养生物——硅藻的基因组进化
Sci Adv. 2022 Apr 29;8(17):eabi5075. doi: 10.1126/sciadv.abi5075.
3
Utilizing Plant Synthetic Biology to Improve Human Health and Wellness.利用植物合成生物学改善人类健康与福祉。

本文引用的文献

1
The Transporter Classification Database: recent advances.转运蛋白分类数据库:最新进展
Nucleic Acids Res. 2009 Jan;37(Database issue):D274-8. doi: 10.1093/nar/gkn862. Epub 2008 Nov 20.
2
A nearest neighbor approach for automated transporter prediction and categorization from protein sequences.一种基于最近邻方法从蛋白质序列进行自动转运蛋白预测和分类的方法。
Bioinformatics. 2008 May 1;24(9):1129-36. doi: 10.1093/bioinformatics/btn099. Epub 2008 Mar 12.
3
Functional discrimination of membrane proteins using machine learning techniques.
Front Plant Sci. 2021 Aug 24;12:691462. doi: 10.3389/fpls.2021.691462. eCollection 2021.
4
Addressing uncertainty in genome-scale metabolic model reconstruction and analysis.解决基因组规模代谢模型重建与分析中的不确定性问题。
Genome Biol. 2021 Feb 18;22(1):64. doi: 10.1186/s13059-021-02289-z.
5
Transportome-wide engineering of Saccharomyces cerevisiae.对酿酒酵母的转运组进行全工程改造。
Metab Eng. 2021 Mar;64:52-63. doi: 10.1016/j.ymben.2021.01.007. Epub 2021 Jan 16.
6
Bodo saltans (Kinetoplastida) is dependent on a novel Paracaedibacter-like endosymbiont that possesses multiple putative toxin-antitoxin systems.跳跃巴通体(动基体目)依赖于一种新型拟杆菌样内共生体,该内共生体拥有多个假定的毒素-抗毒素系统。
ISME J. 2021 Jun;15(6):1680-1694. doi: 10.1038/s41396-020-00879-6. Epub 2021 Jan 15.
7
Deciphering the Infectious Process of in Lupin through Transcriptomic and Proteomic Analysis.通过转录组学和蛋白质组学分析解析羽扇豆中的感染过程
Microorganisms. 2020 Oct 21;8(10):1621. doi: 10.3390/microorganisms8101621.
8
TranCEP: Predicting the substrate class of transmembrane transport proteins using compositional, evolutionary, and positional information.TranCEP:使用组成、进化和位置信息预测跨膜转运蛋白的底物类别。
PLoS One. 2020 Jan 14;15(1):e0227683. doi: 10.1371/journal.pone.0227683. eCollection 2020.
9
The lichen symbiosis re-viewed through the genomes of Cladonia grayi and its algal partner Asterochloris glomerata.通过对 Cladonia grayi 及其藻类伙伴 Asterococcus glomerata 的基因组进行重新审视,发现地衣共生现象。
BMC Genomics. 2019 Jul 23;20(1):605. doi: 10.1186/s12864-019-5629-x.
10
Genome-Wide Identification and Comparative Analysis for OPT Family Genes in and Eleven Flowering Plants.在 和 11 种开花植物中进行 OPT 家族基因的全基因组鉴定和比较分析。
Molecules. 2018 Dec 20;24(1):15. doi: 10.3390/molecules24010015.
利用机器学习技术对膜蛋白进行功能区分。
BMC Bioinformatics. 2008 Mar 3;9:135. doi: 10.1186/1471-2105-9-135.
4
Predicting and improving the protein sequence alignment quality by support vector regression.通过支持向量回归预测并提高蛋白质序列比对质量。
BMC Bioinformatics. 2007 Dec 3;8:471. doi: 10.1186/1471-2105-8-471.
5
TransportDB: a comprehensive database resource for cytoplasmic membrane transport systems and outer membrane channels.TransportDB:一个关于细胞质膜转运系统和外膜通道的综合数据库资源。
Nucleic Acids Res. 2007 Jan;35(Database issue):D274-9. doi: 10.1093/nar/gkl925. Epub 2006 Nov 28.
6
Computational recognition of potassium channel sequences.钾通道序列的计算识别
Bioinformatics. 2006 Jul 1;22(13):1562-8. doi: 10.1093/bioinformatics/btl132. Epub 2006 Apr 4.
7
TCDB: the Transporter Classification Database for membrane transport protein analyses and information.TCDB:用于膜转运蛋白分析和信息查询的转运蛋白分类数据库。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D181-6. doi: 10.1093/nar/gkj001.
8
Prediction of transporter family from protein sequence by support vector machine approach.通过支持向量机方法从蛋白质序列预测转运蛋白家族。
Proteins. 2006 Jan 1;62(1):218-31. doi: 10.1002/prot.20605.
9
NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.NCBI参考序列(RefSeq):一个经过整理的基因组、转录本和蛋白质的非冗余序列数据库。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D501-4. doi: 10.1093/nar/gki025.
10
Structure of a glutamate transporter homologue from Pyrococcus horikoshii.来自嗜热栖热菌的谷氨酸转运体同源物的结构。
Nature. 2004 Oct 14;431(7010):811-8. doi: 10.1038/nature03018.