Suppr超能文献

基于功能域组成的自动转录因子分类器。

Automatic transcription factor classifier based on functional domain composition.

作者信息

Qian Ziliang, Cai Yu-Dong, Li Yixue

机构信息

Bioinformatics Center, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China.

出版信息

Biochem Biophys Res Commun. 2006 Aug 18;347(1):141-4. doi: 10.1016/j.bbrc.2006.06.060. Epub 2006 Jun 21.

Abstract

To understand the transcriptional regulatory mechanism, it is indispensable to identify transcription factors (TF) from the whole genome and to classify transcription factors into different classes. New computational approaches have been developed to identify TFs/non-TFs, and furthermore to classify TFs into four different classes, based on the protein functional domain composition [K.C. Chou, Y.D. Cai, Using functional domain composition and support vector machines for prediction of protein subcellular location, J. Biol. Chem. 277 (2002) 45765-45769]. We trained and tested our method on a non-redundancy dataset consisting of 74 transcription factors collected from TRANSFAC v7.0 [V. Matys, O.V. Kel-Margoulis, E. Fricke, I. Liebich, S. Land, A. Barre-Dirrie, I. Reuter, D. Chekmenev, M. Krull, K. Hornischer, N. Voss, P. Stegmaier, B. Lewicki-Potapov, H. Saxel, A.E. Kel, E. Wingender, TRANSFAC(R) and its module TRANSCompel(R): transcriptional gene regulation in eukaryotes, Nucleic Acids Res. 34 (2006) D108-D110] and 1558 non-transcription factors from UniProtKB/Swiss-Prot Release 49.3 of 21-Mar-2006. The overall success rates of jackknife cross-validation tests reached 98.4% for TF/non-TF identification and 97.2% for classifications of TF classes: basic domains, zinc-coordinating DNA-binding domains, helix-turn-helix, and beta-scaffold factors.

摘要

为了理解转录调控机制,从全基因组中识别转录因子(TF)并将转录因子分类为不同类别是必不可少的。已经开发了新的计算方法来识别TF/非TF,此外,基于蛋白质功能域组成将TF分为四个不同类别[K.C.周,Y.D.蔡,使用功能域组成和支持向量机预测蛋白质亚细胞定位,《生物化学杂志》277(2002)45765-45769]。我们在一个非冗余数据集上训练和测试了我们的方法,该数据集由从TRANSFAC v7.0[V.马蒂斯,O.V.凯尔-马尔古利斯,E.弗里克,I.利比希,S.兰德,A.巴雷-迪里,I.罗伊特,D.切克梅涅夫,M.克鲁尔,K.霍尔尼舍尔,N.沃斯,P.施泰格迈尔,B.莱维茨基-波塔波夫,H.萨克塞尔,A.E.凯尔,E.温根德,TRANSFAC(R)及其模块TRANSCompel(R):真核生物中的转录基因调控,《核酸研究》34(2006)D108-D110]收集的74个转录因子和来自2006年3月21日的UniProtKB/Swiss-Prot Release 49.3的1558个非转录因子组成。留一法交叉验证测试的总体成功率在TF/非TF识别方面达到98.4%,在TF类别分类方面达到97.2%:基本结构域、锌配位DNA结合结构域、螺旋-转角-螺旋和β-支架因子。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验