• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Effect of positional dependence and alignment strategy on modeling transcription factor binding sites.

作者信息

Quader Saad, Huang Chun-Hsi

机构信息

Department of Computer Science & Engineering, University of Connecticut, Storrs, 06269-2155, USA.

出版信息

BMC Res Notes. 2012 Jul 2;5:340. doi: 10.1186/1756-0500-5-340.

DOI:10.1186/1756-0500-5-340
PMID:22748199
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3465234/
Abstract

BACKGROUND

Many consensus-based and Position Weight Matrix-based methods for recognizing transcription factor binding sites (TFBS) are not well suited to the variability in the lengths of binding sites. Besides, many methods discard known binding sites while building the model. Moreover, the impact of Information Content (IC) and the positional dependence of nucleotides within an aligned set of TFBS has not been well researched for modeling variable-length binding sites. In this paper, we propose ML-Consensus (Mixed-Length Consensus): a consensus model for variable-length TFBS which does not exclude any reported binding sites.

METHODS

We consider Pairwise Score (PS) as a measure of positional dependence of nucleotides within an alignment of TFBS. We investigate how the prediction accuracy of ML-Consensus is affected by the incorporation of IC and PS with a particular binding site alignment strategy. We perform cross-validations for datasets of six species from the TRANSFAC public database, and analyze the results using ROC curves and the Wilcoxon matched-pair signed-ranks test.

RESULTS

We observe that the incorporation of IC and PS in ML-Consensus results in statistically significant improvement in the prediction accuracy of the model. Moreover, the existence of a core region among the known binding sites (of any length) is witnessed by the pairwise coexistence of nucleotides within the core length.

CONCLUSIONS

These observations suggest the possibility of an efficient multiple sequence alignment algorithm for aligning TFBS, accommodating known binding sites of any length, for optimal (or near-optimal) TFBS prediction. However, designing such an algorithm is a matter of further investigation.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/a5751068a277/1756-0500-5-340-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/f289b0df42e6/1756-0500-5-340-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/1b77f625ddb6/1756-0500-5-340-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/76bd2f29292c/1756-0500-5-340-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/d5b71deb6b03/1756-0500-5-340-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/a5751068a277/1756-0500-5-340-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/f289b0df42e6/1756-0500-5-340-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/1b77f625ddb6/1756-0500-5-340-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/76bd2f29292c/1756-0500-5-340-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/d5b71deb6b03/1756-0500-5-340-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/923f/3465234/a5751068a277/1756-0500-5-340-5.jpg

相似文献

1
Effect of positional dependence and alignment strategy on modeling transcription factor binding sites.
BMC Res Notes. 2012 Jul 2;5:340. doi: 10.1186/1756-0500-5-340.
2
LASAGNA: a novel algorithm for transcription factor binding site alignment.LASAGNA:一种用于转录因子结合位点比对的新算法。
BMC Bioinformatics. 2013 Mar 24;14:108. doi: 10.1186/1471-2105-14-108.
3
Incorporating evolution of transcription factor binding sites into annotated alignments.将转录因子结合位点的进化纳入注释比对中。
J Biosci. 2007 Aug;32(5):841-50. doi: 10.1007/s12038-007-0084-2.
4
Improvement of TRANSFAC matrices using multiple local alignment of transcription factor binding site sequences.利用转录因子结合位点序列的多重局部比对改进TRANSFAC矩阵。
Genome Inform. 2005;16(1):68-72.
5
Integrating genomic data to predict transcription factor binding.整合基因组数据以预测转录因子结合
Genome Inform. 2005;16(1):83-94.
6
A mixture model-based discriminate analysis for identifying ordered transcription factor binding site pairs in gene promoters directly regulated by estrogen receptor-alpha.基于混合模型的判别分析,用于识别由雌激素受体α直接调控的基因启动子中的有序转录因子结合位点对。
Bioinformatics. 2006 Sep 15;22(18):2210-6. doi: 10.1093/bioinformatics/btl329. Epub 2006 Jun 29.
7
Finding sequence motifs with Bayesian models incorporating positional information: an application to transcription factor binding sites.使用结合位置信息的贝叶斯模型寻找序列基序:在转录因子结合位点上的应用
BMC Bioinformatics. 2008 Jun 4;9:262. doi: 10.1186/1471-2105-9-262.
8
Effective transcription factor binding site prediction using a combination of optimization, a genetic algorithm and discriminant analysis to capture distant interactions.结合优化、遗传算法和判别分析以捕捉远距离相互作用来进行有效的转录因子结合位点预测。
BMC Bioinformatics. 2007 Dec 19;8:481. doi: 10.1186/1471-2105-8-481.
9
Dinucleotide weight matrices for predicting transcription factor binding sites: generalizing the position weight matrix.二核苷酸权重矩阵用于预测转录因子结合位点:位置权重矩阵的推广。
PLoS One. 2010 Mar 22;5(3):e9722. doi: 10.1371/journal.pone.0009722.
10
Identifying functional transcription factor binding sites in yeast by considering their positional preference in the promoters.通过考虑启动子中位置偏好来识别酵母中的功能转录因子结合位点。
PLoS One. 2013 Dec 26;8(12):e83791. doi: 10.1371/journal.pone.0083791. eCollection 2013.

本文引用的文献

1
SiTaR: a novel tool for transcription factor binding site prediction.SiTaR:一种用于转录因子结合位点预测的新型工具。
Bioinformatics. 2011 Oct 15;27(20):2806-11. doi: 10.1093/bioinformatics/btr492. Epub 2011 Sep 4.
2
Variable structure motifs for transcription factor binding sites.转录因子结合位点的变构基序。
BMC Genomics. 2010 Jan 14;11:30. doi: 10.1186/1471-2164-11-30.
3
Diversity and complexity in DNA recognition by transcription factors.转录因子对DNA识别的多样性与复杂性
Science. 2009 Jun 26;324(5935):1720-3. doi: 10.1126/science.1162327. Epub 2009 May 14.
4
ROC analysis: applications to the classification of biological sequences and 3D structures.ROC分析:在生物序列和三维结构分类中的应用
Brief Bioinform. 2008 May;9(3):198-209. doi: 10.1093/bib/bbm064. Epub 2008 Jan 11.
5
A survey of DNA motif finding algorithms.DNA基序查找算法综述。
BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S21. doi: 10.1186/1471-2105-8-S7-S21.
6
Clustal W and Clustal X version 2.0.Clustal W和Clustal X 2.0版本
Bioinformatics. 2007 Nov 1;23(21):2947-8. doi: 10.1093/bioinformatics/btm404. Epub 2007 Sep 10.
7
Recent evolutions of multiple sequence alignment algorithms.多重序列比对算法的最新进展。
PLoS Comput Biol. 2007 Aug;3(8):e123. doi: 10.1371/journal.pcbi.0030123.
8
P-Match: transcription factor binding site search by combining patterns and weight matrices.P-Match:通过结合模式和权重矩阵进行转录因子结合位点搜索。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W432-7. doi: 10.1093/nar/gki441.
9
MatInspector and beyond: promoter analysis based on transcription factor binding sites.MatInspector及其他:基于转录因子结合位点的启动子分析
Bioinformatics. 2005 Jul 1;21(13):2933-42. doi: 10.1093/bioinformatics/bti473. Epub 2005 Apr 28.
10
Comparative analysis of methods for representing and searching for transcription factor binding sites.转录因子结合位点的表示与搜索方法的比较分析
Bioinformatics. 2004 Dec 12;20(18):3516-25. doi: 10.1093/bioinformatics/bth438. Epub 2004 Aug 5.