• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SiTaR:一种用于转录因子结合位点预测的新型工具。

SiTaR: a novel tool for transcription factor binding site prediction.

机构信息

Research Group Systems Biology/Bioinformatics, Leibniz Institute for Natural Product Research and Infection Biology, Hans Knöll Institute, 07745 Jena, Germany.

出版信息

Bioinformatics. 2011 Oct 15;27(20):2806-11. doi: 10.1093/bioinformatics/btr492. Epub 2011 Sep 4.

DOI:10.1093/bioinformatics/btr492
PMID:21893518
Abstract

MOTIVATION

Prediction of transcription factor binding sites (TFBSs) is crucial for promoter modeling and network inference. Quality of the predictions is spoiled by numerous false positives, which persist as the main problem for all presently available TFBS search methods.

RESULTS

We suggest a novel approach, which is alternative to widely used position weight matrices (PWMs) and Hidden Markov Models. Each motif of the input set is used as a search template to scan a query sequence. Found motifs are assigned scores depending on the non-randomness of the motif's occurrence, the number of matching searching motifs and the number of mismatches. The non-randomness is estimated by comparison of observed numbers of matching motifs with those predicted to occur by chance. The latter can be calculated given the base compositions of the motif and the query sequence. The method does not require preliminary alignment of the input motifs, hence avoiding uncertainties introduced by the alignment procedure. In comparison with PWM-based tools, our method demonstrates higher precision by the same sensitivity and specificity. It also tends to outperform methods combining pattern and PWM search. Most important, it allows reducing the number of false positive predictions significantly.

AVAILABILITY

The method is implemented in a tool called SiTaR (Site Tracking and Recognition) and is available at http://sbi.hki-jena.de/sitar/index.php.

CONTACT

ekaterina.shelest@hki-jena.de

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

转录因子结合位点 (TFBS) 的预测对于启动子建模和网络推断至关重要。由于存在大量的假阳性,预测的质量受到了影响,而这些假阳性一直是所有现有 TFBS 搜索方法的主要问题。

结果

我们提出了一种新的方法,它与广泛使用的位置权重矩阵 (PWMs) 和隐马尔可夫模型不同。输入集中的每个基序都用作搜索模板来扫描查询序列。找到的基序根据基序出现的非随机性、匹配搜索基序的数量和不匹配的数量来分配分数。非随机性通过将观察到的匹配基序数量与随机发生的基序数量进行比较来估计。后者可以根据基序和查询序列的碱基组成来计算。该方法不需要对输入基序进行初步对齐,从而避免了对齐过程中引入的不确定性。与基于 PWM 的工具相比,我们的方法在相同的灵敏度和特异性下表现出更高的精度。它也倾向于优于结合模式和 PWM 搜索的方法。最重要的是,它可以显著减少假阳性预测的数量。

可用性

该方法在称为 SiTaR(Site Tracking and Recognition)的工具中实现,并可在 http://sbi.hki-jena.de/sitar/index.php 上获得。

联系方式

ekaterina.shelest@hki-jena.de

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
SiTaR: a novel tool for transcription factor binding site prediction.SiTaR:一种用于转录因子结合位点预测的新型工具。
Bioinformatics. 2011 Oct 15;27(20):2806-11. doi: 10.1093/bioinformatics/btr492. Epub 2011 Sep 4.
2
DistanceScan: a tool for promoter modeling.DistanceScan:一种启动子建模工具。
Bioinformatics. 2010 Jun 1;26(11):1460-2. doi: 10.1093/bioinformatics/btq132. Epub 2010 Mar 25.
3
A DNA shape-based regulatory score improves position-weight matrix-based recognition of transcription factor binding sites.一种基于DNA形状的调控评分提高了基于位置权重矩阵对转录因子结合位点的识别。
Bioinformatics. 2015 Nov 1;31(21):3445-50. doi: 10.1093/bioinformatics/btv391. Epub 2015 Jun 30.
4
A novel method for improved accuracy of transcription factor binding site prediction.一种提高转录因子结合位点预测准确性的新方法。
Nucleic Acids Res. 2018 Jul 6;46(12):e72. doi: 10.1093/nar/gky237.
5
From binding motifs in ChIP-Seq data to improved models of transcription factor binding sites.从染色质免疫沉淀测序(ChIP-Seq)数据中的结合基序到转录因子结合位点的改进模型
J Bioinform Comput Biol. 2013 Feb;11(1):1340004. doi: 10.1142/S0219720013400040. Epub 2013 Jan 16.
6
LASAGNA-Search: an integrated web tool for transcription factor binding site search and visualization.LASAGNA-Search:一个用于转录因子结合位点搜索和可视化的集成网络工具。
Biotechniques. 2013 Mar;54(3):141-53. doi: 10.2144/000113999.
7
DISCOVER: a feature-based discriminative method for motif search in complex genomes.DISCOVER:一种用于在复杂基因组中进行基序搜索的基于特征的判别方法。
Bioinformatics. 2009 Jun 15;25(12):i321-9. doi: 10.1093/bioinformatics/btp230.
8
An integrated toolkit for accurate prediction and analysis of cis-regulatory motifs at a genome scale.一个基因组范围内精确预测和分析顺式调控基序的综合工具包。
Bioinformatics. 2013 Sep 15;29(18):2261-8. doi: 10.1093/bioinformatics/btt397. Epub 2013 Jul 10.
9
TrawlerWeb: an online de novo motif discovery tool for next-generation sequencing datasets.拖网生物:下一代测序数据集的在线从头基序发现工具。
BMC Genomics. 2018 Apr 5;19(1):238. doi: 10.1186/s12864-018-4630-0.
10
Tree-based position weight matrix approach to model transcription factor binding site profiles.基于树的位置权重矩阵方法来模拟转录因子结合位点图谱。
PLoS One. 2011;6(9):e24210. doi: 10.1371/journal.pone.0024210. Epub 2011 Sep 2.

引用本文的文献

1
Scoring Targets of Transcription in Bacteria Rather than Focusing on Individual Binding Sites.细菌转录的评分目标,而非专注于单个结合位点。
Front Microbiol. 2017 Nov 22;8:2314. doi: 10.3389/fmicb.2017.02314. eCollection 2017.
2
Most of the tight positional conservation of transcription factor binding sites near the transcription start site reflects their co-localization within regulatory modules.转录起始位点附近转录因子结合位点的大多数紧密位置保守性反映了它们在调控模块内的共定位。
BMC Bioinformatics. 2016 Nov 21;17(1):479. doi: 10.1186/s12859-016-1354-5.
3
Evaluating tools for transcription factor binding site prediction.
评估转录因子结合位点预测工具。
BMC Bioinformatics. 2016 Nov 2;17(1):547. doi: 10.1186/s12859-016-1298-9.
4
Analysis of Genomic Sequence Motifs for Deciphering Transcription Factor Binding and Transcriptional Regulation in Eukaryotic Cells.用于解析真核细胞中转录因子结合和转录调控的基因组序列基序分析
Front Genet. 2016 Feb 23;7:24. doi: 10.3389/fgene.2016.00024. eCollection 2016.
5
Computational prediction of molecular pathogen-host interactions based on dual transcriptome data.基于双转录组数据的分子病原体-宿主相互作用的计算预测
Front Microbiol. 2015 Feb 6;6:65. doi: 10.3389/fmicb.2015.00065. eCollection 2015.
6
Microevolution of Candida albicans in macrophages restores filamentation in a nonfilamentous mutant.白色念珠菌在巨噬细胞中的微进化可恢复非丝状突变体中的丝状化。
PLoS Genet. 2014 Dec 4;10(12):e1004824. doi: 10.1371/journal.pgen.1004824. eCollection 2014 Dec.
7
Interactive exploration of integrated biological datasets using context-sensitive workflows.使用上下文敏感工作流交互式探索集成生物数据集。
Front Genet. 2014 Feb 20;5:21. doi: 10.3389/fgene.2014.00021. eCollection 2014.
8
Information theory applications for biological sequence analysis.信息论在生物序列分析中的应用。
Brief Bioinform. 2014 May;15(3):376-89. doi: 10.1093/bib/bbt068. Epub 2013 Sep 20.
9
LASAGNA: a novel algorithm for transcription factor binding site alignment.LASAGNA:一种用于转录因子结合位点比对的新算法。
BMC Bioinformatics. 2013 Mar 24;14:108. doi: 10.1186/1471-2105-14-108.
10
Effect of positional dependence and alignment strategy on modeling transcription factor binding sites.
BMC Res Notes. 2012 Jul 2;5:340. doi: 10.1186/1756-0500-5-340.