• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

KIRMES:基于核的常染色质序列调控模块识别。

KIRMES: kernel-based identification of regulatory modules in euchromatic sequences.

机构信息

Friedrich Miescher Laboratory of the Max Planck Society, and Max Planck Institute for Developmental Biology, Tübingen, Germany.

出版信息

Bioinformatics. 2009 Aug 15;25(16):2126-33. doi: 10.1093/bioinformatics/btp278. Epub 2009 Apr 23.

DOI:10.1093/bioinformatics/btp278
PMID:19389732
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2722996/
Abstract

MOTIVATION

Understanding transcriptional regulation is one of the main challenges in computational biology. An important problem is the identification of transcription factor (TF) binding sites in promoter regions of potential TF target genes. It is typically approached by position weight matrix-based motif identification algorithms using Gibbs sampling, or heuristics to extend seed oligos. Such algorithms succeed in identifying single, relatively well-conserved binding sites, but tend to fail when it comes to the identification of combinations of several degenerate binding sites, as those often found in cis-regulatory modules.

RESULTS

We propose a new algorithm that combines the benefits of existing motif finding with the ones of support vector machines (SVMs) to find degenerate motifs in order to improve the modeling of regulatory modules. In experiments on microarray data from Arabidopsis thaliana, we were able to show that the newly developed strategy significantly improves the recognition of TF targets.

AVAILABILITY

The python source code (open source-licensed under GPL), the data for the experiments and a Galaxy-based web service are available at http://www.fml.mpg.de/raetsch/suppl/kirmes/.

摘要

动机

理解转录调控是计算生物学中的主要挑战之一。一个重要的问题是识别潜在转录因子(TF)靶基因启动子区域中的 TF 结合位点。通常通过基于位置权重矩阵的 motif 识别算法使用 Gibbs 采样或启发式方法来扩展种子寡核苷酸来解决此问题。这些算法成功地识别了单个相对保守的结合位点,但在识别几个简并结合位点的组合时往往会失败,因为这些组合通常存在于顺式调控模块中。

结果

我们提出了一种新算法,该算法结合了现有 motif 发现算法和支持向量机(SVM)的优势,以找到简并 motif,从而改进调控模块的建模。在对拟南芥的 microarray 数据进行的实验中,我们能够证明新开发的策略显著提高了 TF 靶标的识别能力。

可用性

python 源代码(在 GPL 下开源)、实验数据以及基于 Galaxy 的网络服务可在 http://www.fml.mpg.de/raetsch/suppl/kirmes/ 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/655a2b8e343b/btp278f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/0fbe2266411c/btp278f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/628ffded1d72/btp278f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/353bd4cdc0d9/btp278f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/549b47d302db/btp278f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/655a2b8e343b/btp278f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/0fbe2266411c/btp278f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/628ffded1d72/btp278f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/353bd4cdc0d9/btp278f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/549b47d302db/btp278f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a5d/2722996/655a2b8e343b/btp278f5.jpg

相似文献

1
KIRMES: kernel-based identification of regulatory modules in euchromatic sequences.KIRMES:基于核的常染色质序列调控模块识别。
Bioinformatics. 2009 Aug 15;25(16):2126-33. doi: 10.1093/bioinformatics/btp278. Epub 2009 Apr 23.
2
Kernel-based identification of regulatory modules.基于内核的调控模块识别
Methods Mol Biol. 2010;674:213-23. doi: 10.1007/978-1-60761-854-6_13.
3
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PhyloGibbs:一种整合了系统发育的吉布斯采样基序查找器。
PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.
4
Comparative promoter region analysis powered by CORG.由CORG驱动的启动子区域比较分析。
BMC Genomics. 2005 Feb 21;6:24. doi: 10.1186/1471-2164-6-24.
5
POWRS: position-sensitive motif discovery.POWRS:位置敏感基序发现。
PLoS One. 2012;7(7):e40373. doi: 10.1371/journal.pone.0040373. Epub 2012 Jul 5.
6
Computational identification of transcription factor binding sites via a transcription-factor-centric clustering (TFCC) algorithm.通过以转录因子为中心的聚类(TFCC)算法对转录因子结合位点进行计算识别。
J Mol Biol. 2002 Apr 19;318(1):71-81. doi: 10.1016/S0022-2836(02)00026-8.
7
RASE: recognition of alternatively spliced exons in C.elegans.RASE:秀丽隐杆线虫中可变剪接外显子的识别
Bioinformatics. 2005 Jun;21 Suppl 1:i369-77. doi: 10.1093/bioinformatics/bti1053.
8
MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes.MAPPER:一种用于在多个基因组中通过计算识别潜在转录因子结合位点的搜索引擎。
BMC Bioinformatics. 2005 Mar 30;6:79. doi: 10.1186/1471-2105-6-79.
9
Seeder: discriminative seeding DNA motif discovery.播种器:鉴别性播种DNA基序发现
Bioinformatics. 2008 Oct 15;24(20):2303-7. doi: 10.1093/bioinformatics/btn444. Epub 2008 Aug 21.
10
SCOPE: a web server for practical de novo motif discovery.SCOPE:用于实际从头基序发现的网络服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W259-64. doi: 10.1093/nar/gkm310. Epub 2007 May 7.

引用本文的文献

1
Discriminating between HuR and TTP binding sites using the k-spectrum kernel method.使用k谱核方法区分HuR和TTP结合位点。
PLoS One. 2017 Mar 23;12(3):e0174052. doi: 10.1371/journal.pone.0174052. eCollection 2017.
2
Identifying transcriptional cis-regulatory modules in animal genomes.识别动物基因组中的转录顺式调控模块。
Wiley Interdiscip Rev Dev Biol. 2015 Mar-Apr;4(2):59-84. doi: 10.1002/wdev.168. Epub 2014 Dec 29.
3
Statistical significance of combinatorial regulations.组合调控的统计显著性。

本文引用的文献

1
Support vector machines and kernels for computational biology.用于计算生物学的支持向量机和核函数。
PLoS Comput Biol. 2008 Oct;4(10):e1000173. doi: 10.1371/journal.pcbi.1000173. Epub 2008 Oct 31.
2
POIMs: positional oligomer importance matrices--understanding support vector machine-based signal detectors.位置寡聚物重要性矩阵(POIMs):理解基于支持向量机的信号检测器
Bioinformatics. 2008 Jul 1;24(13):i6-14. doi: 10.1093/bioinformatics/btn170.
3
Discovering sequence motifs with arbitrary insertions and deletions.发现带有任意插入和缺失的序列基序。
Proc Natl Acad Sci U S A. 2013 Aug 6;110(32):12996-3001. doi: 10.1073/pnas.1302233110. Epub 2013 Jul 23.
4
A histone arginine methylation localizes to nucleosomes in satellite II and III DNA sequences in the human genome.一个组蛋白精氨酸甲基化定位在人类基因组中卫星 II 和 III DNA 序列的核小体上。
BMC Genomics. 2012 Nov 15;13:630. doi: 10.1186/1471-2164-13-630.
5
Sequence-based classification using discriminatory motif feature selection.基于序列的分类使用判别基序特征选择。
PLoS One. 2011;6(11):e27382. doi: 10.1371/journal.pone.0027382. Epub 2011 Nov 10.
6
Discriminative prediction of mammalian enhancers from DNA sequence.从 DNA 序列中鉴别预测哺乳动物增强子。
Genome Res. 2011 Dec;21(12):2167-80. doi: 10.1101/gr.121905.111. Epub 2011 Aug 29.
7
Exploiting physico-chemical properties in string kernels.利用字符串核中的物理化学性质。
BMC Bioinformatics. 2010 Oct 26;11 Suppl 8(Suppl 8):S7. doi: 10.1186/1471-2105-11-S8-S7.
8
Abstracts from the 5th International Society for Computational Biology Student Council Symposium at the 17th Annual International Conference on Intelligent Systems for Molecular Biology and the 8th European Conference on Computational Biology. June 27, 2009. Stockholm, Sweden.第五届国际计算生物学学会学生委员会研讨会摘要,该研讨会于第十七届分子生物学智能系统年度国际会议及第八届欧洲计算生物学会议期间举行。2009年6月27日,瑞典斯德哥尔摩。
BMC Bioinformatics. 2009;10 Suppl 13(Suppl 13):I1, O1-7, P1-7. doi: 10.1186/1471-2105-10-s13-o1.
9
Highlights from the 5th International Society for Computational Biology Student Council Symposium at the 17th Annual International Conference on Intelligent Systems for Molecular Biology and the 8th European Conference on Computational Biology.在第17届分子生物学智能系统年度国际会议和第8届欧洲计算生物学会议上举行的第5届国际计算生物学学会学生委员会研讨会亮点。
BMC Bioinformatics. 2009 Oct 19;10 Suppl 13(Suppl 13):I1. doi: 10.1186/1471-2105-10-S13-I1.
PLoS Comput Biol. 2008 May 9;4(4):e1000071. doi: 10.1371/journal.pcbi.1000071.
4
Accurate splice site prediction using support vector machines.使用支持向量机进行精确的剪接位点预测。
BMC Bioinformatics. 2007;8 Suppl 10(Suppl 10):S7. doi: 10.1186/1471-2105-8-S10-S7.
5
An introduction to kernel-based learning algorithms.基于核的学习算法介绍。
IEEE Trans Neural Netw. 2001;12(2):181-201. doi: 10.1109/72.914517.
6
The Arabidopsis Information Resource (TAIR): gene structure and function annotation.拟南芥信息资源库(TAIR):基因结构与功能注释
Nucleic Acids Res. 2008 Jan;36(Database issue):D1009-14. doi: 10.1093/nar/gkm965. Epub 2007 Nov 5.
7
Evolution of motif variants and positional bias of the cyclic-AMP response element.环磷酸腺苷反应元件的基序变体演变及位置偏向性
BMC Evol Biol. 2007 Feb 8;7 Suppl 1(Suppl 1):S15. doi: 10.1186/1471-2148-7-S1-S15.
8
What is a support vector machine?什么是支持向量机?
Nat Biotechnol. 2006 Dec;24(12):1565-7. doi: 10.1038/nbt1206-1565.
9
WUSCHEL controls meristem function by direct regulation of cytokinin-inducible response regulators.WUSCHEL通过直接调控细胞分裂素诱导的反应调节因子来控制分生组织功能。
Nature. 2005 Dec 22;438(7071):1172-5. doi: 10.1038/nature04270.
10
Galaxy: a platform for interactive large-scale genome analysis.银河:一个用于交互式大规模基因组分析的平台。
Genome Res. 2005 Oct;15(10):1451-5. doi: 10.1101/gr.4086505. Epub 2005 Sep 16.