• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用马尔可夫转移矩阵检测真核生物启动子。

Detection of eukaryotic promoters using Markov transition matrices.

作者信息

Audic S, Claverie J M

机构信息

Institute of Structural Biology and Microbiology, CNRS, Marseille, France.

出版信息

Comput Chem. 1997;21(4):223-7. doi: 10.1016/s0097-8485(96)00040-x.

DOI:10.1016/s0097-8485(96)00040-x
PMID:9440929
Abstract

Eukaryotic promoters are among the most important functional domains yet to be characterized in a satisfactory manner in genomic sequences. Most current detection methods rely on the recognition of individual transcription elements using position-weight matrices (PWM) or consensus sequences. Here, we study a simple promoter detection algorithm based on Markov transition matrices built from sequences upward from proven transcription initiation sites. The performances have been evaluated on the training set and on a test set of promoter-containing sequences. The results on the training set are surprisingly good, given that the algorithm does not incorporate any specific knowledge about promoters. Yet, the program exhibits the pathological behaviour typical of all training set-based methods: a significant decline in performance when confronted with previously unseen sequences. Thus, the Markov algorithm, like the others presently available, does not truly capture the essence of eukaryotic promoters. A detection program based on a Markov model is likely to be blind to categories of promoters without close representatives in the training set.

摘要

真核生物启动子是基因组序列中尚未得到充分表征的最重要功能域之一。目前大多数检测方法依赖于使用位置权重矩阵(PWM)或共有序列来识别单个转录元件。在此,我们研究一种基于马尔可夫转移矩阵的简单启动子检测算法,该矩阵由已证实的转录起始位点向上构建的序列生成。已在训练集和一组含启动子序列的测试集上评估了该算法的性能。鉴于该算法未纳入任何关于启动子的特定知识,训练集上的结果出奇地好。然而,该程序表现出所有基于训练集的方法典型的病态行为:面对以前未见过的序列时性能显著下降。因此,马尔可夫算法与目前其他可用算法一样,并未真正抓住真核生物启动子的本质。基于马尔可夫模型的检测程序可能会对训练集中没有相近代表的启动子类别视而不见。

相似文献

1
Detection of eukaryotic promoters using Markov transition matrices.使用马尔可夫转移矩阵检测真核生物启动子。
Comput Chem. 1997;21(4):223-7. doi: 10.1016/s0097-8485(96)00040-x.
2
Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences.从502个不相关的启动子序列中得出的四种真核生物RNA聚合酶II启动子元件的权重矩阵描述。
J Mol Biol. 1990 Apr 20;212(4):563-78. doi: 10.1016/0022-2836(90)90223-9.
3
Analysis of eukaryotic promoter sequences reveals a systematically occurring CT-signal.对真核生物启动子序列的分析揭示了一种系统出现的CT信号。
Nucleic Acids Res. 1995 Apr 11;23(7):1223-30. doi: 10.1093/nar/23.7.1223.
4
Drosophila TFIID binds to a conserved downstream basal promoter element that is present in many TATA-box-deficient promoters.果蝇TFIID与许多缺乏TATA框的启动子中存在的保守下游基础启动子元件结合。
Genes Dev. 1996 Mar 15;10(6):711-24. doi: 10.1101/gad.10.6.711.
5
PromFD 1.0: a computer program that predicts eukaryotic pol II promoters using strings and IMD matrices.PromFD 1.0:一个使用字符串和IMD矩阵预测真核生物聚合酶II启动子的计算机程序。
Comput Appl Biosci. 1997 Feb;13(1):29-35. doi: 10.1093/bioinformatics/13.1.29.
6
Eukaryotic promoter recognition by binding sites for transcription factors.通过转录因子结合位点进行真核生物启动子识别
Comput Appl Biosci. 1995 Oct;11(5):477-88. doi: 10.1093/bioinformatics/11.5.477.
7
Interpolated markov chains for eukaryotic promoter recognition.用于真核生物启动子识别的内插马尔可夫链
Bioinformatics. 1999 May;15(5):362-9. doi: 10.1093/bioinformatics/15.5.362.
8
RpoD promoters in Campylobacter jejuni exhibit a strong periodic signal instead of a -35 box.空肠弯曲菌中的RpoD启动子呈现出强烈的周期性信号而非-35框。
J Mol Biol. 2003 Mar 7;326(5):1361-72. doi: 10.1016/s0022-2836(03)00034-2.
9
Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters.人类和酵母基因中起始子相对于TATA框的发生率以及人类无TATA核心启动子中富集的DNA基序的鉴定。
Gene. 2007 Mar 1;389(1):52-65. doi: 10.1016/j.gene.2006.09.029. Epub 2006 Oct 10.
10
Visualizing the competitive recognition of TATA-boxes in vertebrate promoters.
Trends Genet. 1998 Jan;14(1):10-1. doi: 10.1016/S0168-9525(97)01323-1.

引用本文的文献

1
Distribution rules of 8-mer spectra and characterization of evolution state in animal genome sequences.8 元组谱的分布规律及动物基因组序列进化状态的特征。
BMC Genomics. 2024 Sep 12;25(1):855. doi: 10.1186/s12864-024-10786-1.
2
iPro70-FMWin: identifying Sigma70 promoters using multiple windowing and minimal features.iPro70-FMWin:使用多重窗口和最小特征识别 Sigma70 启动子。
Mol Genet Genomics. 2019 Feb;294(1):69-84. doi: 10.1007/s00438-018-1487-5. Epub 2018 Sep 5.
3
Osteopontin alters DNA methylation through up-regulating DNMT1 and sensitizes CD133+/CD44+ cancer stem cells to 5 azacytidine in hepatocellular carcinoma.
骨桥蛋白通过上调 DNMT1 改变 DNA 甲基化,使肝癌中 CD133+/CD44+ 肿瘤干细胞对 5-氮杂胞苷敏感。
J Exp Clin Cancer Res. 2018 Jul 31;37(1):179. doi: 10.1186/s13046-018-0832-1.
4
De novo assembly and characterization of Muscovy duck liver transcriptome and analysis of differentially regulated genes in response to heat stress.番鸭肝脏转录组的从头组装与表征以及热应激响应中差异调控基因的分析。
Cell Stress Chaperones. 2015 May;20(3):483-93. doi: 10.1007/s12192-015-0573-4. Epub 2015 Feb 9.
5
De novo characterization of the Anthurium transcriptome and analysis of its digital gene expression under cold stress.红掌转录组的从头表征及其在冷胁迫下的数字基因表达分析。
BMC Genomics. 2013 Nov 25;14(1):827. doi: 10.1186/1471-2164-14-827.
6
Identification of core promoter modules in Drosophila and their application in accurate transcription start site prediction.果蝇核心启动子模块的鉴定及其在准确转录起始位点预测中的应用。
Nucleic Acids Res. 2006;34(20):5943-50. doi: 10.1093/nar/gkl608. Epub 2006 Oct 26.
7
Consensus promoter identification in the human genome utilizing expressed gene markers and gene modeling.利用表达基因标记和基因建模在人类基因组中鉴定一致性启动子。
Genome Res. 2002 Mar;12(3):462-9. doi: 10.1101/gr.198002.
8
Computational detection and location of transcription start sites in mammalian genomic DNA.哺乳动物基因组DNA中转录起始位点的计算检测与定位
Genome Res. 2002 Mar;12(3):458-61. doi: 10.1101/gr.216102.
9
Promoter prediction on a genomic scale--the Adh experience.基因组尺度上的启动子预测——乙醇脱氢酶的经验
Genome Res. 2000 Apr;10(4):539-42. doi: 10.1101/gr.10.4.539.
10
Self-identification of protein-coding regions in microbial genomes.微生物基因组中蛋白质编码区域的自我识别。
Proc Natl Acad Sci U S A. 1998 Aug 18;95(17):10026-31. doi: 10.1073/pnas.95.17.10026.