• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 DNA 自由能的启动子预测及拟南芥和水稻基因组的比较分析。

DNA free energy-based promoter prediction and comparative analysis of Arabidopsis and rice genomes.

机构信息

Indian Institute of Science, Bangalore 560 012, India.

出版信息

Plant Physiol. 2011 Jul;156(3):1300-15. doi: 10.1104/pp.110.167809. Epub 2011 Apr 29.

DOI:10.1104/pp.110.167809
PMID:21531900
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3135951/
Abstract

The cis-regulatory regions on DNA serve as binding sites for proteins such as transcription factors and RNA polymerase. The combinatorial interaction of these proteins plays a crucial role in transcription initiation, which is an important point of control in the regulation of gene expression. We present here an analysis of the performance of an in silico method for predicting cis-regulatory regions in the plant genomes of Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa) on the basis of free energy of DNA melting. For protein-coding genes, we achieve recall and precision of 96% and 42% for Arabidopsis and 97% and 31% for rice, respectively. For noncoding RNA genes, the program gives recall and precision of 94% and 75% for Arabidopsis and 95% and 90% for rice, respectively. Moreover, 96% of the false-positive predictions were located in noncoding regions of primary transcripts, out of which 20% were found in the first intron alone, indicating possible regulatory roles. The predictions for orthologous genes from the two genomes showed a good correlation with respect to prediction scores and promoter organization. Comparison of our results with an existing program for promoter prediction in plant genomes indicates that our method shows improved prediction capability.

摘要

DNA 上的顺式调控区充当蛋白质(如转录因子和 RNA 聚合酶)的结合位点。这些蛋白质的组合相互作用在转录起始中起着至关重要的作用,这是基因表达调控的一个重要控制点。我们在此提出了一种基于 DNA 融解自由能的算法,用于预测拟南芥(Arabidopsis thaliana)和水稻(Oryza sativa)植物基因组中顺式调控区的分析。对于编码蛋白质的基因,我们分别实现了 96%和 42%的召回率和 42%的精度,以及 97%和 31%的召回率和 31%的精度。对于非编码 RNA 基因,该程序分别给出了 94%和 75%的召回率和 90%的精度,以及 95%和 90%的精度。此外,96%的假阳性预测位于初级转录物的非编码区,其中 20%仅位于第一个内含子中,表明可能具有调节作用。来自两个基因组的同源基因的预测在预测分数和启动子组织方面具有很好的相关性。将我们的结果与现有的植物基因组启动子预测程序进行比较表明,我们的方法显示出了改进的预测能力。

相似文献

1
DNA free energy-based promoter prediction and comparative analysis of Arabidopsis and rice genomes.基于 DNA 自由能的启动子预测及拟南芥和水稻基因组的比较分析。
Plant Physiol. 2011 Jul;156(3):1300-15. doi: 10.1104/pp.110.167809. Epub 2011 Apr 29.
2
Evolutionary expansion, gene structure, and expression of the rice wall-associated kinase gene family.水稻细胞壁相关激酶基因家族的进化扩张、基因结构及表达
Plant Physiol. 2005 Nov;139(3):1107-24. doi: 10.1104/pp.105.069005.
3
Distribution of short interstitial telomere motifs in two plant genomes: putative origin and function.短串联重复序列在两个植物基因组中的分布:可能的起源和功能。
BMC Plant Biol. 2010 Dec 20;10:283. doi: 10.1186/1471-2229-10-283.
4
Systematic analysis of alternative first exons in plant genomes.植物基因组中可变首个外显子的系统分析。
BMC Plant Biol. 2007 Oct 17;7:55. doi: 10.1186/1471-2229-7-55.
5
Genome-wide computational prediction and analysis of core promoter elements across plant monocots and dicots.跨单子叶植物和双子叶植物核心启动子元件的全基因组计算预测与分析
PLoS One. 2013 Oct 29;8(10):e79011. doi: 10.1371/journal.pone.0079011. eCollection 2013.
6
Analyses of phylogeny, evolution, conserved sequences and genome-wide expression of the ICK/KRP family of plant CDK inhibitors.植物 CDK 抑制剂 ICK/KRP 家族的系统发育、进化、保守序列和全基因组表达分析。
Ann Bot. 2011 May;107(7):1141-57. doi: 10.1093/aob/mcr034. Epub 2011 Mar 7.
7
Genome-wide analysis of LIM gene family in Populus trichocarpa, Arabidopsis thaliana, and Oryza sativa.毛果杨、拟南芥和水稻中LIM基因家族的全基因组分析。
DNA Res. 2007 Jun 30;14(3):103-16. doi: 10.1093/dnares/dsm013. Epub 2007 Jun 15.
8
Features of Arabidopsis genes and genome discovered using full-length cDNAs.利用全长cDNA发现的拟南芥基因和基因组特征。
Plant Mol Biol. 2006 Jan;60(1):69-85. doi: 10.1007/s11103-005-2564-9.
9
Comparative analysis of microRNA promoters in Arabidopsis and rice.拟南芥和水稻中 microRNA 启动子的比较分析。
Genomics Proteomics Bioinformatics. 2013 Feb;11(1):56-60. doi: 10.1016/j.gpb.2012.12.004. Epub 2013 Jan 11.
10
In silico analysis of promoter regions from cold-induced genes in rice (Oryza sativa L.) and Arabidopsis thaliana reveals the importance of combinatorial control.对水稻(Oryza sativa L.)和拟南芥中冷诱导基因启动子区域的计算机模拟分析揭示了组合调控的重要性。
Bioinformatics. 2009 Jun 1;25(11):1345-8. doi: 10.1093/bioinformatics/btp172. Epub 2009 Mar 25.

引用本文的文献

1
endogenous promoter AMGT1P33 enhances salt tolerance in Arabidopsis by regulating exogenous salt-tolerance genes.内源性启动子AMGT1P33通过调控外源耐盐基因增强拟南芥的耐盐性。
Front Plant Sci. 2025 Mar 14;16:1541465. doi: 10.3389/fpls.2025.1541465. eCollection 2025.
2
Biological and Molecular Components for Genetically Engineering Biosensors in Plants.用于植物基因工程生物传感器的生物和分子组件。
Biodes Res. 2022 Nov 9;2022:9863496. doi: 10.34133/2022/9863496. eCollection 2022.
3
Transcriptome-based Mining of the Constitutive Promoters for Tuning Gene Expression in Aspergillus oryzae.基于转录组的 Aspergillus oryzae 组成型启动子挖掘,用于调控基因表达。
J Microbiol. 2023 Feb;61(2):199-210. doi: 10.1007/s12275-023-00020-0. Epub 2023 Feb 6.
4
Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.原核生物和真核生物启动子预测的计算工具的批判性评估。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab551.
5
TSSFinder-fast and accurate ab initio prediction of the core promoter in eukaryotic genomes.TSSFinder——真核基因组中核心启动子的快速、准确从头预测。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab198.
6
Delving into Eukaryotic Origins of Replication Using DNA Structural Features.利用DNA结构特征深入探究真核生物复制起点
ACS Omega. 2020 Jun 1;5(23):13601-13611. doi: 10.1021/acsomega.0c00441. eCollection 2020 Jun 16.
7
Identification of Regulatory SNPs Associated with Vicine and Convicine Content of Based on Genotyping by Sequencing Data Using Deep Learning.基于深度学习的测序数据基因分型鉴定与野麻蚕野蚕丝素和杂蛋白含量相关的调控 SNP。
Genes (Basel). 2020 Jun 5;11(6):614. doi: 10.3390/genes11060614.
8
Natural Sequence Variations and Combinations of GNP1 and NAL1 Determine the Grain Number per Panicle in Rice.GNP1和NAL1的自然序列变异及组合决定水稻每穗粒数
Rice (N Y). 2020 Feb 28;13(1):14. doi: 10.1186/s12284-020-00374-8.
9
Variation of gene expression in plants is influenced by gene architecture and structural properties of promoters.植物基因表达的变化受基因结构和启动子结构特性的影响。
PLoS One. 2019 Mar 25;14(3):e0212678. doi: 10.1371/journal.pone.0212678. eCollection 2019.
10
ShapeGTB: the role of local DNA shape in prioritization of functional variants in human promoters with machine learning.ShapeGTB:局部DNA形状在利用机器学习对人类启动子中的功能变异进行优先级排序中的作用。
PeerJ. 2018 Nov 29;6:e5742. doi: 10.7717/peerj.5742. eCollection 2018.

本文引用的文献

1
AtMHX is an auxin and ABA-regulated transporter whose expression pattern suggests a role in metal homeostasis in tissues with photosynthetic potential.AtMHX是一种受生长素和脱落酸调节的转运蛋白,其表达模式表明它在具有光合潜力的组织中的金属稳态中发挥作用。
Funct Plant Biol. 2006 Jul;33(7):661-672. doi: 10.1071/FP05295.
2
High-quality annotation of promoter regions for 913 bacterial genomes.对 913 个细菌基因组的启动子区域进行高质量注释。
Bioinformatics. 2010 Dec 15;26(24):3043-50. doi: 10.1093/bioinformatics/btq577. Epub 2010 Oct 17.
3
A novel role for minimal introns: routing mRNAs to the cytosol.内含子最小化的新作用:将 mRNA 导向细胞质。
PLoS One. 2010 Apr 12;5(4):e10144. doi: 10.1371/journal.pone.0010144.
4
Genome sequencing and analysis of the model grass Brachypodium distachyon.拟南芥基因组测序和分析。
Nature. 2010 Feb 11;463(7282):763-8. doi: 10.1038/nature08747.
5
Schizosaccharomyces pombe genome-wide nucleosome mapping reveals positioning mechanisms distinct from those of Saccharomyces cerevisiae.裂殖酵母全基因组核小体作图揭示了与酿酒酵母不同的定位机制。
Nat Struct Mol Biol. 2010 Feb;17(2):251-7. doi: 10.1038/nsmb.1741. Epub 2010 Jan 31.
6
Genome sequence of the palaeopolyploid soybean.古多倍体大豆基因组序列。
Nature. 2010 Jan 14;463(7278):178-83. doi: 10.1038/nature08670.
7
In plants, expression breadth and expression level distinctly and non-linearly correlate with gene structure.在植物中,表达广度和表达水平与基因结构明显呈非线性相关。
Biol Direct. 2009 Nov 21;4:45; discussion 45. doi: 10.1186/1745-6150-4-45.
8
Finding human promoter groups based on DNA physical properties.基于DNA物理特性寻找人类启动子组。
Phys Rev E Stat Nonlin Soft Matter Phys. 2009 Oct;80(4 Pt 1):041917. doi: 10.1103/PhysRevE.80.041917. Epub 2009 Oct 13.
9
Ensembl Genomes: extending Ensembl across the taxonomic space.Ensembl Genomes:在分类学空间中扩展 Ensembl。
Nucleic Acids Res. 2010 Jan;38(Database issue):D563-9. doi: 10.1093/nar/gkp871. Epub 2009 Nov 1.
10
The word landscape of the non-coding segments of the Arabidopsis thaliana genome.拟南芥基因组非编码区段的词汇景观。
BMC Genomics. 2009 Oct 8;10:463. doi: 10.1186/1471-2164-10-463.