• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data.基于人工神经网络-遗传算法模型,利用基因芯片数据对拟南芥启动子进行预测
Bioinformation. 2011;6(6):240-3. doi: 10.6026/97320630006240. Epub 2011 Jun 6.
2
Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks.使用卷积深度学习神经网络识别原核生物和真核生物启动子。
PLoS One. 2017 Feb 3;12(2):e0171410. doi: 10.1371/journal.pone.0171410. eCollection 2017.
3
Identification of TATA and TATA-less promoters in plant genomes by integrating diversity measure, GC-Skew and DNA geometric flexibility.通过整合多样性度量、GC-偏斜和 DNA 几何弹性来鉴定植物基因组中的 TATA 和无 TATA 启动子。
Genomics. 2011 Feb;97(2):112-20. doi: 10.1016/j.ygeno.2010.11.002. Epub 2010 Nov 26.
4
Identification of putative promoters in 48 eukaryotic genomes on the basis of DNA free energy.基于 DNA 自由能鉴定 48 种真核生物基因组中的假定启动子。
Sci Rep. 2018 Mar 14;8(1):4520. doi: 10.1038/s41598-018-22129-8.
5
Annotation of gene promoters by integrative data-mining of ChIP-seq Pol-II enrichment data.通过整合 ChIP-seq Pol-II 富集数据的数据挖掘对基因启动子进行注释。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S65. doi: 10.1186/1471-2105-11-S1-S65.
6
TC-motifs at the TATA-box expected position in plant genes: a novel class of motifs involved in the transcription regulation.植物基因中 TATA 盒预期位置的 TC 基序:一类新的参与转录调控的基序。
BMC Genomics. 2010 Mar 12;11:166. doi: 10.1186/1471-2164-11-166.
7
Heterogeneity of Arabidopsis core promoters revealed by high-density TSS analysis.通过高密度转录起始位点分析揭示拟南芥核心启动子的异质性
Plant J. 2009 Oct;60(2):350-62. doi: 10.1111/j.1365-313X.2009.03958.x. Epub 2009 Jun 29.
8
Artificial neural networks for prediction of mycobacterial promoter sequences.用于预测分枝杆菌启动子序列的人工神经网络
Comput Biol Chem. 2003 Dec;27(6):555-64. doi: 10.1016/j.compbiolchem.2003.09.004.
9
Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae.种子储存蛋白基因启动子在十字花科、豆科和禾本科植物中含有保守的DNA基序。
BMC Plant Biol. 2009 Oct 20;9:126. doi: 10.1186/1471-2229-9-126.
10
A TILLING resource for functional genomics in Arabidopsis thaliana accession C24.拟南芥C24生态型功能基因组学的一个定向诱导基因组局部突变资源。
Genes Genet Syst. 2012;87(5):291-7. doi: 10.1266/ggs.87.291.

引用本文的文献

1
Global nucleosome positioning regulates salicylic acid mediated transcription in Arabidopsis thaliana.全局核小体定位调控拟南芥中水杨酸介导的转录。
BMC Plant Biol. 2015 Jan 21;15:13. doi: 10.1186/s12870-014-0404-2.
2
Preeclampsia: a bioinformatics approach through protein-protein interaction networks analysis.子痫前期:一种通过蛋白质-蛋白质相互作用网络分析的生物信息学方法。
BMC Syst Biol. 2012 Aug 8;6:97. doi: 10.1186/1752-0509-6-97.

本文引用的文献

1
Position-dependent motif characterization using non-negative matrix factorization.使用非负矩阵分解进行位置依赖基序表征
Bioinformatics. 2008 Dec 1;24(23):2684-90. doi: 10.1093/bioinformatics/btn526. Epub 2008 Oct 13.
2
Knowledge-guided multi-scale independent component analysis for biomarker identification.用于生物标志物识别的知识引导多尺度独立成分分析
BMC Bioinformatics. 2008 Oct 6;9:416. doi: 10.1186/1471-2105-9-416.
3
Generic eukaryotic core promoter prediction using structural features of DNA.利用DNA结构特征进行通用真核生物核心启动子预测。
Genome Res. 2008 Feb;18(2):310-23. doi: 10.1101/gr.6991408. Epub 2007 Dec 20.
4
Computational analyses of eukaryotic promoters.真核生物启动子的计算分析。
BMC Bioinformatics. 2007 Sep 27;8 Suppl 6(Suppl 6):S3. doi: 10.1186/1471-2105-8-S6-S3.
5
An efficient algorithm for the identification of structured motifs in DNA promoter sequences.一种识别DNA启动子序列中结构化基序的高效算法。
IEEE/ACM Trans Comput Biol Bioinform. 2006 Apr-Jun;3(2):126-40. doi: 10.1109/TCBB.2006.16.
6
Extracting relations between promoter sequences and their strengths from microarray data.从微阵列数据中提取启动子序列与其强度之间的关系。
Bioinformatics. 2005 Apr 1;21(7):1062-8. doi: 10.1093/bioinformatics/bti094. Epub 2004 Oct 28.
7
DNA dynamically directs its own transcription initiation.DNA动态地指导其自身的转录起始。
Nucleic Acids Res. 2004 Mar 5;32(4):1584-90. doi: 10.1093/nar/gkh335. Print 2004.
8
Dragon gene start finder: an advanced system for finding approximate locations of the start of gene transcriptional units.龙基因起始位点查找器:一种用于查找基因转录单元起始大致位置的先进系统。
Genome Res. 2003 Aug;13(8):1923-9. doi: 10.1101/gr.869803. Epub 2003 Jul 17.
9
PromH: Promoters identification using orthologous genomic sequences.PromH:利用直系同源基因组序列进行启动子识别。
Nucleic Acids Res. 2003 Jul 1;31(13):3540-5. doi: 10.1093/nar/gkg525.
10
The RNA polymerase II core promoter.RNA聚合酶II核心启动子。
Annu Rev Biochem. 2003;72:449-79. doi: 10.1146/annurev.biochem.72.121801.161520. Epub 2003 Mar 19.

基于人工神经网络-遗传算法模型,利用基因芯片数据对拟南芥启动子进行预测

An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data.

作者信息

Mishra Hrishikesh, Singh Nitya, Misra Krishna, Lahiri Tapobrata

机构信息

Division of Applied Sciences and Indo-Russian Centre for Biotechnology, Indian Institute of Information Technology, Allahabad, India.

出版信息

Bioinformation. 2011;6(6):240-3. doi: 10.6026/97320630006240. Epub 2011 Jun 6.

DOI:10.6026/97320630006240
PMID:21887014
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3159145/
Abstract

Identification of promoter region is an important part of gene annotation. Identification of promoters in eukaryotes is important as promoters modulate various metabolic functions and cellular stress responses. In this work, a novel approach utilizing intensity values of tilling microarray data for a model eukaryotic plant Arabidopsis thaliana, was used to specify promoter region from non-promoter region. A feed-forward back propagation neural network model supported by genetic algorithm was employed to predict the class of data with a window size of 41. A dataset comprising of 2992 data vectors representing both promoter and non-promoter regions, chosen randomly from probe intensity vectors for whole genome of Arabidopsis thaliana generated through tilling microarray technique was used. The classifier model shows prediction accuracy of 69.73% and 65.36% on training and validation sets, respectively. Further, a concept of distance based class membership was used to validate reliability of classifier, which showed promising results. The study shows the usability of micro-array probe intensities to predict the promoter regions in eukaryotic genomes.

摘要

启动子区域的识别是基因注释的重要组成部分。真核生物中启动子的识别很重要,因为启动子可调节各种代谢功能和细胞应激反应。在这项工作中,一种利用模式真核植物拟南芥的耕作微阵列数据强度值的新方法,被用于从非启动子区域中确定启动子区域。采用由遗传算法支持的前馈反向传播神经网络模型,以41的窗口大小预测数据类别。使用了一个数据集,该数据集由2992个代表启动子和非启动子区域的数据向量组成,这些数据向量是从通过耕作微阵列技术生成的拟南芥全基因组探针强度向量中随机选择的。分类器模型在训练集和验证集上的预测准确率分别为69.73%和65.36%。此外,基于距离的类成员概念被用于验证分类器的可靠性,结果很有前景。该研究表明微阵列探针强度可用于预测真核生物基因组中的启动子区域。