• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用模式生物对新测序物种的调控序列进行注释的统计方法。

Statistical approaches to use a model organism for regulatory sequences annotation of newly sequenced species.

机构信息

Computer Laboratory, University of Cambridge, Cambridge, United Kingdom.

出版信息

PLoS One. 2012;7(9):e42489. doi: 10.1371/journal.pone.0042489. Epub 2012 Sep 11.

DOI:10.1371/journal.pone.0042489
PMID:22984403
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3439465/
Abstract

A major goal of bioinformatics is the characterization of transcription factors and the transcriptional programs they regulate. Given the speed of genome sequencing, we would like to quickly annotate regulatory sequences in newly-sequenced genomes. In such cases, it would be helpful to predict sequence motifs by using experimental data from closely related model organism. Here we present a general algorithm that allow to identify transcription factor binding sites in one newly sequenced species by performing Bayesian regression on the annotated species. First we set the rationale of our method by applying it within the same species, then we extend it to use data available in closely related species. Finally, we generalise the method to handle the case when a certain number of experiments, from several species close to the species on which to make inference, are available. In order to show the performance of the method, we analyse three functionally related networks in the Ascomycota. Two gene network case studies are related to the G2/M phase of the Ascomycota cell cycle; the third is related to morphogenesis. We also compared the method with MatrixReduce and discuss other types of validation and tests. The first network is well known and provides a biological validation test of the method. The two cell cycle case studies, where the gene network size is conserved, demonstrate an effective utility in annotating new species sequences using all the available replicas from model species. The third case, where the gene network size varies among species, shows that the combination of information is less powerful but is still informative. Our methodology is quite general and could be extended to integrate other high-throughput data from model organisms.

摘要

生物信息学的一个主要目标是对转录因子及其调控的转录程序进行特征描述。考虑到基因组测序的速度,我们希望能够快速注释新测序基因组中的调控序列。在这种情况下,如果能够利用来自密切相关的模式生物的实验数据来预测序列基序,将会很有帮助。在这里,我们提出了一种通用算法,通过在已注释的物种上进行贝叶斯回归,来识别新测序物种中的转录因子结合位点。首先,我们通过将该方法应用于同一物种内部来阐明其原理,然后将其扩展到使用密切相关物种中的可用数据。最后,我们将该方法推广到处理来自几个与推断物种接近的物种的一定数量的实验的情况。为了展示该方法的性能,我们分析了子囊菌门中的三个功能相关网络。两个基因网络案例研究与子囊菌细胞周期的 G2/M 期有关;第三个与形态发生有关。我们还将该方法与 MatrixReduce 进行了比较,并讨论了其他类型的验证和测试。第一个网络是众所周知的,为该方法提供了生物学验证测试。两个细胞周期案例研究中,基因网络的大小是保守的,这证明了使用来自模式物种的所有可用副本注释新物种序列的有效效用。在基因网络大小在物种之间变化的第三个案例中,信息的组合虽然不那么强大,但仍然具有信息性。我们的方法非常通用,可以扩展到整合来自模式生物的其他高通量数据。

相似文献

1
Statistical approaches to use a model organism for regulatory sequences annotation of newly sequenced species.利用模式生物对新测序物种的调控序列进行注释的统计方法。
PLoS One. 2012;7(9):e42489. doi: 10.1371/journal.pone.0042489. Epub 2012 Sep 11.
2
A functional selection model explains evolutionary robustness despite plasticity in regulatory networks.功能选择模型解释了尽管调控网络具有可塑性,但仍能保持进化稳健性。
Mol Syst Biol. 2012;8:619. doi: 10.1038/msb.2012.50.
3
A map of the cis-regulatory sequences in the mouse genome.小鼠基因组中顺式调控序列的图谱。
Nature. 2012 Aug 2;488(7409):116-20. doi: 10.1038/nature11243.
4
Inference of transcriptional networks in Arabidopsis through conserved noncoding sequence analysis.通过保守非编码序列分析推断拟南芥转录网络
Plant Cell. 2014 Jul;26(7):2729-45. doi: 10.1105/tpc.114.127001. Epub 2014 Jul 2.
5
6
From elements to modules: regulatory evolution in Ascomycota fungi.从元件到模块:子囊菌真菌中的调控进化。
Curr Opin Genet Dev. 2009 Dec;19(6):571-8. doi: 10.1016/j.gde.2009.09.007. Epub 2009 Oct 29.
7
Motif analysis unveils the possible co-regulation of chloroplast genes and nuclear genes encoding chloroplast proteins.基序分析揭示了叶绿体基因和编码叶绿体蛋白的核基因的可能协同调控。
Plant Mol Biol. 2012 Sep;80(2):177-87. doi: 10.1007/s11103-012-9938-6. Epub 2012 Jun 26.
8
Context specific transcription factor prediction.上下文特异性转录因子预测
Ann Biomed Eng. 2007 Jun;35(6):1053-67. doi: 10.1007/s10439-007-9268-z. Epub 2007 Mar 22.
9
A probabilistic method to detect regulatory modules.一种检测调控模块的概率方法。
Bioinformatics. 2003;19 Suppl 1:i292-301. doi: 10.1093/bioinformatics/btg1040.
10
Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms.利用系统发育信息注释(PIA)在非模式生物的转录组中搜索光相互作用基因。
BMC Bioinformatics. 2014 Nov 19;15(1):350. doi: 10.1186/s12859-014-0350-x.

引用本文的文献

1
Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems.通过整合 ChIP-seq 和 RNA-seq 数据来理解基因调控机制:解决生物学问题的统计解决方案。
Front Cell Dev Biol. 2014 Sep 17;2:51. doi: 10.3389/fcell.2014.00051. eCollection 2014.

本文引用的文献

1
Genetic analysis of variation in transcription factor binding in yeast.酵母中转录因子结合变化的遗传分析。
Nature. 2010 Apr 22;464(7292):1187-91. doi: 10.1038/nature08934. Epub 2010 Mar 17.
2
Conserved expression without conserved regulatory sequence: the more things change, the more they stay the same.保守的表达没有保守的调控序列:事物变化越多,它们保持不变的就越多。
Trends Genet. 2010 Feb;26(2):66-74. doi: 10.1016/j.tig.2009.12.002. Epub 2010 Jan 18.
3
Measuring similarity between gene expression profiles: a Bayesian approach.
测量基因表达谱之间的相似性:贝叶斯方法。
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S14. doi: 10.1186/1471-2164-10-S3-S14.
4
Transcriptional analysis of the Candida albicans cell cycle.白色念珠菌细胞周期的转录分析
Mol Biol Cell. 2009 Jul;20(14):3363-73. doi: 10.1091/mbc.e09-03-0210. Epub 2009 May 28.
5
Candida parapsilosis, an emerging fungal pathogen.近平滑念珠菌,一种新出现的真菌病原体。
Clin Microbiol Rev. 2008 Oct;21(4):606-25. doi: 10.1128/CMR.00013-08.
6
Role of the RAM network in cell polarity and hyphal morphogenesis in Candida albicans.RAM网络在白色念珠菌细胞极性和菌丝形态发生中的作用。
Mol Biol Cell. 2008 Dec;19(12):5456-77. doi: 10.1091/mbc.e08-03-0272. Epub 2008 Oct 8.
7
Computational approaches to finding and analyzing cis-regulatory elements.寻找和分析顺式调控元件的计算方法。
Methods Cell Biol. 2008;87:337-65. doi: 10.1016/S0091-679X(08)00218-5.
8
The Schizosaccharomyces pombe endo-1,3-beta-glucanase Eng1 contains a novel carbohydrate binding module required for septum localization.粟酒裂殖酵母内切-1,3-β-葡聚糖酶Eng1含有一个隔膜定位所需的新型碳水化合物结合模块。
Mol Microbiol. 2008 Jul;69(1):188-200. doi: 10.1111/j.1365-2958.2008.06275.x. Epub 2008 May 5.
9
Natural history and evolutionary principles of gene duplication in fungi.真菌中基因复制的自然史与进化原理
Nature. 2007 Sep 6;449(7158):54-61. doi: 10.1038/nature06107.
10
Experimental design criteria in phylogenetics: where to add taxa.系统发育学中的实验设计标准:添加分类单元的位置。
Syst Biol. 2007 Aug;56(4):609-22. doi: 10.1080/10635150701499563.