• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

草履虫基因组学的改进方法与资源:转录单元、基因注释与基因表达

Improved methods and resources for paramecium genomics: transcription units, gene annotation and gene expression.

作者信息

Arnaiz Olivier, Van Dijk Erwin, Bétermier Mireille, Lhuillier-Akakpo Maoussi, de Vanssay Augustin, Duharcourt Sandra, Sallet Erika, Gouzy Jérôme, Sperling Linda

机构信息

Institute for Integrative Biology of the Cell (I2BC), CNRS, CEA, Univ. Paris-Sud, Université Paris-Saclay, 91198, Gif-sur-Yvette CEDEX, France.

Institut Jacques Monod, CNRS, UMR 7592, Université Paris Diderot, Sorbonne Paris Cité, F-75205, Paris, France.

出版信息

BMC Genomics. 2017 Jun 26;18(1):483. doi: 10.1186/s12864-017-3887-z.

DOI:10.1186/s12864-017-3887-z
PMID:28651633
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5485702/
Abstract

BACKGROUND

The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage.

RESULTS

We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource.

CONCLUSIONS

We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3' and 5' UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia, P. biaurelia, P. sexaurelia and P. caudatum, and Paramecium-trained EuGene configuration are available through ParameciumDB ( http://paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

摘要

背景

草履虫隐秘种复合体的15个姊妹种是在数千万年前发生的一次全基因组复制之后出现的。鉴于上个世纪积累的关于草履虫遗传学和表观遗传学的广泛知识,这个物种复合体为研究单细胞真核生物全基因组复制的后果以及驱动物种形成的遗传和表观遗传机制提供了一个独特而强大的系统。高质量的草履虫基因模型对于使用该系统的研究很重要。本文报道的工作的主要目的是为草履虫谱系构建一个改进的基因注释流程。

结果

我们在模式物种四膜虫的自体受精有性过程中生成了定向RNA-Seq转录组数据。我们首次在纤毛虫中使用改进的Cap-Seq协议确定了候选的四膜虫转录起始位点。我们开发了TrUC,这是一个多线程的Perl软件,它与将RNA-Seq数据映射到参考基因组的TopHat相结合,为注释流程预测转录单元。我们使用EuGene软件来整合注释证据。为四膜虫获得的高质量基因结构注释被用作改进其他3种草履虫物种已发表注释的证据。RNA-Seq数据还用于差异基因表达分析,提供了一个比先前建立的微阵列资源更敏感的基因表达图谱。

结论

我们开发了一个针对草履虫物种紧凑基因组和微小内含子量身定制的基因注释流程。这个流程的一个新组件TrUC使用Cap-Seq和定向RNA-Seq数据预测转录单元。TrUC可能在草履虫之外也有用,特别是在基因密度高的情况下。对3'和5'非翻译区的准确预测对于基因表达研究(如核小体定位、顺式调控基序的鉴定)将特别有价值。通过草履虫数据库(http://paramecium.i2bc.paris-saclay.fr)可获得改进的四膜虫转录组资源、四膜虫、双小核草履虫、六小核草履虫和尾草履虫的基因注释以及经过草履虫训练的EuGene配置。TrUC软件根据GNU GPL v3许可免费分发(https://github.com/oarnaiz/TrUC)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/5485702/9795ab0e1934/12864_2017_3887_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/5485702/441480a29b6f/12864_2017_3887_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/5485702/545e36437065/12864_2017_3887_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/5485702/9795ab0e1934/12864_2017_3887_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/5485702/441480a29b6f/12864_2017_3887_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/5485702/545e36437065/12864_2017_3887_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/5485702/9795ab0e1934/12864_2017_3887_Fig3_HTML.jpg

相似文献

1
Improved methods and resources for paramecium genomics: transcription units, gene annotation and gene expression.草履虫基因组学的改进方法与资源:转录单元、基因注释与基因表达
BMC Genomics. 2017 Jun 26;18(1):483. doi: 10.1186/s12864-017-3887-z.
2
ParameciumDB 2019: integrating genomic data across the genus for functional and evolutionary biology.ParameciumDB 2019:整合属内基因组数据以用于功能和进化生物学。
Nucleic Acids Res. 2020 Jan 8;48(D1):D599-D605. doi: 10.1093/nar/gkz948.
3
CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts.CodingQuarry:利用RNA测序转录本对真菌基因组进行高精度隐马尔可夫模型基因预测。
BMC Genomics. 2015 Mar 11;16(1):170. doi: 10.1186/s12864-015-1344-4.
4
Population Genomics of Paramecium Species.种群体基因组学研究。
Mol Biol Evol. 2017 May 1;34(5):1194-1216. doi: 10.1093/molbev/msx074.
5
EuGene-PP: a next-generation automated annotation pipeline for prokaryotic genomes.EuGene-PP:用于原核基因组的下一代自动化注释管道。
Bioinformatics. 2014 Sep 15;30(18):2659-61. doi: 10.1093/bioinformatics/btu366. Epub 2014 May 30.
6
Gene expression in a paleopolyploid: a transcriptome resource for the ciliate Paramecium tetraurelia.古多倍体中的基因表达:纤毛虫四膜虫的转录组资源。
BMC Genomics. 2010 Oct 8;11:547. doi: 10.1186/1471-2164-11-547.
7
A robust (re-)annotation approach to generate unbiased mapping references for RNA-seq-based analyses of differential expression across closely related species.一种强大的(重新)注释方法,用于为基于RNA测序的密切相关物种间差异表达分析生成无偏映射参考。
BMC Genomics. 2016 May 24;17:392. doi: 10.1186/s12864-016-2646-x.
8
ParameciumDB in 2011: new tools and new data for functional and comparative genomics of the model ciliate Paramecium tetraurelia.2011年的草履虫数据库:用于模式纤毛虫四膜虫功能和比较基因组学的新工具与新数据。
Nucleic Acids Res. 2011 Jan;39(Database issue):D632-6. doi: 10.1093/nar/gkq918. Epub 2010 Oct 14.
9
Insights into three whole-genome duplications gleaned from the Paramecium caudatum genome sequence.从尾草履虫基因组序列中获得的关于三次全基因组复制的见解。
Genetics. 2014 Aug;197(4):1417-28. doi: 10.1534/genetics.114.163287. Epub 2014 May 19.
10
The mitochondrial genome sequence of the ciliate Paramecium caudatum reveals a shift in nucleotide composition and codon usage within the genus Paramecium.纤毛虫 Paramecium caudatum 的线粒体基因组序列揭示了 Paramecium 属内核苷酸组成和密码子使用的转变。
BMC Genomics. 2011 May 31;12:272. doi: 10.1186/1471-2164-12-272.

引用本文的文献

1
Fidelity, specialization, and evolution of Paramecium PolX DNA polymerases involved in programmed double-strand break DNA repair.参与程序性双链断裂DNA修复的草履虫PolX DNA聚合酶的保真度、特异性及进化
Nucleic Acids Res. 2025 Aug 11;53(15). doi: 10.1093/nar/gkaf786.
2
The linker region of a development-specific DNA polymerase X ensures efficient repair of programmed DNA double-strand breaks in Parameciumtetraurelia.一种发育特异性DNA聚合酶X的连接区确保了草履虫中程序性DNA双链断裂的有效修复。
Nucleic Acids Res. 2025 Apr 10;53(7). doi: 10.1093/nar/gkaf286.
3
Comprehensive genome annotation of the model ciliate Tetrahymena thermophila by in-depth epigenetic and transcriptomic profiling.

本文引用的文献

1
Transcriptomics technologies.转录组学技术
PLoS Comput Biol. 2017 May 18;13(5):e1005457. doi: 10.1371/journal.pcbi.1005457. eCollection 2017 May.
2
The Macronuclear Genome of Stentor coeruleus Reveals Tiny Introns in a Giant Cell.《Stentor coeruleus 的巨核基因组揭示了巨型细胞中的微小内含子》
Curr Biol. 2017 Feb 20;27(4):569-575. doi: 10.1016/j.cub.2016.12.057. Epub 2017 Feb 9.
3
A meiosis-specific Spt5 homolog involved in non-coding transcription.一种参与非编码转录的减数分裂特异性Spt5同源物。
通过深入的表观遗传学和转录组分析对模式纤毛虫嗜热四膜虫进行全面的基因组注释。
Nucleic Acids Res. 2025 Jan 11;53(2). doi: 10.1093/nar/gkae1177.
4
Heterochromatin-dependent transcription links the PRC2 complex to small RNA-mediated DNA elimination.异染色质依赖性转录将PRC2复合物与小RNA介导的DNA消除联系起来。
EMBO Rep. 2025 Jan;26(1):273-296. doi: 10.1038/s44319-024-00332-1. Epub 2024 Nov 29.
5
The PIWI-interacting protein Gtsf1 controls the selective degradation of small RNAs in Paramecium.PIWI相互作用蛋白Gtsf1控制草履虫中小RNA的选择性降解。
Nucleic Acids Res. 2025 Jan 7;53(1). doi: 10.1093/nar/gkae1055.
6
Two paralogous PHD finger proteins participate in natural genome editing in Paramecium tetraurelia.两个旁系同源的 PHD 手指蛋白参与了 Paramecium tetraurelia 的自然基因组编辑。
J Cell Sci. 2024 Aug 15;137(16). doi: 10.1242/jcs.261979. Epub 2024 Aug 30.
7
Structural maintenance of chromosomes (SMC) proteins are required for DNA elimination in .结构维持染色体 (SMC) 蛋白对于. 中的 DNA 消除是必需的。
Life Sci Alliance. 2023 Dec 6;7(2). doi: 10.26508/lsa.202302281. Print 2024 Feb.
8
Genetics, Genomics, and Evolution.遗传学、基因组学与进化。
Annu Rev Genet. 2023 Nov 27;57:391-410. doi: 10.1146/annurev-genet-071819-104035.
9
Programmed chromosome fragmentation in ciliated protozoa: multiple means to chromosome ends.纤毛原生动物中的程序性染色体碎裂:到达染色体末端的多种方式。
Microbiol Mol Biol Rev. 2023 Dec 20;87(4):e0018422. doi: 10.1128/mmbr.00184-22. Epub 2023 Nov 27.
10
Inter-generational nuclear crosstalk links the control of gene expression to programmed genome rearrangement during the Paramecium sexual cycle.代际核串扰将基因表达的控制与草履虫有性周期中的程序性基因组重排联系起来。
Nucleic Acids Res. 2023 Dec 11;51(22):12337-12351. doi: 10.1093/nar/gkad1006.
Nucleic Acids Res. 2017 May 5;45(8):4722-4732. doi: 10.1093/nar/gkw1318.
4
TFIIS-Dependent Non-coding Transcription Regulates Developmental Genome Rearrangements.依赖于TFIIS的非编码转录调控发育过程中的基因组重排。
PLoS Genet. 2015 Jul 15;11(7):e1005383. doi: 10.1371/journal.pgen.1005383. eCollection 2015 Jul.
5
Programmed Rearrangement in Ciliates: Paramecium.纤毛类生物中的程序性重排:草履虫。
Microbiol Spectr. 2014 Dec;2(6). doi: 10.1128/microbiolspec.MDNA3-0035-2014.
6
Maintenance and Loss of Duplicated Genes by Dosage Subfunctionalization.剂量亚功能化导致重复基因的维持与丢失
Mol Biol Evol. 2015 Aug;32(8):2141-8. doi: 10.1093/molbev/msv095. Epub 2015 Apr 22.
7
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.使用DESeq2对RNA测序数据的倍数变化和离散度进行适度估计。
Genome Biol. 2014;15(12):550. doi: 10.1186/s13059-014-0550-8.
8
Ku-mediated coupling of DNA cleavage and repair during programmed genome rearrangements in the ciliate Paramecium tetraurelia.在草履虫四膜虫程序性基因组重排过程中,Ku介导的DNA切割与修复的偶联。
PLoS Genet. 2014 Aug 28;10(8):e1004552. doi: 10.1371/journal.pgen.1004552. eCollection 2014 Aug.
9
Differential retention and divergent resolution of duplicate genes following whole-genome duplication.全基因组复制后重复基因的差异保留和趋异分化
Genome Res. 2014 Oct;24(10):1665-75. doi: 10.1101/gr.173740.114. Epub 2014 Aug 1.
10
A forward genetic screen reveals essential and non-essential RNAi factors in Paramecium tetraurelia.一项正向遗传学筛选揭示了四膜虫中必需和非必需的RNA干扰因子。
Nucleic Acids Res. 2014 Jun;42(11):7268-80. doi: 10.1093/nar/gku223. Epub 2014 May 23.