• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RepetDB:转座元件参考的统一资源。

RepetDB: a unified resource for transposable element references.

作者信息

Amselem Joëlle, Cornut Guillaume, Choisne Nathalie, Alaux Michael, Alfama-Depauw Françoise, Jamilloux Véronique, Maumus Florian, Letellier Thomas, Luyten Isabelle, Pommier Cyril, Adam-Blondon Anne-Françoise, Quesneville Hadi

机构信息

URGI, INRA, Université Paris-Saclay, 78026 Versailles, France.

出版信息

Mob DNA. 2019 Jan 29;10:6. doi: 10.1186/s13100-019-0150-y. eCollection 2019.

DOI:10.1186/s13100-019-0150-y
PMID:30719103
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6350395/
Abstract

BACKGROUND

Thanks to their ability to move around and replicate within genomes, transposable elements (TEs) are perhaps the most important contributors to genome plasticity and evolution. Their detection and annotation are considered essential in any genome sequencing project. The number of fully sequenced genomes is rapidly increasing with improvements in high-throughput sequencing technologies. A fully automated de novo annotation process for TEs is therefore required to cope with the deluge of sequence data.However, all automated procedures are error-prone, and an automated procedure for TE identification and classification would be no exception. It is therefore crucial to provide not only the TE reference sequences, but also evidence justifying their classification, at the scale of the whole genome. A few TE databases already exist, but none provides evidence to justify TE classification. Moreover, biological information about the sequences remains globally poor.

RESULTS

We present here the RepetDB database developed in the framework of GnpIS, a genetic and genomic information system. RepetDB is designed to store and retrieve detected, classified and annotated TEs in a standardized manner. RepetDB is an implementation with extensions of InterMine, an open-source data warehouse framework used here to store, search, browse, analyze and compare all the data recorded for each TE reference sequence. InterMine can display diverse information for each sequence and allows simple to very complex queries. Finally, TE data are displayed via a worldwide data discovery portal. RepetDB is accessible at urgi.versailles.inra.fr/repetdb.

CONCLUSIONS

RepetDB is designed to be a TE knowledge base populated with full de novo TE annotations of complete (or near-complete) genome sequences. Indeed, the description and classification of TEs facilitates the exploration of specific TE families, superfamilies or orders across a large range of species. It also makes possible cross-species searches and comparisons of TE family content between genomes.

摘要

背景

由于转座元件(TEs)能够在基因组内移动和复制,它们可能是基因组可塑性和进化的最重要贡献者。在任何基因组测序项目中,对它们的检测和注释都被认为是必不可少的。随着高通量测序技术的改进,全基因组测序的数量正在迅速增加。因此,需要一个完全自动化的TEs从头注释流程来应对海量的序列数据。然而,所有自动化程序都容易出错,TEs识别和分类的自动化程序也不例外。因此,不仅要提供TEs参考序列,还要在全基因组范围内提供证明其分类合理性的证据,这一点至关重要。已经存在一些TEs数据库,但没有一个能提供证明TEs分类合理性的证据。此外,关于这些序列的生物学信息总体上仍然很少。

结果

我们在此展示了在GnpIS(一个遗传和基因组信息系统)框架下开发的RepetDB数据库。RepetDB旨在以标准化方式存储和检索检测到的、分类的和注释的TEs。RepetDB是InterMine的一个扩展实现,InterMine是一个开源数据仓库框架,在此用于存储、搜索、浏览、分析和比较为每个TEs参考序列记录的所有数据。InterMine可以为每个序列显示各种信息,并允许进行简单到非常复杂的查询。最后,TEs数据通过一个全球数据发现门户进行展示。可通过urgi.versailles.inra.fr/repetdb访问RepetDB。

结论

RepetDB旨在成为一个TEs知识库,其中填充了完整(或接近完整)基因组序列的全从头TEs注释。实际上,TEs的描述和分类有助于在广泛的物种范围内探索特定的TE家族、超家族或目。它还使跨物种搜索和比较基因组之间的TE家族内容成为可能。

相似文献

1
RepetDB: a unified resource for transposable element references.RepetDB:转座元件参考的统一资源。
Mob DNA. 2019 Jan 29;10:6. doi: 10.1186/s13100-019-0150-y. eCollection 2019.
2
A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea.转座元件注释流水线和表达分析揭示微藻新月菱形藻中潜在活跃的元件。
BMC Genomics. 2018 May 22;19(1):378. doi: 10.1186/s12864-018-4763-1.
3
Considering transposable element diversification in de novo annotation approaches.考虑从头注释方法中转座元件的多样化。
PLoS One. 2011 Jan 31;6(1):e16526. doi: 10.1371/journal.pone.0016526.
4
Accurate Transposable Element Annotation Is Vital When Analyzing New Genome Assemblies.准确的转座元件注释对于分析新的基因组组装至关重要。
Genome Biol Evol. 2016 Jan 21;8(2):403-10. doi: 10.1093/gbe/evw009.
5
TE-Tracker: systematic identification of transposition events through whole-genome resequencing.TE-Tracker:通过全基因组重测序对转座事件进行系统鉴定。
BMC Bioinformatics. 2014 Nov 19;15(1):377. doi: 10.1186/s12859-014-0377-z.
6
Diversification of Transposable Elements in Arthropods and Its Impact on Genome Evolution.转座元件在节肢动物中的多样化及其对基因组进化的影响。
Genes (Basel). 2019 May 6;10(5):338. doi: 10.3390/genes10050338.
7
Characterization of transposable elements within the Bemisia tabaci species complex.烟粉虱复合种内转座元件的特征分析
Mob DNA. 2022 Apr 19;13(1):12. doi: 10.1186/s13100-022-00270-6.
8
Orthoptera-TElib: a library of Orthoptera transposable elements for TE annotation.直翅目转座元件文库(Orthoptera-TElib):用于转座元件注释的直翅目转座元件文库。
Mob DNA. 2024 Mar 15;15(1):5. doi: 10.1186/s13100-024-00316-x.
9
Identification of transposons near predicted lncRNA and mRNA pools of Prunus mume using an integrative transposable element database constructed from Rosaceae plant genomes.利用蔷薇科植物基因组构建的整合转座子数据库鉴定李属近预测 lncRNA 和 mRNA 库的转座子。
Mol Genet Genomics. 2018 Oct;293(5):1301-1316. doi: 10.1007/s00438-018-1449-y. Epub 2018 May 26.
10
Combined evidence annotation of transposable elements in genome sequences.基因组序列中转座元件的联合证据注释
PLoS Comput Biol. 2005 Jul;1(2):166-75. doi: 10.1371/journal.pcbi.0010022. Epub 2005 Jul 29.

引用本文的文献

1
Resolving a century-old enigma: potato 'Bolters' originate from instability of the StCDF1.3 allele.解开一个百年谜团:马铃薯“早抽薹”现象源于StCDF1.3等位基因的不稳定性。
Theor Appl Genet. 2025 Sep 9;138(10):245. doi: 10.1007/s00122-025-05030-7.
2
Horizontal transmission of functionally diverse transposons is a major source of new introns.功能多样的转座子的水平转移是新内含子的主要来源。
Proc Natl Acad Sci U S A. 2025 May 27;122(21):e2414761122. doi: 10.1073/pnas.2414761122. Epub 2025 May 22.
3
A Near Complete Genome Assembly of the Oshima Cherry Cerasus speciosa.

本文引用的文献

1
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite gene clusters.炭疽菌的无缝基因组组装揭示了染色体结构以及转座元件与次生代谢物基因簇的关联。
BMC Genomics. 2017 Aug 29;18(1):667. doi: 10.1186/s12864-017-4083-x.
2
transPLANT Resources for Triticeae Genomic Data.禾本科基因组数据的 transPLANT 资源。
Plant Genome. 2016 Mar;9(1). doi: 10.3835/plantgenome2015.06.0038.
3
Coming of age: ten years of next-generation sequencing technologies.
大岛樱(Cerasus speciosa)的近乎完整基因组组装
Sci Data. 2025 Feb 4;12(1):162. doi: 10.1038/s41597-025-04388-z.
4
MCHelper automatically curates transposable element libraries across eukaryotic species.MCHelper可自动整理真核生物物种中的转座因子文库。
Genome Res. 2024 Dec 23;34(12):2256-2268. doi: 10.1101/gr.278821.123.
5
Identification of transposable element families from pangenome polymorphisms.从泛基因组多态性中鉴定转座元件家族。
Mob DNA. 2024 Jun 26;15(1):13. doi: 10.1186/s13100-024-00323-y.
6
Comparative Evolutionary Genomics in Insects.昆虫比较进化基因组学。
Methods Mol Biol. 2024;2802:473-514. doi: 10.1007/978-1-0716-3838-5_16.
7
Genome comparisons reveal accessory genes crucial for the evolution of apple Glomerella leaf spot pathogenicity in Colletotrichum fungi.基因组比较揭示了对炭疽菌属真菌中苹果炭疽叶枯病致病性进化至关重要的辅助基因。
Mol Plant Pathol. 2024 Apr;25(4):e13454. doi: 10.1111/mpp.13454.
8
Identification of two mutant JASON-RELATED genes associated with unreduced pollen production in potato.鉴定与马铃薯未减数花粉产生相关的两个突变的JASON相关基因。
Theor Appl Genet. 2024 Mar 12;137(4):79. doi: 10.1007/s00122-024-04563-7.
9
Crossover shortage in potato is caused by StMSH4 mutant alleles and leads to either highly uniform unreduced pollen or sterility.马铃薯的杂交不足是由 StMSH4 突变等位基因引起的,导致高度均匀的未减数花粉或不育。
Genetics. 2024 Jan 3;226(1). doi: 10.1093/genetics/iyad194.
10
Conserved structured domains in plant non-coding RNA enod40, their evolution and recruitment of sequences from transposable elements.植物非编码RNA enod40中的保守结构域、其进化以及来自转座元件的序列招募
NAR Genom Bioinform. 2023 Oct 16;5(4):lqad091. doi: 10.1093/nargab/lqad091. eCollection 2023 Dec.
成年:下一代测序技术的十年
Nat Rev Genet. 2016 May 17;17(6):333-51. doi: 10.1038/nrg.2016.49.
4
JBrowse: a dynamic web platform for genome visualization and analysis.JBrowse:一个用于基因组可视化和分析的动态网络平台。
Genome Biol. 2016 Apr 12;17:66. doi: 10.1186/s13059-016-0924-1.
5
The FAIR Guiding Principles for scientific data management and stewardship.科学数据管理和保存的 FAIR 指导原则。
Sci Data. 2016 Mar 15;3:160018. doi: 10.1038/sdata.2016.18.
6
The Dfam database of repetitive DNA families.重复DNA家族的Dfam数据库。
Nucleic Acids Res. 2016 Jan 4;44(D1):D81-9. doi: 10.1093/nar/gkv1272. Epub 2015 Nov 26.
7
Repbase Update, a database of repetitive elements in eukaryotic genomes.Repbase Update,一个真核生物基因组中重复元件的数据库。
Mob DNA. 2015 Jun 2;6:11. doi: 10.1186/s13100-015-0041-9. eCollection 2015.
8
Whole genome comparative analysis of transposable elements provides new insight into mechanisms of their inactivation in fungal genomes.转座元件的全基因组比较分析为真菌基因组中转座元件失活机制提供了新见解。
BMC Genomics. 2015 Feb 28;16(1):141. doi: 10.1186/s12864-015-1347-1.
9
PASTEC: an automatic transposable element classification tool.PASTEC:一种自动转座元件分类工具。
PLoS One. 2014 May 2;9(5):e91929. doi: 10.1371/journal.pone.0091929. eCollection 2014.
10
InterMine: extensive web services for modern biology.InterMine:现代生物学的广泛网络服务。
Nucleic Acids Res. 2014 Jul;42(Web Server issue):W468-72. doi: 10.1093/nar/gku301. Epub 2014 Apr 21.