Ritsch Muriel, Brait Nadja, Harvey Erin, Marz Manja, Lequime Sebastian
RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, Leutragraben 1, Jena 07743, Germany.
European Virus Bioinformatics Center, Leutragraben 1, Jena 07743, Germany.
Virus Evol. 2024 Nov 23;10(1):veae099. doi: 10.1093/ve/veae099. eCollection 2024.
Endogenous viral elements (EVEs) are remnants of viral genetic material endogenized into the host genome. They have, in the last decades, attracted attention for their role as potential contributors to pathogenesis, drivers of selective advantage for the host, and genomic remnants of ancient viruses. EVEs have a nuanced and complex influence on both host health and evolution, and can offer insights on the deep evolutionary history of viruses. As an emerging field of research, several factors limit a comprehensive understanding of EVEs: they are currently underestimated and periodically overlooked in studies of the host genome, transcriptome, and virome. The absence of standardized guidelines for ensuring EVE-related data availability and accessibility following the FAIR ('findable, accessible, interoperable, and reusable') principles obstructs our ability to gather and connect information. Here, we discuss challenges to the availability and accessibility of EVE-related data and propose potential solutions. We identified the biological and research focus imbalance between different types of EVEs, and their overall biological complexity as genomic loci with viral ancestry, as potential challenges that can be addressed with the development of a user-oriented identification tool. In addition, reports of EVE identification are scattered between different subfields under different keywords, and EVE sequences and associated data are not properly gathered in databases. While developing an open and dedicated database might be ideal, targeted improvements of generalist databases might provide a pragmatic solution to EVE data and metadata accessibility. The implementation of these solutions, as well as the collective effort by the EVE scientific community in discussing and setting guidelines, is now drastically needed to lead the development of EVE research and offer insights into host-virus interactions and their evolutionary history.
内源性病毒元件(EVEs)是内源性整合到宿主基因组中的病毒遗传物质残余。在过去几十年中,它们因其作为发病机制的潜在促成因素、宿主选择优势的驱动因素以及古代病毒的基因组残余物的作用而受到关注。EVEs对宿主健康和进化具有细微而复杂的影响,并能为病毒的深层进化史提供见解。作为一个新兴的研究领域,有几个因素限制了对EVEs的全面理解:在宿主基因组、转录组和病毒组研究中,它们目前被低估且经常被忽视。缺乏遵循FAIR(“可查找、可访问、可互操作和可重用”)原则确保与EVE相关的数据可用性和可访问性的标准化指南,阻碍了我们收集和连接信息的能力。在这里,我们讨论了与EVE相关数据的可用性和可访问性面临的挑战,并提出了潜在的解决方案。我们确定了不同类型的EVEs之间生物学和研究重点的不平衡,以及它们作为具有病毒祖先的基因组位点的整体生物学复杂性,这是可以通过开发面向用户的识别工具来解决的潜在挑战。此外,EVE识别报告分散在不同关键词下的不同子领域之间,并且EVE序列和相关数据没有在数据库中得到妥善收集。虽然开发一个开放且专门的数据库可能是理想的,但对通用数据库进行有针对性的改进可能为EVE数据和元数据的可访问性提供一个务实的解决方案。现在迫切需要实施这些解决方案,以及EVE科学界在讨论和制定指南方面的集体努力,以引领EVE研究的发展,并深入了解宿主-病毒相互作用及其进化史。