Suppr超能文献

基于自组织映射的人类内源性逆转录病毒序列组的发现与可视化

Self-organizing map-based discovery and visualization of human endogenous retroviral sequence groups.

作者信息

Oja Merja, Sperber Göran O, Blomberg Jonas, Kaski Samuel

机构信息

Department of Computer Science, University of Helsinki, P.O. Box 68, FI-00014 University of Helsinki, Finland.

出版信息

Int J Neural Syst. 2005 Jun;15(3):163-79. doi: 10.1142/S0129065705000177.

Abstract

About 8 per cent of the human genome consists of human endogenous retroviral sequences (HERVs), which are remains from ancient infections. The HERVs may give rise to transcripts or affect the expression of human genes. The first step in understanding HERV function is to classify HERVs into families. In this work we study the relationships of existing HERV families and detect potentially new HERV families. A Median Self-Organizing Map (SOM), a SOM for non-vectorial data, is used to group and visualize a collection of 3661 HERVs. The SOM-based analysis is complemented with estimates of the reliability of the results. A novel trustworthiness visualization method is used to estimate which parts of the SOM visualization are reliable and which not. The reliability of extracted interesting HERV groups is verified by a bootstrap procedure suitable for SOM visualization-based analysis. The SOM detects a group of epsilonretroviral sequences and a group of ERV9, HERVW, and HUERSP3 sequences which suggests that ERV9 and HERVW sequences may have a common origin.

摘要

约8%的人类基因组由人类内源性逆转录病毒序列(HERVs)组成,这些序列是古代感染的遗迹。HERVs可能产生转录本或影响人类基因的表达。理解HERV功能的第一步是将HERVs分类为不同家族。在这项工作中,我们研究了现有HERV家族之间的关系,并检测潜在的新HERV家族。一种中位数自组织映射(SOM),即一种用于非向量数据的SOM,被用于对3661个HERVs的集合进行分组和可视化。基于SOM的分析辅以结果可靠性的估计。一种新颖的可信度可视化方法被用于估计SOM可视化的哪些部分是可靠的,哪些不是。通过适用于基于SOM可视化分析的自助程序,验证了提取的有趣HERV组的可靠性。SOM检测到一组ε逆转录病毒序列以及一组ERV9、HERVW和HUERSP3序列,这表明ERV9和HERVW序列可能有共同的起源。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验