Suppr超能文献

基于小波域的基因组相似度评估。

Evaluation of genome similarities using a wavelet-domain approach.

机构信息

Programa de Pós-Graduação Stricto Sensu em Estatística e Experimentação Agropecuária, Universidade Federal de Lavras, Lavras, MG, Brasil.

Departamento de Estatística, Universidade Federal de Lavras, Lavras, MG, Brasil.

出版信息

Rev Soc Bras Med Trop. 2020;53:e20190470. doi: 10.1590/0037-8682-0470-2019. Epub 2020 May 18.

Abstract

INTRODUCTION

Tuberculosis is listed among the top 10 causes of deaths worldwide. The resistant strains causing this disease have been considered to be responsible for public health emergencies and health security threats. As stated by the World Health Organization (WHO), around 558,000 different cases coupled with resistance to rifampicin (the most operative first-line drug) have been estimated to date. Therefore, in order to detect the resistant strains using the genomes of Mycobacterium tuberculosis (MTB), we propose a new methodology for the analysis of genomic similarities that associate the different levels of decomposition of the genome (discrete non-decimated wavelet transform) and the Hurst exponent.

METHODS

The signals corresponding to the ten analyzed sequences were obtained by assessing GC content, and then these signals were decomposed using the discrete non-decimated wavelet transform along with the Daubechies wavelet with four null moments at five levels of decomposition. The Hurst exponent was calculated at each decomposition level using five different methods. The cluster analysis was performed using the results obtained for the Hurst exponent.

RESULTS

The aggregated variance, differenced aggregated variance, and aggregated absolute value methods presented the formation of three groups, whereas the Peng and R/S methods presented the formation of two groups. The aggregated variance method exhibited the best results with respect to the group formation between similar strains.

CONCLUSION

The evaluation of Hurst exponent associated with discrete non-decimated wavelet transform can be used as a measure of similarity between genome sequences, thus leading to a refinement in the analysis.

摘要

简介

结核病是全球十大死因之一。导致这种疾病的耐药菌株被认为是造成公共卫生紧急情况和健康安全威胁的原因。世界卫生组织(WHO)表示,截至目前,全球估计有 55.8 万例不同的耐利福平(最有效的一线药物)病例。因此,为了使用结核分枝杆菌(MTB)的基因组来检测耐药菌株,我们提出了一种新的基因组相似性分析方法,该方法将基因组分解的不同层次(离散非抽取小波变换)和赫斯特指数联系起来。

方法

通过评估 GC 含量获得了十个分析序列的相应信号,然后使用离散非抽取小波变换和具有四个零矩的 Daubechies 小波在五个分解水平上对这些信号进行分解。在每个分解水平上使用五种不同的方法计算赫斯特指数。使用赫斯特指数的结果进行聚类分析。

结果

聚合方差、差分聚合方差和聚合绝对值方法形成了三个组,而 Peng 和 R/S 方法形成了两个组。聚合方差方法在相似菌株之间的分组形成方面表现出最佳的结果。

结论

与离散非抽取小波变换相关的赫斯特指数的评估可作为基因组序列之间相似性的度量,从而改进分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/24a7/7269520/18b695ac65dd/1678-9849-rsbmt-53-e20190470-gf1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验