• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基于信息的相似性指数进行基因组分类:在严重急性呼吸综合征冠状病毒中的应用。

Genomic classification using an information-based similarity index: application to the SARS coronavirus.

作者信息

Yang Albert C-C, Goldberger Ary L, Peng C-K

机构信息

Cardiovascular Division and Margret and H.A. Rey Institute for Nonlinear Dynamics in Medicine, Beth Israel Deaconess Medical Center/Harvard Medical School, Boston, Massachusetts 02215, USA.

出版信息

J Comput Biol. 2005 Oct;12(8):1103-16. doi: 10.1089/cmb.2005.12.1103.

DOI:10.1089/cmb.2005.12.1103
PMID:16241900
Abstract

Measures of genetic distance based on alignment methods are confined to studying sequences that are conserved and identifiable in all organisms under study. A number of alignment-free techniques based on either statistical linguistics or information theory have been developed to overcome the limitations of alignment methods. We present a novel alignment-free approach to measuring the similarity among genetic sequences that incorporates elements from both word rank order-frequency statistics and information theory. We first validate this method on the human influenza A viral genomes as well as on the human mitochondrial DNA database. We then apply the method to study the origin of the SARS coronavirus. We find that the majority of the SARS genome is most closely related to group 1 coronaviruses, with smaller regions of matches to sequences from groups 2 and 3. The information based similarity index provides a new tool to measure the similarity between datasets based on their information content and may have a wide range of applications in the large-scale analysis of genomic databases.

摘要

基于比对方法的遗传距离测量方法仅限于研究在所研究的所有生物体中保守且可识别的序列。已经开发了许多基于统计语言学或信息论的无比对技术来克服比对方法的局限性。我们提出了一种新颖的无比对方法来测量遗传序列之间的相似性,该方法融合了词序频率统计和信息论的元素。我们首先在人类甲型流感病毒基因组以及人类线粒体DNA数据库上验证了该方法。然后我们应用该方法研究严重急性呼吸综合征冠状病毒的起源。我们发现,严重急性呼吸综合征基因组的大部分与第1组冠状病毒关系最为密切,与第2组和第3组序列匹配的区域较小。基于信息的相似性指数提供了一种基于数据集的信息内容来测量数据集之间相似性的新工具,并且可能在基因组数据库的大规模分析中有广泛的应用。

相似文献

1
Genomic classification using an information-based similarity index: application to the SARS coronavirus.使用基于信息的相似性指数进行基因组分类:在严重急性呼吸综合征冠状病毒中的应用。
J Comput Biol. 2005 Oct;12(8):1103-16. doi: 10.1089/cmb.2005.12.1103.
2
Phylogenetic analysis of the full-length SARS-CoV sequences: evidence for phylogenetic discordance in three genomic regions.严重急性呼吸综合征冠状病毒全长序列的系统发育分析:三个基因组区域中系统发育不一致的证据
J Med Virol. 2004 Nov;74(3):369-72. doi: 10.1002/jmv.20187.
3
Severe Acute Respiratory Syndrome (SARS) Coronavirus ORF8 Protein Is Acquired from SARS-Related Coronavirus from Greater Horseshoe Bats through Recombination.严重急性呼吸综合征(SARS)冠状病毒的ORF8蛋白是通过重组从中华菊头蝠的SARS相关冠状病毒中获得的。
J Virol. 2015 Oct;89(20):10532-47. doi: 10.1128/JVI.01048-15. Epub 2015 Aug 12.
4
Comprehensive comparative genomic and microsatellite analysis of SARS, MERS, BAT-SARS, and COVID-19 coronaviruses.全面比较 SARS、MERS、BAT-SARS 和 COVID-19 冠状病毒的基因组和微卫星分析。
J Med Virol. 2021 Jul;93(7):4382-4391. doi: 10.1002/jmv.26974. Epub 2021 Apr 8.
5
The phylogeny of SARS coronavirus.严重急性呼吸综合征冠状病毒的系统发育
Arch Virol. 2004 Mar;149(3):621-4. doi: 10.1007/s00705-003-0244-0. Epub 2004 Jan 5.
6
Molecular evolution and multilocus sequence typing of 145 strains of SARS-CoV.145株严重急性呼吸综合征冠状病毒的分子进化与多位点序列分型
FEBS Lett. 2005 Sep 12;579(22):4928-36. doi: 10.1016/j.febslet.2005.07.075.
7
Understanding SARS with Wolfram approach.用Wolfram方法理解严重急性呼吸综合征。
Acta Biochim Biophys Sin (Shanghai). 2004 Jan;36(1):1-10. doi: 10.1093/abbs/36.1.1.
8
Evidence from the evolutionary analysis of nucleotide sequences for a recombinant history of SARS-CoV.来自SARS-CoV重组历史的核苷酸序列进化分析的证据。
Infect Genet Evol. 2004 Mar;4(1):15-9. doi: 10.1016/j.meegid.2003.10.001.
9
The complete genome sequence of severe acute respiratory syndrome coronavirus strain HKU-39849 (HK-39).严重急性呼吸综合征冠状病毒株HKU - 39849(HK - 39)的全基因组序列。
Exp Biol Med (Maywood). 2003 Jul;228(7):866-73. doi: 10.1177/15353702-0322807-13.
10
A new method to cluster DNA sequences using Fourier power spectrum.一种使用傅里叶功率谱对DNA序列进行聚类的新方法。
J Theor Biol. 2015 May 7;372:135-45. doi: 10.1016/j.jtbi.2015.02.026. Epub 2015 Mar 5.

引用本文的文献

1
Beyond Frequency Bands: Complementary-Ensemble-Empirical-Mode-Decomposition-Enhanced Microstate Sequence Non-Randomness Analysis for Aiding Diagnosis and Cognitive Prediction of Dementia.超越频段:互补总体经验模态分解增强的微状态序列非随机性分析辅助痴呆诊断与认知预测
Brain Sci. 2024 May 11;14(5):487. doi: 10.3390/brainsci14050487.
2
Exploring morphological similarity and randomness in Alzheimer's disease using adjacent grey matter voxel-based structural analysis.使用基于相邻灰质体素的结构分析探索阿尔茨海默病中的形态相似性和随机性。
Alzheimers Res Ther. 2024 Apr 23;16(1):88. doi: 10.1186/s13195-024-01448-1.
3
Choice of Metric Divergence in Genome Sequence Comparison.
基因组序列比较中的度量散度选择。
Protein J. 2024 Apr;43(2):259-273. doi: 10.1007/s10930-024-10189-x. Epub 2024 Mar 16.
4
Research on Cross-Contrast Neural Network Based Intelligent Painting: Taking Oil Painting Language Classification as an Example.基于交叉对比神经网络的智能绘画研究:以油画语言分类为例。
Comput Intell Neurosci. 2022 Jun 6;2022:7827587. doi: 10.1155/2022/7827587. eCollection 2022.
5
The Relationship between Postural Stability and Lower-Limb Muscle Activity Using an Entropy-Based Similarity Index.使用基于熵的相似性指数研究姿势稳定性与下肢肌肉活动之间的关系。
Entropy (Basel). 2018 Apr 26;20(5):320. doi: 10.3390/e20050320.
6
ACE2 enhance viral infection or viral infection aggravate the underlying diseases.血管紧张素转换酶2增强病毒感染或病毒感染加重基础疾病。
Comput Struct Biotechnol J. 2020 Aug 6;18:2100-2106. doi: 10.1016/j.csbj.2020.08.002. eCollection 2020.
7
Weighted multifractal cross-correlation analysis based on Shannon entropy.基于香农熵的加权多重分形交叉相关性分析
Commun Nonlinear Sci Numer Simul. 2016 Jan;30(1):268-283. doi: 10.1016/j.cnsns.2015.06.029. Epub 2015 Jul 3.
8
Financial time series analysis based on information categorization method.基于信息分类方法的金融时间序列分析
Physica A. 2014 Dec 15;416:183-191. doi: 10.1016/j.physa.2014.08.055. Epub 2014 Aug 30.
9
The similarity analysis of financial stocks based on information clustering.基于信息聚类的金融股相似性分析
Nonlinear Dyn. 2016;85(4):2635-2652. doi: 10.1007/s11071-016-2851-9. Epub 2016 May 26.
10
Finding and identifying the viral needle in the metagenomic haystack: trends and challenges.在宏基因组的干草堆中寻找并识别病毒刺突蛋白:趋势与挑战
Front Microbiol. 2015 Jan 7;5:739. doi: 10.3389/fmicb.2014.00739. eCollection 2014.