• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对于各种计算机生成的模型系统,一种不需要序列比对的差异度量的平均值是需要序列比对的传统错配计数平均值的两倍。

Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a variety of computer-generated model systems.

作者信息

Blaisdell B E

机构信息

Department of Mathematics, Stanford University, CA 94305.

出版信息

J Mol Evol. 1991 Jun;32(6):521-8. doi: 10.1007/BF02102654.

DOI:10.1007/BF02102654
PMID:1908023
Abstract

A measure of sequence similarity, dt, not requiring prior sequence alignment gave correct results for a variety of computer-generated model sequences without and with gaps for all degrees of substitution, s. Measure d was the squared Euclidean distance between vectors of counts of t-tuplets of characters in the two sequences. In models without gaps and without Needleman-Wunsch alignment, average d was very closely equal to twice average conventional mismatch counts, m. In these models one of each of the conditions on the Jukes-Cantor model was violated in turn: (1) both descendant lineages receive the same number of substitutions, (2) all sites are equally likely to be substituted, (3) all different replacement characters are equally likely to be chosen, and (4) all original characters are equally likely to be substituted. In Jukes-Cantor models with gaps Needleman-Wunsch alignment was necessarily performed, a procedure that generally produced incorrect values of m. For these models average d was found to be very closely equal to twice the average m estimated from the known value of s using the inverted Jukes-Cantor formula.

摘要

一种序列相似性度量dt,无需事先进行序列比对,对于各种计算机生成的模型序列,无论有无空位,在所有替换程度s下都能给出正确结果。度量d是两个序列中字符t联体计数向量之间的欧几里得距离平方。在没有空位且没有Needleman-Wunsch比对的模型中,平均d非常接近于平均传统错配计数m的两倍。在这些模型中,Jukes-Cantor模型的每个条件依次被违反:(1) 两个后代谱系接受相同数量的替换;(2) 所有位点被替换的可能性相同;(3) 所有不同的替换字符被选择的可能性相同;(4) 所有原始字符被替换的可能性相同。在有空位的Jukes-Cantor模型中,必须进行Needleman-Wunsch比对,该过程通常会产生错误的m值。对于这些模型,发现平均d非常接近于使用倒置的Jukes-Cantor公式从已知的s值估计的平均m的两倍。

相似文献

1
Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a variety of computer-generated model systems.对于各种计算机生成的模型系统,一种不需要序列比对的差异度量的平均值是需要序列比对的传统错配计数平均值的两倍。
J Mol Evol. 1991 Jun;32(6):521-8. doi: 10.1007/BF02102654.
2
Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a computer-generated model system.
J Mol Evol. 1989 Dec;29(6):538-47. doi: 10.1007/BF02602925.
3
Effectiveness of measures requiring and not requiring prior sequence alignment for estimating the dissimilarity of natural sequences.
J Mol Evol. 1989 Dec;29(6):526-37. doi: 10.1007/BF02602924.
4
Comparative biosequence metrics.比较生物序列度量
J Mol Evol. 1981;18(1):38-46. doi: 10.1007/BF01733210.
5
Homology assessment and molecular sequence alignment.同源性评估与分子序列比对。
J Biomed Inform. 2006 Feb;39(1):18-33. doi: 10.1016/j.jbi.2005.11.005. Epub 2005 Dec 9.
6
A probabilistic measure for alignment-free sequence comparison.一种用于无比对序列比较的概率测度。
Bioinformatics. 2004 Dec 12;20(18):3455-61. doi: 10.1093/bioinformatics/bth426. Epub 2004 Jul 22.
7
A method for multiple sequence alignment with gaps.一种带空位的多序列比对方法。
J Mol Biol. 1989 Oct 20;209(4):539-48. doi: 10.1016/0022-2836(89)90592-5.
8
Theoretical foundation to estimate the relative efficiencies of the Jukes-Cantor+gamma model and the Jukes-Cantor model in obtaining the correct phylogenetic tree.用于估计Jukes-Cantor+伽马模型和Jukes-Cantor模型在获得正确系统发育树方面相对效率的理论基础。
Gene. 2006 Dec 30;385:103-10. doi: 10.1016/j.gene.2006.03.027. Epub 2006 Aug 11.
9
Statistical measures of DNA sequence dissimilarity under Markov chain models of base composition.基于碱基组成马尔可夫链模型的DNA序列差异的统计度量。
Biometrics. 2001 Jun;57(2):441-8. doi: 10.1111/j.0006-341x.2001.00441.x.
10
A method for detecting distant evolutionary relationships between protein or nucleic acid sequences in the presence of deletions or insertions.一种在存在缺失或插入的情况下检测蛋白质或核酸序列之间远距离进化关系的方法。
J Mol Evol. 1978 Jun 20;11(2):143-61. doi: 10.1007/BF01733890.

引用本文的文献

1
Alignment-free method for DNA sequence clustering using Fuzzy integral similarity.基于模糊积分相似度的无比对 DNA 序列聚类方法。
Sci Rep. 2019 Mar 6;9(1):3753. doi: 10.1038/s41598-019-40452-6.
2
Similar cases retrieval from the database of laboratory test results.
J Med Syst. 2003 Jun;27(3):271-82. doi: 10.1023/a:1022527528856.
3
Protein sequence randomness and sequence/structure correlations.蛋白质序列随机性与序列/结构相关性。
Biophys J. 1995 Apr;68(4):1531-9. doi: 10.1016/S0006-3495(95)80325-5.

本文引用的文献

1
A general method applicable to the search for similarities in the amino acid sequence of two proteins.一种适用于寻找两种蛋白质氨基酸序列相似性的通用方法。
J Mol Biol. 1970 Mar;48(3):443-53. doi: 10.1016/0022-2836(70)90057-4.
2
Sequence analysis of a cDNA clone encoding the liver cell adhesion molecule, L-CAM.编码肝细胞粘附分子L-CAM的cDNA克隆的序列分析。
Proc Natl Acad Sci U S A. 1987 May;84(9):2808-12. doi: 10.1073/pnas.84.9.2808.
3
Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a computer-generated model system.
J Mol Evol. 1989 Dec;29(6):538-47. doi: 10.1007/BF02602925.
4
Effectiveness of measures requiring and not requiring prior sequence alignment for estimating the dissimilarity of natural sequences.
J Mol Evol. 1989 Dec;29(6):526-37. doi: 10.1007/BF02602924.