• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
A unified statistical framework for sequence comparison and structure comparison.用于序列比较和结构比较的统一统计框架。
Proc Natl Acad Sci U S A. 1998 May 26;95(11):5913-20. doi: 10.1073/pnas.95.11.5913.
2
Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores.评估基因组学中的注释转移:通过传统分数和概率分数量化蛋白质序列、结构与功能之间的关系。
J Mol Biol. 2000 Mar 17;297(1):233-49. doi: 10.1006/jmbi.2000.3550.
3
Effective protein sequence comparison.有效的蛋白质序列比较。
Methods Enzymol. 1996;266:227-58. doi: 10.1016/s0076-6879(96)66017-0.
4
Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships.利用可靠的结构鉴定远距离进化关系评估序列比较方法。
Proc Natl Acad Sci U S A. 1998 May 26;95(11):6073-8. doi: 10.1073/pnas.95.11.6073.
5
Use of a database of structural alignments and phylogenetic trees in investigating the relationship between sequence and structural variability among homologous proteins.利用结构比对和系统发育树数据库研究同源蛋白质序列与结构变异性之间的关系。
Protein Eng. 2001 Apr;14(4):219-26. doi: 10.1093/protein/14.4.219.
6
A model for statistical significance of local similarities in structure.一种用于结构局部相似性统计显著性的模型。
J Mol Biol. 2003 Mar 7;326(5):1307-16. doi: 10.1016/s0022-2836(03)00045-7.
7
Making sense of score statistics for sequence alignments.理解序列比对的得分统计。
Brief Bioinform. 2001 Mar;2(1):51-67. doi: 10.1093/bib/2.1.51.
8
An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance.一种用于蛋白质序列和结构分析与建模的综合方法。I. 蛋白质结构比对及蛋白质结构距离的定量度量。
J Mol Biol. 2000 Aug 18;301(3):665-78. doi: 10.1006/jmbi.2000.3973.
9
Protein structure comparison using the markov transition model of evolution.使用马尔可夫进化转移模型进行蛋白质结构比较。
Proteins. 2000 Oct 1;41(1):108-22.
10
Large-scale comparison of protein sequence alignment algorithms with structure alignments.蛋白质序列比对算法与结构比对的大规模比较。
Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.

引用本文的文献

1
How the technologies behind self-driving cars, social networks, ChatGPT, and DALL-E2 are changing structural biology.自动驾驶汽车、社交网络、ChatGPT和DALL-E2背后的技术如何正在改变结构生物学。
Bioessays. 2025 Jan;47(1):e2400155. doi: 10.1002/bies.202400155. Epub 2024 Oct 15.
2
LoCoHD: a metric for comparing local environments of proteins.LoCoHD:一种用于比较蛋白质局部环境的度量。
Nat Commun. 2024 May 13;15(1):4029. doi: 10.1038/s41467-024-48225-0.
3
Sequence-structure-function relationships in the microbial protein universe.微生物蛋白质宇宙中的序列-结构-功能关系。
Nat Commun. 2023 Apr 26;14(1):2351. doi: 10.1038/s41467-023-37896-w.
4
InterPepRank: Assessment of Docked Peptide Conformations by a Deep Graph Network.InterPepRank:通过深度图网络评估对接肽构象
Front Bioinform. 2021 Oct 25;1:763102. doi: 10.3389/fbinf.2021.763102. eCollection 2021.
5
Estimating the Similarity between Protein Pockets.估算蛋白质口袋之间的相似性。
Int J Mol Sci. 2022 Oct 18;23(20):12462. doi: 10.3390/ijms232012462.
6
Computational Study on the Dynamics of Mycobacterium Tuberculosis RNA Polymerase Assembly.结核分枝杆菌 RNA 聚合酶组装动力学的计算研究。
Methods Mol Biol. 2022;2516:61-79. doi: 10.1007/978-1-0716-2413-5_5.
7
SIMILE enables alignment of tandem mass spectra with statistical significance.SIMILE 可实现串联质谱数据的对齐,并具有统计学意义。
Nat Commun. 2022 May 6;13(1):2510. doi: 10.1038/s41467-022-30118-9.
8
Fast protein structure comparison through effective representation learning with contrastive graph neural networks.通过对比图神经网络的有效表示学习进行快速蛋白质结构比较。
PLoS Comput Biol. 2022 Mar 24;18(3):e1009986. doi: 10.1371/journal.pcbi.1009986. eCollection 2022 Mar.
9
Structural modeling of Omicron spike protein and its complex with human ACE-2 receptor: Molecular basis for high transmissibility of the virus.奥密克戎刺突蛋白及其与人 ACE-2 受体复合物的结构建模:病毒高传染性的分子基础。
Biochem Biophys Res Commun. 2022 Feb 12;592:51-53. doi: 10.1016/j.bbrc.2021.12.082. Epub 2022 Jan 7.
10
Structural analysis of COVID-19 spike protein in recognizing the ACE2 receptor of different mammalian species and its susceptibility to viral infection.新冠病毒刺突蛋白识别不同哺乳动物物种血管紧张素转换酶2(ACE2)受体的结构分析及其对病毒感染的易感性
3 Biotech. 2021 Feb;11(2):109. doi: 10.1007/s13205-020-02599-2. Epub 2021 Feb 1.

本文引用的文献

1
Structural similarity of DNA-binding domains of bacteriophage repressors and the globin core.噬菌体阻遏物与珠蛋白核心的DNA结合结构域的结构相似性。
Curr Biol. 1993 Mar;3(3):141-8. doi: 10.1016/0960-9822(93)90255-m.
2
Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships.利用可靠的结构鉴定远距离进化关系评估序列比较方法。
Proc Natl Acad Sci U S A. 1998 May 26;95(11):6073-8. doi: 10.1073/pnas.95.11.6073.
3
Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins.针对手动标准(蛋白质的scop分类)对自动结构比对进行全面评估。
Protein Sci. 1998 Feb;7(2):445-56. doi: 10.1002/pro.5560070226.
4
Protein Data Bank archives of three-dimensional macromolecular structures.蛋白质数据库存档三维大分子结构。
Methods Enzymol. 1997;277:556-71. doi: 10.1016/s0076-6879(97)77031-9.
5
Identifying distantly related protein sequences.识别远缘相关的蛋白质序列。
Comput Appl Biosci. 1997 Aug;13(4):325-32. doi: 10.1093/bioinformatics/13.4.325.
6
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.空位BLAST和位置特异性迭代BLAST:新一代蛋白质数据库搜索程序。
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.
7
SCOP: a structural classification of proteins database.SCOP:蛋白质数据库的结构分类
Nucleic Acids Res. 1997 Jan 1;25(1):236-9. doi: 10.1093/nar/25.1.236.
8
Using a measure of structural variation to define a core for the globins.使用结构变异的度量来定义珠蛋白的核心。
Comput Appl Biosci. 1995 Dec;11(6):633-44. doi: 10.1093/bioinformatics/11.6.633.
9
Surprising similarities in structure comparison.结构比较中惊人的相似之处。
Curr Opin Struct Biol. 1996 Jun;6(3):377-85. doi: 10.1016/s0959-440x(96)80058-3.
10
Understanding protein structure: using scop for fold interpretation.理解蛋白质结构:利用SCOP进行折叠解读。
Methods Enzymol. 1996;266:635-43. doi: 10.1016/s0076-6879(96)66039-x.

用于序列比较和结构比较的统一统计框架。

A unified statistical framework for sequence comparison and structure comparison.

作者信息

Levitt M, Gerstein M

机构信息

Department of Structural Biology, Stanford University, Stanford, CA 94305, USA.

出版信息

Proc Natl Acad Sci U S A. 1998 May 26;95(11):5913-20. doi: 10.1073/pnas.95.11.5913.

DOI:10.1073/pnas.95.11.5913
PMID:9600892
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC34495/
Abstract

We present an approach for assessing the significance of sequence and structure comparisons by using nearly identical statistical formalisms for both sequence and structure. Doing so involves an all-vs.-all comparison of protein domains [taken here from the Structural Classification of Proteins (scop) database] and then fitting a simple distribution function to the observed scores. By using this distribution, we can attach a statistical significance to each comparison score in the form of a P value, the probability that a better score would occur by chance. As expected, we find that the scores for sequence matching follow an extreme-value distribution. The agreement, moreover, between the P values that we derive from this distribution and those reported by standard programs (e.g., BLAST and FASTA validates our approach. Structure comparison scores also follow an extreme-value distribution when the statistics are expressed in terms of a structural alignment score (essentially the sum of reciprocated distances between aligned atoms minus gap penalties). We find that the traditional metric of structural similarity, the rms deviation in atom positions after fitting aligned atoms, follows a different distribution of scores and does not perform as well as the structural alignment score. Comparison of the sequence and structure statistics for pairs of proteins known to be related distantly shows that structural comparison is able to detect approximately twice as many distant relationships as sequence comparison at the same error rate. The comparison also indicates that there are very few pairs with significant similarity in terms of sequence but not structure whereas many pairs have significant similarity in terms of structure but not sequence.

摘要

我们提出了一种方法,通过对序列和结构使用几乎相同的统计形式来评估序列和结构比较的显著性。这样做涉及对蛋白质结构域进行全对全比较(此处取自蛋白质结构分类数据库),然后将一个简单的分布函数拟合到观察到的分数上。通过使用这种分布,我们可以以P值的形式为每个比较分数赋予统计显著性,即偶然获得更好分数的概率。正如预期的那样,我们发现序列匹配的分数遵循极值分布。此外,我们从该分布得出的P值与标准程序(如BLAST和FASTA)报告的P值之间的一致性验证了我们的方法。当统计以结构比对分数表示时(本质上是比对原子之间的倒数距离之和减去空位罚分),结构比较分数也遵循极值分布。我们发现,传统的结构相似性度量,即拟合比对原子后原子位置的均方根偏差,遵循不同的分数分布,并且不如结构比对分数表现好。对已知远缘相关的蛋白质对的序列和结构统计进行比较表明,在相同错误率下,结构比较能够检测到的远缘关系数量大约是序列比较的两倍。该比较还表明,在序列方面有显著相似性但在结构方面没有的蛋白质对非常少,而在结构方面有显著相似性但在序列方面没有的蛋白质对有很多。