• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

畴尺寸分布可以预测畴界。

Domain size distributions can predict domain boundaries.

作者信息

Wheelan S J, Marchler-Bauer A, Bryant S H

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.

出版信息

Bioinformatics. 2000 Jul;16(7):613-8. doi: 10.1093/bioinformatics/16.7.613.

DOI:10.1093/bioinformatics/16.7.613
PMID:11038331
Abstract

MOTIVATION

The sizes of protein domains observed in the 3D-structure database follow a surprisingly narrow distribution. Structural domains are furthermore formed from a single-chain continuous segment in over 80% of instances. These observations imply that some choices of domain boundaries on an otherwise uncharacterized sequence are more likely than others, based solely on the size and segment number of predicted domains. This property might be used to guess the locations of protein domain boundaries.

RESULTS

To test this possibility we enumerate putative domain boundaries and calculate their relative likelihood under a probability model that considers only the size and segment number of predicted domains. We ask, in a cross-validated test using sequences with known 3D structure, whether the most likely guesses agree with the observed domain structure. We find that domain boundary predictions are surprisingly successful for sequences up to 400 residues long and that guessing domain boundaries in this way can improve the sensitivity of threading analysis.

摘要

动机

在三维结构数据库中观察到的蛋白质结构域大小呈现出惊人的狭窄分布。此外,超过80%的情况下,结构域由单链连续片段形成。这些观察结果表明,在一个原本未表征的序列上,仅基于预测结构域的大小和片段数量,某些结构域边界的选择比其他选择更有可能。这一特性可用于猜测蛋白质结构域边界的位置。

结果

为了测试这种可能性,我们列举了假定的结构域边界,并在一个仅考虑预测结构域大小和片段数量的概率模型下计算它们的相对可能性。在使用具有已知三维结构的序列进行的交叉验证测试中,我们询问最有可能的猜测是否与观察到的结构域结构一致。我们发现,对于长度达400个残基的序列,结构域边界预测出奇地成功,并且以这种方式猜测结构域边界可以提高穿线分析的灵敏度。

相似文献

1
Domain size distributions can predict domain boundaries.畴尺寸分布可以预测畴界。
Bioinformatics. 2000 Jul;16(7):613-8. doi: 10.1093/bioinformatics/16.7.613.
2
Use of covariance analysis for the prediction of structural domain boundaries from multiple protein sequence alignments.利用协方差分析从多个蛋白质序列比对预测结构域边界。
Protein Eng. 2002 Feb;15(2):65-77. doi: 10.1093/protein/15.2.65.
3
[Prediction of protein domain boundaries based on statistics of appearance of amino acid residues].基于氨基酸残基出现统计的蛋白质结构域边界预测
Mol Biol (Mosk). 2006 Jan-Feb;40(1):111-21.
4
Fast prediction of protein domain boundaries using conserved local patterns.利用保守局部模式快速预测蛋白质结构域边界
J Mol Model. 2006 Sep;12(6):943-52. doi: 10.1007/s00894-006-0116-0. Epub 2006 Apr 29.
5
SnapDRAGON: a method to delineate protein structural domains from sequence data.SnapDRAGON:一种从序列数据中描绘蛋白质结构域的方法。
J Mol Biol. 2002 Feb 22;316(3):839-51. doi: 10.1006/jmbi.2001.5387.
6
Delineation of modular proteins: domain boundary prediction from sequence information.模块化蛋白质的描绘:基于序列信息的结构域边界预测
Brief Bioinform. 2004 Jun;5(2):179-92. doi: 10.1093/bib/5.2.179.
7
Armadillo: domain boundary prediction by amino acid composition.犰狳:基于氨基酸组成的结构域边界预测
J Mol Biol. 2005 Jul 29;350(5):1061-73. doi: 10.1016/j.jmb.2005.05.037.
8
Inferring boundary information of discontinuous-domain proteins.推断不连续结构域蛋白质的边界信息。
IEEE Trans Nanobioscience. 2008 Sep;7(3):200-5. doi: 10.1109/TNB.2008.2002283.
9
Domain boundary prediction based on profile domain linker propensity index.基于序列轮廓结构域连接子倾向指数的结构域边界预测
Comput Biol Chem. 2006 Apr;30(2):127-33. doi: 10.1016/j.compbiolchem.2006.01.001. Epub 2006 Mar 13.
10
Multi-head attention-based U-Nets for predicting protein domain boundaries using 1D sequence features and 2D distance maps.基于多头注意力的 U-Net 模型,利用 1D 序列特征和 2D 距离图预测蛋白质结构域边界。
BMC Bioinformatics. 2022 Jul 19;23(1):283. doi: 10.1186/s12859-022-04829-1.

引用本文的文献

1
Exploring Large Protein Sequence Space through Homology- and Representation-based Hierarchical Clustering.通过基于同源性和表示的层次聚类探索大型蛋白质序列空间。
Mol Biol Evol. 2025 Jun 4;42(6). doi: 10.1093/molbev/msaf136.
2
Positions of cysteine residues reveal local clusters and hidden relationships to Sequons and Transmembrane domains in Human proteins.半胱氨酸残基的位置揭示了人类蛋白质中局部簇和与顺反子及跨膜结构域的隐藏关系。
Sci Rep. 2024 Oct 29;14(1):25886. doi: 10.1038/s41598-024-77056-8.
3
Chemical Synthesis of Human Proteoforms and Application in Biomedicine.
人蛋白质异构体的化学合成及其在生物医学中的应用
ACS Cent Sci. 2024 Jul 22;10(8):1442-1459. doi: 10.1021/acscentsci.4c00642. eCollection 2024 Aug 28.
4
Merizo: a rapid and accurate protein domain segmentation method using invariant point attention.Merizo:一种使用不变点注意力的快速准确的蛋白质结构域分割方法。
Nat Commun. 2023 Dec 19;14(1):8445. doi: 10.1038/s41467-023-43934-4.
5
Bioinformatic and literature assessment of toxicity and allergenicity of a CRISPR-Cas9 engineered gene drive to control Anopheles gambiae the mosquito vector of human malaria.CRISPR-Cas9 基因驱动工程对控制人类疟疾传播媒介按蚊的毒性和致敏性的生物信息学和文献评估。
Malar J. 2023 Aug 14;22(1):234. doi: 10.1186/s12936-023-04665-5.
6
Protein length distribution is remarkably uniform across the tree of life.蛋白质长度分布在整个生命之树上都非常均匀。
Genome Biol. 2023 Jun 8;24(1):135. doi: 10.1186/s13059-023-02973-2.
7
Sub-region analysis of DMD gene in cases with idiopathic generalized epilepsy.特发性全面性癫痫病例中 DMD 基因的亚区分析。
Neurogenetics. 2023 Jul;24(3):161-169. doi: 10.1007/s10048-023-00715-x. Epub 2023 Apr 6.
8
A structural biology community assessment of AlphaFold2 applications.AlphaFold2 应用的结构生物学社区评估。
Nat Struct Mol Biol. 2022 Nov;29(11):1056-1067. doi: 10.1038/s41594-022-00849-w. Epub 2022 Nov 7.
9
Redox-Controlled Chemical Protein Synthesis: Sundry Shades of Latency.氧化还原控制的化学蛋白质合成:潜伏的种种变化。
Acc Chem Res. 2022 Sep 20;55(18):2685-2697. doi: 10.1021/acs.accounts.2c00436. Epub 2022 Sep 9.
10
Phyletic Distribution and Diversification of the Phage Shock Protein Stress Response System in Bacteria and Archaea.细菌和古菌中噬菌体休克蛋白应激反应系统的系统发生分布和多样化。
mSystems. 2022 Jun 28;7(3):e0134821. doi: 10.1128/msystems.01348-21. Epub 2022 May 23.