• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

构建源自具有偏向性氨基酸分布的蛋白质组的非对称替换矩阵。

Construction of non-symmetric substitution matrices derived from proteomes with biased amino acid distributions.

作者信息

Bastien Olivier, Roy Sylvaine, Maréchal Eric

机构信息

Laboratoire de Physiologie Cellulaire Végétale, Département Réponse et Dynamique Cellulaire, UMR 5019, CNRS-CEA-INRA-Université Joseph-Fourier, CEA Grenoble, 17, rue des Martyrs, 38054 Grenoble, France.

出版信息

C R Biol. 2005 May;328(5):445-53. doi: 10.1016/j.crvi.2005.02.002.

DOI:10.1016/j.crvi.2005.02.002
PMID:15948633
Abstract

Automatic comparison of compositionally biased genomes, such as that of the malarial causative agent Plasmodium falciparum (82% adenosine + thymidine), with genomes of average composition, is currently limited. Indeed, popular tools such as BLAST require that amino acid distributions be similar in aligned sequences. However, the P. falciparum genome is so biased that six amino acids account for more than 50% of the protein composition. One reason for the comparison methods failure lies in the compositional difference between the query and the subject proteomes, which is not taken into account in the amino acid substitution matrices. This paper introduces a method to derive substitution matrices, in particular BLOSUM 62, in the frame of the information theory. It allows the construction of non-symmetrical matrices, taking into account the non-symmetric amino acid distributions. The dirAtPf family of matrices allowing the comparison of P. falciparum and A. thaliana is given as an example. This paper further provides an analysis of the obtained matrices in the frame of the information theory, supporting the discrimination advantage they bring.

摘要

目前,将组成存在偏差的基因组(如疟疾病原体恶性疟原虫的基因组,其腺苷+胸腺嘧啶含量为82%)与平均组成的基因组进行自动比较存在局限性。实际上,像BLAST这样的常用工具要求比对序列中的氨基酸分布相似。然而,恶性疟原虫的基因组偏差极大,六种氨基酸占蛋白质组成的比例超过50%。比较方法失败的一个原因在于查询蛋白质组和目标蛋白质组之间的组成差异,而氨基酸替换矩阵并未考虑这一点。本文介绍了一种在信息论框架下推导替换矩阵,特别是BLOSUM 62矩阵的方法。它允许构建非对称矩阵,同时考虑到非对称的氨基酸分布。以允许比较恶性疟原虫和拟南芥的dirAtPf矩阵家族为例。本文还在信息论框架下对所得矩阵进行了分析,证明了它们所带来的区分优势。

相似文献

1
Construction of non-symmetric substitution matrices derived from proteomes with biased amino acid distributions.构建源自具有偏向性氨基酸分布的蛋白质组的非对称替换矩阵。
C R Biol. 2005 May;328(5):445-53. doi: 10.1016/j.crvi.2005.02.002.
2
A novel series of compositionally biased substitution matrices for comparing Plasmodium proteins.用于比较疟原虫蛋白质的一系列新型成分偏倚替换矩阵。
BMC Bioinformatics. 2008 May 16;9:236. doi: 10.1186/1471-2105-9-236.
3
Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome.基因组偏向性影响氨基酸选择:对氨基酸替换的分析以及对AT偏向性基因组特有的替换矩阵的重新编译。
Nucleic Acids Res. 2008 Dec;36(21):6664-75. doi: 10.1093/nar/gkn635. Epub 2008 Oct 23.
4
Analysis of the compositional biases in Plasmodium falciparum genome and proteome using Arabidopsis thaliana as a reference.以拟南芥为参照对恶性疟原虫基因组和蛋白质组中的组成性偏差进行分析。
Gene. 2004 Jul 21;336(2):163-73. doi: 10.1016/j.gene.2004.04.029.
5
The compositional adjustment of amino acid substitution matrices.氨基酸替换矩阵的组成调整。
Proc Natl Acad Sci U S A. 2003 Dec 23;100(26):15688-93. doi: 10.1073/pnas.2533904100. Epub 2003 Dec 8.
6
Proteome composition in Plasmodium falciparum: higher usage of GC-rich nonsynonymous codons in highly expressed genes.恶性疟原虫的蛋白质组组成:高表达基因中富含GC的非同义密码子的使用频率更高。
J Mol Evol. 2005 Oct;61(4):513-23. doi: 10.1007/s00239-005-0023-5. Epub 2005 Jul 21.
7
The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions.用于比较具有非标准组成的蛋白质的氨基酸替换矩阵的构建。
Bioinformatics. 2005 Apr 1;21(7):902-11. doi: 10.1093/bioinformatics/bti070. Epub 2004 Oct 27.
8
Systematic analysis of proteomes with emphasis on insertions in malaria parasite Plasmodium falciparum.
Protein Pept Lett. 2013 Oct;20(10):1088-97. doi: 10.2174/0929866511320100002.
9
An analysis of mobile genetic elements in three Plasmodium species and their potential impact on the nucleotide composition of the P. falciparum genome.对三种疟原虫属物种中移动遗传元件的分析及其对恶性疟原虫基因组核苷酸组成的潜在影响。
BMC Genomics. 2006 Nov 4;7:282. doi: 10.1186/1471-2164-7-282.
10
Analysis of differences in amino acid substitution patterns, using multilevel G-tests.使用多级G检验分析氨基酸替代模式的差异。
C R Biol. 2005 Jul;328(7):632-41. doi: 10.1016/j.crvi.2005.03.003. Epub 2005 Apr 7.

引用本文的文献

1
Analysis of SARS-CoV-2 nucleocapsid phosphoprotein N variations in the binding site to human 14-3-3 proteins.分析 SARS-CoV-2 核衣壳磷蛋白 N 与人 14-3-3 蛋白结合位点的变异。
Biochem Biophys Res Commun. 2021 Sep 10;569:154-160. doi: 10.1016/j.bbrc.2021.06.100. Epub 2021 Jul 2.
2
ALP & FALP: C++ libraries for pairwise local alignment E-values.ALP和FALP:用于成对局部比对E值的C++库。
Bioinformatics. 2016 Jan 15;32(2):304-5. doi: 10.1093/bioinformatics/btv575. Epub 2015 Oct 1.
3
The worst case complexity of maximum parsimony.
最大简约法的最坏情况复杂度。
J Comput Biol. 2014 Nov;21(11):799-808. doi: 10.1089/cmb.2014.0128. Epub 2014 Oct 10.
4
A novel substitution matrix fitted to the compositional bias in Mollicutes improves the prediction of homologous relationships.一种新的替代矩阵,拟合 Mollicutes 中的组成偏倚,可提高同源关系的预测。
BMC Bioinformatics. 2011 Nov 24;12:457. doi: 10.1186/1471-2105-12-457.
5
Where does the alignment score distribution shape come from?对齐得分分布的形状来自哪里?
Evol Bioinform Online. 2010 Dec 12;6:159-87. doi: 10.4137/EBO.S5875.
6
Assessing functional annotation transfers with inter-species conserved coexpression: application to Plasmodium falciparum.评估物种间保守共表达的功能注释转移:在疟原虫中的应用。
BMC Genomics. 2010 Jan 15;11:35. doi: 10.1186/1471-2164-11-35.
7
Revisiting the Plasmodium falciparum RIFIN family: from comparative genomics to 3D-model prediction.重新审视恶性疟原虫RIFIN家族:从比较基因组学到三维模型预测
BMC Genomics. 2009 Sep 21;10:445. doi: 10.1186/1471-2164-10-445.
8
A simple derivation of the distribution of pairwise local protein sequence alignment scores.一种简单的蛋白质序列两两比对局部评分分布推导。
Evol Bioinform Online. 2008 Feb 14;4:41-5.
9
Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome.基因组偏向性影响氨基酸选择:对氨基酸替换的分析以及对AT偏向性基因组特有的替换矩阵的重新编译。
Nucleic Acids Res. 2008 Dec;36(21):6664-75. doi: 10.1093/nar/gkn635. Epub 2008 Oct 23.
10
PlasmoDraft: a database of Plasmodium falciparum gene function predictions based on postgenomic data.疟原虫草稿:一个基于后基因组数据的恶性疟原虫基因功能预测数据库。
BMC Bioinformatics. 2008 Oct 16;9:440. doi: 10.1186/1471-2105-9-440.