• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迈向用于多序列比对的可靠目标函数。

Towards a reliable objective function for multiple sequence alignments.

作者信息

Thompson J D, Plewniak F, Ripp R, Thierry J C, Poch O

机构信息

Laboratoire de Biologie et Génomique Structurales, Institut de Génétique et de Biologie Moléculaire et Cellulaire, (CNRS/INSERM/ULP), Illkirch Cedex, 67404, France.

出版信息

J Mol Biol. 2001 Dec 7;314(4):937-51. doi: 10.1006/jmbi.2001.5187.

DOI:10.1006/jmbi.2001.5187
PMID:11734009
Abstract

Multiple sequence alignment is a fundamental tool in a number of different domains in modern molecular biology, including functional and evolutionary studies of a protein family. Multiple alignments also play an essential role in the new integrated systems for genome annotation and analysis. Thus, the development of new multiple alignment scores and statistics is essential, in the spirit of the work dedicated to the evaluation of pairwise sequence alignments for database searching techniques. We present here norMD, a new objective scoring function for multiple sequence alignments. NorMD combines the advantages of the column-scoring techniques with the sensitivity of methods incorporating residue similarity scores. In addition, norMD incorporates ab initio sequence information, such as the number, length and similarity of the sequences to be aligned. The sensitivity and reliability of the norMD objective function is demonstrated using structural alignments in the SCOP and BAliBASE databases. The norMD scores are then applied to the multiple alignments of the complete sequences (MACS) detected by BlastP with E-value<10, for a set of 734 hypothetical proteins encoded by the Vibrio cholerae genome. Unrelated or badly aligned sequences were automatically removed from the MACS, leaving a high-quality multiple alignment which could be reliably exploited in a subsequent functional and/or structural annotation process. After removal of unreliable sequences, 176 (24 %) of the alignments contained at least one sequence with a functional annotation. 103 of these new matches were supported by significant hits to the Interpro domain and motif database.

摘要

多序列比对是现代分子生物学许多不同领域中的一项基本工具,包括蛋白质家族的功能和进化研究。多序列比对在基因组注释和分析的新集成系统中也起着至关重要的作用。因此,本着致力于评估数据库搜索技术中成对序列比对的工作精神,开发新的多序列比对评分和统计方法至关重要。我们在此介绍norMD,一种用于多序列比对的新的客观评分函数。norMD结合了列评分技术的优点以及纳入残基相似性评分的方法的敏感性。此外,norMD纳入了从头算序列信息,例如要比对的序列的数量、长度和相似性。使用SCOP和BAliBASE数据库中的结构比对证明了norMD目标函数的敏感性和可靠性。然后将norMD评分应用于由BlastP检测到的E值<10的完整序列(MACS)的多序列比对,这些序列来自霍乱弧菌基因组编码的一组734个假设蛋白质。不相关或比对不佳的序列会自动从MACS中去除,留下高质量的多序列比对,可在后续的功能和/或结构注释过程中可靠地利用。去除不可靠序列后,176个(24%)比对中至少包含一个具有功能注释的序列。其中103个新匹配得到了Interpro结构域和基序数据库的显著匹配支持。

相似文献

1
Towards a reliable objective function for multiple sequence alignments.迈向用于多序列比对的可靠目标函数。
J Mol Biol. 2001 Dec 7;314(4):937-51. doi: 10.1006/jmbi.2001.5187.
2
Local weighting schemes for protein multiple sequence alignment.蛋白质多序列比对的局部加权方案。
Comput Chem. 2002 Jul;26(5):459-77. doi: 10.1016/s0097-8485(02)00008-6.
3
SeqTools: visual tools for manual analysis of sequence alignments.SeqTools:用于手动分析序列比对的可视化工具。
BMC Res Notes. 2016 Jan 22;9:39. doi: 10.1186/s13104-016-1847-3.
4
PASS2: an automated database of protein alignments organised as structural superfamilies.PASS2:一个以结构超家族形式组织的蛋白质比对自动化数据库。
BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.
5
Assessing the discordance of multiple sequence alignments.评估多重序列比对的不协调性。
IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):542-51. doi: 10.1109/TCBB.2007.70271.
6
Iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases.用于蛋白质同源物的迭代序列/二级结构搜索:与氨基酸序列比对的比较及在基因组数据库中折叠识别的应用
Bioinformatics. 2000 Nov;16(11):988-1002. doi: 10.1093/bioinformatics/16.11.988.
7
Model-based prediction of sequence alignment quality.基于模型的序列比对质量预测。
Bioinformatics. 2008 Oct 1;24(19):2165-71. doi: 10.1093/bioinformatics/btn414. Epub 2008 Aug 4.
8
Accuracy of structure-based sequence alignment of automatic methods.自动方法的基于结构的序列比对准确性。
BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.
9
Identifying bacterial and archaeal homologs of pentameric ligand-gated ion channel (pLGIC) family using domain-based and alignment-based approaches.使用基于结构域和基于比对的方法鉴定五聚体配体门控离子通道(pLGIC)家族的细菌和古菌同源物。
Channels (Austin). 2011 Jul-Aug;5(4):325-43. doi: 10.4161/chan.5.4.16822.
10
Enhanced genome annotation using structural profiles in the program 3D-PSSM.在程序3D-PSSM中使用结构概况增强基因组注释。
J Mol Biol. 2000 Jun 2;299(2):499-520. doi: 10.1006/jmbi.2000.3741.

引用本文的文献

1
ReAlign-N: an integrated realignment approach for multiple nucleic acid sequence alignment, combining global and local realignments.ReAlign-N:一种用于多核酸序列比对的综合重排方法,结合了全局和局部重排。
NAR Genom Bioinform. 2024 Dec 18;6(4):lqae170. doi: 10.1093/nargab/lqae170. eCollection 2024 Dec.
2
Caecilian Genomes Reveal the Molecular Basis of Adaptation and Convergent Evolution of Limblessness in Snakes and Caecilians.蚓螈基因组揭示了蛇和蚓螈无肢状态适应和趋同进化的分子基础。
Mol Biol Evol. 2023 May 2;40(5). doi: 10.1093/molbev/msad102.
3
Pipeline for transferring annotations between proteins beyond globular domains.
球状结构域之外的蛋白质间注释转移流水线。
Protein Sci. 2023 Jul;32(7):e4655. doi: 10.1002/pro.4655.
4
A burst of genomic innovation at the origin of placental mammals mediated embryo implantation.胎盘哺乳动物起源时基因组创新的爆发介导了胚胎着床。
Commun Biol. 2023 Apr 26;6(1):459. doi: 10.1038/s42003-023-04809-y.
5
Improving Orthologous Signal and Model Fit in Datasets Addressing the Root of the Animal Phylogeny.提高解决动物系统发育根源问题的数据集的直系同源信号和模型拟合度。
Mol Biol Evol. 2023 Jan 4;40(1). doi: 10.1093/molbev/msac276.
6
The Evolution of Glycoside Hydrolase Family 1 in Insects Related to Their Adaptation to Plant Utilization.与昆虫适应植物利用相关的糖苷水解酶家族1的进化
Insects. 2022 Aug 30;13(9):786. doi: 10.3390/insects13090786.
7
Estimating the Divergence Times of Alphaproteobacteria Based on Mitochondrial Endosymbiosis and Eukaryotic Fossils.基于线粒体内共生和真核生物化石估算α变形菌的分歧时间。
Methods Mol Biol. 2022;2569:95-116. doi: 10.1007/978-1-0716-2691-7_5.
8
Nuclear genome of Bulinus truncatus, an intermediate host of the carcinogenic human blood fluke Schistosoma haematobium.非洲大蜗牛的核基因组,一种致癌性人类血吸虫(埃及血吸虫)的中间宿主。
Nat Commun. 2022 Feb 21;13(1):977. doi: 10.1038/s41467-022-28634-9.
9
Ribosome heterogeneity in Drosophila melanogaster gonads through paralog-switching.果蝇性腺中通过基因重排导致核糖体异质性。
Nucleic Acids Res. 2022 Feb 28;50(4):2240-2257. doi: 10.1093/nar/gkab606.
10
A phylogenetic approach for weighting genetic sequences.一种用于遗传序列加权的系统发育方法。
BMC Bioinformatics. 2021 May 28;22(1):285. doi: 10.1186/s12859-021-04183-8.