JCoDA：一种用于检测进化选择的工具。

JCoDA: a tool for detecting evolutionary selection.

机构信息

Department of Biology, The College of New Jersey, 2000 Pennington Road, Ewing, NJ 08628, USA.

出版信息

BMC Bioinformatics. 2010 May 27;11:284. doi: 10.1186/1471-2105-11-284.

DOI:10.1186/1471-2105-11-284

PMID:20507581

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2887424/

Abstract

BACKGROUND

The incorporation of annotated sequence information from multiple related species in commonly used databases (Ensembl, Flybase, Saccharomyces Genome Database, Wormbase, etc.) has increased dramatically over the last few years. This influx of information has provided a considerable amount of raw material for evaluation of evolutionary relationships. To aid in the process, we have developed JCoDA (Java Codon Delimited Alignment) as a simple-to-use visualization tool for the detection of site specific and regional positive/negative evolutionary selection amongst homologous coding sequences.

RESULTS

JCoDA accepts user-inputted unaligned or pre-aligned coding sequences, performs a codon-delimited alignment using ClustalW, and determines the dN/dS calculations using PAML (Phylogenetic Analysis Using Maximum Likelihood, yn00 and codeml) in order to identify regions and sites under evolutionary selection. The JCoDA package includes a graphical interface for Phylip (Phylogeny Inference Package) to generate phylogenetic trees, manages formatting of all required file types, and streamlines passage of information between underlying programs. The raw data are output to user configurable graphs with sliding window options for straightforward visualization of pairwise or gene family comparisons. Additionally, codon-delimited alignments are output in a variety of common formats and all dN/dS calculations can be output in comma-separated value (CSV) format for downstream analysis. To illustrate the types of analyses that are facilitated by JCoDA, we have taken advantage of the well studied sex determination pathway in nematodes as well as the extensive sequence information available to identify genes under positive selection, examples of regional positive selection, and differences in selection based on the role of genes in the sex determination pathway.

CONCLUSIONS

JCoDA is a configurable, open source, user-friendly visualization tool for performing evolutionary analysis on homologous coding sequences. JCoDA can be used to rapidly screen for genes and regions of genes under selection using PAML. It can be freely downloaded at http://www.tcnj.edu/~nayaklab/jcoda.

摘要

背景

近年来，在常用数据库（Ensembl、Flybase、Saccharomyces Genome Database、Wormbase 等）中整合来自多个相关物种的带注释序列信息的工作有了显著增加。这些信息的涌入为评估进化关系提供了大量的原始材料。为了帮助这一过程，我们开发了 JCoDA（Java Codon Delimited Alignment），它是一种简单易用的可视化工具，用于检测同源编码序列中特定位置和区域的正/负进化选择。

结果

JCoDA 接受用户输入的未对齐或预对齐的编码序列，使用 ClustalW 进行密码子分隔对齐，并使用 PAML（最大似然法的系统发育分析，yn00 和 codeml）进行 dN/dS 计算，以识别进化选择下的区域和位点。JCoDA 包包括一个用于 Phylip（系统发育推断包）的图形界面，用于生成系统发育树，管理所有必需文件类型的格式设置，并简化底层程序之间的信息传递。原始数据输出到用户可配置的图表，带有滑动窗口选项，可直观地显示成对或基因家族比较。此外，密码子分隔对齐以多种常见格式输出，所有 dN/dS 计算都可以以逗号分隔值（CSV）格式输出，以便进行下游分析。为了说明 JCoDA 促进的分析类型，我们利用了线虫中研究得很好的性别决定途径以及可用的广泛序列信息来识别正选择下的基因、区域正选择的例子以及基于基因在性别决定途径中的作用的选择差异。

结论

JCoDA 是一种可配置的、开源的、用户友好的可视化工具，用于对同源编码序列进行进化分析。JCoDA 可用于使用 PAML 快速筛选选择下的基因和基因区域。它可以在 http://www.tcnj.edu/~nayaklab/jcoda 上免费下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47f1/2887424/d478b42a29de/1471-2105-11-284-1.jpg

相似文献

JCoDA: a tool for detecting evolutionary selection.

BMC Bioinformatics. 2010 May 27;11:284. doi: 10.1186/1471-2105-11-284.

LMAP: Lightweight Multigene Analyses in PAML.

BMC Bioinformatics. 2016 Sep 6;17(1):354. doi: 10.1186/s12859-016-1204-5.

PoSE: visualization of patterns of sequence evolution using PAML and MATLAB.

BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):364. doi: 10.1186/s12859-018-2335-7.

EvoDB: a database of evolutionary rate profiles, associated protein domains and phylogenetic trees for PFAM-A.

Database (Oxford). 2015 Jul 2;2015:bav065. doi: 10.1093/database/bav065. Print 2015.

A beginners guide to estimating the non-synonymous to synonymous rate ratio of all protein-coding genes in a genome.

Methods Mol Biol. 2015;1201:65-90. doi: 10.1007/978-1-4939-1438-8_4.

IDEA: Interactive Display for Evolutionary Analyses.

BMC Bioinformatics. 2008 Dec 8;9:524. doi: 10.1186/1471-2105-9-524.

webPRANK: a phylogeny-aware multiple sequence aligner with interactive alignment browser.

BMC Bioinformatics. 2010 Nov 26;11:579. doi: 10.1186/1471-2105-11-579.

Human PAML browser: a database of positive selection on human genes using phylogenetic methods.

Nucleic Acids Res. 2008 Jan;36(Database issue):D800-8. doi: 10.1093/nar/gkm764. Epub 2007 Oct 25.

Beginner's Guide on the Use of PAML to Detect Positive Selection.

Mol Biol Evol. 2023 Apr 4;40(4). doi: 10.1093/molbev/msad041.

ABC: software for interactive browsing of genomic multiple sequence alignment data.

BMC Bioinformatics. 2004 Dec 8;5:192. doi: 10.1186/1471-2105-5-192.

引用本文的文献

AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis.

Genome Biol Evol. 2023 Oct 6;15(10). doi: 10.1093/gbe/evad187.

Analysis of gene duplication within the Arabidopsis NUCLEAR FACTOR Y, subunit B (NF-YB) protein family reveals domains under both purifying and diversifying selection.

PLoS One. 2023 Aug 2;18(8):e0289332. doi: 10.1371/journal.pone.0289332. eCollection 2023.

FREEDA: An automated computational pipeline guides experimental testing of protein innovation.

J Cell Biol. 2023 Sep 4;222(9). doi: 10.1083/jcb.202212084. Epub 2023 Jun 26.

Duplication of NRAMP3 gene in poplars generated two homologous transporters with distinct functions.

Mol Biol Evol. 2022 Jun 14;39(6). doi: 10.1093/molbev/msac129.

GWideCodeML: A Python Package for Testing Evolutionary Hypotheses at the Genome-Wide Level.

G3 (Bethesda). 2020 Dec 3;10(12):4369-4372. doi: 10.1534/g3.120.401874.

DGINN, an automated and highly-flexible pipeline for the detection of genetic innovations on protein-coding genes.

Nucleic Acids Res. 2020 Oct 9;48(18):e103. doi: 10.1093/nar/gkaa680.

Abundance, Functional, and Evolutionary Analysis of Oxalyl-Coenzyme A Decarboxylase in Human Microbiota.

Front Microbiol. 2020 Apr 23;11:672. doi: 10.3389/fmicb.2020.00672. eCollection 2020.

Under-the-Radar Dengue Virus Infections in Natural Populations of Aedes aegypti Mosquitoes.

mSphere. 2020 Apr 29;5(2):e00316-20. doi: 10.1128/mSphere.00316-20.

Comprehensive genome-wide identification of angiosperm upstream ORFs with peptide sequences conserved in various taxonomic ranges using a novel pipeline, ESUCA.

BMC Genomics. 2020 Mar 30;21(1):260. doi: 10.1186/s12864-020-6662-5.

Extensive survey of the ycf4 plastid gene throughout the IRLC legumes: Robust evidence of its locus and lineage specific accelerated rate of evolution, pseudogenization and gene loss in the tribe Fabeae.

PLoS One. 2020 Mar 5;15(3):e0229846. doi: 10.1371/journal.pone.0229846. eCollection 2020.

本文引用的文献

IDEA: Interactive Display for Evolutionary Analyses.

BMC Bioinformatics. 2008 Dec 8;9:524. doi: 10.1186/1471-2105-9-524.

The trouble with sliding windows and the selective pressure in BRCA1.

PLoS One. 2008;3(11):e3746. doi: 10.1371/journal.pone.0003746. Epub 2008 Nov 18.

BioJava: an open-source framework for bioinformatics.

Bioinformatics. 2008 Sep 15;24(18):2096-7. doi: 10.1093/bioinformatics/btn397. Epub 2008 Aug 8.

Multiple sequence alignment.

Methods Mol Biol. 2008;452:143-61. doi: 10.1007/978-1-60327-159-2_7.

OCPAT: an online codon-preserved alignment tool for evolutionary genomic analysis of protein coding sequences.

Source Code Biol Med. 2007 Sep 18;2:5. doi: 10.1186/1751-0473-2-5.

Clustal W and Clustal X version 2.0.

Bioinformatics. 2007 Nov 1;23(21):2947-8. doi: 10.1093/bioinformatics/btm404. Epub 2007 Sep 10.

PAML 4: phylogenetic analysis by maximum likelihood.

Mol Biol Evol. 2007 Aug;24(8):1586-91. doi: 10.1093/molbev/msm088. Epub 2007 May 4.

The HIV positive selection mutation database.

Nucleic Acids Res. 2007 Jan;35(Database issue):D371-5. doi: 10.1093/nar/gkl855. Epub 2006 Nov 15.

PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W609-12. doi: 10.1093/nar/gkl315.

SWAKK: a web server for detecting positive selection in proteins using a sliding window substitution rate analysis.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W382-4. doi: 10.1093/nar/gkl272.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

JCoDA：一种用于检测进化选择的工具。

JCoDA: a tool for detecting evolutionary selection.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献