• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

orthofisher:一种广泛适用的自动化基因识别和检索工具。

orthofisher: a broadly applicable tool for automated gene identification and retrieval.

机构信息

Department of Biological Sciences, Vanderbilt University , Nashville, TN 37235, USA.

出版信息

G3 (Bethesda). 2021 Sep 6;11(9). doi: 10.1093/g3journal/jkab250.

DOI:10.1093/g3journal/jkab250
PMID:34544141
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8496211/
Abstract

Identification and retrieval of genes of interest from genomic data are an essential step for many bioinformatic applications. We present orthofisher, a command-line tool for automated identification and retrieval of genes with high sequence similarity to a query profile Hidden Markov Model sequence alignment across a set of proteomes. Performance assessment of orthofisher revealed high accuracy and precision during single-copy orthologous gene identification. orthofisher may be useful for assessing gene annotation quality, identifying single-copy orthologous genes for phylogenomic analyses, estimating gene copy number, and other evolutionary analyses that rely on identification and retrieval of homologous genes from genomic data. orthofisher comes complete with comprehensive documentation (https://jlsteenwyk.com/orthofisher/), is freely available under the MIT license, and is available for download from GitHub (https://github.com/JLSteenwyk/orthofisher), PyPi (https://pypi.org/project/orthofisher/), and the Anaconda Cloud (https://anaconda.org/jlsteenwyk/orthofisher).

摘要

从基因组数据中识别和提取感兴趣的基因是许多生物信息学应用的一个基本步骤。我们介绍了 orthofisher,这是一个命令行工具,用于自动识别和检索与查询轮廓 Hidden Markov Model 序列比对具有高度序列相似性的基因,这些比对跨越了一组蛋白质组。orthofisher 的性能评估显示,在单拷贝直系同源基因识别中具有很高的准确性和精度。orthofisher 可用于评估基因注释质量、识别用于系统发育分析的单拷贝直系同源基因、估计基因拷贝数以及其他依赖于从基因组数据中识别和检索同源基因的进化分析。orthofisher 随附有全面的文档(https://jlsteenwyk.com/orthofisher/),根据 MIT 许可证免费提供,并可从 GitHub(https://github.com/JLSteenwyk/orthofisher/)、PyPi(https://pypi.org/project/orthofisher/)和 Anaconda Cloud(https://anaconda.org/jlsteenwyk/orthofisher/)下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7eae/8496211/137c4c05f442/jkab250f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7eae/8496211/137c4c05f442/jkab250f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7eae/8496211/137c4c05f442/jkab250f1.jpg

相似文献

1
orthofisher: a broadly applicable tool for automated gene identification and retrieval.orthofisher:一种广泛适用的自动化基因识别和检索工具。
G3 (Bethesda). 2021 Sep 6;11(9). doi: 10.1093/g3journal/jkab250.
2
BioKIT: a versatile toolkit for processing and analyzing diverse types of sequence data.BioKIT:一个用于处理和分析多种类型序列数据的多功能工具包。
Genetics. 2022 Jul 4;221(3). doi: 10.1093/genetics/iyac079.
3
PhyKIT: a broadly applicable UNIX shell toolkit for processing and analyzing phylogenomic data.PhyKIT:一个广泛适用的用于处理和分析系统发育基因组数据的UNIX shell工具包。
Bioinformatics. 2021 Aug 25;37(16):2325-2331. doi: 10.1093/bioinformatics/btab096.
4
GOThresher: a program to remove annotation biases from protein function annotation datasets.GOThresher:一个用于去除蛋白质功能注释数据集中注释偏差的程序。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad048.
5
Trackplot: A flexible toolkit for combinatorial analysis of genomic data.轨迹图:用于基因组数据组合分析的灵活工具包。
PLoS Comput Biol. 2023 Sep 5;19(9):e1011477. doi: 10.1371/journal.pcbi.1011477. eCollection 2023 Sep.
6
genomepy: genes and genomes at your fingertips.genomepy:指尖上的基因和基因组。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad119.
7
JustOrthologs: a fast, accurate and user-friendly ortholog identification algorithm.JustOrthologs:一种快速、准确且用户友好的直系同源基因识别算法。
Bioinformatics. 2019 Feb 15;35(4):546-552. doi: 10.1093/bioinformatics/bty669.
8
Efficient population-scale variant analysis and prioritization with VAPr.利用 VAPr 进行高效的群体规模变异分析和优先级排序。
Bioinformatics. 2018 Aug 15;34(16):2843-2845. doi: 10.1093/bioinformatics/bty192.
9
PyHMMER: a Python library binding to HMMER for efficient sequence analysis.PyHMMER:一个绑定到 HMMER 的 Python 库,用于高效的序列分析。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad214.
10
Patchwork: Alignment-Based Retrieval and Concatenation of Phylogenetic Markers from Genomic Data.拼接:基于比对的基因组数据中系统发育标记的检索和拼接。
Genome Biol Evol. 2023 Dec 1;15(12). doi: 10.1093/gbe/evad227.

引用本文的文献

1
Stable hypermutators revealed by the genomic landscape of DNA repair genes among yeast species.通过酵母物种中DNA修复基因的基因组格局揭示的稳定超突变体
bioRxiv. 2025 Mar 17:2025.03.15.643480. doi: 10.1101/2025.03.15.643480.
2
Convergent reductive evolution in bee-associated lactic acid bacteria.蜂源乳酸菌的趋同进化。
Appl Environ Microbiol. 2024 Nov 20;90(11):e0125724. doi: 10.1128/aem.01257-24. Epub 2024 Oct 23.
3
A taxon-rich and genome-scale phylogeny of Opisthokonta.后生动物的一个富含分类群和基因组规模的系统发育关系。

本文引用的文献

1
PhyKIT: a broadly applicable UNIX shell toolkit for processing and analyzing phylogenomic data.PhyKIT:一个广泛适用的用于处理和分析系统发育基因组数据的UNIX shell工具包。
Bioinformatics. 2021 Aug 25;37(16):2325-2331. doi: 10.1093/bioinformatics/btab096.
2
ClipKIT: A multiple sequence alignment trimming software for accurate phylogenomic inference.ClipKIT:一种用于准确系统发育推断的多重序列比对修剪软件。
PLoS Biol. 2020 Dec 2;18(12):e3001007. doi: 10.1371/journal.pbio.3001007. eCollection 2020 Dec.
3
OrthoFinder: phylogenetic orthology inference for comparative genomics.
PLoS Biol. 2024 Sep 16;22(9):e3002794. doi: 10.1371/journal.pbio.3002794. eCollection 2024 Sep.
4
Inferences on the evolution of the ascorbic acid synthesis pathway in insects using Phylogenetic Tree Collapser (PTC), a tool for the automated collapsing of phylogenetic trees using taxonomic information.利用 Phylogenetic Tree Collapser(PTC)推断昆虫抗坏血酸合成途径的进化,PTC 是一种使用分类学信息自动折叠系统发育树的工具。
J Integr Bioinform. 2024 Jul 24;21(2). doi: 10.1515/jib-2023-0051. eCollection 2024 Jun 1.
5
Convergent reductive evolution in bee-associated lactic acid bacteria.蜜蜂相关乳酸菌中的趋同还原进化
bioRxiv. 2024 Jul 2:2024.06.28.601270. doi: 10.1101/2024.06.28.601270.
6
Extensive remodeling of sugar metabolism through gene loss and horizontal gene transfer in a eukaryotic lineage.在一个真核生物谱系中,通过基因丢失和水平基因转移对糖代谢进行广泛改造。
BMC Biol. 2024 May 30;22(1):128. doi: 10.1186/s12915-024-01929-7.
7
Genomic factors shape carbon and nitrogen metabolic niche breadth across Saccharomycotina yeasts.基因组因素塑造了子囊菌酵母中碳和氮代谢生态位宽度。
Science. 2024 Apr 26;384(6694):eadj4503. doi: 10.1126/science.adj4503.
8
Fossil-calibrated molecular clock data enable reconstruction of steps leading to differentiated multicellularity and anisogamy in the Volvocine algae.化石校准的分子钟数据使得重建导致轮藻门藻类中分化的多细胞性和异形配子的步骤成为可能。
BMC Biol. 2024 Apr 10;22(1):79. doi: 10.1186/s12915-024-01878-1.
9
Phylogenomics reveals extensive misidentification of fungal strains from the genus .系统发育基因组学揭示了属真菌菌株的广泛错误鉴定。
Microbiol Spectr. 2024 Apr 2;12(4):e0398023. doi: 10.1128/spectrum.03980-23. Epub 2024 Mar 6.
10
Amino Acid Chirality: Stereospecific Conversion and Physiological Implications.氨基酸手性:立体特异性转化及其生理意义
ACS Omega. 2024 Jan 26;9(5):5084-5099. doi: 10.1021/acsomega.3c08305. eCollection 2024 Feb 6.
OrthoFinder:用于比较基因组学的系统发育直系同源推断。
Genome Biol. 2019 Nov 14;20(1):238. doi: 10.1186/s13059-019-1832-y.
4
Challenges and recommendations to improve the installability and archival stability of omics computational tools.提高组学计算工具可安装性和档案稳定性的挑战和建议。
PLoS Biol. 2019 Jun 20;17(6):e3000333. doi: 10.1371/journal.pbio.3000333. eCollection 2019 Jun.
5
Extensive loss of cell-cycle and DNA repair genes in an ancient lineage of bipolar budding yeasts.在古老的双相出芽酵母谱系中,细胞周期和 DNA 修复基因广泛缺失。
PLoS Biol. 2019 May 21;17(5):e3000255. doi: 10.1371/journal.pbio.3000255. eCollection 2019 May.
6
The State of Software for Evolutionary Biology.进化生物学软件现状
Mol Biol Evol. 2018 May 1;35(5):1037-1046. doi: 10.1093/molbev/msy014.
7
BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics.BUSCO的应用:从质量评估到基因预测和系统发育基因组学
Mol Biol Evol. 2018 Mar 1;35(3):543-548. doi: 10.1093/molbev/msx319.
8
Orthologous Matrix (OMA) algorithm 2.0: more robust to asymmetric evolutionary rates and more scalable hierarchical orthologous group inference.直系同源矩阵(OMA)算法2.0:对不对称进化速率更具鲁棒性,且在分层直系同源组推断方面更具扩展性。
Bioinformatics. 2017 Jul 15;33(14):i75-i82. doi: 10.1093/bioinformatics/btx229.
9
Fast and sensitive protein alignment using DIAMOND.使用 DIAMOND 进行快速灵敏的蛋白质比对。
Nat Methods. 2015 Jan;12(1):59-60. doi: 10.1038/nmeth.3176. Epub 2014 Nov 17.
10
Functional and evolutionary implications of gene orthology.基因直系同源的功能和进化意义。
Nat Rev Genet. 2013 May;14(5):360-6. doi: 10.1038/nrg3456. Epub 2013 Apr 4.