• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

WGDdetector:一个使用基因组或转录组注释来检测全基因组复制事件的流水线。

WGDdetector: a pipeline for detecting whole genome duplication events using the genome or transcriptome annotations.

机构信息

CAS Key Laboratory of Tropical Forest Ecology, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, 666303, Yunnan, China.

State Key Laboratory of Grassland Agro-Ecosystem, College of Life Sciences, Lanzhou University, Lanzhou, China.

出版信息

BMC Bioinformatics. 2019 Feb 13;20(1):75. doi: 10.1186/s12859-019-2670-3.

DOI:10.1186/s12859-019-2670-3
PMID:30760221
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6375192/
Abstract

BACKGROUND

With the availability of well-assembled genomes of a growing number of organisms, identifying the bioinformatic basis of whole genome duplication (WGD) is a growing field of genomics. The most extant software for detecting footprints of WGDs has been restricted to a well-assembled genome. However, the massive poor quality genomes and the more accessible transcriptomes have been largely ignored, and in theoretically they are also likely to contribute to detect WGD using dS based method. Here, to resolve these problems, we have designed a universal and simple technical tool WGDdetector for detecting WGDs using either genome or transcriptome annotations in different organisms based on the widely used dS based method.

RESULTS

We have constructed WGDdetector pipeline that integrates all analyses including gene family constructing, dS estimating and phasing, and outputting the dS values of each paralogs pairs processed with only one command. We further chose four species (Arabidopsis thaliana, Juglans regia, Populus trichocarpa and Xenopus laevis) representing herb, wood and animal, to test its practicability. Our final results showed a high degree of accuracy with the previous studies using both genome and transcriptome data.

CONCLUSION

WGDdetector is not only reliable and stable for genome data, but also a new way to using the transcriptome data to obtain the correct dS distribution for detecting WGD. The source code is freely available, and is implemented in Windows and Linux operation system.

摘要

背景

随着越来越多生物体的基因组组装变得可行,识别全基因组复制(WGD)的生物信息学基础是基因组学日益增长的领域。最先进的用于检测 WGD 足迹的软件一直局限于组装良好的基因组。然而,大量质量较差的基因组和更容易获得的转录组在很大程度上被忽视了,从理论上讲,它们也可能有助于使用基于 dS 的方法检测 WGD。在这里,为了解决这些问题,我们设计了一个通用且简单的技术工具 WGDdetector,用于根据广泛使用的基于 dS 的方法,使用不同生物体的基因组或转录组注释来检测 WGD。

结果

我们构建了 WGDdetector 流水线,该流水线集成了所有分析,包括基因家族构建、dS 估计和分相,并通过仅一条命令处理每个同源基因对的 dS 值。我们进一步选择了四个物种(拟南芥、核桃、杨树和非洲爪蟾),分别代表草本、木本和动物,以测试其实用性。我们的最终结果显示,使用基因组和转录组数据的先前研究具有高度准确性。

结论

WGDdetector 不仅对基因组数据可靠且稳定,而且是一种使用转录组数据获得正确 dS 分布以检测 WGD 的新方法。源代码是免费的,并在 Windows 和 Linux 操作系统中实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b238/6375192/0cac3658e660/12859_2019_2670_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b238/6375192/84d1186366cb/12859_2019_2670_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b238/6375192/0cac3658e660/12859_2019_2670_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b238/6375192/84d1186366cb/12859_2019_2670_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b238/6375192/0cac3658e660/12859_2019_2670_Fig2_HTML.jpg

相似文献

1
WGDdetector: a pipeline for detecting whole genome duplication events using the genome or transcriptome annotations.WGDdetector:一个使用基因组或转录组注释来检测全基因组复制事件的流水线。
BMC Bioinformatics. 2019 Feb 13;20(1):75. doi: 10.1186/s12859-019-2670-3.
2
Comparative genomics of six Juglans species reveals disease-associated gene family contractions.六组核桃属物种的比较基因组学揭示了与疾病相关的基因家族收缩。
Plant J. 2020 Apr;102(2):410-423. doi: 10.1111/tpj.14630. Epub 2020 Jan 16.
3
Phylogenomics reveals an extensive history of genome duplication in diatoms (Bacillariophyta).系统发生基因组学揭示了硅藻(Bacillariophyta)中广泛的基因组复制历史。
Am J Bot. 2018 Mar;105(3):330-347. doi: 10.1002/ajb2.1056. Epub 2018 Apr 17.
4
wgd-simple command line tools for the analysis of ancient whole-genome duplications.wgd-simple 命令行工具,用于分析古代全基因组复制。
Bioinformatics. 2019 Jun 1;35(12):2153-2155. doi: 10.1093/bioinformatics/bty915.
5
De novo Transcriptome Assemblies of Rana (Lithobates) catesbeiana and Xenopus laevis Tadpole Livers for Comparative Genomics without Reference Genomes.无参考基因组情况下用于比较基因组学的牛蛙和非洲爪蟾蝌蚪肝脏的从头转录组组装
PLoS One. 2015 Jun 29;10(6):e0130720. doi: 10.1371/journal.pone.0130720. eCollection 2015.
6
Impact of whole-genome duplication events on diversification rates in angiosperms.全基因组加倍事件对被子植物多样化速率的影响。
Am J Bot. 2018 Mar;105(3):348-363. doi: 10.1002/ajb2.1060. Epub 2018 May 2.
7
Evolutionary history and functional divergence of the cytochrome P450 gene superfamily between Arabidopsis thaliana and Brassica species uncover effects of whole genome and tandem duplications.拟南芥和芸苔属物种之间细胞色素P450基因超家族的进化历史和功能分化揭示了全基因组和串联重复的影响。
BMC Genomics. 2017 Sep 18;18(1):733. doi: 10.1186/s12864-017-4094-7.
8
Evaluating and Characterizing Ancient Whole-Genome Duplications in Plants with Gene Count Data.利用基因计数数据评估和表征植物中的古代全基因组复制
Genome Biol Evol. 2016 Apr 11;8(4):1023-37. doi: 10.1093/gbe/evw058.
9
Genomics: cracking the mysteries of walnuts.基因组学:破解核桃的奥秘。
J Genet. 2019 Jun;98(2).
10
A De-Novo Genome Analysis Pipeline (DeNoGAP) for large-scale comparative prokaryotic genomics studies.一种用于大规模比较原核生物基因组学研究的从头基因组分析流程(DeNoGAP)。
BMC Bioinformatics. 2016 Jun 30;17(1):260. doi: 10.1186/s12859-016-1142-2.

引用本文的文献

1
Metabolome and transcriptome association study reveals biosynthesis of specialized benzylisoquinoline alkaloids in .代谢组学与转录组学关联研究揭示了……中特殊苄基异喹啉生物碱的生物合成。
Chin Herb Med. 2024 Nov 9;17(1):178-188. doi: 10.1016/j.chmed.2024.11.003. eCollection 2025 Jan.
2
Extreme overall mushroom genome expansion in Mycena s.s. irrespective of plant hosts or substrate specializations.无论植物宿主或基质特化如何,蕈类属中的蕈类总体基因组都经历了极端扩张。
Cell Genom. 2024 Jul 10;4(7):100586. doi: 10.1016/j.xgen.2024.100586. Epub 2024 Jun 27.
3
Polyploidization of Indotyphlops braminus: evidence from isoform-sequencing.

本文引用的文献

1
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets.MMseqs2支持进行灵敏的蛋白质序列搜索,以分析海量数据集。
Nat Biotechnol. 2017 Nov;35(11):1026-1028. doi: 10.1038/nbt.3988. Epub 2017 Oct 16.
2
The evolutionary significance of polyploidy.多倍体的进化意义。
Nat Rev Genet. 2017 Jul;18(7):411-424. doi: 10.1038/nrg.2017.26. Epub 2017 May 15.
3
The walnut (Juglans regia) genome sequence reveals diversity in genes coding for the biosynthesis of non-structural polyphenols.核桃(胡桃)基因组序列揭示了非结构性多酚生物合成相关编码基因的多样性。
横纹钝头蛇的多倍体化:同工酶测序的证据。
BMC Genom Data. 2024 Feb 26;25(1):23. doi: 10.1186/s12863-024-01208-y.
4
PacBio Full-Length Transcriptome of a Tetraploid Provides Insights into the Evolution of Cavefish.四倍体洞穴鱼的PacBio全长转录组为洞穴鱼的进化提供了见解。
Animals (Basel). 2023 Nov 2;13(21):3399. doi: 10.3390/ani13213399.
5
Systematic dissection of genomic features determining the vast diversity of conotoxins.系统剖析决定芋螺毒素巨大多样性的基因组特征。
BMC Genomics. 2023 Oct 9;24(1):598. doi: 10.1186/s12864-023-09689-4.
6
The chromosome-scale genome assembly of cluster bean provides molecular insight into edible gum (galactomannan) biosynthesis family genes.菜豆染色体水平基因组组装为食用胶(半乳甘露聚糖)生物合成家族基因提供了分子见解。
Sci Rep. 2023 Jun 19;13(1):9941. doi: 10.1038/s41598-023-33762-3.
7
Chromosome-length genome assemblies and cytogenomic analyses of pangolins reveal remarkable chromosome counts and plasticity.穿山甲的染色体水平基因组组装和细胞基因组分析揭示了显著的染色体数目和可塑性。
Chromosome Res. 2023 Apr 12;31(2):13. doi: 10.1007/s10577-023-09722-y.
8
De Novo Assembly and Characterization of the Transcriptome of an Omnivorous Camel Cricket ().从头组装和鉴定杂食性驼螽转录组()。
Int J Mol Sci. 2023 Feb 16;24(4):4005. doi: 10.3390/ijms24044005.
9
Genome-partitioning strategy, plastid and nuclear phylogenomic discordance, and its evolutionary implications of (Ranunculaceae).毛茛科的基因组划分策略、质体与核系统发育基因组不一致性及其进化意义
Front Plant Sci. 2022 Nov 14;13:1059379. doi: 10.3389/fpls.2022.1059379. eCollection 2022.
10
High-quality genome assembly and pan-genome studies facilitate genetic discovery in mung bean and its improvement.高质量的基因组组装和泛基因组研究促进了绿豆的遗传发现及其改良。
Plant Commun. 2022 Nov 14;3(6):100352. doi: 10.1016/j.xplc.2022.100352. Epub 2022 Jun 26.
Plant J. 2016 Sep;87(5):507-32. doi: 10.1111/tpj.13207. Epub 2016 Jul 18.
4
Evaluating and Characterizing Ancient Whole-Genome Duplications in Plants with Gene Count Data.利用基因计数数据评估和表征植物中的古代全基因组复制
Genome Biol Evol. 2016 Apr 11;8(4):1023-37. doi: 10.1093/gbe/evw058.
5
Early genome duplications in conifers and other seed plants.松柏类植物和其他种子植物的早期基因组加倍。
Sci Adv. 2015 Nov 20;1(10):e1501084. doi: 10.1126/sciadv.1501084. eCollection 2015 Nov.
6
The wondrous cycles of polyploidy in plants.植物中多倍体的奇妙循环。
Am J Bot. 2015 Nov;102(11):1753-6. doi: 10.3732/ajb.1500320. Epub 2015 Oct 8.
7
Whole genome duplications in plants: an overview from Arabidopsis.植物中的全基因组加倍:以拟南芥为例的概述。
J Exp Bot. 2015 Dec;66(22):6991-7003. doi: 10.1093/jxb/erv432. Epub 2015 Sep 28.
8
Evaluation of BLAST-based edge-weighting metrics used for homology inference with the Markov Clustering algorithm.用于通过马尔可夫聚类算法进行同源性推断的基于BLAST的边加权指标评估。
BMC Bioinformatics. 2015 Jul 10;16:218. doi: 10.1186/s12859-015-0625-x.
9
Maintenance and Loss of Duplicated Genes by Dosage Subfunctionalization.剂量亚功能化导致重复基因的维持与丢失
Mol Biol Evol. 2015 Aug;32(8):2141-8. doi: 10.1093/molbev/msv095. Epub 2015 Apr 22.
10
Comparative genomics as a time machine: how relative gene dosage and metabolic requirements shaped the time-dependent resolution of yeast polyploidy.比较基因组学作为时间机器:相对基因剂量和代谢需求如何塑造了酵母多倍体时间依赖性分辨率。
Mol Biol Evol. 2014 Dec;31(12):3184-93. doi: 10.1093/molbev/msu250. Epub 2014 Aug 25.