• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

geneHummus:一个用于定义豆科植物及其以外的基因家族及其表达的 R 包。

geneHummus: an R package to define gene families and their expression in legumes and beyond.

机构信息

Department of Genetics ETSIAM, University of Córdoba, Córdoba, Spain.

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD, 20894, USA.

出版信息

BMC Genomics. 2019 Jul 18;20(1):591. doi: 10.1186/s12864-019-5952-2.

DOI:10.1186/s12864-019-5952-2
PMID:31319791
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6639926/
Abstract

BACKGROUND

During the last decade, plant biotechnological laboratories have sparked a monumental revolution with the rapid development of next sequencing technologies at affordable prices. Soon, these sequencing technologies and assembling of whole genomes will extend beyond the plant computational biologists and become commonplace within the plant biology disciplines. The current availability of large-scale genomic resources for non-traditional plant model systems (the so-called 'orphan crops') is enabling the construction of high-density integrated physical and genetic linkage maps with potential applications in plant breeding. The newly available fully sequenced plant genomes represent an incredible opportunity for comparative analyses that may reveal new aspects of genome biology and evolution. The analysis of the expansion and evolution of gene families across species is a common approach to infer biological functions. To date, the extent and role of gene families in plants has only been partially addressed and many gene families remain to be investigated. Manual identification of gene families is highly time-consuming and laborious, requiring an iterative process of manual and computational analysis to identify members of a given family, typically combining numerous BLAST searches and manually cleaning data. Due to the increasing abundance of genome sequences and the agronomical interest in plant gene families, the field needs a clear, automated annotation tool.

RESULTS

Here, we present the geneHummus package, an R-based pipeline for the identification and characterization of plant gene families. The impact of this pipeline comes from a reduction in hands-on annotation time combined with high specificity and sensitivity in extracting only proteins from the RefSeq database and providing the conserved domain architectures based on SPARCLE. As a case study we focused on the auxin receptor factors gene (ARF) family in Cicer arietinum (chickpea) and other legumes.

CONCLUSION

We anticipate that our pipeline should be suitable for any taxonomic plant family, and likely other gene families, vastly improving the speed and ease of genomic data processing.

摘要

背景

在过去的十年中,随着价格合理的下一代测序技术的快速发展,植物生物技术实验室引发了一场巨大的革命。很快,这些测序技术和整个基因组的组装将超越植物计算生物学家的范畴,在植物生物学领域变得司空见惯。目前,非传统植物模式系统(所谓的“孤儿作物”)的大规模基因组资源的可用性正在使构建具有高密度整合物理和遗传连锁图谱成为可能,这些图谱具有在植物育种中的潜在应用。新获得的全序列植物基因组为比较分析提供了一个极好的机会,这可能揭示基因组生物学和进化的新方面。跨物种基因家族的扩张和进化分析是推断生物学功能的常用方法。迄今为止,植物中基因家族的范围和作用仅部分得到解决,许多基因家族仍有待研究。基因家族的手动鉴定非常耗时且费力,需要手动和计算分析的迭代过程来识别给定家族的成员,通常需要结合多次 BLAST 搜索并手动清理数据。由于基因组序列的大量增加以及植物基因家族在农业方面的兴趣,该领域需要一个清晰、自动化的注释工具。

结果

在这里,我们介绍了 geneHummus 包,这是一种基于 R 的植物基因家族鉴定和特征分析的管道。该管道的影响来自于手动注释时间的减少,同时在从 RefSeq 数据库中提取仅蛋白质方面具有高度的特异性和敏感性,并基于 SPARCLE 提供保守结构域架构。作为一个案例研究,我们专注于鹰嘴豆(鹰嘴豆)和其他豆科植物中的生长素受体因子(ARF)基因家族。

结论

我们预计我们的管道应该适用于任何分类植物家族,并且可能适用于其他基因家族,极大地提高了基因组数据处理的速度和易用性。

相似文献

1
geneHummus: an R package to define gene families and their expression in legumes and beyond.geneHummus:一个用于定义豆科植物及其以外的基因家族及其表达的 R 包。
BMC Genomics. 2019 Jul 18;20(1):591. doi: 10.1186/s12864-019-5952-2.
2
Genome-wide identification of the auxin response factor gene family in Cicer arietinum.鹰嘴豆中生长素反应因子基因家族的全基因组鉴定。
BMC Genomics. 2018 Apr 27;19(1):301. doi: 10.1186/s12864-018-4695-9.
3
The chickpea genomic web resource: visualization and analysis of the desi-type Cicer arietinum nuclear genome for comparative exploration of legumes.鹰嘴豆基因组网络资源:用于豆类比较探索的栽培型鹰嘴豆核基因组可视化与分析
BMC Plant Biol. 2014 Dec 18;14:315. doi: 10.1186/s12870-014-0315-2.
4
5
The CarERF genes in chickpea (Cicer arietinum L.) and the identification of CarERF116 as abiotic stress responsive transcription factor.鹰嘴豆(Cicer arietinum L.)中的 CarERF 基因及其作为非生物胁迫响应转录因子的鉴定。
Funct Integr Genomics. 2015 Jan;15(1):27-46. doi: 10.1007/s10142-014-0399-7. Epub 2014 Oct 2.
6
Genome-wide identification and expression analyses of WRKY transcription factor family members from chickpea (Cicer arietinum L.) reveal their role in abiotic stress-responses.鹰嘴豆(Cicer arietinum L.)WRKY转录因子家族成员的全基因组鉴定与表达分析揭示了它们在非生物胁迫响应中的作用。
Genes Genomics. 2019 Apr;41(4):467-481. doi: 10.1007/s13258-018-00780-9. Epub 2019 Jan 12.
7
Genome-wide analysis of the basic leucine zipper (bZIP) transcription factor gene family in six legume genomes.六种豆科植物基因组中碱性亮氨酸拉链(bZIP)转录因子基因家族的全基因组分析。
BMC Genomics. 2015 Dec 10;16:1053. doi: 10.1186/s12864-015-2258-x.
8
Comprehensive transcriptome assembly of Chickpea (Cicer arietinum L.) using sanger and next generation sequencing platforms: development and applications.利用桑格测序和新一代测序平台对鹰嘴豆(Cicer arietinum L.)进行转录组全面组装:开发与应用
PLoS One. 2014 Jan 23;9(1):e86039. doi: 10.1371/journal.pone.0086039. eCollection 2014.
9
CTDB: An Integrated Chickpea Transcriptome Database for Functional and Applied Genomics.CTDB:一个用于功能基因组学和应用基因组学的综合鹰嘴豆转录组数据库。
PLoS One. 2015 Aug 31;10(8):e0136880. doi: 10.1371/journal.pone.0136880. eCollection 2015.
10
Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling.豆科植物中同源异型框基因家族的全基因组分析:鉴定、基因复制及表达谱分析
PLoS One. 2015 Mar 6;10(3):e0119198. doi: 10.1371/journal.pone.0119198. eCollection 2015.

引用本文的文献

1
Efficient and easy gene expression and genetic variation data analysis and visualization using exvar.使用exvar进行高效便捷的基因表达和遗传变异数据分析及可视化。
Sci Rep. 2025 Apr 10;15(1):12264. doi: 10.1038/s41598-025-93067-5.
2
Targeting using natural African compounds as promising agents for managing diabetic complications.以天然非洲化合物为靶向,作为治疗糖尿病并发症的有前景药物。
Front Bioinform. 2025 Feb 6;5:1499255. doi: 10.3389/fbinf.2025.1499255. eCollection 2025.
3
Machine learning and molecular docking prediction of potential inhibitors against dengue virus.

本文引用的文献

1
Magic-BLAST, an accurate RNA-seq aligner for long and short reads.Magic-BLAST,一款适用于长读长和短读长的精确 RNA-seq 比对工具。
BMC Bioinformatics. 2019 Jul 25;20(1):405. doi: 10.1186/s12859-019-2996-x.
2
Genome-wide identification of the auxin response factor gene family in Cicer arietinum.鹰嘴豆中生长素反应因子基因家族的全基因组鉴定。
BMC Genomics. 2018 Apr 27;19(1):301. doi: 10.1186/s12864-018-4695-9.
3
Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species.Ensembl Genomes 2018:一个用于非脊椎动物物种的综合组学基础设施。
登革病毒潜在抑制剂的机器学习与分子对接预测
Front Chem. 2024 Dec 24;12:1510029. doi: 10.3389/fchem.2024.1510029. eCollection 2024.
4
Prostruc: an open-source tool for 3D structure prediction using homology modeling.Prostruc:一种使用同源建模进行三维结构预测的开源工具。
Front Chem. 2024 Nov 29;12:1509407. doi: 10.3389/fchem.2024.1509407. eCollection 2024.
5
NeuroVar: an open-source tool for the visualization of gene expression and variation data for biomarkers of neurological diseases.NeuroVar:一种用于可视化神经疾病生物标志物基因表达和变异数据的开源工具。
GigaByte. 2024 Nov 25;2024:gigabyte143. doi: 10.46471/gigabyte.143. eCollection 2024.
6
Phenotypic and genetic characterization of a near-isogenic line pair: insights into flowering time in chickpea.表型和遗传特征分析近等基因系:菜豆开花时间的研究。
BMC Plant Biol. 2024 Jul 25;24(1):709. doi: 10.1186/s12870-024-05411-y.
7
Investigating antimicrobial resistance genes in Kenya, Uganda and Tanzania cattle using metagenomics.利用宏基因组学研究肯尼亚、乌干达和坦桑尼亚牛中的抗菌药物耐药基因。
PeerJ. 2024 Apr 22;12:e17181. doi: 10.7717/peerj.17181. eCollection 2024.
Nucleic Acids Res. 2018 Jan 4;46(D1):D802-D808. doi: 10.1093/nar/gkx1011.
4
MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization.MAFFT 在线服务:多序列比对、交互式序列选择和可视化。
Brief Bioinform. 2019 Jul 19;20(4):1160-1166. doi: 10.1093/bib/bbx108.
5
Auxin Response Factor Genes Repertoire in Mulberry: Identification, and Structural, Functional and Evolutionary Analyses.桑树中的生长素响应因子基因库:鉴定、结构、功能及进化分析
Genes (Basel). 2017 Aug 25;8(9):202. doi: 10.3390/genes8090202.
6
Manipulation of Auxin Response Factor 19 affects seed size in the woody perennial Jatropha curcas.生长素反应因子 19 的操纵影响木本多年生植物麻疯树的种子大小。
Sci Rep. 2017 Jan 19;7:40844. doi: 10.1038/srep40844.
7
CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.CDD/SPARCLE:通过亚家族结构域架构对蛋白质进行功能分类
Nucleic Acids Res. 2017 Jan 4;45(D1):D200-D203. doi: 10.1093/nar/gkw1129. Epub 2016 Nov 29.
8
A comprehensive draft genome sequence for lupin (Lupinus angustifolius), an emerging health food: insights into plant-microbe interactions and legume evolution.羽扇豆(窄叶羽扇豆)——一种新兴健康食品的全基因组序列草图:对植物-微生物相互作用和豆科植物进化的见解
Plant Biotechnol J. 2017 Mar;15(3):318-330. doi: 10.1111/pbi.12615. Epub 2016 Sep 23.
9
Genome-wide analysis of auxin response factor gene family members in medicinal model plant Salvia miltiorrhiza.药用模式植物丹参生长素响应因子基因家族成员的全基因组分析。
Biol Open. 2016 Jun 15;5(6):848-57. doi: 10.1242/bio.017178.
10
The genome sequences of Arachis duranensis and Arachis ipaensis, the diploid ancestors of cultivated peanut.栽培花生的二倍体祖先——刺山柑和安第斯花生的基因组序列。
Nat Genet. 2016 Apr;48(4):438-46. doi: 10.1038/ng.3517. Epub 2016 Feb 22.