• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过微生物全基因组序列将基因组特征映射到功能性状。

Mapping genomic features to functional traits through microbial whole genome sequences.

作者信息

Zhang Wei, Zeng Erliang, Liu Dan, Jones Stuart E, Emrich Scott

机构信息

Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, USA.

Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, USA; Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA.

出版信息

Int J Bioinform Res Appl. 2014;10(4-5):461-78. doi: 10.1504/IJBRA.2014.062995.

DOI:10.1504/IJBRA.2014.062995
PMID:24989863
Abstract

Recently, the utility of trait-based approaches for microbial communities has been identified. Increasing availability of whole genome sequences provide the opportunity to explore the genetic foundations of a variety of functional traits. We proposed a machine learning framework to quantitatively link the genomic features with functional traits. Genes from bacteria genomes belonging to different functional traits were grouped to Cluster of Orthologs (COGs), and were used as features. Then, TF-IDF technique from the text mining domain was applied to transform the data to accommodate the abundance and importance of each COG. After TF-IDF processing, COGs were ranked using feature selection methods to identify their relevance to the functional trait of interest. Extensive experimental results demonstrated that functional trait related genes can be detected using our method. Further, the method has the potential to provide novel biological insights.

摘要

最近,基于特征的微生物群落研究方法的实用性已得到确认。全基因组序列可用性的增加为探索各种功能特征的遗传基础提供了机会。我们提出了一个机器学习框架,以定量地将基因组特征与功能特征联系起来。属于不同功能特征的细菌基因组中的基因被分组到直系同源簇(COG)中,并用作特征。然后,应用文本挖掘领域的TF-IDF技术对数据进行转换,以适应每个COG的丰度和重要性。经过TF-IDF处理后,使用特征选择方法对COG进行排序,以确定它们与感兴趣的功能特征的相关性。大量实验结果表明,使用我们的方法可以检测到与功能特征相关的基因。此外,该方法有可能提供新的生物学见解。

相似文献

1
Mapping genomic features to functional traits through microbial whole genome sequences.通过微生物全基因组序列将基因组特征映射到功能性状。
Int J Bioinform Res Appl. 2014;10(4-5):461-78. doi: 10.1504/IJBRA.2014.062995.
2
Prediction of microbial phenotypes based on comparative genomics.基于比较基因组学的微生物表型预测
BMC Bioinformatics. 2015;16 Suppl 14(Suppl 14):S1. doi: 10.1186/1471-2105-16-S14-S1. Epub 2015 Oct 2.
3
Quantitatively Partitioning Microbial Genomic Traits among Taxonomic Ranks across the Microbial Tree of Life.定量划分生命之树上的微生物分类等级中的微生物基因组特征。
mSphere. 2019 Aug 28;4(4):e00446-19. doi: 10.1128/mSphere.00446-19.
4
Expanded microbial genome coverage and improved protein family annotation in the COG database.COG数据库中微生物基因组覆盖范围的扩大及蛋白质家族注释的改进。
Nucleic Acids Res. 2015 Jan;43(Database issue):D261-9. doi: 10.1093/nar/gku1223. Epub 2014 Nov 26.
5
Insights from genome-wide approaches to identify variants associated to phenotypes at pan-genome scale: Application to L. monocytogenes' ability to grow in cold conditions.从全基因组方法中获得的关于表型相关变体的见解,在泛基因组范围内:应用于单核细胞增生李斯特菌在低温条件下生长的能力。
Int J Food Microbiol. 2019 Feb 16;291:181-188. doi: 10.1016/j.ijfoodmicro.2018.11.028. Epub 2018 Nov 29.
6
Pan-Genome Storage and Analysis Techniques.泛基因组存储与分析技术
Methods Mol Biol. 2018;1704:29-53. doi: 10.1007/978-1-4939-7463-4_2.
7
Information-theoretic approaches to SVM feature selection for metagenome read classification.基于信息论的支持向量机特征选择方法在宏基因组读分类中的应用。
Comput Biol Chem. 2011 Jun;35(3):199-209. doi: 10.1016/j.compbiolchem.2011.04.007. Epub 2011 May 13.
8
Millstone: software for multiplex microbial genome analysis and engineering.磨盘:用于多重微生物基因组分析与工程的软件。
Genome Biol. 2017 May 25;18(1):101. doi: 10.1186/s13059-017-1223-1.
9
An investigation into inter- and intragenomic variations of graphic genomic signatures.对图形基因组特征的基因组间和基因组内变异的调查。
BMC Bioinformatics. 2015 Aug 7;16:246. doi: 10.1186/s12859-015-0655-4.
10
GATA: a graphic alignment tool for comparative sequence analysis.GATA:一种用于比较序列分析的图形比对工具。
BMC Bioinformatics. 2005 Jan 17;6:9. doi: 10.1186/1471-2105-6-9.

引用本文的文献

1
A Protocol for Weighted Gene Co-expression Network Analysis With Module Preservation and Functional Enrichment Analysis for Tumor and Normal Transcriptomic Data.一种用于肿瘤和正常转录组数据的具有模块保留和功能富集分析的加权基因共表达网络分析方案。
Bio Protoc. 2025 Sep 20;15(18):e5447. doi: 10.21769/BioProtoc.5447.
2
Systematic review and meta-analysis of oral squamous cell carcinoma associated oral microbiome.口腔鳞状细胞癌相关口腔微生物群的系统评价和荟萃分析。
Front Microbiol. 2022 Oct 20;13:968304. doi: 10.3389/fmicb.2022.968304. eCollection 2022.
3
Interaction networks for identifying coupled molecular processes in microbial communities.
用于识别微生物群落中耦合分子过程的相互作用网络。
BioData Min. 2015 Jul 15;8:21. doi: 10.1186/s13040-015-0054-4. eCollection 2015.