• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基因组和文献挖掘实现基因与表型的系统关联。

Systematic association of genes to phenotypes by genome and literature mining.

作者信息

Korbel Jan O, Doerks Tobias, Jensen Lars J, Perez-Iratxeta Carolina, Kaczanowski Szymon, Hooper Sean D, Andrade Miguel A, Bork Peer

机构信息

European Molecular Biology Laboratory, Heidelberg, Germany.

出版信息

PLoS Biol. 2005 May;3(5):e134. doi: 10.1371/journal.pbio.0030134. Epub 2005 Apr 5.

DOI:10.1371/journal.pbio.0030134
PMID:15799710
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1073694/
Abstract

One of the major challenges of functional genomics is to unravel the connection between genotype and phenotype. So far no global analysis has attempted to explore those connections in the light of the large phenotypic variability seen in nature. Here, we use an unsupervised, systematic approach for associating genes and phenotypic characteristics that combines literature mining with comparative genome analysis. We first mine the MEDLINE literature database for terms that reflect phenotypic similarities of species. Subsequently we predict the likely genomic determinants: genes specifically present in the respective genomes. In a global analysis involving 92 prokaryotic genomes we retrieve 323 clusters containing a total of 2,700 significant gene-phenotype associations. Some clusters contain mostly known relationships, such as genes involved in motility or plant degradation, often with additional hypothetical proteins associated with those phenotypes. Other clusters comprise unexpected associations; for example, a group of terms related to food and spoilage is linked to genes predicted to be involved in bacterial food poisoning. Among the clusters, we observe an enrichment of pathogenicity-related associations, suggesting that the approach reveals many novel genes likely to play a role in infectious diseases.

摘要

功能基因组学的主要挑战之一是揭示基因型与表型之间的联系。到目前为止,还没有全局分析尝试根据自然界中观察到的巨大表型变异性来探索这些联系。在这里,我们使用一种无监督的系统方法来关联基因和表型特征,该方法将文献挖掘与比较基因组分析相结合。我们首先在MEDLINE文献数据库中挖掘反映物种表型相似性的术语。随后,我们预测可能的基因组决定因素:各自基因组中特有的基因。在一项涉及92个原核生物基因组的全局分析中,我们检索到323个聚类,总共包含2700个显著的基因-表型关联。一些聚类大多包含已知的关系,例如参与运动或植物降解的基因,通常还伴有与这些表型相关的其他假设蛋白质。其他聚类包含意想不到的关联;例如,一组与食物和腐败相关的术语与预计参与细菌性食物中毒的基因相关联。在这些聚类中,我们观察到与致病性相关的关联富集,这表明该方法揭示了许多可能在传染病中起作用的新基因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74b7/1110903/b2d622b845f4/pbio.0030134.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74b7/1110903/3ce5e74d558f/pbio.0030134.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74b7/1110903/838e3d88c7f9/pbio.0030134.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74b7/1110903/6c5c69ef0175/pbio.0030134.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74b7/1110903/b2d622b845f4/pbio.0030134.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74b7/1110903/3ce5e74d558f/pbio.0030134.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74b7/1110903/838e3d88c7f9/pbio.0030134.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74b7/1110903/6c5c69ef0175/pbio.0030134.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74b7/1110903/b2d622b845f4/pbio.0030134.g004.jpg

相似文献

1
Systematic association of genes to phenotypes by genome and literature mining.通过基因组和文献挖掘实现基因与表型的系统关联。
PLoS Biol. 2005 May;3(5):e134. doi: 10.1371/journal.pbio.0030134. Epub 2005 Apr 5.
2
From genotype to phenotype in bovine functional genomics.从牛功能基因组学到表型。
Brief Funct Genomics. 2011 May;10(3):165-71. doi: 10.1093/bfgp/elr019.
3
Phenotype mining for functional genomics and gene discovery.用于功能基因组学和基因发现的表型挖掘
Methods Mol Biol. 2011;760:159-73. doi: 10.1007/978-1-61779-176-5_10.
4
The landscape of microbial phenotypic traits and associated genes.微生物表型特征及相关基因图谱。
Nucleic Acids Res. 2016 Dec 1;44(21):10074-10090. doi: 10.1093/nar/gkw964. Epub 2016 Oct 24.
5
Association of genes to genetically inherited diseases using data mining.利用数据挖掘技术将基因与遗传疾病相关联。
Nat Genet. 2002 Jul;31(3):316-9. doi: 10.1038/ng895. Epub 2002 May 13.
6
A text-mining analysis of the human phenome.人类表型组的文本挖掘分析
Eur J Hum Genet. 2006 May;14(5):535-42. doi: 10.1038/sj.ejhg.5201585.
7
Extending the Bacillus cereus group genomics to putative food-borne pathogens of different toxicity.将蜡样芽孢杆菌群基因组学扩展至不同毒性的潜在食源性病原体。
Chem Biol Interact. 2008 Jan 30;171(2):236-49. doi: 10.1016/j.cbi.2007.03.003. Epub 2007 Mar 12.
8
From genome mining to phenotypic microarrays: Planctomycetes as source for novel bioactive molecules.从基因组挖掘到表型微阵列:研究真细菌作为新型生物活性分子的来源。
Antonie Van Leeuwenhoek. 2013 Oct;104(4):551-67. doi: 10.1007/s10482-013-0007-1. Epub 2013 Aug 27.
9
MILANO--custom annotation of microarray results using automatic literature searches.米兰——使用自动文献检索对微阵列结果进行定制注释。
BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12.
10
Strepto-DB, a database for comparative genomics of group A (GAS) and B (GBS) streptococci, implemented with the novel database platform 'Open Genome Resource' (OGeR).链球菌数据库(Strepto-DB),用于A群(GAS)和B群(GBS)链球菌比较基因组学的数据库,采用新型数据库平台“开放基因组资源”(OGeR)实现。
Nucleic Acids Res. 2009 Jan;37(Database issue):D494-8. doi: 10.1093/nar/gkn674. Epub 2008 Oct 14.

引用本文的文献

1
Leveraging strain competition to address antimicrobial resistance with microbiota therapies.利用菌株竞争通过微生物群疗法解决抗生素耐药性问题。
Gut Microbes. 2025 Dec;17(1):2488046. doi: 10.1080/19490976.2025.2488046. Epub 2025 Apr 7.
2
Learning genotype-phenotype associations from gaps in multi-species sequence alignments.从多物种序列比对的缺口处学习基因型-表型关联。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf022.
3
From genotype to phenotype: computational approaches for inferring microbial traits relevant to the food industry.

本文引用的文献

1
Ranking the whole MEDLINE database according to a large training set using text indexing.使用文本索引根据一个大型训练集对整个MEDLINE数据库进行排名。
BMC Bioinformatics. 2005 Mar 24;6:75. doi: 10.1186/1471-2105-6-75.
2
Chemotaxis-guided movements in bacteria.细菌中的趋化性引导运动。
Crit Rev Oral Biol Med. 2004 Jul 1;15(4):207-20. doi: 10.1177/154411130401500404.
3
A global role for Fis in the transcriptional control of metabolism and type III secretion in Salmonella enterica serovar Typhimurium.Fis在肠炎沙门氏菌鼠伤寒血清型代谢转录调控及III型分泌中的全局作用。
从基因型到表型:推断与食品工业相关的微生物特性的计算方法。
FEMS Microbiol Rev. 2023 Jul 5;47(4). doi: 10.1093/femsre/fuad030.
4
Polyamine and Ethanolamine Metabolism in Bacteria as an Important Component of Nitrogen Assimilation for Survival and Pathogenicity.细菌中的多胺和乙醇胺代谢作为生存和致病性氮同化的重要组成部分。
Med Sci (Basel). 2022 Jul 29;10(3):40. doi: 10.3390/medsci10030040.
5
Mechanism of the Gut Microbiota Colonization Resistance and Enteric Pathogen Infection.肠道微生物定植抵抗力与肠道病原体感染的机制。
Front Cell Infect Microbiol. 2021 Dec 23;11:716299. doi: 10.3389/fcimb.2021.716299. eCollection 2021.
6
Deep semi-supervised learning ensemble framework for classifying co-mentions of human proteins and phenotypes.深度半监督学习集成框架,用于分类人类蛋白质和表型的共提及。
BMC Bioinformatics. 2021 Oct 16;22(1):500. doi: 10.1186/s12859-021-04421-z.
7
Bacterial Microcompartment-Dependent 1,2-Propanediol Utilization of .细菌微区室依赖性1,2-丙二醇的利用 。 (你提供的原文似乎不完整,翻译可能不太能准确反映完整意思,你可以补充完整原文以便更准确翻译 。)
Front Microbiol. 2021 May 12;12:679827. doi: 10.3389/fmicb.2021.679827. eCollection 2021.
8
The support of genetic evidence for cardiovascular risk induced by antineoplastic drugs.抗肿瘤药物引起心血管风险的遗传证据支持。
Sci Adv. 2020 Oct 14;6(42). doi: 10.1126/sciadv.abb8543. Print 2020 Oct.
9
Comparative Genomic Analysis Reveals Novel Microcompartment-Associated Metabolic Pathways in the Human Gut Microbiome.比较基因组分析揭示了人类肠道微生物群中与新型微区室相关的代谢途径。
Front Genet. 2019 Jul 4;10:636. doi: 10.3389/fgene.2019.00636. eCollection 2019.
10
Ontology based text mining of gene-phenotype associations: application to candidate gene prediction.基于本体论的基因-表型关联的文本挖掘:在候选基因预测中的应用。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz019.
Microbiology (Reading). 2004 Jul;150(Pt 7):2037-2053. doi: 10.1099/mic.0.27209-0.
4
Analysis of genomic context: prediction of functional associations from conserved bidirectionally transcribed gene pairs.基因组背景分析:从保守的双向转录基因对预测功能关联。
Nat Biotechnol. 2004 Jul;22(7):911-7. doi: 10.1038/nbt988.
5
Role of Clostridium perfringens phospholipase C in the pathogenesis of gas gangrene.产气荚膜梭菌磷脂酶C在气性坏疽发病机制中的作用。
Toxicon. 2003 Dec 15;42(8):979-86. doi: 10.1016/j.toxicon.2003.11.013.
6
A cross-genomic approach for systematic mapping of phenotypic traits to genes.一种用于将表型特征系统映射到基因的跨基因组方法。
Genome Res. 2004 Jan;14(1):109-15. doi: 10.1101/gr.1586704.
7
Global regulation by CsrA in Salmonella typhimurium.鼠伤寒沙门氏菌中CsrA的全局调控
Mol Microbiol. 2003 Jun;48(6):1633-45. doi: 10.1046/j.1365-2958.2003.03535.x.
8
Amidase domains from bacterial and phage autolysins define a family of gamma-D,L-glutamate-specific amidohydrolases.来自细菌和噬菌体自溶素的酰胺酶结构域定义了一个γ-D,L-谷氨酸特异性酰胺水解酶家族。
Trends Biochem Sci. 2003 May;28(5):230-4. doi: 10.1016/s0968-0004(03)00062-8.
9
Complete genome sequence and comparative genomics of Shigella flexneri serotype 2a strain 2457T.福氏志贺菌2a血清型2457T菌株的全基因组序列及比较基因组学
Infect Immun. 2003 May;71(5):2775-86. doi: 10.1128/IAI.71.5.2775-2786.2003.
10
Potential genomic determinants of hyperthermophily.嗜热菌的潜在基因组决定因素。
Trends Genet. 2003 Apr;19(4):172-6. doi: 10.1016/S0168-9525(03)00047-7.