表型组-基因组网络的构建及其意义

Creation and implications of a phenome-genome network.

作者信息

Butte Atul J, Kohane Isaac S

机构信息

Stanford Medical Informatics, Department of Medicine, Stanford University School of Medicine, 251 Campus Drive, Room X-215, Stanford, California 94305-5479, USA.

出版信息

Nat Biotechnol. 2006 Jan;24(1):55-62. doi: 10.1038/nbt1150.

DOI:10.1038/nbt1150

PMID:16404398

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2716377/

Abstract

Although gene and protein measurements are increasing in quantity and comprehensiveness, they do not characterize a sample's entire phenotype in an environmental or experimental context. Here we comprehensively consider associations between components of phenotype, genotype and environment to identify genes that may govern phenotype and responses to the environment. Context from the annotations of gene expression data sets in the Gene Expression Omnibus is represented using the Unified Medical Language System, a compendium of biomedical vocabularies with nearly 1-million concepts. After showing how data sets can be clustered by annotative concepts, we find a network of relations between phenotypic, disease, environmental and experimental contexts as well as genes with differential expression associated with these concepts. We identify novel genes related to concepts such as aging. Comprehensively identifying genes related to phenotype and environment is a step toward the Human Phenome Project.

摘要

尽管基因和蛋白质测量在数量和全面性上不断增加，但它们无法在环境或实验背景下完整地表征样本的整个表型。在此，我们全面考虑表型、基因型和环境各组成部分之间的关联，以识别可能控制表型及对环境反应的基因。基因表达综合数据库中基因表达数据集注释的背景信息通过统一医学语言系统来呈现，该系统是一个包含近100万个概念的生物医学词汇汇编。在展示了如何根据注释概念对数据集进行聚类后，我们发现了表型、疾病、环境和实验背景之间的关系网络，以及与这些概念相关的差异表达基因。我们识别出了与衰老等概念相关的新基因。全面识别与表型和环境相关的基因是迈向人类表型组计划的一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ceb/2716377/ebc31e0c440e/nihms-7545-f0001.jpg

相似文献

Creation and implications of a phenome-genome network.表型组-基因组网络的构建及其意义

Nat Biotechnol. 2006 Jan;24(1):55-62. doi: 10.1038/nbt1150.

Supervised inference of gene-regulatory networks.基因调控网络的监督推理

BMC Bioinformatics. 2008 Jan 4;9:2. doi: 10.1186/1471-2105-9-2.

An unsupervised approach to predict functional relations between genes based on expression data.一种基于表达数据预测基因间功能关系的无监督方法。

Biomed Res Int. 2014;2014:154594. doi: 10.1155/2014/154594. Epub 2014 Mar 31.

Extraction of biological interaction networks from scientific literature.从科学文献中提取生物相互作用网络。

Brief Bioinform. 2005 Sep;6(3):263-76. doi: 10.1093/bib/6.3.263.

A Unified Medical Language System (UMLS) based system for Literature-Based Discovery in medicine.一种基于统一医学语言系统（UMLS）的医学文献发现系统。

Stud Health Technol Inform. 2013;192:412-6.

Automatic reconstruction of a bacterial regulatory network using Natural Language Processing.使用自然语言处理自动重建细菌调控网络。

BMC Bioinformatics. 2007 Aug 7;8:293. doi: 10.1186/1471-2105-8-293.

Data mining and predictive modeling of biomolecular network from biomedical literature databases.从生物医学文献数据库对生物分子网络进行数据挖掘和预测建模。

IEEE/ACM Trans Comput Biol Bioinform. 2007 Apr-Jun;4(2):251-63. doi: 10.1109/TCBB.2007.070211.

Reverse engineering gene regulatory networks.基因调控网络的逆向工程

Nat Biotechnol. 2005 May;23(5):554-5. doi: 10.1038/nbt0505-554.

Joint learning of gene functions--a Bayesian network model approach.基因功能的联合学习——一种贝叶斯网络模型方法。

J Bioinform Comput Biol. 2006 Apr;4(2):217-39. doi: 10.1142/s0219720006001928.

Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks.基因本体注释的自动提取及其与蛋白质网络中聚类的相关性。

BMC Bioinformatics. 2007 Jul 10;8:243. doi: 10.1186/1471-2105-8-243.

引用本文的文献

Bioinformatics approach to identify the influences of SARS-COV2 infections on atherosclerosis.利用生物信息学方法确定SARS-CoV-2感染对动脉粥样硬化的影响。

Front Cardiovasc Med. 2022 Aug 18;9:907665. doi: 10.3389/fcvm.2022.907665. eCollection 2022.

Automated Phenotyping Tool for Identifying Developmental Language Disorder Cases in Health Systems Data (APT-DLD): A New Research Algorithm for Deployment in Large-Scale Electronic Health Record Systems.用于在卫生系统数据中识别发育性语言障碍病例的自动化表型分析工具（APT-DLD）：一种用于在大规模电子健康记录系统中部署的新研究算法。

J Speech Lang Hear Res. 2020 Sep 15;63(9):3019-3035. doi: 10.1044/2020_JSLHR-19-00397. Epub 2020 Aug 11.

Medical Information Extraction Model for User-generated Content.用于用户生成内容的医学信息提取模型

Acta Inform Med. 2019 Sep;27(3):192-198. doi: 10.5455/aim.2019.27.192-198.

Finding relevant biomedical datasets: the UC San Diego solution for the bioCADDIE Retrieval Challenge.寻找相关的生物医学数据集：加州大学圣地亚哥分校为 bioCADDIE 检索挑战赛提供的解决方案。

Database (Oxford). 2018 Jan 1;2018. doi: 10.1093/database/bay017.

DataMed - an open source discovery index for finding biomedical datasets.DataMed——一个用于查找生物医学数据集的开源发现索引。

J Am Med Inform Assoc. 2018 Mar 1;25(3):300-308. doi: 10.1093/jamia/ocx121.

Circadian hormone control in a human-on-a-chip: In vitro biology's ignored component?芯片上的人体昼夜节律激素调控：体外生物学中被忽视的部分？

Exp Biol Med (Maywood). 2017 Nov;242(17):1714-1731. doi: 10.1177/1535370217732766.

Synthesizer: Expediting synthesis studies from context-free data with information retrieval techniques.合成器：运用信息检索技术从无上下文数据中加速合成研究。

PLoS One. 2017 Apr 24;12(4):e0175860. doi: 10.1371/journal.pone.0175860. eCollection 2017.

Big Data Analytics for Genomic Medicine.基因组医学中的大数据分析

Int J Mol Sci. 2017 Feb 15;18(2):412. doi: 10.3390/ijms18020412.

Biomechanisms of Comorbidity: Reviewing Integrative Analyses of Multi-omics Datasets and Electronic Health Records.共病的生物机制：多组学数据集与电子健康记录的综合分析综述

Yearb Med Inform. 2016 Nov 10(1):194-206. doi: 10.15265/IY-2016-040.

Multi-OMICs and Genome Editing Perspectives on Liver Cancer Signaling Networks.肝癌信号网络的多组学与基因组编辑视角

Biomed Res Int. 2016;2016:6186281. doi: 10.1155/2016/6186281. Epub 2016 Jun 14.

本文引用的文献

The use of genome annotation data and its impact on biological conclusions.基因组注释数据的使用及其对生物学结论的影响。

Nat Genet. 2004 Oct;36(10):1028-9. doi: 10.1038/ng1004-1028b.

A module map showing conditional activity of expression modules in cancer.一张显示癌症中表达模块条件活性的模块图。

Nat Genet. 2004 Oct;36(10):1090-8. doi: 10.1038/ng1434. Epub 2004 Sep 26.

Influence of aging on rat liver enzymes involved in glutathione synthesis and degradation.衰老对大鼠肝脏中参与谷胱甘肽合成与降解的酶的影响。

Arch Gerontol Geriatr. 1991 Nov-Dec;13(3):263-70. doi: 10.1016/0167-4943(91)90048-u.

Coexpression analysis of human genes across many microarray data sets.跨多个微阵列数据集的人类基因共表达分析。

Genome Res. 2004 Jun;14(6):1085-94. doi: 10.1101/gr.1910904.

Early evolution of selenium status and oxidative stress parameters in rat models of thermal injury.热损伤大鼠模型中硒状态和氧化应激参数的早期演变

J Trace Elem Med Biol. 2004;17(4):313-8. doi: 10.1016/s0946-672x(04)80034-8.

A gene atlas of the mouse and human protein-encoding transcriptomes.小鼠和人类蛋白质编码转录组的基因图谱。

Proc Natl Acad Sci U S A. 2004 Apr 20;101(16):6062-7. doi: 10.1073/pnas.0400782101. Epub 2004 Apr 9.

Gene expression in human embryonic stem cell lines: unique molecular signature.人类胚胎干细胞系中的基因表达：独特的分子特征。

Blood. 2004 Apr 15;103(8):2956-64. doi: 10.1182/blood-2003-09-3314. Epub 2003 Dec 30.

Pharmacogenomic identification of targets for adjuvant therapy with the topoisomerase poison camptothecin.拓扑异构酶抑制剂喜树碱辅助治疗靶点的药物基因组学鉴定

Cancer Res. 2004 Mar 15;64(6):2096-104. doi: 10.1158/0008-5472.can-03-2029.

The Unified Medical Language System (UMLS): integrating biomedical terminology.统一医学语言系统（UMLS）：整合生物医学术语。

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70. doi: 10.1093/nar/gkh061.

Database resources of the National Center for Biotechnology Information: update.美国国立生物技术信息中心的数据库资源：更新

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D35-40. doi: 10.1093/nar/gkh073.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验