• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用表达值和文本信息进行跨物种的匹配实验。

Matching experiments across species using expression values and textual information.

机构信息

Lane Center for Computational Biology, Carnegie Mellon University Pittsburgh, PA 15213, USA.

出版信息

Bioinformatics. 2012 Jun 15;28(12):i258-64. doi: 10.1093/bioinformatics/bts205.

DOI:10.1093/bioinformatics/bts205
PMID:22689770
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3371837/
Abstract

MOTIVATION

With the vast increase in the number of gene expression datasets deposited in public databases, novel techniques are required to analyze and mine this wealth of data. Similar to the way BLAST enables cross-species comparison of sequence data, tools that enable cross-species expression comparison will allow us to better utilize these datasets: cross-species expression comparison enables us to address questions in evolution and development, and further allows the identification of disease-related genes and pathways that play similar roles in humans and model organisms. Unlike sequence, which is static, expression data changes over time and under different conditions. Thus, a prerequisite for performing cross-species analysis is the ability to match experiments across species.

RESULTS

To enable better cross-species comparisons, we developed methods for automatically identifying pairs of similar expression datasets across species. Our method uses a co-training algorithm to combine a model of expression similarity with a model of the text which accompanies the expression experiments. The co-training method outperforms previous methods based on expression similarity alone. Using expert analysis, we show that the new matches identified by our method indeed capture biological similarities across species. We then use the matched expression pairs between human and mouse to recover known and novel cycling genes as well as to identify genes with possible involvement in diabetes. By providing the ability to identify novel candidate genes in model organisms, our method opens the door to new models for studying diseases.

AVAILABILITY

Source code and supplementary information is available at: www.andrew.cmu.edu/user/aaronwis/cotrain12.

摘要

动机

随着越来越多的基因表达数据集存入公共数据库,我们需要新的技术来分析和挖掘这些丰富的数据。类似于 BLAST 使序列数据在不同物种之间的比较成为可能,使表达数据在不同物种之间进行比较的工具将使我们能够更好地利用这些数据集:在进化和发育方面的比较使我们能够提出问题,进一步确定在人类和模式生物中发挥类似作用的与疾病相关的基因和途径。与序列不同,表达数据是动态的,会随时间和条件的变化而变化。因此,进行跨物种分析的前提是能够在不同物种之间匹配实验。

结果

为了实现更好的跨物种比较,我们开发了方法来自动识别跨物种的相似表达数据集对。我们的方法使用协同训练算法将表达相似性模型与表达实验所伴随的文本模型结合起来。协同训练方法优于以前仅基于表达相似性的方法。通过专家分析,我们表明,我们的方法新识别的匹配确实捕捉到了不同物种之间的生物学相似性。然后,我们使用人类和小鼠之间的匹配表达对来恢复已知和新的循环基因,并识别可能与糖尿病有关的基因。通过提供在模式生物中识别新候选基因的能力,我们的方法为研究疾病的新模型打开了大门。

可用性

源代码和补充信息可在:www.andrew.cmu.edu/user/aaronwis/cotrain12 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f425/3371837/2c7a026928ea/bts205f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f425/3371837/7deea2dae788/bts205f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f425/3371837/3854b3161df0/bts205f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f425/3371837/7a5a6974bfc3/bts205f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f425/3371837/2c7a026928ea/bts205f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f425/3371837/7deea2dae788/bts205f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f425/3371837/3854b3161df0/bts205f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f425/3371837/7a5a6974bfc3/bts205f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f425/3371837/2c7a026928ea/bts205f4.jpg

相似文献

1
Matching experiments across species using expression values and textual information.使用表达值和文本信息进行跨物种的匹配实验。
Bioinformatics. 2012 Jun 15;28(12):i258-64. doi: 10.1093/bioinformatics/bts205.
2
Cross-species queries of large gene expression databases.跨物种查询大型基因表达数据库。
Bioinformatics. 2010 Oct 1;26(19):2416-23. doi: 10.1093/bioinformatics/btq451. Epub 2010 Aug 11.
3
Identification of genes involved in the same pathways using a Hidden Markov Model-based approach.基于隐马尔可夫模型的方法识别参与相同途径的基因。
Bioinformatics. 2009 Nov 15;25(22):2945-54. doi: 10.1093/bioinformatics/btp521. Epub 2009 Aug 31.
4
Inter-species pathway perturbation prediction via data-driven detection of functional homology.通过数据驱动的功能同源性检测进行种间通路扰动预测。
Bioinformatics. 2015 Feb 15;31(4):501-8. doi: 10.1093/bioinformatics/btu570. Epub 2014 Aug 22.
5
Pathway analysis using random forests with bivariate node-split for survival outcomes.使用随机森林进行生存结局的双变量节点分裂的通路分析。
Bioinformatics. 2010 Jan 15;26(2):250-8. doi: 10.1093/bioinformatics/btp640. Epub 2009 Nov 18.
6
Clustering short time series gene expression data.聚类短时间序列基因表达数据。
Bioinformatics. 2005 Jun;21 Suppl 1:i159-68. doi: 10.1093/bioinformatics/bti1022.
7
Identifying cycling genes by combining sequence homology and expression data.通过结合序列同源性和表达数据来识别周期性基因。
Bioinformatics. 2006 Jul 15;22(14):e314-22. doi: 10.1093/bioinformatics/btl229.
8
Inferring pairwise regulatory relationships from multiple time series datasets.从多个时间序列数据集中推断成对的调控关系。
Bioinformatics. 2007 Mar 15;23(6):755-63. doi: 10.1093/bioinformatics/btl676. Epub 2007 Jan 19.
9
svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification.svdPPCS:一种基于有效奇异值分解的方法,用于鉴定保守和分歧的共表达基因模块。
BMC Bioinformatics. 2010 Jun 22;11:338. doi: 10.1186/1471-2105-11-338.
10
SpeCond: a method to detect condition-specific gene expression.SpeCond:一种检测条件特异性基因表达的方法。
Genome Biol. 2011 Oct 18;12(10):R101. doi: 10.1186/gb-2011-12-10-r101.

引用本文的文献

1
Learning a genome-wide score of human-mouse conservation at the functional genomics level.在功能基因组学层面了解全基因组范围的人类与小鼠保守性评分。
Nat Commun. 2021 May 3;12(1):2495. doi: 10.1038/s41467-021-22653-8.
2
Comparison of Gene Coexpression Profiles and Construction of Conserved Gene Networks to Find Functional Modules.基因共表达谱的比较及保守基因网络的构建以寻找功能模块
PLoS One. 2015 Jul 6;10(7):e0132039. doi: 10.1371/journal.pone.0132039. eCollection 2015.
3
Toward computational cumulative biology by combining models of biological datasets.

本文引用的文献

1
Large scale comparison of innate responses to viral and bacterial pathogens in mouse and macaque.大规模比较小鼠和猕猴对病毒和细菌病原体的先天反应。
PLoS One. 2011;6(7):e22401. doi: 10.1371/journal.pone.0022401. Epub 2011 Jul 18.
2
Cross-species queries of large gene expression databases.跨物种查询大型基因表达数据库。
Bioinformatics. 2010 Oct 1;26(19):2416-23. doi: 10.1093/bioinformatics/btq451. Epub 2010 Aug 11.
3
Evolutionary divergence in the fungal response to fluconazole revealed by soft clustering.软聚类揭示了真菌对氟康唑的反应的进化分歧。
通过整合生物数据集模型迈向计算累积生物学。
PLoS One. 2014 Nov 26;9(11):e113053. doi: 10.1371/journal.pone.0113053. eCollection 2014.
Genome Biol. 2010;11(7):R77. doi: 10.1186/gb-2010-11-7-r77. Epub 2010 Jul 23.
4
Next generation software for functional trend analysis.下一代功能趋势分析软件。
Bioinformatics. 2009 Nov 15;25(22):3043-4. doi: 10.1093/bioinformatics/btp498. Epub 2009 Aug 28.
5
Cross species analysis of microarray expression data.微阵列表达数据的跨物种分析。
Bioinformatics. 2009 Jun 15;25(12):1476-83. doi: 10.1093/bioinformatics/btp247. Epub 2009 Apr 8.
6
Innate immunity, insulin resistance and type 2 diabetes.先天性免疫、胰岛素抵抗与2型糖尿病
Trends Endocrinol Metab. 2008 Jan;19(1):10-6. doi: 10.1016/j.tem.2007.10.004. Epub 2007 Dec 21.
7
Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.利用进化特征在12个果蝇基因组中发现功能元件。
Nature. 2007 Nov 8;450(7167):219-32. doi: 10.1038/nature06340.
8
Metagene projection for cross-platform, cross-species characterization of global transcriptional states.用于跨平台、跨物种全局转录状态表征的元基因投影。
Proc Natl Acad Sci U S A. 2007 Apr 3;104(14):5959-64. doi: 10.1073/pnas.0701068104. Epub 2007 Mar 27.
9
NFkappaB and its inhibitor IkappaB in relation to type 2 diabetes and its microvascular and atherosclerotic complications.核因子κB及其抑制剂κB抑制蛋白与2型糖尿病及其微血管和动脉粥样硬化并发症的关系
Hum Immunol. 2006 Sep;67(9):706-13. doi: 10.1016/j.humimm.2006.05.006. Epub 2006 Jun 22.
10
Deficiency of interleukin-18 in mice leads to hyperphagia, obesity and insulin resistance.小鼠体内白细胞介素-18缺乏会导致食欲亢进、肥胖和胰岛素抵抗。
Nat Med. 2006 Jun;12(6):650-6. doi: 10.1038/nm1415. Epub 2006 May 28.