• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因表达数据的整合基因选择:为传统方法提供生物学背景。

Integrative Gene Selection on Gene Expression Data: Providing Biological Context to Traditional Approaches.

作者信息

Perscheid Cindy, Grasnick Bastien, Uflacker Matthias

机构信息

Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, Potsdam, Germany.

出版信息

J Integr Bioinform. 2018 Dec 22;16(1):20180064. doi: 10.1515/jib-2018-0064.

DOI:10.1515/jib-2018-0064
PMID:30785707
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6798862/
Abstract

The advance of high-throughput RNA-Sequencing techniques enables researchers to analyze the complete gene activity in particular cells. From the insights of such analyses, researchers can identify disease-specific expression profiles, thus understand complex diseases like cancer, and eventually develop effective measures for diagnosis and treatment. The high dimensionality of gene expression data poses challenges to its computational analysis, which is addressed with measures of gene selection. Traditional gene selection approaches base their findings on statistical analyses of the actual expression levels, which implies several drawbacks when it comes to accurately identifying the underlying biological processes. In turn, integrative approaches include curated information on biological processes from external knowledge bases during gene selection, which promises to lead to better interpretability and improved predictive performance. Our work compares the performance of traditional and integrative gene selection approaches. Moreover, we propose a straightforward approach to integrate external knowledge with traditional gene selection approaches. We introduce a framework enabling the automatic external knowledge integration, gene selection, and evaluation. Evaluation results prove our framework to be a useful tool for evaluation and show that integration of external knowledge improves overall analysis results.

摘要

高通量RNA测序技术的进步使研究人员能够分析特定细胞中的完整基因活性。从这些分析的见解中,研究人员可以识别疾病特异性表达谱,从而了解像癌症这样的复杂疾病,并最终开发出有效的诊断和治疗措施。基因表达数据的高维度对其计算分析提出了挑战,这可通过基因选择措施来解决。传统的基因选择方法基于对实际表达水平的统计分析来得出结果,这在准确识别潜在生物学过程方面存在若干缺点。反过来,综合方法在基因选择过程中纳入了来自外部知识库的有关生物学过程的精心策划的信息,这有望带来更好的可解释性和改进的预测性能。我们的工作比较了传统和综合基因选择方法的性能。此外,我们提出了一种将外部知识与传统基因选择方法相结合的直接方法。我们引入了一个框架,能够实现自动外部知识整合、基因选择和评估。评估结果证明我们的框架是一个有用的评估工具,并表明外部知识的整合改善了整体分析结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/76413bbd25e6/jib-16-20180064-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/50cfdeb1be4f/jib-16-20180064-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/fe065e3122fc/jib-16-20180064-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/eaa68fe15142/jib-16-20180064-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/a9f95dfa6f31/jib-16-20180064-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/15038e26e6c5/jib-16-20180064-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/3b7e45d99126/jib-16-20180064-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/98e95535b332/jib-16-20180064-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/498da17d15ca/jib-16-20180064-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/e3a58baefe28/jib-16-20180064-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/76413bbd25e6/jib-16-20180064-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/50cfdeb1be4f/jib-16-20180064-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/fe065e3122fc/jib-16-20180064-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/eaa68fe15142/jib-16-20180064-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/a9f95dfa6f31/jib-16-20180064-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/15038e26e6c5/jib-16-20180064-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/3b7e45d99126/jib-16-20180064-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/98e95535b332/jib-16-20180064-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/498da17d15ca/jib-16-20180064-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/e3a58baefe28/jib-16-20180064-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e19/6798862/76413bbd25e6/jib-16-20180064-g010.jpg

相似文献

1
Integrative Gene Selection on Gene Expression Data: Providing Biological Context to Traditional Approaches.基因表达数据的整合基因选择:为传统方法提供生物学背景。
J Integr Bioinform. 2018 Dec 22;16(1):20180064. doi: 10.1515/jib-2018-0064.
2
Integrative biomarker detection on high-dimensional gene expression data sets: a survey on prior knowledge approaches.高维基因表达数据集的综合生物标志物检测:先验知识方法综述。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa151.
3
Towards precise classification of cancers based on robust gene functional expression profiles.基于稳健的基因功能表达谱实现癌症的精准分类
BMC Bioinformatics. 2005 Mar 17;6:58. doi: 10.1186/1471-2105-6-58.
4
Feature selection of gene expression data for Cancer classification using double RBF-kernels.基于双 RBF 核的癌症分类基因表达数据特征选择。
BMC Bioinformatics. 2018 Oct 29;19(1):396. doi: 10.1186/s12859-018-2400-2.
5
Predicting miRNA Targets by Integrating Gene Regulatory Knowledge with Expression Profiles.通过整合基因调控知识与表达谱预测微小RNA靶标
PLoS One. 2016 Apr 11;11(4):e0152860. doi: 10.1371/journal.pone.0152860. eCollection 2016.
6
Feature selection and nearest centroid classification for protein mass spectrometry.蛋白质质谱的特征选择与最近质心分类
BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68.
7
Double Selection Based Semi-Supervised Clustering Ensemble for Tumor Clustering from Gene Expression Profiles.基于双重选择的半监督聚类集成用于从基因表达谱中进行肿瘤聚类
IEEE/ACM Trans Comput Biol Bioinform. 2014 Jul-Aug;11(4):727-40. doi: 10.1109/TCBB.2014.2315996.
8
Biologically inspired survival analysis based on integrating gene expression as mediator with genomic variants.基于整合基因表达作为基因组变异中介的生物启发式生存分析。
Comput Biol Med. 2016 Oct 1;77:231-9. doi: 10.1016/j.compbiomed.2016.08.020. Epub 2016 Aug 31.
9
A co-expression modules based gene selection for cancer recognition.一种基于共表达模块的癌症识别基因选择方法。
J Theor Biol. 2014 Dec 7;362:75-82. doi: 10.1016/j.jtbi.2014.01.005. Epub 2014 Jan 15.
10
Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity.整合多组学数据进行建模以识别癌症驱动因素并推断患者特异性基因活性。
BMC Syst Biol. 2016 Feb 11;10:16. doi: 10.1186/s12918-016-0260-9.

引用本文的文献

1
GeNetOntology: identifying affected gene ontology terms via grouping, scoring, and modeling of gene expression data utilizing biological knowledge-based machine learning.基因本体论:通过利用基于生物知识的机器学习对基因表达数据进行分组、评分和建模来识别受影响的基因本体术语。
Front Genet. 2023 Aug 21;14:1139082. doi: 10.3389/fgene.2023.1139082. eCollection 2023.
2
Review of feature selection approaches based on grouping of features.基于特征分组的特征选择方法综述。
PeerJ. 2023 Jul 17;11:e15666. doi: 10.7717/peerj.15666. eCollection 2023.
3
Characterization of Arabian Peninsula whole exomes: Contributing to the catalogue of human diversity.

本文引用的文献

1
Integrated Theory- and Data-driven Feature Selection in Gene Expression Data Analysis.基因表达数据分析中基于理论与数据驱动的综合特征选择
Proc Int Conf Data Eng. 2017 Apr;2017:1525-1532. doi: 10.1109/ICDE.2017.223. Epub 2017 May 18.
2
An Ensemble Framework Coping with Instability in the Gene Selection Process.一种应对基因选择过程中不稳定性的集成框架。
Interdiscip Sci. 2018 Mar;10(1):12-23. doi: 10.1007/s12539-017-0274-z. Epub 2018 Jan 8.
3
Unsupervised gene selection using biological knowledge : application in sample clustering.
阿拉伯半岛全外显子组特征分析:助力人类多样性目录构建
iScience. 2022 Oct 12;25(11):105336. doi: 10.1016/j.isci.2022.105336. eCollection 2022 Nov 18.
4
Comprior: facilitating the implementation and automated benchmarking of prior knowledge-based feature selection approaches on gene expression data sets.Comprior:促进基于先验知识的特征选择方法在基因表达数据集上的实施和自动化基准测试。
BMC Bioinformatics. 2021 Aug 12;22(1):401. doi: 10.1186/s12859-021-04308-z.
5
CogNet: classification of gene expression data based on ranked active-subnetwork-oriented KEGG pathway enrichment analysis.CogNet:基于面向排名活性子网的KEGG通路富集分析的基因表达数据分类
PeerJ Comput Sci. 2021 Feb 22;7:e336. doi: 10.7717/peerj-cs.336. eCollection 2021.
6
Application of Biological Domain Knowledge Based Feature Selection on Gene Expression Data.基于生物领域知识的特征选择在基因表达数据中的应用。
Entropy (Basel). 2020 Dec 22;23(1):2. doi: 10.3390/e23010002.
利用生物学知识进行无监督基因选择:在样本聚类中的应用
BMC Bioinformatics. 2017 Nov 22;18(1):513. doi: 10.1186/s12859-017-1933-0.
4
UniProt: the universal protein knowledgebase.通用蛋白质知识库:UniProt
Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169. doi: 10.1093/nar/gkw1099. Epub 2016 Nov 29.
5
Expansion of the Gene Ontology knowledgebase and resources.基因本体知识库及资源的扩展。
Nucleic Acids Res. 2017 Jan 4;45(D1):D331-D338. doi: 10.1093/nar/gkw1108. Epub 2016 Nov 29.
6
The Comparative Toxicogenomics Database: update 2017.比较毒理基因组学数据库:2017年更新版
Nucleic Acids Res. 2017 Jan 4;45(D1):D972-D978. doi: 10.1093/nar/gkw838. Epub 2016 Sep 19.
7
LPRP: A Gene-Gene Interaction Network Construction Algorithm and Its Application in Breast Cancer Data Analysis.LPRP:一种基因-基因交互网络构建算法及其在乳腺癌数据分析中的应用。
Interdiscip Sci. 2018 Mar;10(1):131-142. doi: 10.1007/s12539-016-0185-4. Epub 2016 Sep 17.
8
Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection.监督式、无监督式和半监督式特征选择:基因选择综述
IEEE/ACM Trans Comput Biol Bioinform. 2016 Sep-Oct;13(5):971-989. doi: 10.1109/TCBB.2015.2478454. Epub 2015 Sep 14.
9
A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data.应用于微阵列数据的特征选择与特征提取方法综述
Adv Bioinformatics. 2015;2015:198363. doi: 10.1155/2015/198363. Epub 2015 Jun 11.
10
DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes.DisGeNET:一个用于动态探索人类疾病及其基因的发现平台。
Database (Oxford). 2015 Apr 15;2015:bav028. doi: 10.1093/database/bav028. Print 2015.