• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将通路信息纳入特征选择以获得更好的基因特征表现。

Incorporating Pathway Information into Feature Selection towards Better Performed Gene Signatures.

机构信息

Division of Clinical Research, The First Hospital of Jilin University, 71 Xinmin Street, Changchun, Jilin 130021, China.

Department of Biostatistics, Markey Cancer Center, The University of Kentucky, 800 Rose St., Lexington, KY 40536, USA.

出版信息

Biomed Res Int. 2019 Apr 3;2019:2497509. doi: 10.1155/2019/2497509. eCollection 2019.

DOI:10.1155/2019/2497509
PMID:31073522
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6470448/
Abstract

To analyze gene expression data with sophisticated grouping structures and to extract hidden patterns from such data, feature selection is of critical importance. It is well known that genes do not function in isolation but rather work together within various metabolic, regulatory, and signaling pathways. If the biological knowledge contained within these pathways is taken into account, the resulting method is a pathway-based algorithm. Studies have demonstrated that a pathway-based method usually outperforms its gene-based counterpart in which no biological knowledge is considered. In this article, a pathway-based feature selection is firstly divided into three major categories, namely, pathway-level selection, bilevel selection, and pathway-guided gene selection. With bilevel selection methods being regarded as a special case of pathway-guided gene selection process, we discuss pathway-guided gene selection methods in detail and the importance of penalization in such methods. Last, we point out the potential utilizations of pathway-guided gene selection in one active research avenue, namely, to analyze longitudinal gene expression data. We believe this article provides valuable insights for computational biologists and biostatisticians so that they can make biology more computable.

摘要

为了分析具有复杂分组结构的基因表达数据,并从这些数据中提取隐藏模式,特征选择至关重要。众所周知,基因不是孤立工作的,而是在各种代谢、调节和信号通路中共同发挥作用。如果考虑到这些通路中包含的生物学知识,那么所得到的方法就是基于通路的算法。研究表明,基于通路的方法通常比不考虑任何生物学知识的基于基因的方法表现更好。在本文中,首先将基于通路的特征选择分为三大类,即通路水平选择、双层选择和基于通路的基因选择。由于双层选择方法被视为基于通路的基因选择过程的一个特例,我们详细讨论了基于通路的基因选择方法以及在这些方法中惩罚的重要性。最后,我们指出了基于通路的基因选择在一个活跃的研究领域中的潜在应用,即分析纵向基因表达数据。我们相信本文为计算生物学家和生物统计学家提供了有价值的见解,使他们能够使生物学更具计算性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/a9fbb341c8fa/BMRI2019-2497509.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/1c5474895f58/BMRI2019-2497509.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/004b6bb7d70c/BMRI2019-2497509.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/7535de9d70cd/BMRI2019-2497509.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/40125b53c984/BMRI2019-2497509.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/a9fbb341c8fa/BMRI2019-2497509.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/1c5474895f58/BMRI2019-2497509.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/004b6bb7d70c/BMRI2019-2497509.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/7535de9d70cd/BMRI2019-2497509.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/40125b53c984/BMRI2019-2497509.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/376d/6470448/a9fbb341c8fa/BMRI2019-2497509.005.jpg

相似文献

1
Incorporating Pathway Information into Feature Selection towards Better Performed Gene Signatures.将通路信息纳入特征选择以获得更好的基因特征表现。
Biomed Res Int. 2019 Apr 3;2019:2497509. doi: 10.1155/2019/2497509. eCollection 2019.
2
TNF-alpha regulates alternative splicing of genes participating in pathways of crucial metabolic syndromes; a transcriptome wide study.TNF-α 调节参与关键代谢综合征途径的基因的可变剪接;一项转录组广泛研究。
Cytokine. 2020 Jan;125:154815. doi: 10.1016/j.cyto.2019.154815. Epub 2019 Aug 30.
3
The use of Gene Ontology terms and KEGG pathways for analysis and prediction of oncogenes.利用基因本体论术语和KEGG通路进行癌基因的分析与预测。
Biochim Biophys Acta. 2016 Nov;1860(11 Pt B):2725-34. doi: 10.1016/j.bbagen.2016.01.012. Epub 2016 Jan 20.
4
MGRFE: Multilayer Recursive Feature Elimination Based on an Embedded Genetic Algorithm for Cancer Classification.MGRFE:基于嵌入式遗传算法的多层递归特征消除在癌症分类中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):621-632. doi: 10.1109/TCBB.2019.2921961. Epub 2021 Apr 6.
5
Double Selection Based Semi-Supervised Clustering Ensemble for Tumor Clustering from Gene Expression Profiles.基于双重选择的半监督聚类集成用于从基因表达谱中进行肿瘤聚类
IEEE/ACM Trans Comput Biol Bioinform. 2014 Jul-Aug;11(4):727-40. doi: 10.1109/TCBB.2014.2315996.
6
POPBic: Pathway-Based Order Preserving Biclustering Algorithm Towards the Analysis of Gene Expression Data.POPBic:基于通路的保留序分箱算法,用于基因表达数据分析。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2659-2670. doi: 10.1109/TCBB.2020.2980816. Epub 2021 Dec 8.
7
Bottom-up GGM algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways or processes.用于构建调控生物途径或过程的多层层次基因调控网络的自下而上的GGM算法。
BMC Bioinformatics. 2016 Mar 18;17:132. doi: 10.1186/s12859-016-0981-1.
8
Data recovery and integration from public databases uncovers transformation-specific transcriptional downregulation of cAMP-PKA pathway-encoding genes.从公共数据库中进行数据恢复和整合,揭示了特定转化的 cAMP-PKA 通路编码基因的转录下调。
BMC Bioinformatics. 2009 Oct 15;10 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-10-S12-S1.
9
Executable pathway analysis using ensemble discrete-state modeling for large-scale data.使用集成离散状态建模进行大规模数据的可执行路径分析。
PLoS Comput Biol. 2019 Sep 3;15(9):e1007317. doi: 10.1371/journal.pcbi.1007317. eCollection 2019 Sep.
10
Identification of Genes Discriminating Multiple Sclerosis Patients from Controls by Adapting a Pathway Analysis Method.通过改进通路分析方法鉴别区分多发性硬化症患者与对照的基因
PLoS One. 2016 Nov 15;11(11):e0165543. doi: 10.1371/journal.pone.0165543. eCollection 2016.

引用本文的文献

1
PriPath: identifying dysregulated pathways from differential gene expression via grouping, scoring, and modeling with an embedded feature selection approach.PriPath:通过分组、评分和建模,并结合嵌入式特征选择方法,从差异基因表达中识别失调途径。
BMC Bioinformatics. 2023 Feb 23;24(1):60. doi: 10.1186/s12859-023-05187-2.
2
Superfluorinated Extracellular Vesicles for In Vivo Imaging by F-MRI.用于F-MRI体内成像的超氟化细胞外囊泡
ACS Appl Mater Interfaces. 2023 Feb 13;15(7):8974-85. doi: 10.1021/acsami.2c20566.
3
Validating methods for testing natural molecules on molecular pathways of interest in silico and in vitro.

本文引用的文献

1
A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time.一种纵向特征选择方法可识别出相关基因,以区分随时间推移的复杂损伤和简单损伤。
BMC Med Inform Decis Mak. 2018 Dec 7;18(Suppl 5):115. doi: 10.1186/s12911-018-0685-8.
2
To select relevant features for longitudinal gene expression data by extending a pathway analysis method.通过扩展一种通路分析方法来选择纵向基因表达数据的相关特征。
F1000Res. 2018 Jul 31;7:1166. doi: 10.12688/f1000research.15357.1. eCollection 2018.
3
Classification of early-stage non-small cell lung cancer by weighing gene expression profiles with connectivity information.
验证在计算机和体外条件下对感兴趣的分子途径上的天然分子进行测试的方法。
J Prev Med Hyg. 2022 Oct 17;63(2 Suppl 3):E279-E288. doi: 10.15167/2421-4248/jpmh2022.63.2S3.2770. eCollection 2022 Jun.
4
Exploring autophagy-related prognostic genes of Alzheimer's disease based on pathway crosstalk analysis.基于通路交互分析探讨阿尔茨海默病的自噬相关预后基因。
Bosn J Basic Med Sci. 2022 Sep 16;22(5):751-771. doi: 10.17305/bjbms.2021.7019.
5
Evaluation of Feature Selection Methods for Mammographic Breast Cancer Diagnosis in a Unified Framework.在统一框架下评估用于乳腺 X 线摄影乳腺癌诊断的特征选择方法。
Biomed Res Int. 2021 Oct 4;2021:6079163. doi: 10.1155/2021/6079163. eCollection 2021.
6
GEE-TGDR: A Longitudinal Feature Selection Algorithm and Its Application to lncRNA Expression Profiles for Psoriasis Patients Treated with Immune Therapies.GEE-TGDR:一种纵向特征选择算法及其在免疫治疗银屑病患者长链非编码 RNA 表达谱中的应用。
Biomed Res Int. 2021 Apr 9;2021:8862895. doi: 10.1155/2021/8862895. eCollection 2021.
7
Weighted gene expression profiles identify diagnostic and prognostic genes for lung adenocarcinoma and squamous cell carcinoma.加权基因表达谱可识别肺腺癌和肺鳞状细胞癌的诊断及预后相关基因。
J Int Med Res. 2020 Mar;48(3):300060519893837. doi: 10.1177/0300060519893837. Epub 2019 Dec 19.
通过结合连接信息对基因表达谱进行加权来对早期非小细胞肺癌进行分类。
Biom J. 2018 May;60(3):537-546. doi: 10.1002/bimj.201700010. Epub 2017 Dec 5.
4
Finding disagreement pathway signatures and constructing an ensemble model for cancer classification.发现分歧途径特征并构建癌症分类的集成模型。
Sci Rep. 2017 Aug 30;7(1):10044. doi: 10.1038/s41598-017-10258-5.
5
Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods.采用双层选择方法鉴定早期非小细胞肺癌的预后基因和基因集。
Sci Rep. 2017 Apr 7;7:46164. doi: 10.1038/srep46164.
6
Data-driven reverse engineering of signaling pathways using ensembles of dynamic models.使用动态模型集合对信号通路进行数据驱动的逆向工程。
PLoS Comput Biol. 2017 Feb 6;13(2):e1005379. doi: 10.1371/journal.pcbi.1005379. eCollection 2017 Feb.
7
GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.GSNFS:肺癌表达数据的基因子网生物标志物识别
BMC Med Genomics. 2016 Dec 5;9(Suppl 3):70. doi: 10.1186/s12920-016-0231-4.
8
Gene co-expression analysis for functional classification and gene-disease predictions.基因共表达分析用于功能分类和基因疾病预测。
Brief Bioinform. 2018 Jul 20;19(4):575-592. doi: 10.1093/bib/bbw139.
9
Identification of Genes Discriminating Multiple Sclerosis Patients from Controls by Adapting a Pathway Analysis Method.通过改进通路分析方法鉴别区分多发性硬化症患者与对照的基因
PLoS One. 2016 Nov 15;11(11):e0165543. doi: 10.1371/journal.pone.0165543. eCollection 2016.
10
Weighted-SAMGSR: combining significance analysis of microarray-gene set reduction algorithm with pathway topology-based weights to select relevant genes.加权-SAMGSR:将微阵列基因集缩减算法的显著性分析与基于通路拓扑结构的权重相结合以选择相关基因。
Biol Direct. 2016 Sep 29;11(1):50. doi: 10.1186/s13062-016-0152-3.