• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的微阵列分析表明,低表达基因可能共同影响 PAH 疾病。

Machine learning-based microarray analyses indicate low-expression genes might collectively influence PAH disease.

机构信息

College of Agronomy, Gansu Agricultural University, Lanzhou, Gansu, China.

School of Agriculture, Middle Tennessee State University, Murfreesboro, Tennessee, United States of America.

出版信息

PLoS Comput Biol. 2019 Aug 12;15(8):e1007264. doi: 10.1371/journal.pcbi.1007264. eCollection 2019 Aug.

DOI:10.1371/journal.pcbi.1007264
PMID:31404060
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6705875/
Abstract

Accurately predicting and testing the types of Pulmonary arterial hypertension (PAH) of each patient using cost-effective microarray-based expression data and machine learning algorithms could greatly help either identifying the most targeting medicine or adopting other therapeutic measures that could correct/restore defective genetic signaling at the early stage. Furthermore, the prediction model construction processes can also help identifying highly informative genes controlling PAH, leading to enhanced understanding of the disease etiology and molecular pathways. In this study, we used several different gene filtering methods based on microarray expression data obtained from a high-quality patient PAH dataset. Following that, we proposed a novel feature selection and refinement algorithm in conjunction with well-known machine learning methods to identify a small set of highly informative genes. Results indicated that clusters of small-expression genes could be extremely informative at predicting and differentiating different forms of PAH. Additionally, our proposed novel feature refinement algorithm could lead to significant enhancement in model performance. To summarize, integrated with state-of-the-art machine learning and novel feature refining algorithms, the most accurate models could provide near-perfect classification accuracies using very few (close to ten) low-expression genes.

摘要

利用具有成本效益的基于微阵列的表达数据和机器学习算法准确预测和测试每位患者的肺动脉高压 (PAH) 类型,这将极大地帮助识别最靶向药物或采用其他治疗措施,从而在早期纠正/恢复有缺陷的遗传信号。此外,预测模型构建过程还可以帮助识别控制 PAH 的高信息量基因,从而增强对疾病病因和分子途径的理解。在这项研究中,我们使用了几种不同的基因过滤方法,这些方法基于从高质量患者 PAH 数据集获得的微阵列表达数据。之后,我们提出了一种新的特征选择和细化算法,结合了著名的机器学习方法,以识别一小组高信息量基因。结果表明,小表达基因簇在预测和区分不同形式的 PAH 方面极具信息量。此外,我们提出的新特征细化算法可以显著提高模型性能。总之,结合最先进的机器学习和新颖的特征细化算法,最准确的模型可以使用非常少的(接近十个)低表达基因提供近乎完美的分类准确率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/2f20f76e4455/pcbi.1007264.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/ba81095a56b3/pcbi.1007264.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/1bde177c781e/pcbi.1007264.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/db970adef6cb/pcbi.1007264.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/961b107768f3/pcbi.1007264.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/091848af6b8c/pcbi.1007264.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/dadcf7b8818c/pcbi.1007264.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/2f20f76e4455/pcbi.1007264.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/ba81095a56b3/pcbi.1007264.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/1bde177c781e/pcbi.1007264.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/db970adef6cb/pcbi.1007264.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/961b107768f3/pcbi.1007264.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/091848af6b8c/pcbi.1007264.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/dadcf7b8818c/pcbi.1007264.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59e8/6705875/2f20f76e4455/pcbi.1007264.g007.jpg

相似文献

1
Machine learning-based microarray analyses indicate low-expression genes might collectively influence PAH disease.基于机器学习的微阵列分析表明,低表达基因可能共同影响 PAH 疾病。
PLoS Comput Biol. 2019 Aug 12;15(8):e1007264. doi: 10.1371/journal.pcbi.1007264. eCollection 2019 Aug.
2
Hierarchical gene selection and genetic fuzzy system for cancer microarray data classification.用于癌症微阵列数据分类的分层基因选择与遗传模糊系统
PLoS One. 2015 Mar 30;10(3):e0120364. doi: 10.1371/journal.pone.0120364. eCollection 2015.
3
A DSRPCL-SVM approach to informative gene analysis.一种用于信息基因分析的DSRPCL-SVM方法。
Genomics Proteomics Bioinformatics. 2008 Jun;6(2):83-90. doi: 10.1016/S1672-0229(08)60023-6.
4
Active learning using rough fuzzy classifier for cancer prediction from microarray gene expression data.基于粗糙模糊分类器的主动学习在微阵列基因表达数据癌症预测中的应用。
J Biomed Inform. 2019 Apr;92:103136. doi: 10.1016/j.jbi.2019.103136. Epub 2019 Feb 22.
5
Parsimonious selection of useful genes in microarray gene expression data.微阵列基因表达数据中有用基因的简约选择。
Adv Exp Med Biol. 2011;696:45-55. doi: 10.1007/978-1-4419-7046-6_5.
6
Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.
7
Meta-analysis of blood genome-wide expression profiling studies in pulmonary arterial hypertension.肺高血压全基因组血液表达谱研究的荟萃分析。
Am J Physiol Lung Cell Mol Physiol. 2020 Jan 1;318(1):L98-L111. doi: 10.1152/ajplung.00252.2019. Epub 2019 Oct 16.
8
Blood-based gene-expression biomarkers identification for the non-invasive diagnosis of Parkinson's disease using two-layer hybrid feature selection.基于血液的基因表达生物标志物的鉴定,用于使用双层混合特征选择进行帕金森病的非侵入性诊断。
Gene. 2022 May 20;823:146366. doi: 10.1016/j.gene.2022.146366. Epub 2022 Feb 22.
9
Prediction of core cancer genes using a hybrid of feature selection and machine learning methods.使用特征选择和机器学习方法相结合的方式预测核心癌症基因。
Genet Mol Res. 2015 Aug 3;14(3):8871-82. doi: 10.4238/2015.August.3.10.
10
Hybrid genetic algorithm-neural network: feature extraction for unpreprocessed microarray data.混合遗传算法-神经网络:未预处理微阵列数据的特征提取。
Artif Intell Med. 2011 Sep;53(1):47-56. doi: 10.1016/j.artmed.2011.06.008. Epub 2011 Jul 19.

引用本文的文献

1
Blood biomarker-based classification study for neurodegenerative diseases.基于血液生物标志物的神经退行性疾病分类研究。
Sci Rep. 2023 Oct 11;13(1):17191. doi: 10.1038/s41598-023-43956-4.
2
Intrinsic and Extrinsic Transcriptional Profiles That Affect the Clinical Response to PD-1 Inhibitors in Patients with Non-Small Cell Lung Cancer.影响非小细胞肺癌患者对PD-1抑制剂临床反应的内在和外在转录谱
Cancers (Basel). 2022 Dec 29;15(1):197. doi: 10.3390/cancers15010197.
3
A feature selection-based framework to identify biomarkers for cancer diagnosis: A focus on lung adenocarcinoma.

本文引用的文献

1
Bone Marrow-Derived Proangiogenic Cells Mediate Pulmonary Arteriole Stiffening via Serotonin 2B Receptor Dependent Mechanism.骨髓源促血管生成细胞通过 5-羟色胺 2B 受体依赖的机制介导肺小动脉僵硬。
Circ Res. 2018 Dec 7;123(12):e51-e64. doi: 10.1161/CIRCRESAHA.118.313397.
2
MiR-557 works as a tumor suppressor in human lung cancers by negatively regulating LEF1 expression.微小RNA-557通过负向调控淋巴样增强因子1(LEF1)的表达,在人类肺癌中发挥肿瘤抑制作用。
Tumour Biol. 2017 Jun;39(6):1010428317709467. doi: 10.1177/1010428317709467.
3
Solving the quantum many-body problem with artificial neural networks.
基于特征选择的癌症诊断生物标志物识别框架:以肺腺癌为例。
PLoS One. 2022 Sep 6;17(9):e0269126. doi: 10.1371/journal.pone.0269126. eCollection 2022.
4
Artificial Intelligence and Cardiovascular Genetics.人工智能与心血管遗传学
Life (Basel). 2022 Feb 14;12(2):279. doi: 10.3390/life12020279.
5
Integrating Statistical and Machine-Learning Approach for Meta-Analysis of Bisphenol A-Exposure Datasets Reveals Effects on Mouse Gene Expression within Pathways of Apoptosis and Cell Survival.整合统计和机器学习方法对双酚 A 暴露数据集进行荟萃分析,揭示了其对细胞凋亡和细胞存活途径中基因表达的影响。
Int J Mol Sci. 2021 Oct 5;22(19):10785. doi: 10.3390/ijms221910785.
6
Comparative Transcriptional Analysis of Pulmonary Arterial Hypertension Associated With Three Different Diseases.与三种不同疾病相关的肺动脉高压的比较转录分析
Front Cell Dev Biol. 2021 Jul 15;9:672159. doi: 10.3389/fcell.2021.672159. eCollection 2021.
用人工神经网络解决量子多体问题。
Science. 2017 Feb 10;355(6325):602-606. doi: 10.1126/science.aag2302.
4
Microarray Data Processing Techniques for Genome-Scale Network Inference from Large Public Repositories.用于从大型公共数据库推断基因组规模网络的微阵列数据处理技术
Microarrays (Basel). 2016 Sep 19;5(3):23. doi: 10.3390/microarrays5030023.
5
Identification of a novel partner gene, KIAA1217, fused to RET: Functional characterization and inhibitor sensitivity of two isoforms in lung adenocarcinoma.一种与RET融合的新伙伴基因KIAA1217的鉴定:肺腺癌中两种异构体的功能特性及抑制剂敏感性
Oncotarget. 2016 Jun 14;7(24):36101-36114. doi: 10.18632/oncotarget.9137.
6
Microarray analysis in pulmonary hypertension.肺动脉高压中的基因芯片分析。
Eur Respir J. 2016 Jul;48(1):229-41. doi: 10.1183/13993003.02030-2015. Epub 2016 Apr 13.
7
CYP21A2 expression is localized in the developing distal epithelium of the human perinatal lung and is compatible with in situ production and intracrine actions of active glucocorticoids.CYP21A2表达定位于人类围产期肺脏发育中的远端上皮,与活性糖皮质激素的原位产生和内分泌作用相符。
J Steroid Biochem Mol Biol. 2016 Oct;163:12-9. doi: 10.1016/j.jsbmb.2016.03.024. Epub 2016 Mar 19.
8
Critical Genomic Networks and Vasoreactive Variants in Idiopathic Pulmonary Arterial Hypertension.特发性肺动脉高压中的关键基因组网络与血管反应性变异体
Am J Respir Crit Care Med. 2016 Aug 15;194(4):464-75. doi: 10.1164/rccm.201508-1678OC.
9
Effect of low-expression gene filtering on detection of differentially expressed genes in RNA-seq data.低表达基因过滤对RNA测序数据中差异表达基因检测的影响
Annu Int Conf IEEE Eng Med Biol Soc. 2015;2015:6461-4. doi: 10.1109/EMBC.2015.7319872.
10
Bone Marrow-derived Cells Contribute to the Pathogenesis of Pulmonary Arterial Hypertension.骨髓来源的细胞促成肺动脉高压的发病机制。
Am J Respir Crit Care Med. 2016 Apr 15;193(8):898-909. doi: 10.1164/rccm.201502-0407OC.