• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在转录组水平解读肺癌健康差异

Interpreting Lung Cancer Health Disparity at Transcriptome Level.

作者信息

Sobhan Masrur, Islam Md Mezbahul, Mondal Ananda Mohan

机构信息

Knight Foundation School of Computing and Information Sciences Florida International University Miami, USA.

出版信息

bioRxiv. 2025 Jan 13:2025.01.09.632292. doi: 10.1101/2025.01.09.632292.

DOI:10.1101/2025.01.09.632292
PMID:39868239
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11761490/
Abstract

Lung cancer is a leading cause of cancer-related mortality, with disparities in incidence and outcomes observed across different racial and sex groups. Understanding the genetic factors of these disparities is critical for developing targeted treatment therapies. This study aims to identify both patient-specific and cohort-specific biomarker genes that contribute to lung cancer health disparities among African American males (AAMs), European American males (EAMs), African American females (AAFs), and European American females (EAFs). The real-world data is highly imbalanced with respect to race, and the lung cancer dataset is no exception. So, classification with race labels will generate highly biased results toward the larger cohort. We developed a computational framework by designing the classification problems with disease conditions instead of races and leveraging the local interpretability of explainable AI, SHAP (SHapley Additive exPlanations). This study used three disease conditions of lung cancer, including Lung Adenocarcinoma (LUAD), Lung Squamous Cell Carcinoma (LUSC), and Healthy samples (HEALTHY) to design four classification tasks: one 3-class problem (LUAD-LUSC-HEALTHY) and three 2-class problems (LUAD-LUSC, LUAD-HEALTHY, and LUSC-HEALTHY). This multiple-classification approach allows a LUAD patient to be interrogated via three classification problems, namely LUAD-LUSC-HEALTHY, LUAD-LUSC, and LUAD-HEALTHY, thus providing a robust approach of retrieving disparity information for individual patients through the local interpretation of SHAP. The proposed method successfully discovered the sets of genes and pathways related to health disparities in lung cancer between two cohorts, including AAMs vs. EAMs, AAFs vs. EAFs, AAMs vs. AAFs, and EAMs vs. EAFs. The discovered list of genes and pathways provide a short list for biological scientists to conduct wet lab experiment.

摘要

肺癌是癌症相关死亡的主要原因,在不同种族和性别群体中,肺癌的发病率和治疗结果存在差异。了解这些差异的遗传因素对于开发靶向治疗方法至关重要。本研究旨在识别导致非裔美国男性(AAM)、欧裔美国男性(EAM)、非裔美国女性(AAF)和欧裔美国女性(EAF)肺癌健康差异的患者特异性和队列特异性生物标志物基因。真实世界的数据在种族方面高度不平衡,肺癌数据集也不例外。因此,按种族标签进行分类会对较大的队列产生高度偏差的结果。我们通过设计基于疾病状况而非种族的分类问题,并利用可解释人工智能SHAP(SHapley Additive exPlanations)的局部可解释性,开发了一个计算框架。本研究使用肺癌三种疾病状况,包括肺腺癌(LUAD)、肺鳞状细胞癌(LUSC)和健康样本(HEALTHY)来设计四个分类任务:一个3分类问题(LUAD-LUSC-HEALTHY)和三个2分类问题(LUAD-LUSC、LUAD-HEALTHY和LUSC-HEALTHY)。这种多分类方法允许通过三个分类问题对LUAD患者进行询问,即LUAD-LUSC-HEALTHY、LUAD-LUSC和LUAD-HEALTHY,从而通过SHAP的局部解释为个体患者检索差异信息提供了一种强大的方法。所提出的方法成功发现了两个队列之间肺癌健康差异相关的基因和通路集,包括AAM与EAM、AAF与EAF、AAM与AAF以及EAM与EAF。发现的基因和通路列表为生物科学家进行湿实验室实验提供了一个简短列表。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/cd508e986a00/nihpp-2025.01.09.632292v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/6ce706497362/nihpp-2025.01.09.632292v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/f4237e4212a8/nihpp-2025.01.09.632292v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/650a19057cc2/nihpp-2025.01.09.632292v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/c3d043cdcf7a/nihpp-2025.01.09.632292v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/cd508e986a00/nihpp-2025.01.09.632292v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/6ce706497362/nihpp-2025.01.09.632292v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/f4237e4212a8/nihpp-2025.01.09.632292v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/650a19057cc2/nihpp-2025.01.09.632292v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/c3d043cdcf7a/nihpp-2025.01.09.632292v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e8/11761490/cd508e986a00/nihpp-2025.01.09.632292v1-f0005.jpg

相似文献

1
Interpreting Lung Cancer Health Disparity at Transcriptome Level.在转录组水平解读肺癌健康差异
bioRxiv. 2025 Jan 13:2025.01.09.632292. doi: 10.1101/2025.01.09.632292.
2
Interpreting Lung Cancer Health Disparity between African American Males and European American Males.解读非裔美国男性与欧美裔美国男性之间的肺癌健康差异。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2024 Dec;2024:7141-7143. doi: 10.1109/bibm62325.2024.10822014.
3
Comparative investigation of lung adenocarcinoma and squamous cell carcinoma transcriptome to reveal potential candidate biomarkers: An explainable AI approach.肺腺癌和鳞状细胞癌转录组的比较研究以揭示潜在候选生物标志物:一种可解释的人工智能方法。
Comput Biol Chem. 2025 Apr;115:108333. doi: 10.1016/j.compbiolchem.2024.108333. Epub 2024 Dec 27.
4
Archaea Microbiome Dysregulated Genes and Pathways as Molecular Targets for Lung Adenocarcinoma and Squamous Cell Carcinoma.古菌微生物组失调基因和途径作为肺腺癌和鳞状细胞癌的分子靶点。
Int J Mol Sci. 2022 Sep 30;23(19):11566. doi: 10.3390/ijms231911566.
5
Clinically relevant immune subtypes based on alternative splicing landscape of immune-related genes for lung cancer advanced PPPM approach.基于肺癌免疫相关基因可变剪接图谱的临床相关免疫亚型:先进的PPPM方法
EPMA J. 2024 May 24;15(2):345-373. doi: 10.1007/s13167-024-00366-4. eCollection 2024 Jun.
6
[Identification of differentially expressed genes between lung adenocarcinoma and squamous cell carcinoma using transcriber signature analysis].[利用转录特征分析鉴定肺腺癌与肺鳞癌之间的差异表达基因]
Nan Fang Yi Ke Da Xue Xue Bao. 2019 Jun 30;39(6):641-649. doi: 10.12122/j.issn.1673-4254.2019.06.03.
7
Comprehensive Profiling of Genomic and Transcriptomic Differences between Risk Groups of Lung Adenocarcinoma and Lung Squamous Cell Carcinoma.肺腺癌和肺鳞状细胞癌风险组之间基因组和转录组差异的综合分析
J Pers Med. 2021 Feb 23;11(2):154. doi: 10.3390/jpm11020154.
8
Cancer Stemness-Based Prognostic Immune-Related Gene Signatures in Lung Adenocarcinoma and Lung Squamous Cell Carcinoma.基于癌症干性的肺腺癌和肺鳞癌预后免疫相关基因特征。
Front Endocrinol (Lausanne). 2021 Oct 21;12:755805. doi: 10.3389/fendo.2021.755805. eCollection 2021.
9
Exploring and comparing of the gene expression and methylation differences between lung adenocarcinoma and squamous cell carcinoma.探索并比较肺腺癌和肺鳞癌之间的基因表达和甲基化差异。
J Cell Physiol. 2019 Apr;234(4):4454-4459. doi: 10.1002/jcp.27240. Epub 2018 Oct 14.
10
Network module function enrichment analysis of lung squamous cell carcinoma and lung adenocarcinoma.肺鳞癌和肺腺癌的网络模块功能富集分析。
Medicine (Baltimore). 2022 Nov 25;101(47):e31798. doi: 10.1097/MD.0000000000031798.

本文引用的文献

1
Cancer statistics, 2023.癌症统计数据,2023 年。
CA Cancer J Clin. 2023 Jan;73(1):17-48. doi: 10.3322/caac.21763.
2
Multi-Run Concrete Autoencoder to Identify Prognostic lncRNAs for 12 Cancers.多轮混凝土自动编码器识别 12 种癌症的预后 lncRNAs。
Int J Mol Sci. 2021 Nov 3;22(21):11919. doi: 10.3390/ijms222111919.
3
Before and After: Comparison of Legacy and Harmonized TCGA Genomic Data Commons' Data.前后对比:遗留 TCGA 基因组数据公共数据库与统一 TCGA 基因组数据公共数据库的数据比较。
Cell Syst. 2019 Jul 24;9(1):24-34.e10. doi: 10.1016/j.cels.2019.06.006.
4
Unifying cancer and normal RNA sequencing data from different sources.整合来自不同来源的癌症和正常 RNA 测序数据。
Sci Data. 2018 Apr 17;5:180061. doi: 10.1038/sdata.2018.61.
5
Comparative Transcriptome Profiling Reveals Coding and Noncoding RNA Differences in NSCLC from African Americans and European Americans.比较转录组谱分析揭示非小细胞肺癌中非裔美国人和欧洲裔美国人的编码和非编码 RNA 差异。
Clin Cancer Res. 2017 Dec 1;23(23):7412-7425. doi: 10.1158/1078-0432.CCR-17-0527.
6
Proportion and number of cancer cases and deaths attributable to potentially modifiable risk factors in the United States.美国可改变的潜在风险因素导致的癌症病例和死亡人数及比例。
CA Cancer J Clin. 2018 Jan;68(1):31-54. doi: 10.3322/caac.21440. Epub 2017 Nov 21.
7
Intratumor and Intertumor Heterogeneity in Melanoma.黑色素瘤的肿瘤内和肿瘤间异质性
Transl Oncol. 2017 Dec;10(6):956-975. doi: 10.1016/j.tranon.2017.09.007. Epub 2017 Oct 24.
8
Large-scale association analysis identifies new lung cancer susceptibility loci and heterogeneity in genetic susceptibility across histological subtypes.大规模关联分析确定了新的肺癌易感位点以及不同组织学亚型遗传易感性的异质性。
Nat Genet. 2017 Jul;49(7):1126-1132. doi: 10.1038/ng.3892. Epub 2017 Jun 12.
9
Genome-wide association study confirms lung cancer susceptibility loci on chromosomes 5p15 and 15q25 in an African-American population.全基因组关联研究证实了非裔美国人中5号染色体短臂15区和15号染色体长臂25区存在肺癌易感基因座。
Lung Cancer. 2016 Aug;98:33-42. doi: 10.1016/j.lungcan.2016.05.008. Epub 2016 May 13.
10
Analysis of Heritability and Shared Heritability Based on Genome-Wide Association Studies for Thirteen Cancer Types.基于全基因组关联研究对13种癌症类型的遗传力和共享遗传力分析
J Natl Cancer Inst. 2015 Oct 12;107(12):djv279. doi: 10.1093/jnci/djv279. Print 2015 Dec.