• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多源数据变分自编码器的全局和局部特征解缠:一种通过多源拉曼光谱融合技术诊断IgA肾病的可解释模型。

Disentangled global and local features of multi-source data variational autoencoder: An interpretable model for diagnosing IgAN via multi-source Raman spectral fusion techniques.

作者信息

Shuai Wei, Tian Xuecong, Zuo Enguang, Zhang Xueqin, Lu Chen, Gu Jin, Chen Chen, Lv Xiaoyi, Chen Cheng

机构信息

College of Software, Xinjiang University, Urumqi 830046, China.

College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China.

出版信息

Artif Intell Med. 2025 Feb;160:103053. doi: 10.1016/j.artmed.2024.103053. Epub 2024 Dec 12.

DOI:10.1016/j.artmed.2024.103053
PMID:39701016
Abstract

A single Raman spectrum reflects limited molecular information. Effective fusion of the Raman spectra of serum and urine source domains helps to obtain richer feature information. However, most of the current studies on immunoglobulin A nephropathy (IgAN) based on Raman spectroscopy are based on small sample data and low signal-to-noise ratio. If a multi-source data fusion strategy is directly adopted, it may even reduce the accuracy of disease diagnosis. To this end, this paper proposes a data enhancement and spectral optimization method based on variational autoencoders to obtain reconstructed Raman spectra with doubled sample size and improved signal-to-noise ratio. In the diagnosis of IgAN in multi-source domain Raman spectra, this paper builds a global and local feature decoupled variational autoencoder (DMSGL-VAE) model based on multi-source data. First, the statistical features after spectral segmentation are extracted, and the latent variables obtained by the variational encoder are decoupled through the decoupling module. The global representation and local representation obtained represent the global shared information and local unique information of the serum and urine source domains, respectively. Then, the cross-source reconstruction loss and decoupling loss are used to constrain the decoupling, and the effectiveness of the decoupling is proved quantitatively and qualitatively. Finally, the features of different source domains were integrated to diagnose IgAN, and the results were analyzed for important features using the SHapley Additive exPlanations algorithm. The experimental results showed that the AUC value of the DMSGL-VAE model for diagnosing IgAN on the test set was as high as 0.9958. The SHAP algorithm was used to further prove that proteins, hydroxybutyrate, and guanine are likely to be common biological fingerprint substances for the diagnosis of IgAN by serum and urine Raman spectroscopy. In summary, the DMSGL-VAE model designed based on Raman spectroscopy in this paper can achieve rapid, non-invasive, and accurate screening of IgAN in terms of classification performance. And interpretable analysis may help doctors further understand IgAN and make more efficient diagnostic measures in the future.

摘要

单一拉曼光谱反映的分子信息有限。血清和尿液源域拉曼光谱的有效融合有助于获取更丰富的特征信息。然而,目前大多数基于拉曼光谱的免疫球蛋白A肾病(IgAN)研究都是基于小样本数据且信噪比低。如果直接采用多源数据融合策略,甚至可能降低疾病诊断的准确性。为此,本文提出一种基于变分自编码器的数据增强和光谱优化方法,以获得样本量翻倍且信噪比提高的重建拉曼光谱。在多源域拉曼光谱的IgAN诊断中,本文基于多源数据构建了全局和局部特征解耦的变分自编码器(DMSGL-VAE)模型。首先,提取光谱分割后的统计特征,并通过解耦模块对变分编码器获得的潜在变量进行解耦。得到的全局表示和局部表示分别代表血清和尿液源域的全局共享信息和局部独特信息。然后,利用跨源重建损失和解耦损失来约束解耦,并从定量和定性两方面证明解耦的有效性。最后,整合不同源域的特征来诊断IgAN,并使用SHapley加法解释算法对结果进行重要特征分析。实验结果表明,DMSGL-VAE模型在测试集上诊断IgAN的AUC值高达0.9958。利用SHAP算法进一步证明,蛋白质、羟基丁酸和鸟嘌呤可能是血清和尿液拉曼光谱诊断IgAN的常见生物指纹物质。综上所述,本文基于拉曼光谱设计的DMSGL-VAE模型在分类性能方面能够实现对IgAN的快速、无创和准确筛查。可解释分析可能有助于医生进一步了解IgAN,并在未来制定更有效的诊断措施。

相似文献

1
Disentangled global and local features of multi-source data variational autoencoder: An interpretable model for diagnosing IgAN via multi-source Raman spectral fusion techniques.多源数据变分自编码器的全局和局部特征解缠:一种通过多源拉曼光谱融合技术诊断IgA肾病的可解释模型。
Artif Intell Med. 2025 Feb;160:103053. doi: 10.1016/j.artmed.2024.103053. Epub 2024 Dec 12.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
A multimodal fusion network based on variational autoencoder for distinguishing SCLC brain metastases from NSCLC brain metastases.一种基于变分自编码器的多模态融合网络,用于区分小细胞肺癌脑转移瘤与非小细胞肺癌脑转移瘤。
Med Phys. 2025 Jul;52(7):e17816. doi: 10.1002/mp.17816. Epub 2025 May 2.
4
Diagnostic tests and algorithms used in the investigation of haematuria: systematic reviews and economic evaluation.用于血尿调查的诊断测试和算法:系统评价与经济评估
Health Technol Assess. 2006 Jun;10(18):iii-iv, xi-259. doi: 10.3310/hta10180.
5
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
6
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
7
Short-Term Memory Impairment短期记忆障碍
8
Clinical symptoms, signs and tests for identification of impending and current water-loss dehydration in older people.老年人即将发生和当前失水脱水的识别的临床症状、体征及检查
Cochrane Database Syst Rev. 2015 Apr 30;2015(4):CD009647. doi: 10.1002/14651858.CD009647.pub2.
9
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
10
The clinical effectiveness and cost-effectiveness of enzyme replacement therapy for Gaucher's disease: a systematic review.戈谢病酶替代疗法的临床疗效和成本效益:一项系统评价。
Health Technol Assess. 2006 Jul;10(24):iii-iv, ix-136. doi: 10.3310/hta10240.

引用本文的文献

1
Classification of multi-lead ECG based on multiple scales and hierarchical feature convolutional neural networks.基于多尺度和层次特征卷积神经网络的多导联心电图分类
Sci Rep. 2025 May 12;15(1):16418. doi: 10.1038/s41598-025-94127-6.