• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将信息检索方法扩展至基于个性化基因组的疾病研究。

Extending information retrieval methods to personalized genomic-based studies of disease.

作者信息

Ye Shuyun, Dawson John A, Kendziorski Christina

机构信息

Department of Statistics, University of Wisconsin, Madison, WI, USA.

Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI, USA.

出版信息

Cancer Inform. 2015 Feb 10;13(Suppl 7):85-95. doi: 10.4137/CIN.S16354. eCollection 2014.

DOI:10.4137/CIN.S16354
PMID:25733795
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4332045/
Abstract

Genomic-based studies of disease now involve diverse types of data collected on large groups of patients. A major challenge facing statistical scientists is how best to combine the data, extract important features, and comprehensively characterize the ways in which they affect an individual's disease course and likelihood of response to treatment. We have developed a survival-supervised latent Dirichlet allocation (survLDA) modeling framework to address these challenges. Latent Dirichlet allocation (LDA) models have proven extremely effective at identifying themes common across large collections of text, but applications to genomics have been limited. Our framework extends LDA to the genome by considering each patient as a "document" with "text" detailing his/her clinical events and genomic state. We then further extend the framework to allow for supervision by a time-to-event response. The model enables the efficient identification of collections of clinical and genomic features that co-occur within patient subgroups, and then characterizes each patient by those features. An application of survLDA to The Cancer Genome Atlas ovarian project identifies informative patient subgroups showing differential response to treatment, and validation in an independent cohort demonstrates the potential for patient-specific inference.

摘要

基于基因组的疾病研究现在涉及在大量患者群体上收集的各种类型的数据。统计科学家面临的一个主要挑战是如何最好地整合这些数据、提取重要特征,并全面描述它们影响个体疾病进程和治疗反应可能性的方式。我们开发了一种生存监督潜在狄利克雷分配(survLDA)建模框架来应对这些挑战。潜在狄利克雷分配(LDA)模型在识别大量文本集合中常见的主题方面已被证明极其有效,但在基因组学中的应用一直有限。我们的框架通过将每个患者视为一个“文档”,其“文本”详细描述了他/她的临床事件和基因组状态,将LDA扩展到了基因组。然后,我们进一步扩展该框架,以允许通过事件发生时间响应进行监督。该模型能够有效地识别在患者亚组中共同出现的临床和基因组特征集合,然后用这些特征对每个患者进行表征。survLDA在癌症基因组图谱卵巢癌项目中的应用识别出了对治疗有不同反应的信息丰富的患者亚组,在一个独立队列中的验证证明了进行患者特异性推断的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53fe/4332045/7144102a9a90/cin-suppl.7-2014-085f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53fe/4332045/8059c847b34d/cin-suppl.7-2014-085f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53fe/4332045/bb40953d6e7e/cin-suppl.7-2014-085f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53fe/4332045/e539a0728ce8/cin-suppl.7-2014-085f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53fe/4332045/7144102a9a90/cin-suppl.7-2014-085f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53fe/4332045/8059c847b34d/cin-suppl.7-2014-085f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53fe/4332045/bb40953d6e7e/cin-suppl.7-2014-085f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53fe/4332045/e539a0728ce8/cin-suppl.7-2014-085f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53fe/4332045/7144102a9a90/cin-suppl.7-2014-085f4.jpg

相似文献

1
Extending information retrieval methods to personalized genomic-based studies of disease.将信息检索方法扩展至基于个性化基因组的疾病研究。
Cancer Inform. 2015 Feb 10;13(Suppl 7):85-95. doi: 10.4137/CIN.S16354. eCollection 2014.
2
Cardiology record multi-label classification using latent Dirichlet allocation.使用潜在狄利克雷分配进行心脏病学记录的多标签分类。
Comput Methods Programs Biomed. 2018 Oct;164:111-119. doi: 10.1016/j.cmpb.2018.07.002. Epub 2018 Jul 17.
3
Leveraging Latent Dirichlet Allocation in processing free-text personal goals among patients undergoing bladder cancer surgery.利用潜在狄利克雷分配在处理膀胱癌手术患者的自由文本个人目标。
Qual Life Res. 2019 Jun;28(6):1441-1455. doi: 10.1007/s11136-019-02132-w. Epub 2019 Feb 23.
4
Latent Dirichlet allocation models for image classification.基于潜在狄利克雷分配模型的图像分类。
IEEE Trans Pattern Anal Mach Intell. 2013 Nov;35(11):2665-79. doi: 10.1109/TPAMI.2013.69.
5
Pathway index models for construction of patient-specific risk profiles.用于构建患者特定风险概况的途径指数模型。
Stat Med. 2013 Apr 30;32(9):1524-35. doi: 10.1002/sim.5641. Epub 2012 Oct 16.
6
Exploring the complexity of pathway-drug relationships using latent Dirichlet allocation.使用潜在狄利克雷分配探索通路-药物关系的复杂性。
Comput Biol Chem. 2014 Dec;53 Pt A:144-52. doi: 10.1016/j.compbiolchem.2014.08.019. Epub 2014 Aug 24.
7
Semi-supervised identification of cancer subgroups using survival outcomes and overlapping grouping information.基于生存结局和重叠分组信息的半监督式癌症亚组识别。
Stat Methods Med Res. 2019 Jul;28(7):2137-2149. doi: 10.1177/0962280217752980. Epub 2018 Jan 16.
8
A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data.一种深度自回归方法在多模态数据主题建模中的应用。
IEEE Trans Pattern Anal Mach Intell. 2016 Jun;38(6):1056-69. doi: 10.1109/TPAMI.2015.2476802. Epub 2015 Sep 11.
9
UTOPIAN: user-driven topic modeling based on interactive nonnegative matrix factorization.基于交互式非负矩阵分解的用户驱动主题建模。
IEEE Trans Vis Comput Graph. 2013 Dec;19(12):1992-2001. doi: 10.1109/TVCG.2013.212.
10
Learning topic models by belief propagation.通过信念传播学习主题模型。
IEEE Trans Pattern Anal Mach Intell. 2013 May;35(5):1121-34. doi: 10.1109/TPAMI.2012.185.

引用本文的文献

1
Using topic modeling via non-negative matrix factorization to identify relationships between genetic variants and disease phenotypes: A case study of Lipoprotein(a) (LPA).利用非负矩阵分解的主题建模来识别遗传变异与疾病表型之间的关系:脂蛋白(a)(LPA)的案例研究。
PLoS One. 2019 Feb 13;14(2):e0212112. doi: 10.1371/journal.pone.0212112. eCollection 2019.

本文引用的文献

1
Increased Intragenic IGF2 Methylation is Associated with Repression of Insulator Activity and Elevated Expression in Serous Ovarian Carcinoma.基因内 IGF2 甲基化增加与绝缘子活性抑制和浆液性卵巢癌表达升高有关。
Front Oncol. 2013 May 24;3:131. doi: 10.3389/fonc.2013.00131. eCollection 2013.
2
Prognostic significance of serum soluble CD163 level in patients with epithelial ovarian cancer.血清可溶性 CD163 水平在卵巢上皮性癌患者中的预后意义。
Gynecol Obstet Invest. 2013;75(4):263-7. doi: 10.1159/000349892. Epub 2013 Apr 17.
3
HOXA9, ISL1 and ALDH1A3 methylation patterns as prognostic markers for nonmuscle invasive bladder cancer: array-based DNA methylation and expression profiling.
HOXA9、ISL1 和 ALDH1A3 甲基化模式作为非肌肉浸润性膀胱癌的预后标志物:基于阵列的 DNA 甲基化和表达谱分析。
Int J Cancer. 2013 Sep 1;133(5):1135-42. doi: 10.1002/ijc.28121. Epub 2013 Mar 16.
4
Investigation of human cationic antimicrobial protein-18 (hCAP-18), lactoferrin and CD163 as potential biomarkers for ovarian cancer.探讨人阳离子抗菌肽-18(hCAP-18)、乳铁蛋白和 CD163 作为卵巢癌潜在生物标志物的研究。
J Ovarian Res. 2013 Jan 22;6(1):5. doi: 10.1186/1757-2215-6-5.
5
An Integrative Pathway-based Clinical-genomic Model for Cancer Survival Prediction.一种基于整合通路的癌症生存预测临床基因组模型。
Stat Probab Lett. 2010 Sep 7;80(17-18):1313-1319. doi: 10.1016/j.spl.2010.04.011.
6
Expression of the transient receptor potential channel c3 correlates with a favorable prognosis in patients with adenocarcinoma of the lung.瞬时受体电位通道 C3 的表达与肺腺癌患者的良好预后相关。
Ann Surg Oncol. 2011 Nov;18(12):3377-83. doi: 10.1245/s10434-011-1798-9. Epub 2011 May 21.
7
Gene expression profile for predicting survival in advanced-stage serous ovarian cancer across two independent datasets.用于预测两个独立数据集的晚期浆液性卵巢癌患者生存情况的基因表达谱。
PLoS One. 2010 Mar 12;5(3):e9615. doi: 10.1371/journal.pone.0009615.
8
Pathway analysis using random forests with bivariate node-split for survival outcomes.使用随机森林进行生存结局的双变量节点分裂的通路分析。
Bioinformatics. 2010 Jan 15;26(2):250-8. doi: 10.1093/bioinformatics/btp640. Epub 2009 Nov 18.
9
Survival prediction and gene identification with penalized global AUC maximization.基于惩罚全局AUC最大化的生存预测与基因识别
J Comput Biol. 2009 Dec;16(12):1661-70. doi: 10.1089/cmb.2008.0188.
10
Integrating biological knowledge with gene expression profiles for survival prediction of cancer.整合生物学知识与基因表达谱以预测癌症患者的生存情况。
J Comput Biol. 2009 Feb;16(2):265-78. doi: 10.1089/cmb.2008.12TT.