• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

USNAP:快速唯一密集区域检测及其在肺癌中的应用。

USNAP: fast unique dense region detection and its application to lung cancer.

机构信息

Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute, and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, 60 Leonard Avenue, Toronto, ON M5T 0S8, Canada.

Department of Computer Science, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, United States.

出版信息

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad477.

DOI:10.1093/bioinformatics/btad477
PMID:37527019
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10425186/
Abstract

MOTIVATION

Many real-world problems can be modeled as annotated graphs. Scalable graph algorithms that extract actionable information from such data are in demand since these graphs are large, varying in topology, and have diverse node/edge annotations. When these graphs change over time they create dynamic graphs, and open the possibility to find patterns across different time points. In this article, we introduce a scalable algorithm that finds unique dense regions across time points in dynamic graphs. Such algorithms have applications in many different areas, including the biological, financial, and social domains.

RESULTS

There are three important contributions to this manuscript. First, we designed a scalable algorithm, USNAP, to effectively identify dense subgraphs that are unique to a time stamp given a dynamic graph. Importantly, USNAP provides a lower bound of the density measure in each step of the greedy algorithm. Second, insights and understanding obtained from validating USNAP on real data show its effectiveness. While USNAP is domain independent, we applied it to four non-small cell lung cancer gene expression datasets. Stages in non-small cell lung cancer were modeled as dynamic graphs, and input to USNAP. Pathway enrichment analyses and comprehensive interpretations from literature show that USNAP identified biologically relevant mechanisms for different stages of cancer progression. Third, USNAP is scalable, and has a time complexity of O(m+mc log nc+nc log nc), where m is the number of edges, and n is the number of vertices in the dynamic graph; mc is the number of edges, and nc is the number of vertices in the collapsed graph.

AVAILABILITY AND IMPLEMENTATION

The code of USNAP is available at https://www.cs.utoronto.ca/~juris/data/USNAP22.

摘要

动机

许多现实世界的问题都可以建模为有注释的图。从这些数据中提取可操作信息的可扩展图算法需求量很大,因为这些图很大,拓扑结构各异,节点/边注释也多种多样。当这些图随时间变化时,它们会创建动态图,并有可能在不同时间点找到模式。在本文中,我们引入了一种可扩展的算法,该算法可以在动态图中找到随时间变化的独特密集区域。此类算法在许多不同领域都有应用,包括生物、金融和社交领域。

结果

本文有三个重要贡献。首先,我们设计了一种可扩展的算法 USNAP,用于有效地识别给定动态图中特定时间戳的唯一密集子图。重要的是,USNAP 在贪婪算法的每一步都提供了密度度量的下界。其次,通过在真实数据上验证 USNAP 获得的见解和理解表明了其有效性。虽然 USNAP 与领域无关,但我们将其应用于四个非小细胞肺癌基因表达数据集。非小细胞肺癌的各个阶段被建模为动态图,并输入到 USNAP 中。通路富集分析和来自文献的综合解释表明,USNAP 为癌症进展的不同阶段确定了生物学上相关的机制。第三,USNAP 是可扩展的,其时间复杂度为 O(m+mclognc+nc log nc),其中 m 是边的数量,n 是动态图中的顶点数;mc 是边的数量,nc 是折叠图中的顶点数。

可用性和实现

USNAP 的代码可在 https://www.cs.utoronto.ca/~juris/data/USNAP22 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5853/10425186/ba2f1a3c8539/btad477f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5853/10425186/9512c3d784cf/btad477f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5853/10425186/ba2f1a3c8539/btad477f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5853/10425186/9512c3d784cf/btad477f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5853/10425186/ba2f1a3c8539/btad477f2.jpg

相似文献

1
USNAP: fast unique dense region detection and its application to lung cancer.USNAP:快速唯一密集区域检测及其在肺癌中的应用。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad477.
2
Dynamic Graph Stream Algorithms in () Space.()空间中的动态图流算法
Algorithmica. 2019;81(5):1965-1987. doi: 10.1007/s00453-018-0520-8. Epub 2018 Sep 25.
3
FlexGraph: Flexible partitioning and storage for scalable graph mining.FlexGraph:可扩展图挖掘的灵活分区和存储。
PLoS One. 2020 Jan 24;15(1):e0227032. doi: 10.1371/journal.pone.0227032. eCollection 2020.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Modeling tumor progression via the comparison of stage-specific graphs.通过比较阶段特异性图来模拟肿瘤进展。
Methods. 2018 Jan 1;132:34-41. doi: 10.1016/j.ymeth.2017.06.033. Epub 2017 Jul 3.
6
MISAGA: An Algorithm for Mining Interesting Subgraphs in Attributed Graphs.MISAGA:属性图中有趣子图挖掘的算法。
IEEE Trans Cybern. 2018 May;48(5):1369-1382. doi: 10.1109/TCYB.2017.2693558. Epub 2017 Apr 25.
7
Bit-parallel sequence-to-graph alignment.位并行序列到图的对齐。
Bioinformatics. 2019 Oct 1;35(19):3599-3607. doi: 10.1093/bioinformatics/btz162.
8
Subexponential-Time Algorithms for Finding Large Induced Sparse Subgraphs.用于寻找大型诱导稀疏子图的亚指数时间算法。
Algorithmica. 2021;83(8):2634-2650. doi: 10.1007/s00453-020-00745-z. Epub 2020 Jul 31.
9
Clone temporal centrality measures for incomplete sequences of graph snapshots.针对图快照的不完整序列的克隆时间中心性度量。
BMC Bioinformatics. 2017 May 16;18(1):261. doi: 10.1186/s12859-017-1677-x.
10
Visual exploration of complex time-varying graphs.复杂时变图的可视化探索
IEEE Trans Vis Comput Graph. 2006 Sep-Oct;12(5):805-12. doi: 10.1109/TVCG.2006.193.

本文引用的文献

1
A prognostic model for overall survival of patients with early-stage non-small cell lung cancer: a multicentre, retrospective study.早期非小细胞肺癌患者总生存期的预后模型:一项多中心回顾性研究。
Lancet Digit Health. 2020 Nov;2(11):e594-e606. doi: 10.1016/s2589-7500(20)30225-9. Epub 2020 Oct 19.
2
Ion Channels in Lung Cancer.肺癌中的离子通道。
Rev Physiol Biochem Pharmacol. 2021;181:57-79. doi: 10.1007/112_2020_29.
3
The role of mitochondrial ATP synthase in cancer.线粒体 ATP 合酶在癌症中的作用。
Biol Chem. 2020 Oct 25;401(11):1199-1214. doi: 10.1515/hsz-2020-0157.
4
pathDIP 4: an extended pathway annotations and enrichment analysis resource for human, model organisms and domesticated species.pathDIP 4:一个扩展的人类、模式生物和驯化物种通路注释和富集分析资源。
Nucleic Acids Res. 2020 Jan 8;48(D1):D479-D488. doi: 10.1093/nar/gkz989.
5
Bitter Taste Receptors (TAS2Rs) in Human Lung Macrophages: Receptor Expression and Inhibitory Effects of TAS2R Agonists.人类肺巨噬细胞中的苦味受体(TAS2Rs):受体表达及TAS2R激动剂的抑制作用
Front Physiol. 2019 Oct 2;10:1267. doi: 10.3389/fphys.2019.01267. eCollection 2019.
6
The nasal methylome as a biomarker of asthma and airway inflammation in children.鼻腔甲基组作为儿童哮喘和气道炎症的生物标志物。
Nat Commun. 2019 Jul 12;10(1):3095. doi: 10.1038/s41467-019-11058-3.
7
Loss of parkin reduces lung tumor development by blocking p21 degradation.失活 parkin 可通过阻止 p21 降解来抑制肺肿瘤发生。
PLoS One. 2019 May 21;14(5):e0217037. doi: 10.1371/journal.pone.0217037. eCollection 2019.
8
DNA repair in lung cancer: potential not yet reached.肺癌中的DNA修复:潜力尚未实现。
Lung Cancer Manag. 2016 Apr;5(1):5-8. doi: 10.2217/lmt-2016-0004. Epub 2016 Apr 6.
9
HIV-1 Nef promotes cell proliferation and microRNA dysregulation in lung cells.HIV-1 Nef 促进肺细胞的增殖和 microRNA 失调。
Cell Cycle. 2019 Jan;18(2):130-142. doi: 10.1080/15384101.2018.1557487. Epub 2019 Jan 6.
10
Acetylcholine signaling system in progression of lung cancers.乙酰胆碱信号系统在肺癌进展中的作用。
Pharmacol Ther. 2019 Feb;194:222-254. doi: 10.1016/j.pharmthera.2018.10.002. Epub 2018 Oct 3.