• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

并列:一种用于比较共表达网络的基因嵌入方法。

Juxtapose: a gene-embedding approach for comparing co-expression networks.

机构信息

Department of Computer Science, University of Saskatchewan, Saskatoon, S7N 5C9, Canada.

Augmented Intelligence & Precision Health Laboratory (AIPHL), Research Institute of the McGill University Health Centre, Montreal, H4A 3S5, Canada.

出版信息

BMC Bioinformatics. 2021 Mar 16;22(1):125. doi: 10.1186/s12859-021-04055-1.

DOI:10.1186/s12859-021-04055-1
PMID:33726666
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7968242/
Abstract

BACKGROUND

Gene co-expression networks (GCNs) are not easily comparable due to their complex structure. In this paper, we propose a tool, Juxtapose, together with similarity measures that can be utilized for comparative transcriptomics between a set of organisms. While we focus on its application to comparing co-expression networks across species in evolutionary studies, Juxtapose is also generalizable to co-expression network comparisons across tissues or conditions within the same species.

METHODS

A word embedding strategy commonly used in natural language processing was utilized in order to generate gene embeddings based on walks made throughout the GCNs. Juxtapose was evaluated based on its ability to embed the nodes of synthetic structures in the networks consistently while also generating biologically informative results. Evaluation of the techniques proposed in this research utilized RNA-seq datasets from GTEx, a multi-species experiment of prefrontal cortex samples from the Gene Expression Omnibus, as well as synthesized datasets. Biological evaluation was performed using gene set enrichment analysis and known gene relationships in literature.

RESULTS

We show that Juxtapose is capable of globally aligning synthesized networks as well as identifying areas that are conserved in real gene co-expression networks without reliance on external biological information. Furthermore, output from a matching algorithm that uses cosine distance between GCN embeddings is shown to be an informative measure of similarity that reflects the amount of topological similarity between networks.

CONCLUSIONS

Juxtapose can be used to align GCNs without relying on known biological similarities and enables post-hoc analyses using biological parameters, such as orthology of genes, or conserved or variable pathways.

AVAILABILITY

A development version of the software used in this paper is available at https://github.com/klovens/juxtapose.

摘要

背景

由于基因共表达网络(GCN)的结构复杂,它们不容易进行比较。在本文中,我们提出了一个名为 Juxtapose 的工具以及一些相似性度量标准,可用于在一组生物体之间进行比较转录组学。虽然我们专注于将其应用于进化研究中比较物种间的共表达网络,但 Juxtapose 也可推广到同一物种内比较组织或条件的共表达网络。

方法

我们利用自然语言处理中常用的词嵌入策略,根据在 GCN 中进行的游走生成基因嵌入。根据其在一致嵌入网络中合成结构节点的能力以及生成具有生物学意义的结果的能力来评估 Juxtapose。本研究中提出的技术的评估利用了 GTEx 的 RNA-seq 数据集,该数据集来自 Gene Expression Omnibus 的前额叶皮质样本的多物种实验,以及合成数据集。使用基因集富集分析和文献中的已知基因关系进行生物学评估。

结果

我们表明,Juxtapose 能够全局对齐合成网络,并识别真实基因共表达网络中保守的区域,而无需依赖外部生物学信息。此外,使用 GCN 嵌入之间余弦距离的匹配算法的输出被证明是一种信息量的相似性度量,反映了网络之间拓扑相似性的程度。

结论

Juxtapose 可以在不依赖已知生物学相似性的情况下对齐 GCN,并支持使用生物学参数(例如基因的同源性)或保守或可变途径进行事后分析。

可用性

本文中使用的软件的开发版本可在 https://github.com/klovens/juxtapose 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/eb95b7b4ae8b/12859_2021_4055_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/1485ef319a1c/12859_2021_4055_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/986197f55461/12859_2021_4055_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/bec517d455f3/12859_2021_4055_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/f982607fda72/12859_2021_4055_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/c77f5cfe92d8/12859_2021_4055_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/e5479ad3e28f/12859_2021_4055_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/c348b91361b3/12859_2021_4055_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/ceca5fe3b47a/12859_2021_4055_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/eb95b7b4ae8b/12859_2021_4055_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/1485ef319a1c/12859_2021_4055_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/986197f55461/12859_2021_4055_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/bec517d455f3/12859_2021_4055_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/f982607fda72/12859_2021_4055_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/c77f5cfe92d8/12859_2021_4055_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/e5479ad3e28f/12859_2021_4055_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/c348b91361b3/12859_2021_4055_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/ceca5fe3b47a/12859_2021_4055_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c9/7968242/eb95b7b4ae8b/12859_2021_4055_Fig9_HTML.jpg

相似文献

1
Juxtapose: a gene-embedding approach for comparing co-expression networks.并列:一种用于比较共表达网络的基因嵌入方法。
BMC Bioinformatics. 2021 Mar 16;22(1):125. doi: 10.1186/s12859-021-04055-1.
2
Text mining-based word representations for biomedical data analysis and protein-protein interaction networks in machine learning tasks.基于文本挖掘的词表示在生物医学数据分析和机器学习任务中的蛋白质-蛋白质相互作用网络。
PLoS One. 2021 Oct 15;16(10):e0258623. doi: 10.1371/journal.pone.0258623. eCollection 2021.
3
The Performance Comparison of Gene Co-expression Networks of Breast and Prostate Cancer using Different Selection Criteria.基于不同选择标准的乳腺癌和前列腺癌基因共表达网络性能比较。
Interdiscip Sci. 2021 Sep;13(3):500-510. doi: 10.1007/s12539-021-00440-9. Epub 2021 May 18.
4
An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.额外的k均值聚类步骤改善了WGCNA基因共表达网络的生物学特征。
BMC Syst Biol. 2017 Apr 12;11(1):47. doi: 10.1186/s12918-017-0420-6.
5
VCNet: vector-based gene co-expression network construction and its application to RNA-seq data.VCNet:基于向量的基因共表达网络构建及其在 RNA-seq 数据中的应用。
Bioinformatics. 2017 Jul 15;33(14):2173-2181. doi: 10.1093/bioinformatics/btx131.
6
Node similarity-based graph convolution for link prediction in biological networks.基于节点相似度的生物网络链路预测图卷积
Bioinformatics. 2021 Dec 7;37(23):4501-4508. doi: 10.1093/bioinformatics/btab464.
7
Co-expression Networks in Predicting Transcriptional Gene Regulation.共表达网络在预测转录基因调控中的应用。
Methods Mol Biol. 2021;2328:1-11. doi: 10.1007/978-1-0716-1534-8_1.
8
Joint embedding of biological networks for cross-species functional alignment.生物网络的联合嵌入用于跨物种功能对齐。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad529.
9
wTO: an R package for computing weighted topological overlap and a consensus network with integrated visualization tool.wTO:一个用于计算加权拓扑重叠和共识网络的 R 包,具有集成的可视化工具。
BMC Bioinformatics. 2018 Oct 24;19(1):392. doi: 10.1186/s12859-018-2351-7.
10
Centralization Within Sub-Experiments Enhances the Biological Relevance of Gene Co-expression Networks: A Plant Mitochondrial Case Study.子实验中的集中化增强了基因共表达网络的生物学相关性:一个植物线粒体案例研究。
Front Plant Sci. 2020 Jun 4;11:524. doi: 10.3389/fpls.2020.00524. eCollection 2020.

引用本文的文献

1
Bridging Large Language Models and Single-Cell Transcriptomics in Dissecting Selective Motor Neuron Vulnerability.在剖析选择性运动神经元易损性方面连接大语言模型与单细胞转录组学
ArXiv. 2025 May 12:arXiv:2505.07896v1.
2
An artificial intelligence-based model for prediction of clonal hematopoiesis variants in cell-free DNA samples.一种基于人工智能的模型,用于预测游离DNA样本中的克隆性造血变异。
NPJ Precis Oncol. 2025 May 20;9(1):147. doi: 10.1038/s41698-025-00921-w.
3
Biclustering data analysis: a comprehensive survey.双聚类数据分析:全面综述。

本文引用的文献

1
ManiNetCluster: a novel manifold learning approach to reveal the functional links between gene networks.ManiNetCluster:一种新的流形学习方法,用于揭示基因网络之间的功能联系。
BMC Genomics. 2019 Dec 30;20(Suppl 12):1003. doi: 10.1186/s12864-019-6329-2.
2
Embryo polarity in moth flies and mosquitoes relies on distinct old genes with localized transcript isoforms.蛾类和蚊子的胚胎极性依赖于具有局部转录异构体的不同旧基因。
Elife. 2019 Oct 8;8:e46711. doi: 10.7554/eLife.46711.
3
Network Medicine in the Age of Biomedical Big Data.生物医学大数据时代的网络医学
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae342.
4
Accurately modeling biased random walks on weighted networks using node2vec.使用 node2vec 准确建模加权网络上有偏随机游走。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad047.
5
Gene Co-Expression Network Tools and Databases for Crop Improvement.用于作物改良的基因共表达网络工具和数据库
Plants (Basel). 2022 Jun 21;11(13):1625. doi: 10.3390/plants11131625.
6
Translational Informatics for Natural Products as Antidepressant Agents.天然产物作为抗抑郁剂的转化信息学
Front Cell Dev Biol. 2022 Jan 20;9:738838. doi: 10.3389/fcell.2021.738838. eCollection 2021.
7
Silver: Forging almost Gold Standard Datasets.银:锻造近乎黄金标准数据集。
Genes (Basel). 2021 Sep 28;12(10):1523. doi: 10.3390/genes12101523.
8
Comparative Analyses of Gene Co-expression Networks: Implementations and Applications in the Study of Evolution.基因共表达网络的比较分析:在进化研究中的实现与应用
Front Genet. 2021 Aug 13;12:695399. doi: 10.3389/fgene.2021.695399. eCollection 2021.
Front Genet. 2019 Apr 11;10:294. doi: 10.3389/fgene.2019.00294. eCollection 2019.
4
Functional protein representations from biological networks enable diverse cross-species inference.从生物网络中提取功能蛋白质表示可以实现跨物种的多样化推断。
Nucleic Acids Res. 2019 May 21;47(9):e51. doi: 10.1093/nar/gkz132.
5
Gene2vec: distributed representation of genes based on co-expression.Gene2vec:基于共表达的基因分布式表示。
BMC Genomics. 2019 Feb 4;20(Suppl 1):82. doi: 10.1186/s12864-018-5370-x.
6
Embedding of Genes Using Cancer Gene Expression Data: Biological Relevance and Potential Application on Biomarker Discovery.利用癌症基因表达数据进行基因嵌入:生物学相关性及在生物标志物发现中的潜在应用
Front Genet. 2019 Jan 4;9:682. doi: 10.3389/fgene.2018.00682. eCollection 2018.
7
Multiple model species selection for transcriptomics analysis of non-model organisms.多种模式生物选择用于非模式生物的转录组学分析。
BMC Bioinformatics. 2018 Aug 13;19(Suppl 9):284. doi: 10.1186/s12859-018-2278-z.
8
G2Vec: Distributed gene representations for identification of cancer prognostic genes.G2Vec:用于识别癌症预后基因的分布式基因表示。
Sci Rep. 2018 Sep 13;8(1):13729. doi: 10.1038/s41598-018-32180-0.
9
The last universal common ancestor between ancient Earth chemistry and the onset of genetics.古老地球化学和遗传学起源之间的最后一个普遍共同祖先。
PLoS Genet. 2018 Aug 16;14(8):e1007518. doi: 10.1371/journal.pgen.1007518. eCollection 2018 Aug.
10
Identification of common coexpression modules based on quantitative network comparison.基于定量网络比较的共同共表达模块的鉴定。
BMC Bioinformatics. 2018 Jun 13;19(Suppl 8):213. doi: 10.1186/s12859-018-2193-3.