• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Bipartite tight spectral clustering (BiTSC) algorithm for identifying conserved gene co-clusters in two species.用于在两个物种中识别保守基因共聚类的双部分紧谱聚类(BiTSC)算法。
Bioinformatics. 2021 Jun 9;37(9):1225-1233. doi: 10.1093/bioinformatics/btaa741.
2
A cross-species bi-clustering approach to identifying conserved co-regulated genes.一种用于识别保守共调控基因的跨物种双聚类方法。
Bioinformatics. 2016 Jun 15;32(12):i137-i146. doi: 10.1093/bioinformatics/btw278.
3
Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes.使用监督学习组合帕累托最优聚类以识别共表达基因。
BMC Bioinformatics. 2009 Jan 20;10:27. doi: 10.1186/1471-2105-10-27.
4
TimesVector: a vectorized clustering approach to the analysis of time series transcriptome data from multiple phenotypes.TimesVector:一种用于分析来自多种表型的时间序列转录组数据的向量化聚类方法。
Bioinformatics. 2017 Dec 1;33(23):3827-3835. doi: 10.1093/bioinformatics/btw780.
5
OrthoClust: an orthology-based network framework for clustering data across multiple species.OrthoClust:一种基于直系同源关系的网络框架,用于跨多个物种对数据进行聚类。
Genome Biol. 2014 Aug 28;15(8):R100. doi: 10.1186/gb-2014-15-8-r100.
6
Robust clustering of noisy high-dimensional gene expression data for patients subtyping.对噪声高维基因表达数据进行稳健聚类,以对患者进行亚型划分。
Bioinformatics. 2018 Dec 1;34(23):4064-4072. doi: 10.1093/bioinformatics/bty502.
7
A new algorithm for comparing and visualizing relationships between hierarchical and flat gene expression data clusterings.一种用于比较和可视化层次化与平面化基因表达数据聚类之间关系的新算法。
Bioinformatics. 2005 Nov 1;21(21):3993-9. doi: 10.1093/bioinformatics/bti644. Epub 2005 Sep 1.
8
Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm.使用一致性算法对大型DNA微阵列数据集进行稳健的多尺度聚类
Bioinformatics. 2006 Jan 1;22(1):58-67. doi: 10.1093/bioinformatics/bti746. Epub 2005 Oct 27.
9
Multi-task consensus clustering of genome-wide transcriptomes from related biological conditions.来自相关生物学条件的全基因组转录组的多任务共识聚类
Bioinformatics. 2016 May 15;32(10):1509-17. doi: 10.1093/bioinformatics/btw007. Epub 2016 Jan 21.
10
GMHCC: high-throughput analysis of biomolecular data using graph-based multiple hierarchical consensus clustering.GMHCC:基于图的多重层次共识聚类的生物分子数据的高通量分析。
Bioinformatics. 2022 May 26;38(11):3020-3028. doi: 10.1093/bioinformatics/btac290.

引用本文的文献

1
Critical factors influencing live birth rates in fresh embryo transfer for IVF: insights from cluster ensemble algorithms.影响体外受精新鲜胚胎移植活产率的关键因素:来自聚类集成算法的见解
Sci Rep. 2025 Jan 30;15(1):3734. doi: 10.1038/s41598-025-88210-1.
2
MATTE: a pipeline of transcriptome module alignment for anti-noise phenotype-gene-related analysis.MATTE:用于抗噪声表型-基因相关分析的转录组模块比对管道。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad207.
3
PseudotimeDE: inference of differential gene expression along cell pseudotime with well-calibrated p-values from single-cell RNA sequencing data.PseudotimeDE:从单细胞 RNA 测序数据中推断具有良好校准 p 值的细胞伪时间上的差异基因表达。
Genome Biol. 2021 Apr 29;22(1):124. doi: 10.1186/s13059-021-02341-y.

本文引用的文献

1
PANTHER version 14: more genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools.PANTHER 版本 14:更多基因组、一个新的 PANTHER GO-slim 和富集分析工具的改进。
Nucleic Acids Res. 2019 Jan 8;47(D1):D419-D426. doi: 10.1093/nar/gky1038.
2
An RNA-Seq atlas of gene expression in mouse and rat normal tissues.鼠正常组织基因表达的 RNA-Seq 图谱。
Sci Data. 2017 Dec 12;4:170185. doi: 10.1038/sdata.2017.185.
3
Cross-Species Analysis of Gene Expression and Function in Prefrontal Cortex, Hippocampus and Striatum.前额叶皮质、海马体和纹状体中基因表达与功能的跨物种分析
PLoS One. 2016 Oct 7;11(10):e0164295. doi: 10.1371/journal.pone.0164295. eCollection 2016.
4
A cross-species bi-clustering approach to identifying conserved co-regulated genes.一种用于识别保守共调控基因的跨物种双聚类方法。
Bioinformatics. 2016 Jun 15;32(12):i137-i146. doi: 10.1093/bioinformatics/btw278.
5
Meta-analysis of RNA-seq expression data across species, tissues and studies.跨物种、组织和研究的RNA测序表达数据的荟萃分析。
Genome Biol. 2015 Dec 22;16:287. doi: 10.1186/s13059-015-0853-4.
6
OrthoClust: an orthology-based network framework for clustering data across multiple species.OrthoClust:一种基于直系同源关系的网络框架,用于跨多个物种对数据进行聚类。
Genome Biol. 2014 Aug 28;15(8):R100. doi: 10.1186/gb-2014-15-8-r100.
7
Comparative analysis of the transcriptome across distant species.跨远缘物种的转录组比较分析。
Nature. 2014 Aug 28;512(7515):445-8. doi: 10.1038/nature13424.
8
Efficiently inferring community structure in bipartite networks.高效推断二分网络中的社区结构。
Phys Rev E Stat Nonlin Soft Matter Phys. 2014 Jul;90(1):012805. doi: 10.1103/PhysRevE.90.012805. Epub 2014 Jul 10.
9
MAGNA: Maximizing Accuracy in Global Network Alignment.MAGNA:最大化全球网络对齐的准确性。
Bioinformatics. 2014 Oct 15;30(20):2931-40. doi: 10.1093/bioinformatics/btu409. Epub 2014 Jul 10.
10
Comparison of D. melanogaster and C. elegans developmental stages, tissues, and cells by modENCODE RNA-seq data.通过modENCODE RNA测序数据对黑腹果蝇和秀丽隐杆线虫的发育阶段、组织和细胞进行比较。
Genome Res. 2014 Jul;24(7):1086-101. doi: 10.1101/gr.170100.113.

用于在两个物种中识别保守基因共聚类的双部分紧谱聚类(BiTSC)算法。

Bipartite tight spectral clustering (BiTSC) algorithm for identifying conserved gene co-clusters in two species.

机构信息

Department of Statistics, University of California, Los Angeles, CA 90095-1554, USA.

Department of Human Genetics, University of California, Los Angeles, CA 90095-7088, USA.

出版信息

Bioinformatics. 2021 Jun 9;37(9):1225-1233. doi: 10.1093/bioinformatics/btaa741.

DOI:10.1093/bioinformatics/btaa741
PMID:32814973
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8599197/
Abstract

MOTIVATION

Gene clustering is a widely used technique that has enabled computational prediction of unknown gene functions within a species. However, it remains a challenge to refine gene function prediction by leveraging evolutionarily conserved genes in another species. This challenge calls for a new computational algorithm to identify gene co-clusters in two species, so that genes in each co-cluster exhibit similar expression levels in each species and strong conservation between the species.

RESULTS

Here, we develop the bipartite tight spectral clustering (BiTSC) algorithm, which identifies gene co-clusters in two species based on gene orthology information and gene expression data. BiTSC novelly implements a formulation that encodes gene orthology as a bipartite network and gene expression data as node covariates. This formulation allows BiTSC to adopt and combine the advantages of multiple unsupervised learning techniques: kernel enhancement, bipartite spectral clustering, consensus clustering, tight clustering and hierarchical clustering. As a result, BiTSC is a flexible and robust algorithm capable of identifying informative gene co-clusters without forcing all genes into co-clusters. Another advantage of BiTSC is that it does not rely on any distributional assumptions. Beyond cross-species gene co-clustering, BiTSC also has wide applications as a general algorithm for identifying tight node co-clusters in any bipartite network with node covariates. We demonstrate the accuracy and robustness of BiTSC through comprehensive simulation studies. In a real data example, we use BiTSC to identify conserved gene co-clusters of Drosophila melanogaster and Caenorhabditis elegans, and we perform a series of downstream analysis to both validate BiTSC and verify the biological significance of the identified co-clusters.

AVAILABILITY AND IMPLEMENTATION

The Python package BiTSC is open-access and available at https://github.com/edensunyidan/BiTSC.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

基因聚类是一种广泛使用的技术,它使我们能够在一个物种内计算预测未知基因的功能。然而,利用另一个物种中进化保守的基因来完善基因功能预测仍然是一个挑战。这一挑战需要一种新的计算算法来识别两个物种中的基因共聚类,以使每个共聚类中的基因在两个物种中的表达水平相似,并且在两个物种之间具有很强的保守性。

结果

在这里,我们开发了二分紧谱聚类(BiTSC)算法,该算法基于基因直系同源信息和基因表达数据来识别两个物种中的基因共聚类。BiTSC 新颖地实现了一种将基因直系同源编码为二分网络和节点协变量的公式。这种公式允许 BiTSC 采用和结合多种无监督学习技术的优势:核增强、二分谱聚类、共识聚类、紧聚类和层次聚类。因此,BiTSC 是一种灵活且强大的算法,能够识别信息丰富的基因共聚类,而无需将所有基因强制放入共聚类中。BiTSC 的另一个优势是它不依赖于任何分布假设。除了跨物种基因共聚类之外,BiTSC 还作为一种通用算法,具有广泛的应用,可用于识别具有节点协变量的任何二分网络中的紧节点共聚类。我们通过全面的模拟研究证明了 BiTSC 的准确性和鲁棒性。在一个真实数据的例子中,我们使用 BiTSC 来识别黑腹果蝇和秀丽隐杆线虫的保守基因共聚类,并进行了一系列下游分析,以验证 BiTSC 的准确性和验证所识别的共聚类的生物学意义。

可用性和实现

Python 包 BiTSC 是开放访问的,可在 https://github.com/edensunyidan/BiTSC 上获得。

补充信息

补充数据可在生物信息学在线获得。