• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种新的加权拟近邻正则化三因子化单类协同过滤算法:在转录因子靶基因预测中的应用。

A New Weighted Imputed Neighborhood-Regularized Tri-Factorization One-Class Collaborative Filtering Algorithm: Application to Target Gene Prediction of Transcription Factors.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):126-137. doi: 10.1109/TCBB.2020.2968442. Epub 2021 Feb 3.

DOI:10.1109/TCBB.2020.2968442
PMID:31995498
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7382975/
Abstract

Identifying target genes of transcription factors (TFs) is crucial to understand transcriptional regulation. However, our understanding of genome-wide TF targeting profile is limited due to the cost of large-scale experiments and intrinsic complexity of gene regulation. Thus, computational prediction methods are useful to predict unobserved TF-gene associations. Here, we develop a new Weighted Imputed Neighborhood-regularized Tri-Factorization one-class collaborative filtering algorithm, WINTF. It predicts unobserved target genes for TFs using known but noisy, incomplete, and biased TF-gene associations and protein-protein interaction networks. Our benchmark study shows that WINTF significantly outperforms its counterpart matrix factorization-based algorithms and tri-factorization methods that do not include weight, imputation, and neighbor-regularization, for TF-gene association prediction. When evaluated by independent datasets, accuracy is 37.8 percent on the top 495 predicted associations, an enrichment factor of 4.19 compared with random guess. Furthermore, many predicted novel associations are supported by literature evidence. Although we only use canonical TF-gene interaction data, WINTF can directly be applied to tissue-specific data when available. Thus, WINTF provides a potentially useful framework to integrate multiple omics data for further improvement of TF-gene prediction and applications to other sparse and noisy biological data. The benchmark dataset and source code are freely available at https://github.com/XieResearchGroup/WINTF.

摘要

鉴定转录因子(TFs)的靶基因对于理解转录调控至关重要。然而,由于大规模实验的成本和基因调控的内在复杂性,我们对全基因组 TF 靶向谱的理解是有限的。因此,计算预测方法对于预测未观察到的 TF-基因关联是有用的。在这里,我们开发了一种新的加权推断邻域正则化三因子化单类协同过滤算法 WINTF。它使用已知但存在噪声、不完整和有偏差的 TF-基因关联以及蛋白质-蛋白质相互作用网络来预测 TF 未观察到的靶基因。我们的基准研究表明,WINTF 在 TF-基因关联预测方面明显优于其基于矩阵分解的对应算法和不包括权重、推断和邻域正则化的三因子化方法。在独立数据集上进行评估时,前 495 个预测关联的准确率为 37.8%,与随机猜测相比富集因子为 4.19。此外,许多预测的新关联都得到了文献证据的支持。尽管我们仅使用规范的 TF-基因相互作用数据,但当有组织特异性数据可用时,WINTF 可以直接应用于该数据。因此,WINTF 为整合多种组学数据提供了一个潜在有用的框架,以进一步提高 TF-基因预测,并应用于其他稀疏和嘈杂的生物数据。基准数据集和源代码可在 https://github.com/XieResearchGroup/WINTF 上免费获得。

相似文献

1
A New Weighted Imputed Neighborhood-Regularized Tri-Factorization One-Class Collaborative Filtering Algorithm: Application to Target Gene Prediction of Transcription Factors.一种新的加权拟近邻正则化三因子化单类协同过滤算法:在转录因子靶基因预测中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):126-137. doi: 10.1109/TCBB.2020.2968442. Epub 2021 Feb 3.
2
Target Gene Prediction of Transcription Factor Using a New Neighborhood-regularized Tri-factorization One-class Collaborative Filtering Algorithm.基于新型邻域正则化三因子分解单类协同过滤算法的转录因子靶基因预测
ACM BCB. 2018 Aug-Sep;2018:1-10. doi: 10.1145/3233547.3233551.
3
REUNION: transcription factor binding prediction and regulatory association inference from single-cell multi-omics data.REUNION:从单细胞多组学数据中进行转录因子结合预测和调控关联推断。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i567-i575. doi: 10.1093/bioinformatics/btae234.
4
Generalized matrix factorization based on weighted hypergraph learning for microbe-drug association prediction.基于加权超图学习的广义矩阵分解用于微生物-药物关联预测。
Comput Biol Med. 2022 Jun;145:105503. doi: 10.1016/j.compbiomed.2022.105503. Epub 2022 Apr 8.
5
TF-Cluster: a pipeline for identifying functionally coordinated transcription factors via network decomposition of the shared coexpression connectivity matrix (SCCM).TF-Cluster:一种通过共享共表达连接矩阵(SCCM)的网络分解来识别功能协调转录因子的流程。
BMC Syst Biol. 2011 Apr 15;5:53. doi: 10.1186/1752-0509-5-53.
6
Properly defining the targets of a transcription factor significantly improves the computational identification of cooperative transcription factor pairs in yeast.正确定义转录因子的靶标可显著提高酵母中协同转录因子对的计算识别能力。
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S10. doi: 10.1186/1471-2164-16-S12-S10. Epub 2015 Dec 9.
7
ANTENNA, a Multi-Rank, Multi-Layered Recommender System for Inferring Reliable Drug-Gene-Disease Associations: Repurposing Diazoxide as a Targeted Anti-Cancer Therapy.ANTENNA,一种用于推断可靠的药物-基因-疾病关联的多层级推荐系统:将二氮嗪重新用作靶向抗癌疗法。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Nov-Dec;15(6):1960-1967. doi: 10.1109/TCBB.2018.2812189. Epub 2018 Mar 16.
8
ChEA3: transcription factor enrichment analysis by orthogonal omics integration.ChEA3:通过正交组学整合进行转录因子富集分析。
Nucleic Acids Res. 2019 Jul 2;47(W1):W212-W224. doi: 10.1093/nar/gkz446.
9
Neighborhood Regularized Logistic Matrix Factorization for Drug-Target Interaction Prediction.用于药物-靶点相互作用预测的邻域正则化逻辑矩阵分解
PLoS Comput Biol. 2016 Feb 12;12(2):e1004760. doi: 10.1371/journal.pcbi.1004760. eCollection 2016 Feb.
10
Trimming of mammalian transcriptional networks using network component analysis.使用网络组件分析修剪哺乳动物转录网络。
BMC Bioinformatics. 2010 Oct 13;11:511. doi: 10.1186/1471-2105-11-511.

引用本文的文献

1
SeEn: Sequential enriched datasets for sequence-aware recommendations.SeEn:用于序列感知推荐的连续丰富数据集。
Sci Data. 2022 Aug 4;9(1):478. doi: 10.1038/s41597-022-01598-7.