• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

easyMF:一个基于矩阵分解的大规模转录组数据基因发现的网络平台。

easyMF: A Web Platform for Matrix Factorization-Based Gene Discovery from Large-scale Transcriptome Data.

机构信息

State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest Agriculture and Forestry University, Yangling, 712100, China.

Key Laboratory of Biology and Genetics Improvement of Maize in Arid Area of Northwest Region, Ministry of Agriculture, Northwest Agriculture and Forestry University, Yangling, 712100, China.

出版信息

Interdiscip Sci. 2022 Sep;14(3):746-758. doi: 10.1007/s12539-022-00522-2. Epub 2022 May 18.

DOI:10.1007/s12539-022-00522-2
PMID:35585280
Abstract

With the development of high-throughput experimental technologies, large-scale RNA sequencing (RNA-Seq) data have been and continue to be produced, but have led to challenges in extracting relevant biological knowledge hidden in the produced high-dimensional gene expression matrices. Here, we develop easyMF ( https://github.com/cma2015/easyMF ), a web platform that can facilitate functional gene discovery from large-scale transcriptome data using matrix factorization (MF) algorithms. Compared with existing MF-based software packages, easyMF exhibits several promising features, such as greater functionality, flexibility and ease of use. The easyMF platform is equipped using the Big-Data-supported Galaxy system with user-friendly graphic user interfaces, allowing users with little programming experience to streamline transcriptome analysis from raw reads to gene expression, carry out multiple-scenario MF analysis, and perform multiple-way MF-based gene discovery. easyMF is also powered with the advanced packing technology to enhance ease of use under different operating systems and computational environments. We illustrated the application of easyMF for seed gene discovery from temporal, spatial, and integrated RNA-Seq datasets of maize (Zea mays L.), resulting in the identification of 3,167 seed stage-specific, 1,849 seed compartment-specific, and 774 seed-specific genes, respectively. The present results also indicated that easyMF can prioritize seed-related genes with superior prediction performance over the state-of-art network-based gene prioritization system MaizeNet. As a modular, containerized and open-source platform, easyMF can be further customized to satisfy users' specific demands of functional gene discovery and deployed as a web service for broad applications.

摘要

随着高通量实验技术的发展,大规模 RNA 测序 (RNA-Seq) 数据已经并将继续产生,但这也给从产生的高维基因表达矩阵中提取隐藏的相关生物知识带来了挑战。在这里,我们开发了 easyMF(https://github.com/cma2015/easyMF),这是一个使用矩阵分解 (MF) 算法从大规模转录组数据中促进功能基因发现的网络平台。与现有的基于 MF 的软件包相比,easyMF 具有几个有前途的功能,如更大的功能、灵活性和易用性。easyMF 平台配备了支持大数据的 Galaxy 系统,具有用户友好的图形用户界面,允许具有少量编程经验的用户简化从原始读取到基因表达的转录组分析,进行多种情景 MF 分析,并进行基于 MF 的多种方式基因发现。easyMF 还采用了先进的包装技术,增强了在不同操作系统和计算环境下的易用性。我们展示了 easyMF 在玉米(Zea mays L.)的时间、空间和综合 RNA-Seq 数据集的种子基因发现中的应用,分别鉴定了 3167 个种子阶段特异性、1849 个种子区室特异性和 774 个种子特异性基因。目前的结果还表明,easyMF 可以优先考虑与种子相关的基因,其预测性能优于最先进的基于网络的基因优先级系统 MaizeNet。作为一个模块化、容器化和开源平台,easyMF 可以进一步定制以满足用户对功能基因发现的特定需求,并作为网络服务进行广泛应用。

相似文献

1
easyMF: A Web Platform for Matrix Factorization-Based Gene Discovery from Large-scale Transcriptome Data.easyMF:一个基于矩阵分解的大规模转录组数据基因发现的网络平台。
Interdiscip Sci. 2022 Sep;14(3):746-758. doi: 10.1007/s12539-022-00522-2. Epub 2022 May 18.
2
RNA-Seq in Nonmodel Organisms.非模式生物的 RNA-Seq。
Methods Mol Biol. 2021;2243:143-167. doi: 10.1007/978-1-0716-1103-6_8.
3
CAFU: a Galaxy framework for exploring unmapped RNA-Seq data.CAFU:一个用于探索未映射RNA测序数据的Galaxy框架。
Brief Bioinform. 2020 Mar 23;21(2):676-686. doi: 10.1093/bib/bbz018.
4
MaizeNet: a co-functional network for network-assisted systems genetics in Zea mays.玉米网络:一个用于玉米网络辅助系统遗传学的共功能网络。
Plant J. 2019 Aug;99(3):571-582. doi: 10.1111/tpj.14341. Epub 2019 May 16.
5
UINMF performs mosaic integration of single-cell multi-omic datasets using nonnegative matrix factorization.UINMF 通过非负矩阵分解对单细胞多组学数据集进行镶嵌式整合。
Nat Commun. 2022 Feb 9;13(1):780. doi: 10.1038/s41467-022-28431-4.
6
NASQAR: a web-based platform for high-throughput sequencing data analysis and visualization.NASQAR:一个用于高通量测序数据分析和可视化的基于网络的平台。
BMC Bioinformatics. 2020 Jun 29;21(1):267. doi: 10.1186/s12859-020-03577-4.
7
Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA.转录器:一种用于注释组装读段和识别非编码RNA的自动化计算流程。
PLoS One. 2015 Nov 18;10(11):e0140268. doi: 10.1371/journal.pone.0140268. eCollection 2015.
8
ChimeRScope: a novel alignment-free algorithm for fusion transcript prediction using paired-end RNA-Seq data.ChimeRScope:一种使用双端RNA测序数据进行融合转录本预测的新型无比对算法。
Nucleic Acids Res. 2017 Jul 27;45(13):e120. doi: 10.1093/nar/gkx315.
9
MD-ALL: an integrative platform for molecular diagnosis of B-acute lymphoblastic leukemia.MD-ALL:B 急性淋巴细胞白血病分子诊断的综合平台。
Haematologica. 2024 Jun 1;109(6):1741-1754. doi: 10.3324/haematol.2023.283706.
10
QuickRNASeq lifts large-scale RNA-seq data analyses to the next level of automation and interactive visualization.QuickRNASeq将大规模RNA测序数据分析提升到了一个新的自动化和交互式可视化水平。
BMC Genomics. 2016 Jan 8;17:39. doi: 10.1186/s12864-015-2356-9.

引用本文的文献

1
DeepAnnotation: A novel interpretable deep learning-based genomic selection model that integrates comprehensive functional annotations.深度注释:一种基于深度学习的新型可解释基因组选择模型,该模型整合了全面的功能注释。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf083.
2
A review of artificial intelligence-assisted omics techniques in plant defense: current trends and future directions.植物防御中人工智能辅助组学技术综述:当前趋势与未来方向
Front Plant Sci. 2024 Mar 5;15:1292054. doi: 10.3389/fpls.2024.1292054. eCollection 2024.
3
HetFCM: functional co-module discovery by heterogeneous network co-clustering.

本文引用的文献

1
Activation of Neural Modeling-Related Genes in the Heart of Mice after Gamma Irradiation.γ 射线辐照后小鼠心脏中与神经建模相关基因的激活。
Comput Math Methods Med. 2021 Dec 31;2021:8522417. doi: 10.1155/2021/8522417. eCollection 2021.
2
Rab26 suppresses migration and invasion of breast cancer cells through mediating autophagic degradation of phosphorylated Src.Rab26 通过介导磷酸化Src 的自噬降解来抑制乳腺癌细胞的迁移和侵袭。
Cell Death Dis. 2021 Mar 17;12(4):284. doi: 10.1038/s41419-021-03561-7.
3
Exploring transcriptional switches from pairwise, temporal and population RNA-Seq data using deepTS.
HetFCM:基于异质网络共聚类的功能共模块发现。
Nucleic Acids Res. 2024 Feb 9;52(3):e16. doi: 10.1093/nar/gkad1174.
使用 deepTS 从成对、时间和群体 RNA-Seq 数据中探索转录开关。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa137.
4
Multiview learning for understanding functional multiomics.多视图学习理解功能多组学。
PLoS Comput Biol. 2020 Apr 2;16(4):e1007677. doi: 10.1371/journal.pcbi.1007677. eCollection 2020 Apr.
5
Transcriptomics at Maize Embryo/Endosperm Interfaces Identifies a Transcriptionally Distinct Endosperm Subdomain Adjacent to the Embryo Scutellum.转录组学在玉米胚乳/胚珠界面鉴定出一个转录上明显不同的胚乳亚区,与胚盾相邻。
Plant Cell. 2020 Apr;32(4):833-852. doi: 10.1105/tpc.19.00756. Epub 2020 Feb 21.
6
Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets.独立成分分析在癌症组学数据集复杂性研究中的应用
Int J Mol Sci. 2019 Sep 7;20(18):4414. doi: 10.3390/ijms20184414.
7
Maize GO Annotation-Methods, Evaluation, and Review (maize-GAMER).玉米基因本体注释——方法、评估与综述(玉米-GAMER)
Plant Direct. 2018 Apr 11;2(4):e00052. doi: 10.1002/pld3.52. eCollection 2018 Apr.
8
Gene expression across mammalian organ development.哺乳动物器官发育过程中的基因表达。
Nature. 2019 Jul;571(7766):505-509. doi: 10.1038/s41586-019-1338-5. Epub 2019 Jun 26.
9
Developmental dynamics of lncRNAs across mammalian organs and species.长链非编码 RNA 在哺乳动物器官和物种中的发育动态。
Nature. 2019 Jul;571(7766):510-514. doi: 10.1038/s41586-019-1341-x. Epub 2019 Jun 26.
10
pcaExplorer: an R/Bioconductor package for interacting with RNA-seq principal components.pcaExplorer:一个用于与 RNA-seq 主成分交互的 R/Bioconductor 包。
BMC Bioinformatics. 2019 Jun 13;20(1):331. doi: 10.1186/s12859-019-2879-1.