• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
ExperimentSubset: an R package to manage subsets of Bioconductor Experiment objects.ExperimentSubset:一个用于管理 Bioconductor Experiment 对象子集的 R 包。
Bioinformatics. 2021 Sep 29;37(18):3058-3060. doi: 10.1093/bioinformatics/btab179.
2
Genetic association testing using the GENESIS R/Bioconductor package.使用 GENESIS R/Bioconductor 包进行遗传关联测试。
Bioinformatics. 2019 Dec 15;35(24):5346-5348. doi: 10.1093/bioinformatics/btz567.
3
runibic: a Bioconductor package for parallel row-based biclustering of gene expression data.runibic:一个用于基因表达数据的基于行的并行双向聚类的 Bioconductor 包。
Bioinformatics. 2018 Dec 15;34(24):4302-4304. doi: 10.1093/bioinformatics/bty512.
4
GenVisR: Genomic Visualizations in R.GenVisR:R语言中的基因组可视化
Bioinformatics. 2016 Oct 1;32(19):3012-4. doi: 10.1093/bioinformatics/btw325. Epub 2016 Jun 10.
5
MoleculeExperiment enables consistent infrastructure for molecule-resolved spatial omics data in bioconductor.分子实验为生物信息学中的分子分辨率空间组学数据提供了一致的基础设施。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad550.
6
rGREAT: an R/bioconductor package for functional enrichment on genomic regions.rGREAT:一个用于基因组区域功能富集的 R/bioconductor 包。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac745.
7
The Risa R/Bioconductor package: integrative data analysis from experimental metadata and back again.Risa R/Bioconductor 包:从实验元数据到实验结果的综合数据分析。
BMC Bioinformatics. 2014;15 Suppl 1(Suppl 1):S11. doi: 10.1186/1471-2105-15-S1-S11. Epub 2014 Jan 10.
8
Streamlined analysis of LINCS L1000 data with the slinky package for R.使用 R 语言中的 slinky 包简化 LINCS L1000 数据的分析。
Bioinformatics. 2019 Sep 1;35(17):3176-3177. doi: 10.1093/bioinformatics/btz002.
9
DNAshapeR: an R/Bioconductor package for DNA shape prediction and feature encoding.DNAshapeR:一个用于DNA形状预测和特征编码的R/Bioconductor软件包。
Bioinformatics. 2016 Apr 15;32(8):1211-3. doi: 10.1093/bioinformatics/btv735. Epub 2015 Dec 14.
10
projectR: an R/Bioconductor package for transfer learning via PCA, NMF, correlation and clustering.projectR:一个用于通过 PCA、NMF、相关性和聚类进行迁移学习的 R/Bioconductor 包。
Bioinformatics. 2020 Jun 1;36(11):3592-3593. doi: 10.1093/bioinformatics/btaa183.

引用本文的文献

1
Crosstalk Between Gastric Cancer and Adjacent Mucosa Reveals EDN1-EDNRA-Mediated Regulation of Cancer Stemness and Immunomodulation Networks.胃癌与邻近黏膜之间的串扰揭示了内皮素-1-内皮素受体A介导的癌症干性和免疫调节网络调控
J Cell Mol Med. 2025 Apr;29(8):e70547. doi: 10.1111/jcmm.70547.
2
Identify novel therapeutic targets for type II diabetes and periodontitis: insights from single-cell analysis and Mendelian randomization analysis.鉴定二型糖尿病和牙周炎的新型治疗靶点:单细胞分析和孟德尔随机分析的见解。
Front Endocrinol (Lausanne). 2024 Oct 31;15:1410537. doi: 10.3389/fendo.2024.1410537. eCollection 2024.
3
MAMS: matrix and analysis metadata standards to facilitate harmonization and reproducibility of single-cell data.MAMS:矩阵和分析元数据标准,以促进单细胞数据的协调和可重复性。
Genome Biol. 2024 Aug 1;25(1):205. doi: 10.1186/s13059-024-03349-w.
4
Interactive analysis of single-cell data using flexible workflows with SCTK2.使用SCTK2灵活工作流程对单细胞数据进行交互式分析。
Patterns (N Y). 2023 Aug 3;4(8):100814. doi: 10.1016/j.patter.2023.100814. eCollection 2023 Aug 11.
5
TidyMass an object-oriented reproducible analysis framework for LC-MS data.TidyMass:一种面向对象的、可重现的 LC-MS 数据分析框架。
Nat Commun. 2022 Jul 28;13(1):4365. doi: 10.1038/s41467-022-32155-w.

本文引用的文献

1
TreeSummarizedExperiment: a S4 class for data with hierarchical structure.树状 SummarizedExperiment:具有层次结构的数据的 S4 类。
F1000Res. 2020 Oct 15;9:1246. doi: 10.12688/f1000research.26669.2. eCollection 2020.
2
Evaluation of tools for highly variable gene discovery from single-cell RNA-seq data.单细胞 RNA-seq 数据中高变异基因发现工具的评估。
Brief Bioinform. 2019 Jul 19;20(4):1583-1589. doi: 10.1093/bib/bby011.
3
Software for the Integration of Multiomics Experiments in Bioconductor.用于在生物导体中整合多组学实验的软件。
Cancer Res. 2017 Nov 1;77(21):e39-e42. doi: 10.1158/0008-5472.CAN-17-0344.
4
A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications.生物医学研究与临床应用单细胞RNA测序实用指南。
Genome Med. 2017 Aug 18;9(1):75. doi: 10.1186/s13073-017-0467-4.
5
High-throughput sequencing technologies.高通量测序技术
Mol Cell. 2015 May 21;58(4):586-97. doi: 10.1016/j.molcel.2015.05.004.
6
Synthetic spike-in standards for RNA-seq experiments.用于 RNA-seq 实验的合成 Spike-in 标准品。
Genome Res. 2011 Sep;21(9):1543-51. doi: 10.1101/gr.121095.111. Epub 2011 Aug 4.

ExperimentSubset:一个用于管理 Bioconductor Experiment 对象子集的 R 包。

ExperimentSubset: an R package to manage subsets of Bioconductor Experiment objects.

机构信息

Department of Computer Science, National Textile University, Faisalabad 37610, Pakistan.

Department of Medicine, Boston University School of Medicine, Boston, MA 02118, USA.

出版信息

Bioinformatics. 2021 Sep 29;37(18):3058-3060. doi: 10.1093/bioinformatics/btab179.

DOI:10.1093/bioinformatics/btab179
PMID:33715007
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9940906/
Abstract

MOTIVATION

R Experiment objects such as the SummarizedExperiment or SingleCellExperiment are data containers for storing one or more matrix-like assays along with associated row and column data. These objects have been used to facilitate the storage and analysis of high-throughput genomic data generated from technologies such as single-cell RNA sequencing. One common computational task in many genomics analysis workflows is to perform subsetting of the data matrix before applying down-stream analytical methods. For example, one may need to subset the columns of the assay matrix to exclude poor-quality samples or subset the rows of the matrix to select the most variable features. Traditionally, a second object is created that contains the desired subset of assay from the original object. However, this approach is inefficient as it requires the creation of an additional object containing a copy of the original assay and leads to challenges with data provenance.

RESULTS

To overcome these challenges, we developed an R package called ExperimentSubset, which is a data container that implements classes for efficient storage and streamlined retrieval of assays that have been subsetted by rows and/or columns. These classes are able to inherently provide data provenance by maintaining the relationship between the subsetted and parent assays. We demonstrate the utility of this package on a single-cell RNA-seq dataset by storing and retrieving subsets at different stages of the analysis while maintaining a lower memory footprint. Overall, the ExperimentSubset is a flexible container for the efficient management of subsets.

AVAILABILITY AND IMPLEMENTATION

ExperimentSubset package is available at Bioconductor: https://bioconductor.org/packages/ExperimentSubset/ and Github: https://github.com/campbio/ExperimentSubset.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

R 实验对象,如 SummarizedExperiment 或 SingleCellExperiment,是用于存储一个或多个类似矩阵的检测结果以及相关的行和列数据的数据容器。这些对象已被用于促进从单细胞 RNA 测序等技术生成的高通量基因组数据的存储和分析。在许多基因组学分析工作流程中,一个常见的计算任务是在应用下游分析方法之前对数据矩阵进行子集划分。例如,可能需要排除检测矩阵列中的劣质样本,或者选择矩阵行中的最具变化的特征。传统上,会创建第二个对象,该对象包含原始对象中所需的检测子集。然而,这种方法效率低下,因为它需要创建一个包含原始检测副本的额外对象,并且会导致数据来源的挑战。

结果

为了克服这些挑战,我们开发了一个名为 ExperimentSubset 的 R 包,它是一个数据容器,实现了用于高效存储和简化检索的类,这些类通过行和/或列对子集进行了子集划分。这些类能够通过维护子集中和父检测之间的关系来固有地提供数据来源。我们在单细胞 RNA-seq 数据集上演示了该包的实用性,通过在分析的不同阶段存储和检索子集,同时保持较低的内存占用。总体而言,ExperimentSubset 是高效管理子集的灵活容器。

可用性和实现

ExperimentSubset 包可在 Bioconductor:https://bioconductor.org/packages/ExperimentSubset/ 和 Github:https://github.com/campbio/ExperimentSubset 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。