• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ReCount:一个可分析的 RNA-seq 基因计数数据集的多实验资源。

ReCount: a multi-experiment resource of analysis-ready RNA-seq gene count datasets.

机构信息

Department of Biostatistics, The Johns Hopkins University Bloomberg School of Public Health, 615 North Wolfe Street, Baltimore, MD 21205, USA.

出版信息

BMC Bioinformatics. 2011 Nov 16;12:449. doi: 10.1186/1471-2105-12-449.

DOI:10.1186/1471-2105-12-449
PMID:22087737
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3229291/
Abstract

BACKGROUND

RNA sequencing is a flexible and powerful new approach for measuring gene, exon, or isoform expression. To maximize the utility of RNA sequencing data, new statistical methods are needed for clustering, differential expression, and other analyses. A major barrier to the development of new statistical methods is the lack of RNA sequencing datasets that can be easily obtained and analyzed in common statistical software packages such as R. To speed up the development process, we have created a resource of analysis-ready RNA-sequencing datasets. 2 DESCRIPTION: ReCount is an online resource of RNA-seq gene count tables and auxilliary data. Tables were built from raw RNA sequencing data from 18 different published studies comprising 475 samples and over 8 billion reads. Using the Myrna package, reads were aligned, overlapped with gene models and tabulated into gene-by-sample count tables that are ready for statistical analysis. Count tables and phenotype data were combined into Bioconductor ExpressionSet objects for ease of analysis. ReCount also contains the Myrna manifest files and R source code used to process the samples, allowing statistical and computational scientists to consider alternative parameter values. 3 CONCLUSIONS: By combining datasets from many studies and providing data that has already been processed from. fastq format into ready-to-use. RData and. txt files, ReCount facilitates analysis and methods development for RNA-seq count data. We anticipate that ReCount will also be useful for investigators who wish to consider cross-study comparisons and alternative normalization strategies for RNA-seq.

摘要

背景

RNA 测序是一种灵活且强大的新方法,可用于测量基因、外显子或异构体的表达。为了最大限度地利用 RNA 测序数据,需要新的统计方法来进行聚类、差异表达和其他分析。开发新统计方法的主要障碍是缺乏可在 R 等常见统计软件包中轻松获取和分析的 RNA 测序数据集。为了加快开发过程,我们创建了一个可分析的 RNA 测序数据集资源。

描述

ReCount 是一个在线 RNA-seq 基因计数表和辅助数据资源。表是从 18 项不同的已发表研究的原始 RNA 测序数据构建的,这些研究共包含 475 个样本和超过 80 亿个reads。使用 Myrna 包,将 reads 进行比对、与基因模型重叠,并将其制表为适合统计分析的基因样本计数表。将计数表和表型数据组合到 Bioconductor ExpressionSet 对象中,以便于分析。ReCount 还包含用于处理样本的 Myrna 清单文件和 R 源代码,允许统计和计算科学家考虑替代参数值。

结论

通过合并来自多个研究的数据集,并提供已从 fastq 格式处理为可用于. RData 和. txt 文件的数据集,ReCount 促进了 RNA-seq 计数数据的分析和方法开发。我们预计 ReCount 对于希望考虑跨研究比较和 RNA-seq 替代标准化策略的研究人员也将非常有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca0d/3229291/ca142211f19f/1471-2105-12-449-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca0d/3229291/157a938a9480/1471-2105-12-449-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca0d/3229291/ca142211f19f/1471-2105-12-449-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca0d/3229291/157a938a9480/1471-2105-12-449-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca0d/3229291/ca142211f19f/1471-2105-12-449-2.jpg

相似文献

1
ReCount: a multi-experiment resource of analysis-ready RNA-seq gene count datasets.ReCount:一个可分析的 RNA-seq 基因计数数据集的多实验资源。
BMC Bioinformatics. 2011 Nov 16;12:449. doi: 10.1186/1471-2105-12-449.
2
Cloud-scale RNA-sequencing differential expression analysis with Myrna.利用 Myrna 进行云规模 RNA-seq 差异表达分析。
Genome Biol. 2010;11(8):R83. doi: 10.1186/gb-2010-11-8-r83. Epub 2010 Aug 11.
3
JingleBells: A Repository of Immune-Related Single-Cell RNA-Sequencing Datasets.《铃儿响叮当》:一个免疫相关单细胞RNA测序数据集的储存库。
J Immunol. 2017 May 1;198(9):3375-3379. doi: 10.4049/jimmunol.1700272.
4
TCC: an R package for comparing tag count data with robust normalization strategies.TCC:一个用于比较标签计数数据的 R 包,带有稳健的归一化策略。
BMC Bioinformatics. 2013 Jul 9;14:219. doi: 10.1186/1471-2105-14-219.
5
OneStopRNAseq: A Web Application for Comprehensive and Efficient Analyses of RNA-Seq Data.OneStopRNAseq:一种用于 RNA-Seq 数据综合高效分析的网络应用程序。
Genes (Basel). 2020 Oct 2;11(10):1165. doi: 10.3390/genes11101165.
6
Polyester: simulating RNA-seq datasets with differential transcript expression.聚酯:模拟具有差异转录本表达的RNA测序数据集。
Bioinformatics. 2015 Sep 1;31(17):2778-84. doi: 10.1093/bioinformatics/btv272. Epub 2015 Apr 28.
7
A flexible count data model to fit the wide diversity of expression profiles arising from extensively replicated RNA-seq experiments.一种灵活的计数数据模型,可适用于广泛复制的 RNA-seq 实验所产生的广泛多样化的表达谱。
BMC Bioinformatics. 2013 Aug 21;14:254. doi: 10.1186/1471-2105-14-254.
8
recount3: summaries and queries for large-scale RNA-seq expression and splicing.recount3:大规模 RNA-seq 表达和剪接的摘要和查询。
Genome Biol. 2021 Nov 29;22(1):323. doi: 10.1186/s13059-021-02533-6.
9
deGPS is a powerful tool for detecting differential expression in RNA-sequencing studies.deGPS是一种用于在RNA测序研究中检测差异表达的强大工具。
BMC Genomics. 2015 Jun 13;16(1):455. doi: 10.1186/s12864-015-1676-0.
10
compcodeR--an R package for benchmarking differential expression methods for RNA-seq data.compcodeR——一个用于对RNA测序数据差异表达方法进行基准测试的R软件包。
Bioinformatics. 2014 Sep 1;30(17):2517-8. doi: 10.1093/bioinformatics/btu324. Epub 2014 May 9.

引用本文的文献

1
To Tweak or Not to Tweak. How Exploiting Flexibilities in Gene Set Analysis Leads to Overoptimism.调整还是不调整。利用基因集分析中的灵活性如何导致过度乐观。
Biom J. 2025 Feb;67(1):e70016. doi: 10.1002/bimj.70016.
2
An evaluation of RNA-seq differential analysis methods.RNA-seq 差异分析方法评估。
PLoS One. 2022 Sep 16;17(9):e0264246. doi: 10.1371/journal.pone.0264246. eCollection 2022.
3
Sparse sliced inverse regression for high dimensional data analysis.用于高维数据分析的稀疏切片逆回归

本文引用的文献

1
Sequencing technology does not eliminate biological variability.测序技术并不能消除生物变异性。
Nat Biotechnol. 2011 Jul 11;29(7):572-3. doi: 10.1038/nbt.1910.
2
Evaluating gene expression in C57BL/6J and DBA/2J mouse striatum using RNA-Seq and microarrays.使用 RNA-Seq 和微阵列评估 C57BL/6J 和 DBA/2J 小鼠纹状体中的基因表达。
PLoS One. 2011 Mar 24;6(3):e17820. doi: 10.1371/journal.pone.0017820.
3
The developmental transcriptome of Drosophila melanogaster.黑腹果蝇的发育转录组。
BMC Bioinformatics. 2022 May 7;23(1):168. doi: 10.1186/s12859-022-04700-3.
4
Addressing the mean-correlation relationship in co-expression analysis.解决共表达分析中的均值-相关关系。
PLoS Comput Biol. 2022 Mar 30;18(3):e1009954. doi: 10.1371/journal.pcbi.1009954. eCollection 2022 Mar.
5
Comprehensive analysis of an immune infiltrate-related competitive endogenous RNA network reveals potential prognostic biomarkers for non-small cell lung cancer.免疫浸润相关竞争性内源 RNA 网络的综合分析揭示了非小细胞肺癌潜在的预后生物标志物。
PLoS One. 2021 Dec 2;16(12):e0260720. doi: 10.1371/journal.pone.0260720. eCollection 2021.
6
recount3: summaries and queries for large-scale RNA-seq expression and splicing.recount3:大规模 RNA-seq 表达和剪接的摘要和查询。
Genome Biol. 2021 Nov 29;22(1):323. doi: 10.1186/s13059-021-02533-6.
7
Targeting Tumor-Stromal IL6/STAT3 Signaling through IL1 Receptor Inhibition in Pancreatic Cancer.通过抑制白细胞介素 1 受体靶向胰腺癌中的肿瘤-基质 IL6/STAT3 信号传导。
Mol Cancer Ther. 2021 Nov;20(11):2280-2290. doi: 10.1158/1535-7163.MCT-21-0083. Epub 2021 Sep 13.
8
Direct long-read RNA sequencing identifies a subset of questionable exitrons likely arising from reverse transcription artifacts.直接长读 RNA 测序鉴定出一组可能源自逆转录伪迹的可疑外显子。
Genome Biol. 2021 Jun 28;22(1):190. doi: 10.1186/s13059-021-02411-1.
9
Time trajectories in the transcriptomic response to exercise - a meta-analysis.运动转录组反应的时间轨迹——一项荟萃分析。
Nat Commun. 2021 Jun 9;12(1):3471. doi: 10.1038/s41467-021-23579-x.
10
Inflated false discovery rate due to volcano plots: problem and solutions.由于火山图而导致的 inflated false discovery rate:问题与解决方案。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab053.
Nature. 2011 Mar 24;471(7339):473-9. doi: 10.1038/nature09715. Epub 2010 Dec 22.
4
NCBI GEO: archive for functional genomics data sets--10 years on.美国国立生物技术信息中心基因表达综合数据库:功能基因组数据集存档——十年回顾
Nucleic Acids Res. 2011 Jan;39(Database issue):D1005-10. doi: 10.1093/nar/gkq1184. Epub 2010 Nov 21.
5
The sequence read archive.序列读取存档库。
Nucleic Acids Res. 2011 Jan;39(Database issue):D19-21. doi: 10.1093/nar/gkq1019. Epub 2010 Nov 9.
6
Analysis and design of RNA sequencing experiments for identifying isoform regulation.RNA 测序实验分析与设计,用于鉴定异构体调控
Nat Methods. 2010 Dec;7(12):1009-15. doi: 10.1038/nmeth.1528. Epub 2010 Nov 7.
7
Ensembl 2011.Ensembl 2011年版
Nucleic Acids Res. 2011 Jan;39(Database issue):D800-6. doi: 10.1093/nar/gkq1064. Epub 2010 Nov 2.
8
Polymorphic cis- and trans-regulation of human gene expression.人类基因表达的多态顺式和反式调控。
PLoS Biol. 2010 Sep 14;8(9):e1000480. doi: 10.1371/journal.pbio.1000480.
9
Cloud-scale RNA-sequencing differential expression analysis with Myrna.利用 Myrna 进行云规模 RNA-seq 差异表达分析。
Genome Biol. 2010;11(8):R83. doi: 10.1186/gb-2010-11-8-r83. Epub 2010 Aug 11.
10
mRNA-seq with agnostic splice site discovery for nervous system transcriptomics tested in chronic pain.用于神经系统转录组学的具有未知剪接位点发现能力的 mRNA-seq 在慢性疼痛中进行了测试。
Genome Res. 2010 Jun;20(6):847-60. doi: 10.1101/gr.101204.109. Epub 2010 May 7.