文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

SimSeq: a nonparametric approach to simulation of RNA-sequence datasets.

作者信息

Benidt Sam, Nettleton Dan

机构信息

Department of Statistics, Iowa State University, Ames, IA 50011-1210, USA.

出版信息

Bioinformatics. 2015 Jul 1;31(13):2131-40. doi: 10.1093/bioinformatics/btv124. Epub 2015 Feb 26.


DOI:10.1093/bioinformatics/btv124
PMID:25725090
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4481850/
Abstract

MOTIVATION: RNA sequencing analysis methods are often derived by relying on hypothetical parametric models for read counts that are not likely to be precisely satisfied in practice. Methods are often tested by analyzing data that have been simulated according to the assumed model. This testing strategy can result in an overly optimistic view of the performance of an RNA-seq analysis method. RESULTS: We develop a data-based simulation algorithm for RNA-seq data. The vector of read counts simulated for a given experimental unit has a joint distribution that closely matches the distribution of a source RNA-seq dataset provided by the user. We conduct simulation experiments based on the negative binomial distribution and our proposed nonparametric simulation algorithm. We compare performance between the two simulation experiments over a small subset of statistical methods for RNA-seq analysis available in the literature. We use as a benchmark the ability of a method to control the false discovery rate. Not surprisingly, methods based on parametric modeling assumptions seem to perform better with respect to false discovery rate control when data are simulated from parametric models rather than using our more realistic nonparametric simulation strategy. AVAILABILITY AND IMPLEMENTATION: The nonparametric simulation algorithm developed in this article is implemented in the R package SimSeq, which is freely available under the GNU General Public License (version 2 or later) from the Comprehensive R Archive Network (http://cran.rproject.org/). CONTACT: sgbenidt@gmail.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

摘要

相似文献

[1]
SimSeq: a nonparametric approach to simulation of RNA-sequence datasets.

Bioinformatics. 2015-7-1

[2]
Polyester: simulating RNA-seq datasets with differential transcript expression.

Bioinformatics. 2015-9-1

[3]
rSeqNP: a non-parametric approach for detecting differential expression and splicing from RNA-Seq data.

Bioinformatics. 2015-7-1

[4]
NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data.

BMC Bioinformatics. 2013-8-27

[5]
Differential correlation for sequencing data.

BMC Res Notes. 2017-1-19

[6]
Joint estimation of isoform expression and isoform-specific read distribution using multisample RNA-Seq data.

Bioinformatics. 2013-12-3

[7]
PROPER: comprehensive power evaluation for differential expression using RNA-seq.

Bioinformatics. 2015-1-15

[8]
A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.

PLoS One. 2017-5-1

[9]
smallWig: parallel compression of RNA-seq WIG files.

Bioinformatics. 2016-1-15

[10]
Guidance for RNA-seq co-expression network construction and analysis: safety in numbers.

Bioinformatics. 2015-7-1

引用本文的文献

[1]
Crafted experiments to evaluate feature selection methods for single-cell RNA-seq data.

NAR Genom Bioinform. 2025-3-19

[2]
A comprehensive review and benchmark of differential analysis tools for Hi-C data.

Brief Bioinform. 2025-3-4

[3]
Accurate assembly of full-length consensus for viral quasispecies.

BMC Bioinformatics. 2025-2-1

[4]
BEERS2: RNA-Seq simulation through high fidelity in silico modeling.

Brief Bioinform. 2024-3-27

[5]
RNC: Uncovering the dynamic and condition-specific RBP-ncRNA circuits from multi-omics data.

Comput Struct Biotechnol J. 2023-3-23

[6]
Systematic benchmarking of statistical methods to assess differential expression of circular RNAs.

Brief Bioinform. 2023-1-19

[7]
A Framework for Comparison and Assessment of Synthetic RNA-Seq Data.

Genes (Basel). 2022-12-14

[8]
Haplotype assignment of longitudinal viral deep sequencing data using covariation of variant frequencies.

Virus Evol. 2022-10-6

[9]
Spatially-enhanced clusterwise inference for testing and localizing intermodal correspondence.

Neuroimage. 2022-12-1

[10]
Impact of adaptive filtering on power and false discovery rate in RNA-seq experiments.

BMC Bioinformatics. 2022-9-24

本文引用的文献

[1]
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.

Genome Biol. 2014

[2]
Error estimates for the analysis of differential expression from RNA-seq count data.

PeerJ. 2014-9-23

[3]
subSeq: determining appropriate sequencing depth through efficient read subsampling.

Bioinformatics. 2014-12-1

[4]
voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.

Genome Biol. 2014-2-3

[5]
Evaluating statistical analysis models for RNA sequencing experiments.

Front Genet. 2013-9-17

[6]
Comprehensive molecular characterization of clear cell renal cell carcinoma.

Nature. 2013-6-23

[7]
A comparison of methods for differential expression analysis of RNA-seq data.

BMC Bioinformatics. 2013-3-9

[8]
Detecting differential expression in RNA-sequence data using quasi-likelihood with shrunken dispersion estimates.

Stat Appl Genet Mol Biol. 2012-10-22

[9]
A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis.

Brief Bioinform. 2012-9-17

[10]
Modelling and simulating generic RNA-Seq experiments with the flux simulator.

Nucleic Acids Res. 2012-9-7

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索