• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基因长度对方差估计的平滑效应来检测差异表达基因。

Detecting differentially expressed genes by smoothing effect of gene length on variance estimation.

作者信息

Tang Jinyang, Wang Fei

机构信息

1 Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, P. R. China.

出版信息

J Bioinform Comput Biol. 2015 Dec;13(6):1542004. doi: 10.1142/S0219720015420044. Epub 2015 Oct 11.

DOI:10.1142/S0219720015420044
PMID:26608751
Abstract

Next-generation sequencing technologies are widely used in genome research, and RNA sequencing (RNA-Seq) is becoming the main application for gene expression profiling. A large number of computational methods have been developed for analyzing differentially expressed (DE) genes in RNA-Seq data. However, most existing algorithms prefer to call long genes as DE. Short DE genes are rarely detected. In this work, we set out to gain insight into the influence of gene length on RNA-Seq data analysis and to figure out the effect of gene length on variance estimation of RNA-Seq read counts, which is important for statistic test to identify DE genes. We proposed a balanced method of hunting for short DE genes with significance by smoothing a gene length factor. Computational experiments indicate that our method performs well. Software available: http://www.iipl.fudan.edu.cn/lenseq/.

摘要

新一代测序技术在基因组研究中被广泛应用,而RNA测序(RNA-Seq)正成为基因表达谱分析的主要应用。已经开发了大量计算方法来分析RNA-Seq数据中的差异表达(DE)基因。然而,大多数现有算法更倾向于将长基因判定为差异表达基因。短的差异表达基因很少被检测到。在这项工作中,我们着手深入了解基因长度对RNA-Seq数据分析的影响,并弄清楚基因长度对RNA-Seq读数计数方差估计的影响,这对于识别差异表达基因的统计检验很重要。我们提出了一种通过平滑基因长度因子来寻找具有显著性的短差异表达基因的平衡方法。计算实验表明我们的方法性能良好。软件获取地址:http://www.iipl.fudan.edu.cn/lenseq/ 。

相似文献

1
Detecting differentially expressed genes by smoothing effect of gene length on variance estimation.通过基因长度对方差估计的平滑效应来检测差异表达基因。
J Bioinform Comput Biol. 2015 Dec;13(6):1542004. doi: 10.1142/S0219720015420044. Epub 2015 Oct 11.
2
GFOLD: a generalized fold change for ranking differentially expressed genes from RNA-seq data.GFOLD:一种从 RNA-seq 数据中排名差异表达基因的广义倍数变化。
Bioinformatics. 2012 Nov 1;28(21):2782-8. doi: 10.1093/bioinformatics/bts515. Epub 2012 Aug 24.
3
DEGseq: an R package for identifying differentially expressed genes from RNA-seq data.DEGseq:一个用于从 RNA-seq 数据中识别差异表达基因的 R 包。
Bioinformatics. 2010 Jan 1;26(1):136-8. doi: 10.1093/bioinformatics/btp612. Epub 2009 Oct 24.
4
A two-step integrated approach to detect differentially expressed genes in RNA-Seq data.一种用于检测RNA测序数据中差异表达基因的两步综合方法。
J Bioinform Comput Biol. 2016 Dec;14(6):1650034. doi: 10.1142/S0219720016500347. Epub 2016 Sep 15.
5
LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data.LFCseq:一种用于RNA测序数据差异表达分析的非参数方法。
BMC Genomics. 2014;15 Suppl 10(Suppl 10):S7. doi: 10.1186/1471-2164-15-S10-S7. Epub 2014 Dec 12.
6
Statistical detection of differentially expressed genes based on RNA-seq: from biological to phylogenetic replicates.基于 RNA-seq 的差异表达基因的统计检测:从生物学重复到系统发育重复。
Brief Bioinform. 2016 Mar;17(2):243-8. doi: 10.1093/bib/bbv035. Epub 2015 Jun 24.
7
Modifying SAMseq to account for asymmetry in the distribution of effect sizes when identifying differentially expressed genes.修改SAMseq以在识别差异表达基因时考虑效应大小分布的不对称性。
Stat Appl Genet Mol Biol. 2017 Nov 27;16(5-6):291-312. doi: 10.1515/sagmb-2016-0037.
8
DegPack: a web package using a non-parametric and information theoretic algorithm to identify differentially expressed genes in multiclass RNA-seq samples.DegPack:一个使用非参数和信息论算法来识别多类RNA测序样本中差异表达基因的网络程序包。
Methods. 2014 Oct 1;69(3):306-14. doi: 10.1016/j.ymeth.2014.06.004. Epub 2014 Jun 26.
9
Joint estimation of isoform expression and isoform-specific read distribution using multisample RNA-Seq data.利用多样本 RNA-Seq 数据联合估计异构体表达和异构体特异性读取分布。
Bioinformatics. 2014 Feb 15;30(4):506-13. doi: 10.1093/bioinformatics/btt704. Epub 2013 Dec 3.
10
BADGE: a novel Bayesian model for accurate abundance quantification and differential analysis of RNA-Seq data.标记:一种用于 RNA-Seq 数据精确丰度定量和差异分析的新型贝叶斯模型。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S6. doi: 10.1186/1471-2105-15-S9-S6. Epub 2014 Sep 10.