• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类转录组偏度建模。

Modeling skewness in human transcriptomes.

机构信息

Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, Bellaterra, Spain.

出版信息

PLoS One. 2012;7(6):e38919. doi: 10.1371/journal.pone.0038919. Epub 2012 Jun 11.

DOI:10.1371/journal.pone.0038919
PMID:22701729
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3372486/
Abstract

Gene expression data are influenced by multiple biological and technological factors leading to a wide range of dispersion scenarios, although skewed patterns are not commonly addressed in microarray analyses. In this study, the distribution pattern of several human transcriptomes has been studied on free-access microarray gene expression data. Our results showed that, even in previously normalized gene expression data, probe and differential expression within probe effects suffer from substantial departures from the commonly assumed symmetric gaussian distribution. We developed a flexible mixed model for non-competitive microarray data analysis that accounted for asymmetric and heavy-tailed (Student's t distribution) dispersion processes. Random effects for gene expression data were modeled under asymmetric Student's t distributions where the asymmetry parameter (λ) took values from perfect symmetry (λ = 0) to right- (λ>0) or left-side (λ>0) over-expression patterns. This approach was applied to four free-access human data sets and revealed clearly better model performance when comparing with standard approaches accounting for traditional symmetric gaussian distribution patterns. Our analyses on human gene expression data revealed a substantial degree of right-hand asymmetry for probe effects, whereas differential gene expression addressed both symmetric and left-hand asymmetric patterns. Although these results cannot be extrapolated to all microarray experiments, they highlighted the incidence of skew dispersion patterns in human transcriptome; moreover, we provided a new analytical approach to appropriately address this biological phenomenon. The source code of the program accommodating these analytical developments and additional information about practical aspects on running the program are freely available by request to the corresponding author of this article.

摘要

基因表达数据受到多种生物和技术因素的影响,导致分布情况多种多样,尽管在微阵列分析中通常不考虑偏态模式。在这项研究中,我们研究了几个人类转录组在免费获取的微阵列基因表达数据中的分布模式。我们的结果表明,即使在先前已经标准化的基因表达数据中,探针和探针内的差异表达也受到严重偏离常见对称高斯分布的影响。我们开发了一种灵活的混合模型,用于非竞争的微阵列数据分析,该模型考虑了不对称和重尾(学生 t 分布)的分布过程。基因表达数据的随机效应在不对称的学生 t 分布下建模,其中不对称参数(λ)取值从完美对称(λ=0)到右(λ>0)或左(λ>0)过表达模式。我们将这种方法应用于四个免费获取的人类数据集,结果表明,与传统的对称高斯分布模式相比,该方法的模型性能明显更好。我们对人类基因表达数据的分析表明,探针效应存在相当大的右偏不对称程度,而差异基因表达则存在对称和左偏不对称模式。虽然这些结果不能推广到所有的微阵列实验,但它们强调了人类转录组中存在偏态分布模式的情况;此外,我们提供了一种新的分析方法来适当地解决这种生物学现象。程序的源代码可以根据需要向本文的通讯作者索取,该程序还包含了关于运行程序的实际方面的更多信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6c/3372486/1e7b96624316/pone.0038919.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6c/3372486/1e7b96624316/pone.0038919.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6c/3372486/1e7b96624316/pone.0038919.g001.jpg

相似文献

1
Modeling skewness in human transcriptomes.人类转录组偏度建模。
PLoS One. 2012;7(6):e38919. doi: 10.1371/journal.pone.0038919. Epub 2012 Jun 11.
2
Bayesian recursive mixed linear model for gene expression analyses with continuous covariates.贝叶斯递归混合线性模型在具有连续协变量的基因表达分析中的应用。
J Anim Sci. 2012 Jan;90(1):67-75. doi: 10.2527/jas.2010-3750. Epub 2011 Sep 9.
3
Use of linear mixed models for genetic evaluation of gestation length and birth weight allowing for heavy-tailed residual effects.利用线性混合模型对胎龄和初生重进行遗传评估,允许重尾残差效应。
Genet Sel Evol. 2010 Jun 30;42(1):26. doi: 10.1186/1297-9686-42-26.
4
Including probe-level measurement error in robust mixture clustering of replicated microarray gene expression.在复制微阵列基因表达的稳健混合聚类中纳入探针水平测量误差。
Stat Appl Genet Mol Biol. 2010;9:Article42. doi: 10.2202/1544-6115.1600. Epub 2010 Dec 9.
5
Meta-analysis of genome-wide expression patterns associated with behavioral maturation in honey bees.与蜜蜂行为成熟相关的全基因组表达模式的荟萃分析。
BMC Genomics. 2008 Oct 24;9:503. doi: 10.1186/1471-2164-9-503.
6
Variability-specific differential gene expression across reproductive stages in sows.母猪繁殖阶段特异性差异基因表达的变异性。
Animal. 2013 Mar;7(3):378-85. doi: 10.1017/S1751731112001747. Epub 2012 Oct 11.
7
Gene expression analysis in clear cell renal cell carcinoma using gene set enrichment analysis for biostatistical management.基于基因集富集分析的 clear cell 肾细胞癌基因表达分析用于生物统计学管理。
BJU Int. 2011 Jul;108(2 Pt 2):E29-35. doi: 10.1111/j.1464-410X.2010.09794.x. Epub 2011 Mar 16.
8
Three-parameter lognormal distribution ubiquitously found in cDNA microarray data and its application to parametric data treatment.在cDNA微阵列数据中普遍存在的三参数对数正态分布及其在参数数据处理中的应用。
BMC Bioinformatics. 2004 Jan 13;5:5. doi: 10.1186/1471-2105-5-5.
9
Skew distribution of founder-specific inbreeding depression effects on the longevity of Landrace sows.特定奠基者近亲繁殖衰退对长白母猪寿命影响的偏态分布。
Genet Res (Camb). 2008 Dec;90(6):499-508. doi: 10.1017/S0016672308009907.
10
Biclustering of gene expression data by an extension of mixtures of factor analyzers.通过因子分析器混合模型的扩展对基因表达数据进行双聚类分析。
Int J Biostat. 2008;4(1):Article 3. doi: 10.2202/1557-4679.1078.

引用本文的文献

1
A deep learning method to integrate extracelluar miRNA with mRNA for cancer studies.一种将细胞外 miRNA 与 mRNA 整合进行癌症研究的深度学习方法。
Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae653.
2
Low protein expression enhances phenotypic evolvability by intensifying selection on folding stability.低蛋白表达通过强化折叠稳定性选择来增强表型可进化性。
Nat Ecol Evol. 2022 Aug;6(8):1155-1164. doi: 10.1038/s41559-022-01797-w. Epub 2022 Jul 7.
3
Investigating skewness to understand gene expression heterogeneity in large patient cohorts.

本文引用的文献

1
Biological assessment of robust noise models in microarray data analysis.生物评估稳健噪声模型在微阵列数据分析中的应用。
Bioinformatics. 2011 Mar 15;27(6):807-14. doi: 10.1093/bioinformatics/btr018. Epub 2011 Jan 19.
2
A note on oligonucleotide expression values not being normally distributed.关于寡核苷酸表达值不呈正态分布的说明。
Biostatistics. 2009 Jul;10(3):446-50. doi: 10.1093/biostatistics/kxp003. Epub 2009 Mar 10.
3
BATS: a Bayesian user-friendly software for analyzing time series microarray experiments.BATS:一款用于分析时间序列微阵列实验的贝叶斯用户友好型软件。
研究偏度以了解大型患者队列中的基因表达异质性。
BMC Bioinformatics. 2019 Dec 20;20(Suppl 24):668. doi: 10.1186/s12859-019-3252-0.
4
The rise of the distributions: why non-normality is important for understanding the transcriptome and beyond.分布的兴起:为何非正态性对于理解转录组及其他方面至关重要。
Biophys Rev. 2019 Feb;11(1):89-94. doi: 10.1007/s12551-018-0494-4. Epub 2019 Jan 7.
5
A null model for Pearson coexpression networks.用于Pearson共表达网络的零模型。
PLoS One. 2015 Jun 1;10(6):e0128115. doi: 10.1371/journal.pone.0128115. eCollection 2015.
6
Non-gaussian distributions affect identification of expression patterns, functional annotation, and prospective classification in human cancer genomes.非高斯分布会影响人类癌症基因组中表达模式、功能注释和前瞻性分类的识别。
PLoS One. 2012;7(10):e46935. doi: 10.1371/journal.pone.0046935. Epub 2012 Oct 31.
BMC Bioinformatics. 2008 Oct 6;9:415. doi: 10.1186/1471-2105-9-415.
4
Impact of normalization and filtering on linkage analysis of gene expression data.标准化和过滤对基因表达数据连锁分析的影响。
BMC Proc. 2007;1 Suppl 1(Suppl 1):S150. doi: 10.1186/1753-6561-1-s1-s150. Epub 2007 Dec 18.
5
Bayesian analysis of quantitative traits using skewed distributions.使用偏态分布对数量性状进行贝叶斯分析。
Genet Res (Camb). 2008 Apr;90(2):179-90. doi: 10.1017/S0016672308009233.
6
GEAMM v.1.4: a versatile program for mixed model analysis of gene expression data.GEAMM v.1.4:一个用于基因表达数据混合模型分析的通用程序。
Anim Genet. 2008 Feb;39(1):89-90. doi: 10.1111/j.1365-2052.2007.01670.x. Epub 2007 Dec 12.
7
Success and failure in human spermatogenesis as revealed by teratozoospermic RNAs.畸形精子症RNA揭示的人类精子发生中的成功与失败
Hum Mol Genet. 2007 Apr 1;16(7):763-73. doi: 10.1093/hmg/ddm012. Epub 2007 Feb 27.
8
Error distribution for gene expression data.基因表达数据的误差分布。
Stat Appl Genet Mol Biol. 2005;4:Article16. doi: 10.2202/1544-6115.1070. Epub 2005 Jul 12.
9
A Laplace mixture model for identification of differential expression in microarray experiments.一种用于识别微阵列实验中差异表达的拉普拉斯混合模型。
Biostatistics. 2006 Oct;7(4):630-41. doi: 10.1093/biostatistics/kxj032. Epub 2006 Mar 24.
10
Bayesian robust inference for differential gene expression in microarrays with multiple samples.用于多个样本微阵列中差异基因表达的贝叶斯稳健推断。
Biometrics. 2006 Mar;62(1):10-8. doi: 10.1111/j.1541-0420.2005.00397.x.