• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因表达谱密度分布及最大信息系数在差异表达基因识别中的应用评价。

Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes.

机构信息

School of Mathematics and Computer Science, Gannan Normal University, Ganzhou, China.

出版信息

PLoS One. 2019 Jul 17;14(7):e0219551. doi: 10.1371/journal.pone.0219551. eCollection 2019.

DOI:10.1371/journal.pone.0219551
PMID:31314810
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6636747/
Abstract

The hypothesis of data probability density distributions has many effects on the design of a new statistical method. Based on the analysis of a group of real gene expression profiles, this study reveal that the primary density distributions of the real profiles are normal/log-normal and t distributions, accounting for 80% and 19% respectively. According to these distributions, we generated a series of simulation data to make a more comprehensive assessment for a novel statistical method, maximal information coefficient (MIC). The results show that MIC is not only in the top tier in the overall performance of identifying differentially expressed genes, but also exhibits a better adaptability and an excellent noise immunity in comparison with the existing methods.

摘要

数据概率密度分布的假设对新统计方法的设计有许多影响。基于对一组真实基因表达谱的分析,本研究表明,真实谱的主要密度分布分别为正态/对数正态分布和 t 分布,分别占 80%和 19%。根据这些分布,我们生成了一系列模拟数据,对一种新的统计方法——最大信息系数(MIC)进行了更全面的评估。结果表明,MIC 在识别差异表达基因的整体性能方面不仅处于顶级水平,而且与现有方法相比,它还表现出更好的适应性和出色的抗噪能力。

相似文献

1
Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes.基因表达谱密度分布及最大信息系数在差异表达基因识别中的应用评价。
PLoS One. 2019 Jul 17;14(7):e0219551. doi: 10.1371/journal.pone.0219551. eCollection 2019.
2
Maximal information coefficient applied to differentially expressed genes identification: A feasibility study.应用最大信息系数进行差异表达基因鉴定:一项可行性研究。
Technol Health Care. 2019;27(S1):249-262. doi: 10.3233/THC-199024.
3
Robust modeling of differential gene expression data using normal/independent distributions: a Bayesian approach.使用正态/独立分布对差异基因表达数据进行稳健建模:一种贝叶斯方法。
PLoS One. 2015 Apr 24;10(4):e0123791. doi: 10.1371/journal.pone.0123791. eCollection 2015.
4
A non-transformation method for identifying differentially expressed genes from cDNA microarrays.一种从cDNA微阵列中鉴定差异表达基因的非转化方法。
Yi Chuan Xue Bao. 2006 Jan;33(1):80-8. doi: 10.1016/S0379-4172(06)60012-7.
5
MICRAT: a novel algorithm for inferring gene regulatory networks using time series gene expression data.MICRAT:一种使用时间序列基因表达数据推断基因调控网络的新算法。
BMC Syst Biol. 2018 Dec 14;12(Suppl 7):115. doi: 10.1186/s12918-018-0635-1.
6
Quadratic regression analysis for gene discovery and pattern recognition for non-cyclic short time-course microarray experiments.用于非循环短时间进程微阵列实验的基因发现和模式识别的二次回归分析。
BMC Bioinformatics. 2005 Apr 25;6:106. doi: 10.1186/1471-2105-6-106.
7
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。
BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.
8
NanoStringDiff: a novel statistical method for differential expression analysis based on NanoString nCounter data.NanoStringDiff:一种基于NanoString nCounter数据进行差异表达分析的新型统计方法。
Nucleic Acids Res. 2016 Nov 16;44(20):e151. doi: 10.1093/nar/gkw677. Epub 2016 Jul 28.
9
BALLI: Bartlett-adjusted likelihood-based linear model approach for identifying differentially expressed genes with RNA-seq data.BALLI:基于 Bartlett 调整似然比的线性模型方法,用于鉴定 RNA-seq 数据中差异表达的基因。
BMC Genomics. 2019 Jul 2;20(1):540. doi: 10.1186/s12864-019-5851-6.
10
A GMM-IG framework for selecting genes as expression panel biomarkers.一种用于选择基因作为表达谱生物标志物的 GMM-IG 框架。
Artif Intell Med. 2010 Feb-Mar;48(2-3):75-82. doi: 10.1016/j.artmed.2009.07.006. Epub 2009 Dec 8.

引用本文的文献

1
GSNCASCR: An R Package to Identify Differentially Co-Expressed Curated Gene Sets with Single-Cell RNA-Seq Data.GSNCASCR:一个用于通过单细胞RNA测序数据识别差异共表达的精选基因集的R包。
Int J Mol Sci. 2025 May 16;26(10):4771. doi: 10.3390/ijms26104771.
2
Preferential Co-Expression and Colocalization of rDNA-Contacting Genes with LincRNAs Suggest Their Involvement in Shaping Inter-Chromosomal Interactions with Nucleoli.rDNA 接触基因与 lincRNAs 的优先共表达和共定位表明它们参与了核仁与染色体间相互作用的形成。
Int J Mol Sci. 2024 Jun 7;25(12):6333. doi: 10.3390/ijms25126333.
3
Leveraging data-driven self-consistency for high-fidelity gene expression recovery.

本文引用的文献

1
Optimization of Statistical Methods Impact on Quantitative Proteomics Data.统计方法对定量蛋白质组学数据影响的优化
J Proteome Res. 2015 Oct 2;14(10):4118-26. doi: 10.1021/acs.jproteome.5b00183. Epub 2015 Sep 8.
2
ROTS: reproducible RNA-seq biomarker detector-prognostic markers for clear cell renal cell cancer.ROTS:用于透明细胞肾细胞癌的可重复RNA测序生物标志物检测器-预后标志物
Nucleic Acids Res. 2016 Jan 8;44(1):e1. doi: 10.1093/nar/gkv806. Epub 2015 Aug 11.
3
A novel method for identifying SNP disease association based on maximal information coefficient.
利用数据驱动的自一致性进行高保真基因表达恢复。
Nat Commun. 2022 Nov 21;13(1):7142. doi: 10.1038/s41467-022-34595-w.
4
Maintenance of quantitative genetic variance in complex, multitrait phenotypes: the contribution of rare, large effect variants in 2 Drosophila species.在复杂的多性状表型中维持数量遗传方差:2 种果蝇中稀有、大效应变异的贡献。
Genetics. 2022 Sep 30;222(2). doi: 10.1093/genetics/iyac122.
5
Genes Possessing the Most Frequent DNA DSBs Are Highly Associated with Development and Cancers, and Essentially Overlap with the rDNA-Contacting Genes.具有最频繁 DNA DSB 的基因与发育和癌症高度相关,并且与 rDNA 接触基因基本重叠。
Int J Mol Sci. 2022 Jun 28;23(13):7201. doi: 10.3390/ijms23137201.
6
Fragments of rDNA Genes Scattered over the Human Genome Are Targets of Small RNAs.人类基因组中散布的 rDNA 基因片段是小 RNA 的靶标。
Int J Mol Sci. 2022 Mar 10;23(6):3014. doi: 10.3390/ijms23063014.
7
Colostrum-Induced Temporary Changes in the Expression of Proteins Regulating the Epithelial Barrier Function in the Intestine.初乳诱导的肠道上皮屏障功能调节蛋白表达的暂时变化
Foods. 2022 Feb 25;11(5):685. doi: 10.3390/foods11050685.
8
Rapid and accurate identification of ribosomal RNA sequences via deep learning.通过深度学习快速准确地识别核糖体 RNA 序列。
Nucleic Acids Res. 2022 Jun 10;50(10):e60. doi: 10.1093/nar/gkac112.
9
Kidney Response to Chemotherapy-Induced Heart Failure: mRNA Analysis in Normotensive and Transgenic Hypertensive Rats.化疗诱导心衰时肾脏的反应:正常血压和转基因高血压大鼠的 mRNA 分析。
Int J Mol Sci. 2021 Aug 6;22(16):8475. doi: 10.3390/ijms22168475.
10
Insulin-Like Growth Factor 2 As a Possible Neuroprotective Agent and Memory Enhancer-Its Comparative Expression, Processing and Signaling in Mammalian CNS.胰岛素样生长因子 2 作为一种可能的神经保护剂和记忆增强剂-其在哺乳动物中枢神经系统中的比较表达、加工和信号转导。
Int J Mol Sci. 2021 Feb 12;22(4):1849. doi: 10.3390/ijms22041849.
一种基于最大信息系数识别单核苷酸多态性(SNP)与疾病关联的新方法。
Genet Mol Res. 2014 Dec 19;13(4):10863-77. doi: 10.4238/2014.December.19.7.
4
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.使用DESeq2对RNA测序数据的倍数变化和离散度进行适度估计。
Genome Biol. 2014;15(12):550. doi: 10.1186/s13059-014-0550-8.
5
SurvExpress: an online biomarker validation tool and database for cancer gene expression data using survival analysis.SurvExpress:一个使用生存分析的在线生物标志物验证工具和癌症基因表达数据数据库。
PLoS One. 2013 Sep 16;8(9):e74250. doi: 10.1371/journal.pone.0074250. eCollection 2013.
6
Minerva and minepy: a C engine for the MINE suite and its R, Python and MATLAB wrappers.Minerva 和 minepy:MINE 套件及其 R、Python 和 MATLAB 包装器的 C 引擎。
Bioinformatics. 2013 Feb 1;29(3):407-8. doi: 10.1093/bioinformatics/bts707. Epub 2012 Dec 14.
7
Detecting novel associations in large data sets.在大型数据集 中检测新的关联。
Science. 2011 Dec 16;334(6062):1518-24. doi: 10.1126/science.1205438.
8
Differential expression analysis for sequence count data.差异表达分析序列计数数据。
Genome Biol. 2010;11(10):R106. doi: 10.1186/gb-2010-11-10-r106. Epub 2010 Oct 27.
9
Lung cancer: from single-gene methylation to methylome profiling.肺癌:从单基因甲基化到甲基组谱分析。
Cancer Metastasis Rev. 2010 Mar;29(1):95-107. doi: 10.1007/s10555-010-9203-x.
10
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.edgeR:一个用于数字基因表达数据差异表达分析的 Bioconductor 包。
Bioinformatics. 2010 Jan 1;26(1):139-40. doi: 10.1093/bioinformatics/btp616. Epub 2009 Nov 11.