• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

OMICfpp:一种用于配对 RNA-Seq 计数的模糊方法。

OMICfpp: a fuzzy approach for paired RNA-Seq counts.

机构信息

Grupo de Investigación Bioinformática y Genómica Funcional. Laboratorio 19. Centro de Investigación del Cáncer (CiC-IBMCC, Universidad de Salamanca-CSIC, Campus Universitario Miguel de Unamuno s/n, Salamanca, 37007, Spain.

Universidad de La Frontera. Centro De Excelencia de Modelación y Computación Científica, C/ Montevideo 740, Temuco, Chile.

出版信息

BMC Genomics. 2019 Apr 2;20(1):259. doi: 10.1186/s12864-019-5496-5.

DOI:10.1186/s12864-019-5496-5
PMID:30940089
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6444640/
Abstract

BACKGROUND

RNA sequencing is a widely used technology for differential expression analysis. However, the RNA-Seq do not provide accurate absolute measurements and the results can be different for each pipeline used. The major problem in statistical analysis of RNA-Seq and in the omics data in general, is the small sample size with respect to the large number of variables. In addition, experimental design must be taken into account and few tools consider it.

RESULTS

We propose OMICfpp, a method for the statistical analysis of RNA-Seq paired design data. First, we obtain a p-value for each case-control pair using a binomial test. These p-values are aggregated using an ordered weighted average (OWA) with a given orness previously chosen. The aggregated p-value from the original data is compared with the aggregated p-value obtained using the same method applied to random pairs. These new pairs are generated using between-pairs and complete randomization distributions. This randomization p-value is used as a raw p-value to test the differential expression of each gene. The OMICfpp method is evaluated using public data sets of 68 sample pairs from patients with colorectal cancer. We validate our results through bibliographic search of the reported genes and using simulated data set. Furthermore, we compared our results with those obtained by the methods edgeR and DESeq2 for paired samples. Finally, we propose new target genes to validate these as gene expression signatures in colorectal cancer. OMICfpp is available at http://www.uv.es/ayala/software/OMICfpp_0.2.tar.gz .

CONCLUSIONS

Our study shows that OMICfpp is an accurate method for differential expression analysis in RNA-Seq data with paired design. In addition, we propose the use of randomized p-values pattern graphic as a powerful and robust method to select the target genes for experimental validation.

摘要

背景

RNA 测序是一种广泛用于差异表达分析的技术。然而,RNA-Seq 不能提供准确的绝对测量值,并且每个使用的管道的结果可能不同。在 RNA-Seq 的统计分析以及一般的组学数据中,主要问题是相对于大量变量而言,样本量较小。此外,必须考虑实验设计,而很少有工具考虑到这一点。

结果

我们提出了 OMICfpp,这是一种用于 RNA-Seq 配对设计数据的统计分析方法。首先,我们使用二项式检验为每个病例对照对获得一个 p 值。这些 p 值使用给定的有序权重平均 (OWA) 进行聚合,该平均权重是之前选择的。原始数据的聚合 p 值与使用相同方法应用于随机对获得的聚合 p 值进行比较。这些新对是使用对间和完全随机化分布生成的。将该随机化 p 值用作测试每个基因差异表达的原始 p 值。使用来自结直肠癌患者的 68 个样本对的公共数据集评估 OMICfpp 方法。我们通过报告基因的文献搜索和使用模拟数据集验证了我们的结果。此外,我们将我们的结果与配对样本的 edgeR 和 DESeq2 方法的结果进行了比较。最后,我们提出了新的靶基因来验证这些作为结直肠癌的基因表达特征。OMICfpp 可在 http://www.uv.es/ayala/software/OMICfpp_0.2.tar.gz 获得。

结论

我们的研究表明,OMICfpp 是一种用于 RNA-Seq 数据中配对设计的差异表达分析的准确方法。此外,我们建议使用随机化 p 值模式图作为选择用于实验验证的靶基因的强大而稳健的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6828/6444640/04b177a70efc/12864_2019_5496_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6828/6444640/ef1376453e2f/12864_2019_5496_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6828/6444640/44d5dad50b9a/12864_2019_5496_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6828/6444640/45bbba43dc6c/12864_2019_5496_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6828/6444640/04b177a70efc/12864_2019_5496_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6828/6444640/ef1376453e2f/12864_2019_5496_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6828/6444640/44d5dad50b9a/12864_2019_5496_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6828/6444640/45bbba43dc6c/12864_2019_5496_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6828/6444640/04b177a70efc/12864_2019_5496_Fig4_HTML.jpg

相似文献

1
OMICfpp: a fuzzy approach for paired RNA-Seq counts.OMICfpp:一种用于配对 RNA-Seq 计数的模糊方法。
BMC Genomics. 2019 Apr 2;20(1):259. doi: 10.1186/s12864-019-5496-5.
2
Power analysis and sample size estimation for RNA-Seq differential expression.RNA测序差异表达的功效分析与样本量估计
RNA. 2014 Nov;20(11):1684-96. doi: 10.1261/rna.046011.114. Epub 2014 Sep 22.
3
iMir: an integrated pipeline for high-throughput analysis of small non-coding RNA data obtained by smallRNA-Seq.iMir:一个集成的小 RNA-Seq 高通量分析小分子非编码 RNA 数据的综合分析平台。
BMC Bioinformatics. 2013 Dec 13;14:362. doi: 10.1186/1471-2105-14-362.
4
A flexible count data model to fit the wide diversity of expression profiles arising from extensively replicated RNA-seq experiments.一种灵活的计数数据模型,可适用于广泛复制的 RNA-seq 实验所产生的广泛多样化的表达谱。
BMC Bioinformatics. 2013 Aug 21;14:254. doi: 10.1186/1471-2105-14-254.
5
Benchmarking differential expression analysis tools for RNA-Seq: normalization-based vs. log-ratio transformation-based methods.RNA-Seq 差异表达分析工具的基准测试:基于标准化与基于对数比变换的方法。
BMC Bioinformatics. 2018 Jul 18;19(1):274. doi: 10.1186/s12859-018-2261-8.
6
Power in pairs: assessing the statistical value of paired samples in tests for differential expression.成对的力量:评估差异表达检验中配对样本的统计价值。
BMC Genomics. 2018 Dec 20;19(1):953. doi: 10.1186/s12864-018-5236-2.
7
Sample size calculation while controlling false discovery rate for differential expression analysis with RNA-sequencing experiments.在RNA测序实验的差异表达分析中控制错误发现率时的样本量计算。
BMC Bioinformatics. 2016 Mar 31;17:146. doi: 10.1186/s12859-016-0994-9.
8
Improving RNA-Seq expression estimation by modeling isoform- and exon-specific read sequencing rate.通过对异构体和外显子特异性读段测序率进行建模来改进RNA测序表达估计。
BMC Bioinformatics. 2015 Oct 16;16:332. doi: 10.1186/s12859-015-0750-6.
9
TAP: a targeted clinical genomics pipeline for detecting transcript variants using RNA-seq data.TAP:一种使用 RNA-seq 数据检测转录变体的靶向临床基因组学管道。
BMC Med Genomics. 2018 Sep 10;11(1):79. doi: 10.1186/s12920-018-0402-6.
10
An evaluation of RNA-seq differential analysis methods.RNA-seq 差异分析方法评估。
PLoS One. 2022 Sep 16;17(9):e0264246. doi: 10.1371/journal.pone.0264246. eCollection 2022.

引用本文的文献

1
Controlling for cellular heterogeneity using single-cell deconvolution of gene expression reveals novel markers of colorectal tumors exhibiting microsatellite instability.利用基因表达的单细胞反卷积来控制细胞异质性,揭示了表现出微卫星不稳定性的结直肠癌的新标志物。
Oncotarget. 2021 Apr 13;12(8):767-782. doi: 10.18632/oncotarget.27935.

本文引用的文献

1
Elimination of CD4HLA-G T cells overcomes castration-resistance in prostate cancer therapy.消除 CD4+HLA-G 细胞可克服前列腺癌治疗中的去势抵抗。
Cell Res. 2018 Nov;28(11):1103-1117. doi: 10.1038/s41422-018-0089-4. Epub 2018 Oct 8.
2
Relationship between folate concentration and expression of folate-associated genes in tissue and plasma after intraoperative administration of leucovorin in patients with colorectal cancer.结直肠癌患者术中给予亚叶酸后组织和血浆中叶酸浓度与叶酸相关基因表达的关系。
Cancer Chemother Pharmacol. 2018 Dec;82(6):987-997. doi: 10.1007/s00280-018-3690-9. Epub 2018 Sep 29.
3
Keratin 80 promotes migration and invasion of colorectal carcinoma by interacting with PRKDC via activating the AKT pathway.
角蛋白 80 通过与 PRKDC 相互作用并激活 AKT 通路促进结直肠癌的迁移和侵袭。
Cell Death Dis. 2018 Sep 27;9(10):1009. doi: 10.1038/s41419-018-1030-y.
4
An integrated study on TFs and miRNAs in colorectal cancer metastasis and evaluation of three co-regulated candidate genes as prognostic markers.结直肠癌转移中 TF 和 miRNA 的综合研究及三个共同调控候选基因作为预后标志物的评估。
Gene. 2018 Dec 30;679:150-159. doi: 10.1016/j.gene.2018.09.003. Epub 2018 Sep 4.
5
TGF-β1-SOX9 axis-inducible COL10A1 promotes invasion and metastasis in gastric cancer via epithelial-to-mesenchymal transition.TGF-β1-SOX9 轴诱导的 COL10A1 通过上皮-间充质转化促进胃癌的侵袭和转移。
Cell Death Dis. 2018 Aug 28;9(9):849. doi: 10.1038/s41419-018-0877-2.
6
High TNFRSF12A level associated with MMP-9 overexpression is linked to poor prognosis in breast cancer: Gene set enrichment analysis and validation in large-scale cohorts.高 TNFRSF12A 水平与 MMP-9 过表达相关,与乳腺癌不良预后相关:大规模队列的基因集富集分析和验证。
PLoS One. 2018 Aug 24;13(8):e0202113. doi: 10.1371/journal.pone.0202113. eCollection 2018.
7
KLK6 protease accelerates skin tumor formation and progression.KLK6 蛋白酶加速皮肤肿瘤的形成和发展。
Carcinogenesis. 2018 Dec 31;39(12):1529-1536. doi: 10.1093/carcin/bgy110.
8
BCMA (TNFRSF17) Induces APRIL and BAFF Mediated Breast Cancer Cell Stemness.BCMA(肿瘤坏死因子受体超家族成员17)诱导增殖诱导配体(APRIL)和B细胞激活因子(BAFF)介导的乳腺癌细胞干性。
Front Oncol. 2018 Aug 7;8:301. doi: 10.3389/fonc.2018.00301. eCollection 2018.
9
CLEC3B is downregulated and inhibits proliferation in clear cell renal cell carcinoma.CLEC3B 在肾透明细胞癌中表达下调并抑制其增殖。
Oncol Rep. 2018 Oct;40(4):2023-2035. doi: 10.3892/or.2018.6590. Epub 2018 Jul 23.
10
Transcriptome-Wide Analysis Reveals the Landscape of Aberrant Alternative Splicing Events in Liver Cancer.转录组全分析揭示肝癌中异常可变剪接事件的全景
Hepatology. 2019 Jan;69(1):359-375. doi: 10.1002/hep.30158. Epub 2018 Dec 18.