Suppr
超能文献

评估用于推进精准医学的个体转录组学解释的单例研究方法。

Evaluating single-subject study methods for personal transcriptomic interpretations to advance precision medicine.

机构信息

The Center for Biomedical Informatics & Biostatistics of the University of Arizona Health Sciences, 1230 N. Cherry Ave, Tucson, AZ, 85721, USA.

The Department of Medicine, College of Medicine Tucson, 1501 N. Campbell Ave, Tucson, AZ, 85724-5035, USA.

出版信息

BMC Med Genomics. 2019 Jul 11;12(Suppl 5):96. doi: 10.1186/s12920-019-0513-8.

DOI:10.1186/s12920-019-0513-8

PMID:31296218

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6624180/

Abstract

BACKGROUND

Gene expression profiling has benefited medicine by providing clinically relevant insights at the molecular candidate and systems levels. However, to adopt a more 'precision' approach that integrates individual variability including 'omics data into risk assessments, diagnoses, and therapeutic decision making, whole transcriptome expression needs to be interpreted meaningfully for single subjects. We propose an "all-against-one" framework that uses biological replicates in isogenic conditions for testing differentially expressed genes (DEGs) in a single subject (ss) in the absence of an appropriate external reference standard or replicates. To evaluate our proposed "all-against-one" framework, we construct reference standards (RSs) with five conventional replicate-anchored analyses (NOISeq, DEGseq, edgeR, DESeq, DESeq2) and the remainder were treated separately as single-subject sample pairs for ss analyses (without replicates).

RESULTS

Eight ss methods (NOISeq, DEGseq, edgeR, mixture model, DESeq, DESeq2, iDEG, and ensemble) for identifying genes with differential expression were compared in Yeast (parental line versus snf2 deletion mutant; n = 42/condition) and a MCF7 breast-cancer cell line (baseline versus stimulated with estradiol; n = 7/condition). Receiver-operator characteristic (ROC) and precision-recall plots were determined for eight ss methods against each of the five RSs in both datasets. Consistent with prior analyses of these data, ~ 50% and ~ 15% DEGs were obtained in Yeast and MCF7 datasets respectively, regardless of the RSs method. NOISeq, edgeR, and DESeq were the most concordant for creating a RS. Single-subject versions of NOISeq, DEGseq, and an ensemble learner achieved the best median ROC-area-under-the-curve to compare two transcriptomes without replicates regardless of the RS method and dataset (> 90% in Yeast, > 0.75 in MCF7). Further, distinct specific single-subject methods perform better according to different proportions of DEGs.

CONCLUSIONS

The "all-against-one" framework provides a honest evaluation framework for single-subject DEG studies since these methods are evaluated, by design, against reference standards produced by unrelated DEG methods. The ss-ensemble method was the only one to reliably produce higher accuracies in all conditions tested in this conservative evaluation framework. However, single-subject methods for identifying DEGs from paired samples need improvement, as no method performed with precision> 90% and obtained moderate levels of recall. http://www.lussiergroup.org/publications/EnsembleBiomarker.

摘要

背景

基因表达谱分析通过在分子候选物和系统水平上提供临床相关的见解，使医学受益。然而，要采用更“精确”的方法，将个体变异（包括“组学”数据）整合到风险评估、诊断和治疗决策中，需要对单个受试者进行有意义的全转录组表达解读。我们提出了一种“一对一”框架，该框架使用同基因条件下的生物学重复来测试单个受试者（ss）中差异表达基因（DEGs），而无需适当的外部参考标准或重复。为了评估我们提出的“一对一”框架，我们使用五种常规重复锚定分析（NOISeq、DEGseq、edgeR、DESeq、DESeq2）构建参考标准（RS），其余的则分别作为单个受试者样本对（无重复）进行 ss 分析。

结果

在酵母（亲本系与 snf2 缺失突变体；n=42/条件）和 MCF7 乳腺癌细胞系（基线与用雌二醇刺激；n=7/条件）中，比较了八种用于识别差异表达基因的 ss 方法（NOISeq、DEGseq、edgeR、混合物模型、DESeq、DESeq2、iDEG 和集成）。在两个数据集的五个 RS 中，为每个 ss 方法确定了接收器工作特征（ROC）和精度-召回曲线。与这些数据的先前分析一致，无论 RS 方法如何，酵母和 MCF7 数据集分别获得了约 50%和 15%的差异表达基因。NOISeq、edgeR 和 DESeq 是创建 RS 最一致的方法。NOISeq、DEGseq 和集成学习者的单主体版本在没有重复的情况下比较两个转录组时，实现了最佳的中位数 ROC-曲线下面积，无论 RS 方法和数据集如何（酵母>90%，MCF7>0.75）。此外，根据差异表达基因的不同比例，不同的特定单主体方法表现更好。

结论

“一对一”框架为单个受试者 DEG 研究提供了一个诚实的评估框架，因为这些方法是通过设计来评估由不相关的 DEG 方法产生的参考标准的。在这个保守的评估框架中，ss-集成方法是唯一能够可靠地在所有测试条件下产生更高准确性的方法。然而，用于从配对样本中识别 DEG 的单个受试者方法需要改进，因为没有一种方法的精度>90%，召回率适中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061e/6624180/502f13da072b/12920_2019_513_Fig1_HTML.jpg

相似文献

Evaluating single-subject study methods for personal transcriptomic interpretations to advance precision medicine.

BMC Med Genomics. 2019 Jul 11;12(Suppl 5):96. doi: 10.1186/s12920-019-0513-8.

Interpretation of 'Omics dynamics in a single subject using local estimates of dispersion between two transcriptomes.

AMIA Annu Symp Proc. 2020 Mar 4;2019:582-591. eCollection 2019.

Personalized beyond Precision: Designing Unbiased Gold Standards to Improve Single-Subject Studies of Personal Genome Dynamics from Gene Products.

J Pers Med. 2020 Dec 31;11(1):24. doi: 10.3390/jpm11010024.

Single subject transcriptome analysis to identify functionally signed gene set or pathway activity.

Pac Symp Biocomput. 2018;23:400-411.

Emergence of pathway-level composite biomarkers from converging gene set signals of heterogeneous transcriptomic responses.

Pac Symp Biocomput. 2018;23:484-495.

Usability of reference-free transcriptome assemblies for detection of differential expression: a case study on Aethionema arabicum dimorphic seeds.

BMC Genomics. 2019 Jan 30;20(1):95. doi: 10.1186/s12864-019-5452-4.

Dynamic changes of RNA-sequencing expression for precision medicine: N-of-1-pathways Mahalanobis distance within pathways of single subjects predicts breast cancer survival.

Bioinformatics. 2015 Jun 15;31(12):i293-302. doi: 10.1093/bioinformatics/btv253.

bestDEG: a web-based application automatically combines various tools to precisely predict differentially expressed genes (DEGs) from RNA-Seq data.

PeerJ. 2022 Nov 10;10:e14344. doi: 10.7717/peerj.14344. eCollection 2022.

Robust identification of differentially expressed genes from RNA-seq data.

Genomics. 2020 Mar;112(2):2000-2010. doi: 10.1016/j.ygeno.2019.11.012. Epub 2019 Nov 20.

A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.

PLoS One. 2017 May 1;12(5):e0176185. doi: 10.1371/journal.pone.0176185. eCollection 2017.

引用本文的文献

Multiomics2Targets identifies targets from cancer cohorts profiled with transcriptomics, proteomics, and phosphoproteomics.

Cell Rep Methods. 2024 Aug 19;4(8):100839. doi: 10.1016/j.crmeth.2024.100839. Epub 2024 Aug 9.

(Re-)Viewing Role of Intracellular Glucose Beyond Extracellular Regulation of Glucose-Stimulated Insulin Secretion by Pancreatic Cells.

ACS Omega. 2024 Feb 29;9(10):11755-11768. doi: 10.1021/acsomega.3c09171. eCollection 2024 Mar 12.

'Single-subject studies'-derived analyses unveil altered biomechanisms between very small cohorts: implications for rare diseases.

Bioinformatics. 2021 Jul 12;37(Suppl_1):i67-i75. doi: 10.1093/bioinformatics/btab290.

The overexpression of DNA repair genes in invasive ductal and lobular breast carcinomas: Insights on individual variations and precision medicine.

PLoS One. 2021 Mar 4;16(3):e0247837. doi: 10.1371/journal.pone.0247837. eCollection 2021.

Personalized beyond Precision: Designing Unbiased Gold Standards to Improve Single-Subject Studies of Personal Genome Dynamics from Gene Products.

J Pers Med. 2020 Dec 31;11(1):24. doi: 10.3390/jpm11010024.

Interpretation of 'Omics dynamics in a single subject using local estimates of dispersion between two transcriptomes.

AMIA Annu Symp Proc. 2020 Mar 4;2019:582-591. eCollection 2019.

本文引用的文献

The Emerging Potential for Network Analysis to Inform Precision Cancer Medicine.

J Mol Biol. 2018 Sep 14;430(18 Pt A):2875-2899. doi: 10.1016/j.jmb.2018.06.016. Epub 2018 Jun 15.

Developing a 'personalome' for precision medicine: emerging methods that compute interpretable effect sizes from single-subject transcriptomes.

Brief Bioinform. 2019 May 21;20(3):789-805. doi: 10.1093/bib/bbx149.

A genome-by-environment interaction classifier for precision medicine: personal transcriptome response to rhinovirus identifies children prone to asthma exacerbations.

J Am Med Inform Assoc. 2017 Nov 1;24(6):1116-1126. doi: 10.1093/jamia/ocx069.

N-of-1-pathways MixEnrich: advancing precision medicine via single-subject analysis in discovering dynamic changes of transcriptomes.

BMC Med Genomics. 2017 May 24;10(Suppl 1):27. doi: 10.1186/s12920-017-0263-4.

Testing for differentially expressed genetic pathways with single-subject N-of-1 data in the presence of inter-gene correlation.

Stat Methods Med Res. 2018 Dec;27(12):3797-3813. doi: 10.1177/0962280217712271. Epub 2017 May 29.

A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.

PLoS One. 2017 May 1;12(5):e0176185. doi: 10.1371/journal.pone.0176185. eCollection 2017.

kMEn: Analyzing noisy and bidirectional transcriptional pathway responses in single subjects.

J Biomed Inform. 2017 Feb;66:32-41. doi: 10.1016/j.jbi.2016.12.009. Epub 2016 Dec 19.

RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods.

Nucleic Acids Res. 2017 Mar 17;45(5):e30. doi: 10.1093/nar/gkw1063.

Precrec: fast and accurate precision-recall and ROC curve calculations in R.

Bioinformatics. 2017 Jan 1;33(1):145-147. doi: 10.1093/bioinformatics/btw570. Epub 2016 Sep 1.

Analysis of aggregated cell-cell statistical distances within pathways unveils therapeutic-resistance mechanisms in circulating tumor cells.

Bioinformatics. 2016 Jun 15;32(12):i80-i89. doi: 10.1093/bioinformatics/btw248.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

评估用于推进精准医学的个体转录组学解释的单例研究方法。

Evaluating single-subject study methods for personal transcriptomic interpretations to advance precision medicine.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译