• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种模拟种间 RNA-Seq 数据的系统发育框架。

A Phylogenetic Framework to Simulate Synthetic Interspecies RNA-Seq Data.

机构信息

IMAG, Université de Montpellier, CNRS, Montpellier, France.

Friedrich Miescher Institute for Biomedical Research, 4058 Basel, Switzerland.

出版信息

Mol Biol Evol. 2023 Jan 4;40(1). doi: 10.1093/molbev/msac269.

DOI:10.1093/molbev/msac269
PMID:36508357
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11249980/
Abstract

Interspecies RNA-Seq datasets are increasingly common, and have the potential to answer new questions about the evolution of gene expression. Single-species differential expression analysis is now a well-studied problem that benefits from sound statistical methods. Extensive reviews on biological or synthetic datasets have provided the community with a clear picture on the relative performances of the available methods in various settings. However, synthetic dataset simulation tools are still missing in the interspecies gene expression context. In this work, we develop and implement a new simulation framework. This tool builds on both the RNA-Seq and the phylogenetic comparative methods literatures to generate realistic count datasets, while taking into account the phylogenetic relationships between the samples. We illustrate the usefulness of this new framework through a targeted simulation study, that reproduces the features of a recently published dataset, containing gene expression data in adult eye tissue across blind and sighted freshwater crayfish species. Using our simulated datasets, we perform a fair comparison of several approaches used for differential expression analysis. This benchmark reveals some of the strengths and weaknesses of both the classical and phylogenetic approaches for interspecies differential expression analysis, and allows for a reanalysis of the crayfish dataset. The tool has been integrated in the R package compcodeR, freely available on Bioconductor.

摘要

种间 RNA-Seq 数据集越来越常见,有潜力回答关于基因表达进化的新问题。单物种差异表达分析现在是一个研究充分的问题,得益于合理的统计方法。对生物或合成数据集的广泛综述为社区提供了一个关于各种情况下可用方法相对性能的清晰画面。然而,在种间基因表达的背景下,仍然缺少合成数据集模拟工具。在这项工作中,我们开发并实现了一个新的模拟框架。该工具基于 RNA-Seq 和系统发育比较方法文献,生成真实的计数数据集,同时考虑到样本之间的系统发育关系。我们通过有针对性的模拟研究说明了这个新框架的有用性,该研究再现了最近发表的一个数据集的特征,该数据集包含了盲眼和有眼淡水小龙虾物种的成年眼部组织中的基因表达数据。使用我们的模拟数据集,我们对几种用于差异表达分析的方法进行了公平比较。该基准揭示了经典和系统发育方法在种间差异表达分析中的一些优缺点,并允许对小龙虾数据集进行重新分析。该工具已集成到 R 包 compcodeR 中,可在 Bioconductor 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/8a896257f942/msac269f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/58e80395d5e6/msac269f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/cc5eefc2a13b/msac269f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/9d5661edb4e3/msac269f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/2c11a38d61e3/msac269f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/1f7c61acddd0/msac269f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/c6999ad3f26d/msac269f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/d3b201efe957/msac269f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/8a896257f942/msac269f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/58e80395d5e6/msac269f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/cc5eefc2a13b/msac269f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/9d5661edb4e3/msac269f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/2c11a38d61e3/msac269f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/1f7c61acddd0/msac269f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/c6999ad3f26d/msac269f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/d3b201efe957/msac269f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c6a/11249980/8a896257f942/msac269f8.jpg

相似文献

1
A Phylogenetic Framework to Simulate Synthetic Interspecies RNA-Seq Data.一种模拟种间 RNA-Seq 数据的系统发育框架。
Mol Biol Evol. 2023 Jan 4;40(1). doi: 10.1093/molbev/msac269.
2
compcodeR--an R package for benchmarking differential expression methods for RNA-seq data.compcodeR——一个用于对RNA测序数据差异表达方法进行基准测试的R软件包。
Bioinformatics. 2014 Sep 1;30(17):2517-8. doi: 10.1093/bioinformatics/btu324. Epub 2014 May 9.
3
deGPS is a powerful tool for detecting differential expression in RNA-sequencing studies.deGPS是一种用于在RNA测序研究中检测差异表达的强大工具。
BMC Genomics. 2015 Jun 13;16(1):455. doi: 10.1186/s12864-015-1676-0.
4
Polyester: simulating RNA-seq datasets with differential transcript expression.聚酯:模拟具有差异转录本表达的RNA测序数据集。
Bioinformatics. 2015 Sep 1;31(17):2778-84. doi: 10.1093/bioinformatics/btv272. Epub 2015 Apr 28.
5
Statistical detection of differentially expressed genes based on RNA-seq: from biological to phylogenetic replicates.基于 RNA-seq 的差异表达基因的统计检测:从生物学重复到系统发育重复。
Brief Bioinform. 2016 Mar;17(2):243-8. doi: 10.1093/bib/bbv035. Epub 2015 Jun 24.
6
Benchmarking differential expression analysis tools for RNA-Seq: normalization-based vs. log-ratio transformation-based methods.RNA-Seq 差异表达分析工具的基准测试:基于标准化与基于对数比变换的方法。
BMC Bioinformatics. 2018 Jul 18;19(1):274. doi: 10.1186/s12859-018-2261-8.
7
SimBu: bias-aware simulation of bulk RNA-seq data with variable cell-type composition.SimBu:具有可变细胞类型组成的批量 RNA-seq 数据的偏差感知模拟。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii141-ii147. doi: 10.1093/bioinformatics/btac499.
8
A flexible count data model to fit the wide diversity of expression profiles arising from extensively replicated RNA-seq experiments.一种灵活的计数数据模型,可适用于广泛复制的 RNA-seq 实验所产生的广泛多样化的表达谱。
BMC Bioinformatics. 2013 Aug 21;14:254. doi: 10.1186/1471-2105-14-254.
9
A Framework for Comparison and Assessment of Synthetic RNA-Seq Data.用于合成 RNA-Seq 数据比较和评估的框架。
Genes (Basel). 2022 Dec 14;13(12):2362. doi: 10.3390/genes13122362.
10
Empirical assessment of analysis workflows for differential expression analysis of human samples using RNA-Seq.使用RNA测序对人类样本进行差异表达分析的分析流程的实证评估。
BMC Bioinformatics. 2017 Jan 17;18(1):38. doi: 10.1186/s12859-016-1457-z.

引用本文的文献

1
A Tale of Too Many Trees: A Conundrum for Phylogenetic Regression.树木过多的故事:系统发育回归的难题
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf032.
2
Stochastic modeling of single-cell gene expression adaptation reveals non-genomic contribution to evolution of tumor subclones.单细胞基因表达适应性的随机建模揭示了肿瘤亚克隆进化的非基因组贡献。
Cell Syst. 2025 Jan 15;16(1):101156. doi: 10.1016/j.cels.2024.11.013. Epub 2024 Dec 18.
3
Stochastic modelling of single-cell gene expression adaptation reveals non-genomic contribution to evolution of tumor subclones.

本文引用的文献

1
Best practices on the differential expression analysis of multi-species RNA-seq.多物种 RNA-seq 差异表达分析的最佳实践。
Genome Biol. 2021 Apr 29;22(1):121. doi: 10.1186/s13059-021-02337-8.
2
The Bgee suite: integrated curated expression atlas and comparative transcriptomics in animals.Bgee 套件:动物中综合的经过审核的表达图谱和比较转录组学。
Nucleic Acids Res. 2021 Jan 8;49(D1):D831-D847. doi: 10.1093/nar/gkaa793.
3
Amalgamated cross-species transcriptomes reveal organ-specific propensity in gene expression evolution.
单细胞基因表达适应性的随机建模揭示了对肿瘤亚克隆进化的非基因组贡献。
bioRxiv. 2024 Apr 20:2024.04.17.588869. doi: 10.1101/2024.04.17.588869.
4
Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution.评估用于基因表达进化的常用系统发育模型的性能。
Genome Biol Evol. 2023 Dec 1;15(12). doi: 10.1093/gbe/evad211.
5
Dex-Benchmark: datasets and code to evaluate algorithms for transcriptomics data analysis.Dex-Benchmark:用于评估转录组数据分析算法的数据集和代码。
PeerJ. 2023 Nov 8;11:e16351. doi: 10.7717/peerj.16351. eCollection 2023.
6
Facing lethal temperatures: Heat-shock response in desert and temperate ants.面对致命温度:沙漠蚂蚁和温带蚂蚁的热休克反应
Ecol Evol. 2023 Sep 14;13(9):e10438. doi: 10.1002/ece3.10438. eCollection 2023 Sep.
7
Evaluating the Performance of Widely Used Phylogenetic Models for Gene Expression Evolution.评估广泛使用的系统发育模型在基因表达进化方面的性能。
bioRxiv. 2023 Aug 17:2023.02.09.527893. doi: 10.1101/2023.02.09.527893.
合并跨物种转录组揭示了基因表达进化中的器官特异性倾向。
Nat Commun. 2020 Sep 8;11(1):4459. doi: 10.1038/s41467-020-18090-8.
4
Comparative transcriptomics of primary cells in vertebrates.脊椎动物原代细胞的比较转录组学。
Genome Res. 2020 Jul;30(7):951-961. doi: 10.1101/gr.255679.119. Epub 2020 Jul 27.
5
Gene expression of functionally-related genes coevolves across fungal species: detecting coevolution of gene expression using phylogenetic comparative methods.功能相关基因的表达在真菌物种中共同进化:使用系统发育比较方法检测基因表达的共同进化。
BMC Genomics. 2020 May 20;21(1):370. doi: 10.1186/s12864-020-6761-3.
6
A comparison of gene expression and DNA methylation patterns across tissues and species.比较不同组织和物种的基因表达和 DNA 甲基化模式。
Genome Res. 2020 Feb;30(2):250-262. doi: 10.1101/gr.254904.119. Epub 2020 Jan 17.
7
The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.马修斯相关系数(MCC)在二分类评估中优于 F1 得分和准确率的优势。
BMC Genomics. 2020 Jan 2;21(1):6. doi: 10.1186/s12864-019-6413-7.
8
Drift and Directional Selection Are the Evolutionary Forces Driving Gene Expression Divergence in Eye and Brain Tissue of Butterflies.漂变和定向选择是推动蝴蝶眼和脑组织中基因表达差异的进化力量。
Genetics. 2019 Oct;213(2):581-594. doi: 10.1534/genetics.119.302493. Epub 2019 Aug 29.
9
Estimating the strength of expression conservation from high throughput RNA-seq data.从高通量 RNA-seq 数据估计表达保守性的强度。
Bioinformatics. 2019 Dec 1;35(23):5030-5038. doi: 10.1093/bioinformatics/btz405.
10
A statistical normalization method and differential expression analysis for RNA-seq data between different species.用于不同物种间 RNA-seq 数据的统计标准化方法和差异表达分析。
BMC Bioinformatics. 2019 Mar 29;20(1):163. doi: 10.1186/s12859-019-2745-1.