对来自多项研究的3'-序列数据进行分析后发现，尽管收集条件相似，但结果集和原始数据特征却存在差异。

Analysis of 3'-seq data from multiple studies identifies diverging results sets and raw data characteristics despite similar collection conditions.

作者信息

Furumo Quinlan, Meyer Michelle M

机构信息

Boston College, Department of Biology, Chestnut Hill MA 02467.

出版信息

bioRxiv. 2025 Jun 12:2025.06.12.658996. doi: 10.1101/2025.06.12.658996.

DOI:10.1101/2025.06.12.658996

PMID:40661418

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12259150/

Abstract

3-prime end sequencing (3'-seq) is a high-throughput sequencing technique that is used to specifically quantify the changes in 3'-end formation of transcripts in bacterial cells, which is increasingly being utilized to address fundamental questions regarding transcription termination and pausing across a range of different bacterial species. However, the growing number of 3'-seq studies is accompanied by an increase in study-specific 3'-seq data analysis approaches. Thus, differences in a number of factors including: experimental design, data collection approaches, analysis methodologies, and interpretation decisions, make it challenging to confidently compare results derived from different studies, even those that were performed on the same organism. To assess the potential severity of these discrepancies, we used PIPETS, a statistically robust and genome-annotation agnostic 3'-seq analysis package, to study 3'-seq data sets from three different groups collected under similar conditions. By using a consistent analysis and results interpretation approacequaionh, we identified large disparities in the characteristics of the raw 3'-seq data between each of the studies, despite all three studies using the same strain and very similar reported experimental conditions. Additionally, we found strand-specific inconsistencies, with some data sets having reference strand 3'-seq read coverage distributions that differed greatly from the complement strand within the same replicate. Finally, when the 3'-seq distribution profiles of the three studies are compared to studies from four additional bacteria, we identified 3'-seq results clustering patterns that are not explained by phylogenetic similarity between organisms. With the large differences seen between data sets from the same organism as well as the inconsistencies seen between replicates from the same data sets, we urge the field to reconsider the assumptions around 3'-seq data homogeneity and move towards consistent analysis approaches, and cautious interpretation of the data.

摘要

3'端测序（3'-seq）是一种高通量测序技术，用于特异性定量细菌细胞中转录本3'端形成的变化，该技术越来越多地被用于解决一系列不同细菌物种中有关转录终止和暂停的基本问题。然而，随着3'-seq研究数量的不断增加，特定研究的3'-seq数据分析方法也在增多。因此，包括实验设计、数据收集方法、分析方法和解释决策等诸多因素的差异，使得即使是对同一生物体进行的不同研究结果，也难以进行可靠的比较。为了评估这些差异的潜在严重程度，我们使用了PIPETS，这是一个统计稳健且与基因组注释无关的3'-seq分析软件包，来研究在相似条件下收集的来自三个不同组的3'-seq数据集。通过使用一致的分析和结果解释方法，我们发现尽管所有三项研究都使用了相同的菌株且报告的实验条件非常相似，但每项研究的原始3'-seq数据特征仍存在巨大差异。此外，我们还发现了链特异性的不一致性，一些数据集的参考链3'-seq读数覆盖分布与同一重复内的互补链有很大差异。最后，当将这三项研究的3'-seq分布图谱与另外四种细菌的研究进行比较时，我们发现3'-seq结果的聚类模式无法用生物体之间的系统发育相似性来解释。鉴于来自同一生物体的数据集之间存在巨大差异，以及同一数据集的重复之间存在不一致性，我们敦促该领域重新考虑关于3'-seq数据同质性的假设，并朝着一致的分析方法发展，并谨慎解释数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f17/12259150/3c78d7391a6e/nihpp-2025.06.12.658996v1-f0002.jpg

相似文献

Analysis of 3'-seq data from multiple studies identifies diverging results sets and raw data characteristics despite similar collection conditions.

bioRxiv. 2025 Jun 12:2025.06.12.658996. doi: 10.1101/2025.06.12.658996.

The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.

Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

Healthcare outcomes assessed with observational study designs compared with those assessed in randomized trials.

Cochrane Database Syst Rev. 2014 Apr 29;2014(4):MR000034. doi: 10.1002/14651858.MR000034.pub2.

[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].

Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.

Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.

Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.

Portion, package or tableware size for changing selection and consumption of food, alcohol and tobacco.

Cochrane Database Syst Rev. 2015 Sep 14;2015(9):CD011045. doi: 10.1002/14651858.CD011045.pub2.

Surgical interventions for treating extracapsular hip fractures in older adults: a network meta-analysis.

Cochrane Database Syst Rev. 2022 Feb 10;2(2):CD013405. doi: 10.1002/14651858.CD013405.pub2.

本文引用的文献

PIPETS: a statistically informed, gene-annotation agnostic analysis method to study bacterial termination using 3'-end sequencing.

BMC Bioinformatics. 2024 Nov 23;25(1):363. doi: 10.1186/s12859-024-05982-5.

TRS: a method for determining transcript termini from RNAtag-seq sequencing data.

Nat Commun. 2023 Nov 29;14(1):7843. doi: 10.1038/s41467-023-43534-2.

Extensive diversity in RNA termination and regulation revealed by transcriptome mapping for the Lyme pathogen Borrelia burgdorferi.

Nat Commun. 2023 Jul 4;14(1):3931. doi: 10.1038/s41467-023-39576-1.

Premature termination of transcription is shaped by Rho and translated uORFS in .

iScience. 2023 Mar 22;26(4):106465. doi: 10.1016/j.isci.2023.106465. eCollection 2023 Apr 21.

Ubiquitous mRNA decay fragments in E. coli redefine the functional transcriptome.

Nucleic Acids Res. 2022 May 20;50(9):5029-5046. doi: 10.1093/nar/gkac295.

Synthetic 3'-UTR valves for optimal metabolic flux control in Escherichia coli.

Nucleic Acids Res. 2022 Apr 22;50(7):4171-4186. doi: 10.1093/nar/gkac206.

Analysis of mRNA Decay Intermediates in Bacillus subtilis 3' Exoribonuclease and RNA Helicase Mutant Strains.

mBio. 2022 Apr 26;13(2):e0040022. doi: 10.1128/mbio.00400-22. Epub 2022 Mar 21.

System-Level Analysis of Transcriptional and Translational Regulatory Elements in .

Front Bioeng Biotechnol. 2022 Feb 25;10:844200. doi: 10.3389/fbioe.2022.844200. eCollection 2022.

Genome-scale analysis of genetic regulatory elements in Streptomyces avermitilis MA-4680 using transcript boundary information.

BMC Genomics. 2022 Jan 21;23(1):68. doi: 10.1186/s12864-022-08314-0.

Quantitative mapping of mRNA 3' ends in Pseudomonas aeruginosa reveals a pervasive role for premature 3' end formation in response to azithromycin.

PLoS Genet. 2021 Jul 12;17(7):e1009634. doi: 10.1371/journal.pgen.1009634. eCollection 2021 Jul.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

对来自多项研究的3'-序列数据进行分析后发现，尽管收集条件相似，但结果集和原始数据特征却存在差异。

Analysis of 3'-seq data from multiple studies identifies diverging results sets and raw data characteristics despite similar collection conditions.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献