• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

忽略多重比对会导致对 NGS 数据的功能评估产生偏差。

Disregarding multimappers leads to biases in the functional assessment of NGS data.

机构信息

Institute of Biomedical Informatics, Graz University of Technology, Graz, Austria.

出版信息

BMC Genomics. 2024 May 8;25(1):455. doi: 10.1186/s12864-024-10344-9.

DOI:10.1186/s12864-024-10344-9
PMID:38720252
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11078754/
Abstract

BACKGROUND

Standard ChIP-seq and RNA-seq processing pipelines typically disregard sequencing reads whose origin is ambiguous ("multimappers"). This usual practice has potentially important consequences for the functional interpretation of the data: genomic elements belonging to clusters composed of highly similar members are left unexplored.

RESULTS

In particular, disregarding multimappers leads to the underrepresentation in epigenetic studies of recently active transposable elements, such as AluYa5, L1HS and SVAs. Furthermore, this common strategy also has implications for transcriptomic analysis: members of repetitive gene families, such the ones including major histocompatibility complex (MHC) class I and II genes, are under-quantified.

CONCLUSION

Revealing inherent biases that permeate routine tasks such as functional enrichment analysis, our results underscore the urgency of broadly adopting multimapper-aware bioinformatic pipelines -currently restricted to specific contexts or communities- to ensure the reliability of genomic and transcriptomic studies.

摘要

背景

标准 ChIP-seq 和 RNA-seq 处理管道通常会忽略其来源不明确的测序读取(“多重映射”)。这种常见的做法可能会对数据的功能解释产生重要影响:属于由高度相似成员组成的聚类的基因组元件未被探索。

结果

特别是,忽略多重映射会导致在表观遗传学研究中,最近活跃的转座元件(如 AluYa5、L1HS 和 SVAs)的代表性不足。此外,这种常见策略也会对转录组分析产生影响:重复基因家族的成员,如主要组织相容性复合体(MHC)I 类和 II 类基因,被低估了。

结论

揭示了普遍存在于功能富集分析等常规任务中的固有偏差,我们的结果强调了广泛采用多重映射感知生物信息学管道的紧迫性-目前仅限于特定的背景或社区-以确保基因组和转录组研究的可靠性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73ff/11080154/12376a206893/12864_2024_10344_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73ff/11080154/cb7493e806e5/12864_2024_10344_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73ff/11080154/12376a206893/12864_2024_10344_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73ff/11080154/cb7493e806e5/12864_2024_10344_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73ff/11080154/12376a206893/12864_2024_10344_Fig2_HTML.jpg

相似文献

1
Disregarding multimappers leads to biases in the functional assessment of NGS data.忽略多重比对会导致对 NGS 数据的功能评估产生偏差。
BMC Genomics. 2024 May 8;25(1):455. doi: 10.1186/s12864-024-10344-9.
2
piPipes: a set of pipelines for piRNA and transposon analysis via small RNA-seq, RNA-seq, degradome- and CAGE-seq, ChIP-seq and genomic DNA sequencing.piPipes:一组通过小RNA测序、RNA测序、降解组和CAGE测序、染色质免疫沉淀测序以及基因组DNA测序进行piRNA和转座子分析的管道。
Bioinformatics. 2015 Feb 15;31(4):593-5. doi: 10.1093/bioinformatics/btu647. Epub 2014 Oct 17.
3
Analysis of ChIP-Seq and RNA-Seq Data with BioWardrobe.使用BioWardrobe分析染色质免疫沉淀测序(ChIP-Seq)和RNA测序(RNA-Seq)数据。
Methods Mol Biol. 2018;1783:343-360. doi: 10.1007/978-1-4939-7834-2_17.
4
Grape RNA-Seq analysis pipeline environment.葡萄 RNA-Seq 分析管道环境。
Bioinformatics. 2013 Mar 1;29(5):614-21. doi: 10.1093/bioinformatics/btt016. Epub 2013 Jan 17.
5
T3E: a tool for characterising the epigenetic profile of transposable elements using ChIP-seq data.T3E:一种利用染色质免疫沉淀测序(ChIP-seq)数据来表征转座元件表观遗传图谱的工具。
Mob DNA. 2022 Nov 30;13(1):29. doi: 10.1186/s13100-022-00285-z.
6
Differential Expression Analysis of Transposable Elements from RNA-seq Data.基于 RNA-seq 数据的转座元件差异表达分析。
Cold Spring Harb Protoc. 2023 Jan 3;2023(1):35-47. doi: 10.1101/pdb.prot107748.
7
Analysis of RNA-Seq Data Using TEtranscripts.使用TEtranscripts分析RNA测序数据。
Methods Mol Biol. 2018;1751:153-167. doi: 10.1007/978-1-4939-7710-9_11.
8
EXPRSS: an Illumina based high-throughput expression-profiling method to reveal transcriptional dynamics.EXPRSS:一种基于Illumina的高通量表达谱分析方法,用于揭示转录动态。
BMC Genomics. 2014 May 6;15(1):341. doi: 10.1186/1471-2164-15-341.
9
RNA-Seq for transcriptome analysis in non-model plants.用于非模式植物转录组分析的RNA测序
Methods Mol Biol. 2013;1069:43-58. doi: 10.1007/978-1-62703-613-9_4.
10
Transcript Profiling Using Long-Read Sequencing Technologies.使用长读长测序技术进行转录本分析
Methods Mol Biol. 2018;1783:121-147. doi: 10.1007/978-1-4939-7834-2_6.

引用本文的文献

1
Long-read RNA sequencing of transposable elements from single cells using CELLO-seq.使用CELLO-seq对单细胞中的转座元件进行长读长RNA测序。
Nat Protoc. 2025 Jul 16. doi: 10.1038/s41596-025-01203-2.
2
Transcriptomic Profiling Provides Insight into the Molecular Basis of Heterosis in Philippine-Reared Hybrids.转录组分析为菲律宾培育的杂交种杂种优势的分子基础提供了见解。
Insects. 2025 Feb 26;16(3):243. doi: 10.3390/insects16030243.

本文引用的文献

1
Extending support for mouse data in the Molecular Signatures Database (MSigDB).扩展对分子特征数据库(MSigDB)中鼠标数据的支持。
Nat Methods. 2023 Nov;20(11):1619-1620. doi: 10.1038/s41592-023-02014-7.
2
T3E: a tool for characterising the epigenetic profile of transposable elements using ChIP-seq data.T3E:一种利用染色质免疫沉淀测序(ChIP-seq)数据来表征转座元件表观遗传图谱的工具。
Mob DNA. 2022 Nov 30;13(1):29. doi: 10.1186/s13100-022-00285-z.
3
clusterProfiler 4.0: A universal enrichment tool for interpreting omics data.clusterProfiler 4.0:用于解释组学数据的通用富集工具。
Innovation (Camb). 2021 Jul 1;2(3):100141. doi: 10.1016/j.xinn.2021.100141. eCollection 2021 Aug 28.
4
Sequence deeper without sequencing more: Bayesian resolution of ambiguously mapped reads.序列深度不变,测序量更少:解决模糊映射读取的贝叶斯方法。
PLoS Comput Biol. 2021 Apr 19;17(4):e1008926. doi: 10.1371/journal.pcbi.1008926. eCollection 2021 Apr.
5
Systematic benchmark of ancient DNA read mapping.系统评估古 DNA 读段映射。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab076.
6
The Dfam community resource of transposable element families, sequence models, and genome annotations.转座元件家族、序列模型和基因组注释的Dfam社区资源。
Mob DNA. 2021 Jan 12;12(1):2. doi: 10.1186/s13100-020-00230-y.
7
Handling multi-mapped reads in RNA-seq.处理RNA测序中的多重比对 reads
Comput Struct Biotechnol J. 2020 Jun 12;18:1569-1576. doi: 10.1016/j.csbj.2020.06.014. eCollection 2020.
8
Measuring and interpreting transposable element expression.测量和解释转座因子的表达。
Nat Rev Genet. 2020 Dec;21(12):721-736. doi: 10.1038/s41576-020-0251-y. Epub 2020 Jun 23.
9
Mobile genomics: tools and techniques for tackling transposons.移动基因组学:解决转座子的工具和技术。
Philos Trans R Soc Lond B Biol Sci. 2020 Mar 30;375(1795):20190345. doi: 10.1098/rstb.2019.0345. Epub 2020 Feb 10.
10
An enrichment method for mapping ambiguous reads to the reference genome for NGS analysis.一种用于将二代测序分析中的模糊读段映射到参考基因组的富集方法。
J Bioinform Comput Biol. 2019 Dec;17(6):1940012. doi: 10.1142/S0219720019400122.