• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

aPEAch:用于表观基因组和转录组数据端到端分析的自动化流程

aPEAch: Automated Pipeline for End-to-End Analysis of Epigenomic and Transcriptomic Data.

作者信息

Xiropotamos Panagiotis, Papageorgiou Foteini, Manousaki Haris, Sinnis Charalampos, Antonatos Charalabos, Vasilopoulos Yiannis, Georgakilas Georgios K

机构信息

Laboratory of Genetics, Section of Genetics, Cell Biology and Development, Department of Biology, University of Patras, 26504 Patras, Greece.

Information Management Systems Institute, ATHENA Research Center, 15125 Marousi, Greece.

出版信息

Biology (Basel). 2024 Jul 2;13(7):492. doi: 10.3390/biology13070492.

DOI:10.3390/biology13070492
PMID:39056686
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11273691/
Abstract

With the advent of next-generation sequencing (NGS), experimental techniques that capture the biological significance of DNA loci or RNA molecules have emerged as fundamental tools for studying the epigenome and transcriptional regulation on a genome-wide scale. The volume of the generated data and the underlying complexity regarding their analysis highlight the need for robust and easy-to-use computational analytic methods that can streamline the process and provide valuable biological insights. Our solution, aPEAch, is an automated pipeline that facilitates the end-to-end analysis of both DNA- and RNA-sequencing assays, including small RNA sequencing, from assessing the quality of the input sample files to answering meaningful biological questions by exploiting the rich information embedded in biological data. Our method is implemented in Python, based on a modular approach that enables users to choose the path and extent of the analysis and the representations of the results. The pipeline can process samples with single or multiple replicates in batches, allowing the ease of use and reproducibility of the analysis across all samples. aPEAch provides a variety of sample metrics such as quality control reports, fragment size distribution plots, and all intermediate output files, enabling the pipeline to be re-executed with different parameters or algorithms, along with the publication-ready visualization of the results. Furthermore, aPEAch seamlessly incorporates advanced unsupervised learning analyses by automating clustering optimization and visualization, thus providing invaluable insight into the underlying biological mechanisms.

摘要

随着下一代测序(NGS)技术的出现,能够捕捉DNA位点或RNA分子生物学意义的实验技术已成为在全基因组范围内研究表观基因组和转录调控的基本工具。所生成数据的量以及分析其潜在复杂性凸显了对强大且易于使用的计算分析方法的需求,这些方法可以简化流程并提供有价值的生物学见解。我们的解决方案aPEAch是一个自动化流程,它有助于对DNA测序和RNA测序检测进行端到端分析,包括小RNA测序,从评估输入样本文件的质量到通过利用生物数据中嵌入的丰富信息回答有意义的生物学问题。我们的方法是用Python实现的,基于模块化方法,使用户能够选择分析的路径和范围以及结果的表示形式。该流程可以批量处理单个或多个重复样本,从而使所有样本的分析易于使用且具有可重复性。aPEAch提供各种样本指标,如质量控制报告、片段大小分布图以及所有中间输出文件,使该流程能够使用不同的参数或算法重新执行,并能对结果进行可用于发表的可视化展示。此外,aPEAch通过自动执行聚类优化和可视化无缝整合了先进的无监督学习分析,从而为潜在的生物学机制提供了宝贵的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/11273691/34519593a60a/biology-13-00492-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/11273691/e8be4eddb962/biology-13-00492-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/11273691/6132e292ea03/biology-13-00492-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/11273691/34519593a60a/biology-13-00492-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/11273691/e8be4eddb962/biology-13-00492-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/11273691/6132e292ea03/biology-13-00492-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e27/11273691/34519593a60a/biology-13-00492-g003.jpg

相似文献

1
aPEAch: Automated Pipeline for End-to-End Analysis of Epigenomic and Transcriptomic Data.aPEAch:用于表观基因组和转录组数据端到端分析的自动化流程
Biology (Basel). 2024 Jul 2;13(7):492. doi: 10.3390/biology13070492.
2
CATCH-UP: A High-Throughput Upstream-Pipeline for Bulk ATAC-Seq and ChIP-Seq Data.CATCH-UP:一种用于批量 ATAC-Seq 和 ChIP-Seq 数据的高通量上游管道。
J Vis Exp. 2023 Sep 22(199). doi: 10.3791/65633.
3
Unipro UGENE NGS pipelines and components for variant calling, RNA-seq and ChIP-seq data analyses.Unipro UGENE NGS 流水线和组件,用于变体调用、RNA-seq 和 ChIP-seq 数据分析。
PeerJ. 2014 Nov 4;2:e644. doi: 10.7717/peerj.644. eCollection 2014.
4
Octopus-toolkit: a workflow to automate mining of public epigenomic and transcriptomic next-generation sequencing data.章鱼工具包:一个自动化挖掘公共表观基因组和转录组下一代测序数据的工作流程。
Nucleic Acids Res. 2018 May 18;46(9):e53. doi: 10.1093/nar/gky083.
5
Automated Isoform Diversity Detector (AIDD): a pipeline for investigating transcriptome diversity of RNA-seq data.自动化异构体多样性检测工具(AIDD):用于研究 RNA-seq 数据转录组多样性的管道。
BMC Bioinformatics. 2020 Dec 30;21(Suppl 18):578. doi: 10.1186/s12859-020-03888-6.
6
ALGAEFUN with MARACAS, microALGAE FUNctional enrichment tool for MicroAlgae RnA-seq and Chip-seq AnalysiS.玛拉卡斯藻趣,一种微藻 RNA-seq 和 Chip-seq 分析的微藻功能富集工具。
BMC Bioinformatics. 2022 Mar 31;23(1):113. doi: 10.1186/s12859-022-04639-5.
7
CIPHER: a flexible and extensive workflow platform for integrative next-generation sequencing data analysis and genomic regulatory element prediction.CIPHER:一个用于整合下一代测序数据分析和基因组调控元件预测的灵活且功能广泛的工作流程平台。
BMC Bioinformatics. 2017 Aug 8;18(1):363. doi: 10.1186/s12859-017-1770-1.
8
CANEapp: a user-friendly application for automated next generation transcriptomic data analysis.CANEapp:一款用于自动化下一代转录组数据分析的用户友好型应用程序。
BMC Genomics. 2016 Jan 13;17:49. doi: 10.1186/s12864-015-2346-y.
9
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.SPARTA:用于基于参考的细菌RNA测序转录组自动分析的简单程序。
BMC Bioinformatics. 2016 Feb 4;17:66. doi: 10.1186/s12859-016-0923-y.
10
CoBRA: Containerized Bioinformatics Workflow for Reproducible ChIP/ATAC-seq Analysis.CoBRA:用于可重复 ChIP/ATAC-seq 分析的集装箱化生物信息学工作流程。
Genomics Proteomics Bioinformatics. 2021 Aug;19(4):652-661. doi: 10.1016/j.gpb.2020.11.007. Epub 2021 Jul 18.

引用本文的文献

1
Transcriptome-wide analyses delineate the genetic architecture of expression variation in atopic dermatitis.全转录组分析描绘了特应性皮炎中表达变异的遗传结构。
HGG Adv. 2025 Apr 10;6(2):100422. doi: 10.1016/j.xhgg.2025.100422. Epub 2025 Feb 26.

本文引用的文献

1
PyDESeq2: a python package for bulk RNA-seq differential expression analysis.PyDESeq2:一个用于批量 RNA-seq 差异表达分析的 Python 包。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad547.
2
Next-Generation Sequencing Technology: Current Trends and Advancements.下一代测序技术:当前趋势与进展
Biology (Basel). 2023 Jul 13;12(7):997. doi: 10.3390/biology12070997.
3
DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data.DNAscan2:一种功能多样、可扩展且易于使用的人类下一代测序数据分析管道。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad152.
4
GSEApy: a comprehensive package for performing gene set enrichment analysis in Python.GSEApy:一个用于在 Python 中进行基因集富集分析的综合软件包。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac757.
5
Ensembl Genomes 2022: an expanding genome resource for non-vertebrates.Ensembl Genomes 2022:一个不断扩展的非脊椎动物基因组资源。
Nucleic Acids Res. 2022 Jan 7;50(D1):D996-D1003. doi: 10.1093/nar/gkab1007.
6
Integrative, normalization-insusceptible statistical analysis of RNA-Seq data, with improved differential expression and unbiased downstream functional analysis.RNA-Seq 数据的综合、不受归一化影响的统计分析,可改善差异表达分析,并进行无偏下游功能分析。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa156.
7
MethylStar: A fast and robust pre-processing pipeline for bulk or single-cell whole-genome bisulfite sequencing data.MethylStar:一个快速且稳健的用于批量或单细胞全基因组 bisulfite 测序数据的预处理流水线。
BMC Genomics. 2020 Jul 13;21(1):479. doi: 10.1186/s12864-020-06886-3.
8
ImmGen at 15.免疫基因组计划15年
Nat Immunol. 2020 Jul;21(7):700-703. doi: 10.1038/s41590-020-0687-4.
9
CSI NGS Portal: An Online Platform for Automated NGS Data Analysis and Sharing.CSI NGS 门户:一个用于自动化 NGS 数据分析和共享的在线平台。
Int J Mol Sci. 2020 May 28;21(11):3828. doi: 10.3390/ijms21113828.
10
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2020 update.Galaxy 平台,用于实现可访问、可重现和协作的生物医学分析:2020 年更新。
Nucleic Acids Res. 2020 Jul 2;48(W1):W395-W402. doi: 10.1093/nar/gkaa434.