• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

验证一种重新实现的算法,以整合转录组和 ChIP-seq 数据。

Validating a re-implementation of an algorithm to integrate transcriptome and ChIP-seq data.

机构信息

Department of Biochemistry and Convergence Medical Sciences and Institute of Medical Sciences, College of Medicine, Gyeongsang National University, Jinju, South Korea.

出版信息

PeerJ. 2023 Oct 20;11:e16318. doi: 10.7717/peerj.16318. eCollection 2023.

DOI:10.7717/peerj.16318
PMID:37876906
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10592348/
Abstract

Transcription factor binding to a gene regulatory region induces or represses its expression. Binding and expression target analysis (BETA) integrates the binding and gene expression data to predict this function. First, the regulatory potential of the factor is modeled based on the distance of its binding sites from the transcription start sites in a decay function. Then the differential expression statistics from an experiment where this factor was perturbed represent the binding effect. The rank product of the two values is employed to order in importance. This algorithm was originally implemented in Python. We reimplemented the algorithm in R to take advantage of existing data structures and other tools for downstream analyses. Here, we attempted to replicate the findings in the original BETA paper. We applied the new implementation to the same datasets using default and varying inputs and cutoffs. We successfully replicated the original results. Moreover, we showed that the method was appropriately influenced by varying the input and was robust to choices of cutoffs in statistical testing.

摘要

转录因子与基因调控区域结合会诱导或抑制其表达。结合和表达靶标分析(BETA)将结合和基因表达数据整合在一起,以预测该功能。首先,基于其结合位点与转录起始位点之间的距离,使用衰减函数对因子的调控潜力进行建模。然后,使用该因子受到干扰的实验中的差异表达统计数据来表示结合效应。这两个值的秩积用于重要性排序。该算法最初是用 Python 实现的。我们在 R 中重新实现了该算法,以利用现有的数据结构和其他工具进行下游分析。在这里,我们尝试复制 BETA 论文中的原始发现。我们使用默认值和变化的输入和截止值将新实现应用于相同的数据集。我们成功复制了原始结果。此外,我们还表明,该方法受输入变化的适当影响,并且在统计检验中对截止值的选择具有鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caab/10592348/d2ef6a4a103b/peerj-11-16318-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caab/10592348/d6fe1998a899/peerj-11-16318-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caab/10592348/d2ef6a4a103b/peerj-11-16318-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caab/10592348/d6fe1998a899/peerj-11-16318-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caab/10592348/d2ef6a4a103b/peerj-11-16318-g002.jpg

相似文献

1
Validating a re-implementation of an algorithm to integrate transcriptome and ChIP-seq data.验证一种重新实现的算法,以整合转录组和 ChIP-seq 数据。
PeerJ. 2023 Oct 20;11:e16318. doi: 10.7717/peerj.16318. eCollection 2023.
2
ChIP-R: Assembling reproducible sets of ChIP-seq and ATAC-seq peaks from multiple replicates.ChIP-R:从多个重复样本中组装可重复的ChIP-seq和ATAC-seq峰集。
Genomics. 2021 Jul;113(4):1855-1866. doi: 10.1016/j.ygeno.2021.04.026. Epub 2021 Apr 18.
3
From binding motifs in ChIP-Seq data to improved models of transcription factor binding sites.从染色质免疫沉淀测序(ChIP-Seq)数据中的结合基序到转录因子结合位点的改进模型
J Bioinform Comput Biol. 2013 Feb;11(1):1340004. doi: 10.1142/S0219720013400040. Epub 2013 Jan 16.
4
A novel statistical method for quantitative comparison of multiple ChIP-seq datasets.一种用于多个ChIP-seq数据集定量比较的新型统计方法。
Bioinformatics. 2015 Jun 15;31(12):1889-96. doi: 10.1093/bioinformatics/btv094. Epub 2015 Feb 13.
5
xcore: an R package for inference of gene expression regulators.xcore:用于推断基因表达调控因子的 R 包。
BMC Bioinformatics. 2023 Jan 11;24(1):14. doi: 10.1186/s12859-022-05084-0.
6
Target analysis by integration of transcriptome and ChIP-seq data with BETA.使用BETA整合转录组和ChIP-seq数据进行靶标分析。
Nat Protoc. 2013 Dec;8(12):2502-15. doi: 10.1038/nprot.2013.150. Epub 2013 Nov 21.
7
Virtual ChIP-seq: predicting transcription factor binding by learning from the transcriptome.虚拟 ChIP-seq:通过学习转录组预测转录因子结合。
Genome Biol. 2022 Jun 10;23(1):126. doi: 10.1186/s13059-022-02690-2.
8
DiffChIPL: a differential peak analysis method for high-throughput sequencing data with biological replicates based on limma.DiffChIPL:一种基于 limma 的具有生物学重复的高通量测序数据差异峰分析方法。
Bioinformatics. 2022 Sep 2;38(17):4062-4069. doi: 10.1093/bioinformatics/btac498.
9
MotifGenie: a Python application for searching transcription factor binding sequences using ChIP-Seq datasets.MotifGenie:一个使用 ChIP-Seq 数据集搜索转录因子结合序列的 Python 应用程序。
Bioinformatics. 2021 Nov 18;37(22):4238-4239. doi: 10.1093/bioinformatics/btab379.
10
De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis.从头鉴定基序可提高 ChIP-Seq 数据分析中预测转录因子结合位点的准确性。
Nucleic Acids Res. 2010 Jun;38(11):e126. doi: 10.1093/nar/gkq217. Epub 2010 Apr 7.

本文引用的文献

1
Integrating binding and expression data to predict transcription factors combined function.整合结合和表达数据来预测转录因子的组合功能。
BMC Genomics. 2020 Sep 7;21(1):610. doi: 10.1186/s12864-020-06977-1.
2
Target analysis by integration of transcriptome and ChIP-seq data with BETA.使用BETA整合转录组和ChIP-seq数据进行靶标分析。
Nat Protoc. 2013 Dec;8(12):2502-15. doi: 10.1038/nprot.2013.150. Epub 2013 Nov 21.
3
TET1 and hydroxymethylcytosine in transcription and DNA methylation fidelity.TET1 和羟甲基胞嘧啶在转录和 DNA 甲基化保真度中的作用。
Nature. 2011 May 19;473(7347):343-8. doi: 10.1038/nature10066. Epub 2011 Apr 13.
4
On the detection and refinement of transcription factor binding sites using ChIP-Seq data.利用 ChIP-Seq 数据检测和改进转录因子结合位点。
Nucleic Acids Res. 2010 Apr;38(7):2154-67. doi: 10.1093/nar/gkp1180. Epub 2010 Jan 6.
5
Predicting functionality of protein-DNA interactions by integrating diverse evidence.通过整合多种证据预测蛋白质- DNA相互作用的功能
Bioinformatics. 2009 Jun 15;25(12):i137-44. doi: 10.1093/bioinformatics/btp213.
6
A hierarchical network of transcription factors governs androgen receptor-dependent prostate cancer growth.一个转录因子的层级网络控制着雄激素受体依赖性前列腺癌的生长。
Mol Cell. 2007 Aug 3;27(3):380-92. doi: 10.1016/j.molcel.2007.05.041.
7
Genome-wide mapping of in vivo protein-DNA interactions.体内蛋白质-DNA相互作用的全基因组图谱绘制。
Science. 2007 Jun 8;316(5830):1497-502. doi: 10.1126/science.1141319. Epub 2007 May 31.
8
Genome-wide analysis of estrogen receptor binding sites.雌激素受体结合位点的全基因组分析。
Nat Genet. 2006 Nov;38(11):1289-97. doi: 10.1038/ng1901. Epub 2006 Oct 1.
9
Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.基因集富集分析:一种基于知识的方法用于解读全基因组表达谱。
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50. doi: 10.1073/pnas.0506580102. Epub 2005 Sep 30.
10
Transcription factors: bound to activate or repress.
Trends Biochem Sci. 2001 Apr;26(4):211-3. doi: 10.1016/s0968-0004(01)01812-6.