• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

2024 年的 HOCOMOCO:人类和小鼠转录因子结合模型的精选集合的重建。

HOCOMOCO in 2024: a rebuild of the curated collection of binding models for human and mouse transcription factors.

机构信息

Vavilov Institute of General Genetics, Russian Academy of Sciences, 119991 Moscow, Russia.

Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Russia.

出版信息

Nucleic Acids Res. 2024 Jan 5;52(D1):D154-D163. doi: 10.1093/nar/gkad1077.

DOI:10.1093/nar/gkad1077
PMID:37971293
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10767914/
Abstract

We present a major update of the HOCOMOCO collection that provides DNA binding specificity patterns of 949 human transcription factors and 720 mouse orthologs. To make this release, we performed motif discovery in peak sets that originated from 14 183 ChIP-Seq experiments and reads from 2554 HT-SELEX experiments yielding more than 400 thousand candidate motifs. The candidate motifs were annotated according to their similarity to known motifs and the hierarchy of DNA-binding domains of the respective transcription factors. Next, the motifs underwent human expert curation to stratify distinct motif subtypes and remove non-informative patterns and common artifacts. Finally, the curated subset of 100 thousand motifs was supplied to the automated benchmarking to select the best-performing motifs for each transcription factor. The resulting HOCOMOCO v12 core collection contains 1443 verified position weight matrices, including distinct subtypes of DNA binding motifs for particular transcription factors. In addition to the core collection, HOCOMOCO v12 provides motif sets optimized for the recognition of binding sites in vivo and in vitro, and for annotation of regulatory sequence variants. HOCOMOCO is available at https://hocomoco12.autosome.org and https://hocomoco.autosome.org.

摘要

我们呈现了 HOCOMOCO 集合的重大更新,其中提供了 949 个人类转录因子和 720 个鼠标同源物的 DNA 结合特异性模式。为了发布此版本,我们在源自 14183 个 ChIP-Seq 实验和 2554 个 HT-SELEX 实验的峰集中进行了基序发现,产生了超过 40 万个候选基序。候选基序根据其与已知基序的相似性以及各自转录因子的 DNA 结合域层次结构进行了注释。接下来,对基序进行了人类专家策展,以对不同的基序亚型进行分层,并去除非信息性模式和常见的人工制品。最后,对经过策展的 10 万个基序子集进行了自动基准测试,以选择每个转录因子表现最佳的基序。由此产生的 HOCOMOCO v12 核心集合包含 1443 个经过验证的位置权重矩阵,其中包括特定转录因子的 DNA 结合基序的独特亚型。除了核心集合之外,HOCOMOCO v12 还提供了针对体内和体外结合位点识别以及调控序列变体注释优化的基序集。HOCOMOCO 可在 https://hocomoco12.autosome.org 和 https://hocomoco.autosome.org 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d39d/10767914/d9405d98cbe1/gkad1077fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d39d/10767914/ceb7273df360/gkad1077figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d39d/10767914/504774f2e7e6/gkad1077fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d39d/10767914/1f62ca0fdb29/gkad1077fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d39d/10767914/d9405d98cbe1/gkad1077fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d39d/10767914/ceb7273df360/gkad1077figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d39d/10767914/504774f2e7e6/gkad1077fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d39d/10767914/1f62ca0fdb29/gkad1077fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d39d/10767914/d9405d98cbe1/gkad1077fig3.jpg

相似文献

1
HOCOMOCO in 2024: a rebuild of the curated collection of binding models for human and mouse transcription factors.2024 年的 HOCOMOCO:人类和小鼠转录因子结合模型的精选集合的重建。
Nucleic Acids Res. 2024 Jan 5;52(D1):D154-D163. doi: 10.1093/nar/gkad1077.
2
HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis.HOCOMOCO:通过大规模的 ChIP-Seq 分析,构建人类和小鼠转录因子结合模型的完整集合。
Nucleic Acids Res. 2018 Jan 4;46(D1):D252-D259. doi: 10.1093/nar/gkx1106.
3
HOCOMOCO: expansion and enhancement of the collection of transcription factor binding sites models.HOCOMOCO:转录因子结合位点模型集合的扩展与增强
Nucleic Acids Res. 2016 Jan 4;44(D1):D116-25. doi: 10.1093/nar/gkv1249. Epub 2015 Nov 19.
4
HOCOMOCO: a comprehensive collection of human transcription factor binding sites models.HOCOMOCO:一个全面的人类转录因子结合位点模型集合。
Nucleic Acids Res. 2013 Jan;41(Database issue):D195-202. doi: 10.1093/nar/gks1089. Epub 2012 Nov 21.
5
Factorbook: an updated catalog of transcription factor motifs and candidate regulatory motif sites.Factorbook:转录因子基序和候选调控基序位点的更新目录。
Nucleic Acids Res. 2022 Jan 7;50(D1):D141-D149. doi: 10.1093/nar/gkab1039.
6
Sequence homology in eukaryotes (SHOE): interactive visual tool for promoter analysis.真核生物序列同源性(SHOE):用于启动子分析的交互式可视化工具。
BMC Genomics. 2018 Sep 27;19(1):715. doi: 10.1186/s12864-018-5101-3.
7
abc4pwm: affinity based clustering for position weight matrices in applications of DNA sequence analysis.abc4pwm:基于亲和度的位置权重矩阵聚类在 DNA 序列分析中的应用。
BMC Bioinformatics. 2022 Mar 3;23(1):83. doi: 10.1186/s12859-022-04615-z.
8
A De Novo Shape Motif Discovery Algorithm Reveals Preferences of Transcription Factors for DNA Shape Beyond Sequence Motifs.一种新的形状基序发现算法揭示了转录因子对 DNA 形状的偏好,超越了序列基序。
Cell Syst. 2019 Jan 23;8(1):27-42.e6. doi: 10.1016/j.cels.2018.12.001. Epub 2019 Jan 16.
9
GTRD: a database of transcription factor binding sites identified by ChIP-seq experiments.GTRD:一个通过染色质免疫沉淀测序(ChIP-seq)实验鉴定出的转录因子结合位点数据库。
Nucleic Acids Res. 2017 Jan 4;45(D1):D61-D67. doi: 10.1093/nar/gkw951. Epub 2016 Oct 24.
10
Improved linking of motifs to their TFs using domain information.利用域信息改进基序与其 TF 的关联。
Bioinformatics. 2020 Mar 1;36(6):1655-1662. doi: 10.1093/bioinformatics/btz855.

引用本文的文献

1
Dose-dependent interferon programs in myeloid cells after mRNA and adenovirus COVID-19 vaccination.mRNA和腺病毒COVID-19疫苗接种后髓系细胞中剂量依赖性干扰素程序
bioRxiv. 2025 Aug 18:2025.08.15.668720. doi: 10.1101/2025.08.15.668720.
2
Interactions between the genome and the nuclear lamina are multivalent and cooperative.基因组与核纤层之间的相互作用是多价且协同的。
Nat Struct Mol Biol. 2025 Sep 1. doi: 10.1038/s41594-025-01655-w.
3
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA.DART-Eval:一个关于调控DNA的全面DNA语言模型评估基准。

本文引用的文献

1
rs71327024 Associated with COVID-19 Hospitalization Reduces Promoter Activity in Human CD4 T Cells via Disruption of c-Myb Binding.rs71327024 与 COVID-19 住院治疗相关,通过破坏 c-Myb 结合降低人 CD4 T 细胞启动子活性。
Int J Mol Sci. 2023 Sep 7;24(18):13790. doi: 10.3390/ijms241813790.
2
ExplaiNN: interpretable and transparent neural networks for genomics.ExplaiNN:基因组学的可解释和透明神经网络。
Genome Biol. 2023 Jun 27;24(1):154. doi: 10.1186/s13059-023-02985-y.
3
A survey on algorithms to characterize transcription factor binding sites.
ArXiv. 2025 Aug 4:arXiv:2412.05430v2.
4
Dengue virus susceptibility in Aedes aegypti linked to natural cytochrome P450 promoter variants.埃及伊蚊对登革病毒的易感性与天然细胞色素P450启动子变体有关。
Nat Commun. 2025 Aug 12;16(1):7468. doi: 10.1038/s41467-025-62693-y.
5
Aedes aegypti VLG-1 challenges the assumed antiviral nature of Vago genes.埃及伊蚊VLG-1对Vago基因假定的抗病毒特性提出了挑战。
BMC Biol. 2025 Jul 28;23(1):223. doi: 10.1186/s12915-025-02325-5.
6
TRIM33 loss reduces androgen receptor transcriptional output and H2BK120 ubiquitination.TRIM33缺失会降低雄激素受体转录输出和H2BK120泛素化。
Commun Biol. 2025 Jul 11;8(1):1043. doi: 10.1038/s42003-025-08449-2.
7
CanASM: a comprehensive database for genome-wide allele-specific DNA methylation identification and annotation in cancer.CanASM:一个用于癌症全基因组等位基因特异性DNA甲基化鉴定和注释的综合数据库。
BMC Genomics. 2025 Jul 9;26(1):648. doi: 10.1186/s12864-025-11849-7.
8
Enhancer RNA-mediated transcriptional regulatory programs reveal the malignant progression of glioma.增强子RNA介导的转录调控程序揭示了胶质瘤的恶性进展。
Sci Adv. 2025 Jun 6;11(23):eadu9487. doi: 10.1126/sciadv.adu9487.
9
Single-cell ultra-high-throughput multiplexed chromatin and RNA profiling reveals gene regulatory dynamics.单细胞超高通量多重染色质和RNA分析揭示基因调控动态
Nat Methods. 2025 May 26. doi: 10.1038/s41592-025-02700-8.
10
Characterization and identification of extrachromosomal circular DNA in cholangiocarcinoma.胆管癌中染色体外环状DNA的表征与鉴定
PLoS One. 2025 May 5;20(5):e0322173. doi: 10.1371/journal.pone.0322173. eCollection 2025.
一种用于刻画转录因子结合位点的算法研究综述。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad156.
4
Transcription factor binding site orientation and order are major drivers of gene regulatory activity.转录因子结合位点的方向和顺序是基因调控活性的主要驱动因素。
Nat Commun. 2023 Apr 22;14(1):2333. doi: 10.1038/s41467-023-37960-5.
5
gDesigner: computational design of synthetic gRNAs for Cas12a-based transcriptional repression in mammalian cells.gDesigner:基于 Cas12a 的转录抑制的哺乳动物细胞中合成 gRNA 的计算设计。
NPJ Syst Biol Appl. 2022 Sep 16;8(1):34. doi: 10.1038/s41540-022-00241-w.
6
Positional weight matrices have sufficient prediction power for analysis of noncoding variants.位置权重矩阵对于分析非编码变异具有足够的预测能力。
F1000Res. 2022 Jan 12;11:33. doi: 10.12688/f1000research.75471.3. eCollection 2022.
7
ANANASTRA: annotation and enrichment analysis of allele-specific transcription factor binding at SNPs.ANANASTRA:SNP 处等位基因特异性转录因子结合的注释和富集分析。
Nucleic Acids Res. 2022 Jul 5;50(W1):W51-W56. doi: 10.1093/nar/gkac262.
8
Interrogating cell type-specific cooperation of transcriptional regulators in 3D chromatin.探究三维染色质中转录调节因子的细胞类型特异性协同作用。
iScience. 2021 Nov 18;24(12):103468. doi: 10.1016/j.isci.2021.103468. eCollection 2021 Dec 17.
9
JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles.JASPAR 2022:转录因子结合谱开放获取数据库的第 9 个版本。
Nucleic Acids Res. 2022 Jan 7;50(D1):D165-D173. doi: 10.1093/nar/gkab1113.
10
A GO catalogue of human DNA-binding transcription factors.人类 DNA 结合转录因子的 GO 目录。
Biochim Biophys Acta Gene Regul Mech. 2021 Nov-Dec;1864(11-12):194765. doi: 10.1016/j.bbagrm.2021.194765. Epub 2021 Oct 18.