• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

密码本的视角:未表征的人类转录因子的序列特异性

Perspectives on Codebook: sequence specificity of uncharacterized human transcription factors.

作者信息

Jolma Arttu, Laverty Kaitlin U, Fathi Ali, Yang Ally W H, Yellan Isaac, Vorontsov Ilya E, Inukai Sachi, Kribelbauer-Swietek Judith F, Gralak Antoni J, Razavi Rozita, Albu Mihai, Brechalov Alexander, Patel Zain M, Nozdrin Vladimir, Meshcheryakov Georgy, Kozin Ivan, Abramov Sergey, Boytsov Alexandr, Fornes Oriol, Makeev Vsevolod J, Grau Jan, Grosse Ivo, Bucher Philipp, Deplancke Bart, Kulakovskiy Ivan V, Hughes Timothy R

机构信息

Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada.

Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA.

出版信息

bioRxiv. 2024 Nov 12:2024.11.11.622097. doi: 10.1101/2024.11.11.622097.

DOI:10.1101/2024.11.11.622097
PMID:39605729
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11601247/
Abstract

We describe an effort ("Codebook") to determine the sequence specificity of 332 putative and largely uncharacterized human transcription factors (TFs), as well as 61 control TFs. Nearly 5,000 independent experiments across multiple and assays produced motifs for just over half of the putative TFs analyzed (177, or 53%), of which most are unique to a single TF. The data highlight the extensive contribution of transposable elements to TF evolution, both in and , and identify tens of thousands of conserved, base-level binding sites in the human genome. The use of multiple assays provides an unprecedented opportunity to benchmark and analyze TF sequence specificity, function, and evolution, as further explored in accompanying manuscripts. 1,421 human TFs are now associated with a DNA binding motif. Extrapolation from the Codebook benchmarking, however, suggests that many of the currently known binding motifs for well-studied TFs may inaccurately describe the TF's true sequence preferences.

摘要

我们描述了一项工作(“密码本”),以确定332种假定的、大多未被表征的人类转录因子(TFs)以及61种对照TFs的序列特异性。通过多个实验和检测方法进行的近5000次独立实验,为所分析的略超过一半的假定TFs(177个,即53%)生成了基序,其中大多数是单个TF所特有的。数据突出了转座元件在TF进化中的广泛贡献,包括在[具体方面1]和[具体方面2],并在人类基因组中识别出数以万计的保守的、碱基水平的结合位点。多种检测方法的使用为基准测试和分析TF序列特异性、功能及进化提供了前所未有的机会,正如随附手稿中进一步探讨的那样。现在有1421种人类TFs与一个DNA结合基序相关联。然而,从“密码本”基准测试推断,许多目前已知的、经过充分研究的TFs的结合基序可能无法准确描述TF的真实序列偏好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/0a0b9009c887/nihpp-2024.11.11.622097v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/267da3c70f65/nihpp-2024.11.11.622097v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/d201a9efad58/nihpp-2024.11.11.622097v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/45aa5315cdbe/nihpp-2024.11.11.622097v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/b1c2e08b6b78/nihpp-2024.11.11.622097v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/d25d996b7789/nihpp-2024.11.11.622097v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/0a0b9009c887/nihpp-2024.11.11.622097v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/267da3c70f65/nihpp-2024.11.11.622097v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/d201a9efad58/nihpp-2024.11.11.622097v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/45aa5315cdbe/nihpp-2024.11.11.622097v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/b1c2e08b6b78/nihpp-2024.11.11.622097v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/d25d996b7789/nihpp-2024.11.11.622097v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01e8/11601247/0a0b9009c887/nihpp-2024.11.11.622097v1-f0006.jpg

相似文献

1
Perspectives on Codebook: sequence specificity of uncharacterized human transcription factors.密码本的视角:未表征的人类转录因子的序列特异性
bioRxiv. 2024 Nov 12:2024.11.11.622097. doi: 10.1101/2024.11.11.622097.
2
Extensive binding of uncharacterized human transcription factors to genomic dark matter.未鉴定的人类转录因子与基因组暗物质的广泛结合。
bioRxiv. 2024 Nov 12:2024.11.11.622123. doi: 10.1101/2024.11.11.622123.
3
Cross-platform DNA motif discovery and benchmarking to explore binding specificities of poorly studied human transcription factors.跨平台DNA基序发现与基准测试,以探索研究较少的人类转录因子的结合特异性。
bioRxiv. 2024 Nov 13:2024.11.11.619379. doi: 10.1101/2024.11.11.619379.
4
GHT-SELEX demonstrates unexpectedly high intrinsic sequence specificity and complex DNA binding of many human transcription factors.GHT-SELEX显示出许多人类转录因子具有出乎意料的高内在序列特异性和复杂的DNA结合能力。
bioRxiv. 2024 Nov 12:2024.11.11.618478. doi: 10.1101/2024.11.11.618478.
5
Insights gained from a comprehensive all-against-all transcription factor binding motif benchmarking study.从全面的全对全转录因子结合基序基准测试研究中获得的见解。
Genome Biol. 2020 May 11;21(1):114. doi: 10.1186/s13059-020-01996-3.
6
Transcription factor-binding k-mer analysis clarifies the cell type dependency of binding specificities and cis-regulatory SNPs in humans.转录因子结合 k- -mer 分析阐明了人类结合特异性和顺式调控 SNP 的细胞类型依赖性。
BMC Genomics. 2023 Oct 7;24(1):597. doi: 10.1186/s12864-023-09692-9.
7
High resolution models of transcription factor-DNA affinities improve in vitro and in vivo binding predictions.转录因子-DNA 亲和力的高分辨率模型可改善体外和体内结合预测。
PLoS Comput Biol. 2010 Sep 9;6(9):e1000916. doi: 10.1371/journal.pcbi.1000916.
8
DNA-dependent formation of transcription factor pairs alters their binding specificity.DNA 依赖性转录因子对的形成改变了它们的结合特异性。
Nature. 2015 Nov 19;527(7578):384-8. doi: 10.1038/nature15518. Epub 2015 Nov 9.
9
Computational identification of diverse mechanisms underlying transcription factor-DNA occupancy.计算鉴定转录因子-DNA 结合的多种机制。
PLoS Genet. 2013;9(8):e1003571. doi: 10.1371/journal.pgen.1003571. Epub 2013 Aug 1.
10
High-Throughput Protein Production Combined with High- Throughput SELEX Identifies an Extensive Atlas of Ciona robusta Transcription Factor DNA-Binding Specificities.高通量蛋白质生产与高通量SELEX相结合,确定了强大海鞘转录因子DNA结合特异性的广泛图谱。
Methods Mol Biol. 2019;2025:487-517. doi: 10.1007/978-1-4939-9624-7_23.

本文引用的文献

1
Statistical framework for calling allelic imbalance in high-throughput sequencing data.用于在高通量测序数据中检测等位基因不平衡的统计框架。
Nat Commun. 2025 Feb 18;16(1):1739. doi: 10.1038/s41467-024-55513-2.
2
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
3
Hold out the genome: a roadmap to solving the cis-regulatory code.伸出基因组:解决顺式调控代码的路线图。
Nature. 2024 Jan;625(7993):41-50. doi: 10.1038/s41586-023-06661-w. Epub 2023 Dec 13.
4
HOCOMOCO in 2024: a rebuild of the curated collection of binding models for human and mouse transcription factors.2024 年的 HOCOMOCO:人类和小鼠转录因子结合模型的精选集合的重建。
Nucleic Acids Res. 2024 Jan 5;52(D1):D154-D163. doi: 10.1093/nar/gkad1077.
5
JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles.JASPAR 2024:转录因子结合谱开放获取数据库的 20 周年纪念
Nucleic Acids Res. 2024 Jan 5;52(D1):D174-D182. doi: 10.1093/nar/gkad1059.
6
Ensembl 2024.Ensembl 2024.
Nucleic Acids Res. 2024 Jan 5;52(D1):D891-D899. doi: 10.1093/nar/gkad1049.
7
Short tandem repeats bind transcription factors to tune eukaryotic gene expression.短串联重复序列结合转录因子来调节真核生物基因表达。
Science. 2023 Sep 22;381(6664):eadd1250. doi: 10.1126/science.add1250.
8
ExplaiNN: interpretable and transparent neural networks for genomics.ExplaiNN:基因组学的可解释和透明神经网络。
Genome Biol. 2023 Jun 27;24(1):154. doi: 10.1186/s13059-023-02985-y.
9
GENCODE: reference annotation for the human and mouse genomes in 2023.GENCODE:2023 年人类和小鼠基因组的参考注释。
Nucleic Acids Res. 2023 Jan 6;51(D1):D942-D949. doi: 10.1093/nar/gkac1071.
10
The UCSC Genome Browser database: 2023 update.UCSC 基因组浏览器数据库:2023 年更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1188-D1195. doi: 10.1093/nar/gkac1072.