• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LASAGNA:一种用于转录因子结合位点比对的新算法。

LASAGNA: a novel algorithm for transcription factor binding site alignment.

机构信息

Department of Computer Science and Engineering, University of Connecticut, Fairfield Road, Storrs, CT 06269, USA.

出版信息

BMC Bioinformatics. 2013 Mar 24;14:108. doi: 10.1186/1471-2105-14-108.

DOI:10.1186/1471-2105-14-108
PMID:23522376
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3747862/
Abstract

BACKGROUND

Scientists routinely scan DNA sequences for transcription factor (TF) binding sites (TFBSs). Most of the available tools rely on position-specific scoring matrices (PSSMs) constructed from aligned binding sites. Because of the resolutions of assays used to obtain TFBSs, databases such as TRANSFAC, ORegAnno and PAZAR store unaligned variable-length DNA segments containing binding sites of a TF. These DNA segments need to be aligned to build a PSSM. While the TRANSFAC database provides scoring matrices for TFs, nearly 78% of the TFs in the public release do not have matrices available. As work on TFBS alignment algorithms has been limited, it is highly desirable to have an alignment algorithm tailored to TFBSs.

RESULTS

We designed a novel algorithm named LASAGNA, which is aware of the lengths of input TFBSs and utilizes position dependence. Results on 189 TFs of 5 species in the TRANSFAC database showed that our method significantly outperformed ClustalW2 and MEME. We further compared a PSSM method dependent on LASAGNA to an alignment-free TFBS search method. Results on 89 TFs whose binding sites can be located in genomes showed that our method is significantly more precise at fixed recall rates. Finally, we described LASAGNA-ChIP, a more sophisticated version for ChIP (Chromatin immunoprecipitation) experiments. Under the one-per-sequence model, it showed comparable performance with MEME in discovering motifs in ChIP-seq peak sequences.

CONCLUSIONS

We conclude that the LASAGNA algorithm is simple and effective in aligning variable-length binding sites. It has been integrated into a user-friendly webtool for TFBS search and visualization called LASAGNA-Search. The tool currently stores precomputed PSSM models for 189 TFs and 133 TFs built from TFBSs in the TRANSFAC Public database (release 7.0) and the ORegAnno database (08Nov10 dump), respectively. The webtool is available at http://biogrid.engr.uconn.edu/lasagna_search/.

摘要

背景

科学家通常会扫描 DNA 序列以寻找转录因子(TF)结合位点(TFBS)。大多数可用的工具都依赖于从对齐的结合位点构建的位置特异性评分矩阵(PSSM)。由于用于获取 TFBS 的测定方法的分辨率,TRANSFAC、ORegAnno 和 PAZAR 等数据库存储未对齐的可变长度 DNA 片段,其中包含 TF 的结合位点。这些 DNA 片段需要对齐以构建 PSSM。虽然 TRANSFAC 数据库为 TF 提供了评分矩阵,但在公共版本中,近 78%的 TF 没有可用的矩阵。由于 TFBS 对齐算法的工作有限,因此非常需要一种针对 TFBS 量身定制的对齐算法。

结果

我们设计了一种名为 LASAGNA 的新算法,它可以识别输入 TFBS 的长度并利用位置依赖性。在 TRANSFAC 数据库中的 5 个物种的 189 个 TF 上的结果表明,我们的方法明显优于 ClustalW2 和 MEME。我们进一步比较了依赖于 LASAGNA 的 PSSM 方法和无对齐 TFBS 搜索方法。在 89 个其结合位点可以定位在基因组中的 TF 上的结果表明,在固定召回率下,我们的方法具有更高的精度。最后,我们描述了 LASAGNA-ChIP,这是一种用于 ChIP(染色质免疫沉淀)实验的更复杂版本。在一个序列一个的模型下,它在发现 ChIP-seq 峰序列中的基序方面与 MEME 表现相当。

结论

我们得出结论,LASAGNA 算法在对齐可变长度结合位点方面简单有效。它已被集成到一个名为 LASAGNA-Search 的用户友好的 TFBS 搜索和可视化网络工具中。该工具目前为 189 个 TF 和分别来自 TRANSFAC Public 数据库(版本 7.0)和 ORegAnno 数据库(08Nov10 转储)的 133 个 TF 存储了预先计算的 PSSM 模型。该网络工具可在 http://biogrid.engr.uconn.edu/lasagna_search/ 访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cd/3747862/04271c68f866/1471-2105-14-108-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cd/3747862/716f37854477/1471-2105-14-108-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cd/3747862/21699bdb191b/1471-2105-14-108-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cd/3747862/32383952e07b/1471-2105-14-108-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cd/3747862/04271c68f866/1471-2105-14-108-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cd/3747862/716f37854477/1471-2105-14-108-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cd/3747862/21699bdb191b/1471-2105-14-108-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cd/3747862/32383952e07b/1471-2105-14-108-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cd/3747862/04271c68f866/1471-2105-14-108-4.jpg

相似文献

1
LASAGNA: a novel algorithm for transcription factor binding site alignment.LASAGNA:一种用于转录因子结合位点比对的新算法。
BMC Bioinformatics. 2013 Mar 24;14:108. doi: 10.1186/1471-2105-14-108.
2
LASAGNA-Search: an integrated web tool for transcription factor binding site search and visualization.LASAGNA-Search:一个用于转录因子结合位点搜索和可视化的集成网络工具。
Biotechniques. 2013 Mar;54(3):141-53. doi: 10.2144/000113999.
3
LASAGNA-Search 2.0: integrated transcription factor binding site search and visualization in a browser.LASAGNA-Search 2.0:在浏览器中进行整合的转录因子结合位点搜索与可视化。
Bioinformatics. 2014 Jul 1;30(13):1923-5. doi: 10.1093/bioinformatics/btu115. Epub 2014 Feb 26.
4
Improvement of TRANSFAC matrices using multiple local alignment of transcription factor binding site sequences.利用转录因子结合位点序列的多重局部比对改进TRANSFAC矩阵。
Genome Inform. 2005;16(1):68-72.
5
A novel method for improved accuracy of transcription factor binding site prediction.一种提高转录因子结合位点预测准确性的新方法。
Nucleic Acids Res. 2018 Jul 6;46(12):e72. doi: 10.1093/nar/gky237.
6
The next generation of transcription factor binding site prediction.下一代转录因子结合位点预测。
PLoS Comput Biol. 2013;9(9):e1003214. doi: 10.1371/journal.pcbi.1003214. Epub 2013 Sep 5.
7
Discovering unknown human and mouse transcription factor binding sites and their characteristics from ChIP-seq data.从 ChIP-seq 数据中发现未知的人和小鼠转录因子结合位点及其特征。
Proc Natl Acad Sci U S A. 2021 May 18;118(20). doi: 10.1073/pnas.2026754118.
8
Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment.基于拓扑基序富集改进ChIP-Seq数据中转录因子结合位点的分析。
BMC Genomics. 2014 Jun 13;15(1):472. doi: 10.1186/1471-2164-15-472.
9
A Bayesian search for transcriptional motifs.贝叶斯搜索转录基序。
PLoS One. 2010 Nov 18;5(11):e13897. doi: 10.1371/journal.pone.0013897.
10
MEME-ChIP: motif analysis of large DNA datasets.MEME-ChIP:大 DNA 数据集的基序分析。
Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12.

引用本文的文献

1
FADD Activation in Hepatocellular Carcinoma Potentiates CD8+T Cell Responses and Sensitizes to Immune Checkpoint Inhibitors.肝细胞癌中FADD激活增强CD8 + T细胞反应并增强对免疫检查点抑制剂的敏感性。
Cancer Res. 2025 Jul 14. doi: 10.1158/0008-5472.CAN-24-3854.
2
Non-Canonical TERT Activity Initiates Osteogenesis in Calcific Aortic Valve Disease.非经典端粒酶逆转录酶(TERT)活性在钙化性主动脉瓣疾病中启动成骨作用。
Circ Res. 2025 Feb 14;136(4):403-421. doi: 10.1161/CIRCRESAHA.122.321889. Epub 2025 Jan 21.
3
SKA2 enhances stress-related glucocorticoid receptor signaling through FKBP4-FKBP5 interactions in neurons.

本文引用的文献

1
An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。
Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.
2
RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets.RSAT 峰基序:全尺寸 ChIP-seq 数据集的基序分析。
Nucleic Acids Res. 2012 Feb;40(4):e31. doi: 10.1093/nar/gkr1104. Epub 2011 Dec 8.
3
The UCSC Genome Browser database: extensions and updates 2011.UCSC 基因组浏览器数据库:扩展和更新 2011 年版。
SKA2通过神经元中FKBP4与FKBP5的相互作用增强应激相关的糖皮质激素受体信号传导。
Proc Natl Acad Sci U S A. 2024 Dec 24;121(52):e2417728121. doi: 10.1073/pnas.2417728121. Epub 2024 Dec 20.
4
Resequencing of the TMF-1 (TATA Element Modulatory Factor) regulated protein (TRNP1) gene in domestic and wild canids.家犬和野生犬科动物中TMF-1(TATA元件调节因子)调控蛋白(TRNP1)基因的重测序
Canine Med Genet. 2023 Nov 15;10(1):10. doi: 10.1186/s40575-023-00133-0.
5
Orphan Nuclear Receptor NR4A2 Is Constitutively Expressed in Cartilage and Upregulated in Inflamed Synovium From hTNF-Alpha Transgenic Mice.孤儿核受体NR4A2在软骨中组成性表达,并在hTNF-α转基因小鼠的炎性滑膜中上调。
Front Pharmacol. 2022 Apr 20;13:835697. doi: 10.3389/fphar.2022.835697. eCollection 2022.
6
Pro-inflammatory signals induce 20α-HSD expression in myometrial cells: A key mechanism for local progesterone withdrawal.促炎信号诱导肌层细胞中 20α-HSD 的表达:局部孕酮耗竭的关键机制。
J Cell Mol Med. 2021 Jul;25(14):6773-6785. doi: 10.1111/jcmm.16681. Epub 2021 Jun 10.
7
Flexible comparative genomics of prokaryotic transcriptional regulatory networks.原核生物转录调控网络的灵活比较基因组学。
BMC Genomics. 2020 Dec 16;21(Suppl 5):466. doi: 10.1186/s12864-020-06838-x.
8
Differential Glucocorticoid-Dependent Regulation and Function of the ERRFI1 Gene in Triple-Negative Breast Cancer.差异糖皮质激素依赖性调节和 ERRFI1 基因在三阴性乳腺癌中的功能。
Endocrinology. 2020 Jul 1;161(7). doi: 10.1210/endocr/bqaa082.
9
Gene expression patterns of chicken neuregulin 3 in association with copy number variation and frameshift deletion.鸡神经调节蛋白3的基因表达模式与拷贝数变异和移码缺失的关系
BMC Genet. 2017 Jul 21;18(1):69. doi: 10.1186/s12863-017-0537-z.
10
Every Site Counts: Submitting Transcription Factor-Binding Site Information through the CollecTF Portal.每个位点都很重要:通过CollecTF门户提交转录因子结合位点信息。
J Bacteriol. 2015 Aug 1;197(15):2454-7. doi: 10.1128/JB.00031-15. Epub 2015 May 26.
Nucleic Acids Res. 2012 Jan;40(Database issue):D918-23. doi: 10.1093/nar/gkr1055. Epub 2011 Nov 15.
4
c-Myb and GATA-3 cooperatively regulate IL-13 expression via conserved GATA-3 response element and recruit mixed lineage leukemia (MLL) for histone modification of the IL-13 locus.c-Myb 和 GATA-3 通过保守的 GATA-3 反应元件协同调节 IL-13 的表达,并募集混合谱系白血病(MLL)对 IL-13 基因座进行组蛋白修饰。
J Immunol. 2011 Dec 1;187(11):5974-82. doi: 10.4049/jimmunol.1100550. Epub 2011 Oct 28.
5
SiTaR: a novel tool for transcription factor binding site prediction.SiTaR:一种用于转录因子结合位点预测的新型工具。
Bioinformatics. 2011 Oct 15;27(20):2806-11. doi: 10.1093/bioinformatics/btr492. Epub 2011 Sep 4.
6
Recognition of beta-structural motifs using hidden Markov models trained with simulated evolution.使用模拟进化训练的隐马尔可夫模型识别β结构基序。
Bioinformatics. 2010 Jun 15;26(12):i287-93. doi: 10.1093/bioinformatics/btq199.
7
TransFind--predicting transcriptional regulators for gene sets.TransFind——用于预测基因集转录调控因子的工具。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W275-80. doi: 10.1093/nar/gkq438. Epub 2010 May 28.
8
Inclusion of neighboring base interdependencies substantially improves genome-wide prokaryotic transcription factor binding site prediction.纳入相邻碱基的相互依赖关系可极大地提高全基因组原核转录因子结合位点预测的准确性。
Nucleic Acids Res. 2010 Jul;38(12):e135. doi: 10.1093/nar/gkq274. Epub 2010 May 3.
9
Evidence-ranked motif identification.证据排序基序识别。
Genome Biol. 2010;11(2):R19. doi: 10.1186/gb-2010-11-2-r19. Epub 2010 Feb 15.
10
Insights from genomic profiling of transcription factors.转录因子基因组分析的见解。
Nat Rev Genet. 2009 Sep;10(9):605-16. doi: 10.1038/nrg2636. Epub 2009 Aug 11.