• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类着丝粒区转录因子结合基序的显著聚类及其生物学意义。

Notable clustering of transcription-factor-binding motifs in human pericentric regions and its biological significance.

机构信息

Department of Bioscience, Nagahama Institute of Bio-Science and Technology, Nagahama-shi, Shiga-ken, 526-0829, Japan.

出版信息

Chromosome Res. 2013 Aug;21(5):461-74. doi: 10.1007/s10577-013-9371-y. Epub 2013 Jul 30.

DOI:10.1007/s10577-013-9371-y
PMID:23896648
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3761090/
Abstract

Since oligonucleotide composition in the genome sequence varies significantly among species even among those possessing the same genome G + C%, the composition has been used to distinguish a wide range of genomes and called as "genome signature". Oligonucleotides often represent motif sequences responsible for sequence-specific protein binding (e.g., transcription-factor binding). Occurrences of such motif oligonucleotides in the genome should be biased compared to those observed in random sequences and may differ among genomes and genomic portions. Self-Organizing Map (SOM) is a powerful tool for clustering high-dimensional data such as oligonucleotide composition on one plane. We previously modified the conventional SOM for genome informatics to batch learning SOM or "BLSOM". When we constructed BLSOMs to analyze pentanucleotide composition in 20-, 50-, and 100-kb sequences derived from the human genome, BLSOMs did not classify human sequences according to chromosome but revealed several specific zones composed primarily of sequences derived from pericentric regions. Interestingly, various transcription-factor-binding motifs were characteristically overrepresented in pericentric regions but underrepresented in most genomic sequences. When we focused on much shorter sequences (e.g., 1 kb), the clustering of transcription-factor-binding motifs was evident in pericentric, subtelomeric and sex chromosome pseudoautosomal regions. The biological significance of the clustering in these regions was discussed in connection with cell-type and -stage-dependent chromocenter formation and nuclear organization.

摘要

由于基因组序列中的寡核苷酸组成在物种间甚至在具有相同基因组 G+C%的物种间差异很大,因此该组成已被用于区分广泛的基因组,并称为“基因组特征”。寡核苷酸通常代表负责序列特异性蛋白结合的基序序列(例如,转录因子结合)。与随机序列中观察到的寡核苷酸相比,基因组中此类基序寡核苷酸的出现应该存在偏向性,并且可能在基因组和基因组部分之间存在差异。自组织映射 (SOM) 是一种将基因组信息学中的高维数据(例如寡核苷酸组成)聚类到一个平面上的强大工具。我们之前对传统 SOM 进行了修改,以用于基因组信息学中的批量学习 SOM 或“BLSOM”。当我们构建 BLSOM 以分析来自人类基因组的 20、50 和 100 kb 序列中的五核苷酸组成时,BLSOM 并没有根据染色体对人类序列进行分类,而是揭示了由着丝粒区域主要组成的几个特定区域。有趣的是,各种转录因子结合基序在着丝粒区域中特征性地过表达,而在大多数基因组序列中则表达不足。当我们关注更短的序列(例如 1 kb)时,着丝粒、端粒和性染色体假常染色体区域中的转录因子结合基序聚类明显。讨论了这些区域中聚类的生物学意义,涉及细胞类型和阶段依赖性染色质中心形成和核组织。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/068a92c10900/10577_2013_9371_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/d20544469019/10577_2013_9371_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/49a9c596eec3/10577_2013_9371_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/b6361c4dff44/10577_2013_9371_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/677285ad7e8b/10577_2013_9371_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/da39c97d28a5/10577_2013_9371_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/e8e5644e3899/10577_2013_9371_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/068a92c10900/10577_2013_9371_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/d20544469019/10577_2013_9371_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/49a9c596eec3/10577_2013_9371_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/b6361c4dff44/10577_2013_9371_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/677285ad7e8b/10577_2013_9371_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/da39c97d28a5/10577_2013_9371_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/e8e5644e3899/10577_2013_9371_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc83/3761090/068a92c10900/10577_2013_9371_Fig7_HTML.jpg

相似文献

1
Notable clustering of transcription-factor-binding motifs in human pericentric regions and its biological significance.人类着丝粒区转录因子结合基序的显著聚类及其生物学意义。
Chromosome Res. 2013 Aug;21(5):461-74. doi: 10.1007/s10577-013-9371-y. Epub 2013 Jul 30.
2
CG-containing oligonucleotides and transcription factor-binding motifs are enriched in human pericentric regions.含CG的寡核苷酸和转录因子结合基序在人类着丝粒周围区域富集。
Genes Genet Syst. 2015;90(1):43-53. doi: 10.1266/ggs.90.43.
3
A novel bioinformatics method for efficient knowledge discovery by BLSOM from big genomic sequence data.一种通过BLSOM从大型基因组序列数据中进行高效知识发现的新型生物信息学方法。
Biomed Res Int. 2014;2014:765648. doi: 10.1155/2014/765648. Epub 2014 Apr 3.
4
Self-Organizing Map (SOM) unveils and visualizes hidden sequence characteristics of a wide range of eukaryote genomes.自组织映射(SOM)揭示并可视化了多种真核生物基因组的隐藏序列特征。
Gene. 2006 Jan 3;365:27-34. doi: 10.1016/j.gene.2005.09.040. Epub 2005 Dec 20.
5
Informatics for unveiling hidden genome signatures.用于揭示隐藏基因组特征的信息学。
Genome Res. 2003 Apr;13(4):693-702. doi: 10.1101/gr.634603.
6
Mb-level CpG and TFBS islands visualized by AI and their roles in the nuclear organization of the human genome.通过人工智能可视化的兆碱基级别的CpG和转录因子结合位点岛及其在人类基因组核组织中的作用。
Genes Genet Syst. 2020 Apr 22;95(1):29-41. doi: 10.1266/ggs.19-00027. Epub 2020 Mar 12.
7
Comparative genomic analysis of the human genome and six bat genomes using unsupervised machine learning: Mb-level CpG and TFBS islands.使用无监督机器学习对人类基因组和六倍体蝙蝠基因组进行比较基因组分析:Mb 级 CpG 和 TFBS 岛。
BMC Genomics. 2022 Jul 8;23(1):497. doi: 10.1186/s12864-022-08664-9.
8
A Novel Bioinformatics Strategy to Analyze Microbial Big Sequence Data for Efficient Knowledge Discovery: Batch-Learning Self-Organizing Map (BLSOM).一种用于分析微生物大序列数据以实现高效知识发现的新型生物信息学策略:批学习自组织映射(BLSOM)。
Microorganisms. 2013 Nov 20;1(1):137-157. doi: 10.3390/microorganisms1010137.
9
Unsupervised AI reveals insect species-specific genome signatures.无监督人工智能揭示昆虫物种特异性基因组特征。
PeerJ. 2024 Mar 6;12:e17025. doi: 10.7717/peerj.17025. eCollection 2024.
10
AI for the collective analysis of a massive number of genome sequences: various examples from the small genome of pandemic SARS-CoV-2 to the human genome.用于大量基因组序列集体分析的人工智能:从大流行的严重急性呼吸综合征冠状病毒2的小基因组到人类基因组的各种实例。
Genes Genet Syst. 2021 Dec 16;96(4):165-176. doi: 10.1266/ggs.21-00025. Epub 2021 Sep 27.

引用本文的文献

1
The developmental and evolutionary characteristics of transcription factor binding site clustered regions based on an explainable machine learning model.基于可解释机器学习模型的转录因子结合位点聚类区域的发育和进化特征。
Nucleic Acids Res. 2024 Jul 22;52(13):7610-7626. doi: 10.1093/nar/gkae441.
2
Unsupervised AI reveals insect species-specific genome signatures.无监督人工智能揭示昆虫物种特异性基因组特征。
PeerJ. 2024 Mar 6;12:e17025. doi: 10.7717/peerj.17025. eCollection 2024.
3
Comparative genomic analysis of the human genome and six bat genomes using unsupervised machine learning: Mb-level CpG and TFBS islands.

本文引用的文献

1
On the immortality of television sets: "function" in the human genome according to the evolution-free gospel of ENCODE.论电视机的不朽:根据无进化福音的 ENCODE,人类基因组中的“功能”。
Genome Biol Evol. 2013;5(3):578-90. doi: 10.1093/gbe/evt028.
2
Heterochromatin establishment in the context of genome-wide epigenetic reprogramming.在全基因组表观遗传重编程的背景下建立异染色质。
Trends Genet. 2011 May;27(5):177-85. doi: 10.1016/j.tig.2011.02.002. Epub 2011 Apr 15.
3
SUMOylation promotes de novo targeting of HP1α to pericentric heterochromatin.
使用无监督机器学习对人类基因组和六倍体蝙蝠基因组进行比较基因组分析:Mb 级 CpG 和 TFBS 岛。
BMC Genomics. 2022 Jul 8;23(1):497. doi: 10.1186/s12864-022-08664-9.
4
Comparative genomics of using unsupervised AI reveals a high CG frequency.利用无监督人工智能进行比较基因组学研究揭示了 的高 CG 频率。
Life Sci Alliance. 2021 Mar 12;4(5). doi: 10.26508/lsa.202000905. Print 2021 May.
5
Implication of a new function of human tDNAs in chromatin organization.人 tDNA 新功能在染色质组织中的意义。
Sci Rep. 2020 Oct 15;10(1):17440. doi: 10.1038/s41598-020-74499-7.
6
A Novel Bioinformatics Strategy to Analyze Microbial Big Sequence Data for Efficient Knowledge Discovery: Batch-Learning Self-Organizing Map (BLSOM).一种用于分析微生物大序列数据以实现高效知识发现的新型生物信息学策略:批学习自组织映射(BLSOM)。
Microorganisms. 2013 Nov 20;1(1):137-157. doi: 10.3390/microorganisms1010137.
7
tRNADB-CE: tRNA gene database well-timed in the era of big sequence data.tRNADB-CE:大数据时代的及时tRNA基因数据库。
Front Genet. 2014 May 1;5:114. doi: 10.3389/fgene.2014.00114. eCollection 2014.
8
A novel bioinformatics method for efficient knowledge discovery by BLSOM from big genomic sequence data.一种通过BLSOM从大型基因组序列数据中进行高效知识发现的新型生物信息学方法。
Biomed Res Int. 2014;2014:765648. doi: 10.1155/2014/765648. Epub 2014 Apr 3.
9
Evolutionary changes in vertebrate genome signatures with special focus on coelacanth.脊椎动物基因组特征的进化变化,特别关注腔棘鱼。
DNA Res. 2014 Oct;21(5):459-67. doi: 10.1093/dnares/dsu012. Epub 2014 May 6.
10
Visualization of genome signatures of eukaryote genomes by batch-learning self-organizing map with a special emphasis on Drosophila genomes.通过批量学习自组织映射可视化真核生物基因组的基因组特征,特别强调果蝇基因组。
Biomed Res Int. 2014;2014:985706. doi: 10.1155/2014/985706. Epub 2014 Mar 11.
SUMOylation 促进 HP1α 从头定位于着丝粒异染色质。
Nat Genet. 2011 Mar;43(3):220-7. doi: 10.1038/ng.765. Epub 2011 Feb 13.
4
Genome-wide transcription factor binding: beyond direct target regulation.全基因组转录因子结合:超越直接靶标调控。
Trends Genet. 2011 Apr;27(4):141-8. doi: 10.1016/j.tig.2011.01.001. Epub 2011 Feb 4.
5
A strand-specific burst in transcription of pericentric satellites is required for chromocenter formation and early mouse development.着丝粒卫星的转录特异性爆发是染色中心形成和早期小鼠发育所必需的。
Dev Cell. 2010 Oct 19;19(4):625-38. doi: 10.1016/j.devcel.2010.09.002.
6
DNA binding of centromere protein C (CENPC) is stabilized by single-stranded RNA.着丝粒蛋白 C(CENPC)的 DNA 结合由单链 RNA 稳定。
PLoS Genet. 2010 Feb 5;6(2):e1000835. doi: 10.1371/journal.pgen.1000835.
7
Epigenetic inheritance during the cell cycle.细胞周期中的表观遗传继承。
Nat Rev Mol Cell Biol. 2009 Mar;10(3):192-206. doi: 10.1038/nrm2640.
8
Centromere RNA is a key component for the assembly of nucleoproteins at the nucleolus and centromere.着丝粒RNA是核仁与着丝粒处核蛋白组装的关键组成部分。
Genome Res. 2007 Aug;17(8):1146-60. doi: 10.1101/gr.6022807. Epub 2007 Jul 10.
9
Novel phylogenetic studies of genomic sequence fragments derived from uncultured microbe mixtures in environmental and clinical samples.对环境和临床样本中未培养微生物混合物来源的基因组序列片段进行的新型系统发育研究。
DNA Res. 2005;12(5):281-90. doi: 10.1093/dnares/dsi015. Epub 2006 Jan 10.
10
Self-Organizing Map (SOM) unveils and visualizes hidden sequence characteristics of a wide range of eukaryote genomes.自组织映射(SOM)揭示并可视化了多种真核生物基因组的隐藏序列特征。
Gene. 2006 Jan 3;365:27-34. doi: 10.1016/j.gene.2005.09.040. Epub 2005 Dec 20.