• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

加州大学圣克鲁兹分校重复序列浏览器可实现对重复序列家族间进化冲突的发现与可视化展示。

The UCSC repeat browser allows discovery and visualization of evolutionary conflict across repeat families.

作者信息

Fernandes Jason D, Zamudio-Hurtado Armando, Clawson Hiram, Kent W James, Haussler David, Salama Sofie R, Haeussler Maximilian

机构信息

1Genomics Institute, University of California, Santa Cruz, USA.

2Department of Biomolecular Engineering, University of California, Santa Cruz, USA.

出版信息

Mob DNA. 2020 Mar 31;11:13. doi: 10.1186/s13100-020-00208-w. eCollection 2020.

DOI:10.1186/s13100-020-00208-w
PMID:32266012
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7110667/
Abstract

BACKGROUND

Nearly half the human genome consists of repeat elements, most of which are retrotransposons, and many of which play important biological roles. However repeat elements pose several unique challenges to current bioinformatic analyses and visualization tools, as short repeat sequences can map to multiple genomic loci resulting in their misclassification and misinterpretation. In fact, sequence data mapping to repeat elements are often discarded from analysis pipelines. Therefore, there is a continued need for standardized tools and techniques to interpret genomic data of repeats.

RESULTS

We present the UCSC Repeat Browser, which consists of a complete set of human repeat reference sequences derived from annotations made by the commonly used program RepeatMasker. The UCSC Repeat Browser also provides an alignment from the human genome to these references, uses it to map the standard human genome annotation tracks, and presents all of them as a comprehensive interface to facilitate work with repetitive elements. It also provides processed tracks of multiple publicly available datasets of particular interest to the repeat community, including ChIP-seq datasets for KRAB Zinc Finger Proteins (KZNFs) - a family of proteins known to bind and repress certain classes of repeats. We used the UCSC Repeat Browser in combination with these datasets, as well as RepeatMasker annotations in several non-human primates, to trace the independent trajectories of species-specific evolutionary battles between LINE 1 retroelements and their repressors. Furthermore, we document at https://repeatbrowser.ucsc.edu how researchers can map their own human genome annotations to these reference repeat sequences.

CONCLUSIONS

The UCSC Repeat Browser allows easy and intuitive visualization of genomic data on consensus repeat elements, circumventing the problem of multi-mapping, in which sequencing reads of repeat elements map to multiple locations on the human genome. By developing a reference consensus, multiple datasets and annotation tracks can easily be overlaid to reveal complex evolutionary histories of repeats in a single interactive window. Specifically, we use this approach to retrace the history of several primate specific LINE-1 families across apes, and discover several species-specific routes of evolution that correlate with the emergence and binding of KZNFs.

摘要

背景

近一半的人类基因组由重复元件组成,其中大部分是逆转录转座子,许多重复元件发挥着重要的生物学作用。然而,重复元件给当前的生物信息分析和可视化工具带来了几个独特的挑战,因为短重复序列可以映射到多个基因组位点,导致它们被错误分类和错误解读。事实上,映射到重复元件的序列数据在分析流程中常常被丢弃。因此,持续需要标准化的工具和技术来解读重复序列的基因组数据。

结果

我们展示了加州大学圣克鲁兹分校重复序列浏览器(UCSC Repeat Browser),它由一组完整的人类重复序列参考序列组成,这些序列源自常用程序RepeatMasker所做的注释。UCSC重复序列浏览器还提供了从人类基因组到这些参考序列的比对,利用它来映射标准的人类基因组注释轨道,并将所有这些作为一个综合界面呈现出来,以便于处理重复元件。它还提供了重复序列研究群体特别感兴趣的多个公开可用数据集的处理轨道,包括KRAB锌指蛋白(KZNFs)的染色质免疫沉淀测序(ChIP-seq)数据集——这是一类已知能结合并抑制某些重复序列类别的蛋白质家族。我们将UCSC重复序列浏览器与这些数据集以及几种非人类灵长类动物中的RepeatMasker注释结合使用,以追踪LINE 1逆转录元件与其抑制因子之间物种特异性进化斗争的独立轨迹。此外,我们在https://repeatbrowser.ucsc.edu上记录了研究人员如何将他们自己的人类基因组注释映射到这些参考重复序列上。

结论

UCSC重复序列浏览器允许轻松直观地可视化共有重复元件上的基因组数据,规避了多重映射问题,即重复元件的测序读数映射到人类基因组上的多个位置的问题。通过开发一个参考共有序列,可以轻松叠加多个数据集和注释轨道,在单个交互式窗口中揭示重复序列复杂的进化历史。具体来说,我们使用这种方法追溯了几种灵长类特异性LINE-1家族在猿类中的历史,并发现了几条与KZNFs的出现和结合相关的物种特异性进化途径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/86fa5d411bfc/13100_2020_208_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/5ac2667d4658/13100_2020_208_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/8675eb784ff9/13100_2020_208_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/2355de24006f/13100_2020_208_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/f76a7a0cd544/13100_2020_208_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/7f7742112ada/13100_2020_208_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/f7e46b3640ed/13100_2020_208_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/86fa5d411bfc/13100_2020_208_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/5ac2667d4658/13100_2020_208_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/8675eb784ff9/13100_2020_208_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/2355de24006f/13100_2020_208_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/f76a7a0cd544/13100_2020_208_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/7f7742112ada/13100_2020_208_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/f7e46b3640ed/13100_2020_208_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcf/7110667/86fa5d411bfc/13100_2020_208_Fig7_HTML.jpg

相似文献

1
The UCSC repeat browser allows discovery and visualization of evolutionary conflict across repeat families.加州大学圣克鲁兹分校重复序列浏览器可实现对重复序列家族间进化冲突的发现与可视化展示。
Mob DNA. 2020 Mar 31;11:13. doi: 10.1186/s13100-020-00208-w. eCollection 2020.
2
Comparative genomic analysis using the UCSC genome browser.使用加州大学圣克鲁兹分校(UCSC)基因组浏览器进行比较基因组分析。
Methods Mol Biol. 2007;395:17-34. doi: 10.1007/978-1-59745-514-5_2.
3
RNASeqBrowser: a genome browser for simultaneous visualization of raw strand specific RNAseq reads and UCSC genome browser custom tracks.RNA序列浏览器:一种用于同时可视化原始链特异性RNA序列读数和加州大学圣克鲁兹分校(UCSC)基因组浏览器自定义轨迹的基因组浏览器。
BMC Genomics. 2015 Mar 1;16(1):145. doi: 10.1186/s12864-015-1346-2.
4
The UCSC genome browser and associated tools.UCSC 基因组浏览器及相关工具。
Brief Bioinform. 2013 Mar;14(2):144-61. doi: 10.1093/bib/bbs038. Epub 2012 Aug 20.
5
The UCSC Genome Browser database: 2019 update.UCSC 基因组浏览器数据库:2019 年更新。
Nucleic Acids Res. 2019 Jan 8;47(D1):D853-D858. doi: 10.1093/nar/gky1095.
6
Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser.轨迹数据枢纽允许在 UCSC 基因组浏览器上可视化用户定义的全基因组注释。
Bioinformatics. 2014 Apr 1;30(7):1003-5. doi: 10.1093/bioinformatics/btt637. Epub 2013 Nov 13.
7
The UCSC Genome Browser Database: update 2009.加州大学圣克鲁兹分校基因组浏览器数据库:2009年更新
Nucleic Acids Res. 2009 Jan;37(Database issue):D755-61. doi: 10.1093/nar/gkn875. Epub 2008 Nov 7.
8
Visualizing Genomic Annotations with the UCSC Genome Browser.使用UCSC基因组浏览器可视化基因组注释。
Cold Spring Harb Protoc. 2016 Nov 1;2016(11):pdb.prot093062. doi: 10.1101/pdb.prot093062.
9
MakeHub: Fully Automated Generation of UCSC Genome Browser Assembly Hubs.MakeHub:全自动生成 UCSC 基因组浏览器组装中心。
Genomics Proteomics Bioinformatics. 2019 Oct;17(5):546-549. doi: 10.1016/j.gpb.2019.05.003. Epub 2020 Jan 28.
10
The UCSC Genome Browser database: extensions and updates 2013.UCSC 基因组浏览器数据库:扩展和更新 2013 年版
Nucleic Acids Res. 2013 Jan;41(Database issue):D64-9. doi: 10.1093/nar/gks1048. Epub 2012 Nov 15.

引用本文的文献

1
Co-profiling of in situ RNA-protein interactions and transcriptome in single cells and tissues.单细胞和组织中原位RNA-蛋白质相互作用与转录组的联合分析
Nat Methods. 2025 Aug 11. doi: 10.1038/s41592-025-02774-4.
2
Comprehensive fragmentation of cell-free repetitive DNA for enhanced cancer detection in plasma.用于增强血浆中癌症检测的游离重复DNA的全面片段化
Front Cell Dev Biol. 2025 Jul 9;13:1630231. doi: 10.3389/fcell.2025.1630231. eCollection 2025.
3
Anthracyclines induce global changes in cardiomyocyte chromatin accessibility that overlap with cardiovascular disease loci.

本文引用的文献

1
Transposable element expression in tumors is associated with immune infiltration and increased antigenicity.转座元件在肿瘤中的表达与免疫浸润和抗原性增加有关。
Nat Commun. 2019 Nov 19;10(1):5228. doi: 10.1038/s41467-019-13035-2.
2
Postmortem Cortex Samples Identify Distinct Molecular Subtypes of ALS: Retrotransposon Activation, Oxidative Stress, and Activated Glia.尸检大脑皮质样本鉴定出 ALS 的不同分子亚型:逆转录转座子激活、氧化应激和激活的神经胶质细胞。
Cell Rep. 2019 Oct 29;29(5):1164-1177.e5. doi: 10.1016/j.celrep.2019.09.066.
3
The UCSC Genome Browser database: 2019 update.
蒽环类药物可诱导心肌细胞染色质可及性发生全局性变化,这些变化与心血管疾病相关基因座重叠。
bioRxiv. 2025 Jun 16:2025.06.11.658997. doi: 10.1101/2025.06.11.658997.
4
Structural features of somatic and germline retrotransposition events in humans.人类体细胞和生殖系逆转录转座事件的结构特征。
Mob DNA. 2025 Apr 22;16(1):20. doi: 10.1186/s13100-025-00357-w.
5
Comprehensive analysis of H3K27me3 LOCKs under different DNA methylation contexts reveal epigenetic redistribution in tumorigenesis.对不同DNA甲基化背景下H3K27me3锁定区的综合分析揭示了肿瘤发生过程中的表观遗传重分布。
Epigenetics Chromatin. 2025 Jan 20;18(1):6. doi: 10.1186/s13072-025-00570-0.
6
Age, sex, and cell type-resolved hypothalamic gene expression across the pubertal transition in mice.年龄、性别和细胞类型解析的小鼠青春期过渡期间的下丘脑基因表达。
Biol Sex Differ. 2024 Oct 24;15(1):83. doi: 10.1186/s13293-024-00661-9.
7
Accelerating de novo SINE annotation in plant and animal genomes.加速动植物基因组中从头SINE注释
Mob DNA. 2024 Oct 19;15(1):24. doi: 10.1186/s13100-024-00331-y.
8
Abnormal activation of genomic LINE1 elements caused by DNA demethylation contributes to lncRNA CASC9 overexpression in esophageal squamous cell carcinoma.DNA去甲基化导致的基因组LINE1元件异常激活促进了食管鳞状细胞癌中lncRNA CASC9的过表达。
Heliyon. 2024 Jun 13;10(12):e32857. doi: 10.1016/j.heliyon.2024.e32857. eCollection 2024 Jun 30.
9
Reprogramming Chromosome Ends by Functional Histone Acetylation.通过功能化组蛋白乙酰化重编程染色体末端。
Int J Mol Sci. 2024 Mar 31;25(7):3898. doi: 10.3390/ijms25073898.
10
Correlation of Myeloid-Derived Suppressor Cell Expansion with Upregulated Transposable Elements in Severe COVID-19 Unveiled in Single-Cell RNA Sequencing Reanalysis.单细胞RNA测序再分析揭示严重COVID-19中髓源性抑制细胞扩增与转座元件上调的相关性
Biomedicines. 2024 Jan 29;12(2):315. doi: 10.3390/biomedicines12020315.
UCSC 基因组浏览器数据库:2019 年更新。
Nucleic Acids Res. 2019 Jan 8;47(D1):D853-D858. doi: 10.1093/nar/gky1095.
4
Computational tools to unmask transposable elements.用于揭示转座元件的计算工具。
Nat Rev Genet. 2018 Nov;19(11):688-704. doi: 10.1038/s41576-018-0050-x.
5
Transcription factor profiling reveals molecular choreography and key regulators of human retrotransposon expression.转录因子谱分析揭示了人类逆转录转座子表达的分子协调和关键调控因子。
Proc Natl Acad Sci U S A. 2018 Jun 12;115(24):E5526-E5535. doi: 10.1073/pnas.1722565115. Epub 2018 May 25.
6
The case for not masking away repetitive DNA.不掩盖重复DNA的理由。
Mob DNA. 2018 May 1;9:15. doi: 10.1186/s13100-018-0120-9. eCollection 2018.
7
The Neuronal Gene Arc Encodes a Repurposed Retrotransposon Gag Protein that Mediates Intercellular RNA Transfer.神经元基因Arc编码一种经过功能改造的逆转录转座子Gag蛋白,该蛋白介导细胞间RNA转移。
Cell. 2018 Mar 22;173(1):275. doi: 10.1016/j.cell.2018.03.024.
8
Spliced integrated retrotransposed element (SpIRE) formation in the human genome.人类基因组中拼接整合的反转录转座子(SpIRE)形成。
PLoS Biol. 2018 Mar 5;16(3):e2003067. doi: 10.1371/journal.pbio.2003067. eCollection 2018 Mar.
9
An ultra-fast and scalable quantification pipeline for transposable elements from next generation sequencing data.一种用于从下一代测序数据中对转座元件进行超快速且可扩展的定量流程。
Pac Symp Biocomput. 2018;23:168-179.
10
Systematic identification and characterization of regulatory elements derived from human endogenous retroviruses.对源自人类内源性逆转录病毒的调控元件进行系统鉴定和表征。
PLoS Genet. 2017 Jul 12;13(7):e1006883. doi: 10.1371/journal.pgen.1006883. eCollection 2017 Jul.