• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用McClintock 2对转座元件检测器进行可重复评估,可准确推断酵母中Ty插入模式。

Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast.

作者信息

Chen Jingxuan, Basting Preston J, Han Shunhua, Garfinkel David J, Bergman Casey M

机构信息

Institute of Bioinformatics, University of Georgia, Athens, GA.

Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA.

出版信息

bioRxiv. 2023 Mar 21:2023.02.13.528343. doi: 10.1101/2023.02.13.528343.

DOI:10.1101/2023.02.13.528343
PMID:36824955
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9948991/
Abstract

BACKGROUND

Many computational methods have been developed to detect non-reference transposable element (TE) insertions using short-read whole genome sequencing data. The diversity and complexity of such methods often present challenges to new users seeking to reproducibly install, execute, or evaluate multiple TE insertion detectors.

RESULTS

We previously developed the McClintock meta-pipeline to facilitate the installation, execution, and evaluation of six first-generation short-read TE detectors. Here, we report a completely re-implemented version of McClintock written in Python using Snakemake and Conda that improves its installation, error handling, speed, stability, and extensibility. McClintock 2 now includes 12 short-read TE detectors, auxiliary pre-processing and analysis modules, interactive HTML reports, and a simulation framework to reproducibly evaluate the accuracy of component TE detectors. When applied to the model microbial eukaryote Saccharomyces cerevisiae, we find substantial variation in the ability of McClintock 2 components to identify the precise locations of non-reference TE insertions, with RelocaTE2 showing the highest recall and precision in simulated data. We find that RelocaTE2, TEMP, TEMP2 and TEBreak provide a consistent and biologically meaningful view of non-reference TE insertions in a species-wide panel of ∼1000 yeast genomes, as evaluated by coverage-based abundance estimates and expected patterns of tRNA promoter targeting. Finally, we show that best-in-class predictors for yeast have sufficient resolution to reveal a dyad pattern of integration in nucleosome-bound regions upstream of yeast tRNA genes for Ty1, Ty2, and Ty4, allowing us to extend knowledge about fine-scale target preferences first revealed experimentally for Ty1 to natural insertions and related copia-superfamily retrotransposons in yeast.

CONCLUSION

McClintock (https://github.com/bergmanlab/mcclintock/) provides a user-friendly pipeline for the identification of TEs in short-read WGS data using multiple TE detectors, which should benefit researchers studying TE insertion variation in a wide range of different organisms. Application of the improved McClintock system to simulated and empirical yeast genome data reveals best-in-class methods and novel biological insights for one of the most widely-studied model eukaryotes and provides a paradigm for evaluating and selecting non-reference TE detectors for other species.

摘要

背景

已经开发了许多计算方法,用于使用短读长全基因组测序数据检测非参考转座元件(TE)插入。这些方法的多样性和复杂性常常给试图可重复地安装、执行或评估多个TE插入检测器的新用户带来挑战。

结果

我们之前开发了McClintock元管道,以方便六个第一代短读长TE检测器的安装、执行和评估。在此,我们报告了一个使用Snakemake和Conda以Python编写的完全重新实现的McClintock版本,它改进了安装、错误处理、速度、稳定性和可扩展性。McClintock 2现在包括12个短读长TE检测器、辅助预处理和分析模块、交互式HTML报告以及一个模拟框架,用于可重复地评估组件TE检测器的准确性。当应用于模式微生物真核生物酿酒酵母时,我们发现McClintock 2组件识别非参考TE插入精确位置的能力存在很大差异,RelocaTE2在模拟数据中显示出最高的召回率和精确率。我们发现,通过基于覆盖度的丰度估计和tRNA启动子靶向的预期模式评估,RelocaTE2、TEMP、TEMP2和TEBreak在约1000个酵母基因组的全物种面板中提供了关于非参考TE插入的一致且具有生物学意义的视图。最后,我们表明,酵母中的一流预测器具有足够的分辨率,可揭示酵母tRNA基因上游核小体结合区域中Ty1、Ty2和Ty4的二元整合模式,这使我们能够将最初通过实验揭示的关于Ty1的精细尺度靶标偏好的知识扩展到酵母中的自然插入和相关的copia超家族逆转座子。

结论

McClintock(https://github.com/bergmanlab/mcclintock/)提供了一个用户友好的管道,用于使用多个TE检测器在短读长全基因组测序数据中鉴定TE,这将使研究广泛不同生物体中TE插入变异的研究人员受益。将改进后的McClintock系统应用于模拟和实证酵母基因组数据,揭示了一流的方法以及对研究最广泛的模式真核生物之一的新生物学见解,并为评估和选择其他物种的非参考TE检测器提供了范例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/9e57a37ae34e/nihpp-2023.02.13.528343v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/8029c3dc6c26/nihpp-2023.02.13.528343v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/19b717a29346/nihpp-2023.02.13.528343v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/7593c91f30d7/nihpp-2023.02.13.528343v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/aba318f8afb0/nihpp-2023.02.13.528343v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/9e57a37ae34e/nihpp-2023.02.13.528343v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/8029c3dc6c26/nihpp-2023.02.13.528343v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/19b717a29346/nihpp-2023.02.13.528343v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/7593c91f30d7/nihpp-2023.02.13.528343v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/aba318f8afb0/nihpp-2023.02.13.528343v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/466f/10037708/9e57a37ae34e/nihpp-2023.02.13.528343v2-f0005.jpg

相似文献

1
Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast.利用McClintock 2对转座元件检测器进行可重复评估,可准确推断酵母中Ty插入模式。
bioRxiv. 2023 Mar 21:2023.02.13.528343. doi: 10.1101/2023.02.13.528343.
2
Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast.使用麦克林托克2对转座元件检测器进行可重复评估,可准确推断酵母中Ty插入模式。
Mob DNA. 2023 Jul 14;14(1):8. doi: 10.1186/s13100-023-00296-4.
3
McClintock: An Integrated Pipeline for Detecting Transposable Element Insertions in Whole-Genome Shotgun Sequencing Data.麦克林托克:一种用于在全基因组鸟枪法测序数据中检测转座元件插入的集成流程。
G3 (Bethesda). 2017 Aug 7;7(8):2763-2778. doi: 10.1534/g3.117.043893.
4
RelocaTE2: a high resolution transposable element insertion site mapping tool for population resequencing.RelocaTE2:一种用于群体重测序的高分辨率转座子插入位点定位工具。
PeerJ. 2017 Jan 26;5:e2942. doi: 10.7717/peerj.2942. eCollection 2017.
5
T-lex3: an accurate tool to genotype and estimate population frequencies of transposable elements using the latest short-read whole genome sequencing data.T-lex3:一种准确的工具,可使用最新的短读长全基因组测序数据对转座子进行基因分型和估计种群频率。
Bioinformatics. 2020 Feb 15;36(4):1191-1197. doi: 10.1093/bioinformatics/btz727.
6
Evolutionary genomics of transposable elements in Saccharomyces cerevisiae.酵母中转座元件的进化基因组学。
PLoS One. 2012;7(11):e50978. doi: 10.1371/journal.pone.0050978. Epub 2012 Nov 30.
7
Transposable elements and genome organization: a comprehensive survey of retrotransposons revealed by the complete Saccharomyces cerevisiae genome sequence.转座元件与基因组组织:通过酿酒酵母全基因组序列揭示的逆转座子综合调查
Genome Res. 1998 May;8(5):464-78. doi: 10.1101/gr.8.5.464.
8
Targeted identification of TE insertions in a genome through hemi-specific PCR.通过半特异性聚合酶链反应对基因组中TE插入进行靶向鉴定。
Mob DNA. 2017 Jul 28;8:10. doi: 10.1186/s13100-017-0092-1. eCollection 2017.
9
HiTea: a computational pipeline to identify non-reference transposable element insertions in Hi-C data.HiTea:一种用于鉴定 Hi-C 数据中非参考转座元件插入的计算流程。
Bioinformatics. 2021 May 23;37(8):1045-1051. doi: 10.1093/bioinformatics/btaa923.
10
Ty1-copia elements reveal diverse insertion sites linked to polymorphisms among flax (Linum usitatissimum L.) accessions.Ty1-copia类元件揭示了亚麻(Linum usitatissimum L.)种质中与多态性相关的不同插入位点。
BMC Genomics. 2016 Dec 7;17(1):1002. doi: 10.1186/s12864-016-3337-3.

本文引用的文献

1
Paths to adaptation under fluctuating nitrogen starvation: The spectrum of adaptive mutations in Saccharomyces cerevisiae is shaped by retrotransposons and microhomology-mediated recombination.在氮饥饿波动条件下的适应途径:酿酒酵母适应突变的范围是由逆转录转座子和微同源介导的重组形成的。
PLoS Genet. 2023 May 16;19(5):e1010747. doi: 10.1371/journal.pgen.1010747. eCollection 2023 May.
2
Caffeine-tolerant mutations selected through an at-home yeast experimental evolution teaching lab.通过一个居家酵母实验进化教学实验室筛选出的咖啡因耐受性突变。
MicroPubl Biol. 2023 Feb 9;2023. doi: 10.17912/micropub.biology.000749. eCollection 2023.
3
Stage-specific transposon activity in the life cycle of the fairy-ring mushroom .
特定阶段转座子在仙女环蘑菇生命周期中的活性。
Proc Natl Acad Sci U S A. 2022 Nov 16;119(46):e2208575119. doi: 10.1073/pnas.2208575119. Epub 2022 Nov 7.
4
Multiple origins, one evolutionary trajectory: gradual evolution characterizes distinct lineages of allotetraploid Brachypodium.多种起源,一条进化轨迹:异源四倍体短柄草的不同谱系呈现渐进式进化特征。
Genetics. 2023 Feb 9;223(2). doi: 10.1093/genetics/iyac146.
5
yEvo: experimental evolution in high school classrooms selects for novel mutations that impact clotrimazole resistance in Saccharomyces cerevisiae.yEvo:高中课堂中的实验进化选择了影响酿酒酵母克霉唑抗性的新突变。
G3 (Bethesda). 2022 Nov 4;12(11). doi: 10.1093/g3journal/jkac246.
6
Local assembly of long reads enables phylogenomics of transposable elements in a polyploid cell line.长读段的局部组装使多倍体细胞系中转座元件的系统发生基因组学成为可能。
Nucleic Acids Res. 2022 Nov 28;50(21):e124. doi: 10.1093/nar/gkac794.
7
Bergerac strains of Caenorhabditis elegans revisited: expansion of Tc1 elements imposes a significant genomic and fitness cost.重新研究伯格氏线虫伯格氏株系:Tc1 元件的扩张带来了显著的基因组和适应代价。
G3 (Bethesda). 2022 Nov 4;12(11). doi: 10.1093/g3journal/jkac214.
8
Exploring transposable element-based markers to identify allelic variations underlying agronomic traits in rice.利用转座子标记探索水稻农艺性状相关等位基因变异。
Plant Commun. 2022 May 9;3(3):100270. doi: 10.1016/j.xplc.2021.100270. Epub 2021 Dec 20.
9
Ongoing transposition in cell culture reveals the phylogeny of diverse Drosophila S2 sublines.细胞培养中的持续转座揭示了不同 Drosophila S2 亚系的系统发育。
Genetics. 2022 Jul 4;221(3). doi: 10.1093/genetics/iyac077.
10
Epistatic selection on a selfish supergene - drive, recombination, and genetic load.上位性选择对自私的超级基因驱动、重组和遗传负荷的影响。
Elife. 2022 Apr 29;11:e78981. doi: 10.7554/eLife.78981.