• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

哺乳动物基因组中独特区域的检测与注释。

Detection and annotation of unique regions in mammalian genomes.

作者信息

Vieira Mourato Beatriz, Haubold Bernhard

机构信息

Research Group Bioinformatics, Max-Planck-Institute for Evolutionary Biology, August-Thienemann-Str. 2, Plön, Schleswig-Holstein 24306, Germany.

出版信息

G3 (Bethesda). 2025 Jan 8;15(1). doi: 10.1093/g3journal/jkae257.

DOI:10.1093/g3journal/jkae257
PMID:39503253
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11708210/
Abstract

Long unique genomic regions have been reported to be highly enriched for developmental genes in mice and humans. In this paper, we identify unique genomic regions using an efficient method based on fast string matching. We quantify the resource consumption and accuracy of this method before applying it to the genomes of 18 mammals. We annotate their unique regions (URs) of at least 10 kb and find that they are strongly enriched for developmental genes across the board. We then investigated the subset of URs that lack annotations, which we call "anonymous." The longest anonymous UR in the Tasmanian devil spanned 83 kb and contained the gene encoding inositol polyphosphate-5-phosphatase A, which is an essential part of intracellular signaling. This discovery of an essential gene in a UR implies that URs might be given priority when annotating mammalian genomes. Our documented pipeline for annotating URs in any mammalian genome is available from the repository github.com/evolbioinf/auger; the additional data for this study are available from the dataverse at doi.org/10.17617/3.4IKQAG.

摘要

据报道,在小鼠和人类中,长的独特基因组区域高度富集发育基因。在本文中,我们使用一种基于快速字符串匹配的有效方法来识别独特基因组区域。在将该方法应用于18种哺乳动物的基因组之前,我们对其资源消耗和准确性进行了量化。我们注释了它们至少10 kb的独特区域(URs),发现这些区域普遍强烈富集发育基因。然后,我们研究了缺乏注释的URs子集,我们将其称为“匿名”子集。袋獾中最长的匿名UR跨度为83 kb,包含编码肌醇多磷酸-5-磷酸酶A的基因,该基因是细胞内信号传导的重要组成部分。在一个UR中发现一个必需基因意味着在注释哺乳动物基因组时,URs可能应被优先考虑。我们记录的用于注释任何哺乳动物基因组中URs的流程可从github.com/evolbioinf/auger存储库获取;本研究的其他数据可从doi.org/10.17617/3.4IKQAG的数据存储库获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/2d8bcb7e486c/jkae257f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/1bded614a097/jkae257f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/d649f86664a3/jkae257f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/3cda82c08bea/jkae257f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/e56f6e55c49b/jkae257f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/c2e0b9514b73/jkae257f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/147529ec785d/jkae257f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/b97a9ce72832/jkae257f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/8b9a7f8637fa/jkae257f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/089364efab45/jkae257f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/2d8bcb7e486c/jkae257f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/1bded614a097/jkae257f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/d649f86664a3/jkae257f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/3cda82c08bea/jkae257f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/e56f6e55c49b/jkae257f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/c2e0b9514b73/jkae257f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/147529ec785d/jkae257f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/b97a9ce72832/jkae257f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/8b9a7f8637fa/jkae257f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/089364efab45/jkae257f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1f4/11708210/2d8bcb7e486c/jkae257f10.jpg

相似文献

1
Detection and annotation of unique regions in mammalian genomes.哺乳动物基因组中独特区域的检测与注释。
G3 (Bethesda). 2025 Jan 8;15(1). doi: 10.1093/g3journal/jkae257.
2
A genome alignment of 120 mammals highlights ultraconserved element variability and placenta-associated enhancers.120 种哺乳动物的基因组比对突出了超保守元件的可变性和胎盘相关增强子。
Gigascience. 2020 Jan 1;9(1). doi: 10.1093/gigascience/giz159.
3
Beav: a bacterial genome and mobile element annotation pipeline.Beav:细菌基因组和移动元件注释流水线。
mSphere. 2024 Aug 28;9(8):e0020924. doi: 10.1128/msphere.00209-24. Epub 2024 Jul 22.
4
High-complexity regions in mammalian genomes are enriched for developmental genes.哺乳动物基因组中的高复杂度区域富含发育基因。
Bioinformatics. 2019 Jun 1;35(11):1813-1819. doi: 10.1093/bioinformatics/bty922.
5
zDB: bacterial comparative genomics made easy.zDB:轻松进行细菌比较基因组学研究。
mSystems. 2024 Jul 23;9(7):e0047324. doi: 10.1128/msystems.00473-24. Epub 2024 Jun 28.
6
Best genome sequencing strategies for annotation of complex immune gene families in wildlife.野生动物复杂免疫基因家族注释的最佳基因组测序策略。
Gigascience. 2022 Oct 30;11. doi: 10.1093/gigascience/giac100.
7
MicrobeAnnotator: a user-friendly, comprehensive functional annotation pipeline for microbial genomes.微生物注释器:一个用户友好、全面的微生物基因组功能注释管道。
BMC Bioinformatics. 2021 Jan 6;22(1):11. doi: 10.1186/s12859-020-03940-5.
8
Coding exon-structure aware realigner (CESAR) utilizes genome alignments for accurate comparative gene annotation.编码外显子结构感知重排器(CESAR)利用基因组比对进行准确的比较基因注释。
Nucleic Acids Res. 2016 Jun 20;44(11):e103. doi: 10.1093/nar/gkw210. Epub 2016 Mar 25.
9
A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea.转座元件注释流水线和表达分析揭示微藻新月菱形藻中潜在活跃的元件。
BMC Genomics. 2018 May 22;19(1):378. doi: 10.1186/s12864-018-4763-1.
10
ISEScan: automated identification of insertion sequence elements in prokaryotic genomes.ISEScan:原核生物基因组中插入序列元件的自动识别。
Bioinformatics. 2017 Nov 1;33(21):3340-3347. doi: 10.1093/bioinformatics/btx433.

引用本文的文献

1
Fast detection of unique genomic regions.独特基因组区域的快速检测。
Comput Struct Biotechnol J. 2025 Feb 27;27:843-850. doi: 10.1016/j.csbj.2025.02.025. eCollection 2025.

本文引用的文献

1
eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale.eggNOG-mapper v2:宏基因组尺度的功能注释、直系同源物分配和结构域预测。
Mol Biol Evol. 2021 Dec 9;38(12):5825-5829. doi: 10.1093/molbev/msab293.
2
High-complexity regions in mammalian genomes are enriched for developmental genes.哺乳动物基因组中的高复杂度区域富含发育基因。
Bioinformatics. 2019 Jun 1;35(11):1813-1819. doi: 10.1093/bioinformatics/bty922.
3
MUMmer4: A fast and versatile genome alignment system.MUMmer4:一种快速且通用的基因组比对系统。
PLoS Comput Biol. 2018 Jan 26;14(1):e1005944. doi: 10.1371/journal.pcbi.1005944. eCollection 2018 Jan.
4
ModelFinder: fast model selection for accurate phylogenetic estimates.ModelFinder:用于准确系统发育估计的快速模型选择
Nat Methods. 2017 Jun;14(6):587-589. doi: 10.1038/nmeth.4285. Epub 2017 May 8.
5
Accurate Transposable Element Annotation Is Vital When Analyzing New Genome Assemblies.准确的转座元件注释对于分析新的基因组组装至关重要。
Genome Biol Evol. 2016 Jan 21;8(2):403-10. doi: 10.1093/gbe/evw009.
6
The MEME Suite.MEME 套件。
Nucleic Acids Res. 2015 Jul 1;43(W1):W39-49. doi: 10.1093/nar/gkv416. Epub 2015 May 7.
7
Inositol 5-phosphatases: insights from the Lowe syndrome protein OCRL.肌醇 5-磷酸酶:Lowe 综合征蛋白 OCRL 的研究进展。
Trends Biochem Sci. 2012 Apr;37(4):134-43. doi: 10.1016/j.tibs.2012.01.002. Epub 2012 Feb 28.
8
CpG islands and the regulation of transcription.CpG 岛与转录调控。
Genes Dev. 2011 May 15;25(10):1010-22. doi: 10.1101/gad.2037511.
9
Functional relevance of CpG island length for regulation of gene expression.CpG 岛长度对基因表达调控的功能相关性。
Genetics. 2011 Apr;187(4):1077-83. doi: 10.1534/genetics.110.126094. Epub 2011 Feb 1.
10
Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。
Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.