• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
k-mer-based approaches to bridging pangenomics and population genetics.基于k-mer的方法在泛基因组学和群体遗传学之间架起桥梁。
ArXiv. 2024 Sep 18:arXiv:2409.11683v1.
2
K-mer-based Approaches to Bridging Pangenomics and Population Genetics.基于K-mer的泛基因组学与群体遗传学关联方法。
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf047.
3
Assessment of k-mer spectrum applicability for metagenomic dissimilarity analysis.用于宏基因组差异分析的k-mer谱适用性评估。
BMC Bioinformatics. 2016 Jan 16;17:38. doi: 10.1186/s12859-015-0875-7.
4
An alignment- and reference-free strategy using -mer present pattern for population genomic analyses.一种使用-mer呈现模式的无比对和无参考策略用于群体基因组分析。
Mycology. 2024 Jun 5;16(1):309-323. doi: 10.1080/21501203.2024.2358868. eCollection 2025.
5
Determining population structure from k-mer frequencies.从k-mer频率确定群体结构。
PeerJ. 2025 Mar 5;13:e18939. doi: 10.7717/peerj.18939. eCollection 2025.
6
SAKE: Strobemer-assisted k-mer extraction.SAKE:频闪辅助 k-mer 提取。
PLoS One. 2023 Nov 29;18(11):e0294415. doi: 10.1371/journal.pone.0294415. eCollection 2023.
7
A general near-exact k-mer counting method with low memory consumption enables de novo assembly of 106× human sequence data in 2.7 hours.一种通用的、近精确的低内存消耗 k-mer 计数方法,可在 2.7 小时内完成 106×人类序列数据的从头组装。
Bioinformatics. 2020 Dec 30;36(Suppl_2):i625-i633. doi: 10.1093/bioinformatics/btaa890.
8
Reference-free Association Mapping from Sequencing Reads Using k-mers.使用k-mer从测序读数中进行无参考关联映射。
Bio Protoc. 2020 Nov 5;10(21):e3815. doi: 10.21769/BioProtoc.3815.
9
Optimization of de novo transcriptome assembly from high-throughput short read sequencing data improves functional annotation for non-model organisms.优化从头转录组组装从高通量短读测序数据提高非模式生物的功能注释。
BMC Bioinformatics. 2012 Jul 18;13:170. doi: 10.1186/1471-2105-13-170.
10
A k-mer-based bulked segregant analysis approach to map seed traits in unphased heterozygous potato genomes.基于 k- -mer 的 bulked segregant 分析方法在未测序的杂合马铃薯基因组中定位种子性状。
G3 (Bethesda). 2024 Apr 3;14(4). doi: 10.1093/g3journal/jkae035.

基于k-mer的方法在泛基因组学和群体遗传学之间架起桥梁。

k-mer-based approaches to bridging pangenomics and population genetics.

作者信息

Roberts Miles D, Davis Olivia, Josephs Emily B, Williamson Robert J

出版信息

ArXiv. 2024 Sep 18:arXiv:2409.11683v1.

PMID:39398200
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11468241/
Abstract

Many commonly studied species now have more than one chromosome-scale genome assembly, revealing a large amount of genetic diversity previously missed by approaches that map short reads to a single reference. However, many species still lack multiple reference genomes and correctly aligning references to build pangenomes is challenging, limiting our ability to study this missing genomic variation in population genetics. Here, we argue that $k$-mers are a crucial stepping stone to bridging the reference-focused paradigms of population genetics with the reference-free paradigms of pangenomics. We review current literature on the uses of $k$-mers for performing three core components of most population genetics analyses: identifying, measuring, and explaining patterns of genetic variation. We also demonstrate how different $k$-mer-based measures of genetic variation behave in population genetic simulations according to the choice of $k$, depth of sequencing coverage, and degree of data compression. Overall, we find that $k$-mer-based measures of genetic diversity scale consistently with pairwise nucleotide diversity ($\pi$) up to values of about $\pi = 0.025$ ($R^2 = 0.97$) for neutrally evolving populations. For populations with even more variation, using shorter $k$-mers will maintain the scalability up to at least $\pi = 0.1$. Furthermore, in our simulated populations, $k$-mer dissimilarity values can be reliably approximated from counting bloom filters, highlighting a potential avenue to decreasing the memory burden of $k$-mer based genomic dissimilarity analyses. For future studies, there is a great opportunity to further develop methods to identifying selected loci using $k$-mers.

摘要

许多常见的研究物种现在有不止一个染色体水平的基因组组装,揭示了大量以前通过将短读长映射到单个参考基因组的方法所遗漏的遗传多样性。然而,许多物种仍然缺乏多个参考基因组,并且将参考基因组正确比对以构建泛基因组具有挑战性,这限制了我们在群体遗传学中研究这种缺失的基因组变异的能力。在这里,我们认为k-mer是连接群体遗传学中以参考为中心的范式与泛基因组学中无参考范式的关键垫脚石。我们回顾了当前关于使用k-mer进行大多数群体遗传学分析的三个核心组成部分的文献:识别、测量和解释遗传变异模式。我们还展示了根据k的选择、测序覆盖深度和数据压缩程度,不同的基于k-mer的遗传变异测量方法在群体遗传模拟中的表现。总体而言,我们发现对于中性进化的群体,基于k-mer的遗传多样性测量与成对核苷酸多样性(π)一致,直到π约为0.025(R² = 0.97)。对于变异更多的群体,使用更短的k-mer将至少保持可扩展性到π = 0.1。此外,在我们的模拟群体中,k-mer差异值可以通过计数布隆过滤器可靠地近似,这突出了减少基于k-mer的基因组差异分析的内存负担的潜在途径。对于未来的研究,有很大的机会进一步开发使用k-mer识别选择位点的方法。