• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Skmer:使用基因组草图进行无组装和无比对的样本识别。

Skmer: assembly-free and alignment-free sample identification using genome skims.

机构信息

Department of Electrical & Computer Engineering, University of California, San Diego, La Jolla, 92093, CA, USA.

Evolutionary Genomics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark.

出版信息

Genome Biol. 2019 Feb 13;20(1):34. doi: 10.1186/s13059-019-1632-4.

DOI:10.1186/s13059-019-1632-4
PMID:30760303
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6374904/
Abstract

The ability to inexpensively describe taxonomic diversity is critical in this era of rapid climate and biodiversity changes. The recent genome-skimming approach extends current barcoding practices beyond short markers by applying low-pass sequencing and recovering whole organelle genomes computationally. This approach discards the nuclear DNA, which constitutes the vast majority of the data. In contrast, we suggest using all unassembled reads. We introduce an assembly-free and alignment-free tool, Skmer, to compute genomic distances between the query and reference genome skims. Skmer shows excellent accuracy in estimating distances and identifying the closest match in reference datasets.

摘要

在快速的气候和生物多样性变化时代,能够廉价地描述分类多样性至关重要。最近的基因组掠过方法通过应用低通测序和计算恢复整个细胞器基因组,将当前的条形码实践扩展到短标记之外。这种方法丢弃了构成数据绝大多数的核 DNA。相比之下,我们建议使用所有未组装的读取。我们引入了一种无组装和无比对的工具 Skmer,用于计算查询和参考基因组掠过之间的基因组距离。Skmer 在估计距离和识别参考数据集的最近匹配方面表现出出色的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/680b4f04f692/13059_2019_1632_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/4d0ba9470106/13059_2019_1632_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/122fb0b3f96b/13059_2019_1632_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/3dd6bc6b88e9/13059_2019_1632_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/655610a767ec/13059_2019_1632_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/f0d7caf87640/13059_2019_1632_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/a035871d5809/13059_2019_1632_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/e4d167334b12/13059_2019_1632_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/156c9780f8d3/13059_2019_1632_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/680b4f04f692/13059_2019_1632_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/4d0ba9470106/13059_2019_1632_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/122fb0b3f96b/13059_2019_1632_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/3dd6bc6b88e9/13059_2019_1632_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/655610a767ec/13059_2019_1632_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/f0d7caf87640/13059_2019_1632_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/a035871d5809/13059_2019_1632_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/e4d167334b12/13059_2019_1632_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/156c9780f8d3/13059_2019_1632_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3b/6374904/680b4f04f692/13059_2019_1632_Fig9_HTML.jpg

相似文献

1
Skmer: assembly-free and alignment-free sample identification using genome skims.Skmer:使用基因组草图进行无组装和无比对的样本识别。
Genome Biol. 2019 Feb 13;20(1):34. doi: 10.1186/s13059-019-1632-4.
2
Testing Efficacy of Assembly-Free and Alignment-Free Methods for Species Identification Using Genome Skims, with Patellogastropoda as a Test Case.利用基因组草图,免组装和免比对方法对物种鉴定的功效测试,以帽贝形腹足纲软体动物作为测试案例。
Genes (Basel). 2022 Jul 2;13(7):1192. doi: 10.3390/genes13071192.
3
Analyses of Nuclear Reads Obtained Using Genome Skimming.基于基因组重测序的核型分析。
Methods Mol Biol. 2024;2744:247-265. doi: 10.1007/978-1-0716-3581-0_16.
4
Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification.超越 DNA 条形码:基因组 skimming 数据在样本鉴定中的未实现潜力。
Mol Ecol. 2020 Jul;29(14):2521-2534. doi: 10.1111/mec.15507. Epub 2020 Jun 29.
5
The impact of contaminants on the accuracy of genome skimming and the effectiveness of exclusion read filters.污染物对基因组掠取准确性和排除读取过滤器有效性的影响。
Mol Ecol Resour. 2020 May;20(3). doi: 10.1111/1755-0998.13135. Epub 2020 Feb 4.
6
Skimming for barcodes: rapid production of mitochondrial genome and nuclear ribosomal repeat reference markers through shallow shotgun sequencing.条码扫描:通过浅层鸟枪法测序快速生成线粒体基因组和核核糖体重复参考标记。
PeerJ. 2022 Aug 5;10:e13790. doi: 10.7717/peerj.13790. eCollection 2022.
7
DNA barcoding of Cymbidium by genome skimming: Call for next-generation nuclear barcodes.通过基因组浅层测序对大花蕙兰进行DNA条形码分析:呼吁开发新一代核条形码。
Mol Ecol Resour. 2023 Feb;23(2):424-439. doi: 10.1111/1755-0998.13719. Epub 2022 Nov 4.
8
CONSULT: accurate contamination removal using locality-sensitive hashing.咨询:使用局部敏感哈希进行精确的污染去除。
NAR Genom Bioinform. 2021 Aug 5;3(3):lqab071. doi: 10.1093/nargab/lqab071. eCollection 2021 Sep.
9
The use of Oxford Nanopore native barcoding for complete genome assembly.使用牛津纳米孔原生条形码技术进行全基因组组装。
Gigascience. 2017 Mar 1;6(3):1-6. doi: 10.1093/gigascience/gix001.
10
An assembly and alignment-free method of phylogeny reconstruction from next-generation sequencing data.一种从下一代测序数据中重建系统发育树的无需组装和比对的方法。
BMC Genomics. 2015 Jul 14;16(1):522. doi: 10.1186/s12864-015-1647-5.

引用本文的文献

1
Inter-Island Whole-Genome Comparison Reveals Micro-Evolutionary Dynamics of the Red Fox, Stimulated Through Post-Glacial Sea-Level Alterations.岛屿间全基因组比较揭示了红狐的微进化动态,这种动态受到冰后期海平面变化的刺激。
Genome Biol Evol. 2025 Jul 30;17(8). doi: 10.1093/gbe/evaf152.
2
A k-mer-based estimator of the substitution rate between repetitive sequences.一种基于k-mer的重复序列间替换率估计方法。
bioRxiv. 2025 Jun 25:2025.06.19.660607. doi: 10.1101/2025.06.19.660607.
3
Further investigation of (Poaceae, Bambusoideae) in China based on Skmer analysis and morphology.

本文引用的文献

1
High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.高通量 ANI 分析 9 万余组原核基因组揭示了清晰的物种界限。
Nat Commun. 2018 Nov 30;9(1):5114. doi: 10.1038/s41467-018-07641-9.
2
fastp: an ultra-fast all-in-one FASTQ preprocessor.fastp:一个超快速的一体化 FASTQ 预处理程序。
Bioinformatics. 2018 Sep 1;34(17):i884-i890. doi: 10.1093/bioinformatics/bty560.
3
Highly Contiguous Genome Assemblies of 15 Species Generated Using Nanopore Sequencing.使用纳米孔测序生成的15个物种的高度连续基因组组装
基于Skmer分析和形态学对中国(禾本科,竹亚科)的进一步研究。
PhytoKeys. 2025 Jul 2;259:177-198. doi: 10.3897/phytokeys.259.151683. eCollection 2025.
4
A composite universal DNA signature for the tree of life.一种用于生命之树的复合通用DNA特征。
Nat Ecol Evol. 2025 Jun 25. doi: 10.1038/s41559-025-02752-1.
5
PISAD: reference-free intraspecies sample anomalies detection tool based on k-mer counting.PISAD:基于k-mer计数的无参考种内样本异常检测工具。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf061.
6
EvANI benchmarking workflow for evolutionary distance estimation.用于进化距离估计的EvANI基准测试工作流程。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf267.
7
Estimation of substitution and indel rates via -mer statistics.通过 - 聚体统计估计替换和插入缺失率。 (这里原文中的“ -mer”表述不完整,正常应该是如“k-mer”等具体形式,翻译时按照现有内容进行了直译)
bioRxiv. 2025 Jun 21:2025.05.14.653858. doi: 10.1101/2025.05.14.653858.
8
No evidence of transposable element bursts in the Galápagos Scalesia adaptive radiation despite hybridization, diversification and ecological niche shifts.尽管存在杂交、多样化和生态位转移现象,但在加拉帕戈斯斯氏菊属植物的适应性辐射中没有转座元件爆发的证据。
Mob DNA. 2025 May 31;16(1):23. doi: 10.1186/s13100-025-00362-z.
9
A curated benchmark dataset for molecular identification based on genome skimming.一个基于基因组浅层测序的用于分子鉴定的精选基准数据集。
Sci Data. 2025 May 29;12(1):906. doi: 10.1038/s41597-025-05230-2.
10
Genomic DNA barcodes provide novel insights into species delimitation in the complex Camellia sect. Thea (Theaceae).基因组DNA条形码为复杂的茶组(山茶科)中的物种界定提供了新的见解。
BMC Plant Biol. 2025 May 1;25(1):570. doi: 10.1186/s12870-025-06612-9.
G3 (Bethesda). 2018 Oct 3;8(10):3131-3141. doi: 10.1534/g3.118.200160.
4
Phylogeny reconstruction based on the length distribution of -mismatch common substrings.基于错配公共子串长度分布的系统发育重建。
Algorithms Mol Biol. 2017 Dec 11;12:27. doi: 10.1186/s13015-017-0118-8. eCollection 2017.
5
A new species of (Annelida, Eunicida, Dorvilleidae) from hydrothermal vents on the Southwest Indian Ridge.来自西南印度洋中脊热液喷口的一种新的(环节动物门,矶沙蚕目,多毛纲)物种。
Zookeys. 2017 Aug 1(687):1-9. doi: 10.3897/zookeys.687.13046. eCollection 2017.
6
Alignment-free sequence comparison: benefits, applications, and tools.无比对信息的序列比对:优势、应用和工具。
Genome Biol. 2017 Oct 3;18(1):186. doi: 10.1186/s13059-017-1319-7.
7
What can we do with 1000 plastid genomes?我们能用1000个质体基因组做什么?
Plant J. 2017 May;90(4):808-818. doi: 10.1111/tpj.13491. Epub 2017 Mar 30.
8
Fast and accurate phylogeny reconstruction using filtered spaced-word matches.使用过滤后的间隔词匹配进行快速准确的系统发育重建。
Bioinformatics. 2017 Apr 1;33(7):971-979. doi: 10.1093/bioinformatics/btw776.
9
Information-optimal genome assembly via sparse read-overlap graphs.通过稀疏读段重叠图实现信息最优的基因组组装
Bioinformatics. 2016 Sep 1;32(17):i494-i502. doi: 10.1093/bioinformatics/btw450.
10
Comparative evaluation of the genomes of three common Drosophila-associated bacteria.三种常见果蝇相关细菌基因组的比较评估
Biol Open. 2016 Sep 15;5(9):1305-16. doi: 10.1242/bio.017673.