• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从 k- -mer 高效关联映射-在寻找性别特异性序列中的应用。

Efficient association mapping from k-mers-An application in finding sex-specific sequences.

机构信息

Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh.

Department of Computer Science and Engineering, United International University, Dhaka, Bangladesh.

出版信息

PLoS One. 2021 Jan 7;16(1):e0245058. doi: 10.1371/journal.pone.0245058. eCollection 2021.

DOI:10.1371/journal.pone.0245058
PMID:33411830
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7790365/
Abstract

Genome wide association studies (GWAS) attempt to map genotypes to phenotypes in organisms. This is typically performed by genotyping individuals using microarray or by aligning whole genome sequencing reads to a reference genome. Both approaches require knowledge of a reference genome which hinders their application to organisms with no or incomplete reference genomes. This caveat can be removed by using alignment-free association mapping methods based on k-mers from sequencing reads. Here we present an improved implementation of an alignment free association mapping method. The new implementation is faster and includes additional features to make it more flexible than the original implementation. We have tested our implementation on an E. Coli ampicillin resistance dataset and observe improvement in execution time over the original implementation while maintaining accuracy in results. We also demonstrate that the method can be applied to find sex specific sequences.

摘要

全基因组关联研究(GWAS)试图在生物体内将基因型映射到表型。这通常通过使用微阵列对个体进行基因分型,或者通过将全基因组测序reads 与参考基因组对齐来完成。这两种方法都需要参考基因组的知识,这限制了它们在没有或不完全参考基因组的生物中的应用。通过使用基于测序reads 的 k-mer 的无比对关联映射方法,可以消除此限制。本文介绍了一种改进的无比对关联映射方法的实现。新的实现更快,并且包含更多的功能,使其比原始实现更灵活。我们已经在大肠杆菌氨苄青霉素抗性数据集上测试了我们的实现,并观察到执行时间相对于原始实现的提高,同时保持了结果的准确性。我们还证明该方法可用于查找性别特异性序列。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/605a4412e5f8/pone.0245058.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/e51e23e6eb50/pone.0245058.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/f2db427a8af8/pone.0245058.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/39411af3560b/pone.0245058.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/abc3aa627e3a/pone.0245058.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/d06937d6cbe6/pone.0245058.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/605a4412e5f8/pone.0245058.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/e51e23e6eb50/pone.0245058.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/f2db427a8af8/pone.0245058.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/39411af3560b/pone.0245058.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/abc3aa627e3a/pone.0245058.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/d06937d6cbe6/pone.0245058.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9117/7790365/605a4412e5f8/pone.0245058.g006.jpg

相似文献

1
Efficient association mapping from k-mers-An application in finding sex-specific sequences.从 k- -mer 高效关联映射-在寻找性别特异性序列中的应用。
PLoS One. 2021 Jan 7;16(1):e0245058. doi: 10.1371/journal.pone.0245058. eCollection 2021.
2
Association mapping from sequencing reads using -mers.基于 -mers 的测序reads 的关联作图。
Elife. 2018 Jun 13;7:e32920. doi: 10.7554/eLife.32920.
3
Reference-free Association Mapping from Sequencing Reads Using k-mers.使用k-mer从测序读数中进行无参考关联映射。
Bio Protoc. 2020 Nov 5;10(21):e3815. doi: 10.21769/BioProtoc.3815.
4
Predicting bacterial resistance from whole-genome sequences using k-mers and stability selection.基于 k- -mer 和稳定性选择预测全基因组序列中的细菌耐药性。
BMC Bioinformatics. 2018 Oct 17;19(1):383. doi: 10.1186/s12859-018-2403-z.
5
A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events.一种快速且无偏倚的细菌全基因组关联研究方法:弥合 k- mers 与遗传事件之间的差距。
PLoS Genet. 2018 Nov 12;14(11):e1007758. doi: 10.1371/journal.pgen.1007758. eCollection 2018 Nov.
6
Assessing the impact of exact reads on reducing the error rate of read mapping.评估精确读取对降低读取映射错误率的影响。
BMC Bioinformatics. 2018 Nov 6;19(1):406. doi: 10.1186/s12859-018-2432-7.
7
Insights from genome-wide approaches to identify variants associated to phenotypes at pan-genome scale: Application to L. monocytogenes' ability to grow in cold conditions.从全基因组方法中获得的关于表型相关变体的见解,在泛基因组范围内:应用于单核细胞增生李斯特菌在低温条件下生长的能力。
Int J Food Microbiol. 2019 Feb 16;291:181-188. doi: 10.1016/j.ijfoodmicro.2018.11.028. Epub 2018 Nov 29.
8
Unique -mers as Strain-Specific Barcodes for Phylogenetic Analysis and Natural Microbiome Profiling.独特的单核苷酸多态性作为系统发育分析和自然微生物组分析的菌株特异性条码。
Int J Mol Sci. 2020 Jan 31;21(3):944. doi: 10.3390/ijms21030944.
9
SeqSero2: Rapid and Improved Serotype Determination Using Whole-Genome Sequencing Data.SeqSero2:利用全基因组测序数据进行快速且改进的血清型测定。
Appl Environ Microbiol. 2019 Nov 14;85(23). doi: 10.1128/AEM.01746-19. Print 2019 Dec 1.
10
Next generation sequencing reads comparison with an alignment-free distance.使用无比对距离的下一代测序读数比较
BMC Res Notes. 2014 Dec 3;7:869. doi: 10.1186/1756-0500-7-869.

引用本文的文献

1
Bacterial genome-wide association studies: exploring the genetic variation underlying bacterial phenotypes.细菌全基因组关联研究:探索细菌表型背后的遗传变异
Appl Environ Microbiol. 2025 Jun 18;91(6):e0251224. doi: 10.1128/aem.02512-24. Epub 2025 May 16.
2
K-mer-based Approaches to Bridging Pangenomics and Population Genetics.基于K-mer的泛基因组学与群体遗传学关联方法。
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf047.
3
Previously unmeasured genetic diversity explains part of Lewontin's paradox in a -mer-based meta-analysis of 112 plant species.

本文引用的文献

1
Identifying genetic variants underlying phenotypic variation in plants without complete genomes.鉴定没有完整基因组的植物表型变异的遗传变异。
Nat Genet. 2020 May;52(5):534-540. doi: 10.1038/s41588-020-0612-7. Epub 2020 Apr 13.
2
A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events.一种快速且无偏倚的细菌全基因组关联研究方法:弥合 k- mers 与遗传事件之间的差距。
PLoS Genet. 2018 Nov 12;14(11):e1007758. doi: 10.1371/journal.pgen.1007758. eCollection 2018 Nov.
3
Association mapping from sequencing reads using -mers.
在一项基于112种植物物种的-mer元分析中,先前未测量的遗传多样性解释了部分莱翁汀悖论。
bioRxiv. 2024 Sep 8:2024.05.17.594778. doi: 10.1101/2024.05.17.594778.
4
kGWASflow: a modular, flexible, and reproducible Snakemake workflow for k-mers-based GWAS.kGWASflow:一种基于 k-mer 的 GWAS 的模块化、灵活和可重复的 Snakemake 工作流程。
G3 (Bethesda). 2023 Dec 29;14(1). doi: 10.1093/g3journal/jkad246.
5
k mdiff, large-scale and user-friendly differential k-mer analyses.kmdiff,大规模且用户友好的差异 k-mer 分析。
Bioinformatics. 2022 Dec 13;38(24):5443-5445. doi: 10.1093/bioinformatics/btac689.
基于 -mers 的测序reads 的关联作图。
Elife. 2018 Jun 13;7:e32920. doi: 10.7554/eLife.32920.
4
Linear assembly of a human centromere on the Y chromosome.线性组装人类着丝粒于 Y 染色体上。
Nat Biotechnol. 2018 Apr;36(4):321-323. doi: 10.1038/nbt.4109. Epub 2018 Mar 19.
5
Sequence element enrichment analysis to determine the genetic basis of bacterial phenotypes.序列元件富集分析确定细菌表型的遗传基础。
Nat Commun. 2016 Sep 16;7:12797. doi: 10.1038/ncomms12797.
6
Identifying lineage effects when controlling for population structure improves power in bacterial association studies.在控制群体结构时识别谱系效应可提高细菌关联研究的效能。
Nat Microbiol. 2016 Apr 4;1:16041. doi: 10.1038/nmicrobiol.2016.41.
7
Genome-wide association study identifies vitamin B5 biosynthesis as a host specificity factor in Campylobacter.全基因组关联研究鉴定出维生素 B5 生物合成是弯曲菌属宿主特异性的一个因素。
Proc Natl Acad Sci U S A. 2013 Jul 16;110(29):11923-7. doi: 10.1073/pnas.1305559110. Epub 2013 Jul 1.
8
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
9
Fast gapped-read alignment with Bowtie 2.快速缺口读对准与 Bowtie 2。
Nat Methods. 2012 Mar 4;9(4):357-9. doi: 10.1038/nmeth.1923.
10
A fast, lock-free approach for efficient parallel counting of occurrences of k-mers.一种快速、无锁的方法,用于高效并行计数 k-mer 的出现次数。
Bioinformatics. 2011 Mar 15;27(6):764-70. doi: 10.1093/bioinformatics/btr011. Epub 2011 Jan 7.