• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 -mers 的测序reads 的关联作图。

Association mapping from sequencing reads using -mers.

机构信息

Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, United States.

Department of Statistics, University of California, Berkeley, Berkeley, United States.

出版信息

Elife. 2018 Jun 13;7:e32920. doi: 10.7554/eLife.32920.

DOI:10.7554/eLife.32920
PMID:29897334
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6044908/
Abstract

Genome wide association studies (GWAS) rely on microarrays, or more recently mapping of sequencing reads, to genotype individuals. The reliance on prior sequencing of a reference genome limits the scope of association studies, and also precludes mapping associations outside of the reference. We present an alignment free method for association studies of categorical phenotypes based on counting [Formula: see text]-mers in whole-genome sequencing reads, testing for associations directly between [Formula: see text]-mers and the trait of interest, and local assembly of the statistically significant [Formula: see text]-mers to identify sequence differences. An analysis of the 1000 genomes data show that sequences identified by our method largely agree with results obtained using the standard approach. However, unlike standard GWAS, our method identifies associations with structural variations and sites not present in the reference genome. We also demonstrate that population stratification can be inferred from [Formula: see text]-mers. Finally, application to an dataset on ampicillin resistance validates the approach.

摘要

全基因组关联研究(GWAS)依赖于微阵列,或最近的测序读段映射,对个体进行基因分型。对参考基因组进行先前测序的依赖限制了关联研究的范围,并且也排除了参考之外的关联映射。我们提出了一种基于全基因组测序读段中计数 [Formula: see text]-mers 的分类表型关联研究的无比对方法,直接在 [Formula: see text]-mers 和感兴趣的性状之间进行关联测试,并对统计学上显著的 [Formula: see text]-mers 进行局部组装,以识别序列差异。对 1000 基因组数据的分析表明,我们的方法识别的序列在很大程度上与使用标准方法获得的结果一致。然而,与标准的 GWAS 不同,我们的方法可以识别与结构变异和参考基因组中不存在的位点相关的关联。我们还证明了可以从 [Formula: see text]-mers 推断出种群分层。最后,对氨苄青霉素抗性数据集的应用验证了该方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/208f7b67225b/elife-32920-app1-fig14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/4d80ec2ce17d/elife-32920-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/2fe9190c7355/elife-32920-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/54fd0980d904/elife-32920-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/76c747a3e6e9/elife-32920-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/efda84e33412/elife-32920-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/679aa51cb844/elife-32920-app1-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/e83a05555bc9/elife-32920-app1-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/208df219440e/elife-32920-app1-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/73631abb97bd/elife-32920-app1-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/48affcabb598/elife-32920-app1-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/843352618370/elife-32920-app1-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/cfff6ab7af73/elife-32920-app1-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/2e6c7ac84ebc/elife-32920-app1-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/e9f406ae9e54/elife-32920-app1-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/7d5af922440b/elife-32920-app1-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/a55e52197a08/elife-32920-app1-fig11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/8b58fb17a0ef/elife-32920-app1-fig12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/24973182f567/elife-32920-app1-fig13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/208f7b67225b/elife-32920-app1-fig14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/4d80ec2ce17d/elife-32920-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/2fe9190c7355/elife-32920-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/54fd0980d904/elife-32920-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/76c747a3e6e9/elife-32920-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/efda84e33412/elife-32920-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/679aa51cb844/elife-32920-app1-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/e83a05555bc9/elife-32920-app1-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/208df219440e/elife-32920-app1-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/73631abb97bd/elife-32920-app1-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/48affcabb598/elife-32920-app1-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/843352618370/elife-32920-app1-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/cfff6ab7af73/elife-32920-app1-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/2e6c7ac84ebc/elife-32920-app1-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/e9f406ae9e54/elife-32920-app1-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/7d5af922440b/elife-32920-app1-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/a55e52197a08/elife-32920-app1-fig11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/8b58fb17a0ef/elife-32920-app1-fig12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/24973182f567/elife-32920-app1-fig13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d8/6044908/208f7b67225b/elife-32920-app1-fig14.jpg

相似文献

1
Association mapping from sequencing reads using -mers.基于 -mers 的测序reads 的关联作图。
Elife. 2018 Jun 13;7:e32920. doi: 10.7554/eLife.32920.
2
Efficient association mapping from k-mers-An application in finding sex-specific sequences.从 k- -mer 高效关联映射-在寻找性别特异性序列中的应用。
PLoS One. 2021 Jan 7;16(1):e0245058. doi: 10.1371/journal.pone.0245058. eCollection 2021.
3
Reference-free Association Mapping from Sequencing Reads Using k-mers.使用k-mer从测序读数中进行无参考关联映射。
Bio Protoc. 2020 Nov 5;10(21):e3815. doi: 10.21769/BioProtoc.3815.
4
Efficient mapping of accurate long reads in minimizer space with mapquik.使用 mapquik 在 minimizer 空间中高效映射准确的长读段。
Genome Res. 2023 Jul;33(7):1188-1197. doi: 10.1101/gr.277679.123. Epub 2023 Jun 30.
5
Kmer2SNP: Reference-Free Heterozygous SNP Calling Using k-mer Frequency Distributions.Kmer2SNP:基于 k-mer 频率分布的无参考杂合 SNP 调用。
Methods Mol Biol. 2022;2493:257-265. doi: 10.1007/978-1-0716-2293-3_16.
6
Next generation sequencing reads comparison with an alignment-free distance.使用无比对距离的下一代测序读数比较
BMC Res Notes. 2014 Dec 3;7:869. doi: 10.1186/1756-0500-7-869.
7
Demonstrating the utility of flexible sequence queries against indexed short reads with FlexTyper.使用 FlexTyper 对索引短读取进行灵活序列查询的实用性展示。
PLoS Comput Biol. 2021 Mar 22;17(3):e1008815. doi: 10.1371/journal.pcbi.1008815. eCollection 2021 Mar.
8
Identifying genetic variants underlying phenotypic variation in plants without complete genomes.鉴定没有完整基因组的植物表型变异的遗传变异。
Nat Genet. 2020 May;52(5):534-540. doi: 10.1038/s41588-020-0612-7. Epub 2020 Apr 13.
9
FastGT: an alignment-free method for calling common SNVs directly from raw sequencing reads.FastGT:一种从原始测序读段中直接调用常见单核苷酸变异(SNVs)的无需比对方法。
Sci Rep. 2017 May 31;7(1):2537. doi: 10.1038/s41598-017-02487-5.
10
KATK: Fast genotyping of rare variants directly from unmapped sequencing reads.KATK:直接从未映射测序reads 中快速进行稀有变体的基因分型。
Hum Mutat. 2021 Jun;42(6):777-786. doi: 10.1002/humu.24197. Epub 2021 Apr 1.

引用本文的文献

1
Genome-wide association analysis provides insights into the genomics and extracellular expression of proteases.全基因组关联分析为蛋白酶的基因组学和细胞外表达提供了见解。
Virulence. 2025 Dec;16(1):2543116. doi: 10.1080/21505594.2025.2543116. Epub 2025 Aug 14.
2
Pseudoassembly of k-mers.k-mer的伪组装
bioRxiv. 2025 May 13:2025.05.11.653354. doi: 10.1101/2025.05.11.653354.
3
Bacterial genome-wide association studies: exploring the genetic variation underlying bacterial phenotypes.细菌全基因组关联研究:探索细菌表型背后的遗传变异

本文引用的文献

1
Visualizing the geography of genetic variants.可视化基因变异的分布情况。
Bioinformatics. 2017 Feb 15;33(4):594-595. doi: 10.1093/bioinformatics/btw643.
2
Sequence element enrichment analysis to determine the genetic basis of bacterial phenotypes.序列元件富集分析确定细菌表型的遗传基础。
Nat Commun. 2016 Sep 16;7:12797. doi: 10.1038/ncomms12797.
3
Identifying lineage effects when controlling for population structure improves power in bacterial association studies.在控制群体结构时识别谱系效应可提高细菌关联研究的效能。
Appl Environ Microbiol. 2025 Jun 18;91(6):e0251224. doi: 10.1128/aem.02512-24. Epub 2025 May 16.
4
Extreme heterochiasmy and high rates of sex-reversed recombination result in large yet homomorphic sex chromosomes in the Emei moustache toad.极端的异态性和高频率的性反转重组导致峨眉髭蟾拥有大而形态相似的性染色体。
Genome Res. 2025 Jun 2;35(6):1325-1336. doi: 10.1101/gr.280161.124.
5
Hiding in plain sight: the Y chromosome and its reinvigorated role in evolutionary processes.隐匿于众目睽睽之下:Y染色体及其在进化过程中焕发出的新作用。
Evol Lett. 2024 Nov 27;9(2):165-171. doi: 10.1093/evlett/qrae065. eCollection 2025 Apr.
6
K-mer-based Approaches to Bridging Pangenomics and Population Genetics.基于K-mer的泛基因组学与群体遗传学关联方法。
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf047.
7
An alignment- and reference-free strategy using -mer present pattern for population genomic analyses.一种使用-mer呈现模式的无比对和无参考策略用于群体基因组分析。
Mycology. 2024 Jun 5;16(1):309-323. doi: 10.1080/21501203.2024.2358868. eCollection 2025.
8
Determining population structure from k-mer frequencies.从k-mer频率确定群体结构。
PeerJ. 2025 Mar 5;13:e18939. doi: 10.7717/peerj.18939. eCollection 2025.
9
SNP-Based and Kmer-Based eQTL Analysis Using Transcriptome Data.使用转录组数据的基于单核苷酸多态性(SNP)和基于k-mer的表达数量性状基因座(eQTL)分析
Animals (Basel). 2024 Oct 11;14(20):2941. doi: 10.3390/ani14202941.
10
Simple and accurate genomic classification model for distinguishing between human and pig Staphylococcus aureus.用于区分人源和猪源金黄色葡萄球菌的简单且准确的基因组分类模型。
Commun Biol. 2024 Sep 18;7(1):1171. doi: 10.1038/s42003-024-06883-2.
Nat Microbiol. 2016 Apr 4;1:16041. doi: 10.1038/nmicrobiol.2016.41.
4
WASP: allele-specific software for robust molecular quantitative trait locus discovery.WASP:用于可靠分子数量性状基因座发现的等位基因特异性软件。
Nat Methods. 2015 Nov;12(11):1061-3. doi: 10.1038/nmeth.3582. Epub 2015 Sep 14.
5
Integrated allelic, transcriptional, and phenomic dissection of the cardiac effects of titin truncations in health and disease.健康与疾病状态下肌联蛋白截短对心脏影响的等位基因、转录组及表型组综合剖析
Sci Transl Med. 2015 Jan 14;7(270):270ra6. doi: 10.1126/scitranslmed.3010134.
6
Error-prone polymerase activity causes multinucleotide mutations in humans.易出错的聚合酶活性会导致人类出现多核苷酸突变。
Genome Res. 2014 Sep;24(9):1445-54. doi: 10.1101/gr.170696.113. Epub 2014 Jul 30.
7
Genomic characterization of large heterochromatic gaps in the human genome assembly.人类基因组组装中大型异染色质间隙的基因组特征分析。
PLoS Comput Biol. 2014 May 15;10(5):e1003628. doi: 10.1371/journal.pcbi.1003628. eCollection 2014 May.
8
Sailfish enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms.旗鱼能够使用轻量级算法从RNA测序读段中进行无比对的异构体定量分析。
Nat Biotechnol. 2014 May;32(5):462-4. doi: 10.1038/nbt.2862. Epub 2014 Apr 20.
9
Genome-wide analysis of wild-type Epstein-Barr virus genomes derived from healthy individuals of the 1,000 Genomes Project.对来自千人基因组计划中健康个体的野生型爱泼斯坦-巴尔病毒基因组进行全基因组分析。
Genome Biol Evol. 2014 Apr;6(4):846-60. doi: 10.1093/gbe/evu054.
10
Titin gene mutations are common in families with both peripartum cardiomyopathy and dilated cardiomyopathy.肌联蛋白基因突变常见于围生期心肌病和扩张型心肌病的家族中。
Eur Heart J. 2014 Aug 21;35(32):2165-73. doi: 10.1093/eurheartj/ehu050. Epub 2014 Feb 20.