• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无比对群体基因组学:一种高效估计序列多样性的方法。

Alignment-free population genomics: an efficient estimator of sequence diversity.

机构信息

Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plön, Germany.

出版信息

G3 (Bethesda). 2012 Aug;2(8):883-9. doi: 10.1534/g3.112.002527. Epub 2012 Aug 1.

DOI:10.1534/g3.112.002527
PMID:22908037
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3411244/
Abstract

Comparative sequencing contributes critically to the functional annotation of genomes. One prerequisite for successful analysis of the increasingly abundant comparative sequencing data is the availability of efficient computational tools. We present here a strategy for comparing unaligned genomes based on a coalescent approach combined with advanced algorithms for indexing sequences. These algorithms are particularly efficient when analyzing large genomes, as their run time ideally grows only linearly with sequence length. Using this approach, we have derived and implemented a maximum-likelihood estimator of the average number of mismatches per site between two closely related sequences, π. By allowing for fluctuating coalescent times, we are able to improve a previously published alignment-free estimator of π. We show through simulation that our new estimator is fast and accurate even with moderate recombination (ρ ≤ π). To demonstrate its applicability to real data, we compare the unaligned genomes of Drosophila persimilis and D. pseudoobscura. In agreement with previous studies, our sliding window analysis locates the global divergence minimum between these two genomes to the pericentromeric region of chromosome 3.

摘要

比较测序对基因组的功能注释至关重要。成功分析日益丰富的比较测序数据的一个前提条件是拥有高效的计算工具。我们在此提出了一种基于合并方法并结合高级序列索引算法的未对齐基因组比较策略。当分析大型基因组时,这些算法特别有效,因为它们的运行时间理想情况下仅随序列长度呈线性增长。使用这种方法,我们推导出并实现了两个密切相关序列之间每个位置的平均错配数 π 的最大似然估计值。通过允许合并时间波动,我们能够改进以前发布的基于无比对的 π 估计值。通过模拟,我们表明,即使存在适度的重组(ρ ≤ π),我们的新估计值也快速且准确。为了证明它在实际数据中的适用性,我们比较了黑腹果蝇和拟暗果蝇的未对齐基因组。与先前的研究一致,我们的滑动窗口分析将这两个基因组之间的全局分歧最小值定位在染色体 3 的着丝粒区域。

相似文献

1
Alignment-free population genomics: an efficient estimator of sequence diversity.无比对群体基因组学:一种高效估计序列多样性的方法。
G3 (Bethesda). 2012 Aug;2(8):883-9. doi: 10.1534/g3.112.002527. Epub 2012 Aug 1.
2
Alignment-free estimation of nucleotide diversity.无比对核苷酸多样性估计。
Bioinformatics. 2011 Feb 15;27(4):449-55. doi: 10.1093/bioinformatics/btq689. Epub 2010 Dec 14.
3
Estimating mutation distances from unaligned genomes.从未比对的基因组估计突变距离。
J Comput Biol. 2009 Oct;16(10):1487-500. doi: 10.1089/cmb.2009.0106.
4
Structural and sequence diversity of the transposon Galileo in the Drosophila willistoni genome.果蝇威氏果蝇基因组中转座子伽利略的结构和序列多样性。
BMC Genomics. 2014 Sep 13;15(1):792. doi: 10.1186/1471-2164-15-792.
5
Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment.使用准比对快速发现和可视化 DNA 序列中的保守区域。
BMC Bioinformatics. 2013;14 Suppl 11(Suppl 11):S2. doi: 10.1186/1471-2105-14-S11-S2. Epub 2013 Sep 13.
6
Parametric alignment of Drosophila genomes.果蝇基因组的参数比对
PLoS Comput Biol. 2006 Jun 23;2(6):e73. doi: 10.1371/journal.pcbi.0020073.
7
The genomics of speciation in Drosophila: diversity, divergence, and introgression estimated using low-coverage genome sequencing.果蝇物种形成的基因组学:利用低覆盖度基因组测序估算多样性、分化及基因渗入
PLoS Genet. 2009 Jul;5(7):e1000550. doi: 10.1371/journal.pgen.1000550. Epub 2009 Jul 3.
8
Highly Contiguous Genome Assemblies of 15 Species Generated Using Nanopore Sequencing.使用纳米孔测序生成的15个物种的高度连续基因组组装
G3 (Bethesda). 2018 Oct 3;8(10):3131-3141. doi: 10.1534/g3.118.200160.
9
The speciation history of Drosophila pseudoobscura and close relatives: inferences from DNA sequence variation at the period locus.拟暗果蝇及其近缘种的物种形成历史:基于周期基因座DNA序列变异的推断
Genetics. 1996 Nov;144(3):1113-26. doi: 10.1093/genetics/144.3.1113.
10
Prediction of similarly acting cis-regulatory modules by subsequence profiling and comparative genomics in Drosophila melanogaster and D.pseudoobscura.通过序列分析和比较基因组学预测黑腹果蝇和拟暗果蝇中具有相似作用的顺式调控模块
Bioinformatics. 2004 Nov 1;20(16):2738-50. doi: 10.1093/bioinformatics/bth320. Epub 2004 May 14.

引用本文的文献

1
K-mer-based Approaches to Bridging Pangenomics and Population Genetics.基于K-mer的泛基因组学与群体遗传学关联方法。
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf047.
2
DNA-protein quasi-mapping for rapid differential gene expression analysis in non-model organisms.用于非模式生物中快速差异基因表达分析的 DNA-蛋白质拟作图。
BMC Bioinformatics. 2024 Oct 24;25(Suppl 2):335. doi: 10.1186/s12859-024-05924-1.
3
Inferring phylogenies of evolving sequences without multiple sequence alignment.无需多序列比对推断进化序列的系统发育树。

本文引用的文献

1
Alignment-free detection of horizontal gene transfer between closely related bacterial genomes.密切相关细菌基因组间水平基因转移的无比对检测
Mob Genet Elements. 2011 Sep;1(3):230-235. doi: 10.4161/mge.1.3.18065. Epub 2011 Sep 1.
2
Alignment-free detection of local similarity among viral and bacterial genomes.基于比对的病毒和细菌基因组之间局部相似性的检测。
Bioinformatics. 2011 Jun 1;27(11):1466-72. doi: 10.1093/bioinformatics/btr176. Epub 2011 Apr 6.
3
Alignment-free estimation of nucleotide diversity.无比对核苷酸多样性估计。
Sci Rep. 2014 Sep 30;4:6504. doi: 10.1038/srep06504.
4
Whole genome phylogeny for 21 Drosophila species using predicted 2b-RAD fragments.利用预测的 2b-RAD 片段对 21 种果蝇进行全基因组系统发育分析。
PeerJ. 2013 Dec 23;1:e226. doi: 10.7717/peerj.226.
5
An alignment-free test for recombination.无比对重组测试。
Bioinformatics. 2013 Dec 15;29(24):3121-7. doi: 10.1093/bioinformatics/btt550. Epub 2013 Sep 23.
Bioinformatics. 2011 Feb 15;27(4):449-55. doi: 10.1093/bioinformatics/btq689. Epub 2010 Dec 14.
4
Alignment-free sequence comparison (II): theoretical power of comparison statistics.无比对序列比较(II):比较统计量的理论功效
J Comput Biol. 2010 Nov;17(11):1467-90. doi: 10.1089/cmb.2010.0056. Epub 2010 Oct 25.
5
Alignment-free sequence comparison (I): statistics and power.无比对序列比较(I):统计学与效能
J Comput Biol. 2009 Dec;16(12):1615-34. doi: 10.1089/cmb.2009.0198.
6
Efficient estimation of pairwise distances between genomes.高效估计基因组之间的成对距离。
Bioinformatics. 2009 Dec 15;25(24):3221-7. doi: 10.1093/bioinformatics/btp590. Epub 2009 Oct 13.
7
Estimating mutation distances from unaligned genomes.从未比对的基因组估计突变距离。
J Comput Biol. 2009 Oct;16(10):1487-500. doi: 10.1089/cmb.2009.0106.
8
SOAP2: an improved ultrafast tool for short read alignment.SOAP2:一种用于短读序列比对的改进型超快速工具。
Bioinformatics. 2009 Aug 1;25(15):1966-7. doi: 10.1093/bioinformatics/btp336. Epub 2009 Jun 3.
9
Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。
Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.
10
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.短DNA序列与人类基因组的超快速且内存高效比对。
Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. Epub 2009 Mar 4.