• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无比对信息的序列比对:优势、应用和工具。

Alignment-free sequence comparison: benefits, applications, and tools.

机构信息

Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University in Poznan, Umultowska 89, 61-614, Poznan, Poland.

IDMEC, Instituto Superior Técnico, Universidade de Lisboa, Av. Rovisco Pais 1, 1049-001, Lisbon, Portugal.

出版信息

Genome Biol. 2017 Oct 3;18(1):186. doi: 10.1186/s13059-017-1319-7.

DOI:10.1186/s13059-017-1319-7
PMID:28974235
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5627421/
Abstract

Alignment-free sequence analyses have been applied to problems ranging from whole-genome phylogeny to the classification of protein families, identification of horizontally transferred genes, and detection of recombined sequences. The strength of these methods makes them particularly useful for next-generation sequencing data processing and analysis. However, many researchers are unclear about how these methods work, how they compare to alignment-based methods, and what their potential is for use for their research. We address these questions and provide a guide to the currently available alignment-free sequence analysis tools.

摘要

无比对序列分析方法已经被应用于从全基因组系统发生到蛋白质家族分类、水平基因转移的鉴定以及重组序列的检测等各种问题。这些方法的优势使得它们特别适用于下一代测序数据的处理和分析。然而,许多研究人员并不清楚这些方法的工作原理、它们与基于比对的方法相比的优劣,以及它们在研究中的潜在用途。我们解决了这些问题,并提供了一份当前可用的无比对序列分析工具指南。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eba/5627421/73b88787ea61/13059_2017_1319_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eba/5627421/fa011daf2f3b/13059_2017_1319_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eba/5627421/f720d289322d/13059_2017_1319_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eba/5627421/73b88787ea61/13059_2017_1319_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eba/5627421/fa011daf2f3b/13059_2017_1319_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eba/5627421/f720d289322d/13059_2017_1319_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eba/5627421/73b88787ea61/13059_2017_1319_Fig3_HTML.jpg

相似文献

1
Alignment-free sequence comparison: benefits, applications, and tools.无比对信息的序列比对:优势、应用和工具。
Genome Biol. 2017 Oct 3;18(1):186. doi: 10.1186/s13059-017-1319-7.
2
New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing.无比对序列比较的新进展:度量、统计学与新一代测序
Brief Bioinform. 2014 May;15(3):343-53. doi: 10.1093/bib/bbt067. Epub 2013 Sep 23.
3
An overview of multiple sequence alignment.多序列比对概述。
Curr Protoc Bioinformatics. 2003 Nov;Chapter 3:3.7.1-3.7.26. doi: 10.1002/0471250953.bi0307s03.
4
Multiple Sequence Alignment.多序列比对
Methods Mol Biol. 2017;1525:167-189. doi: 10.1007/978-1-4939-6622-6_8.
5
Benchmarking of alignment-free sequence comparison methods.无比对信息的序列比较方法的基准测试。
Genome Biol. 2019 Jul 25;20(1):144. doi: 10.1186/s13059-019-1755-7.
6
Alignment-free sequence comparison-a review.无比对序列比较——综述
Bioinformatics. 2003 Mar 1;19(4):513-23. doi: 10.1093/bioinformatics/btg005.
7
Information theory applications for biological sequence analysis.信息论在生物序列分析中的应用。
Brief Bioinform. 2014 May;15(3):376-89. doi: 10.1093/bib/bbt068. Epub 2013 Sep 20.
8
Using MUMmer to identify similar regions in large sequence sets.使用MUMmer在大型序列集中识别相似区域。
Curr Protoc Bioinformatics. 2003 Feb;Chapter 10:Unit 10.3. doi: 10.1002/0471250953.bi1003s00.
9
Gap mapping: a paradigm for aligning two sequences.缺口定位:一种比对两个序列的范例。
Appl Bioinformatics. 2003;2(3 Suppl):S31-5.
10
A Review of Parallel Implementations for the Smith-Waterman Algorithm.《Smith-Waterman 算法的并行实现综述》。
Interdiscip Sci. 2022 Mar;14(1):1-14. doi: 10.1007/s12539-021-00473-0. Epub 2021 Sep 6.

引用本文的文献

1
Energy entropy vector: a novel approach for efficient microbial genomic sequence analysis and classification.能量熵向量:一种用于高效微生物基因组序列分析和分类的新方法。
Brief Bioinform. 2025 Sep 6;26(5). doi: 10.1093/bib/bbaf459.
2
CAKL: Commutative algebra k-mer learning of genomics.CAKL:基因组学的交换代数k-mer学习
ArXiv. 2025 Aug 13:arXiv:2508.09406v1.
3
Genomic organization, domain assortments, and nucleotide-binding domain diversity of NLR proteins in Sordariales fungi.粪壳菌纲真菌中NLR蛋白的基因组组织、结构域分类及核苷酸结合结构域多样性

本文引用的文献

1
Alignment-free inference of hierarchical and reticulate phylogenomic relationships.基于无比对的方法推断系统发生的分支和网状结构关系。
Brief Bioinform. 2019 Mar 22;20(2):426-435. doi: 10.1093/bib/bbx067.
2
A greedy alignment-free distance estimator for phylogenetic inference.一种用于系统发育推断的贪婪无比对距离估计器。
BMC Bioinformatics. 2017 Jun 7;18(Suppl 8):238. doi: 10.1186/s12859-017-1658-0.
3
FastGT: an alignment-free method for calling common SNVs directly from raw sequencing reads.FastGT:一种从原始测序读段中直接调用常见单核苷酸变异(SNVs)的无需比对方法。
PLoS Genet. 2025 Jul 7;21(7):e1011739. doi: 10.1371/journal.pgen.1011739. eCollection 2025 Jul.
4
New groups of highly divergent proteins in families as old as cellular life with important biological functions in the ocean.在与细胞生命一样古老的家族中,出现了新的高度分化的蛋白质群体,它们在海洋中具有重要的生物学功能。
Environ Microbiome. 2025 Jun 11;20(1):65. doi: 10.1186/s40793-025-00697-3.
5
SEQSIM: A novel bioinformatics tool for comparisons of promoter regions-a case study of calcium binding protein spermatid associated 1 (CABS1).SEQSIM:一种用于比较启动子区域的新型生物信息学工具——以钙结合蛋白精子细胞相关蛋白1(CABS1)为例的研究
BMC Bioinformatics. 2025 Jun 9;26(1):156. doi: 10.1186/s12859-025-06160-x.
6
Estimation of substitution and indel rates via -mer statistics.通过 - 聚体统计估计替换和插入缺失率。 (这里原文中的“ -mer”表述不完整,正常应该是如“k-mer”等具体形式,翻译时按照现有内容进行了直译)
bioRxiv. 2025 Jun 21:2025.05.14.653858. doi: 10.1101/2025.05.14.653858.
7
Conservation of regulatory elements with highly diverged sequences across large evolutionary distances.在大的进化距离上具有高度分化序列的调控元件的保守性。
Nat Genet. 2025 May 27. doi: 10.1038/s41588-025-02202-5.
8
Transmission pathways of Campylobacter jejuni between humans and livestock in rural Ethiopia are highly complex and interdependent.在埃塞俄比亚农村地区,空肠弯曲菌在人类和牲畜之间的传播途径极为复杂且相互依存。
Gut Pathog. 2025 May 3;17(1):26. doi: 10.1186/s13099-025-00691-7.
9
Automated classification of giant virus genomes using a random forest model built on trademark protein families.使用基于标志性蛋白质家族构建的随机森林模型对巨型病毒基因组进行自动分类。
Npj Viruses. 2024 Mar 8;2(1):9. doi: 10.1038/s44298-024-00021-9.
10
Alignment-free viral sequence classification at scale.大规模无比对病毒序列分类
BMC Genomics. 2025 Apr 18;26(1):389. doi: 10.1186/s12864-025-11554-5.
Sci Rep. 2017 May 31;7(1):2537. doi: 10.1038/s41598-017-02487-5.
4
StrainSeeker: fast identification of bacterial strains from raw sequencing reads using user-provided guide trees.菌株搜索器:利用用户提供的引导树从原始测序读数中快速鉴定细菌菌株。
PeerJ. 2017 May 18;5:e3353. doi: 10.7717/peerj.3353. eCollection 2017.
5
Benchmarking of RNA-sequencing analysis workflows using whole-transcriptome RT-qPCR expression data.使用全转录本 RT-qPCR 表达数据对 RNA-seq 分析工作流程进行基准测试。
Sci Rep. 2017 May 8;7(1):1559. doi: 10.1038/s41598-017-01617-3.
6
CAFE: aCcelerated Alignment-FrEe sequence analysis.CAFE:加速无比对序列分析。
Nucleic Acids Res. 2017 Jul 3;45(W1):W554-W559. doi: 10.1093/nar/gkx351.
7
ChimeRScope: a novel alignment-free algorithm for fusion transcript prediction using paired-end RNA-Seq data.ChimeRScope:一种使用双端RNA测序数据进行融合转录本预测的新型无比对算法。
Nucleic Acids Res. 2017 Jul 27;45(13):e120. doi: 10.1093/nar/gkx315.
8
A coevolution analysis for identifying protein-protein interactions by Fourier transform.一种通过傅里叶变换识别蛋白质-蛋白质相互作用的协同进化分析。
PLoS One. 2017 Apr 21;12(4):e0174862. doi: 10.1371/journal.pone.0174862. eCollection 2017.
9
DLTree: efficient and accurate phylogeny reconstruction using the dynamical language method.DLTree:使用动态语言方法进行高效准确的系统发育重建。
Bioinformatics. 2017 Jul 15;33(14):2214-2215. doi: 10.1093/bioinformatics/btx158.
10
Comprehensive evaluation of RNA-seq quantification methods for linearity.RNA测序定量方法线性的综合评估
BMC Bioinformatics. 2017 Mar 22;18(Suppl 4):117. doi: 10.1186/s12859-017-1526-y.