文献检索文档翻译深度研究

邀请有礼套餐&价格历史记录

新学期，新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

不再提醒

SA-SSR：一种基于后缀数组的算法，用于在大型基因序列中全面高效地发现简单重复序列（SSR）

SA-SSR: a suffix array-based algorithm for exhaustive and efficient SSR discovery in large genetic sequences.

作者信息

Pickett B D, Karlinsey S M, Penrod C E, Cormier M J, Ebbert M T W, Shiozawa D K, Whipple C J, Ridge P G

机构信息

Department of Biology, Brigham Young University, Provo, UT 84602, USA.

出版信息

Bioinformatics. 2016 Sep 1;32(17):2707-9. doi: 10.1093/bioinformatics/btw298. Epub 2016 May 11.

DOI:10.1093/bioinformatics/btw298

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5013907/

Abstract

UNLABELLED

Simple Sequence Repeats (SSRs) are used to address a variety of research questions in a variety of fields (e.g. population genetics, phylogenetics, forensics, etc.), due to their high mutability within and between species. Here, we present an innovative algorithm, SA-SSR, based on suffix and longest common prefix arrays for efficiently detecting SSRs in large sets of sequences. Existing SSR detection applications are hampered by one or more limitations (i.e. speed, accuracy, ease-of-use, etc.). Our algorithm addresses these challenges while being the most comprehensive and correct SSR detection software available. SA-SSR is 100% accurate and detected >1000 more SSRs than the second best algorithm, while offering greater control to the user than any existing software.

AVAILABILITY AND IMPLEMENTATION

SA-SSR is freely available at http://github.com/ridgelab/SA-SSR CONTACT: perry.ridge@byu.edu

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

未标注

简单序列重复（SSRs）因其在物种内部和物种之间具有高度变异性，被用于解决各个领域的各种研究问题（如群体遗传学、系统发育学、法医学等）。在此，我们提出一种基于后缀和最长公共前缀数组的创新算法SA-SSR，用于在大量序列中高效检测简单序列重复。现有的简单序列重复检测应用受到一个或多个限制（即速度、准确性、易用性等）的阻碍。我们的算法在解决这些挑战的同时，是现有最全面且正确的简单序列重复检测软件。SA-SSR的准确率为100%，比第二好的算法多检测出1000多个简单序列重复，同时为用户提供了比任何现有软件更大的控制权。

可用性与实现

SA-SSR可在http://github.com/ridgelab/SA-SSR上免费获取。

联系方式

perry.ridge@byu.edu

补充信息

补充数据可在《生物信息学》在线版获取。

相似文献

[1]

SA-SSR: a suffix array-based algorithm for exhaustive and efficient SSR discovery in large genetic sequences.

Bioinformatics. 2016-9-1

[2]

Kmer-SSR: a fast and exhaustive SSR search algorithm.

Bioinformatics. 2017-12-15

[3]

PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences.

Bioinformatics. 2018-3-15

[4]

SSRPrimer and SSR Taxonomy Tree: Biome SSR discovery.

Nucleic Acids Res. 2006-7-1

[5]

SATIN: a micro and mini satellite mining tool of total genome and coding regions with analysis of perfect repeats polymorphism in coding regions.

BMC Bioinformatics. 2024-6-18

[6]

Mining for SNPs and SSRs using SNPServer, dbSNP and SSR taxonomy tree.

Methods Mol Biol. 2009

[7]

Simple sequence repeat marker loci discovery using SSR primer.

Bioinformatics. 2004-6-12

[8]

An accurate and efficient method for large-scale SSR genotyping and applications.

Nucleic Acids Res. 2017-6-2

[9]

SAT, a flexible and optimized Web application for SSR marker development.

BMC Bioinformatics. 2007-11-29

[10]

ESAP plus: a web-based server for EST-SSR marker development.

BMC Genomics. 2016-12-22

引用本文的文献

[1]

Streamlining of Simple Sequence Repeat Data Mining Methodologies and Pipelines for Crop Scanning.

Plants (Basel). 2024-9-19

[2]

Earl Grey: A Fully Automated User-Friendly Transposable Element Annotation and Analysis Pipeline.

Mol Biol Evol. 2024-4-2

[3]

BigFiRSt: A Software Program Using Big Data Technique for Mining Simple Sequence Repeats From Large-Scale Sequencing Data.

Front Big Data. 2022-1-18

[4]

SSRMMD: A Rapid and Accurate Algorithm for Mining SSR Feature Loci and Candidate Polymorphic SSRs Based on Assembled Sequences.

Front Genet. 2020-7-27

[5]

Developing an ultra-efficient microsatellite discoverer to find structural differences between SARS-CoV-1 and Covid-19.

Inform Med Unlocked. 2020

[6]

IDSSR: An Efficient Pipeline for Identifying Polymorphic Microsatellites from a Single Genome Sequence.

Int J Mol Sci. 2019-7-16

[7]

Kmer-SSR: a fast and exhaustive SSR search algorithm.

Bioinformatics. 2017-12-15

本文引用的文献

[1]

Microsatellites: evolution and contribution.

Methods Mol Biol. 2013

[2]

Review of tandem repeat search tools: a systematic approach to evaluating algorithmic performance.

Brief Bioinform. 2012-5-29

[3]

Slippage synthesis of simple sequence DNA.

Nucleic Acids Res. 1992-1-25

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

推荐工具

医学文档翻译智能文献检索