• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Rotate:一个命令行程序,用于旋转环状DNA序列,使其从给定位置或字符串处开始。

Rotate: A command-line program to rotate circular DNA sequences to start at a given position or string.

作者信息

Durbin Richard, De Sanctis Bianca, Blumer Moritz

机构信息

Department of Genetics, University of Cambridge, Cambridge, England, CB2 3EH, UK.

出版信息

Wellcome Open Res. 2023 Sep 13;8:401. doi: 10.12688/wellcomeopenres.19568.1. eCollection 2023.

DOI:10.12688/wellcomeopenres.19568.1
PMID:38680652
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11056001/
Abstract

Sequences derived from circular DNA molecules (i.e. most bacterial, viral and plastid genomes) are expected to be linearised and rotated to a common start position for most downstream analyses including alignment. Despite this being a common and straightforward task, available software is either limited to a small number of input sequences, lacks the option to specify a custom anchor string, or requires a commercial license. Here, we present rotate, a simple, open source command line program written in C with no external dependencies, which can rotate a set of input sequences to a custom anchor string (allowing for a specified number of mismatches), or offset the input sequences to the desired position. The combination of both functionalities allows the rotation of all input sequences to any desired starting position, enabling downstream analysis. rotate is extremely fast and scales linearly with the number of input sequences, taking only seconds to rotate over a thousand mitochondrial sequences.

摘要

源自环状DNA分子(即大多数细菌、病毒和质体基因组)的序列,在包括比对在内的大多数下游分析中,都需要进行线性化处理并旋转至共同的起始位置。尽管这是一项常见且简单的任务,但现有的软件要么仅限于处理少量输入序列,缺乏指定自定义锚定字符串的选项,要么需要商业许可证。在此,我们展示了rotate,这是一个用C语言编写的简单开源命令行程序,无外部依赖项,它可以将一组输入序列旋转至自定义锚定字符串(允许指定一定数量的错配),或者将输入序列偏移到所需位置。这两种功能的结合使得所有输入序列都能旋转到任何所需的起始位置,从而便于进行下游分析。rotate速度极快,并且随输入序列数量呈线性扩展,只需几秒钟就能旋转一千多个线粒体序列。

相似文献

1
Rotate: A command-line program to rotate circular DNA sequences to start at a given position or string.Rotate:一个命令行程序,用于旋转环状DNA序列,使其从给定位置或字符串处开始。
Wellcome Open Res. 2023 Sep 13;8:401. doi: 10.12688/wellcomeopenres.19568.1. eCollection 2023.
2
MARS: improving multiple circular sequence alignment using refined sequences.MARS:使用优化序列改进多重环状序列比对
BMC Genomics. 2017 Jan 14;18(1):86. doi: 10.1186/s12864-016-3477-5.
3
CSA: an efficient algorithm to improve circular DNA multiple alignment.CSA:一种改进环状DNA多重比对的高效算法。
BMC Bioinformatics. 2009 Jul 23;10:230. doi: 10.1186/1471-2105-10-230.
4
BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines.BpWrapper:基于 BioPerl 的序列和树实用程序,用于快速原型化生物信息学管道。
BMC Bioinformatics. 2018 Mar 2;19(1):76. doi: 10.1186/s12859-018-2074-9.
5
OrganellarGenomeDRAW (OGDRAW): a tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes.细胞器基因组绘图工具(OGDRAW):一种用于轻松生成高质量质体和线粒体基因组自定义图形图谱的工具。
Curr Genet. 2007 Nov;52(5-6):267-74. doi: 10.1007/s00294-007-0161-y. Epub 2007 Oct 24.
6
Detecting gene breakpoints in noisy genome sequences using position-annotated colored de-Bruijn graphs.利用位置注释的有色 de-Bruijn 图检测嘈杂基因组序列中的基因断点。
BMC Bioinformatics. 2023 Jun 5;24(1):235. doi: 10.1186/s12859-023-05371-4.
7
BuddySuite: Command-Line Toolkits for Manipulating Sequences, Alignments, and Phylogenetic Trees.BuddySuite:用于操作序列、比对和系统发育树的命令行工具包。
Mol Biol Evol. 2017 Jun 1;34(6):1543-1546. doi: 10.1093/molbev/msx089.
8
Accurate multiple alignment of distantly related genome sequences using filtered spaced word matches as anchor points.使用过滤的间隔字匹配作为锚点,对远缘基因组序列进行精确的多重比对。
Bioinformatics. 2019 Jan 15;35(2):211-218. doi: 10.1093/bioinformatics/bty592.
9
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
10
Judging sound rotation when listeners and sounds rotate: Sound source localization is a multisystem process.当听众和声音旋转时判断声音旋转:声源定位是一个多系统过程。
J Acoust Soc Am. 2015 Nov;138(5):3293-310. doi: 10.1121/1.4935091.

引用本文的文献

1
Complete mitogenomes reveal high diversity and recent population dynamics in Antarctic krill.完整的线粒体基因组揭示了南极磷虾的高度多样性和近期种群动态。
BMC Genomics. 2025 Apr 29;26(1):419. doi: 10.1186/s12864-025-11579-w.
2
sp. nov, . sp. nov, . sp. nov, . sp. nov and . sp. nov: five new species isolated from water sources in the Midwestern United States.新种,新种,新种,新种和新种:从美国中西部水源中分离出的五个新物种。
Int J Syst Evol Microbiol. 2025 Jan;75(1). doi: 10.1099/ijsem.0.006595.

本文引用的文献

1
SeqKit2: A Swiss army knife for sequence and alignment processing.SeqKit2:一款用于序列和比对处理的瑞士军刀式工具。
Imeta. 2024 Apr 5;3(3):e191. doi: 10.1002/imt2.191. eCollection 2024 Jun.
2
MUMmer4: A fast and versatile genome alignment system.MUMmer4:一种快速且通用的基因组比对系统。
PLoS Comput Biol. 2018 Jan 26;14(1):e1005944. doi: 10.1371/journal.pcbi.1005944. eCollection 2018 Jan.
3
Canu: scalable and accurate long-read assembly via adaptive -mer weighting and repeat separation.Canu:通过自适应k-mer加权和重复序列分离实现可扩展且准确的长读长序列拼接
Genome Res. 2017 May;27(5):722-736. doi: 10.1101/gr.215087.116. Epub 2017 Mar 15.
4
MARS: improving multiple circular sequence alignment using refined sequences.MARS:使用优化序列改进多重环状序列比对
BMC Genomics. 2017 Jan 14;18(1):86. doi: 10.1186/s12864-016-3477-5.
5
SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation.SeqKit:一个用于FASTA/Q文件操作的跨平台超快速工具包。
PLoS One. 2016 Oct 5;11(10):e0163962. doi: 10.1371/journal.pone.0163962. eCollection 2016.
6
Circlator: automated circularization of genome assemblies using long sequencing reads.Circlator:利用长测序读段实现基因组组装的自动化环化
Genome Biol. 2015 Dec 29;16:294. doi: 10.1186/s13059-015-0849-0.
7
AliView: a fast and lightweight alignment viewer and editor for large datasets.AliView:一款快速、轻量级的大型数据集比对查看和编辑工具。
Bioinformatics. 2014 Nov 15;30(22):3276-8. doi: 10.1093/bioinformatics/btu531. Epub 2014 Aug 5.
8
MAFFT multiple sequence alignment software version 7: improvements in performance and usability.MAFFT 多序列比对软件版本 7:性能和易用性的改进。
Mol Biol Evol. 2013 Apr;30(4):772-80. doi: 10.1093/molbev/mst010. Epub 2013 Jan 16.
9
SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.SPAdes:一种新的基因组组装算法及其在单细胞测序中的应用
J Comput Biol. 2012 May;19(5):455-77. doi: 10.1089/cmb.2012.0021. Epub 2012 Apr 16.
10
Prodigal: prokaryotic gene recognition and translation initiation site identification.普罗迪格:原核基因识别和翻译起始位点鉴定。
BMC Bioinformatics. 2010 Mar 8;11:119. doi: 10.1186/1471-2105-11-119.