• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Sim4db 和 Leaff:用于快速批量拼接比对和序列索引的实用程序。

Sim4db and Leaff: utilities for fast batch spliced alignment and sequence indexing.

机构信息

The J. Craig Venter Institute, Rockville, MD 20850, USA.

出版信息

Bioinformatics. 2011 Jul 1;27(13):1869-70. doi: 10.1093/bioinformatics/btr285. Epub 2011 May 6.

DOI:10.1093/bioinformatics/btr285
PMID:21551146
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3117389/
Abstract

UNLABELLED

The large number of genomes that will be sequenced will need to be annotated with genes and other functional features. Aligning gene sequences from a related species to the target genome is an economical and highly reliable method to identify genes; unfortunately, existing tools have been lacking in sensitivity and speed. A program we reported, sim4cc, was shown to be highly accurate but is limited to comparing one cDNA with one genomic sequence. We present here an optimization of the tool, implemented in the packages sim4db and leaff. The new tool performs batch alignments of cDNA and genomic sequences in a fraction of the time required by its predecessor, and thus is very well suited for genome-wide analyses.

AVAILABILITY

Sim4db and leaff are written in C, C++ and Perl for Linux and other Unix platforms. Source code is distributed free of charge from http://sourceforge.net/projects/kmer/.

CONTACT

florea@umiacs.umd.edu

摘要

未标记

需要对大量将要测序的基因组进行基因和其他功能特征注释。将相关物种的基因序列与目标基因组进行比对是识别基因的经济高效且高度可靠的方法;遗憾的是,现有的工具在灵敏度和速度方面有所欠缺。我们报告的一个程序 sim4cc 被证明具有高度准确性,但仅限于比较一个 cDNA 与一个基因组序列。我们在此介绍该工具的优化版本,它实现于 sim4db 和 leaff 软件包中。新工具的批处理 cDNA 和基因组序列比对速度比其前身快得多,因此非常适合全基因组分析。

可用性

Sim4db 和 leaff 是用 C、C++和 Perl 编写的,可用于 Linux 和其他 Unix 平台。源代码可从 http://sourceforge.net/projects/kmer/ 免费下载。

联系人

florea@umiacs.umd.edu

相似文献

1
Sim4db and Leaff: utilities for fast batch spliced alignment and sequence indexing.Sim4db 和 Leaff:用于快速批量拼接比对和序列索引的实用程序。
Bioinformatics. 2011 Jul 1;27(13):1869-70. doi: 10.1093/bioinformatics/btr285. Epub 2011 May 6.
2
Sim4cc: a cross-species spliced alignment program.Sim4cc:一种跨物种剪接比对程序。
Nucleic Acids Res. 2009 Jun;37(11):e80. doi: 10.1093/nar/gkp319. Epub 2009 May 8.
3
The Sequence Alignment/Map format and SAMtools.序列比对/映射格式和 SAMtools。
Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.
4
Optimal spliced alignment of homologous cDNA to a genomic DNA template.同源cDNA与基因组DNA模板的最佳剪接比对。
Bioinformatics. 2000 Mar;16(3):203-11. doi: 10.1093/bioinformatics/16.3.203.
5
Pairagon: a highly accurate, HMM-based cDNA-to-genome aligner.派拉贡:一种基于隐马尔可夫模型的高度精确的cDNA到基因组比对工具。
Bioinformatics. 2009 Jul 1;25(13):1587-93. doi: 10.1093/bioinformatics/btp273. Epub 2009 May 4.
6
ABMapper: a suffix array-based tool for multi-location searching and splice-junction mapping.ABMapper:一个基于后缀数组的多位置搜索和剪接连接映射工具。
Bioinformatics. 2011 Feb 1;27(3):421-2. doi: 10.1093/bioinformatics/btq656. Epub 2010 Dec 17.
7
Gene structure prediction from consensus spliced alignment of multiple ESTs matching the same genomic locus.基于与同一基因组位点匹配的多个EST的一致性剪接比对进行基因结构预测。
Bioinformatics. 2004 May 1;20(7):1157-69. doi: 10.1093/bioinformatics/bth058. Epub 2004 Feb 5.
8
chainCleaner improves genome alignment specificity and sensitivity.链清洁器提高了基因组比对的特异性和灵敏度。
Bioinformatics. 2017 Jun 1;33(11):1596-1603. doi: 10.1093/bioinformatics/btx024.
9
Splign: algorithms for computing spliced alignments with identification of paralogs.Splign:用于计算剪接比对并识别旁系同源物的算法。
Biol Direct. 2008 May 21;3:20. doi: 10.1186/1745-6150-3-20.
10
A space-efficient and accurate method for mapping and aligning cDNA sequences onto genomic sequence.一种用于将cDNA序列定位和比对到基因组序列上的节省空间且准确的方法。
Nucleic Acids Res. 2008 May;36(8):2630-8. doi: 10.1093/nar/gkn105. Epub 2008 Mar 15.

引用本文的文献

1
Genomic variation across distribution of Micro-Tom, a model cultivar of tomato (Solanum lycopersicum).番茄(Solanum lycopersicum)模式品种 Micro-Tom 在分布上的基因组变异。
DNA Res. 2024 Oct 1;31(5). doi: 10.1093/dnares/dsae016.
2
Detection of Exonization Events in Human Frontal Cortex From RNA-Seq Data.从RNA测序数据中检测人类前额叶皮质中的外显子化事件
Front Mol Biosci. 2021 Sep 10;8:727537. doi: 10.3389/fmolb.2021.727537. eCollection 2021.
3
IsoSplitter: identification and characterization of alternative splicing sites without a reference genome.IsoSplitter:无需参考基因组即可识别和表征可变剪接位点
RNA. 2021 May 21;27(8):868-75. doi: 10.1261/rna.077834.120.
4
, the nuclear exosome targeting component, is mutated in familial pulmonary fibrosis and is required for telomerase RNA maturation.核 exosome 靶向组件在家族性肺纤维化中发生突变,并且是端粒酶 RNA 成熟所必需的。
Genes Dev. 2019 Oct 1;33(19-20):1381-1396. doi: 10.1101/gad.326785.119. Epub 2019 Sep 5.
5
Landscape genomics provides evidence of climate-associated genetic variation in Mexican populations of .景观基因组学为墨西哥种群中与气候相关的遗传变异提供了证据。 (原文句末不完整,缺少具体物种名称)
Evol Appl. 2018 Aug 31;11(10):1842-1858. doi: 10.1111/eva.12684. eCollection 2018 Dec.
6
Range expansion underlies historical introgressive hybridization in the Iberian hare.范围扩张是伊比利亚兔历史上渗入杂交的基础。
Sci Rep. 2017 Jan 25;7:40788. doi: 10.1038/srep40788.
7
Transcriptome analysis of root response to citrus blight based on the newly assembled Swingle citrumelo draft genome.基于新组装的酸橙柚草图基因组对柑橘衰退病根部反应的转录组分析。
BMC Genomics. 2016 Jul 8;17:485. doi: 10.1186/s12864-016-2779-y.
8
Rcorrector: efficient and accurate error correction for Illumina RNA-seq reads.Rcorrector:对Illumina RNA测序读数进行高效准确的纠错。
Gigascience. 2015 Oct 19;4:48. doi: 10.1186/s13742-015-0089-y. eCollection 2015.
9
Pathogen-regulated genes in wheat isogenic lines differing in resistance to brown rust Puccinia triticina.对小麦条锈菌(Puccinia triticina)抗性不同的小麦近等基因系中的病原菌调控基因
BMC Genomics. 2015 Oct 5;16:742. doi: 10.1186/s12864-015-1932-3.
10
MEGANTE: a web-based system for integrated plant genome annotation.MEGANTE:一个基于网络的植物基因组综合注释系统。
Plant Cell Physiol. 2014 Jan;55(1):e2. doi: 10.1093/pcp/pct157. Epub 2013 Nov 18.

本文引用的文献

1
Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo): genome assembly and analysis.家鸡(Meleagris gallopavo)多平台新一代测序:基因组组装与分析。
PLoS Biol. 2010 Sep 7;8(9):e1000475. doi: 10.1371/journal.pbio.1000475.
2
Genome 10K: a proposal to obtain whole-genome sequence for 10,000 vertebrate species.基因组 10K:获取 10000 种脊椎动物全基因组序列的提案。
J Hered. 2009 Nov-Dec;100(6):659-74. doi: 10.1093/jhered/esp086. Epub 2009 Nov 5.
3
Sim4cc: a cross-species spliced alignment program.Sim4cc:一种跨物种剪接比对程序。
Nucleic Acids Res. 2009 Jun;37(11):e80. doi: 10.1093/nar/gkp319. Epub 2009 May 8.
4
GMAP: a genomic mapping and alignment program for mRNA and EST sequences.GMAP:一种用于mRNA和EST序列的基因组图谱绘制与比对程序。
Bioinformatics. 2005 May 1;21(9):1859-75. doi: 10.1093/bioinformatics/bti310. Epub 2005 Feb 22.
5
BLAT--the BLAST-like alignment tool.BLAT——类BLAST比对工具。
Genome Res. 2002 Apr;12(4):656-64. doi: 10.1101/gr.229202.
6
A computer program for aligning a cDNA sequence with a genomic DNA sequence.一种用于将互补DNA(cDNA)序列与基因组DNA序列进行比对的计算机程序。
Genome Res. 1998 Sep;8(9):967-74. doi: 10.1101/gr.8.9.967.