• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GMAP:一种用于mRNA和EST序列的基因组图谱绘制与比对程序。

GMAP: a genomic mapping and alignment program for mRNA and EST sequences.

作者信息

Wu Thomas D, Watanabe Colin K

机构信息

Department of Bioinformatics Genentech, Inc., South San Francisco, CA 94080, USA.

出版信息

Bioinformatics. 2005 May 1;21(9):1859-75. doi: 10.1093/bioinformatics/bti310. Epub 2005 Feb 22.

DOI:10.1093/bioinformatics/bti310
PMID:15728110
Abstract

MOTIVATION

We introduce GMAP, a standalone program for mapping and aligning cDNA sequences to a genome. The program maps and aligns a single sequence with minimal startup time and memory requirements, and provides fast batch processing of large sequence sets. The program generates accurate gene structures, even in the presence of substantial polymorphisms and sequence errors, without using probabilistic splice site models. Methodology underlying the program includes a minimal sampling strategy for genomic mapping, oligomer chaining for approximate alignment, sandwich DP for splice site detection, and microexon identification with statistical significance testing.

RESULTS

On a set of human messenger RNAs with random mutations at a 1 and 3% rate, GMAP identified all splice sites accurately in over 99.3% of the sequences, which was one-tenth the error rate of existing programs. On a large set of human expressed sequence tags, GMAP provided higher-quality alignments more often than blat did. On a set of Arabidopsis cDNAs, GMAP performed comparably with GeneSeqer. In these experiments, GMAP demonstrated a several-fold increase in speed over existing programs.

AVAILABILITY

Source code for gmap and associated programs is available at http://www.gene.com/share/gmap

SUPPLEMENTARY INFORMATION

http://www.gene.com/share/gmap.

摘要

动机

我们介绍了GMAP,一个用于将cDNA序列映射和比对到基因组的独立程序。该程序以最少的启动时间和内存需求来映射和比对单个序列,并能对大型序列集进行快速批量处理。即使存在大量多态性和序列错误,该程序也能生成准确的基因结构,且不使用概率性剪接位点模型。该程序的基础方法包括用于基因组映射的最小采样策略、用于近似比对的寡聚物链接、用于剪接位点检测的夹心动态规划以及具有统计显著性检验的微外显子识别。

结果

在一组以1%和3%的速率存在随机突变的人类信使RNA上,GMAP在超过99.3%的序列中准确识别了所有剪接位点,这是现有程序错误率的十分之一。在一大组人类表达序列标签上,GMAP比blat更常提供更高质量的比对。在一组拟南芥cDNA上,GMAP的表现与GeneSeqer相当。在这些实验中,GMAP的速度比现有程序提高了几倍。

可用性

gmap及相关程序的源代码可在http://www.gene.com/share/gmap获取。

补充信息

http://www.gene.com/share/gmap。

相似文献

1
GMAP: a genomic mapping and alignment program for mRNA and EST sequences.GMAP:一种用于mRNA和EST序列的基因组图谱绘制与比对程序。
Bioinformatics. 2005 May 1;21(9):1859-75. doi: 10.1093/bioinformatics/bti310. Epub 2005 Feb 22.
2
Gene structure prediction from consensus spliced alignment of multiple ESTs matching the same genomic locus.基于与同一基因组位点匹配的多个EST的一致性剪接比对进行基因结构预测。
Bioinformatics. 2004 May 1;20(7):1157-69. doi: 10.1093/bioinformatics/bth058. Epub 2004 Feb 5.
3
PALMA: mRNA to genome alignments using large margin algorithms.帕尔马:使用大间隔算法将信使核糖核酸与基因组进行比对。
Bioinformatics. 2007 Aug 1;23(15):1892-900. doi: 10.1093/bioinformatics/btm275. Epub 2007 May 30.
4
EasyCluster: a fast and efficient gene-oriented clustering tool for large-scale transcriptome data.EasyCluster:一种用于大规模转录组数据的快速高效的面向基因的聚类工具。
BMC Bioinformatics. 2009 Jun 16;10 Suppl 6(Suppl 6):S10. doi: 10.1186/1471-2105-10-S6-S10.
5
A fast and sensitive algorithm for aligning ESTs to the human genome.一种用于将EST序列与人类基因组进行比对的快速且灵敏的算法。
J Bioinform Comput Biol. 2003 Jul;1(2):363-86. doi: 10.1142/s0219720003000058.
6
MGAlignIt: A web service for the alignment of mRNA/EST and genomic sequences.MGAlignIt:一种用于mRNA/EST与基因组序列比对的网络服务。
Nucleic Acids Res. 2003 Jul 1;31(13):3533-6. doi: 10.1093/nar/gkg561.
7
Modeling splicing sites with pairwise correlations.使用成对相关性对剪接位点进行建模。
Bioinformatics. 2002;18 Suppl 2:S27-34. doi: 10.1093/bioinformatics/18.suppl_2.s27.
8
SpliceMachine: predicting splice sites from high-dimensional local context representations.拼接机器:从高维局部上下文表示中预测剪接位点。
Bioinformatics. 2005 Apr 15;21(8):1332-8. doi: 10.1093/bioinformatics/bti166. Epub 2004 Nov 25.
9
SPA: a probabilistic algorithm for spliced alignment.SPA:一种用于剪接比对的概率算法。
PLoS Genet. 2006 Apr;2(4):e24. doi: 10.1371/journal.pgen.0020024. Epub 2006 Apr 28.
10
Fast and sensitive algorithm for aligning ESTs to human genome.用于将EST序列与人类基因组进行比对的快速灵敏算法。
Proc IEEE Comput Soc Bioinform Conf. 2002;1:43-53.

引用本文的文献

1
Integrating Full-Length and Second-Generation Transcriptomes to Elucidate the ApNPV-Induced Transcriptional Reprogramming in Midgut.整合全长转录组和第二代转录组以阐明苜蓿银纹夜蛾核多角体病毒诱导的中肠转录重编程
Insects. 2025 Jul 31;16(8):792. doi: 10.3390/insects16080792.
2
Full-Length Transcriptome Analysis of Alternative Splicing and Polyadenylation in the Molecular Regulation of Labor Division in .[物种名称]劳动分工分子调控中可变剪接和多聚腺苷酸化的全长转录组分析
Int J Mol Sci. 2025 Aug 14;26(16):7859. doi: 10.3390/ijms26167859.
3
Chromosome-level haplotype-resolved genome assembly provides insights into the highly heterozygous genome of Italian ryegrass (Lolium multiflorum Lam.).
染色体水平单倍型解析的基因组组装为多花黑麦草(Lolium multiflorum Lam.)高度杂合的基因组提供了见解。
Plant Genome. 2025 Sep;18(3):e70079. doi: 10.1002/tpg2.70079.
4
Better together: Subgenomes for allotetraploid potato wild relative Solanum acaule Bitt. reveal origins in Petota Clade 3 and 4.携手共进:异源四倍体马铃薯野生近缘种智利茄的亚基因组揭示其起源于马铃薯进化分支3和4。
Plant Genome. 2025 Sep;18(3):e70095. doi: 10.1002/tpg2.70095.
5
Analysis wheat wild relatives Thinopyrum intermedium and Roegneria kamoji genomes reveal different polyploid evolution paths.对小麦野生近缘种中间偃麦草和鹅观草基因组的分析揭示了不同的多倍体进化路径。
Nat Commun. 2025 Aug 18;16(1):7693. doi: 10.1038/s41467-025-63007-y.
6
Pangenome analysis of transposable element insertion polymorphisms reveals features underlying cold tolerance in rice.转座元件插入多态性的泛基因组分析揭示了水稻耐寒性的潜在特征。
Nat Commun. 2025 Aug 16;16(1):7634. doi: 10.1038/s41467-025-62887-4.
7
Mechanism of parent-of-origin effects revealed by multi-omic data in euro-chinese hybrid pigs.欧洲猪与中国猪杂交后代多组学数据揭示的亲本来源效应机制
Nat Commun. 2025 Aug 14;16(1):7542. doi: 10.1038/s41467-025-62243-6.
8
A telomere-to-telomere genome of wild soybean with resistance to soybean cyst nematode X12.对大豆胞囊线虫X12具有抗性的野生大豆的端粒到端粒基因组。
Sci Data. 2025 Aug 13;12(1):1412. doi: 10.1038/s41597-025-05741-y.
9
Exonize: a tool for finding and classifying exon duplications in annotated genomes.Exonize:一种用于在注释基因组中查找和分类外显子重复的工具。
Bioinform Adv. 2025 Jul 28;5(1):vbaf177. doi: 10.1093/bioadv/vbaf177. eCollection 2025.
10
Haplotype-resolved, gap-free genome assemblies provide insights into the divergence between Asian and European pears.单倍型解析、无间隙基因组组装为亚洲梨和欧洲梨之间的差异提供了见解。
Nat Genet. 2025 Aug 6. doi: 10.1038/s41588-025-02273-4.