• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Computational inference of homologous gene structures in the human genome.人类基因组中同源基因结构的计算推断
Genome Res. 2001 May;11(5):803-16. doi: 10.1101/gr.175701.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
Gene structure conservation aids similarity based gene prediction.基因结构保守性有助于基于相似性的基因预测。
Nucleic Acids Res. 2004 Feb 4;32(2):776-83. doi: 10.1093/nar/gkh211. Print 2004.
4
Computational analysis of full-length mouse cDNAs compared with human genome sequences.与人类基因组序列相比的全长小鼠cDNA的计算分析。
Mamm Genome. 2001 Sep;12(9):673-7. doi: 10.1007/s00335-001-2048-4.
5
CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes.CEGMA:一种用于准确注释真核生物基因组中核心基因的流程。
Bioinformatics. 2007 May 1;23(9):1061-7. doi: 10.1093/bioinformatics/btm071. Epub 2007 Mar 1.
6
Scipio: using protein sequences to determine the precise exon/intron structures of genes and their orthologs in closely related species.西庇阿:利用蛋白质序列确定基因及其在近缘物种中的直系同源基因的精确外显子/内含子结构。
BMC Bioinformatics. 2008 Jun 13;9:278. doi: 10.1186/1471-2105-9-278.
7
A complexity reduction algorithm for analysis and annotation of large genomic sequences.一种用于大型基因组序列分析和注释的复杂度降低算法。
Genome Res. 2003 Feb;13(2):313-22. doi: 10.1101/gr.313703.
8
A fast and sensitive algorithm for aligning ESTs to the human genome.一种用于将EST序列与人类基因组进行比对的快速且灵敏的算法。
J Bioinform Comput Biol. 2003 Jul;1(2):363-86. doi: 10.1142/s0219720003000058.
9
The sequence of the human genome.人类基因组序列。
Science. 2001 Feb 16;291(5507):1304-51. doi: 10.1126/science.1058040.
10
Assembly of the working draft of the human genome with GigAssembler.使用GigAssembler组装人类基因组工作草图。
Genome Res. 2001 Sep;11(9):1541-8. doi: 10.1101/gr.183201.

引用本文的文献

1
The reference genome and abiotic stress responses of the model perennial grass Brachypodium sylvaticum.模式多年生草本植物柳枝稷的参考基因组和非生物胁迫响应。
G3 (Bethesda). 2023 Dec 29;14(1). doi: 10.1093/g3journal/jkad245.
2
gene prediction for protein-coding regions.蛋白质编码区域的基因预测。
Bioinform Adv. 2023 Aug 10;3(1):vbad105. doi: 10.1093/bioadv/vbad105. eCollection 2023.
3
B cell lymphoma 6A regulates immune development and function in zebrafish.B 细胞淋巴瘤 6A 调节斑马鱼的免疫发育和功能。
Front Cell Infect Microbiol. 2022 Oct 28;12:887278. doi: 10.3389/fcimb.2022.887278. eCollection 2022.
4
Genome and transcriptome mechanisms driving cephalopod evolution.驱动头足类动物进化的基因组和转录组机制。
Nat Commun. 2022 May 4;13(1):2427. doi: 10.1038/s41467-022-29748-w.
5
Four chromosome scale genomes and a pan-genome annotation to accelerate pecan tree breeding.四个染色体尺度基因组和泛基因组注释,加速山核桃树的培育。
Nat Commun. 2021 Jul 5;12(1):4125. doi: 10.1038/s41467-021-24328-w.
6
Pests, diseases, and aridity have shaped the genome of Corymbia citriodora.害虫、疾病和干旱塑造了柠檬桉的基因组。
Commun Biol. 2021 May 10;4(1):537. doi: 10.1038/s42003-021-02009-0.
7
Intron exon boundary junctions in human genome have in-built unique structural and energetic signals.人类基因组中外显子和内含子边界连接点具有内在的独特结构和能量信号。
Nucleic Acids Res. 2021 Mar 18;49(5):2674-2683. doi: 10.1093/nar/gkab098.
8
prediction and segregation analysis of putative virus defense genes based on SSR markers in sweet potato F1 progenies of cultivars 'New Kawogo' and 'Resisto'.基于SSR标记对甘薯品种‘新卡沃戈’和‘抗性’F1后代中假定的病毒防御基因进行预测和分离分析。
Afr J Biotechnol. 2019;18(16). doi: 10.5897/AJB2018/16724. Epub 2019 Apr 17.
9
A chromosome-scale reference genome of trifoliate orange (Poncirus trifoliata) provides insights into disease resistance, cold tolerance and genome evolution in Citrus.枳(枳橙)的染色体级参考基因组为柑橘的抗病性、耐寒性和基因组进化提供了见解。
Plant J. 2020 Dec;104(5):1215-1232. doi: 10.1111/tpj.14993. Epub 2020 Oct 18.
10
Review on the Computational Genome Annotation of Sequences Obtained by Next-Generation Sequencing.下一代测序获得序列的计算基因组注释综述
Biology (Basel). 2020 Sep 18;9(9):295. doi: 10.3390/biology9090295.

本文引用的文献

1
Chipping away at the transcriptome.对转录组进行逐步剖析。
Nat Genet. 2001 Mar;27(3):232-4. doi: 10.1038/85772.
2
Experimental annotation of the human genome using microarray technology.利用微阵列技术对人类基因组进行实验注释。
Nature. 2001 Feb 15;409(6822):922-7. doi: 10.1038/35057141.
3
Initial sequencing and analysis of the human genome.人类基因组的初步测序与分析。
Nature. 2001 Feb 15;409(6822):860-921. doi: 10.1038/35057062.
4
Functional annotation of a full-length mouse cDNA collection.一个全长小鼠cDNA文库的功能注释
Nature. 2001 Feb 8;409(6821):685-90. doi: 10.1038/35055500.
5
The sequence of the human genome.人类基因组序列。
Science. 2001 Feb 16;291(5507):1304-51. doi: 10.1126/science.1058040.
6
An assessment of gene prediction accuracy in large DNA sequences.大型DNA序列中基因预测准确性的评估。
Genome Res. 2000 Oct;10(10):1631-42. doi: 10.1101/gr.122800.
7
Human and mouse gene structure: comparative analysis and application to exon prediction.人类和小鼠基因结构:比较分析及其在外显子预测中的应用。
Genome Res. 2000 Jul;10(7):950-8. doi: 10.1101/gr.10.7.950.
8
Gene index analysis of the human genome estimates approximately 120,000 genes.对人类基因组的基因索引分析估计约有120000个基因。
Nat Genet. 2000 Jun;25(2):239-40. doi: 10.1038/76126.
9
Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence.利用黑青斑河豚DNA序列进行全基因组分析得出的人类基因数量估计值。
Nat Genet. 2000 Jun;25(2):235-8. doi: 10.1038/76118.
10
Analysis of expressed sequence tags indicates 35,000 human genes.对表达序列标签的分析表明人类有35000个基因。
Nat Genet. 2000 Jun;25(2):232-4. doi: 10.1038/76115.

人类基因组中同源基因结构的计算推断

Computational inference of homologous gene structures in the human genome.

作者信息

Yeh R F, Lim L P, Burge C B

机构信息

Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.

出版信息

Genome Res. 2001 May;11(5):803-16. doi: 10.1101/gr.175701.

DOI:10.1101/gr.175701
PMID:11337476
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC311055/
Abstract

With the human genome sequence approaching completion, a major challenge is to identify the locations and encoded protein sequences of all human genes. To address this problem we have developed a new gene identification algorithm, GenomeScan, which combines exon-intron and splice signal models with similarity to known protein sequences in an integrated model. Extensive testing shows that GenomeScan can accurately identify the exon-intron structures of genes in finished or draft human genome sequence with a low rate of false-positives. Application of GenomeScan to 2.7 billion bases of human genomic DNA identified at least 20,000-25,000 human genes out of an estimated 30,000-40,000 present in the genome. The results show an accurate and efficient automated approach for identifying genes in higher eukaryotic genomes and provide a first-level annotation of the draft human genome.

摘要

随着人类基因组序列即将完成,一项重大挑战是确定所有人类基因的位置和编码的蛋白质序列。为了解决这个问题,我们开发了一种新的基因识别算法——基因组扫描(GenomeScan),它将外显子 - 内含子和剪接信号模型与已知蛋白质序列的相似性整合到一个综合模型中。大量测试表明,基因组扫描能够准确识别完成或草图形式的人类基因组序列中基因的外显子 - 内含子结构,且假阳性率较低。将基因组扫描应用于27亿个碱基的人类基因组DNA,在基因组中估计存在的30000 - 40000个基因中,至少识别出了20000 - 25000个人类基因。结果显示了一种用于识别高等真核生物基因组中基因的准确且高效的自动化方法,并为人类基因组草图提供了一级注释。