• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将UNIGENE簇组装、注释并整合到人类基因组草图中。

Assembly, annotation, and integration of UNIGENE clusters into the human genome draft.

作者信息

Zhuo D, Zhao W D, Wright F A, Yang H Y, Wang J P, Sears R, Baer T, Kwon D H, Gordon D, Gibbs S, Dai D, Yang Q, Spitzner J, Krahe R, Stredney D, Stutz A, Yuan B

机构信息

Bioinformatics Group, James Cancer Hospital and Solove Research Institute, The Ohio State University, Columbus, Ohio 43210, USA.

出版信息

Genome Res. 2001 May;11(5):904-18. doi: 10.1101/gr.gr-1645r.

DOI:10.1101/gr.gr-1645r
PMID:11337484
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC311045/
Abstract

The recent release of the first draft of the human genome provides an unprecedented opportunity to integrate human genes and their functions in a complete positional context. However, at least three significant technical hurdles remain: first, to assemble a complete and nonredundant human transcript index; second, to accurately place the individual transcript indices on the human genome; and third, to functionally annotate all human genes. Here, we report the extension of the UNIGENE database through the assembly of its sequence clusters into nonredundant sequence contigs. Each resulting consensus was aligned to the human genome draft. A unique location for each transcript within the human genome was determined by the integration of the restriction fingerprint, assembled genomic contig, and radiation hybrid (RH) maps. A total of 59,500 UNIGENE clusters were mapped on the basis of at least three independent criteria as compared with the 30,000 human genes/ESTs currently mapped in Genemap'99. Finally, the extension of the human transcript consensus in this study enabled a greater number of putative functional assignments than the 11,000 annotated entries in UNIGENE. This study reports a draft physical map with annotations for a majority of the human transcripts, called the Human Index of Nonredundant Transcripts (HINT). Such information can be immediately applied to the discovery of new genes and the identification of candidate genes for positional cloning.

摘要

人类基因组初稿的近期发布为在完整的位置背景下整合人类基因及其功能提供了前所未有的机遇。然而,至少仍存在三个重大技术障碍:其一,构建一个完整且无冗余的人类转录本索引;其二,将各个转录本索引准确置于人类基因组上;其三,对所有人类基因进行功能注释。在此,我们报告了通过将其序列簇组装成无冗余序列重叠群来扩展UNIGENE数据库。将每个得到的共有序列与人类基因组草图进行比对。通过整合限制性指纹图谱、组装的基因组重叠群和辐射杂种(RH)图谱,确定了人类基因组内每个转录本的唯一位置。与目前在Genemap'99中定位的30,000个人类基因/EST相比,基于至少三个独立标准共定位了59,500个UNIGENE簇。最后,本研究中人类转录本共有序列的扩展使得能够进行比UNIGENE中11,000个注释条目更多的假定功能分配。本研究报告了一个带有大多数人类转录本注释的物理图谱草图,称为非冗余转录本人类索引(HINT)。此类信息可立即应用于新基因的发现以及定位克隆候选基因的鉴定。

相似文献

1
Assembly, annotation, and integration of UNIGENE clusters into the human genome draft.将UNIGENE簇组装、注释并整合到人类基因组草图中。
Genome Res. 2001 May;11(5):904-18. doi: 10.1101/gr.gr-1645r.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
Genome-wide detection of alternative splicing in expressed sequences of human genes.人类基因表达序列中可变剪接的全基因组检测。
Nucleic Acids Res. 2001 Jul 1;29(13):2850-9. doi: 10.1093/nar/29.13.2850.
4
The HIB database of annotated UniGene clusters.带注释的单基因簇的HIB数据库。
Bioinformatics. 2001 Jun;17(6):571-2. doi: 10.1093/bioinformatics/17.6.571.
5
Mapping ESTs to the TSC1 candidate interval by use of the 'Science 96' transcript map.通过使用“科学96”转录图谱将ESTs定位到TSC1候选区间。
Ann Hum Genet. 1997 Sep;61(Pt 5):401-9. doi: 10.1046/j.1469-1809.1997.6150401.x.
6
Genome-wide assembly and analysis of alternative transcripts in mouse.小鼠全基因组组装及可变转录本分析
Genome Res. 2005 May;15(5):748-54. doi: 10.1101/gr.3269805.
7
Assembly of the working draft of the human genome with GigAssembler.使用GigAssembler组装人类基因组工作草图。
Genome Res. 2001 Sep;11(9):1541-8. doi: 10.1101/gr.183201.
8
A high-resolution radiation hybrid map of the human genome draft sequence.人类基因组草图序列的高分辨率辐射杂种图谱。
Science. 2001 Feb 16;291(5507):1298-302. doi: 10.1126/science.1057437.
9
A gene-based high-resolution comparative radiation hybrid map as a framework for genome sequence assembly of a bovine chromosome 6 region associated with QTL for growth, body composition, and milk performance traits.基于基因的高分辨率比较辐射杂种图谱,作为与生长、体组成和乳性能性状的QTL相关的牛6号染色体区域基因组序列组装的框架。
BMC Genomics. 2006 Mar 16;7:53. doi: 10.1186/1471-2164-7-53.
10
A high-resolution 6.0-megabase transcript map of the type 2 diabetes susceptibility region on human chromosome 20.人类20号染色体上2型糖尿病易感区域的高分辨率6.0兆碱基转录图谱。
Genomics. 2001 Aug;76(1-3):45-57. doi: 10.1006/geno.2001.6584.

引用本文的文献

1
Drug repurposing for cancer therapy.药物重用于癌症治疗。
Signal Transduct Target Ther. 2024 Apr 19;9(1):92. doi: 10.1038/s41392-024-01808-1.
2
Genome annotation: From human genetics to biodiversity genomics.基因组注释:从人类遗传学到生物多样性基因组学
Cell Genom. 2023 Aug 1;3(8):100375. doi: 10.1016/j.xgen.2023.100375. eCollection 2023 Aug 9.
3
Comparison of miRNA and mRNA Expression in Sika Deer Testes With Age.梅花鹿睾丸中miRNA和mRNA表达随年龄的比较。
Front Vet Sci. 2022 Apr 5;9:854503. doi: 10.3389/fvets.2022.854503. eCollection 2022.
4
Integrated analysis of miRNA and mRNA transcriptomic reveals antler growth regulatory network.miRNA与mRNA转录组的综合分析揭示鹿茸生长调控网络。
Mol Genet Genomics. 2021 May;296(3):689-703. doi: 10.1007/s00438-021-01776-z. Epub 2021 Mar 26.
5
Exploration of databases and methods supporting drug repurposing: a comprehensive survey.数据库与药物重定位支持方法探索:全面综述
Brief Bioinform. 2021 Mar 22;22(2):1656-1678. doi: 10.1093/bib/bbaa003.
6
Cationic lipoplexes for treatment of cancer stem cell-derived murine lung tumors.阳离子脂复合物治疗癌症干细胞源性鼠肺肿瘤。
Nanomedicine. 2019 Jun;18:31-43. doi: 10.1016/j.nano.2019.02.007. Epub 2019 Mar 1.
7
An annotated genetic map of loblolly pine based on microsatellite and cDNA markers.基于微卫星和 cDNA 标记的火炬松注释遗传图谱。
BMC Genet. 2011 Jan 26;12:17. doi: 10.1186/1471-2156-12-17.
8
A genetic network model of cellular responses to lithium treatment and cocaine abuse in bipolar disorder.双相情感障碍中细胞对锂治疗和可卡因滥用反应的遗传网络模型。
BMC Syst Biol. 2010 Nov 19;4:158. doi: 10.1186/1752-0509-4-158.
9
ASPicDB: a database of annotated transcript and protein variants generated by alternative splicing.ASPicDB:一个由可变剪接产生的注释转录本和蛋白质变体数据库。
Nucleic Acids Res. 2011 Jan;39(Database issue):D80-5. doi: 10.1093/nar/gkq1073. Epub 2010 Nov 4.
10
Construction and application of an electronic spatiotemporal expression profile and gene ontology analysis platform based on the EST database of the silkworm, Bombyx mori.基于家蚕(Bombyx mori)EST 数据库构建和应用电子时空表达谱及基因本体分析平台。
J Insect Sci. 2010;10:114. doi: 10.1673/031.010.11401.

本文引用的文献

1
Genome-wide analysis of single-nucleotide polymorphisms in human expressed sequences.人类表达序列中单核甘酸多态性的全基因组分析。
Nat Genet. 2000 Oct;26(2):233-6. doi: 10.1038/79981.
2
Repeat polymorphisms within gene regions: phenotypic and evolutionary implications.基因区域内的重复多态性:表型及进化意义
Am J Hum Genet. 2000 Aug;67(2):345-56. doi: 10.1086/303013. Epub 2000 Jul 7.
3
Gene index analysis of the human genome estimates approximately 120,000 genes.对人类基因组的基因索引分析估计约有120000个基因。
Nat Genet. 2000 Jun;25(2):239-40. doi: 10.1038/76126.
4
Analysis of expressed sequence tags indicates 35,000 human genes.对表达序列标签的分析表明人类有35000个基因。
Nat Genet. 2000 Jun;25(2):232-4. doi: 10.1038/76115.
5
Representation of functional information in the SWISS-PROT data bank.SWISS-PROT数据库中功能信息的呈现
Bioinformatics. 1999 Dec;15(12):1066-7. doi: 10.1093/bioinformatics/15.12.1066.
6
Frequent alternative splicing of human genes.人类基因频繁的可变剪接。
Genome Res. 1999 Dec;9(12):1288-93. doi: 10.1101/gr.9.12.1288.
7
The Pfam protein families database.Pfam蛋白质家族数据库。
Nucleic Acids Res. 2000 Jan 1;28(1):263-6. doi: 10.1093/nar/28.1.263.
8
The TIGR gene indices: reconstruction and representation of expressed gene sequences.TIGR基因索引:表达基因序列的重建与呈现
Nucleic Acids Res. 2000 Jan 1;28(1):141-5. doi: 10.1093/nar/28.1.141.
9
The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.2000年的SWISS-PROT蛋白质序列数据库及其补充数据库TrEMBL。
Nucleic Acids Res. 2000 Jan 1;28(1):45-8. doi: 10.1093/nar/28.1.45.
10
The protein information resource (PIR).蛋白质信息资源(PIR)。
Nucleic Acids Res. 2000 Jan 1;28(1):41-4. doi: 10.1093/nar/28.1.41.