• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Evola:H-InvDB中所有人类基因的直系同源数据库,并对系统发育树进行人工整理。

Evola: Ortholog database of all human genes in H-InvDB with manual curation of phylogenetic trees.

作者信息

Matsuya Akihiro, Sakate Ryuichi, Kawahara Yoshihiro, Koyanagi Kanako O, Sato Yoshiharu, Fujii Yasuyuki, Yamasaki Chisato, Habara Takuya, Nakaoka Hajime, Todokoro Fusano, Yamaguchi Kaori, Endo Toshinori, Oota Satoshi, Makalowski Wojciech, Ikeo Kazuho, Suzuki Yoshiyuki, Hanada Kousuke, Hashimoto Katsuyuki, Hirai Momoki, Iwama Hisakazu, Saitou Naruya, Hiraki Aiko T, Jin Lihua, Kaneko Yayoi, Kanno Masako, Murakami Katsuhiko, Noda Akiko Ogura, Saichi Naomi, Sanbonmatsu Ryoko, Suzuki Mami, Takeda Jun-ichi, Tanaka Masayuki, Gojobori Takashi, Imanishi Tadashi, Itoh Takeshi

机构信息

Integrated Database Group, Japan Biological Information Research Center, Japan Biological Informatics Consortium, Tokyo, Japan.

出版信息

Nucleic Acids Res. 2008 Jan;36(Database issue):D787-92. doi: 10.1093/nar/gkm878. Epub 2007 Nov 3.

DOI:10.1093/nar/gkm878
PMID:17982176
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2238928/
Abstract

Orthologs are genes in different species that evolved from a common ancestral gene by speciation. Currently, with the rapid growth of transcriptome data of various species, more reliable orthology information is prerequisite for further studies. However, detection of orthologs could be erroneous if pairwise distance-based methods, such as reciprocal BLAST searches, are utilized. Thus, as a sub-database of H-InvDB, an integrated database of annotated human genes (http://h-invitational.jp/), we constructed a fully curated database of evolutionary features of human genes, called 'Evola'. In the process of the ortholog detection, computational analysis based on conserved genome synteny and transcript sequence similarity was followed by manual curation by researchers examining phylogenetic trees. In total, 18 968 human genes have orthologs among 11 vertebrates (chimpanzee, mouse, cow, chicken, zebrafish, etc.), either computationally detected or manually curated orthologs. Evola provides amino acid sequence alignments and phylogenetic trees of orthologs and homologs. In 'd(N)/d(S) view', natural selection on genes can be analyzed between human and other species. In 'Locus maps', all transcript variants and their exon/intron structures can be compared among orthologous gene loci. We expect the Evola to serve as a comprehensive and reliable database to be utilized in comparative analyses for obtaining new knowledge about human genes. Evola is available at http://www.h-invitational.jp/evola/.

摘要

直系同源基因是不同物种中通过物种形成从共同祖先基因进化而来的基因。目前,随着各种物种转录组数据的快速增长,更可靠的直系同源信息是进一步研究的先决条件。然而,如果使用基于成对距离的方法,如相互BLAST搜索,直系同源基因的检测可能会出现错误。因此,作为H-InvDB(一个注释人类基因的综合数据库,网址为http://h-invitational.jp/)的子数据库,我们构建了一个经过全面策划的人类基因进化特征数据库,称为“Evola”。在直系同源基因检测过程中,基于保守基因组共线性和转录本序列相似性的计算分析之后,研究人员会通过检查系统发育树进行人工策划。总共有18968个人类基因在11种脊椎动物(黑猩猩、小鼠、牛、鸡、斑马鱼等)中有直系同源基因,这些直系同源基因要么是通过计算检测到的,要么是经过人工策划的。Evola提供直系同源基因和同源基因的氨基酸序列比对和系统发育树。在“d(N)/d(S)视图”中,可以分析人类与其他物种之间基因的自然选择情况。在“基因座图谱”中,可以比较直系同源基因座之间的所有转录本变体及其外显子/内含子结构。我们期望Evola能够作为一个全面且可靠的数据库,用于比较分析,以获取有关人类基因的新知识。Evola可在http://www.h-invitational.jp/evola/获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb6/2238928/618508de009a/gkm878f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb6/2238928/7de9b7d19198/gkm878f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb6/2238928/618508de009a/gkm878f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb6/2238928/7de9b7d19198/gkm878f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb6/2238928/618508de009a/gkm878f2.jpg

相似文献

1
Evola: Ortholog database of all human genes in H-InvDB with manual curation of phylogenetic trees.Evola:H-InvDB中所有人类基因的直系同源数据库,并对系统发育树进行人工整理。
Nucleic Acids Res. 2008 Jan;36(Database issue):D787-92. doi: 10.1093/nar/gkm878. Epub 2007 Nov 3.
2
The H-Invitational Database (H-InvDB), a comprehensive annotation resource for human genes and transcripts.H-Invitation数据库(H-InvDB),一个关于人类基因和转录本的综合注释资源库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D793-9. doi: 10.1093/nar/gkm999. Epub 2007 Dec 18.
3
Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.通过成对物种比较对直系同源基因和旁系同源基因进行自动聚类。
J Mol Biol. 2001 Dec 14;314(5):1041-52. doi: 10.1006/jmbi.2000.5197.
4
H-InvDB in 2009: extended database and data mining resources for human genes and transcripts.H-InvDB 于 2009 年推出:为人类基因和转录本扩展了数据库和数据挖掘资源。
Nucleic Acids Res. 2010 Jan;38(Database issue):D626-32. doi: 10.1093/nar/gkp1020. Epub 2009 Nov 23.
5
H-InvDB in 2013: an omics study platform for human functional gene and transcript discovery.H-InvDB 于 2013 年:人类功能基因和转录本发现的组学研究平台。
Nucleic Acids Res. 2013 Jan;41(Database issue):D915-9. doi: 10.1093/nar/gks1245. Epub 2012 Nov 28.
6
The other side of comparative genomics: genes with no orthologs between the cow and other mammalian species.比较基因组学的另一面:牛和其他哺乳动物物种之间没有同源基因。
BMC Genomics. 2009 Dec 14;10:604. doi: 10.1186/1471-2164-10-604.
7
Integrative annotation of 21,037 human genes validated by full-length cDNA clones.由全长cDNA克隆验证的21,037个人类基因的综合注释。
PLoS Biol. 2004 Jun;2(6):e162. doi: 10.1371/journal.pbio.0020162. Epub 2004 Apr 20.
8
Computational methods for Gene Orthology inference.基因直系同源推断的计算方法。
Brief Bioinform. 2011 Sep;12(5):379-91. doi: 10.1093/bib/bbr030. Epub 2011 Jun 19.
9
PANTHER version 7: improved phylogenetic trees, orthologs and collaboration with the Gene Ontology Consortium.PANTHER 版本 7:改进了系统发育树、直系同源物,以及与基因本体论联盟的合作。
Nucleic Acids Res. 2010 Jan;38(Database issue):D204-10. doi: 10.1093/nar/gkp1019. Epub 2009 Dec 16.
10
Investigation of protein functions through data-mining on integrated human transcriptome database, H-Invitational database (H-InvDB).通过对整合的人类转录组数据库H-Invitational数据库(H-InvDB)进行数据挖掘来研究蛋白质功能。
Gene. 2005 Dec 30;364:99-107. doi: 10.1016/j.gene.2005.05.036. Epub 2005 Sep 26.

引用本文的文献

1
VarPPUD: Variant post prioritization developed for undiagnosed genetic disorders.VarPPUD:为未确诊的遗传疾病开发的变异后优先级排序
medRxiv. 2024 Apr 20:2024.04.15.24305876. doi: 10.1101/2024.04.15.24305876.
2
Characterization, comparative, and functional analysis of arylacetamide deacetylase from Gnathostomata organisms.颚口纲生物芳基乙酰胺脱乙酰酶的表征、比较及功能分析。
J Genet Eng Biotechnol. 2022 Dec 21;20(1):169. doi: 10.1186/s43141-022-00443-z.
3
Identifying digenic disease genes via machine learning in the Undiagnosed Diseases Network.

本文引用的文献

1
The H-Invitational Database (H-InvDB), a comprehensive annotation resource for human genes and transcripts.H-Invitation数据库(H-InvDB),一个关于人类基因和转录本的综合注释资源库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D793-9. doi: 10.1093/nar/gkm999. Epub 2007 Dec 18.
2
Rates of genome evolution and branching order from whole genome analysis.基于全基因组分析的基因组进化速率和分支顺序
Mol Biol Evol. 2007 Aug;24(8):1722-30. doi: 10.1093/molbev/msm094. Epub 2007 May 9.
3
New developments in the InterPro database.InterPro数据库的新进展。
通过机器学习在未确诊疾病网络中鉴定双基因疾病基因。
Am J Hum Genet. 2021 Oct 7;108(10):1946-1963. doi: 10.1016/j.ajhg.2021.08.010. Epub 2021 Sep 15.
4
Functional characterization of rare NRXN1 variants identified in autism spectrum disorders and schizophrenia.鉴定自闭症谱系障碍和精神分裂症中罕见 NRXN1 变异的功能特征。
J Neurodev Disord. 2020 Sep 17;12(1):25. doi: 10.1186/s11689-020-09325-2.
5
Investigation of Rare Single-Nucleotide PCDH15 Variants in Schizophrenia and Autism Spectrum Disorders.精神分裂症和自闭症谱系障碍中罕见的原钙黏蛋白15单核苷酸变异的研究。
PLoS One. 2016 Apr 8;11(4):e0153224. doi: 10.1371/journal.pone.0153224. eCollection 2016.
6
Archaeal Clusters of Orthologous Genes (arCOGs): An Update and Application for Analysis of Shared Features between Thermococcales, Methanococcales, and Methanobacteriales.古菌直系同源基因簇 (arCOGs):对热球菌目、甲烷球菌目和甲烷杆菌目之间共享特征进行分析的更新与应用。
Life (Basel). 2015 Mar 10;5(1):818-40. doi: 10.3390/life5010818.
7
The Hsp90-dependent proteome is conserved and enriched for hub proteins with high levels of protein-protein connectivity.热休克蛋白90(Hsp90)依赖的蛋白质组具有保守性,且富含蛋白质-蛋白质连接性高的中心蛋白。
Genome Biol Evol. 2014 Oct 13;6(10):2851-65. doi: 10.1093/gbe/evu226.
8
H-InvDB in 2013: an omics study platform for human functional gene and transcript discovery.H-InvDB 于 2013 年:人类功能基因和转录本发现的组学研究平台。
Nucleic Acids Res. 2013 Jan;41(Database issue):D915-9. doi: 10.1093/nar/gks1245. Epub 2012 Nov 28.
9
Revealing mammalian evolutionary relationships by comparative analysis of gene clusters.通过基因簇的比较分析揭示哺乳动物的进化关系。
Genome Biol Evol. 2012;4(4):586-601. doi: 10.1093/gbe/evs032. Epub 2012 Mar 27.
10
Developing a community-based genetic nomenclature for anole lizards.建立基于社区的安乐蜥遗传命名法。
BMC Genomics. 2011 Nov 11;12:554. doi: 10.1186/1471-2164-12-554.
Nucleic Acids Res. 2007 Jan;35(Database issue):D224-8. doi: 10.1093/nar/gkl841.
4
Database resources of the National Center for Biotechnology Information.美国国立生物技术信息中心的数据库资源。
Nucleic Acids Res. 2007 Jan;35(Database issue):D5-12. doi: 10.1093/nar/gkl1031. Epub 2006 Dec 14.
5
Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56,419 completely sequenced and manually annotated full-length cDNAs.利用56419条完全测序且经过人工注释的全长cDNA对人类基因转录本的可变剪接变体进行大规模鉴定和表征。
Nucleic Acids Res. 2006;34(14):3917-28. doi: 10.1093/nar/gkl507. Epub 2006 Aug 12.
6
TreeFam: a curated database of phylogenetic trees of animal gene families.TreeFam:一个经过精心策划的动物基因家族系统发育树数据库。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D572-80. doi: 10.1093/nar/gkj118.
7
Ensembl 2006.Ensembl 2006。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D556-61. doi: 10.1093/nar/gkj133.
8
Investigation of protein functions through data-mining on integrated human transcriptome database, H-Invitational database (H-InvDB).通过对整合的人类转录组数据库H-Invitational数据库(H-InvDB)进行数据挖掘来研究蛋白质功能。
Gene. 2005 Dec 30;364:99-107. doi: 10.1016/j.gene.2005.05.036. Epub 2005 Sep 26.
9
A web tool for comparative genomics: G-compass.一种用于比较基因组学的网络工具:G-compass。
Gene. 2005 Dec 30;364:45-52. doi: 10.1016/j.gene.2005.05.043. Epub 2005 Oct 5.
10
Tree pattern matching in phylogenetic trees: automatic search for orthologs or paralogs in homologous gene sequence databases.系统发育树中的树形模式匹配:在同源基因序列数据库中自动搜索直系同源基因或旁系同源基因。
Bioinformatics. 2005 Jun 1;21(11):2596-603. doi: 10.1093/bioinformatics/bti325. Epub 2005 Feb 15.