• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GIFtS:利用 GeneCards 进行注释景观分析。

GIFtS: annotation landscape analysis with GeneCards.

机构信息

Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel.

出版信息

BMC Bioinformatics. 2009 Oct 23;10:348. doi: 10.1186/1471-2105-10-348.

DOI:10.1186/1471-2105-10-348
PMID:19852797
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2774327/
Abstract

BACKGROUND

Gene annotation is a pivotal component in computational genomics, encompassing prediction of gene function, expression analysis, and sequence scrutiny. Hence, quantitative measures of the annotation landscape constitute a pertinent bioinformatics tool. GeneCards is a gene-centric compendium of rich annotative information for over 50,000 human gene entries, building upon 68 data sources, including Gene Ontology (GO), pathways, interactions, phenotypes, publications and many more.

RESULTS

We present the GeneCards Inferred Functionality Score (GIFtS) which allows a quantitative assessment of a gene's annotation status, by exploiting the unique wealth and diversity of GeneCards information. The GIFtS tool, linked from the GeneCards home page, facilitates browsing the human genome by searching for the annotation level of a specified gene, retrieving a list of genes within a specified range of GIFtS value, obtaining random genes with a specific GIFtS value, and experimenting with the GIFtS weighting algorithm for a variety of annotation categories. The bimodal shape of the GIFtS distribution suggests a division of the human gene repertoire into two main groups: the high-GIFtS peak consists almost entirely of protein-coding genes; the low-GIFtS peak consists of genes from all of the categories. Cluster analysis of GIFtS annotation vectors provides the classification of gene groups by detailed positioning in the annotation arena. GIFtS also provide measures which enable the evaluation of the databases that serve as GeneCards sources. An inverse correlation is found (for GIFtS>25) between the number of genes annotated by each source, and the average GIFtS value of genes associated with that source. Three typical source prototypes are revealed by their GIFtS distribution: genome-wide sources, sources comprising mainly highly annotated genes, and sources comprising mainly poorly annotated genes. The degree of accumulated knowledge for a given gene measured by GIFtS was correlated (for GIFtS>30) with the number of publications for a gene, and with the seniority of this entry in the HGNC database.

CONCLUSION

GIFtS can be a valuable tool for computational procedures which analyze lists of large set of genes resulting from wet-lab or computational research. GIFtS may also assist the scientific community with identification of groups of uncharacterized genes for diverse applications, such as delineation of novel functions and charting unexplored areas of the human genome.

摘要

背景

基因注释是计算基因组学的关键组成部分,包括预测基因功能、表达分析和序列分析。因此,注释景观的定量度量是一种相关的生物信息学工具。GeneCards 是一个以基因为中心的综合资源,包含超过 50,000 个人类基因条目丰富的注释信息,基于 68 个数据源,包括基因本体论 (GO)、途径、相互作用、表型、出版物等。

结果

我们提出了 GeneCards 推断功能评分 (GIFtS),通过利用 GeneCards 信息的独特丰富性和多样性,可以对基因的注释状态进行定量评估。GIFtS 工具链接到 GeneCards 主页,通过搜索指定基因的注释水平,可以方便地浏览人类基因组,检索特定 GIFtS 值范围内的基因列表,获取特定 GIFtS 值的随机基因,并针对各种注释类别进行 GIFtS 加权算法的实验。GIFtS 分布的双峰形状表明,人类基因库可以分为两个主要组:高 GIFtS 峰几乎完全由编码蛋白质的基因组成;低 GIFtS 峰由所有类别的基因组成。通过 GIFtS 注释向量的聚类分析,可以对基因组进行详细定位的分类。GIFtS 还提供了评估作为 GeneCards 来源的数据库的度量标准。发现(对于 GIFtS>25)每个来源注释的基因数量与与该来源相关的基因的平均 GIFtS 值之间存在负相关关系。通过其 GIFtS 分布揭示了三种典型的源原型:全基因组源、主要包含高度注释基因的源以及主要包含低度注释基因的源。通过 GIFtS 测量的给定基因的累积知识程度(对于 GIFtS>30)与基因的出版物数量以及 HGNC 数据库中该条目的年龄相关。

结论

GIFtS 可以成为分析湿实验室或计算研究产生的大量基因列表的计算过程的有价值的工具。GIFtS 还可以帮助科学界识别各种应用的未表征基因组,例如描绘新功能和绘制人类基因组的未探索区域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/3828010050b6/1471-2105-10-348-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/8afdf2d13946/1471-2105-10-348-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/e1aa20055ab5/1471-2105-10-348-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/7c17058ca694/1471-2105-10-348-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/cb8f1cb66f78/1471-2105-10-348-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/de0d44735ce7/1471-2105-10-348-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/d69d728ec492/1471-2105-10-348-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/3828010050b6/1471-2105-10-348-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/8afdf2d13946/1471-2105-10-348-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/e1aa20055ab5/1471-2105-10-348-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/7c17058ca694/1471-2105-10-348-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/cb8f1cb66f78/1471-2105-10-348-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/de0d44735ce7/1471-2105-10-348-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/d69d728ec492/1471-2105-10-348-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd6/2774327/3828010050b6/1471-2105-10-348-7.jpg

相似文献

1
GIFtS: annotation landscape analysis with GeneCards.GIFtS:利用 GeneCards 进行注释景观分析。
BMC Bioinformatics. 2009 Oct 23;10:348. doi: 10.1186/1471-2105-10-348.
2
GeneCards Version 3: the human gene integrator.GeneCards 版本 3:人类基因综合数据库。
Database (Oxford). 2010 Aug 5;2010:baq020. doi: 10.1093/database/baq020.
3
Rational confederation of genes and diseases: NGS interpretation via GeneCards, MalaCards and VarElect.基因与疾病的合理关联:通过基因卡片、疾病卡片和变异电子工具进行二代测序解读
Biomed Eng Online. 2017 Aug 18;16(Suppl 1):72. doi: 10.1186/s12938-017-0359-2.
4
MalaCards: an amalgamated human disease compendium with diverse clinical and genetic annotation and structured search.MalaCards:一个整合了多种临床和基因注释以及结构化搜索功能的人类疾病综合纲要。
Nucleic Acids Res. 2017 Jan 4;45(D1):D877-D887. doi: 10.1093/nar/gkw1012. Epub 2016 Nov 28.
5
6
Genic insights from integrated human proteomics in GeneCards.来自基因卡片中整合人类蛋白质组学的基因见解。
Database (Oxford). 2016 Apr 5;2016. doi: 10.1093/database/baw030. Print 2016.
7
Non-redundant compendium of human ncRNA genes in GeneCards.GeneCards 中人类非冗余 ncRNA 基因简编
Bioinformatics. 2013 Jan 15;29(2):255-61. doi: 10.1093/bioinformatics/bts676. Epub 2012 Nov 19.
8
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
9
DAVID Bioinformatics Resources: expanded annotation database and novel algorithms to better extract biology from large gene lists.DAVID生物信息学资源:扩展注释数据库和新颖算法,以便从大型基因列表中更好地提取生物学信息。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W169-75. doi: 10.1093/nar/gkm415. Epub 2007 Jun 18.
10
GeneCaRNA: A Comprehensive Gene-centric Database of Human Non-coding RNAs in the GeneCards Suite.GeneCaRNA:GeneCards 套件中人类非编码 RNA 的综合基因中心数据库。
J Mol Biol. 2021 May 28;433(11):166913. doi: 10.1016/j.jmb.2021.166913. Epub 2021 Mar 4.

引用本文的文献

1
Pre-training Genomic Language Model with Variants for Better Modeling Functional Genomics.使用变异体预训练基因组语言模型以更好地建模功能基因组学。
bioRxiv. 2025 Aug 23:2025.02.26.640468. doi: 10.1101/2025.02.26.640468.
2
PTPN11 is a potential biomarker for type 2 diabetes mellitus complicated with colorectal cancer.PTPN11 是 2 型糖尿病合并结直肠癌的潜在生物标志物。
Sci Rep. 2024 Oct 24;14(1):25155. doi: 10.1038/s41598-024-75889-x.
3
Expanding and Enriching the LncRNA Gene-Disease Landscape Using the GeneCaRNA Database.利用GeneCaRNA数据库扩展和丰富长链非编码RNA基因-疾病图谱

本文引用的文献

1
GeneDecks: paralog hunting and gene-set distillation with GeneCards annotation.GeneDecks:利用 GeneCards 注释进行基因冗余搜索和基因集提取。
OMICS. 2009 Dec;13(6):477-87. doi: 10.1089/omi.2009.0069.
2
Interpreting neonatal lethal phenotypes in mouse mutants: insights into gene function and human diseases.解读小鼠突变体中的新生儿致死表型:对基因功能和人类疾病的见解。
Physiol Rev. 2009 Jan;89(1):1-26. doi: 10.1152/physrev.00040.2007.
3
Gene characterization index: assessing the depth of gene annotation.基因特征索引:评估基因注释的深度。
Biomedicines. 2024 Jun 12;12(6):1305. doi: 10.3390/biomedicines12061305.
4
Matrine Targets BTF3 to Inhibit the Growth of Canine Mammary Tumor Cells.苦参碱靶向 BTF3 抑制犬乳腺肿瘤细胞生长。
Int J Mol Sci. 2023 Dec 30;25(1):540. doi: 10.3390/ijms25010540.
5
Identification of Rare Genetic Variants in Familial Spontaneous Coronary Artery Dissection and Evidence for Shared Biological Pathways.家族性自发性冠状动脉夹层中罕见基因变异的鉴定及共享生物学途径的证据
J Cardiovasc Dev Dis. 2023 Sep 12;10(9):393. doi: 10.3390/jcdd10090393.
6
Biochemical Targets and Molecular Mechanism of Matrine against Aging.苦参碱抗衰老的生化靶标及分子机制。
Int J Mol Sci. 2023 Jun 14;24(12):10098. doi: 10.3390/ijms241210098.
7
Anti-proliferative and pro-apoptotic effects of curcumin on skin cutaneous melanoma: Bioinformatics analysis and experimental studies.姜黄素对皮肤黑色素瘤的抗增殖和促凋亡作用:生物信息学分析与实验研究
Front Genet. 2022 Sep 12;13:983943. doi: 10.3389/fgene.2022.983943. eCollection 2022.
8
Glioblastoma gene network reconstruction and ontology analysis by online bioinformatics tools.通过在线生物信息学工具进行胶质母细胞瘤基因网络重建和本体分析。
J Integr Bioinform. 2021 Nov 16;18(4):20210031. doi: 10.1515/jib-2021-0031.
9
Network pharmacology, molecular docking integrated surface plasmon resonance technology reveals the mechanism of Toujie Quwen Granules against coronavirus disease 2019 pneumonia.网络药理学、分子对接结合表面等离子共振技术揭示透解祛瘟颗粒治疗 2019 冠状病毒病肺炎的作用机制。
Phytomedicine. 2021 May;85:153401. doi: 10.1016/j.phymed.2020.153401. Epub 2020 Oct 28.
10
Functionally Enigmatic Genes in Cancer: Using TCGA Data to Map the Limitations of Annotations.癌症中功能神秘的基因:利用 TCGA 数据绘制注释的局限性图谱。
Sci Rep. 2020 Mar 5;10(1):4106. doi: 10.1038/s41598-020-60456-x.
PLoS One. 2008 Jan 23;3(1):e1440. doi: 10.1371/journal.pone.0001440.
4
Gene Ontology annotation quality analysis in model eukaryotes.模式真核生物中的基因本体注释质量分析
Nucleic Acids Res. 2008 Feb;36(2):e12. doi: 10.1093/nar/gkm1167. Epub 2008 Jan 10.
5
Inflammation-related genes up-regulated in schizophrenia brains.在精神分裂症患者大脑中上调的炎症相关基因。
BMC Psychiatry. 2007 Sep 6;7:46. doi: 10.1186/1471-244X-7-46.
6
Characterization of protein-interaction networks in tumors.肿瘤中蛋白质相互作用网络的表征
BMC Bioinformatics. 2007 Jun 27;8:224. doi: 10.1186/1471-2105-8-224.
7
Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.ENCODE试点项目对人类基因组1%的功能元件进行鉴定与分析。
Nature. 2007 Jun 14;447(7146):799-816. doi: 10.1038/nature05874.
8
What is a gene, post-ENCODE? History and updated definition.后ENCODE时代的基因是什么?历史与更新后的定义。
Genome Res. 2007 Jun;17(6):669-81. doi: 10.1101/gr.6339607.
9
Gene expression profiling of esophageal cancer: comparative analysis of Barrett's esophagus, adenocarcinoma, and squamous cell carcinoma.食管癌的基因表达谱分析:巴雷特食管、腺癌和鳞状细胞癌的比较分析。
Int J Cancer. 2007 May 1;120(9):1914-21. doi: 10.1002/ijc.22501.
10
The HUGO Gene Nomenclature Database, 2006 updates.《人类基因组组织基因命名数据库》2006年更新版。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D319-21. doi: 10.1093/nar/gkj147.