• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通路分析软件:注释错误及解决方法。

Pathway analysis software: annotation errors and solutions.

机构信息

Department of Pediatrics, David Geffen School of Medicine at UCLA, Los Angeles, CA 90095-7088, USA.

出版信息

Mol Genet Metab. 2010 Oct-Nov;101(2-3):134-40. doi: 10.1016/j.ymgme.2010.06.005. Epub 2010 Jun 22.

DOI:10.1016/j.ymgme.2010.06.005
PMID:20663702
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2950253/
Abstract

Genetic databases contain a variety of annotation errors that often go unnoticed due to the large size of modern genetic data sets. Interpretation of these data sets requires bioinformatics tools that may contribute to this problem. While providing gene symbol annotations for identifiers (IDs) such as microarray probe set, RefSeq, GenBank, and Entrez Gene is seemingly trivial, the accuracy is fundamental to any subsequent conclusions. We examine gene symbol annotations and results from three commercial pathway analysis software (PAS) packages: Ingenuity Pathways Analysis, GeneGO, and Pathway Studio. We compare gene symbol annotations and canonical pathway results over time and among different input ID types. We find that PAS results can be affected by variation in gene symbol annotations across software releases and the input ID type analyzed. As a result, we offer suggestions for using commercial PAS and reporting microarray results to improve research quality. We propose a wiki type website to facilitate communication of bioinformatics software problems within the scientific community.

摘要

遗传数据库包含各种注释错误,由于现代遗传数据集的规模庞大,这些错误常常被忽视。这些数据集的解释需要生物信息学工具,而这些工具可能会导致这个问题。虽然为标识符(ID)提供基因符号注释(例如微阵列探针集、RefSeq、GenBank 和 Entrez Gene)看似微不足道,但准确性对于任何后续结论都是至关重要的。我们检查了三个商业通路分析软件 (PAS) 包的基因符号注释和结果:Ingenuity Pathways Analysis、GeneGO 和 Pathway Studio。我们比较了不同软件版本和不同输入 ID 类型之间的基因符号注释和规范通路结果。我们发现 PAS 结果可能会受到软件版本之间基因符号注释的变化以及分析的输入 ID 类型的影响。因此,我们为使用商业 PAS 和报告微阵列结果提出了建议,以提高研究质量。我们提议建立一个维基类型的网站,以促进科学界内部生物信息学软件问题的交流。

相似文献

1
Pathway analysis software: annotation errors and solutions.通路分析软件:注释错误及解决方法。
Mol Genet Metab. 2010 Oct-Nov;101(2-3):134-40. doi: 10.1016/j.ymgme.2010.06.005. Epub 2010 Jun 22.
2
MADGene: retrieval and processing of gene identifier lists for the analysis of heterogeneous microarray datasets.MADGene:用于分析异质微阵列数据集的基因标识符列表的检索和处理。
Bioinformatics. 2011 Mar 1;27(5):725-6. doi: 10.1093/bioinformatics/btq710. Epub 2011 Jan 6.
3
GeneTools--application for functional annotation and statistical hypothesis testing.基因工具——用于功能注释和统计假设检验的应用程序。
BMC Bioinformatics. 2006 Oct 24;7:470. doi: 10.1186/1471-2105-7-470.
4
MILANO--custom annotation of microarray results using automatic literature searches.米兰——使用自动文献检索对微阵列结果进行定制注释。
BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12.
5
Onto-Tools: new additions and improvements in 2006.Onto-Tools:2006年的新增加内容及改进
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W206-11. doi: 10.1093/nar/gkm327. Epub 2007 Jun 21.
6
A web-based platform for rice microarray annotation and data analysis.一个基于网络的水稻芯片注释和数据分析平台。
Sci China Life Sci. 2010 Dec;53(12):1467-73. doi: 10.1007/s11427-010-4101-6. Epub 2010 Dec 23.
7
RACE: Remote Analysis Computation for gene Expression data.RACE:基因表达数据的远程分析计算
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W638-43. doi: 10.1093/nar/gki490.
8
CRCView: a web server for analyzing and visualizing microarray gene expression data using model-based clustering.CRCView:一个用于使用基于模型的聚类分析和可视化微阵列基因表达数据的网络服务器。
Bioinformatics. 2007 Jul 15;23(14):1843-5. doi: 10.1093/bioinformatics/btm238. Epub 2007 May 7.
9
Recent additions and improvements to the Onto-Tools.Onto-Tools最近的新增功能和改进。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W762-5. doi: 10.1093/nar/gki472.
10
Consistent annotation of gene expression arrays.基因表达谱的一致注释。
BMC Genomics. 2010 May 11;11:294. doi: 10.1186/1471-2164-11-294.

引用本文的文献

1
A genome-wide association study identified PRKCB as a causal gene and therapeutic target for Mycobacterium avium complex disease.一项全基因组关联研究确定PRKCB是鸟分枝杆菌复合群疾病的致病基因和治疗靶点。
Cell Rep Med. 2025 Feb 18;6(2):101923. doi: 10.1016/j.xcrm.2024.101923. Epub 2025 Jan 22.
2
High-throughput analysis and functional interpretation of extracellular vesicle content in hematological malignancies.血液系统恶性肿瘤中细胞外囊泡内容物的高通量分析及功能解读
Comput Struct Biotechnol J. 2020 Sep 24;18:2670-2677. doi: 10.1016/j.csbj.2020.09.027. eCollection 2020.
3
An introductory review of parallel independent component analysis (p-ICA) and a guide to applying p-ICA to genetic data and imaging phenotypes to identify disease-associated biological pathways and systems in common complex disorders.

本文引用的文献

1
ArrayIDer: automated structural re-annotation pipeline for DNA microarrays.ArrayIDer:用于DNA微阵列的自动化结构重新注释流程
BMC Bioinformatics. 2009 Jan 23;10:30. doi: 10.1186/1471-2105-10-30.
2
The public road to high-quality curated biological pathways.通往高质量精选生物途径的公共道路。
Drug Discov Today. 2008 Oct;13(19-20):856-62. doi: 10.1016/j.drudis.2008.06.013. Epub 2008 Aug 27.
3
nuID: a universal naming scheme of oligonucleotides for illumina, affymetrix, and other microarrays.nuID:一种用于Illumina、Affymetrix及其他微阵列的寡核苷酸通用命名方案。
并行独立成分分析(p-ICA)简介及将p-ICA应用于遗传数据和成像表型以识别常见复杂疾病中与疾病相关的生物途径和系统的指南。
Front Genet. 2015 Sep 7;6:276. doi: 10.3389/fgene.2015.00276. eCollection 2015.
4
Gene Network Analysis in Amygdala following Taste Aversion Learning in Rats.大鼠味觉厌恶学习后杏仁核中的基因网络分析
Neurosci J. 2013;2013:739764. doi: 10.1155/2013/739764. Epub 2013 May 23.
5
Systems proteomics for translational network medicine.用于转化网络医学的系统蛋白质组学。
Circ Cardiovasc Genet. 2012 Aug 1;5(4):478. doi: 10.1161/CIRCGENETICS.110.958991.
6
Investigating the secretome: lessons about the cells that comprise the heart.探索分泌蛋白组:关于构成心脏的细胞的经验教训。
Circ Cardiovasc Genet. 2012 Feb 1;5(1):o8-o18. doi: 10.1161/CIRCGENETICS.111.960187.
Biol Direct. 2007 May 31;2:16. doi: 10.1186/1745-6150-2-16.
4
Transcript-based redefinition of grouped oligonucleotide probe sets using AceView: high-resolution annotation for microarrays.使用AceView基于转录本对分组寡核苷酸探针集进行重新定义:微阵列的高分辨率注释
BMC Bioinformatics. 2007 Mar 29;8:108. doi: 10.1186/1471-2105-8-108.
5
Comparisons of annotation predictions for affymetrix GeneChips.Affymetrix基因芯片注释预测的比较。
Appl Bioinformatics. 2006;5(4):237-48. doi: 10.2165/00822942-200605040-00006.
6
Too much data, but little inter-changeability: a lesson learned from mining public data on tissue specificity of gene expression.数据繁多,但可互换性低:从挖掘基因表达组织特异性的公共数据中获得的教训。
Biol Direct. 2006 Oct 25;1:33. doi: 10.1186/1745-6150-1-33.
7
The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements.微阵列质量控制(MAQC)项目展示了基因表达测量在不同平台间和同一平台内的可重复性。
Nat Biotechnol. 2006 Sep;24(9):1151-61. doi: 10.1038/nbt1239.
8
Comparison of gene coverage of mouse oligonucleotide microarray platforms.小鼠寡核苷酸微阵列平台的基因覆盖度比较。
BMC Genomics. 2006 Mar 21;7:58. doi: 10.1186/1471-2164-7-58.
9
Targeted disruption of glycerol kinase gene in mice: expression analysis in liver shows alterations in network partners related to glycerol kinase activity.小鼠甘油激酶基因的靶向破坏:肝脏中的表达分析显示与甘油激酶活性相关的网络伙伴发生改变。
Hum Mol Genet. 2006 Feb 1;15(3):405-15. doi: 10.1093/hmg/ddi457. Epub 2005 Dec 20.
10
Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data.不断演变的基因/转录本定义显著改变了对基因芯片数据的解读。
Nucleic Acids Res. 2005 Nov 10;33(20):e175. doi: 10.1093/nar/gni179.