• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于表型高通量比较生物学的术语映射

Terminological mapping for high throughput comparative biology of phenotypes.

作者信息

Lussier Y A, Li J

机构信息

Department of Biomedical Informatics, Columbia University College of Physicians and Surgeons, New York, NY 10032, USA.

出版信息

Pac Symp Biocomput. 2004:202-13. doi: 10.1142/9789812704856_0020.

DOI:10.1142/9789812704856_0020
PMID:14992504
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2891021/
Abstract

Comparative biological studies have led to remarkable biomedical discoveries. While genomic science and technologies are advancing rapidly, our ability to precisely specify a phenotype and compare it to related phenotypes of other organisms remains challenging. This study has examined the systematic use of terminology and knowledge based technologies to enable high-throughput comparative phenomics. More specifically, we measured the accuracy of a multi-strategy automated classification method to bridge the phenotype gap between a phenotypic terminology (MGD: Phenoslim) and a broad-coverage clinical terminology (SNOMED CT). Furthermore, we qualitatively evaluate the additional emerging properties of the combined terminological network for comparative biology and discovery science. According to the gold standard (n = 100), the accuracies (precision / recall) of the composite automated methods were 67% / 97% (mapping for identical concepts) and 85% / 98% (classification). Quantitatively, only 2% of the phenotypic concepts were missing from the clinical terminology, however, qualitatively the gap was larger: conceptual scope, granularity and subtle yet significant, homonymy problems were observed. These results suggest that, as observed in other domains, additional strategies are required for combining terminologies.

摘要

比较生物学研究带来了卓越的生物医学发现。虽然基因组科学和技术正在迅速发展,但我们精确确定一种表型并将其与其他生物的相关表型进行比较的能力仍然具有挑战性。本研究考察了基于术语和知识的技术的系统应用,以实现高通量比较表型组学。更具体地说,我们测量了一种多策略自动分类方法的准确性,以弥合表型术语(MGD:Phenoslim)和广泛覆盖的临床术语(SNOMED CT)之间的表型差距。此外,我们定性评估了用于比较生物学和发现科学的组合术语网络的其他新兴特性。根据金标准(n = 100),复合自动方法的准确率(精确率/召回率)分别为67% / 97%(相同概念的映射)和85% / 98%(分类)。从数量上看,临床术语中仅2%的表型概念缺失,然而,从质量上看差距更大:观察到概念范围、粒度以及细微但显著的同名问题。这些结果表明,正如在其他领域所观察到的那样,组合术语需要额外的策略。

相似文献

1
Terminological mapping for high throughput comparative biology of phenotypes.用于表型高通量比较生物学的术语映射
Pac Symp Biocomput. 2004:202-13. doi: 10.1142/9789812704856_0020.
2
A comparative analysis of the density of the SNOMED CT conceptual content for semantic harmonization.用于语义协调的SNOMED CT概念内容密度的比较分析。
Artif Intell Med. 2015 May;64(1):29-40. doi: 10.1016/j.artmed.2015.03.002. Epub 2015 Apr 2.
3
Automating terminological networks to link heterogeneous biomedical databases.自动化术语网络以连接异构生物医学数据库。
Stud Health Technol Inform. 2004;107(Pt 1):555-9.
4
An evaluation of hybrid methods for matching biomedical terminologies: mapping the gene ontology to the UMLS.用于匹配生物医学术语的混合方法评估:将基因本体映射到统一医学语言系统。
Stud Health Technol Inform. 2003;95:62-7.
5
Mapping clinical phenotype data elements to standardized metadata repositories and controlled terminologies: the eMERGE Network experience.将临床表型数据元素映射到标准化元数据存储库和受控术语:eMERGE 网络的经验。
J Am Med Inform Assoc. 2011 Jul-Aug;18(4):376-86. doi: 10.1136/amiajnl-2010-000061. Epub 2011 May 19.
6
Linking biomedical language information and knowledge resources: GO and UMLS.链接生物医学语言信息与知识资源:基因本体论(GO)与统一医学语言系统(UMLS)
Pac Symp Biocomput. 2003:439-50. doi: 10.1142/9789812776303_0041.
7
Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.开发和评估 RapTAT:一种用于从医学叙述中映射短语概念的机器学习系统。
J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4.
8
Putting data integration into practice: using biomedical terminologies to add structure to existing data sources.将数据整合付诸实践:利用生物医学术语为现有数据源增添结构。
AMIA Annu Symp Proc. 2003;2003:125-9.
9
A health terminological system for inherited retinal diseases: Content coverage evaluation and a proposed classification.遗传性视网膜疾病健康术语系统:内容覆盖评估及分类建议。
PLoS One. 2023 Aug 4;18(8):e0281858. doi: 10.1371/journal.pone.0281858. eCollection 2023.
10
Issues in mapping LOINC laboratory tests to SNOMED CT.将LOINC实验室检查映射到SNOMED CT中的问题。
AMIA Annu Symp Proc. 2008 Nov 6;2008:51-5.

引用本文的文献

1
Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes.通过对非结构化表型进行映射和模型理论语义分解来整合神经影像和微阵列数据集。
Summit Transl Bioinform. 2009 Mar 1;2009:85-9.
2
Homolonto: generating homology relationships by pairwise alignment of ontologies and application to vertebrate anatomy.同源关系生成:通过本体的两两比对生成同源关系及其在脊椎动物解剖学中的应用。
Bioinformatics. 2010 Jul 15;26(14):1766-71. doi: 10.1093/bioinformatics/btq283. Epub 2010 Jun 2.
3
PhenoHM: human-mouse comparative phenome-genome server.PhenoHM:人类-小鼠比较表型-基因组服务器。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W165-74. doi: 10.1093/nar/gkq472. Epub 2010 May 27.
4
Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes.通过对非结构化表型进行映射和模型理论语义分解实现神经影像数据集与微阵列数据集的整合。
Cancer Inform. 2009 Jun 8;8:75-94. doi: 10.4137/cin.s1046.
5
PhenoGO: an integrated resource for the multiscale mining of clinical and biological data.PhenoGO:一个用于临床和生物数据多尺度挖掘的综合资源库。
BMC Bioinformatics. 2009 Feb 5;10 Suppl 2(Suppl 2):S8. doi: 10.1186/1471-2105-10-S2-S8.
6
Biomedical ontologies in action: role in knowledge management, data integration and decision support.生物医学本体的应用:在知识管理、数据集成和决策支持中的作用。
Yearb Med Inform. 2008:67-79.
7
Standards and ontologies for functional genomics 2.功能基因组学的标准与本体2。
Comp Funct Genomics. 2004;5(8):618-22. doi: 10.1002/cfg.448.
8
Mapping proteins to disease terminologies: from UniProt to MeSH.将蛋白质映射到疾病术语:从通用蛋白质数据库(UniProt)到医学主题词表(MeSH)。
BMC Bioinformatics. 2008 Apr 29;9 Suppl 5(Suppl 5):S3. doi: 10.1186/1471-2105-9-S5-S3.
9
Mapping the categories of the Swedish primary health care version of ICD-10 to SNOMED CT concepts: rule development and intercoder reliability in a mapping trial.将ICD - 10瑞典初级卫生保健版本的类别映射到SNOMED CT概念:映射试验中的规则制定与编码员间可靠性
BMC Med Inform Decis Mak. 2007 May 2;7:9. doi: 10.1186/1472-6947-7-9.
10
Computational approaches to phenotyping: high-throughput phenomics.表型分析的计算方法:高通量表型组学
Proc Am Thorac Soc. 2007 Jan;4(1):18-25. doi: 10.1513/pats.200607-142JG.

本文引用的文献

1
Putting data integration into practice: using biomedical terminologies to add structure to existing data sources.将数据整合付诸实践:利用生物医学术语为现有数据源增添结构。
AMIA Annu Symp Proc. 2003;2003:125-9.
2
An evaluation of hybrid methods for matching biomedical terminologies: mapping the gene ontology to the UMLS.用于匹配生物医学术语的混合方法评估:将基因本体映射到统一医学语言系统。
Stud Health Technol Inform. 2003;95:62-7.
3
From biological databases to platforms for biomedical discovery.从生物数据库到生物医学发现平台。
Trends Biotechnol. 2003 Jun;21(6):263-8. doi: 10.1016/s0167-7799(03)00108-2.
4
Genetic Models in Applied Physiology. Functional genomics in the mouse: powerful techniques for unraveling the basis of human development and disease.
J Appl Physiol (1985). 2003 Jun;94(6):2502-9. doi: 10.1152/japplphysiol.00209.2003.
5
The human phenome project.人类表型组计划
Nat Genet. 2003 May;34(1):15-21. doi: 10.1038/ng0503-15.
6
The Human Genome Project: lessons from large-scale biology.人类基因组计划:来自大规模生物学的经验教训。
Science. 2003 Apr 11;300(5617):286-90. doi: 10.1126/science.1084564.
7
The genetics and genomics of cancer.癌症的遗传学与基因组学
Nat Genet. 2003 Mar;33 Suppl:238-44. doi: 10.1038/ng1107.
8
Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease.发现人类表型背后的基因型:孟德尔疾病的既往成功案例,复杂疾病的未来研究方法
Nat Genet. 2003 Mar;33 Suppl:228-37. doi: 10.1038/ng1090.
9
Evaluation of ontology merging tools in bioinformatics.生物信息学中本体合并工具的评估
Pac Symp Biocomput. 2003:589-600. doi: 10.1142/9789812776303_0055.
10
Linking biomedical language information and knowledge resources: GO and UMLS.链接生物医学语言信息与知识资源:基因本体论(GO)与统一医学语言系统(UMLS)
Pac Symp Biocomput. 2003:439-50. doi: 10.1142/9789812776303_0041.