• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从科学文献中挖掘骨骼表型描述。

Mining skeletal phenotype descriptions from scientific literature.

机构信息

School of ITEE, The University of Queensland, Australia.

出版信息

PLoS One. 2013;8(2):e55656. doi: 10.1371/journal.pone.0055656. Epub 2013 Feb 8.

DOI:10.1371/journal.pone.0055656
PMID:23409017
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3568099/
Abstract

Phenotype descriptions are important for our understanding of genetics, as they enable the computation and analysis of a varied range of issues related to the genetic and developmental bases of correlated characters. The literature contains a wealth of such phenotype descriptions, usually reported as free-text entries, similar to typical clinical summaries. In this paper, we focus on creating and making available an annotated corpus of skeletal phenotype descriptions. In addition, we present and evaluate a hybrid Machine Learning approach for mining phenotype descriptions from free text. Our hybrid approach uses an ensemble of four classifiers and experiments with several aggregation techniques. The best scoring technique achieves an F-1 score of 71.52%, which is close to the state-of-the-art in other domains, where training data exists in abundance. Finally, we discuss the influence of the features chosen for the model on the overall performance of the method.

摘要

表型描述对于我们理解遗传学很重要,因为它们能够计算和分析与相关特征的遗传和发育基础相关的各种问题。文献中包含大量这样的表型描述,通常以自由文本形式报告,类似于典型的临床总结。在本文中,我们专注于创建和提供一个注释的骨骼表型描述语料库。此外,我们还提出并评估了一种从自由文本中挖掘表型描述的混合机器学习方法。我们的混合方法使用了四个分类器的集成,并尝试了几种聚合技术。得分最高的技术达到了 71.52%的 F1 分数,这接近在其他领域的最新水平,在这些领域中,训练数据非常丰富。最后,我们讨论了模型选择的特征对方法整体性能的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7ed/3568099/6f683597f460/pone.0055656.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7ed/3568099/6f683597f460/pone.0055656.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7ed/3568099/6f683597f460/pone.0055656.g001.jpg

相似文献

1
Mining skeletal phenotype descriptions from scientific literature.从科学文献中挖掘骨骼表型描述。
PLoS One. 2013;8(2):e55656. doi: 10.1371/journal.pone.0055656. Epub 2013 Feb 8.
2
Knowledge based word-concept model estimation and refinement for biomedical text mining.用于生物医学文本挖掘的基于知识的词概念模型估计与优化。
J Biomed Inform. 2015 Feb;53:300-7. doi: 10.1016/j.jbi.2014.11.015. Epub 2014 Dec 12.
3
Supervised segmentation of phenotype descriptions for the human skeletal phenome using hybrid methods.使用混合方法对人类骨骼表型进行表型描述的监督分割。
BMC Bioinformatics. 2012 Oct 15;13:265. doi: 10.1186/1471-2105-13-265.
4
Learning to recognize phenotype candidates in the auto-immune literature using SVM re-ranking.利用 SVM 重新排序学习识别自身免疫文献中的表型候选物。
PLoS One. 2013 Oct 14;8(10):e72965. doi: 10.1371/journal.pone.0072965. eCollection 2013.
5
Assessment of disease named entity recognition on a corpus of annotated sentences.基于带注释句子语料库的疾病命名实体识别评估。
BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2105-9-S3-S3.
6
Portable automatic text classification for adverse drug reaction detection via multi-corpus training.通过多语料库训练实现用于药物不良反应检测的便携式自动文本分类
J Biomed Inform. 2015 Feb;53:196-207. doi: 10.1016/j.jbi.2014.11.002. Epub 2014 Nov 8.
7
Recognition of medication information from discharge summaries using ensembles of classifiers.使用分类器集成识别出院小结中的药物信息。
BMC Med Inform Decis Mak. 2012 May 7;12:36. doi: 10.1186/1472-6947-12-36.
8
Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.从生物医学文献中挖掘基因型-表型关系以用于数据库管理和精准医学。
PLoS Comput Biol. 2016 Nov 30;12(11):e1005017. doi: 10.1371/journal.pcbi.1005017. eCollection 2016 Nov.
9
Discriminative and informative features for biomolecular text mining with ensemble feature selection.基于集成特征选择的生物分子文本挖掘的判别和信息特征。
Bioinformatics. 2010 Sep 15;26(18):i554-60. doi: 10.1093/bioinformatics/btq381.
10
Extraction of temporal relations from clinical free text: A systematic review of current approaches.从临床自由文本中提取时间关系:当前方法的系统评价。
J Biomed Inform. 2020 Aug;108:103488. doi: 10.1016/j.jbi.2020.103488. Epub 2020 Jul 13.

引用本文的文献

1
A new synonym-substitution method to enrich the human phenotype ontology.一种丰富人类表型本体的新同义词替换方法。
BMC Bioinformatics. 2017 Oct 10;18(1):446. doi: 10.1186/s12859-017-1858-7.
2
The digital revolution in phenotyping.表型分析中的数字革命。
Brief Bioinform. 2016 Sep;17(5):819-30. doi: 10.1093/bib/bbv083. Epub 2015 Sep 29.
3
Concept selection for phenotypes and diseases using learn to rank.使用排序学习法进行表型和疾病的概念选择。

本文引用的文献

1
Open semantic annotation of scientific publications using DOMEO.使用DOMEO对科学出版物进行开放语义标注。
J Biomed Semantics. 2012 Apr 24;3 Suppl 1(Suppl 1):S1. doi: 10.1186/2041-1480-3-S1-S1.
2
Mouse genetic and phenotypic resources for human genetics.用于人类遗传学的小鼠遗传和表型资源。
Hum Mutat. 2012 May;33(5):826-36. doi: 10.1002/humu.22077.
3
Standard terminology for phenotypic variations: the elements of morphology project, its current progress, and future directions.表型变异的标准术语:形态学项目的要素、当前进展及未来方向。
J Biomed Semantics. 2015 Jun 1;6:24. doi: 10.1186/s13326-015-0019-z. eCollection 2015.
4
Automatic concept recognition using the human phenotype ontology reference and test suite corpora.使用人类表型本体参考和测试套件语料库进行自动概念识别。
Database (Oxford). 2015 Feb 27;2015. doi: 10.1093/database/bav005. Print 2015.
5
Toward knowledge support for analysis and interpretation of complex traits.迈向对复杂性状分析和解释的知识支持。
Genome Biol. 2013;14(9):214. doi: 10.1186/gb-2013-14-9-214.
Hum Mutat. 2012 May;33(5):781-6. doi: 10.1002/humu.22053. Epub 2012 Apr 13.
4
Boosting performance of gene mention tagging system by hybrid methods.通过混合方法提高基因提及标记系统的性能。
J Biomed Inform. 2012 Feb;45(1):156-64. doi: 10.1016/j.jbi.2011.10.004. Epub 2011 Oct 28.
5
PhenomeNET: a whole-phenome approach to disease gene discovery.表型网络(PhenomeNET):一种全表型方法用于疾病基因发现。
Nucleic Acids Res. 2011 Oct;39(18):e119. doi: 10.1093/nar/gkr538. Epub 2011 Jul 6.
6
Entity/quality-based logical definitions for the human skeletal phenome using PATO.使用PATO对人类骨骼表型组进行基于实体/特征的逻辑定义。
Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:7069-72. doi: 10.1109/IEMBS.2009.5333362.
7
Linking human diseases to animal models using ontology-based phenotype annotation.利用基于本体的表型注释将人类疾病与动物模型联系起来。
PLoS Biol. 2009 Nov;7(11):e1000247. doi: 10.1371/journal.pbio.1000247. Epub 2009 Nov 24.
8
Clinical diagnostics in human genetics with semantic similarity searches in ontologies.基于本体中语义相似性搜索的人类遗传学临床诊断
Am J Hum Genet. 2009 Oct;85(4):457-64. doi: 10.1016/j.ajhg.2009.09.003.
9
The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease.人类表型本体论:一种用于注释和分析人类遗传病的工具。
Am J Hum Genet. 2008 Nov;83(5):610-5. doi: 10.1016/j.ajhg.2008.09.017. Epub 2008 Oct 23.
10
Exploiting the performance of dictionary-based bio-entity name recognition in biomedical literature.利用基于词典的生物实体名称识别在生物医学文献中的性能。
Comput Biol Chem. 2008 Aug;32(4):287-91. doi: 10.1016/j.compbiolchem.2008.03.008. Epub 2008 Apr 1.