使用数据挖掘进行自动化3D表型分析。

Automated 3D phenotype analysis using data mining.

作者信息

Plyusnin Ilya, Evans Alistair R, Karme Aleksis, Gionis Aristides, Jernvall Jukka

机构信息

Institute of Biotechnology, University of Helsinki, Helsinki, Finland.

出版信息

PLoS One. 2008 Mar 5;3(3):e1742. doi: 10.1371/journal.pone.0001742.

DOI:10.1371/journal.pone.0001742

PMID:18320060

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2254194/

Abstract

The ability to analyze and classify three-dimensional (3D) biological morphology has lagged behind the analysis of other biological data types such as gene sequences. Here, we introduce the techniques of data mining to the study of 3D biological shapes to bring the analyses of phenomes closer to the efficiency of studying genomes. We compiled five training sets of highly variable morphologies of mammalian teeth from the MorphoBrowser database. Samples were labeled either by dietary class or by conventional dental types (e.g. carnassial, selenodont). We automatically extracted a multitude of topological attributes using Geographic Information Systems (GIS)-like procedures that were then used in several combinations of feature selection schemes and probabilistic classification models to build and optimize classifiers for predicting the labels of the training sets. In terms of classification accuracy, computational time and size of the feature sets used, non-repeated best-first search combined with 1-nearest neighbor classifier was the best approach. However, several other classification models combined with the same searching scheme proved practical. The current study represents a first step in the automatic analysis of 3D phenotypes, which will be increasingly valuable with the future increase in 3D morphology and phenomics databases.

摘要

对三维（3D）生物形态进行分析和分类的能力，落后于对其他生物数据类型（如基因序列）的分析。在此，我们将数据挖掘技术引入到3D生物形状的研究中，以使表型分析更接近基因组研究的效率。我们从MorphoBrowser数据库中汇编了五组具有高度可变形态的哺乳动物牙齿训练集。样本根据饮食类别或传统牙齿类型（如裂齿、月型齿）进行标记。我们使用类似地理信息系统（GIS）的程序自动提取了大量拓扑属性，然后将这些属性用于多种特征选择方案和概率分类模型的组合中，以构建和优化用于预测训练集标签的分类器。在分类准确率、计算时间和所用特征集的大小方面，非重复最佳优先搜索与1-最近邻分类器相结合是最佳方法。然而，其他几种与相同搜索方案相结合的分类模型也被证明是可行的。当前的研究代表了3D表型自动分析的第一步，随着未来3D形态学和表型组学数据库的增加，这将变得越来越有价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9876/2254194/f9beda28b7db/pone.0001742.g001.jpg

相似文献

Automated 3D phenotype analysis using data mining.使用数据挖掘进行自动化3D表型分析。

PLoS One. 2008 Mar 5;3(3):e1742. doi: 10.1371/journal.pone.0001742.

Effects of classifier structures and training regimes on integrated segmentation and recognition of handwritten numeral strings.

IEEE Trans Pattern Anal Mach Intell. 2004 Nov;26(11):1395-407. doi: 10.1109/tpami.2004.104.

Effect of finite sample size on feature selection and classification: a simulation study.有限样本大小对特征选择和分类的影响：一项模拟研究。

Med Phys. 2010 Feb;37(2):907-20. doi: 10.1118/1.3284974.

Automatic SNOMED classification--a corpus-based method.自动SNOMED分类——一种基于语料库的方法。

Comput Methods Programs Biomed. 1997 Sep;54(1-2):115-22. doi: 10.1016/s0169-2607(97)00040-0.

Evaluating feature selection strategies for high dimensional, small sample size datasets.评估高维小样本数据集的特征选择策略。

Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:949-52. doi: 10.1109/IEMBS.2011.6090214.

Feature selection and nearest centroid classification for protein mass spectrometry.蛋白质质谱的特征选择与最近质心分类

BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68.

An Extensive Empirical Comparison of Probabilistic Hierarchical Classifiers in Datasets of Ageing-Related Genes.衰老相关基因数据集中概率分层分类器的广泛实证比较

IEEE/ACM Trans Comput Biol Bioinform. 2016 Nov-Dec;13(6):1045-1058. doi: 10.1109/TCBB.2015.2505288. Epub 2015 Dec 3.

Enhancing prototype reduction schemes with recursion: a method applicable for "large" data sets.

IEEE Trans Syst Man Cybern B Cybern. 2004 Jun;34(3):1384-97. doi: 10.1109/tsmcb.2004.824524.

EvDTree: structure-dependent substitution profiles based on decision tree classification of 3D environments.EvDTree：基于三维环境决策树分类的结构相关替代概况

BMC Bioinformatics. 2005 Jan 10;6:4. doi: 10.1186/1471-2105-6-4.

Is cross-validation better than resubstitution for ranking genes?在对基因进行排名时，交叉验证是否比重替代法更好？

Bioinformatics. 2004 Jan 22;20(2):253-8. doi: 10.1093/bioinformatics/btg399.

引用本文的文献

Experimental assessment of diffusible iodine-based contrast-enhanced computed tomography (diceCT) protocols.基于弥散碘的对比增强计算机断层扫描（diceCT）方案的实验评估。

PeerJ. 2024 Sep 5;12:e17919. doi: 10.7717/peerj.17919. eCollection 2024.

Digital image processing: A new tool for morphological measurements of freshwater turtles under rehabilitation.数字图像处理：一种用于康复中淡水龟形态测量的新工具。

PLoS One. 2024 Mar 14;19(3):e0300253. doi: 10.1371/journal.pone.0300253. eCollection 2024.

Dental complexity and diet in amniotes: A meta-analysis.羊膜动物的牙齿复杂性与饮食：一项荟萃分析。

PLoS One. 2024 Feb 2;19(2):e0292358. doi: 10.1371/journal.pone.0292358. eCollection 2024.

Mapping molar shapes on signaling pathways.在信号通路中描绘摩尔形状。

PLoS Comput Biol. 2020 Dec 14;16(12):e1008436. doi: 10.1371/journal.pcbi.1008436. eCollection 2020 Dec.

Repeatability, reproducibility and consistency of horse shape data and its association with linearly described conformation traits in Franches-Montagnes stallions.重复力、可重复性和一致性的马体型数据及其与弗朗什-蒙塔涅种公马线性描述的体貌特征的关系。

PLoS One. 2018 Aug 27;13(8):e0202931. doi: 10.1371/journal.pone.0202931. eCollection 2018.

MorphoTester: An Open Source Application for Morphological Topographic Analysis.形态测试仪：一款用于形态地形分析的开源应用程序。

PLoS One. 2016 Feb 3;11(2):e0147649. doi: 10.1371/journal.pone.0147649. eCollection 2016.

Prospective in (Primate) dental analysis through tooth 3D topographical quantification.通过牙齿三维形貌量化进行灵长类前瞻性牙分析。

PLoS One. 2013 Jun 24;8(6):e66142. doi: 10.1371/journal.pone.0066142. Print 2013.

Three-dimensional analysis of tooth dimensions in the MSX1-missense mutation.MSX1 错义突变中牙齿尺寸的三维分析。

Clin Oral Investig. 2013 Jun;17(5):1437-45. doi: 10.1007/s00784-012-0828-8. Epub 2012 Aug 31.

The Digital Fish Library: using MRI to digitize, database, and document the morphological diversity of fish.数字化鱼类文库：利用 MRI 技术对鱼类的形态多样性进行数字化、数据库化和文献记录。

PLoS One. 2012;7(4):e34499. doi: 10.1371/journal.pone.0034499. Epub 2012 Apr 6.

Modeling three-dimensional morphological structures using spherical harmonics.使用球谐函数对三维形态结构进行建模。

Evolution. 2009 Apr;63(4):1003-16. doi: 10.1111/j.1558-5646.2008.00557.x. Epub 2009 Oct 17.

本文引用的文献

Nature. 2007 Jan 4;445(7123):78-81. doi: 10.1038/nature05433. Epub 2006 Dec 13.

Dental topography and diets of Australopithecus afarensis and early Homo.阿法南方古猿和早期人类的牙齿形态与饮食

J Hum Evol. 2004 May;46(5):605-22. doi: 10.1016/j.jhevol.2004.03.004.

How different types of pattern formation mechanisms affect the evolution of form and development.不同类型的模式形成机制如何影响形态的进化与发育。

Evol Dev. 2004 Jan-Feb;6(1):6-16. doi: 10.1111/j.1525-142x.2004.04002.x.

Confocal imaging, visualization and 3-D surface measurement of small mammalian teeth.小型哺乳动物牙齿的共聚焦成像、可视化及三维表面测量

J Microsc. 2001 Nov;204(Pt 2):108-18. doi: 10.1046/j.1365-2818.2001.00939.x.

The Protein Data Bank.蛋白质数据库。

Nucleic Acids Res. 2000 Jan 1;28(1):235-42. doi: 10.1093/nar/28.1.235.

Technical note: Modeling primate occlusal topography using geographic information systems technology.技术说明：使用地理信息系统技术对灵长类动物咬合面地形进行建模

Am J Phys Anthropol. 1998 Sep;107(1):137-42. doi: 10.1002/(SICI)1096-8644(199809)107:1<137::AID-AJPA11>3.0.CO;2-1.

Molar tooth diversity, disparity, and ecology in Cenozoic ungulate radiations.新生代有蹄类动物辐射演化中的臼齿多样性、差异及生态学

Science. 1996 Nov 29;274(5292):1489-92. doi: 10.1126/science.274.5292.1489.

Evolution and mammalian dental morphology.进化与哺乳动物牙齿形态

J Biol Buccale. 1983 Dec;11(4):285-302.

The functional adaptations of primate molar teeth.灵长类动物臼齿的功能适应性

Am J Phys Anthropol. 1975 Sep;43(2):195-216. doi: 10.1002/ajpa.1330430207.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用数据挖掘进行自动化3D表型分析。

Automated 3D phenotype analysis using data mining.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献