• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习工具和算法分析果蝇形态计量学。

Leveraging machine learning tools and algorithms for analysis of fruit fly morphometrics.

机构信息

International Centre of Insect Physiology and Ecology (icipe), P.O. Box 30772-00100, Nairobi, Kenya.

Department of Statistics, Jomo Kenyatta University of Agriculture and Technology, P.O. Box 62000-00200, Nairobi, Kenya.

出版信息

Sci Rep. 2022 May 3;12(1):7208. doi: 10.1038/s41598-022-11258-w.

DOI:10.1038/s41598-022-11258-w
PMID:35505067
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9065030/
Abstract

Analysis of landmark-based morphometric measurements taken on body parts of insects have been a useful taxonomic approach alongside DNA barcoding in insect identification. Statistical analysis of morphometrics have largely been dominated by traditional methods and approaches such as principal component analysis (PCA), canonical variate analysis (CVA) and discriminant analysis (DA). However, advancement in computing power creates a paradigm shift to apply modern tools such as machine learning. Herein, we assess the predictive performance of four machine learning classifiers; K-nearest neighbor (KNN), random forest (RF), support vector machine (the linear, polynomial and radial kernel SVMs) and artificial neural network (ANNs) on fruit fly morphometrics that were previously analysed using PCA and CVA. KNN and RF performed poorly with overall model accuracy lower than "no-information rate" (NIR) (p value > 0.1). The SVM models had a predictive accuracy of > 95%, significantly higher than NIR (p < 0.001), Kappa > 0.78 and area under curve (AUC) of the receiver operating characteristics was > 0.91; while ANN model had a predictive accuracy of 96%, significantly higher than NIR, Kappa of 0.83 and AUC was 0.98. Wing veins 2, 3, 8, 10, 14 and tibia length were of higher importance than other variables based on both SVM and ANN models. We conclude that SVM and ANN models could be used to discriminate fruit fly species based on wing vein and tibia length measurements or any other morphologically similar pest taxa. These algorithms could be used as candidates for developing an integrated and smart application software for insect discrimination and identification. Variable importance analysis results in this study would be useful for future studies for deciding what must be measured.

摘要

基于昆虫身体部位的地标形态测量分析,结合 DNA 条码技术,已成为昆虫鉴定的一种有用的分类方法。形态计量学的统计分析主要由传统方法和方法主导,如主成分分析(PCA)、典范变量分析(CVA)和判别分析(DA)。然而,计算能力的进步使得应用现代工具(如机器学习)成为可能。在这里,我们评估了四种机器学习分类器的预测性能;K 近邻(KNN)、随机森林(RF)、支持向量机(线性、多项式和径向核 SVM)和人工神经网络(ANNs),它们之前用于分析 PCA 和 CVA 分析的果蝇形态计量学。KNN 和 RF 的整体模型精度低于“无信息率”(NIR)(p 值>0.1),表现不佳。SVM 模型的预测精度>95%,显著高于 NIR(p<0.001),Kappa>0.78,接收者操作特征曲线下面积(AUC)>0.91;而 ANN 模型的预测精度为 96%,显著高于 NIR,Kappa 为 0.83,AUC 为 0.98。基于 SVM 和 ANN 模型,第二、三、八、十、十四翅脉和胫骨长度比其他变量更为重要。我们得出结论,SVM 和 ANN 模型可用于基于翅脉和胫骨长度测量或任何其他形态相似的害虫分类来区分果蝇种类。这些算法可以作为开发昆虫识别和鉴定的集成和智能应用软件的候选算法。本研究的变量重要性分析结果将有助于未来的研究,以决定必须测量哪些变量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/79edbbfe860d/41598_2022_11258_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/832ce18e31a6/41598_2022_11258_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/bd89a6bae014/41598_2022_11258_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/524828f4db7f/41598_2022_11258_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/5fa2c16812ca/41598_2022_11258_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/79edbbfe860d/41598_2022_11258_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/832ce18e31a6/41598_2022_11258_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/bd89a6bae014/41598_2022_11258_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/524828f4db7f/41598_2022_11258_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/5fa2c16812ca/41598_2022_11258_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd69/9065030/79edbbfe860d/41598_2022_11258_Fig5_HTML.jpg

相似文献

1
Leveraging machine learning tools and algorithms for analysis of fruit fly morphometrics.利用机器学习工具和算法分析果蝇形态计量学。
Sci Rep. 2022 May 3;12(1):7208. doi: 10.1038/s41598-022-11258-w.
2
Geometric morphometrics and machine learning as tools for the identification of sibling mosquito species of the Maculipennis complex (Anopheles).几何形态测量学和机器学习作为鉴定库蚊复合体(按蚊属)中同胞蚊种的工具
Infect Genet Evol. 2021 Nov;95:105034. doi: 10.1016/j.meegid.2021.105034. Epub 2021 Aug 9.
3
Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 QSAR data sets.我们是否需要不同的机器学习算法来进行定量构效关系建模?对 16 种机器学习算法在 14 个定量构效关系数据集上的综合评估。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa321.
4
Can machine learning predict pharmacotherapy outcomes? An application study in osteoporosis.机器学习能预测药物治疗效果吗?一项在骨质疏松症中的应用研究。
Comput Methods Programs Biomed. 2022 Oct;225:107028. doi: 10.1016/j.cmpb.2022.107028. Epub 2022 Jul 21.
5
Comparative assessment of the capability of machine learning-based radiomic models for predicting omental metastasis in locally advanced gastric cancer.基于机器学习的放射组学模型预测局部进展期胃癌网膜转移能力的比较评估。
Sci Rep. 2024 Jul 13;14(1):16208. doi: 10.1038/s41598-024-66979-x.
6
[Rapid identification of geographic origins of Zingiberis Rhizoma by NIRS combined with chemometrics and machine learning algorithms].近红外光谱结合化学计量学和机器学习算法快速鉴定干姜的地理来源
Zhongguo Zhong Yao Za Zhi. 2022 Sep;47(17):4583-4592. doi: 10.19540/j.cnki.cjcmm.20220514.103.
7
Classifying Alzheimer's disease and normal subjects using machine learning techniques and genetic-environmental features.使用机器学习技术和遗传-环境特征对阿尔茨海默病和正常受试者进行分类。
J Formos Med Assoc. 2024 Jun;123(6):701-709. doi: 10.1016/j.jfma.2023.10.021. Epub 2023 Dec 2.
8
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
9
Raman spectroscopy combined with machine learning algorithms for rapid detection Primary Sjögren's syndrome associated with interstitial lung disease.拉曼光谱结合机器学习算法用于快速检测与间质性肺病相关的原发性干燥综合征。
Photodiagnosis Photodyn Ther. 2022 Dec;40:103057. doi: 10.1016/j.pdpdt.2022.103057. Epub 2022 Aug 6.
10
[Construction of a predictive model for in-hospital mortality of sepsis patients in intensive care unit based on machine learning].基于机器学习构建重症监护病房脓毒症患者院内死亡率预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2023 Jul;35(7):696-701. doi: 10.3760/cma.j.cn121430-20221219-01104.

引用本文的文献

1
Opportunities and Challenges in Applying AI to Evolutionary Morphology.将人工智能应用于进化形态学的机遇与挑战。
Integr Org Biol. 2024 Sep 23;6(1):obae036. doi: 10.1093/iob/obae036. eCollection 2024.
2
Assessment of Body Morphometry to Classify Two Colombian Creole Pigs Using Statistical and Machine Learning Methods.使用统计和机器学习方法评估身体形态测量学以对两种哥伦比亚克里奥尔猪进行分类
Life (Basel). 2025 Apr 24;15(5):693. doi: 10.3390/life15050693.
3
Data Augmentation and Machine Learning algorithms for multi-class imbalanced morphometrics data of stingless bees.

本文引用的文献

1
Derivation and Validation of a Simplified Clinical Prediction Rule for Identifying Children at Increased Risk for Clinically Important Traumatic Brain Injuries Following Minor Blunt Head Trauma.一种用于识别轻度钝性头部创伤后有发生具有临床重要性创伤性脑损伤高风险儿童的简化临床预测规则的推导与验证
J Pediatr X. 2020 Apr 22;3:100026. doi: 10.1016/j.ympdx.2020.100026. eCollection 2020 Summer.
2
Artificial Neural Network: Understanding the Basic Concepts without Mathematics.人工神经网络:无需数学知识理解基本概念
Dement Neurocogn Disord. 2018 Sep;17(3):83-89. doi: 10.12779/dnd.2018.17.3.83. Epub 2018 Dec 13.
3
Wing morphometrics as a tool in species identification of forensically important blow flies of Thailand.
用于无刺蜂多类不平衡形态测量数据的数据增强和机器学习算法
Heliyon. 2025 Jan 23;11(3):e42214. doi: 10.1016/j.heliyon.2025.e42214. eCollection 2025 Feb 15.
4
Classifying forensically important flies using deep learning to support pathologists and rescue teams during forensic investigations.利用深度学习对具有法医学重要性的苍蝇进行分类,以便在法医调查期间为病理学家和救援队提供支持。
PLoS One. 2024 Dec 5;19(12):e0314533. doi: 10.1371/journal.pone.0314533. eCollection 2024.
5
Enhancing mosquito classification through self-supervised learning.通过自监督学习增强蚊子分类。
Sci Rep. 2024 Nov 7;14(1):27123. doi: 10.1038/s41598-024-78260-2.
6
Spatio-temporal characterization of phenotypic resistance in malaria vector species.疟疾病媒物种表型抗性的时空特征。
BMC Biol. 2024 May 20;22(1):117. doi: 10.1186/s12915-024-01915-z.
7
Factors influencing fruit cracking: an environmental and agronomic perspective.影响果实裂果的因素:环境与农艺学视角
Front Plant Sci. 2024 Feb 16;15:1343452. doi: 10.3389/fpls.2024.1343452. eCollection 2024.
翅形态测量学作为泰国法医重要丽蝇物种鉴定的工具。
Parasit Vectors. 2017 May 10;10(1):229. doi: 10.1186/s13071-017-2163-z.
4
Evolution of wing shape in hornets: why is the wing venation efficient for species identification?黄蜂翅膀形状的演变:为何翅脉对物种识别有效?
J Evol Biol. 2014 Dec;27(12):2665-75. doi: 10.1111/jeb.12523. Epub 2014 Oct 27.
5
Morphometrical diagnosis of the malaria vectors Anopheles cruzii, An. homunculus and An. bellator.疟蚊属中克鲁兹按蚊、趋人按蚊和凶猛按蚊的形态学诊断。
Parasit Vectors. 2012 Nov 13;5:257. doi: 10.1186/1756-3305-5-257.
6
Taxonomic identity of the invasive fruit fly pest, Bactrocera invadens: concordance in morphometry and DNA barcoding.入侵果实蝇害虫 Bactrocera invadens 的分类学身份:形态计量学和 DNA 条码的一致性。
PLoS One. 2012;7(9):e44862. doi: 10.1371/journal.pone.0044862. Epub 2012 Sep 17.
7
Wing morphometry as a tool for correct identification of primary and secondary New World screwworm fly.翅形态测量作为正确鉴定新大陆螺旋蝇(初级和次级)的工具。
Bull Entomol Res. 2010 Feb;100(1):19-26. doi: 10.1017/S0007485309006762. Epub 2009 Mar 23.
8
Artificial neural networks. Methods and applications. Preface.人工神经网络。方法与应用。前言。
Methods Mol Biol. 2008;458:v.
9
Comparison of five allopatric fruit fly parasitoid populations (Psyttalia species) (Hymenoptera: Braconidae) from coffee fields using morphometric and molecular methods.使用形态测量和分子方法对来自咖啡种植园的五个异域果蝇寄生蜂种群(潜蝇茧蜂属物种)(膜翅目:茧蜂科)进行比较。
Bull Entomol Res. 2008 Feb;98(1):63-75. doi: 10.1017/S000748530700541X. Epub 2007 Dec 13.
10
What is a support vector machine?什么是支持向量机?
Nat Biotechnol. 2006 Dec;24(12):1565-7. doi: 10.1038/nbt1206-1565.