• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于缺失数据插补的新型排序最近邻算法。

A novel ranked -nearest neighbors algorithm for missing data imputation.

作者信息

Khan Yasir, Shah Said Farooq, Asim Syed Muhammad

机构信息

Government College of Management Sciences Jamrud, Jamrud, KP, Pakistan.

Department of Statistics, University of Peshawar, Peshawar, KP, Pakistan.

出版信息

J Appl Stat. 2024 Oct 11;52(5):1103-1127. doi: 10.1080/02664763.2024.2414357. eCollection 2025.

DOI:10.1080/02664763.2024.2414357
PMID:40160485
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11951327/
Abstract

Missing data is a common problem in many domains that rely on data analysis. The Nearest Neighbors imputation method has been widely used to address this issue, but it has limitations in accurately imputing missing values, especially for datasets with small pairwise correlations and small values of . In this study, we proposed a method, Ranked Nearest Neighbors imputation that uses a similar approach to Nearest Neighbor, but utilizing the concept of Ranked set sampling to select the most relevant neighbors for imputation. Our results show that the proposed method outperforms the standard nearest neighbor method in terms of imputation accuracy both in case of Missing Completely at Random and Missing at Random mechanism, as demonstrated by consistently lower MSIE and MAIE values across all datasets. This suggests that the proposed method is a promising alternative for imputing missing values in datasets with small pairwise correlations and small values of . Thus, the proposed Ranked Nearest Neighbor method has important implications for data imputation in various domains and can contribute to the development of more efficient and accurate imputation methods without adding any computational complexity to an algorithm.

摘要

缺失数据是许多依赖数据分析的领域中常见的问题。最近邻插补方法已被广泛用于解决这一问题,但它在准确插补缺失值方面存在局限性,特别是对于成对相关性较小且[此处原文缺失具体内容]值较小的数据集。在本研究中,我们提出了一种排序最近邻插补方法,该方法采用与最近邻类似的方法,但利用排序集抽样的概念来选择最相关的邻居进行插补。我们的结果表明,在完全随机缺失和随机缺失机制的情况下,所提出的方法在插补准确性方面优于标准最近邻方法,这体现在所有数据集上的MSIE和MAIE值始终较低。这表明所提出的方法是插补成对相关性较小且[此处原文缺失具体内容]值较小的数据集缺失值的一种有前途的替代方法。因此,所提出的排序最近邻方法对各领域的数据插补具有重要意义,并且可以在不增加算法计算复杂度的情况下,为开发更高效、准确的插补方法做出贡献。

相似文献

1
A novel ranked -nearest neighbors algorithm for missing data imputation.一种用于缺失数据插补的新型排序最近邻算法。
J Appl Stat. 2024 Oct 11;52(5):1103-1127. doi: 10.1080/02664763.2024.2414357. eCollection 2025.
2
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
3
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.
4
Long-acting inhaled therapy (beta-agonists, anticholinergics and steroids) for COPD: a network meta-analysis.慢性阻塞性肺疾病的长效吸入疗法(β受体激动剂、抗胆碱能药物和类固醇):一项网状荟萃分析。
Cochrane Database Syst Rev. 2014 Mar 26;2014(3):CD010844. doi: 10.1002/14651858.CD010844.pub2.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
9
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
10
Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施:一项网状Meta分析
Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.

本文引用的文献

1
Comparison of imputation methods for missing laboratory data in medicine.医学中缺失实验室数据的插补方法比较。
BMJ Open. 2013 Aug 1;3(8):e002847. doi: 10.1136/bmjopen-2013-002847.
2
Missing value estimation methods for DNA microarrays.DNA微阵列的缺失值估计方法。
Bioinformatics. 2001 Jun;17(6):520-5. doi: 10.1093/bioinformatics/17.6.520.
3
'Gene shaving' as a method for identifying distinct sets of genes with similar expression patterns.“基因消减”作为一种识别具有相似表达模式的不同基因集的方法。
Genome Biol. 2000;1(2):RESEARCH0003. doi: 10.1186/gb-2000-1-2-research0003. Epub 2000 Aug 4.