Zhou Shally, Brady Brooke, Anstey Kaarin J
School of Psychology, University of New South Wales, Sydney, Australia.
UNSW Ageing Futures Institute, Sydney, Australia.
Behav Res Methods. 2025 Jan 22;57(2):69. doi: 10.3758/s13428-024-02583-1.
With recent technical advances, many cognitive and sensory tasks have been adapted for smartphone testing. This study aimed to assess the criterion validity of a subset of self-administered, open-source app-based cognitive and sensory tasks by comparing test performance to lab-based alternatives. An in-person baseline was completed by 43 participants (aged 21 to 82) from the larger Labs without Walls project (Brady et al., 2023) to compare the self-administered, app-based tasks with researcher-administered equivalents. 4 preset tasks sourced from Apple's ResearchKit (Spatial Memory, Trail Making Test, Stroop Test, and dBHL Tone Audiometry) and 1 custom-built task (Ishihara Color Deficiency Test) were compared. All tasks except the Spatial Memory task demonstrated high comparability to the researcher-administered version. Specifically, the Trail Making Tests were strongly correlated (.77 and .78 for parts A and B, respectively), Stroop correlations ranged from .77 to .89 and the Ishihara tasks were moderately correlated (r = .69). ICCs for the Audiometry task ranged from .56 to .96 (Moderate to Excellent) with 83% sensitivity and 100% specificity. Bland-Altman plots revealed a mean bias between -5.35 to 9.67 dB for each ear and frequency with an overall bias of 3.02 and 1.98 for the left and right ears, respectively, within the minimum testing interval. Furthermore, all app-based tasks were significantly correlated with age. These results offer preliminary evidence of the validity of four open-source cognitive and sensory tasks with implications for effective remote testing in non-lab settings.
随着近期技术的进步,许多认知和感官任务已被改编用于智能手机测试。本研究旨在通过将测试表现与基于实验室的替代方法进行比较,评估一部分基于开源应用程序的自我管理认知和感官任务的标准效度。来自规模更大的“无墙实验室”项目(布雷迪等人,2023年)的43名参与者(年龄在21至82岁之间)完成了一次面对面基线测试,以比较基于应用程序的自我管理任务与研究人员管理的等效任务。比较了从苹果研究套件中获取的4个预设任务(空间记忆、连线测验、斯特鲁普测验和dBHL纯音听力测定)和1个定制任务(石原色盲测试)。除空间记忆任务外,所有任务与研究人员管理的版本都具有高度可比性。具体而言,连线测验的相关性很强(A部分和B部分分别为0.77和0.78),斯特鲁普测验的相关性在0.77至0.89之间,石原任务的相关性中等(r = 0.69)。听力测定任务的组内相关系数在0.56至0.96之间(中等至优秀),灵敏度为83%,特异性为100%。布兰德-奥特曼图显示,在最小测试间隔内,每只耳朵和每个频率的平均偏差在-5.35至9.67分贝之间,左耳和右耳的总体偏差分别为3.02和1.98。此外,所有基于应用程序的任务都与年龄显著相关。这些结果为四项开源认知和感官任务的效度提供了初步证据,对非实验室环境中的有效远程测试具有启示意义。
JAMA Netw Open. 2025-3-3
J Am Acad Audiol. 2025-3-31
Cochrane Database Syst Rev. 2022-5-20
JMIR Mhealth Uhealth. 2022-10-21
Eur J Ageing. 2025-7-16
J Acoust Soc Am. 2022-7
Front Psychol. 2022-6-10
Lancet Neurol. 2022-7
Comput Methods Programs Biomed. 2021-11
JMIR Mhealth Uhealth. 2021-9-10