Vincent Jack P, Economo Michael N
Department of Biomedical Engineering, Boston University, Boston, MA.
Center for Neurophotonics, Boston University, Boston, MA.
bioRxiv. 2023 Dec 23:2023.12.21.572882. doi: 10.1101/2023.12.21.572882.
Recent advances in extracellular electrophysiology now facilitate the recording of spikes from hundreds or thousands of neurons simultaneously. This has necessitated both the development of new computational methods for spike sorting and better methods to determine spike sorting accuracy. One longstanding method of assessing the false discovery rate (FDR) of spike sorting - the rate at which spikes are misassigned to the wrong cluster - has been the rate of inter-spike-interval (ISI) violations. Despite their near ubiquitous usage in spike sorting, our understanding of how exactly ISI violations relate to FDR, as well as best practices for using ISI violations as a quality metric, remain limited. Here, we describe an analytical solution that can be used to predict FDR from ISI violation rate. We test this model in silico through Monte Carlo simulation, and apply it to publicly available spike-sorted electrophysiology datasets. We find that the relationship between ISI violation rate and FDR is highly nonlinear, with additional dependencies on firing rate, the correlation in activity between neurons, and contaminant neuron count. Predicted median FDRs in public datasets were found to range from 3.1% to 50.0%. We find that stochasticity in the occurrence of ISI violations as well as uncertainty in cluster-specific parameters make it difficult to predict FDR for single clusters with high confidence, but that FDR can be estimated accurately across a population of clusters. Our findings will help the growing community of researchers using extracellular electrophysiology assess spike sorting accuracy in a principled manner.
细胞外电生理学的最新进展使得同时记录数百或数千个神经元的尖峰成为可能。这就需要开发新的尖峰分类计算方法以及更好地确定尖峰分类准确性的方法。一种长期用于评估尖峰分类错误发现率(FDR)(即尖峰被错误分配到错误簇的速率)的方法是尖峰间隔(ISI)违反率。尽管ISI违反率在尖峰分类中几乎被普遍使用,但我们对ISI违反率与FDR的确切关系以及将ISI违反率用作质量指标的最佳实践的理解仍然有限。在这里,我们描述了一种解析解,可用于根据ISI违反率预测FDR。我们通过蒙特卡罗模拟在计算机上对该模型进行测试,并将其应用于公开可用的尖峰分类电生理数据集。我们发现ISI违反率与FDR之间的关系高度非线性,还依赖于放电率、神经元之间活动的相关性以及污染神经元数量。在公共数据集中预测的中位数FDR范围为3.1%至50.0%。我们发现ISI违反发生的随机性以及特定簇参数的不确定性使得难以高度自信地预测单个簇的FDR,但可以准确估计整个簇群体的FDR。我们的研究结果将帮助越来越多使用细胞外电生理学的研究人员以有原则的方式评估尖峰分类的准确性。