Department of Biomedical Engineering, Boston University, Boston, Massachusetts 02215.
Center for Neurophotonics, Boston University, Boston, Massachusetts 02215.
eNeuro. 2024 Aug 28;11(8). doi: 10.1523/ENEURO.0554-23.2024. Print 2024 Aug.
Recent advances in extracellular electrophysiology now facilitate the recording of spikes from hundreds or thousands of neurons simultaneously. This has necessitated both the development of new computational methods for spike sorting and better methods to determine spike-sorting accuracy. One long-standing method of assessing the false discovery rate (FDR) of spike sorting-the rate at which spikes are assigned to the wrong cluster-has been the rate of interspike interval (ISI) violations. Despite their near ubiquitous usage in spike sorting, our understanding of how exactly ISI violations relate to FDR, as well as best practices for using ISI violations as a quality metric, remains limited. Here, we describe an analytical solution that can be used to predict FDR from the ISI violation rate (ISI). We test this model in silico through Monte Carlo simulation and apply it to publicly available spike-sorted electrophysiology datasets. We find that the relationship between ISI and FDR is highly nonlinear, with additional dependencies on firing frequency, the correlation in activity between neurons, and contaminant neuron count. Predicted median FDRs in public datasets recorded in mice were found to range from 3.1 to 50.0%. We found that stochasticity in the occurrence of ISI violations as well as uncertainty in cluster-specific parameters make it difficult to predict FDR for single clusters with high confidence but that FDR can be estimated accurately across a population of clusters. Our findings will help the growing community of researchers using extracellular electrophysiology assess spike-sorting accuracy in a principled manner.
近年来,细胞外电生理学的进展使得同时记录数百或数千个神经元的尖峰成为可能。这不仅需要开发新的用于尖峰分类的计算方法,还需要更好的方法来确定尖峰分类的准确性。评估尖峰分类错误发现率(FDR)的一种长期方法——即分配给错误集群的尖峰率——一直是尖峰间间隔(ISI)违反率。尽管在尖峰分类中广泛使用了 ISI 违反率,但我们对 ISI 违反率与 FDR 的关系以及将 ISI 违反率用作质量指标的最佳实践的理解仍然有限。在这里,我们描述了一个可以用来从 ISI 违反率(ISI)预测 FDR 的解析解。我们通过蒙特卡罗模拟在计算机上测试了该模型,并将其应用于公开可用的尖峰分类电生理数据集。我们发现,ISI 与 FDR 之间的关系是非线性的,与神经元活动之间的相关性、尖峰发放频率以及污染物神经元计数等因素有关。在小鼠记录的公共数据集的预测中位数 FDR 范围从 3.1 到 50.0%。我们发现,ISI 违反的随机性以及特定于集群的参数的不确定性使得很难对单个集群的 FDR 进行高置信度的预测,但可以准确估计集群群体的 FDR。我们的研究结果将帮助越来越多使用细胞外电生理学的研究人员以有原则的方式评估尖峰分类的准确性。