Phillips Sophie, Schoenberg Frederic
Department of Statistics, UCLA, Los Angeles, CA, USA.
J Appl Stat. 2024 Nov 12;52(7):1405-1422. doi: 10.1080/02664763.2024.2426019. eCollection 2025.
Several approaches to estimating the productivity function for a Hawkes point process with variable productivity are discussed, improved upon, and compared in terms of their root-mean-squared error and computational efficiency for various data sizes, and for binned as well as unbinned data. We find that for unbinned data, a regularized version of the analytic maximum likelihood estimator proposed by Schoenberg is the most accurate but is computationally burdensome. The unregularized version of the estimator is faster to compute but has lower accuracy, though both estimators outperform empirical or binned least squares estimators in terms of root-mean-squared error, especially when the mean productivity is 0.2 or greater. For binned data, binned least squares estimates are highly efficient both in terms of computation time and root-mean-squared error. An extension to estimating transmission time density is discussed, and an application to estimating the productivity of Covid-19 in the United States as a function of time from January 2020 to July 2022 is provided.
本文讨论了几种用于估计具有可变生产率的霍克斯点过程生产率函数的方法,并在不同数据规模以及分箱和未分箱数据的情况下,根据均方根误差和计算效率对这些方法进行了改进和比较。我们发现,对于未分箱数据,Schoenberg提出的解析最大似然估计器的正则化版本最为准确,但计算量较大。该估计器的非正则化版本计算速度更快,但准确性较低,不过在均方根误差方面,这两种估计器均优于经验或分箱最小二乘估计器,尤其是当平均生产率为0.2或更高时。对于分箱数据,分箱最小二乘估计在计算时间和均方根误差方面都非常高效。本文还讨论了估计传播时间密度的扩展,并提供了一个应用示例,用于估计2020年1月至2022年7月期间美国新冠疫情生产率随时间的变化情况。