References & Citations
Statistics > Applications
Title: Clustering US States by Time Series of COVID-19 New Case Counts with Non-negative Matrix Factorization
(Submitted on 29 Nov 2020 (v1), last revised 15 Jan 2021 (this version, v3))
Abstract: The spreading pattern of COVID-19 differ a lot across the US states under different quarantine measures and reopening policies. We proposed to cluster the US states into distinct communities based on the daily new confirmed case counts via a nonnegative matrix factorization (NMF) followed by a k-means clustering procedure on the coefficients of the NMF basis. A cross-validation method was employed to select the rank of the NMF. Applying the method to the entire study period from March 22 to July 25, we clustered the 49 continental states (including District of Columbia) into 7 groups, two of which contained a single state. To investigate the dynamics of the clustering results over time, the same method was successively applied to the time periods with increment of one week, starting from the period of March 22 to March 28. The results suggested a change point in the clustering in the week starting on May 30, which might be explained by a combined impact of both quarantine measures and reopening policies.
Submission history
From: Panpan Zhang [view email][v1] Sun, 29 Nov 2020 18:27:02 GMT (232kb,D)
[v2] Sun, 6 Dec 2020 02:07:36 GMT (232kb,D)
[v3] Fri, 15 Jan 2021 05:53:45 GMT (232kb,D)
Link back to: arXiv, form interface, contact.