We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.AP

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Applications

Title: Clustering US States by Time Series of COVID-19 New Case Counts with Non-negative Matrix Factorization

Abstract: The spreading pattern of COVID-19 differ a lot across the US states under different quarantine measures and reopening policies. We proposed to cluster the US states into distinct communities based on the daily new confirmed case counts via a nonnegative matrix factorization (NMF) followed by a k-means clustering procedure on the coefficients of the NMF basis. A cross-validation method was employed to select the rank of the NMF. Applying the method to the entire study period from March 22 to July 25, we clustered the 49 continental states (including District of Columbia) into 7 groups, two of which contained a single state. To investigate the dynamics of the clustering results over time, the same method was successively applied to the time periods with increment of one week, starting from the period of March 22 to March 28. The results suggested a change point in the clustering in the week starting on May 30, which might be explained by a combined impact of both quarantine measures and reopening policies.
Subjects: Applications (stat.AP)
MSC classes: 62H30, 62H12, 62M10
Cite as: arXiv:2011.14412 [stat.AP]
  (or arXiv:2011.14412v3 [stat.AP] for this version)

Submission history

From: Panpan Zhang [view email]
[v1] Sun, 29 Nov 2020 18:27:02 GMT (232kb,D)
[v2] Sun, 6 Dec 2020 02:07:36 GMT (232kb,D)
[v3] Fri, 15 Jan 2021 05:53:45 GMT (232kb,D)

Link back to: arXiv, form interface, contact.