References & Citations
Mathematics > Statistics Theory
Title: Ultrametric embedding: application to data fingerprinting and to fast data clustering
(Submitted on 19 May 2006 (v1), last revised 28 Jan 2007 (this version, v2))
Abstract: We begin with pervasive ultrametricity due to high dimensionality and/or spatial sparsity. How extent or degree of ultrametricity can be quantified leads us to the discussion of varied practical cases when ultrametricity can be partially or locally present in data. We show how the ultrametricity can be assessed in text or document collections, and in time series signals. An aspect of importance here is that to draw benefit from this perspective the data may need to be recoded. Such data recoding can also be powerful in proximity searching, as we will show, where the data is embedded globally and not locally in an ultrametric space.
Submission history
From: Fionn Murtagh [view email][v1] Fri, 19 May 2006 22:28:18 GMT (19kb)
[v2] Sun, 28 Jan 2007 10:57:40 GMT (28kb)
Link back to: arXiv, form interface, contact.