Clustering methods based on variational analysis in the space of measures

M.N.M. Van Lieshout, I.S. Molchanov, S.A. Zuev

    Research output: Contribution to journalArticlepeer-review

    5 Citations (Scopus)
    49 Downloads (Pure)

    Abstract

    We formulate clustering as a minimisation problem in the space of measures by modelling the cluster centres as a Poisson process with unknown intensity function.We derive a Ward-type clustering criterion which, under the Poisson assumption, can easily be evaluated explicitly in terms of the intensity function. We show that asymptotically, i.e. for increasing total intensity, the optimal intensity function is proportional to a dimension-dependent power of the density of the observations. For fixed finite total intensity, no explicit solution seems available. However, the Ward-type criterion to be minimised is convex in the intensity function, so that the steepest descent method of Molchanov and Zuyev (2001) can be used to approximate the global minimum. It turns out that the gradient is similar in form to the functional to be optimised. If we discretise over a grid, the steepest descent algorithm at each iteration step increases the current intensity function at those points where the gradient is minimal at the expense of regions with a large gradient value. The algorithm is applied to a toy one-dimensional example, a simulation from a popular spatial cluster model and a real-life dataset from Strauss (1975) concerning the positions of redwood seedlings. Finally, we discuss the relative merits of our approach compared to classical hierarchical and partition clustering techniques as well as to modern model based clustering methods using Markov point processes and mixture distributions.
    Original languageEnglish
    Pages (from-to)1021-1033
    Number of pages12
    JournalBiometrika
    Volume88
    Issue number4
    DOIs
    Publication statusPublished - 31 May 2001

    Keywords

    • cluster analysis
    • poisson point process
    • steepest descent
    • statistics
    • modelling science
    • biometrics

    Fingerprint

    Dive into the research topics of 'Clustering methods based on variational analysis in the space of measures'. Together they form a unique fingerprint.

    Cite this