Eindhoven SPOR Seminar

May 4, 15:45 - 16:45

Matthieu Jonckheere (UBA)

Distance learning using Euclidean percolation: Following Fermat's principle

In unsupervised statistical learning tasks such as clustering, recommendation, or dimension reduction, a notion of distance or similarity between points is crucial but usually not directly available as an input. We proposed a new density-based estimator for weighted geodesic distances that takes into account the underlying density of the data, and that is suitable for nonuniform data lying on a manifold of lower dimension than the ambient space. The consistency of the estimator is proven using tools from first passage percolation.  We then discuss its properties and implementation and evaluate its performance for clustering tasks. We also discuss other applications like intrinsic persistent homology.

Joint work with P. Groisman, University of Buenos Aires and F. Sapienza, Berkeley.



