NeurIPS Poster Near-Linear Time Algorithm for the Chamfer Distance

Poster

Near-Linear Time Algorithm for the Chamfer Distance

Ainesh Bakshi · Piotr Indyk · Rajesh Jayaram · Sandeep Silwal · Erik Waingarten

Great Hall & Hall B1+B2 (level 1) #1823

[ Abstract ]

[ Paper] [ OpenReview]

Abstract: For any two point sets

A, B \subset R^{d}

$A,B \subset \mathbb{R}^d$ of size up to

n

$n$ , the Chamfer distance from

A

$A$ to

B

$B$ is defined as

CH (A, B) = \sum_{a \in A} min_{b \in B} d_{X} (a, b)

$\texttt{CH}(A,B)=\sum_{a \in A} \min_{b \in B} d_X(a,b)$ , where

d_{X}

$d_X$ is the underlying distance measure (e.g., the Euclidean or Manhattan distance). The Chamfer distance is a popular measure of dissimilarity between point clouds, used in many machine learning, computer vision, and graphics applications, and admits a straightforward

O (d n^{2})

$O(d n^2)$ -time brute force algorithm. Further, Chamfer distance is often used as a proxy for the more computationally demanding Earth-Mover (Optimal Transport) Distance. However, the \emph{quadratic} dependence on

n

$n$ in the running time makes the naive approach intractable for large datasets.We overcome this bottleneck and present the first

(1 + ϵ)

$(1+\epsilon)$ -approximate algorithm for estimating Chamfer distance with a near-linear running time. Specifically, our algorithm runs in time

O (n d \log (n) / ϵ^{2})

$O(nd \log (n)/\epsilon^2)$ and is implementable. Our experiments demonstrate that it is both accurate and fast on large high-dimensional datasets. We believe that our algorithm will open new avenues for analyzing large high-dimensional point clouds. We also give evidence that if the goal is to report a

(1 + ϵ)

$(1+\epsilon)$ -approximate mapping from

A

$A$ to

B

$B$ (as opposed to just its value), then any sub-quadratic time algorithm is unlikely to exist.

Chat is not available.