Skip to yearly menu bar Skip to main content


Poster

A Combinatorial Algorithm for the Semi-Discrete Optimal Transport Problem

Pankaj Agarwal · Sharath Raghvendra · Pouyan Shirzadian · Keegan Yao

[ ]
Wed 11 Dec 11 a.m. PST — 2 p.m. PST

Abstract: Optimal Transport (OT, also known as the Wasserstein distance) is a popular metric for comparing probability distributions and has been successfully used in many machine-learning applications. In the semi-discrete $2$-Wasserstein problem, we wish to compute the cheapest way to transport all the mass from a continuous distribution $\mu$ to a discrete distribution $\nu$ in $\mathbb{R}^d$ for $d\ge 1$, where the cost of transporting unit mass between points $a$ and $b$ is $d(a,b)=||a-b||^2$. When both distributions are discrete, a simple combinatorial framework has been used to find the exact solution (see e.g. [Orlin, STOC 1988]). In this paper, we propose a combinatorial framework for the semi-discrete OT, which can be viewed as an extension of the combinatorial framework for the discrete OT but requires several new ideas, and we present a new algorithm that computes an $\varepsilon$-additive approximate semi-discrete transport plan between $2$-dimensional distributions in $O(n^{4}\log n\log \frac{1}{\varepsilon})$ time (in the worst case), assuming that the mass of $\mu$ inside a triangle can be computed in $O(1)$ time. Our algorithm is significantly faster than the known algorithms and unlike many numerical algorithms does not make any assumptions on the smoothness of $\mu$. As an application of our algorithm, we describe a data structure to store a large discrete distribution $\mu$ (with support size $N$) using $O(N)$ space so that, given a query discrete distribution $\nu$ (with support size $k$), an $\varepsilon$-additive approximate transport plan can be computed in $O(k^{3}\sqrt{N}\log \frac{1}{\varepsilon})$ time in $2$ dimensions. Our algorithm and data structure extend to any dimension $d$ as well as to $p$-Wasserstein problem for any $p \ge 1$.

Live content is unavailable. Log in and register to view live content