Timezone: »
We seek an entropy estimator for discrete distributions with fully empirical accuracy bounds. As stated, this goal is infeasible without some prior assumptions on the distribution. We discover that a certain information moment assumption renders the problem feasible. We argue that the moment assumption is natural and, in some sense, {\em minimalistic} --- weaker than finite support or tail decay conditions. Under the moment assumption, we provide the first finite-sample entropy estimates for infinite alphabets, nearly recovering the known minimax rates. Moreover, we demonstrate that our empirical bounds are significantly sharper than the state-of-the-art bounds, for various natural distributions and non-trivial sample regimes. Along the way, we give a dimension-free analogue of the Cover-Thomas result on entropy continuity (with respect to total variation distance) for finite alphabets, which may be of independent interest.
Author Information
Doron Cohen (Ben-Gurion University of the Negev)
Aryeh Kontorovich (Ben Gurion University)
Aaron Koolyk (Hebrew University of Jerusalem, Israel)
Geoffrey Wolfer (Tokyo University of Agriculture and Technology)
More from the Same Authors
-
2023 Poster: Near-optimal learning with average Hölder smoothness »
Guy Kornowski · Aryeh Kontorovich · Steve Hanneke -
2020 Poster: Learning discrete distributions with infinite support »
Doron Cohen · Aryeh Kontorovich · Geoffrey Wolfer -
2018 Poster: Learning convex polytopes with margin »
Lee-Ad Gottlieb · Eran Kaufman · Aryeh Kontorovich · Gabriel Nivasch -
2017 Poster: Nearest-Neighbor Sample Compression: Efficiency, Consistency, Infinite Dimensions »
Aryeh Kontorovich · Sivan Sabato · Roi Weiss -
2016 Poster: Active Nearest-Neighbor Learning in Metric Spaces »
Aryeh Kontorovich · Sivan Sabato · Ruth Urner -
2015 Poster: Mixing Time Estimation in Reversible Markov Chains from a Single Sample Path »
Daniel Hsu · Aryeh Kontorovich · Csaba Szepesvari -
2014 Poster: Near-optimal sample compression for nearest neighbors »
Lee-Ad Gottlieb · Aryeh Kontorovich · Pinhas Nisnevitch -
2014 Poster: Consistency of weighted majority votes »
Daniel Berend · Aryeh Kontorovich -
2013 Poster: Predictive PAC Learning and Process Decompositions »
Cosma Shalizi · Aryeh Kontorovich