Timezone: »
We propose a new fast Gaussian summation algorithm for high-dimensional datasets with high accuracy. First, we extend the original fast multipole-type methods to use approximation schemes with both hard and probabilistic error. Second, we utilize a new data structure called subspace tree which maps each data point in the node to its lower dimensional mapping as determined by any linear dimension reduction method such as PCA. This new data structure is suitable for reducing the cost of each pairwise distance computation, the most dominant cost in many kernel methods. Our algorithm guarantees probabilistic relative error on each kernel sum, and can be applied to high-dimensional Gaussian summations which are ubiquitous inside many kernel methods as the key computational bottleneck. We provide empirical speedup results on low to high-dimensional datasets up to 89 dimensions.
Author Information
Dongryeol Lee (Independent Researcher)
Alexander Gray (Skytree Inc. and Georgia Tech)
More from the Same Authors
-
2013 Poster: Which Space Partitioning Tree to Use for Search? »
Parikshit Ram · Alexander Gray -
2012 Poster: Minimax Multi-Task Learning and a Generalized Loss-Compositional Paradigm for MTL »
Nishant A Mehta · Dongryeol Lee · Alexander Gray -
2009 Workshop: Large-Scale Machine Learning: Parallelism and Massive Datasets »
Alexander Gray · Arthur Gretton · Alexander Smola · Joseph E Gonzalez · Carlos Guestrin -
2009 Poster: Submanifold density estimation »
Arkadas Ozakin · Alexander Gray -
2009 Poster: Linear-time Algorithms for Pairwise Statistical Problems »
Parikshit Ram · Dongryeol Lee · William B March · Alexander Gray -
2009 Spotlight: Linear-time Algorithms for Pairwise Statistical Problems »
Parikshit Ram · Dongryeol Lee · William B March · Alexander Gray -
2009 Poster: Rank-Approximate Nearest Neighbor Search: Retaining Meaning and Speed in High Dimensions »
Parikshit Ram · Dongryeol Lee · Hua Ouyang · Alexander Gray -
2008 Poster: QUIC-SVD: Fast SVD Using Cosine Trees »
Michael Holmes · Alexander Gray · Charles Isbell -
2008 Demonstration: MLPACK: Scalable Machine Learning Software »
Alexander Gray -
2007 Poster: Multi-Stage Monte Carlo Approximation for Fast Generalized Data Summations »
Michael Holmes · Alexander Gray · Charles Isbell