firstbacksecondback
62 Results
Poster
|
Wed 7:45 |
Constant Regret, Generalized Mixability, and Mirror Descent Zakaria Mhammedi · Robert Williamson |
|
Poster
|
Thu 14:00 |
Policy Regret in Repeated Games Raman Arora · Michael Dinitz · Teodor Vanislavov Marinov · Mehryar Mohri |
|
Poster
|
Wed 7:45 |
Fast deep reinforcement learning using online adjustments from the past Steven Hansen · Alexander Pritzel · Pablo Sprechmann · Andre Barreto · Charles Blundell |
|
Poster
|
Wed 7:45 |
Learning Optimal Reserve Price against Non-myopic Bidders Jinyan Liu · Zhiyi Huang · Xiangning Wang |
|
Poster
|
Thu 7:45 |
Online Structure Learning for Feed-Forward and Recurrent Sum-Product Networks Agastya Kalra · Abdullah Rashwan · Wei-Shou Hsu · Pascal Poupart · Prashant Doshi · George Trimponias |
|
Poster
|
Thu 14:00 |
Adaptive Online Learning in Dynamic Environments Lijun Zhang · Shiyin Lu · Zhi-Hua Zhou |
|
Workshop
|
Fri 11:30 |
Model-free vs. Model-based Learning in a Causal World: Some Stories from Online Learning to Rank Csaba Szepesvari |
|
Poster
|
Thu 7:45 |
Preference Based Adaptation for Learning Objectives Yao-Xiang Ding · Zhi-Hua Zhou |
|
Poster
|
Wed 7:45 |
On Markov Chain Gradient Descent Tao Sun · Yuejiao Sun · Wotao Yin |
|
Poster
|
Thu 14:00 |
Learning Beam Search Policies via Imitation Learning Renato Negrinho · Matthew Gormley · Geoffrey Gordon |
|
Poster
|
Wed 14:00 |
Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models Yining Wang · Xi Chen · Yuan Zhou |
|
Poster
|
Thu 14:00 |
Gen-Oja: Simple & Efficient Algorithm for Streaming Generalized Eigenvector Computation Kush Bhatia · Aldo Pacchiano · Nicolas Flammarion · Peter Bartlett · Michael Jordan |