firstbacksecondback
256 Results
Poster
|
Tue 14:00 |
Diversified Recommendations for Agents with Adaptive Preferences William Brown · Arpit Agarwal |
|
Workshop
|
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization Runlong Zhou · Yuandong Tian · YI WU · Simon Du |
||
Poster
|
Thu 14:00 |
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos Bowen Baker · Ilge Akkaya · Peter Zhokov · Joost Huizinga · Jie Tang · Adrien Ecoffet · Brandon Houghton · Raul Sampedro · Jeff Clune |
|
Workshop
|
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion Utkarsh Soni · Sarath Sreedharan · Mudit Verma · Lin Guan · Matthew Marquez · Subbarao Kambhampati |
||
Poster
|
Conditional Meta-Learning of Linear Representations Giulia Denevi · Massimiliano Pontil · Carlo Ciliberto |
||
Poster
|
Thu 14:00 |
Augmenting Online Algorithms with Anupam Gupta · Debmalya Panigrahi · Bernardo Subercaseaux · Kevin Sun |
|
Poster
|
Wed 9:00 |
Online Allocation and Learning in the Presence of Strategic Agents Steven Yin · Shipra Agrawal · Assaf Zeevi |
|
Poster
|
Thu 14:00 |
Optimal Comparator Adaptive Online Learning with Switching Cost Zhiyu Zhang · Ashok Cutkosky · Yannis Paschalidis |
|
Poster
|
Wed 9:00 |
A gradient estimator via L1-randomization for online zero-order optimization with two point feedback Arya Akhavan · Evgenii Chzhen · Massimiliano Pontil · Alexandre Tsybakov |
|
Poster
|
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech Ziyue Jiang · Zhe Su · Zhou Zhao · Qian Yang · Yi Ren · Jinglin Liu · 振辉 叶 |
||
Poster
|
Tue 14:00 |
Learning from a Sample in Online Algorithms C.J. Argue · Alan Frieze · Anupam Gupta · Christopher Seiler |
|
Poster
|
Thu 9:00 |
Minimax Regret for Cascading Bandits Daniel Vial · Sujay Sanghavi · Sanjay Shakkottai · R. Srikant |