firstbacksecondback
130 Results
Poster
|
Tue 7:45 |
An intriguing failing of convolutional neural networks and the CoordConv solution Rosanne Liu · Joel Lehman · Piero Molino · Felipe Petroski Such · Eric Frank · Alex Sergeev · Jason Yosinski |
|
Poster
|
Tue 7:45 |
Post: Device Placement with Cross-Entropy Minimization and Proximal Policy Optimization Yuanxiang Gao · Li Chen · Baochun Li |
|
Poster
|
Wed 7:45 |
End-to-End Differentiable Physics for Learning and Control Filipe de Avila Belbute Peres · Kevin Smith · Kelsey Allen · Josh Tenenbaum · J. Zico Kolter |
|
Poster
|
Wed 14:00 |
Learning to Navigate in Cities Without a Map Piotr Mirowski · Matt Grimes · Mateusz Malinowski · Karl Moritz Hermann · Keith Anderson · Denis Teplyashin · Karen Simonyan · koray kavukcuoglu · Andrew Zisserman · Raia Hadsell |
|
Poster
|
Wed 14:00 |
Learning to Play With Intrinsically-Motivated, Self-Aware Agents Nick Haber · Damian Mrowca · Stephanie Wang · Li Fei-Fei · Daniel Yamins |
|
Poster
|
Wed 14:00 |
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments Sriram Srinivasan · Marc Lanctot · Vinicius Zambaldi · Julien Perolat · Karl Tuyls · Remi Munos · Michael Bowling |
|
Poster
|
Wed 14:00 |
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization Hoi-To Wai · Zhuoran Yang · Zhaoran Wang · Mingyi Hong |
|
Poster
|
Wed 14:00 |
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing Chen Liang · Mohammad Norouzi · Jonathan Berant · Quoc V Le · Ni Lao |
|
Poster
|
Wed 14:00 |
Total stochastic gradient algorithms and applications in reinforcement learning Paavo Parmas |
|
Poster
|
Wed 14:00 |
Representation Balancing MDPs for Off-policy Policy Evaluation Yao Liu · Omer Gottesman · Aniruddh Raghu · Matthieu Komorowski · Aldo Faisal · Finale Doshi-Velez · Emma Brunskill |