Timezone: »
NeurIPS RL Competitions Results Presentations
Rohin Shah · Liam Paull · Tabitha Lee · Tim Rocktäschel · Heinrich Küttler · Sharada Mohanty · Manuel Wuethrich
Author Information
Rohin Shah (DeepMind)
Rohin is a Research Scientist on the technical AGI safety team at DeepMind. He completed his PhD at the Center for Human-Compatible AI at UC Berkeley, where he worked on building AI systems that can learn to assist a human user, even if they don't initially know what the user wants. He is particularly interested in big picture questions about artificial intelligence. What techniques will we use to build human-level AI systems? How will their deployment affect the world? What can we do to make this deployment go better? He writes up summaries and thoughts about recent work tackling these questions in the Alignment Newsletter.
Liam Paull (Université de Montréal)
Tabitha Lee (Carnegie Mellon University)
Tim Rocktäschel (Facebook AI Research)
Heinrich Küttler (Facebook AI Research)
Sharada Mohanty (AIcrowd SA)
Manuel Wuethrich (MPI Intelligent Systems)
More from the Same Authors
-
2021 : The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions »
Jennifer J Sun · Tomomi Karigo · Dipam Chakraborty · Sharada Mohanty · Benjamin Wild · Quan Sun · Chen Chen · David Anderson · Pietro Perona · Yisong Yue · Ann Kennedy -
2021 : MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research »
Mikayel Samvelyan · Robert Kirk · Vitaly Kurin · Jack Parker-Holder · Minqi Jiang · Eric Hambro · Fabio Petroni · Heinrich Kuttler · Edward Grefenstette · Tim Rocktäschel -
2021 Spotlight: Iterative Teaching by Label Synthesis »
Weiyang Liu · Zhen Liu · Hanchen Wang · Liam Paull · Bernhard Schölkopf · Adrian Weller -
2021 Spotlight: Optimal Policies Tend To Seek Power »
Alex Turner · Logan Smith · Rohin Shah · Andrew Critch · Prasad Tadepalli -
2021 : An Empirical Investigation of Representation Learning for Imitation »
Cynthia Chen · Sam Toyer · Cody Wild · Scott Emmons · Ian Fischer · Kuang-Huei Lee · Neel Alex · Steven Wang · Ping Luo · Stuart Russell · Pieter Abbeel · Rohin Shah -
2021 : Grounding Aleatoric Uncertainty in Unsupervised Environment Design »
Minqi Jiang · Michael Dennis · Jack Parker-Holder · Andrei Lupu · Heinrich Kuttler · Edward Grefenstette · Tim Rocktäschel · Jakob Foerster -
2021 : That Escalated Quickly: Compounding Complexity by Editing Levels at the Frontier of Agent Capabilities »
Jack Parker-Holder · Minqi Jiang · Michael Dennis · Mikayel Samvelyan · Jakob Foerster · Edward Grefenstette · Tim Rocktäschel -
2021 : Graph Backup: Data Efficient Backup Exploiting Markovian Data »
zhengyao Jiang · Tianjun Zhang · Robert Kirk · Tim Rocktäschel · Edward Grefenstette -
2021 : Return Dispersion as an Estimator of Learning Potential for Prioritized Level Replay »
Iryna Korshunova · Minqi Jiang · Jack Parker-Holder · Tim Rocktäschel · Edward Grefenstette -
2022 : Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response »
Vincent Mai · Philippe Maisonneuve · Tianyu Zhang · Jorge Montalvo Arvizu · Liam Paull · Antoine Lesage-Landry -
2022 : Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response »
Vincent Mai · Philippe Maisonneuve · Tianyu Zhang · Jorge Montalvo Arvizu · Liam Paull · Antoine Lesage-Landry -
2022 : Fifteen-minute Competition Overview Video »
Nico Gürtler · Georg Martius · Pavel Kolev · Sebastian Blaes · Manuel Wuethrich · Markus Wulfmeier · Cansu Sancaktar · Martin Riedmiller · Arthur Allshire · Bernhard Schölkopf · Annika Buchholz · Stefan Bauer -
2022 : Fifteen-minute Competition Overview Video »
Byron Galbraith · Anssi Kanervisto · Steven Wang · Stephanie Milani · Sharada Mohanty · Rohin Shah · Karolis Ramanauskas · Brandon Houghton -
2022 : Fifteen-minute Competition Overview Video »
Tianpei Yang · Iuliia Kotseruba · Montgomery Alban · Amir Rasouli · Soheil Mohamad Alizadeh Shabestary · Randolph Goebel · Matthew Taylor · Liam Paull · Florian Shkurti -
2023 Competition: The CityLearn Challenge 2023 »
Zoltan Nagy · Kingsley Nweye · Sharada Mohanty · Ruchi Choudhary · Max Langtry · Gregor Henze · Jan Drgona · Sourav Dey · Alfonso Capozzoli · Mohamed Ouf -
2023 Competition: The NeurIPS 2023 Neural MMO Challenge: Multi-Task Reinforcement Learning and Curriculum Generation »
Joseph Suarez · Phillip Isola · David Bloomin · Kyoung Choe · Hao Li · Ryan Sullivan · Nishaanth Kanna · Daniel Scott · Rose Shuman · Herbie Bradley · Louis Castricato · Chenghui Yu · Yuhao Jiang · Qimai Li · Jiaxin Chen · Xiaolong Zhu · Dipam Chakrabroty · Sharada Mohanty -
2022 Competition: The CityLearn Challenge 2022 »
Zoltan Nagy · Kingsley Nweye · Sharada Mohanty · Siva Sankaranarayanan · Jan Drgona · Tianzhen Hong · Sourav Dey · Gregor Henze -
2022 Competition: Driving SMARTS »
Amir Rasouli · Matthew Taylor · Iuliia Kotseruba · Tianpei Yang · Randolph Goebel · Soheil Mohamad Alizadeh Shabestary · Montgomery Alban · Florian Shkurti · Liam Paull -
2022 Competition: The MineRL BASALT Competition on Fine-tuning from Human Feedback »
Anssi Kanervisto · Stephanie Milani · Karolis Ramanauskas · Byron Galbraith · Steven Wang · Brandon Houghton · Sharada Mohanty · Rohin Shah -
2022 Competition: Real Robot Challenge III - Learning Dexterous Manipulation from Offline Data in the Real World »
Nico Gürtler · Georg Martius · Sebastian Blaes · Pavel Kolev · Cansu Sancaktar · Stefan Bauer · Manuel Wuethrich · Markus Wulfmeier · Martin Riedmiller · Arthur Allshire · Annika Buchholz · Bernhard Schölkopf -
2022 Poster: Dungeons and Data: A Large-Scale NetHack Dataset »
Eric Hambro · Roberta Raileanu · Danielle Rothermel · Vegard Mella · Tim Rocktäschel · Heinrich Küttler · Naila Murray -
2022 Poster: Grounding Aleatoric Uncertainty for Unsupervised Environment Design »
Minqi Jiang · Michael Dennis · Jack Parker-Holder · Andrei Lupu · Heinrich Küttler · Edward Grefenstette · Tim Rocktäschel · Jakob Foerster -
2021 : AI Driving Olympics + Q&A »
Andrea Censi · Liam Paull · Jacopo Tani · Emilio Frazzoli · Holger Caesar · Matthew Walter · Andrea Daniele · Sahika Genc · Sharada Mohanty -
2021 : BASALT: A MineRL Competition on Solving Human-Judged Task + Q&A »
Rohin Shah · Cody Wild · Steven Wang · Neel Alex · Brandon Houghton · William Guss · Sharada Mohanty · Stephanie Milani · Nicholay Topin · Pieter Abbeel · Stuart Russell · Anca Dragan -
2021 : The NetHack Challenge + Q&A »
Eric Hambro · Sharada Mohanty · Dipam Chakrabroty · Edward Grefenstette · Minqi Jiang · Robert Kirk · Vitaly Kurin · Heinrich Kuttler · Vegard Mella · Nantas Nardelli · Jack Parker-Holder · Roberta Raileanu · Tim Rocktäschel · Danielle Rothermel · Mikayel Samvelyan -
2021 Poster: Replay-Guided Adversarial Environment Design »
Minqi Jiang · Michael Dennis · Jack Parker-Holder · Jakob Foerster · Edward Grefenstette · Tim Rocktäschel -
2021 : Learning By Doing: Controlling a Dynamical System using Control Theory, Reinforcement Learning, or Causality + Q&A »
Sebastian Weichwald · Niklas Pfister · Dominik Baumann · Isabelle Guyon · Oliver Kroemer · Tabitha Lee · Søren Wengel Mogensen · Jonas Peters · Sebastian Trimpe -
2021 Poster: Iterative Teaching by Label Synthesis »
Weiyang Liu · Zhen Liu · Hanchen Wang · Liam Paull · Bernhard Schölkopf · Adrian Weller -
2021 : Diamond: A MineRL Competition on Training Sample-Efficient Agents + Q&A »
William Guss · Alara Dirik · Byron Galbraith · Brandon Houghton · Anssi Kanervisto · Noboru Kuno · Stephanie Milani · Sharada Mohanty · Karolis Ramanauskas · Ruslan Salakhutdinov · Rohin Shah · Nicholay Topin · Steven Wang · Cody Wild -
2021 Poster: Optimal Policies Tend To Seek Power »
Alex Turner · Logan Smith · Rohin Shah · Andrew Critch · Prasad Tadepalli -
2020 : Concluding Remarks »
Sharada Mohanty -
2020 : Sample Efficiency & Generalization in RL : An assortment of tricks (talks by top participants) »
Sharada Mohanty -
2020 : Winner Announcements & Analysis of top submissions »
Sharada Mohanty -
2020 : NeurIPS 2020 Procgen Challenge Design »
Sharada Mohanty -
2020 : Introduction - Procgen »
Sharada Mohanty -
2020 : Concluding Remarks »
Sharada Mohanty -
2020 : "Real world applications of Flatland" : Panel Discussion with SBB, DeutschBahn, SNCF »
Sharada Mohanty -
2020 : Winner Talks : Team ai-team-flatland »
Sharada Mohanty -
2020 : Winner Talks : Team JBR_HSE »
Sharada Mohanty -
2020 : Winner Talks : Team An Old Driver »
Sharada Mohanty -
2020 : Flatland Competition Design & Results »
Sharada Mohanty -
2020 : Introduction - Flatland »
Sharada Mohanty -
2020 : Conclusions and Wrap up »
Liam Paull -
2020 : Interviews with winners »
Liam Paull -
2020 : Live robot competition (LF, LFP, lFVM) »
Liam Paull -
2020 : Intro to Urban League (includes highlights from semifinals) »
Liam Paull -
2020 : Advanced Perception League »
Liam Paull -
2020 : Introduction to AIDO »
Liam Paull -
2020 : Spotlight Talk: Benefits of Assistance over Reward Learning »
Rohin Shah -
2020 : NeurIPS RL Competitions: Procgen challenge »
Sharada Mohanty -
2020 : NeurIPS RL Competitions: Flatland challenge »
Sharada Mohanty -
2020 : Q&A #1 »
Oren Etzioni · Tim Rocktäschel · Victoria Lin -
2020 : Invited Talk #3 »
Tim Rocktäschel -
2020 Workshop: Differentiable computer vision, graphics, and physics in machine learning »
Krishna Murthy Jatavallabhula · Kelsey Allen · Victoria Dean · Johanna Hansen · Shuran Song · Florian Shkurti · Liam Paull · Derek Nowrouzezahrai · Josh Tenenbaum -
2020 Poster: Your GAN is Secretly an Energy-based Model and You Should Use Discriminator Driven Latent Sampling »
Tong Che · Ruixiang ZHANG · Jascha Sohl-Dickstein · Hugo Larochelle · Liam Paull · Yuan Cao · Yoshua Bengio -
2020 Poster: The NetHack Learning Environment »
Heinrich Küttler · Nantas Nardelli · Alexander Miller · Roberta Raileanu · Marco Selvatici · Edward Grefenstette · Tim Rocktäschel -
2020 Poster: Look-ahead Meta Learning for Continual Learning »
Gunshi Gupta · Karmesh Yadav · Liam Paull -
2020 Oral: Look-ahead Meta Learning for Continual Learning »
Gunshi Gupta · Karmesh Yadav · Liam Paull -
2020 Poster: The MAGICAL Benchmark for Robust Imitation »
Sam Toyer · Rohin Shah · Andrew Critch · Stuart Russell -
2020 Poster: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks »
Patrick Lewis · Ethan Perez · Aleksandra Piktus · Fabio Petroni · Vladimir Karpukhin · Naman Goyal · Heinrich Küttler · Mike Lewis · Wen-tau Yih · Tim Rocktäschel · Sebastian Riedel · Douwe Kiela -
2019 : Lunch break & Poster session »
Breandan Considine · Michael Innes · Du Phan · Dougal Maclaurin · Robin Manhaeve · Alexey Radul · Shashi Gowda · Ekansh Sharma · Eli Sennesh · Maxim Kochurov · Gordon Plotkin · Thomas Wiecki · Navjot Kukreja · Chung-chieh Shan · Matthew Johnson · Dan Belov · Neeraj Pradhan · Wannes Meert · Angelika Kimmig · Luc De Raedt · Brian Patton · Matthew Hoffman · Rif A. Saurous · Daniel Roy · Eli Bingham · Martin Jankowiak · Colin Carroll · Junpeng Lao · Liam Paull · Martin Abadi · Angel Rojas Jimenez · JP Chen -
2019 : AI Driving Olympics 3 »
Caglayan Dicle · Liam Paull · Jacopo Tani · Sunil Mallya · Sahika Genc · Kirsten Bowser · Tao Sun · Yunzhe Tao · Philippe Marcotte · Hsu-kuang Chiu · Eric Wolff -
2019 Poster: On the Utility of Learning about Humans for Human-AI Coordination »
Micah Carroll · Rohin Shah · Mark Ho · Tom Griffiths · Sanjit Seshia · Pieter Abbeel · Anca Dragan -
2018 : Live competition The AI Driving Olympics: Introduction to Duckietown and the AI Driving Olympics »
Liam Paull · Jacopo Tani · Kirsten Bowser · Lin Jin · Cameron Peron