Timezone: »
Author Information
Byron Galbraith (Seva)
Byron Galbraith is the CTO of Seva, where he works to translate the latest advancements in machine learning and natural language processing to build AI-powered conversational agents. Byron has a PhD in Cognitive and Neural Systems from Boston University and an MS in Bioinformatics from Marquette University. His research expertise includes brain-computer interfaces, neuromorphic robotics, spiking neural networks, high-performance computing, and natural language processing. Byron has also held several software engineering roles including back-end system engineer, full stack web developer, office automation consultant, and game engine developer at companies ranging in size from a two-person startup to a multi-national enterprise.
Anssi Kanervisto (Microsoft Research)
Steven Wang (UC Berkeley)
Stephanie Milani (Carnegie Mellon University)
Sharada Mohanty (AIcrowd SA)
Rohin Shah (DeepMind)
Karolis Ramanauskas (University of Bath)

PhD Student in Reinforcement Learning
Brandon Houghton (OpenAI)
More from the Same Authors
-
2021 : The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions »
Jennifer J Sun · Tomomi Karigo · Dipam Chakraborty · Sharada Mohanty · Benjamin Wild · Quan Sun · Chen Chen · David Anderson · Pietro Perona · Yisong Yue · Ann Kennedy -
2021 Spotlight: Optimal Policies Tend To Seek Power »
Alex Turner · Logan Smith · Rohin Shah · Andrew Critch · Prasad Tadepalli -
2021 : An Empirical Investigation of Representation Learning for Imitation »
Cynthia Chen · Sam Toyer · Cody Wild · Scott Emmons · Ian Fischer · Kuang-Huei Lee · Neel Alex · Steven Wang · Ping Luo · Stuart Russell · Pieter Abbeel · Rohin Shah -
2021 : General Characterization of Agents by States they Visit »
Anssi Kanervisto · Ville Hautamäki -
2022 : Imitating Human Behaviour with Diffusion Models »
Tim Pearce · Tabish Rashid · Anssi Kanervisto · David Bignell · Mingfei Sun · Raluca Georgescu · Sergio Valcarcel Macua · Shan Zheng Tan · Ida Momennejad · Katja Hofmann · Sam Devlin -
2022 Competition: The CityLearn Challenge 2022 »
Zoltan Nagy · Kingsley Nweye · Sharada Mohanty · Siva Sankaranarayanan · Jan Drgona · Tianzhen Hong · Sourav Dey · Gregor Henze -
2022 Competition: The MineRL BASALT Competition on Fine-tuning from Human Feedback »
Anssi Kanervisto · Stephanie Milani · Karolis Ramanauskas · Byron Galbraith · Steven Wang · Brandon Houghton · Sharada Mohanty · Rohin Shah -
2022 Poster: Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos »
Bowen Baker · Ilge Akkaya · Peter Zhokov · Joost Huizinga · Jie Tang · Adrien Ecoffet · Brandon Houghton · Raul Sampedro · Jeff Clune -
2022 Poster: Uni[MASK]: Unified Inference in Sequential Decision Problems »
Micah Carroll · Orr Paradise · Jessy Lin · Raluca Georgescu · Mingfei Sun · David Bignell · Stephanie Milani · Katja Hofmann · Matthew Hausknecht · Anca Dragan · Sam Devlin -
2021 : NeurIPS RL Competitions Results Presentations »
Rohin Shah · Liam Paull · Tabitha Lee · Tim Rocktäschel · Heinrich Küttler · Sharada Mohanty · Manuel Wuethrich -
2021 : AI Driving Olympics + Q&A »
Andrea Censi · Liam Paull · Jacopo Tani · Emilio Frazzoli · Holger Caesar · Matthew Walter · Andrea Daniele · Sahika Genc · Sharada Mohanty -
2021 : BASALT: A MineRL Competition on Solving Human-Judged Task + Q&A »
Rohin Shah · Cody Wild · Steven Wang · Neel Alex · Brandon Houghton · William Guss · Sharada Mohanty · Stephanie Milani · Nicholay Topin · Pieter Abbeel · Stuart Russell · Anca Dragan -
2021 : The NetHack Challenge + Q&A »
Eric Hambro · Sharada Mohanty · Dipam Chakrabroty · Edward Grefenstette · Minqi Jiang · Robert Kirk · Vitaly Kurin · Heinrich Kuttler · Vegard Mella · Nantas Nardelli · Jack Parker-Holder · Roberta Raileanu · Tim Rocktäschel · Danielle Rothermel · Mikayel Samvelyan -
2021 : Diamond: A MineRL Competition on Training Sample-Efficient Agents + Q&A »
William Guss · Alara Dirik · Byron Galbraith · Brandon Houghton · Anssi Kanervisto · Noboru Kuno · Stephanie Milani · Sharada Mohanty · Karolis Ramanauskas · Ruslan Salakhutdinov · Rohin Shah · Nicholay Topin · Steven Wang · Cody Wild -
2021 Poster: Optimal Policies Tend To Seek Power »
Alex Turner · Logan Smith · Rohin Shah · Andrew Critch · Prasad Tadepalli -
2020 : Concluding Remarks »
Sharada Mohanty -
2020 : Sample Efficiency & Generalization in RL : An assortment of tricks (talks by top participants) »
Sharada Mohanty -
2020 : Winner Announcements & Analysis of top submissions »
Sharada Mohanty -
2020 : NeurIPS 2020 Procgen Challenge Design »
Sharada Mohanty -
2020 : Introduction - Procgen »
Sharada Mohanty -
2020 : Concluding Remarks »
Sharada Mohanty -
2020 : "Real world applications of Flatland" : Panel Discussion with SBB, DeutschBahn, SNCF »
Sharada Mohanty -
2020 : Winner Talks : Team ai-team-flatland »
Sharada Mohanty -
2020 : Winner Talks : Team JBR_HSE »
Sharada Mohanty -
2020 : Winner Talks : Team An Old Driver »
Sharada Mohanty -
2020 : Flatland Competition Design & Results »
Sharada Mohanty -
2020 : Introduction - Flatland »
Sharada Mohanty -
2020 : Spotlight Talk: Benefits of Assistance over Reward Learning »
Rohin Shah -
2020 : NeurIPS RL Competitions: Procgen challenge »
Sharada Mohanty -
2020 : NeurIPS RL Competitions: Flatland challenge »
Sharada Mohanty -
2020 Poster: The MAGICAL Benchmark for Robust Imitation »
Sam Toyer · Rohin Shah · Andrew Critch · Stuart Russell -
2019 : The MineRL competition »
Misa Ogura · Joe Booth · Sophia Sun · Nicholay Topin · Brandon Houghton · William Guss · Stephanie Milani · Oriol Vinyals · Katja Hofmann · JIA KIM · Karolis Ramanauskas · Florian Laurent · Daichi Nishio · Anssi Kanervisto · Alexey Skrynnik · Artemij Amiranashvili · Christian Scheller · KAIXIN WANG · Yanick Schraner -
2019 Poster: On the Utility of Learning about Humans for Human-AI Coordination »
Micah Carroll · Rohin Shah · Mark Ho · Tom Griffiths · Sanjit Seshia · Pieter Abbeel · Anca Dragan