Workshop
Reinforcement Learning for Real Life (RL4RealLife) Workshop
Yuxi Li 路 Emma Brunskill 路 MINMIN CHEN 路 Omer Gottesman 路 Lihong Li 路 Yao Liu 路 Zhiwei Tony Qin 路 Matthew Taylor
Theater A
Sat 3 Dec, 5:30 a.m. PST
Discover how to improve the adoption of RL in practice, by discussing key research problems, SOTA, and success stories / insights / lessons w.r.t. practical RL algorithms, practical issues, and applications with leading experts from both academia and industry @ NeurIPS 2022 RL4RealLife workshop.
Chat is not available.
Timezone: America/Los_Angeles
Schedule
Sat 5:30 a.m. - 6:25 a.m.
|
posters (for early birds, optional)
(
posters
)
>
|
馃敆 |
Sat 6:25 a.m. - 6:30 a.m.
|
opening remarks
(
opening remarks
)
>
SlidesLive Video |
馃敆 |
Sat 6:31 a.m. - 7:00 a.m.
|
Invited talk: Outracing Champion Gran Turismo Drivers with Deep Reinforcement Learning
(
talk
)
>
link
SlidesLive Video |
Peter Stone 馃敆 |
Sat 7:01 a.m. - 7:30 a.m.
|
Invited talk: Scaling reinforcement learning in the real world, from gaming to finance to manufacturing
(
talk
)
>
SlidesLive Video |
Robert Nishihara 馃敆 |
Sat 7:30 a.m. - 7:31 a.m.
|
Intro speaker
(
In-person Intro
)
>
|
馃敆 |
Sat 7:31 a.m. - 8:00 a.m.
|
Invited talk: Deep Reinforcement Learning for Real-World Inventory Management
(
talk
)
>
SlidesLive Video |
Dhruv Madeka 馃敆 |
Sat 8:00 a.m. - 8:20 a.m.
|
Coffee break
(
Coffee break
)
>
|
馃敆 |
Sat 8:20 a.m. - 9:10 a.m.
|
Panel RL Implementation
(
Panel
)
>
SlidesLive Video |
Xiaolin Ge 路 Alborz Geramifard 路 Kence Anderson 路 Craig Buhr 路 Robert Nishihara 路 Yuandong Tian 馃敆 |
Sat 9:10 a.m. - 10:00 a.m.
|
Panel RL Benchmarks
(
Panel
)
>
SlidesLive Video |
Minmin Chen 路 Pablo Samuel Castro 路 Caglar Gulcehre 路 Tony Jebara 路 Peter Stone 馃敆 |
Sat 10:00 a.m. - 11:30 a.m.
|
Lunch Break / Posters
(
Poster/Break
)
>
|
馃敆 |
Sat 11:31 a.m. - 12:00 p.m.
|
Invited talk AlphaTensor: Discovering faster matrix multiplication algorithms with RL
(
talk
)
>
SlidesLive Video |
Matej Balog 馃敆 |
Sat 12:00 p.m. - 12:55 p.m.
|
Panel RL Theory-Practice Gap
(
Panel
)
>
SlidesLive Video |
Peter Stone 路 Matej Balog 路 Jonas Buchli 路 Jason Gauci 路 Dhruv Madeka 馃敆 |
Sat 12:55 p.m. - 1:00 p.m.
|
closing remarks
(
closing remarks
)
>
|
馃敆 |
Sat 1:00 p.m. - 1:30 p.m.
|
Coffee break / Posters
(
Coffee break / Posters
)
>
|
馃敆 |
Sat 1:30 p.m. - 3:00 p.m.
|
Posters
(
Posters
)
>
|
馃敆 |
-
|
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning
(
Poster
)
>
|
Danil Provodin 路 Pratik Gajane 路 Mykola Pechenizkiy 路 Maurits Kaptein 馃敆 |
-
|
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning
(
Spotlight
)
>
SlidesLive Video |
Danil Provodin 路 Pratik Gajane 路 Mykola Pechenizkiy 路 Maurits Kaptein 馃敆 |
-
|
MARLIM: Multi-Agent Reinforcement Learning for Inventory Management
(
Poster
)
>
SlidesLive Video |
R茅mi Leluc 路 Elie Kadoche 路 Antoine Bertoncello 路 S茅bastien Gourv茅nec 馃敆 |
-
|
MARLIM: Multi-Agent Reinforcement Learning for Inventory Management
(
Spotlight
)
>
|
R茅mi Leluc 路 Elie Kadoche 路 Antoine Bertoncello 路 S茅bastien Gourv茅nec 馃敆 |
-
|
A Versatile and Efficient Reinforcement Learning Approach for Autonomous Driving
(
Poster
)
>
|
Guan Wang 路 Haoyi Niu 路 desheng zhu 路 Jianming HU 路 Xianyuan Zhan 路 Guyue Zhou 馃敆 |
-
|
A Versatile and Efficient Reinforcement Learning Approach for Autonomous Driving
(
Spotlight
)
>
SlidesLive Video |
Guan Wang 路 Haoyi Niu 路 desheng zhu 路 Jianming HU 路 Xianyuan Zhan 路 Guyue Zhou 馃敆 |
-
|
Semi-analytical Industrial Cooling System Model for Reinforcement Learning
(
Poster
)
>
SlidesLive Video |
Yuri Chervonyi 路 Praneet Dutta 馃敆 |
-
|
Semi-analytical Industrial Cooling System Model for Reinforcement Learning
(
Spotlight
)
>
|
Yuri Chervonyi 路 Praneet Dutta 馃敆 |
-
|
Structured Q-learning For Antibody Design
(
Poster
)
>
SlidesLive Video |
Alexander Cowen-Rivers 路 Philip John Gorinski 路 aivar sootla 路 Asif Khan 路 Jun WANG 路 Jan Peters 路 Haitham Bou Ammar 馃敆 |
-
|
Structured Q-learning For Antibody Design
(
Spotlight
)
>
|
Alexander Cowen-Rivers 路 Philip John Gorinski 路 aivar sootla 路 Asif Khan 路 Jun WANG 路 Jan Peters 路 Haitham Bou Ammar 馃敆 |
-
|
Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes
(
Poster
)
>
SlidesLive Video |
Xinhan Di 路 Pengqian Yu 馃敆 |
-
|
Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes
(
Spotlight
)
>
|
Xinhan Di 路 Pengqian Yu 馃敆 |
-
|
Learning an Adaptive Forwarding Strategy for Mobile Wireless Networks: Resource Usage vs. Latency
(
Poster
)
>
SlidesLive Video |
Victoria Manfredi 路 Alicia Wolfe 路 Xiaolan Zhang 路 Bing Wang 馃敆 |
-
|
Learning an Adaptive Forwarding Strategy for Mobile Wireless Networks: Resource Usage vs. Latency
(
Spotlight
)
>
SlidesLive Video |
Victoria Manfredi 路 Alicia Wolfe 路 Xiaolan Zhang 路 Bing Wang 馃敆 |
-
|
Safe Reinforcement Learning for Automatic Insulin Delivery in Type I Diabetes
(
Poster
)
>
SlidesLive Video |
Maxime Louis 路 Hector Romero Ugalde 路 Pierre Gauthier 路 Alice Adenis 路 Yousra Tourki 路 Erik Huneker 馃敆 |
-
|
Safe Reinforcement Learning for Automatic Insulin Delivery in Type I Diabetes
(
Spotlight
)
>
|
Maxime Louis 路 Hector Romero Ugalde 路 Pierre Gauthier 路 Alice Adenis 路 Yousra Tourki 路 Erik Huneker 馃敆 |
-
|
Power Grid Congestion Management via Topology Optimization with AlphaZero
(
Poster
)
>
|
Matthias Dorfer 路 Anton R. Fuxjaeger 路 Kristi谩n Koz谩k 路 Patrick Blies 路 Marcel Wasserer 馃敆 |
-
|
Power Grid Congestion Management via Topology Optimization with AlphaZero
(
Spotlight
)
>
SlidesLive Video |
Matthias Dorfer 路 Anton R. Fuxjaeger 路 Kristi谩n Koz谩k 路 Patrick Blies 路 Marcel Wasserer 馃敆 |
-
|
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
(
Poster
)
>
SlidesLive Video |
Yuandong Ding 路 Mingxiao Feng 路 Guozi Liu 路 Wei Jiang 路 Chuheng Zhang 路 Li Zhao 路 Lei Song 路 Houqiang Li 路 Yan Jin 路 Jiang Bian 馃敆 |
-
|
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
(
Spotlight
)
>
SlidesLive Video |
Yuandong Ding 路 Mingxiao Feng 路 Guozi Liu 路 Wei Jiang 路 Chuheng Zhang 路 Li Zhao 路 Lei Song 路 Houqiang Li 路 Yan Jin 路 Jiang Bian 馃敆 |
-
|
LibSignal: An Open Library for Traffic Signal Control
(
Poster
)
>
SlidesLive Video |
Hao Mei 路 Xiaoliang Lei 路 Longchao Da 路 Bin Shi 路 Hua Wei 馃敆 |
-
|
LibSignal: An Open Library for Traffic Signal Control
(
Spotlight
)
>
|
Hao Mei 路 Xiaoliang Lei 路 Longchao Da 路 Bin Shi 路 Hua Wei 馃敆 |
-
|
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
(
Poster
)
>
SlidesLive Video |
Benjamin Fuhrer 路 Yuval Shpigelman 路 Chen Tessler 路 Shie Mannor 路 Gal Chechik 路 Eitan Zahavi 路 Gal Dalal 馃敆 |
-
|
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
(
Spotlight
)
>
|
Benjamin Fuhrer 路 Yuval Shpigelman 路 Chen Tessler 路 Shie Mannor 路 Gal Chechik 路 Eitan Zahavi 路 Gal Dalal 馃敆 |
-
|
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
(
Poster
)
>
|
Kaixuan Huang 路 Yu Wu 路 Xuezhou Zhang 路 Shenyinying Tu 路 Qingyun Wu 路 Mengdi Wang 路 Huazheng Wang 馃敆 |
-
|
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
(
Spotlight
)
>
SlidesLive Video |
Kaixuan Huang 路 Yu Wu 路 Xuezhou Zhang 路 Shenyinying Tu 路 Qingyun Wu 路 Mengdi Wang 路 Huazheng Wang 馃敆 |
-
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning
(
Poster
)
>
SlidesLive Video |
zheng Yu 路 Yikuan Li 路 Joseph Kim 路 Kaixuan Huang 路 Yuan Luo 路 Mengdi Wang 馃敆 |
-
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning
(
Spotlight
)
>
|
zheng Yu 路 Yikuan Li 路 Joseph Kim 路 Kaixuan Huang 路 Yuan Luo 路 Mengdi Wang 馃敆 |
-
|
tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices
(
Poster
)
>
SlidesLive Video |
Toygun Basaklar 路 Yigit Tuncel 路 Umit Ogras 馃敆 |
-
|
tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices
(
Spotlight
)
>
|
Toygun Basaklar 路 Yigit Tuncel 路 Umit Ogras 馃敆 |
-
|
Optimizing Audio Recommendations for the Long-Term
(
Poster
)
>
SlidesLive Video |
Lucas Maystre 路 Daniel Russo 路 Yu Zhao 馃敆 |
-
|
Optimizing Audio Recommendations for the Long-Term
(
Spotlight
)
>
|
Lucas Maystre 路 Daniel Russo 路 Yu Zhao 馃敆 |
-
|
Controlling Commercial Cooling Systems Using Reinforcement Learning
(
Poster
)
>
|
27 presentersJerry Luo 路 Cosmin Paduraru 路 Octavian Voicu 路 Yuri Chervonyi 路 Scott Munns 路 Jerry Li 路 Crystal Qian 路 Praneet Dutta 路 Daniel Mankowitz 路 Jared Quincy Davis 路 Ningjia Wu 路 Xingwei Yang 路 Chu-Ming Chang 路 Ted Li 路 Rob Rose 路 Mingyan Fan 路 Hootan Nakhost 路 Tinglin Liu 路 Deeni Fatiha 路 Neil Satra 路 Juliet Rothenberg 路 Molly Carlin 路 Satish Tallapaka 路 Sims Witherspoon 路 David Parish 路 Peter Dolan 路 Chenyu Zhao |
-
|
Controlling Commercial Cooling Systems Using Reinforcement Learning
(
Spotlight
)
>
SlidesLive Video |
27 presentersJerry Luo 路 Cosmin Paduraru 路 Octavian Voicu 路 Yuri Chervonyi 路 Scott Munns 路 Jerry Li 路 Crystal Qian 路 Praneet Dutta 路 Daniel Mankowitz 路 Jared Quincy Davis 路 Ningjia Wu 路 Xingwei Yang 路 Chu-Ming Chang 路 Ted Li 路 Rob Rose 路 Mingyan Fan 路 Hootan Nakhost 路 Tinglin Liu 路 Deeni Fatiha 路 Neil Satra 路 Juliet Rothenberg 路 Molly Carlin 路 Satish Tallapaka 路 Sims Witherspoon 路 David Parish 路 Peter Dolan 路 Chenyu Zhao |
-
|
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response
(
Poster
)
>
SlidesLive Video |
Vincent Mai 路 Philippe Maisonneuve 路 Tianyu Zhang 路 Jorge Montalvo Arvizu 路 Liam Paull 路 Antoine Lesage-Landry 馃敆 |
-
|
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response
(
Spotlight
)
>
|
Vincent Mai 路 Philippe Maisonneuve 路 Tianyu Zhang 路 Jorge Montalvo Arvizu 路 Liam Paull 路 Antoine Lesage-Landry 馃敆 |
-
|
Identifying Disparities in Sepsis Treatment by Learning the Expert Policy
(
Poster
)
>
SlidesLive Video |
Hyewon Jeong 路 Siddharth Nayak 路 Taylor Killian 路 Sanjat Kanjilal 路 Marzyeh Ghassemi 馃敆 |
-
|
Identifying Disparities in Sepsis Treatment by Learning the Expert Policy
(
Spotlight
)
>
|
Hyewon Jeong 路 Siddharth Nayak 路 Taylor Killian 路 Sanjat Kanjilal 路 Marzyeh Ghassemi 馃敆 |
-
|
Bandits for Online Calibration: An Application to Content Moderation on Social Media Platforms
(
Poster
)
>
|
20 presentersVashist Avadhanula 路 Omar Abdul Baki 路 Hamsa Bastani 路 Osbert Bastani 路 Caner Gocmen 路 Daniel Haimovich 路 Darren Hwang 路 Dmytro Karamshuk 路 Thomas Leeper 路 Jiayuan Ma 路 Gregory macnamara 路 Jake Mullet 路 Christopher Palow 路 Sung Park 路 Varun S Rajagopal 路 Kevin Schaeffer 路 Parikshit Shah 路 Deeksha Sinha 路 Nicolas Stier-Moses 路 Ben Xu |
-
|
Bandits for Online Calibration: An Application to Content Moderation on Social Media Platforms
(
Spotlight
)
>
SlidesLive Video |
20 presentersVashist Avadhanula 路 Omar Abdul Baki 路 Hamsa Bastani 路 Osbert Bastani 路 Caner Gocmen 路 Daniel Haimovich 路 Darren Hwang 路 Dmytro Karamshuk 路 Thomas Leeper 路 Jiayuan Ma 路 Gregory macnamara 路 Jake Mullet 路 Christopher Palow 路 Sung Park 路 Varun S Rajagopal 路 Kevin Schaeffer 路 Parikshit Shah 路 Deeksha Sinha 路 Nicolas Stier-Moses 路 Ben Xu |
-
|
Beyond CAGE: Investigating Generalization of Learned Autonomous Network Defense Policies
(
Poster
)
>
|
11 presentersMelody Wolk 路 Andy Applebaum 路 Camron Dennler 路 Patrick Dwyer 路 Marina Moskowitz 路 Harold Nguyen 路 Nicole Nichols 路 Nicole Park 路 Paul Rachwalski 路 Frank Rau 路 Adrian Webster |
-
|
Beyond CAGE: Investigating Generalization of Learned Autonomous Network Defense Policies
(
Spotlight
)
>
SlidesLive Video |
11 presentersMelody Wolk 路 Andy Applebaum 路 Camron Dennler 路 Patrick Dwyer 路 Marina Moskowitz 路 Harold Nguyen 路 Nicole Nichols 路 Nicole Park 路 Paul Rachwalski 路 Frank Rau 路 Adrian Webster |
-
|
Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
(
Poster
)
>
SlidesLive Video |
William Wong 路 Praneet Dutta 路 Octavian Voicu 路 Yuri Chervonyi 路 Cosmin Paduraru 路 Jerry Luo 馃敆 |
-
|
Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
(
Spotlight
)
>
|
William Wong 路 Praneet Dutta 路 Octavian Voicu 路 Yuri Chervonyi 路 Cosmin Paduraru 路 Jerry Luo 馃敆 |
-
|
Reinforcement Learning Approaches for Traffic Signal Control under Missing Data
(
Poster
)
>
SlidesLive Video |
Hao Mei 路 Junxian Li 路 Bin Shi 路 Hua Wei 馃敆 |
-
|
Reinforcement Learning Approaches for Traffic Signal Control under Missing Data
(
Spotlight
)
>
SlidesLive Video |
Hao Mei 路 Junxian Li 路 Bin Shi 路 Hua Wei 馃敆 |
-
|
Reinforcement Learning-Based Air Traffic Deconfliction
(
Poster
)
>
|
Denis Osipychev 路 Dragos Margineantu 馃敆 |
-
|
Reinforcement Learning-Based Air Traffic Deconfliction
(
Spotlight
)
>
SlidesLive Video |
Denis Osipychev 路 Dragos Margineantu 馃敆 |
-
|
Automatic Evaluation of Excavator Operators using Learned Reward Functions
(
Poster
)
>
SlidesLive Video |
Pranav Agarwal 路 Marek Teichmann 路 Sheldon Andrews 路 Samira Ebrahimi Kahou 馃敆 |
-
|
Automatic Evaluation of Excavator Operators using Learned Reward Functions
(
Spotlight
)
>
|
Pranav Agarwal 路 Marek Teichmann 路 Sheldon Andrews 路 Samira Ebrahimi Kahou 馃敆 |
-
|
Function Approximations for Reinforcement Learning Controller for Wave Energy Converters
(
Poster
)
>
SlidesLive Video |
Soumyendu Sarkar 路 Vineet Gundecha 路 Alexander Shmakov 路 Sahand Ghorbanpour 路 Ashwin Ramesh Babu 路 Alexandre Pichard 路 Mathieu Cocho 馃敆 |
-
|
Function Approximations for Reinforcement Learning Controller for Wave Energy Converters
(
Spotlight
)
>
SlidesLive Video |
Soumyendu Sarkar 路 Vineet Gundecha 路 Alexander Shmakov 路 Sahand Ghorbanpour 路 Ashwin Ramesh Babu 路 Alexandre Pichard 路 Mathieu Cocho 馃敆 |