firstbacksecondback
634 Results
Workshop
|
Crystal Design Amidst Noisy DFT Signals: A Reinforcement Learning Approach Prashant Govindarajan · Mathieu Reymond · Santiago Miret · Mariano Phielipp · Sarath Chandar |
||
Poster
|
Wed 11:00 |
Offline Multitask Representation Learning for Reinforcement Learning Haque Ishfaq · Thanh Nguyen-Tang · Songtao Feng · Raman Arora · Mengdi Wang · Ming Yin · Doina Precup |
|
Poster
|
Thu 11:00 |
Reinforcement Learning with LTL and ω-Regular Objectives via Optimality-Preserving Translation to Average Rewards Xuan Bach Le · Dominik Wagner · Leon Witzman · Alexander Rabinovich · Luke Ong |
|
Poster
|
Thu 16:30 |
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents Zihao Wang · Shaofei Cai · Zhancun Mu · Haowei Lin · Ceyao Zhang · Xuejie Liu · Qing Li · Anji Liu · Xiaojian (Shawn) Ma · Yitao Liang |
|
Workshop
|
Sat 12:00 |
CPP-UT-Bench: Can LLMs Write Complex Unit Tests in C++? Vaishnavi Bhargava · Rajat Ghosh · Debojyoti Dutta |
|
Workshop
|
Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning Alicia Li · Nishanth Kumar · Tomás Lozano-Pérez · Leslie Kaelbling |
||
Poster
|
Wed 16:30 |
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs Davide Maran · Alberto Maria Metelli · Matteo Papini · Marcello Restelli |
|
Poster
|
Wed 11:00 |
Autoregressive Policy Optimization for Constrained Allocation Tasks David Winkel · Niklas Strauß · Maximilian Bernhard · Zongyue Li · Thomas Seidl · Matthias Schubert |
|
Poster
|
Wed 16:30 |
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX Alexander Rutherford · Benjamin Ellis · Matteo Gallici · Jonathan Cook · Andrei Lupu · Garðar Ingvarsson Juto · Timon Willi · Ravi Hammond · Akbir Khan · Christian Schroeder de Witt · Alexandra Souly · Saptarashmi Bandyopadhyay · Mikayel Samvelyan · Minqi Jiang · Robert Lange · Shimon Whiteson · Bruno Lacerda · Nick Hawes · Tim Rocktäschel · Chris Lu · Jakob Foerster |
|
Poster
|
Fri 11:00 |
Melting Pot Contest: Charting the Future of Generalized Cooperative Intelligence Rakshit Trivedi · Akbir Khan · Jesse Clifton · Lewis Hammond · Edgar Duenez-Guzman · Dipam Chakraborty · John Agapiou · Jayd Matyas · Sasha Vezhnevets · Barna Pásztor · Yunke Ao · Omar G. Younis · Jiawei Huang · Benjamin Swain · Haoyuan Qin · Deng · Ziwei Deng · Utku Erdoğanaras · Yue Zhao · Marko Tesic · Natasha Jaques · Jakob Foerster · Vincent Conitzer · José Hernández-Orallo · Dylan Hadfield-Menell · Joel Leibo |