firstbacksecondback
62 Results
Poster
|
Thu 16:30 |
WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs Seungju Han · Kavel Rao · Allyson Ettinger · Liwei Jiang · Bill Yuchen Lin · Nathan Lambert · Yejin Choi · Nouha Dziri |
|
Workshop
|
Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning Alicia Li · Nishanth Kumar · Tomás Lozano-Pérez · Leslie Kaelbling |