Skip to yearly menu bar Skip to main content


(4 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Dec 14 01:20 PM -- 01:35 PM (PST) @ Hall C2 (level 1 gate 9 south of food court) None
Are Emergent Abilities of Large Language Models a Mirage?
Rylan Schaeffer · Brando Miranda · Sanmi Koyejo
[ OpenReview
Oral
Thu Dec 14 01:35 PM -- 01:50 PM (PST) @ Hall C2 (level 1 gate 9 south of food court) None
Task Arithmetic in the Tangent Space: Improved Editing of Pre-Trained Models
Guillermo Ortiz-Jimenez · Alessandro Favero · Pascal Frossard
[ OpenReview
Oral
Thu Dec 14 01:50 PM -- 02:05 PM (PST) @ Hall C2 (level 1 gate 9 south of food court) None
The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks
Ziqian Zhong · Ziming Liu · Max Tegmark · Jacob Andreas
[ OpenReview
Oral
Thu Dec 14 02:05 PM -- 02:20 PM (PST) @ Hall C2 (level 1 gate 9 south of food court) None
Jailbroken: How Does LLM Safety Training Fail?
Alexander Wei · Nika Haghtalab · Jacob Steinhardt
[ OpenReview