firstbacksecondback
7 Results
Workshop
|
Linear Probe Penalties Reduce LLM Sycophancy Henry Papadatos · Rachel Freedman |
||
Workshop
|
Semantic Entropy Neurons: Encoding Semantic Uncertainty in the Latent Space of LLMs Jiatong Han · Jannik Kossen · Muhammed Razzak · Yarin Gal |
||
Workshop
|
Do LLMs internally know'' when they follow instructions? Juyeon Heo · Christina Heinze-Deml · Shirley Ren · Oussama Elachqar · Udhyakumar Nallasamy · Andy Miller · Jaya Narain |
||
Workshop
|
What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks Nathalie Kirch · Severin Field · Stephen Casper |
||
Poster
|
Fri 16:30 |
Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective Akiyoshi Tomihari · Issei Sato |
|
Workshop
|
ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy Kian Kenyon-Dean · Jerry Wang · John Urbanik · Konstantin Donhauser · Jason Hartford · Saber Saberian · Nil Sahin · Ihab Bendidi · Safiye Celik · Marta Fay · Juan Rodriguez · Imran Haque · Oren Kraus |
||
Workshop
|
Sun 12:10 |
ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy Kian Kenyon-Dean · Jerry Wang · John Urbanik · Konstantin Donhauser · Jason Hartford · Saber Saberian · Nil Sahin · Ihab Bendidi · Safiye Celik · Marta Fay · Juan Rodriguez · Imran Haque · Oren Kraus |