firstbacksecondback
113 Results
Workshop
|
Evaluating Chemistry Prompts for Large-Language Model Fine-Tuning Carmelo Gonzales · Michael Pieler · Kevin Maik Jablonka · Santiago Miret |
||
Poster
|
Wed 11:00 |
Weak Supervision Performance Evaluation via Partial Identification Felipe Maia Polo · Subha Maity · Mikhail Yurochkin · Moulinath Banerjee · Yuekai Sun |
|
Session
|
Thu 16:30 |
Dialogue with the Machine and Dialogue with the Art World: A Method for Evaluating AI as a Tool for Creativity Remi Denton · Farbod Mehr · Aroussiak Gabriellan · Rida Qadri · Huma Gupta · Pamela Karimi · Piotr Mirowski |
|
Workshop
|
GPAI Evaluations Standards Taskforce: towards effective AI governance Patricia Paskov · Lukas Berglund · Everett Smith · Lisa Soder |
||
Workshop
|
Towards Deliberating Agents: Evaluating the Ability of Large Language Models to Deliberate Arjun Karanam · Farnaz Jahanbakhsh · Sanmi Koyejo |
||
Poster
|
Thu 11:00 |
Task-oriented Time Series Imputation Evaluation via Generalized Representers Zhixian Wang · Linxiao Yang · Liang Sun · Qingsong Wen · Yi Wang |
|
Poster
|
Wed 11:00 |
MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning Yifan Jiang · jiarui zhang · Kexuan Sun · Zhivar Sourati · Kian Ahrabian · Kaixin Ma · Filip Ilievski · Jay Pujara |
|
Workshop
|
Towards Deliberating Agents: Evaluating the Ability of Large Language Models to Deliberate Arjun Karanam · Farnaz Jahanbakhsh · Sanmi Koyejo |
||
Workshop
|
Declare and Justify: Explicit assumptions in AI evaluations are necessary for effective regulation Peter Barnett · Lisa Thiergart |
||
Affinity Event
|
Evaluating Generative AI for Scenario Variation in Automated Driving Validation Manasa Mariam Mammen · Zafer Kayatas · Eva Zimmermann · Pavel Nedvědický |
||
Workshop
|
GenAI Evaluation Maturity Framework (GEMF) to assess and improve GenAI Evaluations Yilin Zhang · Frank J. Kanayet |
||
Workshop
|
Trust but Verify: Reliable VLM evaluation in-the-wild with program synthesis Viraj Uday Prabhu · Senthil Purushwalkam · Jieyu Zhang · An Yan · Caiming Xiong · Ran Xu |