Poster
in
Workshop: Workshop on Multi-Turn Interactions in Large Language Models Sat, Dec 6, 2025 • 3:45 PM – 4:45 PM PST

Toward Community-Driven Agents for Machine Learning Engineering

Sijie Li · Weiwei Sun · Shanda Li · Ameet Talwalkar · Yiming Yang

Project Page [ OpenReview]

Abstract

Large language model-based machine learning (ML) agents have shown great promise in automating ML research.However, existing agents typically operate in isolation on a given research problem, without engaging with the broader research community, where human researchers often gain insights and contribute by sharing knowledge.To bridge this gap, we introduce MLE-Live, a live evaluation framework designed to assess an agent's ability to communicate with and leverage collective knowledge from a simulated Kaggle research community.Building on this framework, we propose CoMind, a novel agent that excels at exchanging insights and developing novel solutions within a community context.CoMind achieves state-of-the-art performance on MLE-Live and outperforms 79.2% human competitors on average across four ongoing Kaggle competitions.

Chat is not available.