AInstein: Can AI Rediscover Scientific Concepts from First Principles?
Shambhavi Mishra · Gaurav Sahu · Marco Pedersoli · Laurent Charlin · Jose Dolz · Chris Pal
Abstract
Large language models have demonstrated remarkable capabilities across diverse tasks, yet a fundamental question remains: can these models genuinely rediscover complex scientific insights, or do they merely recite memorized information? We present AInstein, a novel framework for evaluating whether language models can derive established scientific concepts from first principles when stripped of domain-specific terminology. Rather than testing the recall of scientific facts, we reformulate landmark discoveries as conceptual puzzles, challenging models to reconstruct the underlying technical solutions independently.
Successful Page Load