Skip to yearly menu bar Skip to main content


OLMoE: Open Mixture-of-Experts Language Models

Niklas Muennighoff ⋅ Luca Soldaini ⋅ Dirk Groeneveld ⋅ Kyle Lo ⋅ Jacob Morrison ⋅ Sewon Min ⋅ Weijia Shi ⋅ Evan Walsh ⋅ Oyvind Tafjord ⋅ Nathan Lambert ⋅ Yuling Gu ⋅ Shane Arora ⋅ Akshita Bhagia ⋅ Dustin Schwenk ⋅ David Wadden ⋅ Alexander Wettig ⋅ Binyuan Hui ⋅ Tim Dettmers ⋅ Douwe Kiela ⋅ Noah Smith ⋅ Pang Wei Koh ⋅ Amanpreet Singh ⋅ Hannaneh Hajishirzi
Keywords: Efficient Training

Abstract

Video

Chat is not available.