Tutorial Tue, Dec 2, 2025 • 1:30 PM – 4:00 PM PST Upper Level Room 30A-E

Autoregressive Models Beyond Language

Tianhong Li · Huiwen Chang · Kaiming He

Abstract

Autoregressive modeling is no longer confined to language. Recent work shows that the same next-element prediction principle can achieve state-of-the-art performance in generative modeling, representation learning, and multi-modal tasks across images, video, audio, robotics, and scientific data. Yet, extending autoregressive methods to these data is far from straightforward. Many inductive biases used in autoregressive language models no longer hold for other data modalities, and thus, many new techniques have been proposed in recent years to adapt autoregressive models to data beyond language.

This tutorial will review the core theory of autoregressive models, present practical design choices for generative modeling, representation learning, and multi-modal learning, and spotlight open challenges in this area. We hope our tutorial can provide the attendees with a clear conceptual roadmap and hands-on resources to apply and extend autoregressive techniques across diverse data domains.

Video

Chat is not available.

Schedule

Timezone: America/Los_Angeles

1:30 PM

Back to Basics: Challenging the Established in Autoregressive Language Modeling

Tianhong Li

1:55 PM

Elucidating the Design Space of visual autoregressive model and image tokenizers

Yi Jiang

2:20 PM

Autoregressive World Modeling and Planning

Yilun Du

2:45 PM

Understanding Comes after Prediction

Yutong Bai

3:10 PM

Code to Cure: Build Models, Agents, AI-XR Co-scientists to Drive Biomedical Innovation

Le Cong

3:35 PM

Closing remarks

Huiwen Chang