How we ship models in VS Code | LIVE161

Julia Kasper and Seth Juarez share how the VS Code and Copilot teams approach selecting, testing, and rolling out AI models for different tasks.

Overview

The session explains why shipping “the right model for the right task” is hard in practice, and how the team uses structured evaluation to decide when a model change is safe to roll out.

Key themes covered:

Resources