Glue doesn't lock you in. Use the best model for the job — swap with a single flag, no restart needed.
Different tasks need different models. Use Opus for complex architecture decisions, Sonnet for everyday coding, Haiku for quick fixes.
Switch models mid-session with the /model command. Your conversation context carries over — no restart needed.
Glue tracks token usage per session and shows real-time cost estimates in the status bar. Stay in budget.
| Model | Best For | Speed | Context |
|---|---|---|---|
| Claude 4 Opus | Complex refactors, architecture | Moderate | 200K |
| Claude 4 Sonnet | General coding, best balance | Fast | 200K |
| GPT-5 | Reasoning-heavy tasks | Fast | 128K |
| Gemini 3 Pro | Large codebase analysis | Fast | 1M |
| Local (Ollama) | Offline, private, experimentation | Varies | Varies |