Small code edits
Start: GPT-5 mini, Claude Haiku 4.5, or GPT-5.4 nano.
Escalate: Claude Sonnet 4.6 or GPT-5.3-Codex if the edit touches several files.
Copilot Models
A compact reference for choosing models in Copilot Chat and agent workflows. Use it to balance model quality, premium request cost, latency, and the type of work you are asking Copilot to do.
Reference Table
Paid-plan cost is shown as GitHub's premium request multiplier. Copilot Free has a limited monthly premium request allowance; included models may not consume premium requests on paid plans.
| Model | Provider | Status | Paid Cost | Free Plan | Best At | Use It For |
|---|---|---|---|---|---|---|
| GPT-4.1 | OpenAI | Retires Jun 1 | Included | Included | Fast everyday coding help, explanations, small changes, and fallback chat. | Use only while migrating routine workflows to GPT-5 mini or GPT-5.5. |
| GPT-4o | OpenAI | Billing ref | Included | 1 request | General chat, multimodal help, and broad assistant tasks where available. | Use as a baseline included model if it appears in your Copilot model picker. |
| GPT-5 mini | OpenAI | GA | Included | Included | General coding, writing, quick reasoning, and low-cost daily usage. | Default to this for routine prompts before escalating to premium models. |
| GPT-5.2 | OpenAI | Retires Jun 1 | 1x | 1 request | Deep reasoning, debugging, architectural analysis, and tradeoff decisions. | Use only while migrating planning prompts to GPT-5.5 or GPT-5.4. |
| GPT-5.2-Codex | OpenAI | Retires Jun 1 | 1x | 1 request | Long-horizon coding work, multi-file implementation, and agent mode tasks. | Use only while migrating repository-edit workflows to GPT-5.3-Codex. |
| GPT-5.3-Codex | OpenAI | GA | 1x | 1 request | Agentic software development, tests, refactors, reviews, and codebase navigation. | Use as a strong default for implementation-heavy agent workflows. |
| GPT-5.4 | OpenAI | GA | 1x | 1 request | Complex reasoning, debugging, code review, and architecture-level analysis. | Use when you need a strong reviewer or planner without jumping to premium-heavy cost. |
| GPT-5.5 | OpenAI | GA | 7.5x | 7.5 requests | Frontier reasoning, difficult architecture decisions, and high-stakes code review. | Use narrowly for ambiguous planning, security-sensitive review, or GPT-4.1 migration checks. |
| GPT-5.4 mini | OpenAI | GA | 0.33x | 1 request | Agentic codebase exploration, search-heavy work, and lightweight implementation. | Use to scan a repo, find likely files, draft a plan, or handle medium-risk edits. |
| GPT-5.4 nano | OpenAI | GA | 0.25x | 1 request | Low-cost snippets, short explanations, small rewrites, and quick checks. | Use for cheap iteration when mistakes are easy to spot and fix. |
| Claude Haiku 4.5 | Anthropic | GA | 0.33x | 1 request | Fast answers, lightweight edits, small explanations, and cheap support loops. | Use for quick iteration before escalating to Sonnet or Opus. |
| Claude Sonnet 4.5 | Anthropic | GA | 1x | 1 request | General coding, agent tasks, reasoning, and practical development support. | Use as a balanced implementation model when you want quality at 1x. |
| Claude Sonnet 4.6 | Anthropic | GA | 1x | 1 request | General coding, debugging, visual reasoning, and agentic development. | Use as the main "build it" model after a harder model has planned the approach. |
| Claude Opus 4.5 | Anthropic | GA | 3x | 3 requests | Difficult refactors, codebase reasoning, and complex problem solving. | Use for architectural planning when Sonnet is missing context or nuance. |
| Claude Opus 4.6 | Anthropic | GA | 3x | 3 requests | Deep reasoning, hard debugging, and complex tradeoff analysis. | Use for planning and review on complicated work when 3x cost is acceptable. |
| Claude Opus 4.6 fast mode | Anthropic | Preview | 30x | 30 requests | Low-latency Opus-class reasoning when response speed matters more than budget. | Reserve for rare interactive sessions where premium request burn is acceptable. |
| Claude Opus 4.7 | Anthropic | GA | 15x | 15 requests | Anthropic's strongest Copilot model for advanced coding and reasoning. | Use for the first planning pass on large, ambiguous, or risky engineering tasks. |
| Gemini 2.5 Pro | GA | 1x | 1 request | Complex code generation, debugging, research, and broad technical analysis. | Use as a strong alternative reviewer or researcher at standard cost. | |
| Gemini 3 Flash | Preview | 0.33x | 1 request | Fast simple tasks, repetitive edits, summaries, and low-cost iteration. | Use for quick work when latency and cost matter more than depth. | |
| Gemini 3.1 Pro | Preview | 1x | 1 request | Long-context reasoning, edit-test loops, and technical analysis. | Use for reading large contexts, comparing alternatives, and research-heavy prompts. | |
| Grok Code Fast 1 | xAI | Deprecated | 0.25x | 1 request | Fast coding, code generation, and debugging across common languages. | Avoid for new workflows; move speed-focused prompts to GPT-5 mini or Claude Haiku 4.5. |
| Raptor mini | Fine-tuned GPT-5 mini | Preview | Included | Included | Inline suggestions, explanations, and everyday developer assistance. | Use for included-cost support when available in your environment. |
| Goldeneye | Fine-tuned GPT-5.1-Codex | Preview | Free only | Included | Evaluation-oriented reasoning and problem solving. | Use for comparison and testing when GitHub exposes it in your picker. |
Current caveat: GitHub may auto-select a model for some requests, especially if the selected model is overloaded. Models also vary by Copilot plan, client, policy settings, and preview access.
Task Guide
Start with the cheapest model that can do the job, then escalate only when the task is ambiguous or risky.
Start: GPT-5 mini, Claude Haiku 4.5, or GPT-5.4 nano.
Escalate: Claude Sonnet 4.6 or GPT-5.3-Codex if the edit touches several files.
Start: Claude Sonnet 4.6 or GPT-5.3-Codex.
Escalate: Claude Opus 4.7 or GPT-5.4 for the plan if the feature is unclear.
Start: GPT-5.4 or Gemini 2.5 Pro.
Escalate: Claude Opus 4.7 for difficult root-cause analysis.
Start: Claude Opus 4.7 or GPT-5.4 for scope and risk analysis.
Implement: Claude Sonnet 4.6 or GPT-5.3-Codex in smaller verified chunks.
Start: GPT-5.4, Gemini 2.5 Pro, or Claude Sonnet 4.6.
Escalate: Claude Opus 4.7 for high-risk logic, security-sensitive areas, or architecture drift.
Start: GPT-5 mini, Gemini 3 Flash, or Claude Haiku 4.5.
Escalate: Gemini 3.1 Pro for long context or GPT-5.4 for technical precision.
Start: Claude Sonnet 4.6, Gemini 2.5 Pro, or GPT-5.4.
Escalate: Opus when layout, product behavior, and code changes all interact.
Start: included models, GPT-5.4 nano, Claude Haiku 4.5, or Gemini 3 Flash.
Escalate: Pay 1x or higher only after the problem is narrowed.
Recommended Flows
Use different models for different phases instead of trying to make one model do every job.
Cost Notes
Premium request multipliers are not dollar prices. They are how quickly a request consumes your included or purchased premium request allowance.
GPT-5 mini and Raptor mini are the safest starting point for routine prompts because paid-plan usage is included according to GitHub's multiplier table. GPT-4.1 should be treated as a migration-only option.
Sonnet, Codex, GPT-5.4, and Gemini Pro models are usually where implementation, review, and deeper analysis should happen.
GPT-5.5, Claude Opus 4.7, and Opus fast mode should answer narrow, high-value questions rather than carrying every iteration of a long coding session.
Sources
The model list changes often. Re-check these sources before making budget or policy decisions.