When was GPT-5.6 released?

OpenAI previewed GPT-5.6 on June 26, 2026 as three models — Sol, Terra, and Luna — through a limited Codex and API preview. On July 9, 2026, @OpenAI posted that Sol, Terra, and Luna are starting to roll out in ChatGPT, Codex, and the API with GA benchmarks including ALE 53.6 and AA Coding Agent Index 80.0.

What are GPT-5.6 Sol, Terra, and Luna?

Sol is OpenAI's GPT-5.6 flagship; Terra delivers GPT-5.5-competitive performance at roughly 2× lower cost; Luna is the fastest and most affordable tier for high-volume work. The 5.6 number marks the generation; Sol/Terra/Luna are durable capability tiers.

How does GPT-5.6 compare to Claude Fable 5?

OpenAI's July 9 GA claims: Sol leads Agents' Last Exam at 53.6 (+13.1 vs Fable adaptive), Artificial Analysis Coding Agent Index at 80.0 (+2.8 vs Fable), and Terminal-Bench 2.1 at 91.9% (Sol Ultra). Fable 5 leads SWE-Bench Pro at 80.3%. Both are publicly accessible as of July 2026.

What improvements does GPT-5.6 Sol make over GPT-5.5?

OpenAI calls Sol a step-function improvement over GPT-5.5 — new max reasoning effort, Ultra mode with subagents, state-of-the-art Terminal-Bench 2.1 scores, stronger biology (GeneBench v1) and cyber evals, and a layered safeguard stack. Terra and Luna trade capability for cost within the same generation.

How much does GPT-5.6 cost?

Official API pricing per 1M tokens: Sol $5/$30 (same list as GPT-5.5), Terra $2.50/$15, Luna $1/$6. GPT-5.6 adds explicit cache breakpoints and 30-minute minimum cache life.

What are Polymarket odds on GPT-5.6 release date?

On July 7, 2026, Polymarket posted ~80% odds on July 10 — a prediction market, not OpenAI. On July 8, 2026, @OpenAI officially said Sol, Terra, and Luna launch publicly this Thursday (July 9) with global preview expansion — superseding Polymarket as the primary schedule signal. Treat markets as crowd sentiment only.

GPT-5.6: Rolling Out July 9 — Sol, Terra, Luna | explainx.ai Blog

Update — July 9, 2026 (OpenAI rollout): @OpenAI posted:

Sol, Terra, and Luna, our GPT‑5.6 family of models, are starting to roll out now in ChatGPT, Codex, and the API.

New GA benchmarks from the thread:

Benchmark	GPT-5.6 Sol	vs Fable 5
Agents' Last Exam	53.6	+13.1 (adaptive)
Artificial Analysis Coding Agent Index	80.0	+2.8; <½ tokens, ~⅓ cost
Terminal-Bench 2.1 (Ultra)	91.9%	+8.5 pts
Ultra mode	Ships at launch	Parallel multi-agent coordination

Full Fable comparison → · Preview guide · Access timeline

Official OpenAI announcement graphic — Sol, Terra, and Luna rollout July 9, 2026.

TL;DR (July 9 rollout):

Model	Role	API (in/out per 1M)	Key GA score
Sol Ultra	Parallel multi-agent	—	91.9% TB · 53.6 ALE
Sol	Flagship	$5 / $30	80.0 AA Coding Index
Terra	GPT-5.5-class, 2× cheaper	$2.50 / $15	Beats Fable ALE @ ~1/16 cost
Luna	Volume tier	$1 / $6	Same efficiency story
Claude Fable 5	Live July 1	$10 / $50	80.3% SWE-Bench Pro

Walkthrough of the official GPT-5.6 preview — Sol, Terra, Luna tiers, benchmarks, and access timeline.

Official launch (June 26, 2026)

OpenAI published Previewing GPT-5.6 Sol and a preview system card. Key facts:

Three tiers: Sol (flagship), Terra (balanced), Luna (affordable volume)
Naming: generation number + durable tier names (Sol/Terra/Luna advance on their own cadence)
Availability: limited preview via API and Codex for trusted partners; GA planned in weeks
Government: U.S. requested vetted-partner start; OpenAI opposes making that permanent
Modes: max reasoning effort on Sol; Ultra mode uses subagents beyond a single agent
Cerebras: Sol at up to 750 tps in July for select customers

For the full launch narrative — cyber safeguards, ExploitBench², caching rules — see GPT-5.6 Sol, Terra, Luna preview guide.

Pre-release signals (what leaks got right)

Before June 26, GPT-5.6 surfaced only through indirect signals. Most proved directionally correct:

Codex log traces: Developers using Codex Computer Use have reported model identifiers referencing "gpt-5.6" appearing in system-level logs during extended agentic sessions. These are not publicly documented model names.

Context window reports: A subset of ChatGPT Pro OAuth users invoking Codex in extended sessions have reported context windows exceeding 1.4–1.5 million tokens—substantially above GPT-5.5's reported capabilities—in unofficial early-access configurations.

OpenAI's release cadence: GPT-5.4 in March 2026, GPT-5.5 on April 23, 2026 — sub-60-day increments put mid-July GA in range. June 2026 Polymarket/Metaculus contracts priced high odds by June 30; that date passed with preview still limited. July 7, 2026: Polymarket reposted ~80% on July 10 — see GA timeline post.

Training data signals: Researchers analyzing GPT-5.6 responses in early access have noted knowledge of events through approximately May 2026—consistent with a refreshed training cutoff ahead of a June public release.

Leaks correctly pointed at June timing, agentic gains, and government gating. OpenAI's official framing for Sol is stronger than "incremental" — a step function over GPT-5.5 on frontier agentic work — while Terra and Luna handle cost-optimized tiers.

Confirmed improvements (Sol vs GPT-5.5)

OpenAI's official claims concentrate on agentic, biology, and cyber — not single-turn chat polish:

Hands-on with GPT-5: where it exceeds expectations and where it still falls short.

1. Context Window: Up to 1.5 Million Tokens

GPT-5.5 operates with a context window that most production applications have treated as ~400K tokens effective for complex tasks. GPT-5.6 is expected to push this to approximately 1.5 million tokens—a 43% increase over the developer-reported ceiling for 5.5.

Why this matters: long-context handling is one of the clearest capability signals in the current frontier race. Claude Fable 5 and Gemini 3.1 Pro have both pushed long-context as a differentiator. A 1.5M token GPT model changes the calculus for use cases like full-codebase analysis, book-length document review, and multi-session agent state persistence.

At 1.5M tokens you can fit roughly:

An entire mid-size software project's worth of source code
A legal document corpus for a full case discovery process
Several full academic papers plus all their cited sources
Hours of meeting transcripts from a long project

2. Agentic Task Completion: Meaningful Reliability Gains

The most technically significant expected improvement is in multi-hour agentic task completion rates—particularly for Codex Computer Use workloads where an AI agent plans, executes, debugs, and iterates on a task autonomously over extended time horizons.

GPT-5.5 made progress here with its 82.7% Terminal-Bench 2.0 score, but early reports suggest GPT-5.6's agentic reliability improvement is meaningful enough that developers noticed it without being told the model changed. The improvement is attributed to:

A cleaner reward signal in training that reduces reward hacking in long agent loops
Tighter persona-isolation (the model less frequently "breaking character" or contradicting its system prompt mid-task)
An improved SFT pipeline that doesn't recycle contaminated rollouts—a subtle but important training quality fix that affects how reliably the model follows complex multi-step instructions

For developers building with Codex or custom agent frameworks, this kind of reliability improvement matters more than raw benchmark scores. A 10% improvement in task completion rate on a 20-step agent pipeline means the agent succeeds more than twice as often end-to-end.

3. Refreshed Training Data Through Mid-2026

GPT-5.5 launched in April 2026 with a training cutoff that left a gap for events from early 2026 onward. GPT-5.6 is expected to include training data through approximately May 2026, closing this window.

For most tasks, training cutoff doesn't matter. For tasks involving recent software ecosystems (new library releases, framework updates), recent world events, or current competitive intelligence, a model trained 6–8 weeks more recently is meaningfully more useful.

4. FrontierMath Tier 4 Reasoning

GPT-5.5 posted 35.4% on FrontierMath Tier 4—the hardest mathematical reasoning benchmark. GPT-5.6 is expected to show improvement here, potentially pushing past 40%. This would be the most direct counter to OpenAI's o3-pro positioning as the reasoning-first model: if GPT-5.6 meaningfully improves frontier math without being explicitly a "reasoning model," it blurs the product line distinction.

5. Token Efficiency for Long Tasks

For long-running agentic sessions, GPT-5.6 reportedly uses fewer tokens to accomplish the same work—a result of the cleaner SFT pipeline reducing repetition, self-correction loops, and unnecessary verbosity. For API users with high-volume agentic workloads, this efficiency gain translates directly to lower cost even if per-token pricing stays the same.

GPT-5.6 family vs GPT-5.5: The upgrade picture

Capability	GPT-5.5	GPT-5.6 Sol (official)	Terra / Luna
Terminal-Bench 2.1	88.0%	88.8% (Ultra 91.9%)	82.5% / 84.3%
API input / output	$5 / $30 per 1M	$5 / $30	$2.50/$15 · $1/$6
Agentic modes	Standard	Max reasoning · Ultra subagents	Tiered
GeneBench v1	Baseline	Better, fewer tokens	Improved cyber stack
Availability	Public	Preview → GA weeks	Same

Routing rule: Sol for hardest agent work; Terra when GPT-5.5-class is enough at half cost; Luna for volume. Context-window leak numbers (~1.5M) were not in OpenAI's June 26 preview post — treat as unconfirmed until GA docs.

GPT-5.6 vs Claude Fable 5: The Frontier Battle

This is the comparison that makes GPT-5.6 interesting. Claude Fable 5 ($10/$50 per million tokens) has been Anthropic's dominant position at the frontier since its launch: highest per-token price, highest capability ceiling, the model Claude Code runs on for complex agent tasks.

GPT-5.6's expected profile maps directly onto Fable 5's strongest territory:

Context length: Fable 5 has a 200K context window—a standard frontier spec. A GPT-5.6 at 1.5M tokens would be a 7.5× advantage on this single dimension. For use cases that push context limits, GPT-5.6 would win outright.

Agentic coding: Fable 5 leads the frontier on long-horizon autonomous coding tasks. GPT-5.6's reported improvements in multi-hour task completion rates are specifically targeting this category. Whether the gap closes entirely depends on benchmark results, but OpenAI is clearly aiming at Fable's core strength.

Pricing: Claude Fable 5 at $10/$50 per million tokens is 2× GPT-5.5's pricing. If GPT-5.6 stays near GPT-5.5's price point, it creates a scenario where a model with comparable or better capability costs half as much—which would reshape which frontier model enterprises default to.

Multimodal: Fable 5 is strong on multimodal reasoning. GPT-5.5 Vision already competes here, and GPT-5.6 is expected to maintain or improve that standing.

Single-turn quality: Fable 5 leads on the Artificial Analysis Intelligence Index and closely-contested benchmarks like SWE-bench Verified (87% range). GPT-5.6 is not expected to dramatically change this competitive position—Anthropic's RLHF quality at the fine-tuning stage is a real advantage.

The honest prediction: GPT-5.6 probably ties Fable 5 on aggregate intelligence metrics and leads Fable 5 on context length. On the hardest agentic coding tasks at the absolute frontier, whether GPT-5.6 closes Fable 5's lead depends on benchmark results that don't exist yet.

What's notable is how close this matchup is expected to be. Six months ago, Claude Fable 5 was a clear tier above GPT-5.5 on agentic capability. GPT-5.6's reported improvements would make this a genuine coin-flip race rather than a clear hierarchy.

GPT-5.6 vs Claude Fable 5: Quick Comparison

Dimension	GPT-5.6 (expected)	Claude Fable 5
Input price (per 1M)	~$5.00–$6.00	$10.00
Output price (per 1M)	~$30.00–$35.00	$50.00
Context window	~1.5M tokens	200K tokens
SWE-bench Verified	~87–89% (estimate)	~87%
Agentic task completion	Improved (TBD)	Strong
FrontierMath Tier 4	~40% (estimate)	~36% (estimate)
Training cutoff	~May 2026	~Mar 2026
Multimodal	Strong	Strong
Self-hosting	No	No

At these expected specs, the pricing story is significant: if GPT-5.6 delivers frontier-comparable capability at roughly half the per-token cost of Fable 5, the enterprise default for high-volume agentic workloads shifts. Teams spending $50,000/month on Fable 5 could potentially run the same workloads on GPT-5.6 for $25,000–$30,000.

What This Means for Developers Right Now

If you're currently on GPT-5.5: The upgrade case for GPT-5.6 is strong if you're doing agentic work or long-context tasks. For single-turn quality, the upgrade is marginal—you can wait for benchmark confirmation before migrating.

If you're currently on Claude Fable 5: Watch the first independent benchmark results closely when GPT-5.6 launches. The context window advantage alone (1.5M vs 200K) is material for certain workloads. On coding benchmarks, if GPT-5.6 matches Fable 5 at roughly half the price, the ROI calculation for high-volume use cases changes.

If you're building something new: Hold off on committing to either model until GPT-5.6 official benchmarks are published. A model at GPT-5.5's price point with Fable 5-class capability changes the math significantly.

If you're considering local open-source models: GPT-5.6 and Claude Fable 5 competing on context window and agentic capability doesn't change the underlying economics for the 70–80% of tasks where open-weight models like Qwen3 235B or DeepSeek V3 are already good enough. The frontier race is relevant for the hardest agentic and reasoning tasks; most practical workflows are better served by matching the right open model to the task.

The Bigger OpenAI Cadence Picture

GPT-5.6 is not an event—it's a data point in a pattern. OpenAI has compressed its release cadence to under 60 days between incremental model updates. This means:

GPT-5.5 is already ~8 weeks old at the expected GPT-5.6 release date
A GPT-5.7 would be expected in August 2026
The frontier model you adopt in January may be two model generations behind by July

This cadence creates a different kind of lock-in pressure than before. Rather than committing to a model and trusting it for a year, enterprise AI teams are now managing rolling model upgrades, regression testing, and prompt compatibility across quarterly update cycles.

The teams managing this most effectively in 2026 are those with model abstraction layers in their AI infrastructure—routing specific task types through specific models and swapping models at the routing layer without rewriting application logic. Whether GPT-5.6 beats Fable 5 matters less if your architecture allows you to swap in the winner within a week of benchmark publication.

Timeline: what happened

June 26, 2026 — limited preview live (Codex + API, trusted partners).

July 1, 2026 — Fable 5 restored globally; GPT-5.6 preview continues.

July 9, 2026 — @OpenAI rollout thread: Sol, Terra, Luna starting to roll out in ChatGPT, Codex, and API; ALE 53.6, AA Coding Index 80.0, Ultra mode at launch.

July 2026 — Cerebras Sol deployment (up to 750 tps) for select customers.

Watch next — independent AA verification; SWE-Bench Pro Sol scores; tier-by-tier ChatGPT picker rollout.

When your tier unlocks, benchmark Terra for cost and Sol for agentic pipelines — see comparison vs Fable 5.

Updated July 10, 2026 with July 9 rollout benchmarks (ALE, AA Coding Index). Pre-release leak sections retained for context. Verify rollout in your ChatGPT picker on openai.com.

Update — July 9, 2026 (OpenAI rollout): @OpenAI posted:

Sol, Terra, and Luna, our GPT‑5.6 family of models, are starting to roll out now in ChatGPT, Codex, and the API.

New GA benchmarks from the thread:

Benchmark	GPT-5.6 Sol	vs Fable 5
Agents' Last Exam	53.6	+13.1 (adaptive)
Artificial Analysis Coding Agent Index	80.0	+2.8; <½ tokens, ~⅓ cost
Terminal-Bench 2.1 (Ultra)	91.9%	+8.5 pts
Ultra mode	Ships at launch	Parallel multi-agent coordination

Full Fable comparison → · Preview guide · Access timeline

Official OpenAI announcement graphic — Sol, Terra, and Luna rollout July 9, 2026.

TL;DR (July 9 rollout):

Model	Role	API (in/out per 1M)	Key GA score
Sol Ultra	Parallel multi-agent	—	91.9% TB · 53.6 ALE
Sol	Flagship	$5 / $30	80.0 AA Coding Index
Terra	GPT-5.5-class, 2× cheaper	$2.50 / $15	Beats Fable ALE @ ~1/16 cost
Luna	Volume tier	$1 / $6	Same efficiency story
Claude Fable 5	Live July 1	$10 / $50	80.3% SWE-Bench Pro

Walkthrough of the official GPT-5.6 preview — Sol, Terra, Luna tiers, benchmarks, and access timeline.

Official launch (June 26, 2026)

OpenAI published Previewing GPT-5.6 Sol and a preview system card. Key facts:

Three tiers: Sol (flagship), Terra (balanced), Luna (affordable volume)
Naming: generation number + durable tier names (Sol/Terra/Luna advance on their own cadence)
Availability: limited preview via API and Codex for trusted partners; GA planned in weeks
Government: U.S. requested vetted-partner start; OpenAI opposes making that permanent
Modes: max reasoning effort on Sol; Ultra mode uses subagents beyond a single agent
Cerebras: Sol at up to 750 tps in July for select customers

For the full launch narrative — cyber safeguards, ExploitBench², caching rules — see GPT-5.6 Sol, Terra, Luna preview guide.

Pre-release signals (what leaks got right)

Before June 26, GPT-5.6 surfaced only through indirect signals. Most proved directionally correct:

Confirmed improvements (Sol vs GPT-5.5)

OpenAI's official claims concentrate on agentic, biology, and cyber — not single-turn chat polish:

Hands-on with GPT-5: where it exceeds expectations and where it still falls short.

1. Context Window: Up to 1.5 Million Tokens

At 1.5M tokens you can fit roughly:

An entire mid-size software project's worth of source code
A legal document corpus for a full case discovery process
Several full academic papers plus all their cited sources
Hours of meeting transcripts from a long project

2. Agentic Task Completion: Meaningful Reliability Gains

A cleaner reward signal in training that reduces reward hacking in long agent loops
Tighter persona-isolation (the model less frequently "breaking character" or contradicting its system prompt mid-task)
An improved SFT pipeline that doesn't recycle contaminated rollouts—a subtle but important training quality fix that affects how reliably the model follows complex multi-step instructions

3. Refreshed Training Data Through Mid-2026

4. FrontierMath Tier 4 Reasoning

5. Token Efficiency for Long Tasks

GPT-5.6 family vs GPT-5.5: The upgrade picture

Capability	GPT-5.5	GPT-5.6 Sol (official)	Terra / Luna
Terminal-Bench 2.1	88.0%	88.8% (Ultra 91.9%)	82.5% / 84.3%
API input / output	$5 / $30 per 1M	$5 / $30	$2.50/$15 · $1/$6
Agentic modes	Standard	Max reasoning · Ultra subagents	Tiered
GeneBench v1	Baseline	Better, fewer tokens	Improved cyber stack
Availability	Public	Preview → GA weeks	Same

GPT-5.6 vs Claude Fable 5: The Frontier Battle

GPT-5.6's expected profile maps directly onto Fable 5's strongest territory:

Multimodal: Fable 5 is strong on multimodal reasoning. GPT-5.5 Vision already competes here, and GPT-5.6 is expected to maintain or improve that standing.

GPT-5.6 vs Claude Fable 5: Quick Comparison

Dimension	GPT-5.6 (expected)	Claude Fable 5
Input price (per 1M)	~$5.00–$6.00	$10.00
Output price (per 1M)	~$30.00–$35.00	$50.00
Context window	~1.5M tokens	200K tokens
SWE-bench Verified	~87–89% (estimate)	~87%
Agentic task completion	Improved (TBD)	Strong
FrontierMath Tier 4	~40% (estimate)	~36% (estimate)
Training cutoff	~May 2026	~Mar 2026
Multimodal	Strong	Strong
Self-hosting	No	No

What This Means for Developers Right Now

The Bigger OpenAI Cadence Picture

GPT-5.6 is not an event—it's a data point in a pattern. OpenAI has compressed its release cadence to under 60 days between incremental model updates. This means:

GPT-5.5 is already ~8 weeks old at the expected GPT-5.6 release date
A GPT-5.7 would be expected in August 2026
The frontier model you adopt in January may be two model generations behind by July

Timeline: what happened

June 26, 2026 — limited preview live (Codex + API, trusted partners).

July 1, 2026 — Fable 5 restored globally; GPT-5.6 preview continues.

July 9, 2026 — @OpenAI rollout thread: Sol, Terra, Luna starting to roll out in ChatGPT, Codex, and API; ALE 53.6, AA Coding Index 80.0, Ultra mode at launch.

July 2026 — Cerebras Sol deployment (up to 750 tps) for select customers.

Watch next — independent AA verification; SWE-Bench Pro Sol scores; tier-by-tier ChatGPT picker rollout.

When your tier unlocks, benchmark Terra for cost and Sol for agentic pipelines — see comparison vs Fable 5.

Updated July 10, 2026 with July 9 rollout benchmarks (ALE, AA Coding Index). Pre-release leak sections retained for context. Verify rollout in your ChatGPT picker on openai.com.

GPT-5.6 Guide: Sol, Terra, Luna Models, Pricing, and Benchmarks

Official launch (June 26, 2026)

Pre-release signals (what leaks got right)

Confirmed improvements (Sol vs GPT-5.5)

1. Context Window: Up to 1.5 Million Tokens

2. Agentic Task Completion: Meaningful Reliability Gains

3. Refreshed Training Data Through Mid-2026

4. FrontierMath Tier 4 Reasoning

5. Token Efficiency for Long Tasks

GPT-5.6 family vs GPT-5.5: The upgrade picture

GPT-5.6 vs Claude Fable 5: The Frontier Battle

GPT-5.6 vs Claude Fable 5: Quick Comparison

What This Means for Developers Right Now

The Bigger OpenAI Cadence Picture

Timeline: what happened

GPT-5.6 Guide: Sol, Terra, Luna Models, Pricing, and Benchmarks

Official launch (June 26, 2026)

Pre-release signals (what leaks got right)

Confirmed improvements (Sol vs GPT-5.5)

1. Context Window: Up to 1.5 Million Tokens

2. Agentic Task Completion: Meaningful Reliability Gains

3. Refreshed Training Data Through Mid-2026

4. FrontierMath Tier 4 Reasoning

5. Token Efficiency for Long Tasks

GPT-5.6 family vs GPT-5.5: The upgrade picture

GPT-5.6 vs Claude Fable 5: The Frontier Battle

GPT-5.6 vs Claude Fable 5: Quick Comparison

What This Means for Developers Right Now

The Bigger OpenAI Cadence Picture

Timeline: what happened

Related posts

GPT-5.6 Sol, Terra, Luna vs Claude Fable 5: Complete Frontier Comparison

GPT-5.6 Sol, Terra, and Luna: OpenAI Preview Launch Explained

GPT-5.5, Claude Opus, Gemini vs Their Best Local Open-Source Alternatives (2026)

Related posts

GPT-5.6 Sol, Terra, Luna vs Claude Fable 5: Complete Frontier Comparison

GPT-5.6 Sol, Terra, and Luna: OpenAI Preview Launch Explained

GPT-5.5, Claude Opus, Gemini vs Their Best Local Open-Source Alternatives (2026)