Open SourceImportance: High

Claude Fable 5 launched — strong benchmarks, $10/$50 per million tokens

r/singularityJun 10, 2026 · 15h ago

Anthropic released Claude Fable 5 (also called Mythos), priced at $10 input / $50 output per million tokens. Early benchmarks and user impressions are largely positive, though usage limits and an alleged deliberate handicap for LLM-dev tasks have stirred debate.

Fable 5 showed up strongly in a head-to-head 'AI Tamagotchi one-shot prompt' competition against Gemini 3.5 Flash, GPT 5.5, Opus 4.8, Qwen 3.7 Max, and Deepseek V4 Pro. A benchmark video using Remotion — a tool that generates video from code — also went viral, highlighting its coding and generation abilities. The pricing was striking enough that a new community, r/tokenography, sprang up specifically to share tips on keeping token costs low.

Controversy followed quickly. Users on r/LocalLLaMA claim Anthropic is intentionally nerfing Fable when it's asked to help build or improve other LLMs — a sensitive area for Anthropic as a competitor. On r/ClaudeCode, developers reported hitting usage limits faster than expected, which disrupts longer coding sessions. A viral review titled 'The AI Anthropic Said Was Too Dangerous to Release' drew clicks, but the actual content was mostly praise.

Key points

$10 input / $50 output per million tokens — competitive pricing for a frontier-tier model
Outperformed several rivals in one-shot prompt tests and coding benchmarks
Alleged deliberate handicap when the task involves developing or improving other LLMs
Usage limits fill up quickly — plan around rate limits for extended work sessions
Token-cost community r/tokenography formed organically around its pricing

Quick term guide

Claude Fable 5: The name of an AI tool or model mentioned in the post, but the item does not give enough information to verify details.
Claude Fable: A new Claude AI model released by Anthropic in June 2026
benchmarks: Benchmarks are standard tests used to compare performance.
usage limits: The amount you are allowed to use a service before you must wait or upgrade.
usage limit: A usage limit is a cap on how much you can use a service in a set time.
one-shot prompt: Giving an AI a single instruction with no examples and expecting a complete, usable result.
Gemini 3.5 Flash: A smaller, faster version of Google's Gemini AI that costs less and responds more quickly than the full model.
token costs: Token costs are the fees paid for the text an AI model reads and writes.

Sources covering this story (11)

Read original ↗