Claude Fable 5 launched — strong benchmarks, $10/$50 per million tokens

Anthropic released Claude Fable 5 (also called Mythos), priced at $10 input / $50 output per million tokens. Early benchmarks and user impressions are largely positive, though usage limits and an alleged deliberate handicap for LLM-dev tasks have stirred debate.

Fable 5 showed up strongly in a head-to-head 'AI Tamagotchi one-shot prompt' competition against Gemini 3.5 Flash, GPT 5.5, Opus 4.8, Qwen 3.7 Max, and Deepseek V4 Pro. A benchmark video using Remotion — a tool that generates video from code — also went viral, highlighting its coding and generation abilities. The pricing was striking enough that a new community, r/tokenography, sprang up specifically to share tips on keeping token costs low.

Controversy followed quickly. Users on r/LocalLLaMA claim Anthropic is intentionally nerfing Fable when it's asked to help build or improve other LLMs — a sensitive area for Anthropic as a competitor. On r/ClaudeCode, developers reported hitting usage limits faster than expected, which disrupts longer coding sessions. A viral review titled 'The AI Anthropic Said Was Too Dangerous to Release' drew clicks, but the actual content was mostly praise.

Key points

  • $10 input / $50 output per million tokens — competitive pricing for a frontier-tier model
  • Outperformed several rivals in one-shot prompt tests and coding benchmarks
  • Alleged deliberate handicap when the task involves developing or improving other LLMs
  • Usage limits fill up quickly — plan around rate limits for extended work sessions
  • Token-cost community r/tokenography formed organically around its pricing

Quick term guide

Claude Fable 5
The name of an AI tool or model mentioned in the post, but the item does not give enough information to verify details.
Claude Fable
A new Claude AI model released by Anthropic in June 2026
benchmarks
Benchmarks are standard tests used to compare performance.
usage limits
The amount you are allowed to use a service before you must wait or upgrade.
usage limit
A usage limit is a cap on how much you can use a service in a set time.
one-shot prompt
Giving an AI a single instruction with no examples and expecting a complete, usable result.
Gemini 3.5 Flash
A smaller, faster version of Google's Gemini AI that costs less and responds more quickly than the full model.
token costs
Token costs are the fees paid for the text an AI model reads and writes.

Sources covering this story (11)

Read original