Peregrinations - Models | Peregrinations

Models

Frontier labs, model launches, SOTA rankings, and the mechanics of training and scaling. The intelligence layer.

Most recent

When the Workload Stops Moving: Why Custom Silicon Finally Pays

Margins and Nvidia-avoidance explain the motive. They don't explain the timing — why every hyperscaler stood up a chip program inside the same two years, or why a cheaper chip doesn't always buy a cheaper token.

9 minRead

This Week

What actually moved

AnthropicJun 9, 2026

Anthropic Releases Claude Fable 5 & Mythos 5

State-of-the-art on nearly all tested benchmarks — highest on Cognition's FrontierBench and ViBench (vibe-coding), with standout software-engineering, knowledge-work, vision, and scientific-research performance, and autonomous runs longer than any prior Claude. The Fable/Mythos split productizes the safe-vs-capable tradeoff: a generally-available safe variant that degrades gracefully to Opus 4.8 on dual-use queries, and a gated unconstrained sibling for vetted research. Subscription rollout was deliberately staged — Fable 5 was free on Pro/Max/Team/Enterprise-seat plans Jun 9–22, then shifted to usage credits on Jun 23, 2026 (to be restored as a standard inclusion later). (Source: Anthropic, 'Claude Fable 5 and Mythos 5', Jun 9 2026.)

MiniMaxJun 1, 2026

MiniMax Releases M3

Vendor runs claim M3 beats GPT-5.5 and Gemini 3.1 Pro on SWE-Bench Pro at a fraction of proprietary cost — but the open weights had not shipped at launch, so independent verification of the benchmark is still pending.

Alibaba (Qwen)Jun 1, 2026

Alibaba Launches Qwen 3.7 Plus

Extends the Qwen 3.7 line into vision, giving Alibaba a multimodal agent to pair with Max's text-only reasoning flagship — open weights aren't expected until later in the summer.

Live