The Parallel Stack — Daily Magazine · No. 9

Issue

No. 9 · April 28

Spreads

Twelve

For You

Five flagged

Anchor

The proxy worker

01 · Lead story · Cloud politics

No. one

OpenAI buys its way out of Azure with a fifty-billion Amazon line.

The renegotiated Microsoft pact, finalized April 27, replaces the old exclusivity with a non-exclusive license through 2032 — and ends Microsoft's threat of legal action over OpenAI's $50B AWS deal announced in February. Microsoft keeps its ~27% equity, a revenue share through 2030, and the right to call Azure the "primary" partner for new products. OpenAI gets to sell every product across every cloud. The stateless API stays Azure-exclusive on paper; everything else, including the Amazon-funded training runs, is fair game.

Read on TechCrunch →

TechCrunch · April 27, 2026 · Multi-cloud is now policy

02 · Frontier research · London

No. two · Funding

A $1.1 billion seed for an AI that refuses to read us.

David Silver — the DeepMind veteran behind AlphaGo and AlphaZero — left Google to start Ineffable Intelligence, and walked into the largest seed round Europe has ever recorded: $1.1B at a $5.1B valuation, led by Sequoia and Lightspeed, with checks from Nvidia, Google, Index, and the UK government's Sovereign AI fund. The thesis is a return to first principles: a "superlearner" that discovers knowledge through reinforcement learning without leaning on human-generated data. Silver is pledging 100% of his personal equity gains to Founders Pledge.

Read on TechCrunch →

Beijing · Order to unwind

03 · Geopolitics · M&A

No. three

China cancels Meta's $2B Manus deal — after the deal already closed.

The NDRC blocked Meta's $2B acquisition of agentic-AI startup Manus on April 27, citing export-control and overseas-investment compliance. The catch: Meta closed the deal in late December, integrated Manus into its internal stack, and seated Manus's founders on its agent team. Now they have to pull all of it back out. The order doubles as a signal — Beijing is willing to claw back deals that route Chinese-founded AI talent into US frontier labs, even after the wires have cleared.

Read on TechCrunch →

$ cursor changelog --release 3.2

No. four · Dev toolsFor You

cursor 3.2: async subagents, multi-root workspaces, cross-repo edits.

The headline feature in Cursor 3.2 (April 24): async subagents you fire-and-forget, plus multi-root workspaces that let a single session edit across repos. The companion 3.1 release pushed Bugbot's autofix resolution rate to ~80% (15 points ahead of the next AI code review tool), shipped /debug in the CLI for hypothesis-driven root-cause work, and introduced Canvases — interactive React surfaces inside the agent window for evals, PR reviews, and research. The IDE is quietly becoming an agent dispatcher.

→ cursor.com/changelog

05 · Leaked feature · Claude Code

No. fiveFor You

Anthropic's Bugcrawl: ten Claudes in parallel, hunting for what you didn't ask about.

Bugcrawl appeared in Claude Code preview builds this week as a dedicated side-nav entry — a tool that scans a repo with ten parallel agents looking for general correctness issues, the messy band that sits between automated security scans and PR-level review. It rounds out Anthropic's plan to push Claude Code from a reactive assistant into a proactive QA mechanism. Like its Security and Code Review siblings, the working assumption is Teams and Enterprise pricing. No release window yet, no production builds — but the side-nav slot is real, and the orchestration layer it leans on (parallel sessions with shared context and dependency awareness) is already shipping.

Read on TestingCatalog →

06 · Capital pulse · Q2 surge

No. six

$1.5B+

Monday's other rounds, all closed in a single news cycle.

Outside Ineffable's record seed: Avoca AI raised $125M led by Kleiner Perkins to automate missed-call answering for HVAC, plumbing, and roofing — a $1B valuation built on home-services dispatch. Shanghai's Robot Era closed $200M+ for industrial humanoids. Quantum Art extended its Series A by $140M for trapped-ion quantum systems. And Google stood up a separate $750M fund earmarked for partners building agents on Gemini Enterprise. The applied-AI round is no longer a side track — it's where the cleared paperwork lands.

Read on Tech Startups →

07 · Postmortem · Claude Code

No. sevenFor You

Three bugs, three timelines, one illusion of decline.

Anthropic's April 23 postmortem on weeks of Claude Code complaints reads like a primer on how concurrent regressions disguise themselves as drift. Three independent changes overlapped: a default reasoning-effort downgrade (March 4 → April 7), a thinking-cache bug that discarded reasoning history every turn instead of once (March 26 → April 10), and a 25-word-between-tool-calls verbosity prompt that cost 3% intelligence (April 16 → April 20). API was clean throughout. Different user cohorts hit different bugs on different days — which is why the early complaints "didn't reproduce" and why the team initially missed it.

Reasoning effort default downgradeMar 4 → Apr 7

Thinking-cache discard bugMar 26 → Apr 10

25-word verbosity system promptApr 16 → Apr 20

Read on anthropic.com →

08 · Vibe-coding case study · iOS

No. eightFor You

Fourteen thousand lines of Swift, written entirely through conversation.

Stripe's design manager Kris Puckett shipped Epilogue — an ambient reading-tracker iOS app — to the App Store after months of building it almost exclusively through Claude Code. No Figma, no traditional design tools: 14,000 lines of Swift, multi-column-aware camera quote capture, custom Metal shaders, all driven by spoken intent and feedback loops. His takeaway, repeated in interviews this week: the bottleneck was never coding ability. It was articulation — describing the thing precisely enough that a model could build it. A useful data point on what a senior IC actually ships when the editor disappears.

View Epilogue on App Store →

09 · Pricing · API economics

No. nine

DeepSeek cuts cached-input pricing to one-tenth across the V4 line.

Days after the V4 Preview drop (V4-Pro at 1.6T total / 49B active, V4-Flash at 284B / 13B, both 1M context), DeepSeek slashed input-cache prices to 1/10 of the original across the entire API series. V4-Flash now bills $0.14/M input and $0.28/M output; V4-Pro $1.74/M input and $3.48/M output — with the cache-hit path an order of magnitude cheaper. V4-Pro's Codeforces rating of 3,206 already eclipsed GPT-5.4's 3,168 at release. The frontier-quality coding model with a 1M window now costs less than most teams' staging bills.

Read on DeepSeek API docs →

10 · Model release · OpenAI

No. ten

GPT-5.5 arrives as a unified "super app."

OpenAI's GPT-5.5 announcement, the day before the Microsoft renegotiation cleared, is the strategic complement to it: one model spec that fuses ChatGPT, Codex, and the browser into a single front-end. Vercel's AI Gateway picked it up the same week alongside DeepSeek V4 (Pro and Flash) — both pushed as more token-efficient surfaces for long-running agentic coding, computer-use, and research workflows. The pattern is consistent across providers this month: collapse the surface, expand the context, charge for the thinking, give the chat away.

Read on CNBC →

From Hacker News.

Top 5 · Last 24 hours · Firebase API

The front page, distilled.

Editor's notes · April 28

01

Talkie: a 13B "vintage" language model from 1930

389 points · 141 comments · @jekude

A 13-billion parameter LM trained exclusively on text published before 1930 — a deliberate cutoff that excludes the entire arc of modernist English, computers, and most of the engineering vocabulary the field is built on. The site frames it as a research artifact about how grammar, style, and "world model" depend on training-data era; the demo replies in the cadence of a Victorian periodical. Useful as a counterweight when teams insist on always-fresh data — sometimes the constraint is the experiment.

HN thread →

02

Is my blue your blue?

570 points · 383 comments · @theogravity

An interactive that asks where you personally draw the line between blue and green by walking you through ~100 hue swatches, then plots your boundary against the global distribution. The viral surface hides a surprisingly clean piece of perceptual-science instrumentation: monitor calibration caveats spelled out, a simple log-likelihood model fit to your answers, and an honest discussion of why the population mean isn't the "right" answer. Worth running with a colleague to see how far apart your two cutoffs are.

HN thread →

03

GTFOBins

223 points · 63 comments · @StefanBatory

The classic curated index of Unix binaries that can be abused — when SUID, sudo, or capabilities are misconfigured — to bypass restricted shells, exfiltrate files, or pop a root shell. The post resurfacing on the front page is a reminder, not a launch: if you ship Linux containers or harden multi-tenant systems, GTFOBins is the canonical "what a low-priv attacker has on hand" reference. Pair it with LOLBAS for the Windows mirror.

HN thread →

04

An update on GitHub availability

71 points · 31 comments · @salkahfi

GitHub posted a public-availability update covering recent regional incidents and the changes underway in their networking and traffic-management layer. The interesting part isn't the apology — it's the operational shape: more aggressive traffic shaping at the edge, a renewed focus on regional failure containment, and a candid note that "more visible status for partial outages" is now a roadmap item. If you've been getting flaky pushes in the past two weeks, the timeline lines up.

HN thread →

05

WASM is not quite a stack machine

55 points · 16 comments · @signa11

A short, sharp blog post that pokes at the common shorthand "WASM is a stack machine." The author walks through validation rules, typed locals, and structured control flow to show that the runtime is closer to a typed register-with-stack-conventions hybrid — which matters if you're writing a compiler backend, a JIT, or trying to reason about why certain stack manipulations are illegal. Compact reading; the diagrams are the payoff.

HN thread →

Architecture in the wild.

Anchor piece · April 28

Cloudflare Engineering · 30-day report

The single proxy worker: Cloudflare's internal AI stack, built on the platform they ship.

By Ayush Thakur, Scott Roe-Meschke, Rajesh Bhatia · Cloudflare Blog · April 20, 2026

Cloudflare published a rare full-stack tour of how they actually run AI internally — not a vendor pitch, but a 30-day production accounting of the architecture every Cloudflare engineer touches when they prompt anything.

The shape of it: every internal AI request passes through a single proxy Worker that fronts AI Gateway, which fronts both frontier models (~91% of requests) and Workers AI (~9%, for the cost-sensitive long tail). Auth is Cloudflare Access at the edge; portal layer collapses MCP tool schemas via "Code Mode" so token budgets don't explode; agentic state lives in McpAgent and Durable Objects via the Agents SDK; sandboxed code execution runs in dynamic Workers; orchestration uses Cloudflare Workflows; and a Backstage-based knowledge graph plus repo-level AGENTS.md files give every agent the context map it needs.

3,683

Active users · 30 days

241B

Tokens · AI Gateway

295

Teams using agentic tools

The architectural insight that earned the lede: by routing everything through a proxy Worker from day one, they could add per-user attribution, model-catalog management, and permission enforcement after the fact, without changing a single client config. It's a textbook lesson in placing the indirection point before you know what you'll need it for. The same essay quietly admits the cost: portal-level Code Mode requires architectural changes upstream, and the 91/9 split between frontier and in-house inference is an active cost-management lever, not a stable equilibrium.

"Centralizing through a Worker meant we could add per-user attribution, model catalog management, and permission enforcement later without touching any client configs."

Why it matters to anyone shipping agentic systems: the post is one of the few public, instrumented references for what an actual production-grade AI platform looks like once 60% of the company is in it. If you're scaffolding internal AI tooling now, the order of operations is the lesson — proxy first, governance second, models third — not the specific Cloudflare primitives.

Read on the Cloudflare blog →