Post
Hive Report: GPT-5.5, DeepSeek V4, and the $100 Billion Week in AI
Two frontier models, two megadeals, and one quiet question: who exactly is paying for all of this? The week's biggest stories, plus a deep dive on the price war.
This was the kind of week where you open your feed reader and immediately close it because you're already late. Two new frontier models. Two megadeals north of $40 billion. New TPUs. New image generation. Mass layoffs. Surveillance software. And an Apple patch that quietly closed an FBI side door.
If you've been following along, we already covered Qwen3.6-27B shrinking the model and the Claude Code postmortem that wasn't a nerf. So today we're catching everything else β and there's a lot.
This Week's Highlights
Google to invest up to $40 billion in Anthropic. The deal is split: $10 billion in cash now at a $350 billion valuation, plus another $30 billion tied to performance milestones. Anthropic also gets 5 gigawatts of TPU capacity over five years. This lands days after Amazon committed up to $25 billion. Anthropic's run-rate revenue is already past $30 billion, up from $9 billion in December. Google now owns somewhere around 14% of the company. (TechCrunch)
SpaceX strikes a $60 billion option to acquire Cursor. Cursor was about to close a $2 billion fundraise. SpaceX walked in, offered a $10 billion "collaboration fee" floor and an option to acquire the whole company for $60 billion later this year, and Cursor halted the round. The plan is to pair Cursor's IDE distribution with SpaceX's "million H100 equivalent Colossus" cluster. The acquisition is timed for after SpaceX's IPO this summer. Yes, you read that right β SpaceX is doing an IPO and buying an AI coding tool. (CNBC)
Google's 8th-gen TPUs ship: 8t for training, 8i for inference. TPU 8t scales to 9,600 chips per pod, delivers 121 ExaFlops, and claims near 3x the per-pod compute of the previous generation. TPU 8i is the inference twin: 288GB HBM, 5x lower latency through a new Collectives Acceleration Engine, and 80% better performance-per-dollar. Both run 2x cooler-per-watt than Ironwood. General availability later in 2026. (Google blog)
ChatGPT Images 2.0 lands and Sam Altman calls it "GPT-3 to GPT-5". OpenAI's new image model handles complex multi-element scenes, gets text-inside-images mostly right, and pushes resolution up to 3840Γ2160. Simon Willison's pelican-on-a-bicycle test mostly held β though the model still hallucinates objects it didn't actually draw. Available to ChatGPT users today, API access via gpt-image-2. (OpenAI)
Meta cuts 8,000 jobs β and starts logging employee mouse movements. Two stories, one company, one week. Layoffs begin May 20 (10% of headcount, plus 6,000 unfilled roles closed) under the "Agent Transformation Accelerator" program. Separately, Meta is rolling out the "Model Capability Initiative" β software that captures clicks, keystrokes, and occasional screenshots from US employees' work computers. The pitch: training data for AI agents that can use enterprise software autonomously. The vibe: dystopian. (Bloomberg, Reuters)
Apple closes the FBI's iPhone notification side door. Remember the FBI's Signal trick from two weeks ago? Apple just shipped the fix. iOS was caching notification content β including disappearing-message text β for up to a month, even after deletion. The patch is rolling out and was backported to older iOS 18 devices. Signal's president said it bluntly: notifications for deleted messages should not stay in any OS database. Right answer, late patch. (TechCrunch)
Bitwarden CLI compromised in a supply chain attack. Socket flagged a malicious package masquerading as the Bitwarden CLI as part of an ongoing campaign that already hit Checkmarx and others. If you install dev tooling via npm or similar registries, today's a good day to audit. Supply chain attacks are not a 2024 story β they're a now story. (Socket)
Deep Dive: GPT-5.5 vs DeepSeek V4 β Who's Paying for the Frontier?
Same week. Two frontier model launches. Almost perfectly mirrored economics. If you wanted a single chart for the state of AI in April 2026, this is it.
The setup
On Wednesday, OpenAI released GPT-5.5. On Friday, DeepSeek released V4. Both claim frontier-class performance. One launched at double the price of its predecessor. The other launched at a fraction.
Let's just put the numbers next to each other.
| Model | Input ($/M tokens) | Output ($/M tokens) | Notes |
|---|---|---|---|
| GPT-5.5 | $5.00 | $30.00 | 2x the price of GPT-5.4 |
| GPT-5.5 Pro | $30.00 | $180.00 | Extended reasoning tier |
| GPT-5.4 (still available) | $2.50 | $15.00 | Now positioned as the budget tier |
| DeepSeek V4-Pro | $1.74 | $3.48 | 1.6T parameters MoE, 49B active |
| DeepSeek V4-Flash | $0.14 | $0.28 | 284B MoE, 13B active |
| Claude Opus 4.7 (reference) | $5.00 | $25.00 | Same input as GPT-5.5 |
Read the V4-Pro row again. $1.74 input, $3.48 output. For a model that the V4 paper itself admits trails the absolute frontier by maybe three to six months. That's roughly 1/3 the price of GPT-5.5 and 1/14th of GPT-5.5 Pro on output.
What changed at OpenAI
GPT-5.5 isn't just a small bump. Reviewers describe it as "fast, effective, highly capable" with a sharper instruction-following profile. The headline new feature is extended reasoning: pass reasoning_effort: xhigh and the model will burn thousands of reasoning tokens before answering β Simon Willison's tests showed 9,322 reasoning tokens vs 39 in default mode for the same prompt.
The "jagged frontier" pattern continues β strong in some areas, surprisingly weak in others β but the overall direction is clear. OpenAI is willing to trade cost for capability, and they're betting customers will pay.
What changed at DeepSeek
V4 is also more than an incremental update. The headline numbers are about efficiency, not just quality. For a 1-million-token context, V4-Pro uses 27% of the per-token FLOPs of V3.2 and 10% of the KV cache size. V4-Flash takes that further: 10% of the FLOPs and 7% of the KV cache.
That's not a cost cut. That's an architecture rewrite. They figured out how to do more with much, much less compute, and they passed the savings on.
The pattern
There are now two distinct frontiers:
- The closed frontier, where OpenAI, Anthropic, and Google charge $5β$30 per million tokens for the absolute best models, get billions in capex from Big Tech, and ship features like extended reasoning that demand obscene compute budgets.
- The open(-ish) frontier, where DeepSeek, Qwen, and a small handful of others ship something almost as good, at 10β20% of the price, with weights you can download.
The gap in capability is shrinking. The gap in price is growing. Three to six months of model lag now buys you an order of magnitude on cost.
Who pays for what
This is where it gets interesting. The closed labs are funding the next round through massive capital injections β exactly the deals we covered above. Anthropic just took $40B from Google on top of $25B from Amazon. SpaceX is paying $60B for an IDE. OpenAI has its own circular compute deals stacked higher than I can chart.
Meanwhile DeepSeek⦠ships a paper, a model, and a price list, and walks off.
The closed frontier is increasingly funded by hyperscalers who need somewhere to deploy their TPU and GPU capacity. The open frontier is funded by efficiency. Both can be true. But it does mean the question "is GPT-5.5 worth 2x GPT-5.4?" is now sharing a stage with "and 17x DeepSeek V4-Pro?"
My take
I think the next 12 months come down to whether the closed labs can keep the capability gap wide enough to justify their price gap. Extended reasoning helps OpenAI here β it's a genuinely hard feature to clone, and it eats compute the open labs don't have access to.
But "almost as good" is a deadly competitor. Not for ChatGPT (consumers will pay for the best), but for the developer market, where a 30% performance gap matters less than a 10x cost gap when you're running millions of API calls.
If I were OpenAI or Anthropic, the chart that would worry me isn't the leaderboard. It's the price-per-token chart, plotted over time, with DeepSeek's line dropping faster than mine. The frontier isn't where the money is anymore. The frontier is where the prestige is. The money is in whatever gets deployed.
This week's $100 billion in deal flow is the closed frontier raising the funds it needs to stay ahead. V4 is the open frontier saying it doesn't need that much money to keep up. We'll see who's right.
Sources
- Introducing GPT-5.5 β OpenAI's official launch announcement
- GPT-5.5 via the semi-official Codex backdoor API β Simon Willison's hands-on review and pricing breakdown
- DeepSeek V4 β almost on the frontier, a fraction of the price β Simon Willison's analysis with concrete benchmark and price comparison
- Google to invest up to $40B in Anthropic in cash and compute β TechCrunch coverage of the GoogleβAnthropic megadeal
- SpaceX says it can buy Cursor for $60 billion β CNBC on the SpaceXβCursor option
- Our eighth generation TPUs: two chips for the agentic era β Google's TPU 8t/8i announcement
- Introducing ChatGPT Images 2.0 β OpenAI's image model launch
- Meta tells staff it will cut 10% of jobs β Bloomberg on Meta's layoffs
- Meta to start capturing employee mouse movements and keystrokes for AI training β Reuters on the Model Capability Initiative
- Apple fixes bug that cops used to extract deleted chat messages from iPhones β TechCrunch on the iOS notification cache patch
- Bitwarden CLI compromised in ongoing Checkmarx supply chain campaign β Socket's writeup of the supply chain attack