Everything you need to know about artificial intelligence from the last 24 hours.
OpenAI co-founder, ex-Tesla Autopilot director and Eureka Labs founder returns to frontier R&D — and will help bootstrap a sub-team using Claude itself to accelerate pretraining research.
nn-zero-to-hero, micrograd, makemore and llm.c tutorials."Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time."
— Andrej Karpathy, on X
The hire lands at a delicate moment for OpenAI and slots cleanly into Anthropic's thesis that the next phase of frontier AI is won by automating research itself. Putting Karpathy — arguably the most respected pedagogue/practitioner in the field — on Nick Joseph's pre-training team, and asking him to bootstrap a "use Claude to do pretraining research" team, signals where Anthropic believes its leverage is: not just more H100s, but agentic Claude-on-Claude research loops.
Implications: Expect renewed talent-market pressure on OpenAI; an uptick in Anthropic recruiting momentum; and likely visible product fingerprints over the next 6-12 months in how Claude tools its own training pipeline.
Open-source browser automation built for LLM agents — no Playwright, no Node, just a Rust binary and Chrome for Testing.
npm i -g agent-browser, Homebrew, or Cargo. No Playwright or Node.js required for the daemon.snapshot (accessibility tree with refs the model can click by), click @e2, fill @e3, screenshot --annotate, pdf, eval <js>, and a built-in chat REPL for natural-language control.find role button click --name "Submit") and traditional CSS selectors.react tree, react inspect, react renders), Core Web Vitals reporting, SPA pushstate navigation, and repeatable --init-script + --enable feature flags.Implications: Positions the browser as a first-class agent surface with primitives that match how LLMs reason (refs from an a11y tree, not pixel-perfect XPath). The React-DevTools integration in particular reads as Vercel making the case that "your agent should understand the framework, not just the DOM."
Electric Atlas hoists mini-fridges using proprioception — not vision — after sim-trained RL generalises from 50-70 lb training payloads to 100+ lb in the real world.
"You cannot lift a fridge just by looking at it and using your hands."
— Boston Dynamics Atlas team
Implications: Proprioception-first RL is emerging as the dominant recipe for industrial humanoid manipulation; combined with Hyundai's 30k/year roadmap, this is the clearest Western signal that heavy-duty humanoids are leaving the demo phase.
Apollo pilots with Mercedes-Benz and GXO Logistics scale up; headcount grows from ~300 to ~500.
Implications: US humanoid leaders are pivoting from R&D-mode to industrial buildout (factories, training data, headcount) — capital and footprint are becoming the moats, not just robot demos.
Average selling price projected to drop from ~$115k (2024) to ~$37k by 2030; commercial scale still gated by capability gaps.
Implications: For the first time the unit-economics narrative is being pegged to specific payback windows — but IDTechEx explicitly cautions that effective task generalisation, not cost curves, decides which vendors actually scale.
~$35k one-way attack drones get swarm-capable AI — one operator commands the swarm, humans retain strike authority.
"Mass without coordination is limited in value. Hivemind is the AI pilot that makes that mass intelligent."
— Brandon Tseng, President, Shield AI
Implications: LUCAS gives the Pentagon a swarm-ready Shahed-class attritable munition, and Hivemind — already on one-way attack drones in Ukraine — becomes a de facto autonomy reference stack for cheap mass strike.
JIATF 401 deal covers AI-enabled interceptors and strike drones already battle-tested against Iranian UAVs.
"Drones are the defining threat of our time. Deploy and scale low-cost, attritable air-to-air drone interceptors at all our facilities."
— Brig. Gen. Matt Ross
Sit(x) + zero-trust + DETS fold into Anduril's C4 and Lattice — operational integrations demoed at SOF Week 2026.
"Allows us to connect to compromised networks all over the world and still utilize mission applications without compromising the operators on the ground."
— Brett Melancon, Anduril Chief Solutions Architect
SOCOM + Global SOF Foundation event runs May 18-21; exhibition floor live May 19-21.
Implications: SOF Week has become the de facto deal floor for autonomy-at-the-edge vendors — European players are now planting U.S. flags directly into SOCOM's backyard rather than approaching through Beltway primes.
Second consecutive delay for SpaceX's redesigned V3 stack; Booster 19 + Ship 39 stand 408 ft tall with Raptor 3 engines.
Implications: NASA's Artemis 4 lunar landing in 2028 depends on Starship working as the Human Landing System; a clean V3 debut is on the critical path. Multiple slips with no public technical explanation suggest preflight checks on the new pad and Raptor 3 stack are running tight.
Falcon 9 lifts 24 Starlink v2 Mini from SLC-4E — 58th SpaceX launch of 2026, 612th booster recovery overall.
Mission re-scoped as crewed Earth-orbit rendezvous-and-docking test with commercial landers; SLS core stage mated to engine section May 12.
Implications: Re-scoping Artemis III away from a surface landing is a tacit acknowledgment that neither SpaceX's Starship HLS nor Blue Origin's MK2 lander will be flight-ready on the original timeline — making Starship V3 Flight 12 a higher-stakes test for the entire Artemis program.
Series B co-led by Overmatch, BlackRock and 8090 Industries; Galleon Forge One will manufacture megawatt-scale modular data centers for edge AI.
"The AI race will be won by companies that manufacture, deploy, and improve infrastructure with speed, scale and sovereignty."
— Dan Wright, CEO, Armada
Implications: Modular, edge-sited AI compute is becoming an investable subcategory alongside the GPU-cloud goliaths. Expect more strategic capital from industrial OEMs (HVAC, power, defense primes) entering infra-startup cap tables.
Highland Europe leads; LLM-agnostic enterprise AI delivery with outcome-based pricing — only pay after live trial proves out.
"Every enterprise we speak with has a backlog of high-impact AI use cases and almost nothing in production. We built Unframe to close the gap between ambition and execution."
— Shay Levi, CEO, Unframe
Implications: Outcome-based pricing flips the usual enterprise pilot script. If this scales, expect copycats in consulting-heavy verticals and pressure on OpenAI's newly announced Deployment Company.
Menlo Ventures leads; 10,000+ dietitians + AI agents covered for 200M+ lives, documented A1C and LDL deltas at scale.
Implications: The "human expert + AI agent" hybrid is becoming the dominant clinical-AI archetype for regulated, insurance-paid care.
Ceiling sensors + RFID + computer vision cut BOPIS cancellation rates from 25% to 3% in deployed stores.