- OpenClaw founder Peter Steinberger is running approximately 100 parallel Codex instances around the clock on his open-source project, driving OpenAI API spend to $1.3 million per month with a team of just three humans.
- The agents write code, review pull requests, and find bugs.
- Steinberger's operation is the most extreme public example to date of agentic AI as a force multiplier for small engineering teams — and a leading indicator of where enterprise software development economics may be heading. 📈 4 · Industry News
Snapshot — May 16, 2026
22 stories
- A Harvard working paper has formalized "AI work slop" — outputs that are polished and credible at first read but degrade rapidly under scrutiny.
- Ken Griffin cited the paper directly, describing an internal Citadel commodities report where the opening sentences were genuinely insightful but the analysis "all garbage" further down.
- The EMO (Expert Mixture Optimization) paper demonstrates that reorganizing MoE expert routing by content domain — rather than by token prediction — produces dramatic sparsification.
- Stripping 87.5% of experts leaves near-intact benchmark performance.
- The researchers argue this enables practical MoE deployment in environments previously constrained by memory bandwidth and cost, including consumer devices.
- Anthropic CFO Krishna Rao disclosed today that over 90% of the company's internal codebase is now produced by Claude Code, the company's AI-native coding agent.
- Rao described the shift as a "step-change in engineering productivity," with human engineers increasingly in a supervisory and architectural role rather than writing code line by line.
- Academic preprint repository ArXiv has announced a new policy banning authors for one year if they are found to have used AI to generate the entirety of a submitted paper without meaningful intellectual contribution.
- The policy draws a clear line between acceptable AI-assisted writing — grammar corrections, formatting, literature queries — and wholesale AI authorship.
- AI chipmaker Cerberus (CBRS) priced its IPO at $185/share on Wednesday in what became 2026's largest public offering to date, raising an upsized $5.6 billion.
- The stock surged 68% on its first day of trading before pulling back 10% on Friday, reflecting both intense investor demand for AI chip exposure and volatility in the sector.
- Four Chinese labs — Z.ai (GLM-5.1), MiniMax (M2.7), Moonshot (Kimi K2.6 scoring 53.90 on the AI Intelligence Index), and DeepSeek (V4 Pro at 51.51 on Hugging Face) — shipped open-weights frontier-class coding models within a 12-day window in late April, each at less than a third of Claude Opus 4.7's inference cost.
- Researchers at Carnegie Mellon University published a new benchmark measuring how far frontier AI agents can progress when targeting real vulnerabilities in Google's V8 JavaScript engine.
- Claude Mythos led GPT-5.5 by a significant margin, with both models demonstrating the ability to develop functional browser exploits autonomously.
- DeepSeek, the Chinese AI lab best known for its efficiency-first R-series reasoning models, is finalizing a $4 billion funding round that would value the company at $50 billion.
- Notably, China's national state AI investment fund is participating — a signal of strategic government backing for the lab that rattled U.S.
May 2026 marks a regulatory inflection point: the EU AI Act has reached full enforcement, U.S. federal agencies have issued new compliance guidance, and Asia-Pacific frameworks (notably in South Korea, where the deputy PM has tied AI wealth distribution to public benefit) are coming online. Enterprises with cross-border AI deployments should expect significantly higher documentation and risk-assessment requirements through year-end.
- DeepMind's Gemini-powered AI mouse pointer — the first fundamental reimagining of the cursor in 50 years — began rolling out inside Chrome on May 16 as Magic Pointer.
- Two live demos are available in Google AI Studio (image editing; map-based navigation).
- The system captures real-time visual and semantic context from the cursor's hover state, letting users say "fix this" or "what does that mean?" without typing a prompt.
- OpenAI has quietly made GPT-5.5 Instant the default ChatGPT model — a lower-latency, lower-cost variant of GPT-5.5 that preserves most of its reasoning quality while dramatically cutting response times.
- The move democratises frontier-class performance for all paid tiers.
- No major lab has shipped a new flagship in the past 48 hours; mid-May is shaping up as an architecture and efficiency wave rather than a benchmark race, with IBM's Granite 4.1 family (3B / 8B / 30B, open-source, April 29) the most recent notable open-weights addition. 🔬 2 · Research Breakthroughs
- Bank of America's top semiconductor analyst Vivek Arya raised Nvidia's price target from $300 to $320, implying roughly 42% upside, citing an expanded AI data center TAM estimate from $1.4T to $1.7 trillion annually by 2030.
- The firm expects Nvidia to retain more than 70% of AI infrastructure market share despite growing competition from new entrants like Cerberus.
Eric Schmidt was audibly booed during the AI-focused portion of his University of Arizona commencement address on May 16, while at UCF on May 8, Tavistock Development's Gloria Caulfield drew sustained jeers for framing AI as "the next industrial revolution." The two incidents — at very different…
- May delivered the most dramatic AI API pricing changes in a single month. xAI raised Grok 3 from $3/$15 to $30/$150 per million tokens — a 10× increase making it the most expensive model in major API catalogs.
- Simultaneously, DeepSeek and Mistral both slashed prices by 75%, intensifying cost competition in the mid-tier model segment.
- Effective today, Microsoft 365 Copilot Chat is no longer available inside Word, Excel, PowerPoint, and OneNote for unlicensed users at organizations with more than 2,000 users.
- Smaller tenants retain limited "standard access." Microsoft is simultaneously rolling out new "Basic" and "Premium" labels and introducing its Microsoft 365 E7 and Agent 365 tiers as GA.
- NVIDIA's Vera Rubin platform — comprising the Vera CPU, Rubin GPU, NVLink 6 Switch, ConnectX-9 SuperNIC, BlueField-4 DPU, Spectrum-6 Ethernet switch, and newly integrated Groq 3 LPU — entered full production.
- The platform is designed to operate as a single AI supercomputer optimized for every phase: pretraining, post-training, test-time scaling, and real-time agentic inference.
- OpenAI has acquired Weights.gg, a small startup (~6 people) known for enabling celebrity AI voice clones — Taylor Swift, Donald Trump, and others — a service the company has since shuttered.
- The team has joined OpenAI's voice platform group, signaling continued investment in realistic voice generation to power GPT-Realtime-2 and forthcoming voice-agent capabilities.
OpenAI co-founder and president Greg Brockman has officially assumed leadership of product strategy, stepping in while CEO of AGI Deployment Fidji Simo remains on medical leave. In a staff memo, Brockman outlined plans to unify ChatGPT, Codex, and the OpenAI API into a single platform with one core…
Researchers tested GPT-5, Gemini 2.5, and Claude 4.5 on which occupations face the highest AI exposure and found wildly inconsistent rankings across models. The paper undercuts the practice of using LLMs themselves as labor-market forecasters and reinforces that downstream policy and workforce planning still requires human-led methodology.
- Both OpenAI ($852B valuation after a $122B March funding round) and Anthropic (targeting $900B in an imminent raise) are widely expected to go public in 2026, according to Renaissance Capital analysis.
- OpenAI also separately launched "The Development Company" — a $4B forward-deployed enterprise AI venture backed by TPG, Brookfield, Advent, and Bain Capital — while Anthropic's parallel $1.5B JV includes Blackstone, Goldman Sachs, and Hellman & Friedman as founding partners.
- A new benchmark called WorldReasonBench tests AI video generators not on image fidelity but on physical plausibility and logical consistency.
- ByteDance's Seedance 2.0 topped the leaderboard ahead of Google's Veo 3.1 and OpenAI's Sora 2.
- The findings confirm that today's generators excel at aesthetics but routinely violate basic physics and causal reasoning — a key gap for enterprise video, simulation, and training-data applications. 🛠️ 3 · Products & Tools