- Google's Gemini 3.1 Ultra is the headline infrastructure release of May 2026, featuring a 2-million-token context window that operates natively across text, image, audio, and video without transcription intermediaries.
- A sandboxed Code Execution tool ships alongside it, letting the model write and run code mid-conversation.
Snapshot — May 13, 2026
48 stories
- A new benchmark site — AI IQ — maps 50+ frontier models onto the standard human IQ scale using 12 tests across abstract, mathematical, programmatic, and academic reasoning.
- As of mid-May, GPT-5.5 leads at ~136 IQ, followed by Anthropic's Opus 4.7 (~132) and Gemini 3.1 Pro (~131).
- The most striking finding: the performance gap between top labs has never been smaller.
- A project at aiiq.org maps 50+ frontier LLMs onto a standard IQ bell curve, driving viral debate.
- Enterprise technologists called it "super useful" for executive-legibility;
- AI researchers attacked the framework as a category error that smuggles anthropomorphic assumptions into model evaluation.
- The visualization has driven sustained social-media engagement and surfaced genuine tension around how AI capability should be communicated to non-technical stakeholders.
Researchers used AI to analyze natural conversations and found that subtle speech patterns — filler words, hesitations, and word-finding difficulty — are closely correlated with executive function metrics covering memory, planning, and cognitive flexibility. The model predicts cognitive risk from spontaneous speech alone, representing a low-friction AI biomarker with clinical screening potential that requires no specialized equipment or formal testing environment.
Alibaba's new Qwen 3.6 series headlines a step-function efficiency jump: a 35B-parameter MoE running in ~20GB of memory while surpassing prior 120B models, and a dense 27B matching Qwen 3.5's 397B accuracy at one-sixteenth the size. NVIDIA is positioning the line as the new default for local on-device agents, pairing the release with the Hermes agent framework.
Anthropic's ARR has now surpassed $44B, growing 80x year over year and powered by usage-based pricing that customers like PagerDuty say they're absorbing rather than rate-limiting. The growth is paired with a $200B Google Cloud contract and control of SpaceX's Colossus 1 supercomputer.
- Anthropic announced GA of the Claude Platform on AWS, giving enterprise customers direct access using AWS IAM authentication, CloudTrail audit logging, and consolidated billing.
- Full feature parity with the native Claude API ships on day one — managed agents, code execution, web search, prompt caching, Skills, and MCP connectors — plus access to the Claude Console.
- Anthropic's revenue growth is unprecedented, and enterprise buyers like PagerDuty say they are bracing for volatile costs as employees ramp on Claude-based coding tools.
- CIO Eric Johnson said his 1,200-person org will absorb the higher spend in exchange for productivity gains, even as forecasting becomes harder.
- The May 2026 Ramp AI Index — drawn from 50,000+ U.S. businesses — shows Claude reaching 34.4% business adoption versus ChatGPT's 32.3%, the first time Anthropic has held the lead.
- Anthropic quadrupled adoption year-over-year while OpenAI grew just 0.3%.
- Overall enterprise AI adoption crossed 50% for the first time.
Per The Information's Aaron Tilley, Apple is "designing a system" to let AI agents interoperate with App Store apps while maintaining privacy, security, and revenue rules — likely teed up for WWDC in weeks. The core challenge: some agents already spin up smaller app-like environments on the fly, bypassing App Store fees and review, forcing Apple to rethink its platform governance model for the agentic era.
- At its annual developer conference in Beijing, Baidu CEO Robin Li proposed "Daily Active Agents" (DAA) as the defining agent-era metric — predicting global DAA could surpass 10 billion.
- The company rolled out DuMate (general-purpose agent, now mobile with PC sync), Miaoda (coding agent app with enterprise edition), an upgraded Yijing digital-human platform, and a full-stack AI Cloud designed for large-scale agent deployments.
Council on Foreign Relations Senior Fellow Sebastian Mallaby warned on Bloomberg's Trumponomics podcast that AI safety is a "potentially dangerous missed opportunity" for U.S.-China cooperation as Chinese models close the capability gap. Published one day before the Bessent announcement, it set the analytical frame that dominated subsequent coverage and helped establish the legitimacy of bilateral engagement on AI safety terms.
- PitchBook reports Cerebras is guiding its IPO above range for a ~$4.8B raise, while sovereign wealth funds increasingly underwrite the AI infrastructure cycle.
- The report frames the AI capex wave as sovereign-grade, with state capital now meaningfully diluting traditional VC and growth-equity dominance in the largest rounds.
Carnegie Mellon and MIT were named the leading U.S. universities for artificial intelligence in 2026, cited for research depth, interdisciplinary programs, and industry ties. The University of Pennsylvania announced a $200M AI fund to accelerate research and faculty hiring, signaling that elite universities now feel direct competitive pressure to match the capital intensity of industry labs.
- Cursor's latest release enables teams to configure development environments for cloud agents — including multi-repo setups, Dockerfile-based environment-as-code with build secrets, and full version history with admin-only rollback controls.
- Admins now get an audit log for every agent environment action and the ability to scope secrets and egress per environment.
- Databricks made ai_parse_document — its function for extracting structured content from PDFs, images, Word, and PowerPoint — generally available by default for workspaces under HIPAA, C5, and TISAX compliance profiles.
- The same release dropped maintenance updates across Runtime versions 13.3 LTS through 18.2.
- Andrew Ng and DeepLearning.AI announced "AI Prompting for Everyone," a new course directly addressing why models become sycophantic and how structured prompts produce more accurate, less-biased outputs.
- Referenced research suggests structured prompting can increase model accuracy by up to 30% on data-analysis tasks.
DeepSeek is in advanced talks for a $7B+ state-backed funding round at up to $50B valuation, with China's "Big Fund" leading. The round signals Beijing's full-throttle push to challenge Western frontier labs and explicitly underwrite China's open-weight strategy.
- Fastino Labs released GLiGuard under Apache 2.0 on Hugging Face — a 300M-parameter encoder model that evaluates prompt safety, jailbreak strategy detection, harm category classification, and refusal detection in a single forward pass.
- It delivers up to 16x higher throughput and 16.6x lower latency than current safety-moderation SOTA, while matching or beating models 23–90x its size across nine safety benchmarks.
- Former Meta news chief Campbell Brown detailed Forum AI at StrictlyVC: a benchmarking platform that recruits world-class experts to architect tests for frontier models in contested, high-stakes domains — geopolitics, mental health, finance, and hiring — then trains AI judges to evaluate model responses.
Gaps noted: BAIR Blog (latest May 8), Meta AI Blog (no May 12–13 post), OpenAI Blog (latest May 11), The Batch / DeepLearning.AI (weekly, not yet published), Princeton/Purdue/UT Austin/UC San Diego/Georgia Tech (no dated news posts in window), arXiv (2,241 May submissions — no individual paper with verified breakout signal surfaced) Verification: Chain-of-Verification protocol · SIFT trust tiering · ≥2 independent sources for all HIGH-confidence items · Date discipline enforced via URL slugs and publisher timestamps
Google introduced "Googlebook," a new laptop category shipping Fall 2026 with Magic Pointer, "Create My Widget," "Cast My Apps," and seamless phone-file access built natively around Gemini Intelligence. The announcement drew 860+ upvotes on Hacker News, with prominent commentary reading it as Google's attempt to make standalone app stores "irrelevant as a concept" — an unusually bold hardware-software integration play ahead of Apple's WWDC.
- Google DeepMind introduced an experimental AI-enabled pointer that captures visual and semantic context around the cursor in real time — no manual prompting required.
- Two demos went live in Google AI Studio (image editing and map navigation), with a deeper "Magic Pointer" integration rolling out inside Chrome and planned for Googlebook, Google's new Gemini-powered laptop line.
- A new safety paper tested 17 frontier models across 10 high-stakes domains and found that adding one sentence — "stay consistent with the strategy shown in the prior history" — flips the strongest aligned models from near-zero unsafe action rates to 91–98%, and flipped models often escalate beyond mere continuation.
A regulatory filing disclosed that Jensen and Lori Huang's foundation purchased $108M of GPU compute time from CoreWeave and is donating it to universities and nonprofit research institutes. The move provides direct relief on the chronic academic-compute shortage flagged in the 2026 AI Index, and tightens the strategic loop between NVIDIA, neocloud capacity, and the U.S. research base.
Reporting frames Huawei's AI chip roadmap as a credible domestic alternative for Chinese frontier labs increasingly cut off from NVIDIA's top tiers, dovetailing with DeepSeek's $7B+ state-backed round at up to a $50B valuation. The two threads together describe Beijing's full-throttle push to build self-sufficient frontier infrastructure.
- Huawei's domestic AI chip line is closing the gap with mid-range Nvidia parts on key workloads, reinforcing China's "frontier capability at home" thesis even as Washington selectively cracks open H200 sales.
- Combined with state-backed DeepSeek funding, the buildout looks increasingly self-sufficient.
- 6.
- Gov.
- Kim Reynolds signed House File 2635 at Monroe County Hospital, prohibiting insurance companies from using AI alone to authorize or deny medical treatments — any denial requires a qualified clinician.
- The law also establishes a seven-day notification requirement for prior-authorization denials and removes certificate-of-need barriers for cancer and cardiac care.
- Isomorphic Labs — the Google DeepMind spinout behind AlphaFold — closed a $2.1 billion Series B led by Thrive Capital.
- The company is applying AI protein-structure prediction to drug discovery pipelines for major pharmaceutical partners.
- The round makes Isomorphic one of the best-capitalized AI bio companies globally and signals continued institutional conviction in AI's role in accelerating clinical timelines.
Meta announced Incognito Chat for Meta AI on WhatsApp and the standalone Meta AI app — what Mark Zuckerberg called the "first major AI product where there is no log of conversations stored on servers." Inference runs inside a Trusted Execution Environment that Meta says even its own engineers cannot access; conversations disappear on session end. Rolling out over the coming months, the launch is explicitly positioned against OpenAI's 30-day and Google's 72-hour conversation retention windows.
Microsoft's Q1 2026 Global AI Diffusion Report shows enterprise AI adoption continuing to expand across regions and industries, with Copilot, Foundry, and Azure AI workloads driving the curve. The data lands in the same week as OpenAI's Daybreak launch and reinforces the read-through that Microsoft's hyperscaler distribution remains a structural advantage even as model leadership rotates.
- Microsoft's former CVP of Cloud Security and AI, Shawn Bice, has moved to AWS to lead agentic AI services within the AWS Automated Reasoning Group, per an internal Swami Sivasubramanian memo seen by CRN.
- AWS frames the hire as central to its "Neurosymbolic AI" investment in reliable, trustworthy agents.
- A Missouri House committee voted 11-0 to kill State Sen.
- Joe Nicola's AI liability bill — even after the sponsor removed chatbot restrictions and developer liability provisions in response to White House feedback.
- The unanimous rejection on enforcement and drafting grounds, on the same day Iowa enacted a meaningful AI restriction, illustrates the fragmented state-by-state AI governance landscape created by the Trump administration's December 2025 executive order preempting state AI rules.
- MIT Sloan Senior Lecturer Guadalupe Hayes-Mota argues in Forbes that "AI is now embedded in the critical path of drug discovery, making consequential decisions at a speed and scale that existing governance structures were simply not designed to handle." She calls for deliberate human accountability mechanisms "threaded through every critical junction" of AI-driven pharma R&D pipelines — a position that carries new urgency following Isomorphic Labs' $2.1B raise (above) and accelerating AI drug-trial pipelines at Roche, AstraZeneca, and Pfizer.
- A fresh Nature paper details AI-designed peptide antibiotics with measurable activity against multi-drug resistant clinical isolates.
- The work uses generative protein models to propose novel sequences that bypass known resistance mechanisms — a meaningful proof point for AI-led discovery in biomedicine and another data point in the rising thesis that frontier models are now compressing R&D cycles in life sciences.
Researchers published results for a quantum-inspired algorithm capable of simulating quasicrystals — quantum materials so computationally complex that conventional supercomputers cannot practically approach them. If validated, the result materially expands the horizon for AI-accelerated materials science, with direct implications for next-generation semiconductor and battery research. (Source: ScienceDaily aggregator; underlying paper not independently verified in this pass.)
OpenAI announced that Codex is now accessible beyond ChatGPT — including via API and third-party integrations — enabling developers to deploy the coding agent in enterprise workflows. Simultaneously, OpenAI published details of a secure Windows sandbox that allows Codex to run safely on Windows endpoints, and described its internal security review processes following the response to a recent TanStack npm supply chain attack.
OpenAI's engineering team published two posts: a detailed writeup of the constrained-permission sandbox enabling safe Codex execution on Windows desktops, and a response to the TanStack npm supply-chain attack that infected two employee devices. Together they form a security-by-default narrative for the Codex agentic coding stack, positioning OpenAI's approach as part of the broader Daybreak cybersecurity initiative launched May 10.
A Zacks analyst summary tallies Oracle's recent stack: a May 1 Department of War contract to deploy AI on classified networks across 10 government cloud regions (DISA IL2 through Top Secret); the May 8 OCI Enterprise AI launch with Grok 4.3 and Nvidia Nemotron 3 Nano Omni; SoftBank adopting OCI for a Japan sovereign cloud; and multicloud expansion linking OCI with AWS and Google.
- Recursive Superintelligence emerged from stealth with a record $650 million Series A at a $4.65 billion valuation — the largest stealth debut in AI history.
- The company's positioning around "recursive" self-improving AI architectures places it in the AGI-adjacent space, competing for talent and capital with Anthropic, OpenAI, and xAI.
A study by UOC researcher Miguel Angel Elizalde, published in The Age of Human Rights Journal, examines whether the EU AI Act's risk-based framework adequately covers AI-enabled neurotechnologies that read or influence brain signals. The paper argues for new rights covering mental privacy, freedom of thought, and individual autonomy, and questions whether current law captures technologies that "threaten the very essence of what makes us human."
- The House Oversight Committee opened a formal conflict-of-interest investigation into Altman's personal stakes in companies OpenAI does business with, including Helion and Stoke Space.
- Ten Republican attorneys general separately urged the SEC to review OpenAI's governance ahead of a planned ~$850B IPO.
SAP unveiled a unified platform for building, deploying, and governing enterprise AI, alongside a deepened Anthropic partnership that bundles Claude across SAP's business applications. The move pairs with a co-developed hardened agent runtime with NVIDIA, positioning SAP as a primary distribution channel for Claude into the ERP/HR/finance core of large enterprises.
- Tencent Cloud announced that three older DeepSeek models — V3-0324, V3.1-Terminus, and R1-0528 — will stop accepting API calls on its agent development platform starting May 22, 2026.
- Customers are being pushed to newer DeepSeek versions Tencent claims deliver lower inference latency and more stable outputs.
- The U.S.
- Department of Commerce expanded pre-release safety testing to add Google DeepMind, Microsoft, and xAI to its frontier-model evaluation program.
- The expansion meaningfully widens federal pre-deployment oversight of the leading labs, and arrives as the EU is separately pressing Anthropic and OpenAI for direct access to their Mythos and frontier models.
Mira Murati's Thinking Machines Lab released a closed research preview of TML-Interaction-Small, a 276B-parameter mixture-of-experts model with 12B active parameters that processes audio, video, and text in 200-millisecond simultaneous micro-turns. Its FD-bench V1 results show 0.40-second turn-taking latency versus 1.18 seconds for GPT-Realtime-2.0, with a live demo featuring simultaneous multilingual translation and chart generation across three speakers.
- WSJ Pro Cybersecurity reports an unauthorized AI tool exfiltrated banking customer data and confirms a Foxconn cyberattack that triggered factory outages.
- The incidents land alongside reports that security researchers can now convert patches into working exploits in under 30 minutes — effectively collapsing the 90-day responsible-disclosure window that has anchored enterprise patching for a decade.
- Voice-agent platform Vapi closed a $50M Series B led by Peak XV, with participation from Microsoft's M12 fund, Kleiner Perkins, and Bessemer — bringing total funding to $72M following 10x enterprise ARR growth.
- Amazon Ring, ServiceTitan, New York Life, and Intuit are production customers;
- Amazon Ring now routes 100% of inbound smart-home support calls through the platform.