AI Radar

AI Radar — 15 May 2026

9 items 6 verified 3 secondary 0 rumor 20 sources 44% exploration

Notion opens its workspace to external AI agents; xAI ships a Grok Build CLI to challenge Claude Code; Anthropic splits Agent SDK billing from subscriptions; Claude Code v2.1.142 upgrades Fast mode to Opus 4.7; Anthropic and Gates Foundation commit $200M for AI in global health and education.

Run: 12–15 May 2026 · 30 items reviewed → 9 published · 6 verified · 3 secondary · 0 rumor · 44% exploration · Run timestamp: 2026-05-15

Note: Claude for Small Business, Claude Code v2.1.141, GitHub Copilot CLI v1.0.45, Meta AI App features, Judgment Labs $32M, Google TIG zero-day, Wispr AI $260M talks, and Anthropic $30B round (all May 11–14) were published in the 2026-05-14 bulletin and are excluded here.


TL;DR


Items

Notion Launches Developer Platform with External Agents API, Workers, and Database Sync

Source: https://www.notion.com/releases/2026-05-13 · Notion · 2026-05-13 Verification: T2 verified · announcement · productivity-ai / agent-framework / workflow-automation

Notion shipped version 3.5 of its product on 13 May 2026, adding a developer platform that lets external AI agents operate inside Notion workspaces. The External Agents API (alpha) connects Claude Code, Cursor, Codex, and Decagon to Notion at launch, with a custom-agent pathway via the Agent SDK (alpha). Notion Workers — a hosted, serverless code-execution runtime deployed via CLI into a secure sandbox — powers database syncs, custom agent tools, and webhook triggers without server management; Workers are free during beta and move to Notion credits from 11 August 2026. The Developer Portal at app.notion.com/developers now centralizes connection and token management across integrations.

Why it matters for automation/productivity: Teams running Notion as their primary workspace can now connect Claude Code, Codex, or Cursor directly into Notion and build serverless functions that pull external API data into Notion databases on a schedule — making Notion a coordination layer for multi-step agent workflows rather than a passive notes tool.

Key claims:

Cross-references:

Caveats: External Agents API, Agent SDK, and Webhook Triggers are in alpha/beta. Pricing for Workers credits post-August 11 not disclosed at launch. Vendor framing; degree of actual agent autonomy within Notion depends on workflow design.


xAI Ships Grok Build CLI Early Beta for Agentic Coding, Requiring SuperGrok Heavy at $99/Month

Source: https://x.ai/news/grok-build-cli · xAI · 2026-05-14 Verification: T2 verified · announcement · dev-tools Tier nuance: x.ai/news URL returned 403 from this sandbox; content confirmed via @xai official X post (T2) and Bloomberg reporting.

xAI released an early beta of Grok Build on 14 May 2026, an agentic CLI for coding and workflow automation available to SuperGrok Heavy subscribers. The tool plans projects, writes and edits files, executes shell commands, and dispatches up to 8 concurrent AI subagents to work in parallel — positioning it directly against Claude Code. The underlying model is Grok 4.3 beta in a 16-agent Heavy architecture with a vendor-stated 2M-token context window. Plugins, hooks, skills, and MCP servers work out of the box, and subagents can launch in their own git worktrees. Normally priced at $299/month (SuperGrok Heavy), with an introductory rate of $99/month for the first six months.

Why it matters for automation/productivity: Development teams now have a second agentic coding CLI from a frontier lab, with a notably large stated context window and native multi-agent parallelism. The $99 introductory price sits just below Claude Code Max 5x ($100/month), making it a direct compare for teams shopping the category. Grok Build’s MCP server and plugin compatibility reduces lock-in risk for teams already invested in that tooling.

Key claims:

Cross-references:

Caveats: Early beta; 8-agent parallelism and 2M context window are vendor-claimed, no independent replication in this run. Access restricted to SuperGrok Heavy subscribers. xAI has documented service-reliability issues (recurring demand-outage errors) through April 2026.


Claude Code v2.1.142 Upgrades Fast Mode to Opus 4.7 and Adds Background Agent Configuration Flags

Source: https://github.com/anthropics/claude-code/releases · Anthropic (anthropics/claude-code) · 2026-05-14 Verification: T2 verified · changelog · dev-tools

Claude Code v2.1.142 shipped 14 May 2026 at 22:55 UTC. Fast mode now defaults to Claude Opus 4.7 (previously Opus 4.6); users can pin to 4.6 via CLAUDE_CODE_OPUS_4_6_FAST_MODE_OVERRIDE=1. Background agents now accept configuration flags at dispatch time — —add-dir, —settings, —mcp-config, —plugin-dir, —permission-mode, —model, —effort, and —dangerously-skip-permissions — enabling fine-grained per-session configuration when spawning headless agents programmatically. Plugins with a root-level SKILL.md and no skills/ subdirectory are now surfaced as skills. Bug fixes: MCP_TOOL_TIMEOUT now correctly raises per-request timeouts for remote HTTP/SSE MCP servers; background sessions now recognize pre-existing git worktrees; daemon no longer crash-loops after binary upgrade.

Why it matters for automation/productivity: The Fast mode upgrade to Opus 4.7 brings Opus 4.7’s April 2026 coding improvements (vendor-cited 13% lift on coding benchmarks) into Fast mode at no additional cost. The per-dispatch background agent flags immediately benefit teams running Claude Code in CI pipelines where each session needs different tool scopes, permission boundaries, or models.

Key claims:


OpenAI Brings Codex to ChatGPT Mobile App on iOS and Android in Preview

Source: https://openai.com/index/work-with-codex-from-anywhere/ · OpenAI · 2026-05-14 Verification: T3 secondary · announcement · dev-tools Tier nuance: Primary openai.com URL returned 403 from this sandbox; content confirmed by TechCrunch, 9to5Mac, Engadget, and Thurrott (all T2).

OpenAI added Codex to the ChatGPT mobile app on 14 May 2026, rolling out preview access on iOS and Android across all plans including Free and Go. The mobile interface acts as an intermediary to Codex running on a macOS machine (Windows remote support coming), letting users review task status, approve commands, change models, and start new sessions from a phone while local files, credentials, and environment stay on the source machine. A secure relay layer keeps the remote machine reachable across devices without exposing it to the public internet. OpenAI states over 4 million people use Codex weekly — a vendor-cited figure without independent verification.

Why it matters for automation/productivity: Development teams running Codex on fixed machines can now supervise long-running tasks remotely, approve checkpoints mid-commute, and restart stalled jobs without returning to a workstation — a concrete improvement for asynchronous agentic workflows that outpace the workday.

Key claims:

Cross-references:

Caveats: Preview only; Windows remote support not yet available. Primary URL 403 from this sandbox — tier limited to T3 secondary for this run.


Anthropic Splits Agent SDK and Programmatic Claude Usage Into a Separate Monthly Credit Pool, Effective June 15

Source: @ClaudeDevs (official Anthropic developer relations account on X) · Anthropic · 2026-05-13 Verification: T2 secondary · policy-change · ai-for-business / dev-tools

Anthropic announced on 13 May 2026 that, starting June 15, Claude Agent SDK calls, claude -p, Claude Code GitHub Actions, and third-party agents such as OpenClaw move from the interactive subscription limit to a separate monthly programmatic credit pool. Interactive use of Claude Code, Claude Cowork, and Claude.ai (user types, model responds) remains on subscription limits and is unaffected. Monthly credit amounts: Pro = $20, Max 5x = $100, Max 20x = $200. Credits do not roll over; once exhausted, programmatic usage stops unless users enable extra usage billing at standard API rates. Anthropic technical staffer Lydia Hallie clarified: “You don’t pay extra. It’s the same subscription, same price per month.”

Why it matters for automation/productivity: Teams running third-party Claude integrations — OpenClaw, Zed, custom Agent SDK pipelines — need to audit programmatic token consumption before June 15. At Pro ($20 credit), a single heavy-load pipeline can exhaust the monthly pool within days at API pricing. Max 20x users ($200/month credit, equal to subscription cost) are least affected. Teams regularly hitting the ceiling will face per-token billing or need to upgrade tiers.

Key claims:

Cross-references:

Caveats: Announcement via @ClaudeDevs (official developer account) rather than an Anthropic blog post. Tier elevated to T2 secondary (official company account, confirmed by multiple T2 trade press outlets). Upgrade to T2 verified when Anthropic publishes a primary support article or pricing page update.


Source: https://github.com/NousResearch/hermes-agent · NousResearch · 2026-05-13 (trending milestone) Verification: T3 secondary · community-signal · agent-framework

Hermes Agent (NousResearch/hermes-agent) returned to the top of GitHub trending on 13 May 2026, adding 2,065 stars per day and crossing 145,000 cumulative stars — ten weeks after its 25 February 2026 launch. The most recent release, v0.13.0 Tenacity (shipped 7 May 2026), added a multi-agent Kanban board with heartbeat monitoring, zombie detection, per-task retries, and a hallucination gate. On 10 May 2026, Hermes Agent overtook OpenClaw to become the #1 most-used agent on OpenRouter’s daily rankings, processing a community-reported 224 billion tokens in a single day. The self-improving architecture accumulates user interactions into a persistent skill library and reasoning-preference profile across sessions.

Why it matters for automation/productivity: Sustained GitHub growth after ten weeks — not initial hype — combined with top OpenRouter usage signals that the self-improving agent design is finding production traction. The project is open-source and model-agnostic; for teams evaluating stateful agent frameworks that adapt to project conventions over time, Hermes Agent is now a peer to LangGraph and CrewAI in practitioner attention.

Key claims:

Cross-references:

Caveats: GitHub stars and OpenRouter rankings are not production adoption evidence; stars ≠ active deployments. 224B-token-per-day figure is from a community blog citing NousResearch, not from OpenRouter’s own published stats. v0.13.0 release was May 7, outside the 72h window — the in-window event is the May 13 trending milestone.


Anthropic and Gates Foundation Form $200M Four-Year Partnership Targeting Global Health, Education, and Agriculture

Source: https://www.anthropic.com/news/gates-foundation-partnership · Anthropic · 2026-05-14 Verification: T2 verified · announcement · ai-for-business

Anthropic and the Bill & Melinda Gates Foundation announced a $200 million, four-year commitment on 14 May 2026, combining grants, Claude API credits, and technical support. The program spans four priority areas: vaccine and therapy development, disease tracking, and health data systems for low- and middle-income countries; K-12 tutoring, literacy, and career navigation in the US, sub-Saharan Africa, and India; agricultural decision-support tools in local languages; and portable skills records for economic mobility. Specific research commitments include AI-assisted drug candidate identification for HPV and preeclampsia, and collaboration with the Institute for Health Metrics and Evaluation for disease forecasting. The Gates Foundation simultaneously published a confirming press release.

Why it matters for automation/productivity: Informational for commercial operators. For NGOs, global health research institutes, and education nonprofits, this creates a pathway to Claude API access and technical support via the Gates Foundation umbrella without direct commercial contracting with Anthropic. Public-goods datasets and benchmarks are expected to be released, which may benefit AI-in-health deployments broadly.

Key claims:

Cross-references:


PwC Expands Anthropic Alliance, Commits to Training 30,000 Staff on Claude Across Client and Internal Deployments

Source: https://www.prnewswire.com/news-releases/anthropic-and-pwc-expand-alliance-driving-impact-across-client-work-and-the-firm-302772321.html · PwC / PRNewswire · 2026-05-14 Verification: T2 verified · announcement · ai-for-business

PwC and Anthropic announced an expanded strategic alliance on 14 May 2026, building on a prior partnership to deploy Claude across PwC’s consulting, deals, and operating-model practices. PwC will roll out Claude Code and Claude Cowork starting with US teams, expanding toward its global professional workforce, and is committing to train and certify 30,000 professionals on Claude via a joint Center of Excellence. Focus areas: agentic tooling for engineering teams serving clients, AI across dealmaking workflows, and AI-driven operating model reinvention. Vendor-cited client results: insurance underwriting cycles from 10 weeks to 10 days; vulnerability operations reducing exposure windows from hours to minutes. No independent benchmarks published.

Why it matters for automation/productivity: PwC’s 30,000-professional certification commitment creates a large certified practitioner pool for Claude implementation across financial services, insurance, and deal advisory clients. For organizations considering Claude adoption via a system integrator, PwC now has the most publicly committed Claude deployment practice.

Key claims:

Cross-references:

Caveats: Client performance metrics are vendor-cited results from PwC’s own announcement; no independent audit published.


IBM Consulting Launches Forward Deployed Units: Six-Person AI-Agent Pods at Think 2026

Source: https://newsroom.ibm.com/2026-05-14-A-New-Way-to-Make-AI-Actually-Work-in-the-Real-World · IBM · 2026-05-14 Verification: T2 verified · announcement · ai-for-business

IBM Consulting introduced Forward Deployed Units (FDUs) on 14 May 2026 at IBM Think, a new delivery model combining a six-person senior human team with a specialized agent workforce handling coding, evaluation, testing, and documentation under human direction. IBM states a six-person FDU performs work equivalent to a 30-person traditional team — a 5:1 ratio that is vendor-claimed with no independent audit cited. Named live FDU deployments: Riyadh Air, Nestlé, Heineken, and Pearson. FDUs run on IBM Consulting Advantage, an AI platform providing reusable agent assets and industry accelerators.

Why it matters for automation/productivity: Informational only — no off-the-shelf tooling to adopt directly. For organizations scoping enterprise AI implementation contracts, IBM’s FDU model signals that major consultancies are restructuring their AI delivery economics around agent-augmented small teams. The 5:1 claimed team-size ratio, if independently validated, would materially change project cost structures for large-scale AI implementation.

Key claims:

Cross-references:

Caveats: 5:1 efficiency ratio is IBM-stated with no third-party validation. No pricing or contract terms disclosed.


Dropped

Items considered but not published, with reason:

Title consideredSourceReason
Claude for Small Business (15 workflows, 7 integrations)anthropic.com · 2026-05-13Covered in 2026-05-14 bulletin
Claude Code v2.1.141 (terminalSequence hooks, ANTHROPIC_WORKSPACE_ID)github.com · 2026-05-13Covered in 2026-05-14 bulletin
GitHub Copilot CLI v1.0.45 (/autopilot command)github.com · 2026-05-11Covered in 2026-05-14 bulletin
Meta AI App rollout (voice, live camera, Threads)9to5Mac · 2026-05-12Covered in 2026-05-14 bulletin
Judgment Labs $32M (agent evaluation infrastructure)BusinessWire · 2026-05-12Covered in 2026-05-14 bulletin
Google TIG first AI-generated zero-dayCNBC · 2026-05-11Covered in 2026-05-14 bulletin
Wispr AI $260M funding talksBloomberg · 2026-05-12Covered in 2026-05-14 bulletin
Anthropic $30B round at $900B valuationBloomberg · 2026-05-12Covered in 2026-05-14 bulletin
OpenAI Deployment Co. / Tomoro acquisitionopenai.com · 2026-05-11Covered in 2026-05-12 bulletin
Claude Opus 4.7 releaseanthropic.com · 2026-04-16Outside window (April 16)
Novo Nordisk-OpenAI strategic partnershipCNBC · 2026-04-14Outside window (April 14)
Gemma 4 (31B, Apache 2.0)blog.google · 2026-05-05Outside 72h window
ThoughtSpot MCP Server GAthoughtspot.com · 2026-03-20Outside window (March 20; May 1 Spotter 3 update also outside 72h)
Windsurf 2.0xda-developers.comRelease date not confirmed within 72h window
Grok SuperHeavy $299/$99 subscription tierxai.com · 2026-05-14Folded into Grok Build item — same announcement
Hermes Agent v0.13.0 Tenacity releasegithub.com/NousResearch/hermes-agent · 2026-05-07Outside 72h window; trending milestone (May 13) covered in Item 6
ServiceNow Autonomous Workforce (Knowledge 2026)newsroom.servicenow.com · 2026-05-05Outside 72h window
ByteDance Doubao-Seed-2.0-lite (full-modal multimodal)volcengine · 2026-05-06Outside 72h window
IBM Think 2026 other keynote announcementsibm.com · 2026-05-12–14Maintenance-level; IBM FDU covers the headline
HuggingFace daily papers May 13–14 (MinT, MulTaBench, AnyFlow)huggingface.co · 2026-05-13–14No immediate BD-actionable finding in window; EVA-Bench (ServiceNow) outside window
Google I/O 2026 Gemini modelAnticipated May 19Not yet announced

Limitations


Search log (compact)

QueryYieldType
Anthropic Claude announcement news May 14 15 202610 results, 3 high-relregistry
OpenAI announcement release ChatGPT May 14 15 20269 results, 2 high-relregistry
Google DeepMind Gemini AI announcement May 14 15 202610 results, 1 high-rel (outside window)registry
new AI model release launch May 14 202610 results, 1 high-rel (IBM)exploratory
Anthropic Claude Opus 4.7 release date announcement 202610 results, 0 in-window (April 16)registry
OpenAI Codex Work with Codex from anywhere May 14 20261 result, 1 high-relregistry
Anthropic Agent SDK credit split June 15 202610 results, 5 high-relregistry
anthropic.com/news (web_fetch)1 in-window item (Gates Foundation)registry
Claude Code release update May 14 15 202610 results, 3 high-relregistry
MCP Model Context Protocol server release May 14 15 202610 results, 0 in-windowregistry
Claude Code releases page (github.com, web_fetch)v2.1.140, v2.1.141, v2.1.142 confirmedregistry
Anthropic subscription billing claude -p agent SDK May 13 14 202610 results, 4 high-relregistry
IBM new way to make AI May 14 2026 FDU consulting5 results, 3 high-relexploratory
AI agent framework release launch new May 14 15 202610 results, 4 high-relexploratory
notion.com/releases/2026-05-13 (web_fetch)Notion Developer Platform confirmedregistry
Notion developer platform Workers External Agent API May 13 202610 results, 5 high-relregistry
PwC Claude Anthropic partnership deployment May 14 202610 results, 4 high-relexploratory
Novo Nordisk OpenAI partnership announcement May 202610 results, 0 in-window (April 14)exploratory
OpenAI Codex mobile ChatGPT app May 14 2026 release10 results, 5 high-relregistry
Anthropic agent programmatic billing blog post May 202610 results, 3 high-relregistry
xAI Grok Build SuperHeavy beta May 14 202610 results, 4 high-relregistry
xAI Grok Mistral Meta announcement May 14 15 202610 results, 1 high-rel (Grok Build)registry
AI announcement launch news May 15 202610 results, 2 high-relexploratory
new LLM OR new model released May 14 15 202610 results, 0 confirmed in-windowexploratory
agent framework OR AI tool launch May 14 15 202610 results, 3 high-relexploratory
Grok Build criticism problems limitations concerns May 202610 results, 3 high-rel (reliability issues)adversarial
huggingface.co papers (web_fetch, May 13–15)5 papers, 0 immediate BD applicationexploratory
site:x.com AI agent OR MCP server announcement May 14 15 202610 results, 1 in-windowsocial-X
AI Indonesia OR startup AI Asia OR model AI lokal May 202610 results, 0 in-window primarycross-lang
Anthropic Gates Foundation partnership criticism concerns May 202610 results, 2 high-reladversarial
Hermes Agent NousResearch github May 13 OR May 14 202610 results, 3 high-relexploratory
ThoughtSpot MCP server announcement date May 202610 results, 0 in-window (March 20)exploratory

Total searches: 32, of which 14 exploratory/adversarial/social/cross-lang (44%).


Suggested next runs