Prates FYI

GPT-5.5 and Codex Land on Amazon Bedrock, Ending Azure's Seven-Year Lock

OpenAI's frontier models — GPT-5.5, GPT-5.4, and open-weight variants gpt-oss-20b and gpt-oss-120b — are now available in limited preview on Amazon Bedrock, alongside the Codex coding agent and Bedrock Managed Agents. The launch follows a renegotiated partnership with Microsoft that lifted Azure's near-exclusive hosting rights after seven years. Enterprise customers can now build with OpenAI models inside AWS, using existing security controls and procurement paths without a separate Microsoft relationship.

xAI techcrunch.com · cnbc.com

Musk Testifies Under Oath: xAI Used OpenAI Distillation to Train Grok

On the stand in a California federal court on Wednesday, Elon Musk confirmed that xAI used distillation techniques — querying OpenAI's models via API and using the outputs as training signals — while framing it as "general practice" among AI labs. It is the first public confirmation from a major U.S. AI leader that American companies train on each other's models. In the same testimony, Musk ranked Anthropic first among AI providers, followed by OpenAI and Google, placing xAI well behind the leaders. Musk is suing OpenAI over its conversion from nonprofit to for-profit structure.

Anthropic anthropic.com · techcrunch.com

Google Commits Up to $40 Billion in Anthropic as Revenue Triples

Alphabet confirmed a fresh investment of up to $40 billion in Anthropic — $10 billion immediately at a $350 billion valuation, with the remaining $30 billion conditional on performance targets — structured in partnership with Broadcom for compute capacity. The deal arrives as Anthropic's annualized revenue surpasses $30 billion run-rate, more than triple its approximately $9 billion exit rate from late 2025, reflecting accelerating enterprise demand for Claude across coding, analysis, and agentic workflows.

Coding & Developer Tools

Cursor thenewstack.io

Cursor 3 Goes Agent-First With Composer 2 to Rival Claude Code and Codex

Cursor launched its third-generation interface built around "Composer 2," an internally developed model optimized for agentic tasks that minimizes token usage while maximizing output quality. The redesign positions Cursor directly against Claude Code and Codex as all three tools converge on background-agent, CLI-capable workflows. Analysts describe the emerging landscape as a composable stack — orchestration, execution, and review layers — rather than a winner-take-all consolidation. Claude Code held a 46% developer "most loved" rating as of February 2026.

OpenAI industry

Codex Surpasses 3 Million Weekly Active Users, Gains Desktop Control

OpenAI's Codex coding agent crossed 3 million weekly active users — up 50% from 2 million just one month earlier. The growth coincides with a new capability: desktop control, allowing Codex to observe the screen and operate the mouse and keyboard as a virtual pair programmer, pushing it from code-completion into full agentic territory. The rapid user growth places Codex alongside Claude Code and Cursor as one of the most widely used AI developer tools.

Enterprise & Agents

OpenAI techcrunch.com

OpenAI Updates Agents SDK With Enterprise Governance and Safety Hooks

OpenAI shipped an updated Agents SDK adding guardrails and governance controls for enterprises deploying agents across internal systems. New features allow agents to traverse cross-system boundaries — moving through a company's tools and data — while providing monitoring and safety layers that satisfy enterprise risk requirements. Customers including Oracle, State Farm, and Uber are among the reference deployments. Industry surveys now project 40% of business applications will embed AI agents by end of 2026, up from under 5% in 2025.

Google theregister.com

Google Launches Gemini Enterprise Agent Platform to Address Agent Sprawl

Google unveiled a unified platform for developing, deploying, governing, and monitoring AI agents across enterprise environments — positioned as a connective governance layer between company data, employees, and autonomous agents. The announcement targets a growing pain point: organizations running dozens of disconnected agent deployments with no central oversight. Google's pitch is a single Gemini-powered control plane covering the full agent lifecycle from development through decommissioning.

Microsoft industry

Microsoft Open-Sources Agent Governance Toolkit for AI Security

Microsoft released the Agent Governance Toolkit as open source — a security shield protecting AI agents against 10 critical attack types including prompt injection and privilege escalation — with sub-0.1 millisecond blocking latency. The release comes as 97% of enterprises report expecting a major AI agent security incident in 2026. As agentic deployments move from pilot to production, security is becoming a first-class infrastructure requirement rather than an afterthought.

Models & Benchmarks

Benchmarks industry

The 2026 Model Leaderboard: Specialization Has Replaced the Single Winner

No frontier model dominates every benchmark in mid-2026. GPT-5.5 — a ground-up natively omnimodal rebuild internally codenamed "Spud," processing text, images, audio, and video in a unified architecture — leads Terminal-Bench 2.0 at 82.7% for agentic terminal workflows. Claude Opus 4.7 holds the SWE-bench Pro crown at 64.3% for complex software engineering. Gemini 3.1 Pro tops GPQA Diamond at 94.3% for scientific reasoning. Analysts call specialization the defining feature of 2026: developers pick models by task rather than by brand.