The Agent Stack #021 — Friday Signal


The AI coding wars just got serious. OpenAI dropped a massive update to Codex this week, adding desktop control, image generation, and persistent memory. This is a direct shot at Anthropic’s Claude Computer Use.

Main Analysis: The Battle for Your Desktop

OpenAI’s Codex overhaul isn’t just an incremental update—it’s a declaration of war against Anthropic’s Claude ecosystem. The new capabilities let Codex manipulate your entire macOS desktop, not just code editors. It can screenshot your screen, click buttons, and navigate applications whilst coding.

The timing is brutal. Anthropic just launched Claude Computer Use in October, positioning themselves as the leader in agentic desktop automation. Now OpenAI has matched those capabilities and bundled them with their already-popular coding assistant.

The real kicker? Codex now remembers context across sessions. Previous conversations, project structures, coding patterns—it all persists. This addresses the biggest frustration with AI coding tools: constantly re-explaining your codebase.

For practitioners, this changes the game. You’re no longer choosing between coding help and desktop automation. One tool does both. The integration is deeper than Anthropic’s bolt-on approach with Claude Computer Use.

Actionable takeaway: If you’re building agents that need desktop access, test both platforms immediately. The competitive pressure means rapid feature releases ahead.

Quick Hits

Physical Intelligence released π0.7—their “general-purpose robot brain” that handles tasks it was never explicitly trained on. Early demos show impressive transfer learning across different robotic platforms. Worth watching for embodied AI applications.

InsightFinder raised £12M to solve AI agent debugging. Their platform monitors entire tech stacks with AI components, not just individual models. Growing necessity as agent deployments scale beyond proof-of-concept.

Anthropic’s CPO left Figma’s board after reports of competing design tools. Another signal that AI labs are coming for traditional SaaS. The “SaaSpocalypse” thesis gains more evidence.

One Thing to Try

Set up both Codex and Claude Computer Use in your development environment this weekend. Run the same complex coding task on both—something that requires multiple file edits, terminal commands, and browser research. Document which feels more natural for your workflow. The winner becomes your primary agent platform for Q2.

The desktop is the new battleground. Choose your weapon wisely.