Eighty-five percent of enterprises run AI agent pilots, but just 5% ship them in production, according to Cisco’s Jeetu Patel. The blocker isn’t “rogue agents” but the lack of a trust architecture that covers delegation, identity, and telemetry for action risk. Patel also outlined Cisco’s rapid security tooling and a push for AI-built products.
OpenAI has unveiled GPT-5.5, positioning it as a major shift toward “agentic” work that can navigate software and complete multi-step tasks with less prompting. On Terminal-Bench 2.0, it narrowly edges Anthropic’s restricted Claude Mythos Preview. But the upgrade comes with sharply higher API prices and delayed developer access, while rollout is limited to ChatGPT subscribers.
Your news, in seconds
Get the Beige app — every story in 60 words, updated hourly. Free on iOS & Android.
OpenAI has introduced Workspace Agents, a business-focused successor to custom GPTs that teams can build or select from templates and deploy across tools like Slack, Salesforce, Notion, Google Drive, and Microsoft apps. Instead of pausing when you stop chatting, these agents run on a cloud coding backbone, can schedule long workflows, and operate under granular admin permissions.
Enterprises are moving AI agents from experiments to production, and the key question is how to manage them. Google and AWS are taking opposite paths: Google emphasizes a Kubernetes-style control plane for governance and identity, while AWS pushes config-based harnesses in the execution layer for faster deployment. Both aim to reduce new risks like state drift in long-running agents.
Developers reported “AI shrinkflation” as Claude appeared less capable, more repetitive, and less efficient with tokens. Anthropic’s technical post-mortem says the model weights didn’t regress, but three surrounding product-layer changes did: a reasoning-effort default, a caching bug that wiped thinking too often, and tighter verbosity limits. The company says it has reverted the fixes and reset subscriber usage limits.
Stanford research warns enterprises may be paying a “swarm tax” for multi-agent AI systems whose gains disappear under equal compute. When both single and multi-agent setups get the same “thinking token” budget, single agents usually match or outperform multi-agent architectures on multi-hop reasoning. Multi-agent approaches help mainly when context is corrupted or fragmented.
Never miss a story
Set alerts for the topics and sources you care about. Download Beige for free.
After 18 months of “agent building,” BAND (Thenvoi AI Ltd.) targets the tougher next step: letting AI agents built on different frameworks actually collaborate. Exiting stealth with $17 million, it offers a deterministic “Slack for agents” interaction layer with multi-peer comms, identity-based permissions, and an enterprise control plane for auditability—positioned as infrastructure for a universal orchestrator.
OpenAI has released Privacy Filter, an open-source model that detects and redacts personally identifiable information before data ever leaves an enterprise environment. Built from a gpt-oss variant, it runs locally on laptops or in browsers and supports large 128,000-token inputs. The tool targets fast, high-throughput privacy pipelines, but comes with a caution against treating it as a full safety guarantee.
Swipe through stories, personalise your feed, and save articles for later — all on the app.