Alibaba’s AgentEvolver lifts model performance in tool use by ~30% using synthetic, auto-generated tasks

Researchers at Alibaba’s Tongyi Lab have developed a new framework for self-evolving agents that create their own training data by exploring their application environments. The framework, AgentEvolver, uses the knowledge and reasoning capabilities of large language models for autonomous learning, addressing the high costs and manual effort typically required to gather task-specific datasets. Experiments show […]

OpenAI now lets enterprises choose where to host their data

OpenAI expanded its data residency regions for ChatGPT and its API, giving enterprise users the option to store and process their data closest to their business operations and better comply with local regulations. This expansion removes one of the biggest compliance blockers preventing global enterprises from deploying ChatGPT at scale. Data residency, often an overlooked piece […]

Anthropic’s Claude Opus 4.5 is here: Cheaper AI, infinite chats, and coding skills that beat humans

Anthropic released its most capable artificial intelligence model yet on Monday, slashing prices by roughly two-thirds while claiming state-of-the-art performance on software engineering tasks — a strategic move that intensifies the AI startup’s competition with deep-pocketed rivals OpenAI and Google. The new model, Claude Opus 4.5, scored higher on Anthropic’s most challenging internal engineering assessment […]

DeepSeek injects 50% more security bugs when prompted with Chinese political triggers

China’s DeepSeek-R1 LLM generates up to 50% more insecure code when prompted with politically sensitive inputs such as “Falun Gong,” “Uyghurs,” or “Tibet,” according to new research from CrowdStrike. The latest in a series of discoveries — following Wiz Research’s January database exposure, NowSecure’s iOS app vulnerabilities, Cisco’s 100% jailbreak success rate, and NIST’s finding […]

How to avoid becoming an “AI-first” company with zero real AI usage

Remember the first time you heard your company was going AI-first? Maybe it came through an all-hands that felt different from the others. The CEO said, “By Q3, every team should have integrated AI into their core workflows,” and the energy in the room (or on the Zoom) shifted. You saw a mix of excitement […]