Why prompt debt, retrieval debt, and evaluation debt are quietly reshaping enterprise AI risk
Over the past two decades, technical debt meant outdated architecture, messy code, and poorly maintained documentation. That definition is no longer sufficient in the AI era, where failure modes are more subtle and often non-linear. AI systems are introducing new layers of technical debt that live across prompts, models, and data dependencies — making these […]
AI agents are quietly generating chaos engineering failures enterprises don’t track yet
There is a category of production incident that engineering teams are not tracking yet — because it doesn’t fit any existing postmortem template. The agent initiated an action. The action was technically correct given the agent’s context. The context was incomplete. The infrastructure cascaded. And, by the time the incident review happened, three teams were […]
Valid certificates, stolen accounts: how attackers broke npm’s last trust signal
On May 19, 633 malicious npm package versions passed Sigstore provenance verification. They were cleared by the system because the attacker had generated valid signing certificates from a compromised maintainer account. Sigstore worked exactly as designed: it verified the package was built in a CI environment, confirmed a valid certificate was issued, and recorded everything […]
Your AI agents need a terminal, not just a vector database
When agentic workflows fail, developers often assume the problem lies in the underlying model’s reasoning abilities. In reality, the limited information provided by the retrieval interface is often the primary limiting factor. Researchers at multiple universities propose a technique called direct corpus interaction (DCI) that lets agents bypass embedding models entirely, searching raw corpora directly […]
D&B’s database of 642 million businesses was built for humans, not AI agents. So they rebuilt it.
Dun & Bradstreet has spent over 180 years building a comprehensive commercial database. Its Commercial Graph, covering 642 million businesses and their relationships, corporate hierarchies and risk profiles, was designed for people. Credit analysts, risk managers and sales professionals who could wait for query results and work through ambiguous entity matches. AI agents cannot do […]
Alibaba’s proprietary Qwen3.7-Max can run for 35 hours autonomously and supports external harnesses like Anthropic’s Claude Code
The AI industry has fully entered the “agent era,” a paradigm where AI models do far more than generate text — they now actively plan, execute, and course-correct complex tasks over days rather than seconds. Thus, it’s perhaps unsurprising to see Chinese e-commerce giant Alibaba’s famed Qwen Team of AI researchers release a model capable […]
A 0.12% parameter add-on gives AI agents the working memory RAG can’t
AI agents forget. Every time a coding assistant loses track of a debugging thread, or a data analysis agent re-ingests the same context it already processed, the team pays in latency, token costs, and brittle workflows. The fix most teams reach for — expanding the context window or adding more RAG — is increasingly expensive […]
Americans can’t spot a deepfake, and that’s a business crisis, not just a consumer problem
Presented by Veriff Americans can’t reliably distinguish real from AI-generated content, and that’s not just a media literacy problem; it’s a direct threat to how businesses verify identity online. New research finds that while many people are aware of deepfakes, their ability to distinguish them from reality is barely better than a coin flip. A […]
Google’s Managed Agents API promises one-call deployment at the cost of execution layer control
At Google I/O, the company unveiled Managed Agents in its Gemini API — a service that promises to collapse weeks of agent deployment work into a single API call. It’s also a sign that Google believes its ecosystem, including the newly launched Antigravity CLI, is ready to own the execution layer end-to-end. Before a single […]
Resolve AI says the AI coding boom is breaking production systems. It wants to fix that.
Resolve AI, the production-operations startup backed by Greylock and Lightspeed Venture Partners, today announced a sweeping expansion of its platform that introduces always-on background agents, a redesigned investigation architecture, and a shared workspace where engineers and AI agents collaborate in real time on live incidents. The centerpiece of the release is a new multi-agent investigation […]
