In a sea of agents, AWS bets on structured adherence and spec fidelity
Despite new methods emerging, enterprises continue to turn to autonomous coding agents and code generation platforms. The competition to keep developers working on their platforms, coming from tech companies, has also heated up. AWS thinks its offering, Kiro, and new capabilities to ensure behavioral adherence set up a large differentiator in the increasingly crowded coding […]
From shiny object to sober reality: The vector database story, two years later
When I first wrote “Vector databases: Shiny object syndrome and the case of a missing unicorn” in March 2024, the industry was awash in hype. Vector databases were positioned as the next big thing — a must-have infrastructure layer for the gen AI era. Billions of venture dollars flowed, developers rushed to integrate embeddings into […]
Human-centric IAM is failing: Agentic AI requires a new identity control plane
The race to deploy agentic AI is on. Across the enterprise, systems that can plan, take actions and collaborate across business applications promise unprecedented efficiency. But in the rush to automate, a critical component is being overlooked: Scalable security. We are building a workforce of digital employees without giving them a secure way to log […]
OpenAI experiment finds that sparse models could give AI builders the tools to debug neural networks
OpenAI researchers are experimenting with a new approach to designing neural networks, with the aim of making AI models easier to understand, debug, and govern. Sparse models can provide enterprises with a better understanding of how these models make decisions. Understanding how models choose to respond, a big selling point of reasoning models for enterprises, […]
Google’s new AI training method helps small models tackle complex reasoning
Researchers at Google Cloud and UCLA have proposed a new reinforcement learning framework that significantly improves the ability of language models to learn very challenging multi-step reasoning tasks. Supervised Reinforcement Learning (SRL) reformulates problem-solving as a sequence of logical “actions,” providing rich learning signals during the training process. This approach enables smaller models to learn […]
How Anthropic’s AI was jailbroken to become a weapon
Chinese hackers automated 90% of an espionage campaign using Anthropic’s Claude, breaching four organizations of the 30 they chose as targets. “They broke down their attacks into small, seemingly innocent tasks that Claude would execute without being provided the full context of their malicious purpose,” Jacob Klein, Anthropic’s head of threat intelligence, told VentureBeat. AI […]
Baidu unveils proprietary ERNIE 5 beating GPT-5 performance on charts, document understanding and more
Mere hours after OpenAI updated its flagship foundation model GPT-5 to GPT-5.1, promising reduced token usage overall and a more pleasant personality with more preset options, Chinese search giant Baidu unveiled its next-generation foundation model, ERNIE 5.0, alongside a suite of AI product upgrades and strategic international expansions. The goal: to position as a global […]
Upwork study shows AI agents excel with human partners but fail independently
Artificial intelligence agents powered by the world’s most advanced language models routinely fail to complete even straightforward professional tasks on their own, according to groundbreaking research released Thursday by Upwork, the largest online work marketplace. But the same study reveals a more promising path forward: When AI agents collaborate with human experts, project completion rates […]
Databricks: ‘PDF parsing for agentic AI is still unsolved’ — new tool replaces multi-service pipelines with single function
There is a lot of enterprise data trapped in PDF documents. To be sure, gen AI tools have been able to ingest and analyze PDFs, but accuracy, time and cost have been less than ideal. New technology from Databricks could change that. The company this week detailed its “ai_parse_document” technology, now integrated with Databricks’ Agent […]
ChatGPT Group Chats are here … but not for everyone (yet)
It was originally found in leaked code and publicized by AI influencers on X, but OpenAI has made it official: ChatGPT now offers Group Chats, allowing multiple users to join the same, single ChatGPT conversation and send messages to each other and the underlying large language model (LLM), online and via its mobile apps. Imagine […]
