Google’s new framework helps AI agents spend their compute and tool budget more wisely
In a new paper that studies tool-use in large language model (LLM) agents, researchers at Google and UC Santa Barbara have developed a framework that enables agents to make more efficient use of tool and compute budgets. The researchers introduce two new techniques: a simple “Budget Tracker” and a more comprehensive framework called “Budget Aware […]
Nous Research just released Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math exam
Nous Research, the San Francisco-based artificial intelligence startup, released on Tuesday an open-source mathematical reasoning system called Nomos 1 that achieved near-elite human performance on this year’s William Lowell Putnam Mathematical Competition, one of the most prestigious and notoriously difficult undergraduate math contests in the world. The Putnam is known for its difficulty: While a […]
GPT-5.2 first impressions: a powerful update, especially for business tasks and workflows
OpenAI has officially released GPT-5.2, and the reactions from early testers — among whom OpenAI seeded the model several days prior to public release, in some cases weeks ago — paints a two toned picture: it is a monumental leap forward for deep, autonomous reasoning and coding, yet potentially an underwhelming “incremental” update for casual […]
OpenAI’s GPT-5.2 is here: what enterprises need to know
The rumors were true: OpenAI on Thursday announced the release of its new frontier large language model (LLM) family, GPT-5.2. It comes at a pivotal moment for the AI pioneer, which has faced intensifying pressure since rival Google’s Gemini 3 LLM seized the top spot on major third-party performance leaderboards and many key benchmarks last […]
Marble enters the race to bring AI to tax work, armed with $9 million and a free research tool
Marble, a startup building artificial intelligence agents for tax professionals, has raised $9 million in seed funding as the accounting industry grapples with a deepening labor shortage and mounting regulatory complexity. The round, led by Susa Ventures with participation from MXV Capital and Konrad Capital, positions Marble to compete in a market where AI adoption […]
Creating a glass box: How NetSuite is engineering trust into AI
Presented by Oracle NetSuite When any company tells you it is their biggest product release in almost three decades, it’s worth listening. When the person saying it founded the world’s first cloud computing company, it’s time to take note. At SuiteWorld 2025, Evan Goldberg, founder and EVP of Oracle NetSuite, did just that when he […]
The 70% factuality ceiling: why Google’s new ‘FACTS’ benchmark is a wake-up call for enterprise AI
There’s no shortage of generative AI benchmarks designed to measure the performance and accuracy of a given model on completing various helpful enterprise tasks — from coding to instruction following to agentic web browsing and tool use. But many of these benchmarks have one major shortcoming: they measure the AI’s ability to complete specific problems […]
Mistral launches powerful Devstral 2 coding model including open source, laptop-friendly version
French AI startup Mistral has weathered a rocky period of public questioning over the last year to emerge, now here in December 2025, with new, crowd-pleasing models for enterprise and indie developers. Just days after releasing its powerful open source, general purpose Mistral 3 LLM family for edge devices and local hardware, the company returned […]
The AI that scored 95% — until consultants learned it was AI
Presented by SAP When SAP ran a quiet internal experiment to gauge consultant attitudes toward AI, the results were striking. Five teams were asked to validate answers to more than 1,000 business requirements completed by SAP’s AI co-pilot, Joule for Consultants — a workload that would normally take several weeks. Four teams were told the […]
OpenAI report reveals a 6x productivity gap between AI power users and everyone else
The tools are available to everyone. The subscription is company-wide. The training sessions have been held. And yet, in offices from Wall Street to Silicon Valley, a stark divide is opening between workers who have woven artificial intelligence into the fabric of their daily work and colleagues who have barely touched it. The gap is […]
