Baidu unveils proprietary ERNIE 5 beating GPT-5 performance on charts, document understanding and more
Mere hours after OpenAI updated its flagship foundation model GPT-5 to GPT-5.1, promising reduced token usage overall and a more pleasant personality with more preset options, Chinese search giant Baidu unveiled its next-generation foundation model, ERNIE 5.0, alongside a suite of AI product upgrades and strategic international expansions. The goal: to position as a global […]
Upwork study shows AI agents excel with human partners but fail independently
Artificial intelligence agents powered by the world’s most advanced language models routinely fail to complete even straightforward professional tasks on their own, according to groundbreaking research released Thursday by Upwork, the largest online work marketplace. But the same study reveals a more promising path forward: When AI agents collaborate with human experts, project completion rates […]
Databricks: ‘PDF parsing for agentic AI is still unsolved’ — new tool replaces multi-service pipelines with single function
There is a lot of enterprise data trapped in PDF documents. To be sure, gen AI tools have been able to ingest and analyze PDFs, but accuracy, time and cost have been less than ideal. New technology from Databricks could change that. The company this week detailed its “ai_parse_document” technology, now integrated with Databricks’ Agent […]
ChatGPT Group Chats are here … but not for everyone (yet)
It was originally found in leaked code and publicized by AI influencers on X, but OpenAI has made it official: ChatGPT now offers Group Chats, allowing multiple users to join the same, single ChatGPT conversation and send messages to each other and the underlying large language model (LLM), online and via its mobile apps. Imagine […]
Weibo’s new open source AI model VibeThinker-1.5B outperforms DeepSeek-R1 on $7,800 post-training budget
Another day in late 2025, another impressive result from a Chinese company in open source artificial intelligence. Chinese social networking company Weibo’s AI division recently released its open source VibeThinker-1.5B—a 1.5 billion parameter large language model (LLM) that is a fine-tuned variant of rival Chinese tech firm Alibaba’s Qwen2.5-Math-1.5B. It’s available now for free download […]
OpenAI reboots ChatGPT experience with GPT-5.1 after mixed reviews of GPT-5
ChatGPT is about to become faster and more conversational as OpenAI upgrades its flagship model GPT-5 to GPT-5.1. OpenAI announced two updates to the GPT-5 series: GPT-5.1 Instant and GPT-5.1 Thinking. Both models are now accessible on ChatGPT. GPT-5.1 Instant, essentially the default and most-used model, is now “warmer, more intelligent, and better at following […]
Inside LinkedIn’s generative AI cookbook: How it scaled people search to 1.3 billion users
LinkedIn is launching its new AI-powered people search this week, after what seems like a very long wait for what should have been a natural offering for generative AI. It comes a full three years after the launch of ChatGPT and six months after LinkedIn launched its AI job search offering. For technical leaders, this […]
How Deductive AI saved DoorDash 1,000 engineering hours by automating software debugging
As software systems grow more complex and AI tools generate code faster than ever, a fundamental problem is getting worse: Engineers are drowning in debugging work, spending up to half their time hunting down the causes of software failures instead of building new products. The challenge has become so acute that it’s creating a new […]
Baidu just dropped an open-source multimodal AI that it claims beats GPT-5 and Gemini
Baidu Inc., China’s largest search engine company, released a new artificial intelligence model on Monday that its developers claim outperforms competitors from Google and OpenAI on several vision-related benchmarks despite using a fraction of the computing resources typically required for such systems. The model, dubbed ERNIE-4.5-VL-28B-A3B-Thinking, is the latest salvo in an escalating competition among […]
Meta’s SPICE framework lets AI systems teach themselves to reason
Researchers at Meta FAIR and the National University of Singapore have developed a new reinforcement learning framework for self-improving AI systems. Called Self-Play In Corpus Environments (SPICE), the framework pits two AI agents against each other, creating its own challenges and gradually improving without human supervision. While currently a proof-of-concept, this self-play mechanism could provide […]
