NYU’s new AI architecture makes high-quality image generation faster and cheaper
Researchers at New York University have developed a new architecture for diffusion models that improves the semantic representation of the images they generate. “Diffusion Transformer with Representation Autoencoders” (RAE) challenges some of the accepted norms of building diffusion models. The NYU researcher’s model is more efficient and accurate than standard diffusion models, takes advantage of […]
Moonshot’s Kimi K2 Thinking emerges as leading open source AI, outperforming GPT-5, Claude Sonnet 4.5 on key benchmarks
Even as concern and skepticism grows over U.S. AI startup OpenAI’s buildout strategy and high spending commitments, Chinese open source AI providers are escalating their competition and one has even caught up to OpenAI’s flagship, paid proprietary model GPT-5 in key third-party performance benchmarks with a new, free model. The Chinese AI startup Moonshot AI’s […]
Why Google’s File Search could displace DIY RAG stacks in the enterprise
By now, enterprises understand that retrieval augmented generation (RAG) allows applications and agents to find the best, most grounded information for queries. However, typical RAG setups could be an engineering challenge and also exhibit undesirable traits. To help solve this, Google released the File Search Tool on the Gemini API, a fully managed RAG system […]
Google debuts AI chips with 4X performance boost, secures Anthropic megadeal worth billions
Google Cloud is introducing what it calls its most powerful artificial intelligence infrastructure to date, unveiling a seventh-generation Tensor Processing Unit and expanded Arm-based computing options designed to meet surging demand for AI model deployment — what the company characterizes as a fundamental industry shift from training models to serving them to billions of users. […]
The compute rethink: Scaling AI where data lives, at the edge
Presented by Arm AI is no longer confined to the cloud or data centers. Increasingly, it’s running directly where data is created — in devices, sensors, and networks at the edge. This shift toward on-device intelligence is being driven by latency, privacy, and cost concerns that companies are confronting as they continue their investments in […]
From prototype to production: What vibe coding tools must fix for enterprise adoption
Presented by Salesforce Vibe coding — the fast-growing trend of using generative AI to spin up code from plain-language prompts — is quick, creative, and great for instant prototypes. But many argue that it’s not cut out for building production-ready business apps with the security, governance, and trusted infrastructure that enterprises require. In other words, […]
Google Cloud updates its AI Agent Builder with new observability dashboard and faster build-and-deploy tools
Google Cloud has introduced a big update in a bid to keep AI developers on its Vertex AI platform for concepting, designing, building, testing, deploying and modifying AI agents in enterprise use cases. The new features, announced today, include additional governance tools for enterprises and expanding the capabilities for creating agents with just a few […]
AI’s capacity crunch: Latency risk, escalating costs, and the coming surge-pricing breakpoint
The latest big headline in AI isn’t model size or multimodality — it’s the capacity crunch. At VentureBeat’s latest AI Impact stop in NYC, Val Bercovici, chief AI officer at WEKA, joined Matt Marshall, VentureBeat CEO, to discuss what it really takes to scale AI amid rising latency, cloud lock-in, and runaway costs. Those forces, […]
From logs to insights: The AI breakthrough redefining observability
Presented by Elastic Logs set to become the primary tool for finding the “why” in diagnosing network incidents Modern IT environments have a data problem: there’s too much of it. Organizations that need to manage a company’s environment are increasingly challenged to detect and diagnose issues in real-time, optimize performance, improve reliability, and ensure security […]
Databricks research reveals that building better AI judges isn’t just a technical concern, it’s a people problem
The intelligence of AI models isn’t what’s blocking enterprise deployments. It’s the inability to define and measure quality in the first place. That’s where AI judges are now playing an increasingly important role. In AI evaluation, a “judge” is an AI system that scores outputs from another AI system. Judge Builder is Databricks’ framework for […]
