NYU’s new AI architecture makes high-quality image generation faster and cheaper

Researchers at New York University have developed a new architecture for diffusion models that improves the semantic representation of the images they generate. “Diffusion Transformer with Representation Autoencoders” (RAE) challenges some of the accepted norms of building diffusion models. The NYU researcher’s model is more efficient and accurate than standard diffusion models, takes advantage of […]

Why Google’s File Search could displace DIY RAG stacks in the enterprise

By now, enterprises understand that retrieval augmented generation (RAG) allows applications and agents to find the best, most grounded information for queries. However, typical RAG setups could be an engineering challenge and also exhibit undesirable traits.  To help solve this, Google released the File Search Tool on the Gemini API, a fully managed RAG system […]

Google debuts AI chips with 4X performance boost, secures Anthropic megadeal worth billions

Google Cloud is introducing what it calls its most powerful artificial intelligence infrastructure to date, unveiling a seventh-generation Tensor Processing Unit and expanded Arm-based computing options designed to meet surging demand for AI model deployment — what the company characterizes as a fundamental industry shift from training models to serving them to billions of users. […]

The compute rethink: Scaling AI where data lives, at the edge

Presented by Arm AI is no longer confined to the cloud or data centers. Increasingly, it’s running directly where data is created — in devices, sensors, and networks at the edge. This shift toward on-device intelligence is being driven by latency, privacy, and cost concerns that companies are confronting as they continue their investments in […]

From prototype to production: What vibe coding tools must fix for enterprise adoption

Presented by Salesforce Vibe coding — the fast-growing trend of using generative AI to spin up code from plain-language prompts — is quick, creative, and great for instant prototypes. But many argue that it’s not cut out for building production-ready business apps with the security, governance, and trusted infrastructure that enterprises require. In other words, […]

From logs to insights: The AI breakthrough redefining observability

Presented by Elastic Logs set to become the primary tool for finding the “why” in diagnosing network incidents Modern IT environments have a data problem: there’s too much of it. Organizations that need to manage a company’s environment are increasingly challenged to detect and diagnose issues in real-time, optimize performance, improve reliability, and ensure security […]