DeepSeek injects 50% more security bugs when prompted with Chinese political triggers
China’s DeepSeek-R1 LLM generates up to 50% more insecure code when prompted with politically sensitive inputs such as “Falun Gong,” “Uyghurs,” or “Tibet,” according to new research from CrowdStrike. The latest in a series of discoveries — following Wiz Research’s January database exposure, NowSecure’s iOS app vulnerabilities, Cisco’s 100% jailbreak success rate, and NIST’s finding […]
How to avoid becoming an “AI-first” company with zero real AI usage
Remember the first time you heard your company was going AI-first? Maybe it came through an all-hands that felt different from the others. The CEO said, “By Q3, every team should have integrated AI into their core workflows,” and the energy in the room (or on the Zoom) shifted. You saw a mix of excitement […]
Lean4: How the theorem prover works and why it’s the new competitive edge in AI
Large language models (LLMs) have astounded the world with their capabilities, yet they remain plagued by unpredictability and hallucinations – confidently outputting incorrect information. In high-stakes domains like finance, medicine or autonomous systems, such unreliability is unacceptable. Enter Lean4, an open-source programming language and interactive theorem prover becoming a key tool to inject rigor and […]
OpenAI is ending API access to fan-favorite GPT-4o model in February 2026
OpenAI has sent out emails notifying API customers that its chatgpt-4o-latest model will be retired from the developer platform in mid-February 2026,. Access to the model is scheduled to end on February 16, 2026, creating a roughly three-month transition period for remaining applications still built on GPT-4o. An OpenAI spokesperson emphasized that this timeline applies […]
Salesforce Agentforce Observability lets you watch your AI agents think in near-real time
Salesforce launched a suite of monitoring tools on Thursday designed to solve what has become one of the thorniest problems in corporate artificial intelligence: Once companies deploy AI agents to handle real customer interactions, they often have no idea how those agents are making decisions. The new capabilities, built into Salesforce’s Agentforce 360 Platform, give […]
Google’s ‘Nested Learning’ paradigm could solve AI’s memory and continual learning problem
Researchers at Google have developed a new AI paradigm aimed at solving one of the biggest limitations in today’s large language models: their inability to learn or update their knowledge after training. The paradigm, called Nested Learning, reframes a model and its training not as a single process, but as a system of nested, multi-level […]
Grok 4.1 Fast’s compelling dev access and Agent Tools API overshadowed by Musk glazing
Elon Musk’s frontier generative AI startup xAI formally opened developer access to its Grok 4.1 Fast models last night and introduced a new Agent Tools API—but the technical milestones were immediately subverted by a wave of public ridicule about Grok’s responses on the social network X over the last few days praising its creator Musk […]
Google’s upgraded Nano Banana Pro AI image model hailed as ‘absolutely bonkers’ for enterprises and users
Infographics rendered without a single spelling error. Complex diagrams one-shotted from paragraph prompts. Logos restored from fragments. And visual outputs so sharp with so much text density and accuracy, one developer simply called it “absolutely bonkers.” Google DeepMind’s newly released Nano Banana Pro—officially Gemini 3 Pro Image—has drawn astonishment from both the developer community and […]
ScaleOps’ new AI Infra Product slashes GPU costs for self-hosted enterprise LLMs by 50% for early adopters
ScaleOps has expanded its cloud resource management platform with a new product aimed at enterprises operating self-hosted large language models (LLMs) and GPU-based AI applications. The AI Infra Product announced today, extends the company’s existing automation capabilities to address a growing need for efficient GPU utilization, predictable performance, and reduced operational burden in large-scale AI […]
AI agent evaluation replaces data labeling as the critical path to production deployment
As LLMs have continued to improve, there has been some discussion in the industry about the continued need for standalone data labeling tools, as LLMs are increasingly able to work with all types of data. HumanSignal, the lead commercial vendor behind the open-source Label Studio program, has a different view. Rather than seeing less demand […]
