VentureBeat

B&T Television

VentureBeat

Tech news that matters

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy
Researchers at Nvidia have developed a technique that can reduce the memory costs of large language model reasoning by up to eight times. Their technique, called dynamic memory sparsification (DMS), c...

MiniMax's new open M2.5 and M2.5 Lightning near state-of-the-art while costing 1/20th of Claude Opus 4.6
Chinese AI startup MiniMax, headquartered in Shanghai, has sent shockwaves through the AI industry today with the release of its new M2.5 language model in two variants, which promise to make high-end...

OpenAI deploys Cerebras chips for 'near-instant' code generation in first major move beyond Nvidia
OpenAI on Thursday launched GPT-5.3-Codex-Spark, a stripped-down coding model engineered for near-instantaneous response times, marking the company's first significant inference partnership outside it...

Google Chrome ships WebMCP in early preview, turning every website into a structured tool for AI agents
When an AI agent visits a website, it’s essentially a tourist who doesn’t speak the local language. Whether built on LangChain, Claude Code, or the increasingly popular OpenClaw framework, the age...

AI inference costs dropped up to 10x on Nvidia's Blackwell — but hardware is only half the equation
Lowering the cost of inference is typically a combination of hardware and software. A new analysis released Thursday by Nvidia details how four leading inference providers are reporting 4x to 10x redu...

z.ai's open source GLM-5 achieves record low hallucination rate and leverages new RL 'slime' technique
Chinese AI startup Zhupai aka z.ai is back this week with an eye-popping new frontier large language model: GLM-5.The latest in z.ai's ongoing and continually impressive GLM series, it retains an open...

Anthropic’s Claude Cowork finally lands on Windows — and it wants to automate your workday
Anthropic released its Claude Cowork AI agent software for Windows on Monday, bringing the file management and task automation tool to roughly 70 percent of the desktop computing market and intensifyi...

MIT's new fine-tuning method lets LLMs learn new skills without losing old ones
When enterprises fine-tune LLMs for new tasks, they risk breaking everything the models already know. This forces companies to maintain separate models for every skill.Researchers at MIT, the Improbab...

NanoClaw solves one of OpenClaw's biggest security issues — and it's already powering the creator's biz
The rapid viral adoption of Austrian developer Peter Steinberger's open source AI assistant OpenClaw in recent weeks has sent enterprises and indie developers into a tizzy.It's easy to easy why: OpenC...

Why enterprise IT operations are breaking — and how AgenticOps fixes them
Presented by Cisco AI agents are breaking traditional IT operations models, adding complexity, data silos, and fragmented workflows. DJ Sampath, Cisco's SVP of AI Software and Platform, believes that ...