Jan 6, 2026•12:30pm UTC

Nvidia's 'Vera Rubin' chip boosts AI speeds, reduces costs

Nvidia continues to set the bar higher. In his CES keynote on Monday, the chip giant’s CEO Jensen Huang announced the launch of the Vera Rubin supercomputing platform, capable of delivering five times the compute power of the last-generation chip and arriving months ahead of schedule.

Huang said that the Vera Rubin platform is now in “full production,” and will be available to customers in the second half of 2026. Vera Rubin is composed of six chip types, 1152 GPUs in total across 16 server racks, that work in concert to reduce training time and inference costs.

The platform is particularly fit for agentic AI, advanced reasoning and massive-scale mixture-of-experts, a machine learning concept that draws on multiple specialized models.
Huang touted the Rubin platform’s ability to do more with less: it cuts inference token costs by up to 10 times and requires 1/4 as many GPUs to train mixture-of-experts models compared to its current Blackwell platform.

“The amount of computation necessary for AI is skyrocketing,” Huang said in his keynote. “The demand for NVIDIA GPUs is skyrocketing. It's skyrocketing because models are increasing by a factor of 10, an order of magnitude every single year.”

Nvidia has been talking about Vera Rubin for a while now. The company first teased the platform at the Computex conference in 2024, and laid out a clearer roadmap for the tech at its GTC conference in March 2025. However, the platform wasn’t expected to be released until mid-2026, making Huang’s announcement on Monday six months ahead of schedule.

Our Deeper View

Already the dominating force of the AI chip industry, Nvidia is now outdoing itself. The company rolling out Vera Rubin early could tell us two things. One: It’s doing all it can to keep up with the frenzied demand for its chips. And two: It may be feeling pressure from competitors like Google and Amazon as they develop and release their own chips. AMD on Monday night may have squeezed Nvidia further, with CEO Lisa Su showing off Helios, its new rack for AI compute, and bringing out OpenAI president Greg Brockman and Dr. Fe–Fei Li to tout their partnerships. No other chip on the market holds a candle to Nvidia’s, as it stands. AI companies are hungry for more computing power, and with Vera Rubin, Nvidia just cooked them up a feast.

Anthropic defies Pentagon over AI guardrails

Nat Rubio-Licht

Nvidia's 'Vera Rubin' chip boosts AI speeds, reduces costs

Nat Rubio-Licht

•

Jan 6, 2026

•

12:30pm UTC

Copy link

Share on X

Share on LinkedIn

Share on Instagram

Share via Facebook

The platform is particularly fit for agentic AI, advanced reasoning and massive-scale mixture-of-experts, a machine learning concept that draws on multiple specialized models.
Huang touted the Rubin platform’s ability to do more with less: it cuts inference token costs by up to 10 times and requires 1/4 as many GPUs to train mixture-of-experts models compared to its current Blackwell platform.

Our Deeper View

Nat Rubio-Licht

Nat Rubio-Licht

Nvidia's 'Vera Rubin' chip boosts AI speeds, reduces costs

Our Deeper View

Related

Anthropic defies Pentagon over AI guardrails

Why vibe coding has boosted demand for engineers

Google’s Nano Banana 2 solves a key AI flaw

Nvidia's 'Vera Rubin' chip boosts AI speeds, reduces costs

Our Deeper View

Related

Anthropic defies Pentagon over AI guardrails

Why vibe coding has boosted demand for engineers

Google’s Nano Banana 2 solves a key AI flaw