Nvidia's China presence hits zero, says CEO Jensen Huang, and companies are already working around it — Alibaba reduces reliance on H20 as U.S. and China division deepens

Alibaba Cloud has revealed a new GPU pooling system that slashed the number of Nvidia accelerators needed for large-scale inference by more than 80%. The system, known as Aegaeon, was presented at the 2025 SOSP conference in Korea and piloted in Alibaba’s own production environment. It allows multiple large language models to share a single GPU. By doing so, it cuts the hardware footprint for inference workloads to a fraction of what was previously required.

The company claims it served dozens of LLMs…

Source link

Nvidia’s China presence hits zero, says CEO Jensen Huang, and companies are already working around it — Alibaba reduces reliance on H20 as U.S. and China division deepens

Tom’s Hardware

Tom’s Hardware

Events

Trending

35+ Mac apps – build your own bundle from $2.50

Issue Subscribed 5% On Day 1 So Far

Lehar Footwears announced H1FY26 and Q2FY26 results, Reports Strong Revenue and PAT Growth

Grab to invest $60m in Vay’s remote-driven EV service

Useful Links

Categories

Startups

Legal

Popular This Week

Editor's Pick

What Are You Looking For?

Recent

What Are You Looking For?

Recent

What Are You Looking For?

Recent

Nvidia’s China presence hits zero, says CEO Jensen Huang, and companies are already working around it — Alibaba reduces reliance on H20 as U.S. and China division deepens

I’ve been testing KitchenAid’s new budget-friendly blender, and I can’t believe it’s only $119

I Tried Samsung’s Galaxy XR, and It’s the First Mixed Reality Headset That Feels Ready for Everyone

You may also like

Events

Trending

Useful Links

Categories

Startups

Legal

Popular This Week

Editor's Pick