New Deepseek model drastically reduces resource usage by converting text and documents into images — 'vision-text compression' uses up to 20 times fewer tokens

Chinese developers of Deepseek AI have released a new model that leverages its multi-modal capabilities to improve the efficiency of its handling of complex documents and large blocks of text, by converting them into images first, as per SCMP. Vision encoders were able to take large quantities of text and convert them into images, which, when accessed later, required between seven and 20 times fewer tokens, while maintaining an impressive level of accuracy.

Deepseek is the Chinese-developed AI that

Source link

New Deepseek model drastically reduces resource usage by converting text and documents into images — ‘vision-text compression’ uses up to 20 times fewer tokens

Tom’s Hardware

Tom’s Hardware

Events

Trending

35+ Mac apps – build your own bundle from $2.50

Issue Subscribed 5% On Day 1 So Far

Lehar Footwears announced H1FY26 and Q2FY26 results, Reports Strong Revenue and PAT Growth

Grab to invest $60m in Vay’s remote-driven EV service

Useful Links

Categories

Startups

Legal

Popular This Week

Editor's Pick

What Are You Looking For?

Recent

What Are You Looking For?

Recent

What Are You Looking For?

Recent

New Deepseek model drastically reduces resource usage by converting text and documents into images — ‘vision-text compression’ uses up to 20 times fewer tokens

iPad Pro M5 review: Apple’s powerhouse tablet just raised the bar again

I Tested the M5 iPad Pro: It’s Overkill for Most People, and That’s the Point

You may also like

Events

Trending

Useful Links

Categories

Startups

Legal

Popular This Week

Editor's Pick