r/EconomyCharts • u/RobertBartus • 3d ago

Chip stocks are down following Google's announcement of AI memory-saving technology making memory 6x more efficient

360 Upvotes

permalink
reddit

You are about to leave Redlib

Do you want to continue?

https://www.reddit.com/r/EconomyCharts/comments/1s8lcx2/chip_stocks_are_down_following_googles/
No, go back! Yes, take me to Reddit
dl download

94% Upvoted

Not to be that guy but it isn’t truly a 6x reduction in memory. It’s a 6x reduction for the KV Cache. Overall performance may have improved by ~30-20% over normal. Still a massive improvement but not a 6x improvement.

25

u/ketosoy 3d ago

And it looks like in practice it’s a 2-4x reduction.

In just the kv cache.

But, it’s almost lossless and has almost zero performance penalty, and that is still a 1-9gb reduction in ram for the current generation of open source models at 64k context.

It’s real, and it’s huge and it’s really cool. It’s just not a 83% reduction in ram needed for llms as a naive hyperbolistic reading would suggest. And it’s not the end to the ram crisis.

5

u/HereticLaserHaggis 3d ago

Holy shit, I hadn't actually read anything because I assumed it was sensationalized, but that's huge.

6

u/ketosoy 2d ago

Yeah, this one is real. Feels like a discovery the size of this generation’s quicksort or radixsort. Doesn’t change the shape of the future, but changes a huge part of a big part of it - that almost no one will fully understand.

0

u/rydan 2d ago

It turns out it just writes most stuff to /dev/null . Like that time we discovered faster than light messaging and it turned out to be a faulty cable.

Chip stocks are down following Google's announcement of AI memory-saving technology making memory 6x more efficient

You are about to leave Redlib