N
Hacker Next
new
past
show
ask
show
jobs
submit
login
▲
Google's TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x
(
arstechnica.com
)
16 points by
gmays
22 hours ago
|
3 comments
add comment
Rendered at 13:42:57 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
redanddead 21 hours ago
[-]
You'd think it'd be bigger news on hn
axiologist 21 hours ago
[-]
See
https://news.ycombinator.com/item?id=47513475
from two days ago.