this post was submitted on 06 Jan 2025
775 points (99.9% liked)

TechTakes

1533 readers
151 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] killingspark@feddit.org 11 points 3 days ago (2 children)

I have seen these 3 bit ai papers on hacker news a few times. And the takeaway apparently is: the current models are being pretty shitty at what we want them to do, and we can reach a similar (but slightly worse) level of shittyness with 3 bits.

But that doesn't say anything about how both technologies could progress in the future. I guess you can compensate for having only three bits to pass between nodes by just having more nodes. But that doesn't really seem helpful, neither for storage nor compute.

Anyways yeah it always strikes me as a kind of trend that maybe has an application in a very specific niche but is likely bullshit if applied to the general case

[–] BlueMonday1984@awful.systems 3 points 2 days ago

Far as I can tell, the only real benefit here is significant energy savings, which would take LLMs from "useless waste of a shitload of power" to "useless waste of power".

[–] V0ldek@awful.systems 12 points 3 days ago (1 children)

If anything that sounds like an indictment? Like, the current models are so incredibly fucking bad that we could achieve the same with three bits and a ham sandwich

[–] killingspark@feddit.org 5 points 3 days ago

Oh it definitely says something about the current models for sure