this post was submitted on 28 Jan 2025
97 points (100.0% liked)

Technology

37924 readers
617 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago
MODERATORS
 

cross-posted from: https://lemmy.ml/post/25282200

you are viewing a single comment's thread
view the rest of the comments
[–] fuzzy_feeling@programming.dev 38 points 3 days ago* (last edited 3 days ago) (10 children)

ahh... the sound of a bursting bubble.

- plop -

[–] artificialfish@programming.dev 2 points 3 days ago (6 children)

Nah, o1 has been out how long? They are already on o3 in the office.

It’s completely normal a year later for someone to copy their work and publish it.

It probably cost them less because they probably just distilled o1 XD. Or might have gotten insider knowledge (but honestly how hard could CoT fine tuning possibly be?)

[–] leisesprecher@feddit.org 16 points 3 days ago (5 children)

Deepseek showed that actually putting thought into the architecture achieves much more than just throwing more hardware at the problem.

This means a) there will be much less demand for hardware, since much more could be run locally on regular consumer devices. And b) the export restrictions don't really work and instead force China to create actually better models.

That means, a lot of the investments into the thousands of AI companies are in jeopardy.

[–] artificialfish@programming.dev 1 points 21 hours ago

I think “just writing better code” is a lot harder than you think. You actually have to do research first you know? Our universities and companies do research too. But I guarantee using R1 techniques on more compute would follow the scaling law too. It’s not either or.

load more comments (4 replies)
load more comments (4 replies)
load more comments (7 replies)