artificialfish

joined 6 days ago
[–] artificialfish@programming.dev 2 points 8 hours ago (1 children)

Well I think you actually need to train a "discriminator" model on rationality tests. Probably an encoder only model like BERT just to assign a score to thoughts. Then you do monte carlo tree search.

 

Generate 5 thoughts, prune 3, branch, repeat. I think that’s what o1 pro and o3 do

[–] artificialfish@programming.dev 1 points 18 hours ago* (last edited 16 hours ago)

Meta? The one that released Llama 3.3? The one that actually publishes its work? What are you talking about?

Why is it so hard to believe that deepseek is just yet another amazing paper in a long line of research done by everyone. Just because it’s Chinese? Everyone will adapt to this amazing innovation and then stagnate and throw compute at it until the next one. That’s how research works.

Not to mention China has billions of more people to establish a research community…

[–] artificialfish@programming.dev 1 points 18 hours ago

I think “just writing better code” is a lot harder than you think. You actually have to do research first you know? Our universities and companies do research too. But I guarantee using R1 techniques on more compute would follow the scaling law too. It’s not either or.

[–] artificialfish@programming.dev 12 points 3 days ago* (last edited 3 days ago)
[–] artificialfish@programming.dev 2 points 3 days ago (6 children)

Nah, o1 has been out how long? They are already on o3 in the office.

It’s completely normal a year later for someone to copy their work and publish it.

It probably cost them less because they probably just distilled o1 XD. Or might have gotten insider knowledge (but honestly how hard could CoT fine tuning possibly be?)

They are fine at programming numpy and sympy given an interface, and they are surprisingly good at explaining advanced symbolic math concepts. I wouldn't expect them to be good at arithmetic, but a good reasoning model should be really good at mathematical reasoning.

Good catch, that’s probably what’s happening then

[–] artificialfish@programming.dev 5 points 5 days ago (2 children)

TBH, I find the search feature on the block instances on your profile tab useful, HOWEVER, it shouldn't require the instance to exist or show up in search. It should let me put in any string. Just in case search is broken somehow.

I just don't know what blocked means then. It looks like I can see hexbear communities, probably comment on the, and even subscribe to them. But I guess they can't comment in programming.dev communities or see our stuff?

 

So I want to block hexbear.net. I see it's in the instance blocked list. However I still see their communities on community search. HOWEVER, I also can't block them from my profile, it doesn't give me the option. Known bug? User Error?

 

With so many engineers on here I'm surprised it doesn't come up in search.

I don’t care who federates. We can always ban their instance: