this post was submitted on 22 Dec 2024
486 points (95.8% liked)

Technology

60075 readers
3415 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] communist@lemmy.frozeninferno.xyz 2 points 20 hours ago* (last edited 20 hours ago)

You're probablistic, you just have an internal verifier, you think things that are silly, and then decide not to say them all the time. A human being often thinks things that they realize are silly before they say them... that's an entirely unfair goal in the first place from my perspective, why does it have to be non-probablistic?

Are you not a general intelligence because sometimes your brain thinks silly things?

o3 currently works precisely that way, by the way, it generates hundreds of possible things, and then uses something that checks if the steps actually work, before it outputs. In fact, they then reinforce it on these correct logical steps, so it becomes better at not outputting illogical answers like you said.

it's interesting that you said "not on the probability of the next word, but on context and rationality"

context IS pricesely that, you know what's likely to come next because of the context, that's you understanding context. YOU as a human being don't even always get this right, you must realize we are not perfect beings, we think of possibilities and choose the right one. I think we're much better at this right now, but i don't think that's a fundamental difference between us and o3.

Rationality is the internal verifier.

Something that doesn’t require thousands of hours of training to update and instead is capable of ingesting and rationalize new information on the fly.

Being able to do this is... exactly what arc-agi was testing. Literally the entire point of the benchmark, it can do that.

I've done the test by the way, I solved it by brute forcing possible solutions in my head, then checking if they were true... did you just divine the answers instantly?