TechTakes

1973 readers

245 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago

MODERATORS

dgerard@awful.systems

Google's Gemini 2.5 pro is out of beta. (awful.systems)

submitted 1 day ago* (last edited 1 day ago) by diz@awful.systems to c/techtakes@awful.systems

56 comments fedilink hide all child comments

I love to show that kind of shit to AI boosters. (In case you're wondering, the numbers were chosen randomly and the answer is incorrect).

They go waaa waaa its not a calculator, and then I can point out that it got the leading 6 digits and the last digit correct, which is a lot better than it did on the "softer" parts of the test.

you are viewing a single comment's thread
view the rest of the comments

[–] codexarcanum@lemmy.dbzer0.com 1 points 11 hours ago (1 children)

I posted a top level comment about this also, but Anthropic has done some research on this. The section on reasoning models discusses math I believe. The short version is it has a bunch of math in its corpus so it can approximate math (kind of, seemingly, similar to how you'd do a back of the envelope calculation in your head to get the orders of magnitude right) but it can't actually do calculations which is why they often get the specifics wrong.

[–] froztbyte@awful.systems 3 points 6 hours ago

reasoning models

that’s a shot, everyone drink up