this post was submitted on 23 Nov 2023
184 points (91.8% liked)

Technology

58306 readers
4911 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] 5BC2E7@lemmy.world 27 points 10 months ago (3 children)

well now this is getting interesting beyond gossip. I doubt they made a significant AGI-related breakthrough but it might be something really cool and useful.

[–] guitarsarereal@sh.itjust.works 51 points 10 months ago* (last edited 10 months ago) (2 children)

According to the article, they got an experimental LLM to reliably perform basic arithmetic, which would be a pretty substantial improvement if true. IE instead of stochastically guessing or offloading it to an interpreter, the model itself was able to reliably perform a reasoning task that LLM's have struggled with so far.

It's rather exciting, tbh. it kicks open the door to a whole new universe of applications, if true. It's only technically a step in the direction of AGI, though, since technically if AGI is possible every improvement like this counts as a step towards it. If this development is really what triggered the board coup, though, then it sort of makes the board coup group look even more ridiculous than they did before. This is step 1 to making a model that can be tasked with ingesting spreadsheets and doing useful math on them. And I say that as someone who leans pretty pessimistically in the AI safety debate.

[–] maegul@lemmy.ml 16 points 10 months ago (3 children)

Being a layperson in this, I’d imagine part of the promise is that once you’ve got reliable arithmetic, you can get logic and maths in there too and so get the LLM to actually do more computer-y stuff but with the whole LLM/ChatGPT wrapped around it as the interface.

That would mean more functionality, perhaps a lot more of it works and scales, but also, perhaps more control and predictability and logical constraints. I can see how the development would get some excited. It seems like a categorical improvement.

[–] FaceDeer@kbin.social 15 points 10 months ago (1 children)

If that's the case then bad news for OpenAI's "moat" (and for people arguing for restraint in general): there's been some recent breakthroughs in getting open-source LLMs trained to understand math as well.

It'd be hilarious if OpenAI's board went through huge turmoil, tanked tens of billions of dollars worth of investments, disrupted their partnership with Microsoft to protect this huge revolution they've got brewing in their most secret and secure of laboratories... and then someone posts "hey, I got my AI Waifu to count good, check out this github to see how I did it" on Reddit.

[–] Buddahriffic@lemmy.world 2 points 10 months ago

It also brings into question (well, it adds to the questions, they were already brought up) the whole premise of IP law that "if we don't protect it properly, no one will want to invent things". It seems to me like people like creating things and humanity has a strange habit of converging on new inventions from multiple directions. Kinda like how calculus was invented by two different people at the same time.

[–] perviouslyiner@lemm.ee 2 points 10 months ago* (last edited 10 months ago) (1 children)

Always wondered why the text model didn't just put its output through something like MATLAB or Mathematica once it got as far as having something which requires domain-specific tools.

Like when Prof. Moriarty tried it on a quantum physics question and it got as far as writing out the correct formula before failing to actually calculate the result

[–] hamptonio@lemmy.world 3 points 10 months ago

There is definitely a lot of effort in this direction, seems very likely that a hybrid system could be very powerful.

[–] guitarsarereal@sh.itjust.works 1 points 10 months ago

More or less, yeah. It's been possible to do math with llm's by hooking in an interpreter, but it's a pain in the ass and in my limited experience required tons of custom prompting. Now, hooking in an interpreter and having it do math directly on the processor instead of using the neural net architecture is going to be orders of magnitude faster and more power-efficient, but also, with rudimentary reasoning comes the possibility of improving an llm's ability to hook itself into external tools without lots of help from the user.

[–] Wanderer@lemm.ee 2 points 10 months ago* (last edited 10 months ago)

I kinda just realised that the two aspects of this. The LLM part and the basic maths part. Doesn't this look set to destroy thousands of accounting jobs?

Surely this isn't far off doing a lot of the accounting work. Maybe even an app than a small business puts their info into it and that app keeps track of it for a year and then goes to an accountant that needs to look over it for an hour instead of sorting all the shit out for 10 hours

[–] Neato@kbin.social 17 points 10 months ago

If they had a real breakthrough this circus wouldn't be necessary.

[–] Benj1B@sh.itjust.works 8 points 10 months ago

The maker of ChatGPT had made progress on Q* (pronounced Q-Star), which some internally believe could be a breakthrough in the startup's search for superintelligence, also known as artificial general intelligence (AGI), one of the people told Reuters. OpenAI defines AGI as AI systems that are smarter than humans.

Definitely seems AGI related. Has to do with acing mathematical problems - I can see why a generative AI model that can learn, solve, and then extrapolate mathematical formulae could be a big breakthrough.