Technology

73429 readers

4741 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

487

AI is overhyped and unreliable -Goldman Sachs (calckey.world)

submitted 1 year ago by tek@calckey.world to c/technology@lemmy.world

80 comments fedilink hide all child comments

AI is overhyped and unreliable -Goldman Sachs

https://www.404media.co/goldman-sachs-ai-is-overhyped-wildly-expensive-and-unreliable/

"Despite its expensive price tag, the technology is nowhere near where it needs to be in order to be useful for even such basic tasks"

@technology@lemmy.world

you are viewing a single comment's thread
view the rest of the comments

[+] chrash0@lemmy.world -6 points 1 year ago (1 children)

it’s super weird that people think LLMs are so fundamentally different from neural networks, the underlying technology. neural network architectures are constantly improving, and LLMs are just a product of a ton of research and an emergence after the discovery of the transformer architecture. what LLMs have shown us is that we’re definitely on the right track using neural networks to solve a wide range of problems classified as “AI”

[–] HackyHorse3000@lemmy.world 16 points 1 year ago

I think the main problem is applying LLM outside the domain of "complete this sentence". It's fine for what it is, and trained on huge datasets it obviously appears impressive, but it doesn't know if it's right or wrong, and evaluation metrics are different. In most traditional applications of neural networks, you have datasets with right and wrong answers, that's not how these are trained, as there is no "right" answer to "tell me a joke." So the training has to be based on what would likely fill in the blank. This could be an actual joke, a bad joke, a completely different topic, there's no difference in the training data. The biases, incorrect answers, all the faults of this massive dataset are inherent in the model, and there's no fixing that. They are fundamentally different in their application and evaluation (this extends to training) methods from other neural networks that are actually effective at what they do, like image processing and identification. The scope of what they're trying to do with a finite dataset is not realistic and entirely unconstrained, as compared to more "traditional" neural networks, which are very narrow in scope exactly because of this issue.