this post was submitted on 17 Mar 2025
571 points (96.9% liked)

Technology

66892 readers
4995 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

Half of LLM users (49%) think the models they use are smarter than they are, including 26% who think their LLMs are “a lot smarter.” Another 18% think LLMs are as smart as they are. Here are some of the other attributes they see:

  • Confident: 57% say the main LLM they use seems to act in a confident way.
  • Reasoning: 39% say the main LLM they use shows the capacity to think and reason at least some of the time.
  • Sense of humor: 32% say their main LLM seems to have a sense of humor.
  • Morals: 25% say their main model acts like it makes moral judgments about right and wrong at least sometimes. Sarcasm: 17% say their prime LLM seems to respond sarcastically.
  • Sad: 11% say the main model they use seems to express sadness, while 24% say that model also expresses hope.
you are viewing a single comment's thread
view the rest of the comments
[–] blady_blah@lemmy.world 14 points 1 day ago (2 children)

You say this like this is wrong.

Think of a question that you would ask an average person and then think of what the LLM would respond with. The vast majority of the time the llm would be more correct than most people.

[–] LifeInMultipleChoice@lemmy.dbzer0.com 17 points 1 day ago (1 children)

A good example is the post on here about tax brackets. Far more Republicans didn't know how tax brackets worked than Democrats. But every mainstream language model would have gotten the answer right.

[–] smeenz@lemmy.nz 6 points 1 day ago* (last edited 1 day ago)

I bet the LLMs also know who pays tarrifs

[–] JacksonLamb@lemmy.world 8 points 1 day ago (1 children)
[–] blady_blah@lemmy.world 0 points 1 day ago (2 children)

Then asking it a logic question. What question are you asking that the llms are getting wrong and your average person is getting right? How are you proving intelligence here?

[–] JacksonLamb@lemmy.world 1 points 15 hours ago (1 children)

LLMs are an autocorrect.

Let's use a standard definition like "intelligence is the ability to acquire, understand, and use knowledge."

It can acquire (learn) and use (access, output) data but it lacks the ability to understand it.

This is why we have AI telling people to use glue on pizza or drink bleach.

I suggest you sit down with an AI some time and put a few versions of the Trolley Problem to it. You will likely see what is missing.

[–] blady_blah@lemmy.world 1 points 15 hours ago (1 children)

I think this all has to do with how you are going to compare and pick a winner in intelligence. the traditional way is usually with questions which llms tend to do quite well at. they have the tendency to hallucinate, but the amount they hallucinate is less than the amount they don't know in my experience.

The issue is really all about how you measure intelligence. Is it a word problem? A knowledge problem? A logic problem?... And then the issue is, can the average person get your question correct? A big part of my statement here is at the average person is not very capable of answering those types of questions.

In this day and age of alternate facts and vaccine denial, science denial, and other ways that your average person may try to be intentionally stupid... I put my money on an llm winning the intelligence competition versus the average person. In most cases I think the llm would beat me in 90% of the topics.

So, the question to you, is how do you create this competition? What are the questions you're going to ask that the average person's going to get right and the llm will get wrong?

[–] JacksonLamb@lemmy.world 1 points 14 hours ago

I have already suggested it. Trolley problems etc. Ask it to tell you its reasoning. It always becomes preposterous sooner or later.

My point here is that remembering the correct answer or performing a mathematical calculation are not a measure of understanding.

What we are looking for that sets apart INTELLIGENCE is an ability to genuinely understand. LLMs don't have that, any more than older autocorrects did.

[–] eletes@sh.itjust.works 1 points 23 hours ago (2 children)

How many Rs are there in the word strawberry?

[–] blady_blah@lemmy.world 0 points 22 hours ago (1 children)

I asked gemini and ChatGPT (the free one) and they both got it right. How many people do you think would get that right if you didn't write it down in front of them? If Copilot gets it wrong, as per eletes' post, then the AI success rate is 66%. Ask your average person walking down the street and I don't think you would do any better. Plus there are a million questions that the LLMs would vastly out perform your average human.

[–] JacksonLamb@lemmy.world 0 points 15 hours ago* (last edited 15 hours ago)

I think you might know some really stupid or perhaps just uneducated people. I would expect 100% of people to know how many Rs there are in "strawberry" without looking at it.

Nevertheless, spelling is memory and memory is not intelligence.

[–] BlushedPotatoPlayers@sopuli.xyz -1 points 22 hours ago (1 children)

That was a very long time ago, that's fine now

[–] eletes@sh.itjust.works 4 points 22 hours ago

literally just asked copilot through our work subscription

I know it looks like I'm shitting on LLMs but really just trying to highlight they still have gaps on reasoning that they'll probably fix in this decade.