Technology

73767 readers

3605 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

318

Google is redesigning its search engine — and it’s AI all the way down (www.theverge.com)

submitted 1 year ago by morrowind@lemmy.ml to c/technology@lemmy.world

110 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] lemmyvore@feddit.nl 27 points 1 year ago* (last edited 1 year ago) (4 children)

They don't really have a choice. Classic website search will be useless in the near future because of the rapid rise of LLM-generated pages. Already for some searches 1 out of 3 results is generated crap.

Their only hope it's that somehow they'll be able to weed out LLM pages with LLM. Which is something that scientists say it's impossible because LLMs cannot learn from LLM results so they won't be able to reliably tell which content is good.

The fact they're even trying this shows they're desperate, so they will try.

[–] wagoner@infosec.pub 15 points 1 year ago

If they can't direct me to the right web site because they can't tell what's LLM junk, then how will they summarize an answer for me based on those same web sites they know about? It doesn't seem like LLM summaries are a way to avoid that issue at all.

[–] eager_eagle@lemmy.world 3 points 1 year ago

Well, it's not exactly impossible because of that, it's just unlikely they'll use a discriminator for the task because great part of generated content is effectively indistinguishable from human-written content - either because the model was prompted to avoid "LLM speak", or because the text was heavily edited. Thus they'd risk a high false positive rate.

[–] QuadratureSurfer@lemmy.world 3 points 1 year ago

Do you have a source for those scientists you're referring to?

I know that LLMs can be trained on data output by other LLMs, but you're basically diluting your results unless you do a lot of work to clean up the data.

I wouldn't say it's "impossible" to determine if content was generated by an LLM, but I agree that it will not be reliable.

[–] Etterra@lemmy.world 2 points 1 year ago

So far I'm mostly unaffected by this. That's probably because I usually internet mostly for niche hobbies and occasionally practical things and shopping. Like apartment hunting, since the industry is too spread out for anybody to get in bed with Google enough to get a big boost up the AI idiocy. Except maybe apartments.com, but that's where I've always ended up anyway even back before Google's enshitification.