this post was submitted on 07 Oct 2023
988 points (97.7% liked)

Technology

59317 readers
5567 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Previous posts: https://programming.dev/post/3974121 and https://programming.dev/post/3974080

Original survey link: https://forms.gle/7Bu3Tyi5fufmY8Vc8

Thanks for all the answers, here are the results for the survey in case you were wondering how you did!

Edit: People working in CS or a related field have a 9.59 avg score while the people that aren’t have a 9.61 avg.

People that have used AI image generators before got a 9.70 avg, while people that haven’t have a 9.39 avg score.

Edit 2: The data has slightly changed! Over 1,000 people have submitted results since posting this image, check the dataset to see live results. Be aware that many people saw the image and comments before submitting, so they've gotten spoiled on some results, which may be leading to a higher average recently: https://docs.google.com/spreadsheets/d/1MkuZG2MiGj-77PGkuCAM3Btb1_Lb4TFEx8tTZKiOoYI

you are viewing a single comment's thread
view the rest of the comments
[–] MooseBoys@lemmy.world 39 points 1 year ago (5 children)

I still don’t believe the avocado comic is one-shot AI-generated. Composited from multiple outputs, sure. But I have not once seen generative AI produce an image that includes properly rendered text like this.

[–] deranger@sh.itjust.works 57 points 1 year ago* (last edited 1 year ago) (1 children)

Bing image creator uses the new DALL-E model which does hands and text pretty good.

generated this first try with the prompt a cartoon avocado holding a sign that says 'help me'

[–] dotMonkey@lemmy.world 26 points 1 year ago (2 children)

People forget just how fast this tech is evolving

[–] S_H_K@lemmy.fmhy.net 12 points 1 year ago

Absolutely SDXL with loras already can do a lot of what it was thought impossible.

[–] seralth@lemmy.world 2 points 1 year ago

Yeah Everytime iv seen anyone say "iv never seen it" makes it really obvious how little people actually know about the tech or follow it.

They basically saw it once a year ago and think it's still the same.

[–] isildun@sh.itjust.works 10 points 1 year ago

Image generation tech has gone crazy over the past year and a half or so. At the speed it's improving I wouldn't rule out the possibility.

Here's a paper from this year discussing text generation within images (it's very possible these methods aren't SOTA anymore -- that's how fast this field is moving): https://openaccess.thecvf.com/content/WACV2023/html/Rodriguez_OCR-VQGAN_Taming_Text-Within-Image_Generation_WACV_2023_paper.html

[–] b000urns@lemmy.world 3 points 1 year ago (1 children)

Yeah I'm sceptical too, what tool and prompt was used to produce this?

[–] Mint@lemmy.one 1 points 1 year ago* (last edited 1 year ago) (1 children)

Its Dalle 3 its not that difficult to generate something like that using dalle 3 here's some shreks I generated as a showcase Shrek 1 inage

Shrek 2 Image

Shrek 3 Image

All of these are just generated nothing else

[–] b000urns@lemmy.world 1 points 1 year ago

Huh interesting it handles text relatively well

[–] kattenluik@feddit.nl 1 points 1 year ago

I found the avocado comic the easiest to tell, since the missing eyebrow was so insanely out of place.

[–] Mint@lemmy.one 1 points 1 year ago* (last edited 1 year ago) (1 children)

Its not that difficult to generate something like that using dalle 3 here's some shreks I generated as a showcase Shrek 1 inage

Shrek 2 Image

Shrek 3 Image

All of these are just generated nothing else

[–] MooseBoys@lemmy.world 1 points 1 year ago (1 children)

Prompt and tool links? I know there are tools that try to pick out label text in the prompt and composite it after the fact, but I don’t consider this one-shot AI generated, even if it’s a single tool from the user’s perspective.

[–] Mint@lemmy.one 1 points 1 year ago* (last edited 1 year ago)

Its Dalle 3 like I said. As far as in aware Dalle 3 doesn't do that since the text isn't always perfect still. Can't really provide prompts since its been a bit, and the history on it isn't great, but I was just mostly shrek in x style and saying "x" do mind you Dalle is very heavily censored now, so you're now unlikely to be able to recreate that.

It's on - https://bing.com/create