Technology

59466 readers

5251 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

Best text to image generator (lemmy.world)

submitted 10 months ago by Billd111@lemmy.world to c/technology@lemmy.world

13 comments fedilink hide all child comments

I have used several different generators. What they all seem to have in common is that they don't always display what I am asking for. Example: if I am looking for a person in jeans and t-shirt, I will get images of a person wear things totally different clothing and it isn't consistent. Another example is if I want a full body picture, that command seems to be ignored giving just waist up or just below the waist. Same goes if I ask for side views or back views. Sometimes they work. Sometimes they don't. More often they don't. I have also seen that none of the negative requests seem to actually work. If I ask for pictures of people and don't want them using cell phones or no tattoos, like magic they have cell phones. Some have tattoos. I have noticed this in every single generator I have used. Am I asking for things the wrong way or is the AI doing whatever it wants and not paying attention to my actual request?

Thanks

you are viewing a single comment's thread
view the rest of the comments

[–] rickdg@lemmy.world 12 points 10 months ago (1 children)

Can you give an example of a complete prompt? Are you using Dall-E, Midjourney, Stable Diffusion…?

It seems that all models need to have prompts crafted specifically for them and you need to follow-up with corrections. The follow-up is critical for pretty much anything these LMMs output.

[–] Ragdoll_X@lemmy.world 4 points 10 months ago* (last edited 10 months ago) (1 children)

Image-to-image also helps a lot with SD. Even some roughly-drawn blobs can be the difference between the image almost matching what you had in mind vs. looking exactly how you intended.

[–] BlueEther@no.lastname.nz 1 points 10 months ago

I just cant get img2img on SD to work for me to get images that are what I want(A1111 front end)