Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Ever since DALL-E 3 completely eclipsed Midjourney in terms of Prompt ADHERENCE (albeit not quality), I've had very little reason to make use of it. However, in my testing of Flux Dev, I can gen images in roughly 15 seconds in Forge, throw those at a SDXL model such as Dreamshaper in the form of a controlnet and get the best of both worlds, high detail and adherent.


Dall-E 3 (intentionally) leans away from realism though but in doing so what it leans into is a very tacky and aesthetically naive although competently executed type of image. Gives every image the feeling that you're seeing a bootleg version of a genuine thing and therefore makes everything else it touches feel tacky.

Same feeling you get looking at the airbrushed art on a state fairground ride.


>Same feeling you get looking at the airbrushed art on a state fairground ride.

That's a great way to describe it. A lot of articles and youtube pics are using these images lately and they all give that sort of vibe.


In 15 seconds? Really? On my machine with good specs and a 4090 flux-dev takes around 400 seconds for 512x512! And flux-schnell at least half. Do you recommend a tutorial for optimization?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: