You can get much better results with Ideogram 2 (also free): https://ideogram.ai...

vunderba · on Sept 8, 2024

You can also get reasonably close with an open model that you can run locally (flux dev).

https://replicate.com/p/xm41nvz05drm00chsywb6am7f0

https://replicate.com/p/kdw8bnkj39rm40chsyzbyg5e04

But of course anyone who has even a passing familiarity with scrabble is going to be able to tell that something's off.

GaggiX · on Sept 8, 2024

The biggest problem with the default Flux model is that it generates images with that strong AI look, probably caused by the distillation of the CFG. You should try some LoRAs for this, and also prompt the model to generate the rack that holds the letters.

vunderba · on Sept 8, 2024

Good point. I have a comfyui setup for it but its super basic right now just the diffusion model / clip loader / vae. Another thing you've probably noticed is that 99% of images from Flux tend to have that classic narrow depth of field look. I've seen people occasionally be able to get around it with pretty amusing prompt tokens like "instagram photo, selfie, gopro, etc." though.

renewiltord · on Sept 8, 2024

Great tip. The text handling here is far superior.

layer8 · on Sept 8, 2024

The “1”s are still inconsistent, and of course the numbers are all wrong.

skybrian · on Sept 8, 2024

This seems like a good idea for a contest.