Text-to-Image AI That Can Actually Spell!? Meet DeepFloyd IF

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis - A podcast by Nathaniel Whittemore

Categories:

Tehnologie

If you've ever used Midjourney, Dall-E, Stable Diffusion or another text-to-image generator, you'll know that words are a weakness. Text (such as on signs) tends to be gibberish. DeepFloyd IF has started to solve that problem and it's doing it open source. Referenced in the video: https://twitter.com/DeepFloydIF https://twitter.com/EMostaque/status/1652295961404645376 https://stability.ai/blog/deepfloyd-if-text-to-image-model https://twitter.com/hardmaru/status/1651822596844048385 https://the-decoder.com/deepfloyd-if-is-a-crazy-good-text-to-image-model-and-open-source/ https://wandb.ai/geekyrakshit/deepfloyd/reports/A-Gentle-Introduction-to-DeepFloydAI-s-New-Diffusion-Model-IF--VmlldzozNTY3Nzc4 https://twitter.com/javilopen/status/1652387049268297729 https://huggingface.co/DeepFloyd https://twitter.com/DavidVorick/status/1652070967412129793 Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/

Visit the podcast's native language site