Heh, I figured I should open a thread on this, though it's likely been discussed elsewhere already.
Is there any knowledge of how difficult it would be to implement Deepfloyd support into InvokeAI, either currently or with the upcoming Nodes setup? I'm not really in the loop with how different it is from a typical Standard Diffusion setup, but I have seen the early results from it and it's definitely a remarkable advancement.
Be curious what the prospects are for having it be supported in Invoke are. 🙂
Here's a good video covering just was DF has to offer!
DeepFloyd IF is a state-of-the-art text-to-image model that can generate high-quality images based on text prompts. It was introduced by StabilityAI and its multimodal AI research lab DeepFloyd. The model consists of a frozen text encoder based on the T5 transformer and three cascaded pixel diffusion modules: a base model that generates 64x64 px...