#🆕｜sd3 | Stable Diffusion | Page 105

errant dust Sep 18, 2024, 3:58 PM

#

It is unlikely if there is no basis for the claim. I am interested in sources. I am not saying it is false, but I'd like to know where this was stated.

sullen moss Sep 18, 2024, 4:06 PM

#

errant dust It is unlikely if there is no basis for the claim. I am interested in sources. I...

They probably picked those who stubbornly kept praising SD3 despite the terrible release. LOL

dusky thistle Sep 18, 2024, 4:09 PM

#

errant dust Sep 18, 2024, 4:11 PM

#

sullen moss They probably picked those who stubbornly kept praising SD3 despite the terrible...

That isn't my question. My question is where the rumor of a 3.5 version started. Who said this, or wrote this and where?

noble coyote Sep 18, 2024, 4:13 PM

#

Mix of Toy Camera, Blue Future and Wraith_BW all at Civitai (by andreac75)

mortal mesa Sep 18, 2024, 4:14 PM

#

supposedly Prem said things at AI conference San Francisco 2024, haven't actually seen any confirmation

errant dust Sep 18, 2024, 4:22 PM

#

ok, well, that is interesting if true. If they are rebuilding a larger or better version of SD3, it will be fascinating to see what they do

#

I mean, despite al the other SD projects, like Audio, 3D, video, etc. The image AI was always their central calling card

#

And if Flux has shown anything, it is that you can not only give a full monster 12B to the community, but still be able to market and monetize one of your own (AKA Grok 2)

noble coyote Sep 18, 2024, 4:28 PM

#

Flux1_dev_Q4_1_GGUF and LoRA (Wreath_BW)

#

noble coyote Sep 18, 2024, 4:31 PM

#

errant dust ok, well, that is interesting if true. If they are rebuilding a larger or better...

If the new SD3.5 cannot match up to Flux - then perhaps SAI ought to go and do something else? 😄

#

#

#

#

noble coyote Sep 18, 2024, 4:40 PM

#

errant dust And if Flux has shown anything, it is that you can not only give a full monster ...

https://www.reddit.com/r/StableDiffusion/comments/1fj3g0s/rumors_about_stability_ai_ceo_confirming_sd35_8b/

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

mortal mesa Sep 18, 2024, 4:41 PM

#

nice, trust me bro confirmation

cunning lintel Sep 18, 2024, 4:43 PM

#

https://twitter.com/Lykon4072/status/1834917552209744318

Lykon (@Lykon4072) on X

@linaqruf_ @oron1208 Wait for 8b. It's basically Flux without distillation and heavy hands dpo. This should make it easy to finetune (and dpo).
We're also trying a new scaling down mechanism for mmdit, the new 2b is gonna work much better.

turbid grotto Sep 18, 2024, 4:43 PM

#

noble coyote If the new SD3.5 cannot match up to Flux - then perhaps SAI ought to go and do s...

It wont and do not have to.

SD3.5L is 66.6% size of Flux
It won't be distilled and probably no heavy DPO
It should be easier to train and run
Better License

cunning lintel Sep 18, 2024, 4:45 PM

#

Plenty hints something is brewing reading twitter , though Lykon seems more occupied finding all the faults in flux than promoting sd3 these days

mortal mesa Sep 18, 2024, 4:46 PM

#

barely scratching the surface in what Flux can do, SD3 also just you know too many bad pics not worth time

cunning lintel Sep 18, 2024, 4:47 PM

#

And to be fair, I've heard that something be released a loooong time. Seeing the weights is believing. But taking all the chaos SAI had been in into account, I understand things moved slow or not at, and only started again the last month(s)

#

I feel the API of 8b looks nicer style wise (not all smoothed) than flux but generates much more flaws and much less prompt understanding. Still being able to actually use it will be nice

turbid grotto Sep 18, 2024, 4:51 PM

#

I wish them luck and hope to finally move from SDXL gonnabegood

mortal mesa Sep 18, 2024, 4:51 PM

#

noble coyote Sep 18, 2024, 4:56 PM

#

#

#

mortal mesa Sep 18, 2024, 5:00 PM

#

noble coyote Sep 18, 2024, 5:18 PM

#

sacred jewel Sep 18, 2024, 5:44 PM

#

noble coyote Sep 18, 2024, 5:46 PM

#

sacred jewel Sep 18, 2024, 5:48 PM

#

noble coyote https://www.reddit.com/r/StableDiffusion/comments/1fj3g0s/rumors_about_stability...

Wasn't that the 10th and 11th? Did it actually get "announced"?

noble coyote Sep 18, 2024, 5:49 PM

#

sacred jewel Wasn't that the 10th and 11th? Did it actually get "announced"?

Dunno - just saw this in passing ...

#

Prolly just hearsay, rumour, chit chat etc

sacred jewel Sep 18, 2024, 5:53 PM

#

noble coyote Sep 18, 2024, 5:53 PM

#

sacred jewel Sep 18, 2024, 5:54 PM

#

#

#

noble coyote Sep 18, 2024, 5:56 PM

#

sacred jewel Sep 18, 2024, 5:56 PM

#

#

noble coyote Sep 18, 2024, 5:57 PM

#

sacred jewel Sep 18, 2024, 5:57 PM

#

noble coyote Sep 18, 2024, 5:58 PM

#

#

#

dusky thistle Sep 18, 2024, 5:59 PM

#

noble coyote Sep 18, 2024, 5:59 PM

#

#

A good attempt at "getting the camera out of the photo!"

#

marble lion Sep 18, 2024, 6:11 PM

#

Anyone still using SD3?

noble coyote Sep 18, 2024, 6:12 PM

#

I use SD3 images to feed img2img into FluX 😄

#

#

#

#

#

#

sacred jewel Sep 18, 2024, 7:05 PM

#

sacred jewel Sep 18, 2024, 7:21 PM

#

hexed dirge Sep 18, 2024, 7:33 PM

#

#

any sd3 news?

bitter hearth Sep 18, 2024, 7:39 PM

#

Lykon
@Lykon4072
Wait for 8b. It's basically Flux without distillation and heavy hands dpo. ```sounds perfect

#

those are the two big problems with flux, the distil and the baked-in look

turbid grotto Sep 18, 2024, 7:45 PM

#

marble lion Anyone still using SD3?

still waiting sadcat

turbid grotto Sep 18, 2024, 7:47 PM

#

bitter hearth those are the two big problems with flux, the distil and the baked-in look

yeeees!

#

But we will probably get noticeably worse hands than Flux without overbaking (just a guess)

bitter hearth Sep 18, 2024, 7:48 PM

#

yeah that's possible

#

could do handfix pass with flux though with impact pack

hexed dirge Sep 18, 2024, 7:49 PM

#

main problem is that at this point Flux has a huge support by the community

turbid grotto Sep 18, 2024, 7:49 PM

#

bitter hearth could do handfix pass with flux though with impact pack

Yea, it won't be a problem

bitter hearth Sep 18, 2024, 7:51 PM

#

not really
only a few flux-specific nodes and tools have come out

#

would actually say the opposite, that flux doesn't have much tooling yet

pseudo owl Sep 18, 2024, 7:51 PM

#

flux seems to have better prompt following, text rendering as well but sd3.5 isn't distilled so some advantages

basically the community will get one more amazing choice, win-win for us.

#

i'm pretty sure tho, by the time sd3.5 large is released, black forest labs might release the text to video model.

turbid grotto Sep 18, 2024, 7:53 PM

#

hexed dirge main problem is that at this point Flux has a huge support by the community

SD3.5L has own advantages and if Stability release training scripts with weights and maybe couple of controlnets, it is gonna be a good start

bitter hearth Sep 18, 2024, 7:54 PM

#

the best prompt following is Auraflow V2 I think

#

not the latest Auraflow V3

#

it got a bit worse in that version

hexed dirge Sep 18, 2024, 7:54 PM

#

turbid grotto SD3.5L has own advantages and if Stability release training scripts with weights...

Really I hope so. Let's see in a couple of weeks

turbid grotto Sep 18, 2024, 7:54 PM

#

btw, do we have a good finetune of Flux already? Last time I heard, it was loosing adherence

pseudo owl Sep 18, 2024, 7:55 PM

#

finetunes are much better now, there were a few bugs. there are already hyper/turbo loras as well

hexed dirge Sep 18, 2024, 7:55 PM

#

turbid grotto btw, do we have a good finetune of Flux already? Last time I heard, it was loosi...

for Lora yes, for model I don't know

bitter hearth Sep 18, 2024, 7:56 PM

#

no big checkpoint yet

turbid grotto Sep 18, 2024, 7:56 PM

#

Loras seems to be very good now

hexed dirge Sep 18, 2024, 7:56 PM

#

turbid grotto Loras seems to be very good now

mine yes 😂

bitter hearth Sep 18, 2024, 7:57 PM

#

been having trouble with some Civit flux loras
some are very nice but some are very overfit
a couple break the image above a very low strength

#

I liked these ones

hexed dirge Sep 18, 2024, 7:58 PM

#

bitter hearth been having trouble with some Civit flux loras some are very nice but some are v...

well becuase it turns out that lorsa works very well also with small network dim and steps

#

BTW I just won tha Civ training contest for flux

bitter hearth Sep 18, 2024, 7:59 PM

#

wow nice

turbid grotto Sep 18, 2024, 7:59 PM

#

Does anyone know what is this? It is by the creator of realvisxl and I don't quite understand what that name means. Did he figured out how to reduce Flux's size down to 1b parameters?
https://huggingface.co/SG161222/RealFlux_1.0b

SG161222/RealFlux_1.0b · Hugging Face

hexed dirge Sep 18, 2024, 8:00 PM

#

turbid grotto Does anyone know what is this? It is by the creator of realvisxl and I don't qui...

it seems model training

bitter hearth Sep 18, 2024, 8:00 PM

#

if you look on the realvis civit page for SDXL he talks about it

#

its not cooked yet

#

jugger team are also cooking

cunning lintel Sep 18, 2024, 8:01 PM

#

turbid grotto Does anyone know what is this? It is by the creator of realvisxl and I don't qui...

version 1.0b of RealFlux finetune

hexed dirge Sep 18, 2024, 8:01 PM

#

and that's what I'm saying.. If big teams moved to flux SD3.5 must be really good

#

because training a model has costs

cunning lintel Sep 18, 2024, 8:03 PM

#

They move to flux cause SAI is tonedeaf and no one knows WTF the StableDiffusion future brings, don;t really think it's about preference, it's taking the only sota model out there, despite the shortcomings

frail shoal Sep 18, 2024, 8:03 PM

#

Where can I download sd3.5 ?

pseudo owl Sep 18, 2024, 8:03 PM

#

hexed dirge and that's what I'm saying.. If big teams moved to flux SD3.5 must be really goo...

it won't beat it but it will be an alternative, its faster/smaller and has more knowledge. not as good as flux in many important things like prompt following/text rendering/humans but still a great alternative.

pseudo owl Sep 18, 2024, 8:04 PM

#

frail shoal Where can I download sd3.5 ?

not out yet, will probably take a long time(months)

hexed dirge Sep 18, 2024, 8:04 PM

#

pseudo owl it won't beat it but it will be an alternative, its faster/smaller and has more ...

I really hope so

frail shoal Sep 18, 2024, 8:04 PM

#

pseudo owl not out yet, will probably take a long time(months)

Why are people talking about it then ?

pseudo owl Sep 18, 2024, 8:04 PM

#

frail shoal Why are people talking about it then ?

Some people got early access of the api for testing.

turbid grotto Sep 18, 2024, 8:05 PM

#

turbid grotto Does anyone know what is this? It is by the creator of realvisxl and I don't qui...

Ah got it, thanks everyone!

turbid grotto Sep 18, 2024, 8:05 PM

#

hexed dirge BTW I just won tha Civ training contest for flux

Congrats!

hexed dirge Sep 18, 2024, 8:06 PM

#

turbid grotto Congrats!

thanks

frail shoal Sep 18, 2024, 8:10 PM

#

pseudo owl Some people got early access of the api for testing.

Ah , not me 😭

errant dust Sep 18, 2024, 8:13 PM

#

frail shoal Where can I download sd3.5 ?

Sorry, but we cannot tell that to you

#

😛

bitter hearth Sep 18, 2024, 8:15 PM

#

new Pixart model is due soon

#

and they joined Nvidia so it might be great

pseudo owl Sep 18, 2024, 8:30 PM

#

yeah looking forward to that, do you have any links for it? i wanna check it out.

bitter hearth Sep 18, 2024, 8:32 PM

#

no news yet

#

there was some big news today cos Fal.ai got funded https://blog.fal.ai/generative-media-needs-speed-fal-has-raised-23m-to-accelerate/

#

someone from Black Forest invested lol

pseudo owl Sep 18, 2024, 8:37 PM

#

bitter hearth no news yet

I saw the reddit post, they are thinking about video gen as well, which is pretty cool!
https://www.reddit.com/r/StableDiffusion/comments/1dm1kpv/pixart_team_joins_nvidia/

From the StableDiffusion community on Reddit: Pixart team joins Nvidia

Explore this post and more from the StableDiffusion community

bitter hearth Sep 18, 2024, 8:43 PM

#

there's also the lumina team

#

they released an LLM recently that can make images

#

so maybe after this they will make a diffusion model again

pseudo owl Sep 18, 2024, 8:53 PM

#

bitter hearth they released an LLM recently that can make images

yeah the aesthetic was good but kinda took lot of time and imo not worth it.

This seems a very promising llm image gen method(model not released yet, but will soon): https://github.com/VectorSpaceLab/OmniGen
It's very impressive imo, its very small, just a measly 3.8b params and has no text encoder but supposedly performs as good as sd3 large in t2i.

The most impressive thing is that it can do reasoning, editing, step by step images, deblurring, and everything controlnets can do in just 3.8b params.

GitHub

GitHub - VectorSpaceLab/OmniGen

Contribute to VectorSpaceLab/OmniGen development by creating an account on GitHub.

bitter hearth Sep 18, 2024, 8:54 PM

#

wow yeah looks good

alpine summit Sep 18, 2024, 11:58 PM

#

dusky thistle Sep 19, 2024, 12:21 AM

#

bitter hearth ``` Lykon @Lykon4072 Wait for 8b. It's basically Flux without distillation and h...

So they're saying now that they will actually release 8b?

#

Id be thrilled if they did that

#

2b is... We all know... 8b is pretty fn good though

alpine summit Sep 19, 2024, 12:32 AM

#

sacred jewel Sep 19, 2024, 12:43 AM

#

alpine summit Sep 19, 2024, 12:58 AM

#

sacred jewel Sep 19, 2024, 1:03 AM

#

bitter hearth I liked these ones

ARS Midjourney LoRA mixed with a little sci_fi_future LoRA

sacred jewel Sep 19, 2024, 1:30 AM

#

sacred jewel Sep 19, 2024, 2:14 AM

#

hearty fossil Sep 19, 2024, 2:49 AM

#

#📝｜prompting-help

alpine summit Sep 19, 2024, 4:25 AM

#

alpine summit Sep 19, 2024, 4:40 AM

#

#

icy drift Sep 19, 2024, 8:30 AM

#

pseudo owl yeah the aesthetic was good but kinda took lot of time and imo not worth it. Th...

Yep just read through their paper and that looks 100% legit and 🔥 .
The model shouldn't be super capable at that size, but if the architecture works (and this is an absolutely awesome-looking architecture), you should be able to initialize from Pixtral and get something similar with SotA results.

#

Open source image-to-video from CogVideoX just dropped!!! https://huggingface.co/THUDM/CogVideoX-5b-I2V
(They had text-to-video public weights, and they had image-to-video private weights demoed in HF space for a while, but the image-to-video weights are now open. Downloading and testing now.)

THUDM/CogVideoX-5b-I2V · Hugging Face

#

#

Steps are happening! No errors so far...

noble coyote Sep 19, 2024, 8:49 AM

#

pseudo owl it won't beat it but it will be an alternative, its faster/smaller and has more ...

Flux allows use of living artists in your prompts (and training)

icy drift Sep 19, 2024, 8:50 AM

#

noble coyote Flux allows use of living artists in your prompts (and training)

Might end up being useful for upscaling or AnimateDiff or something IDK.

#

Image generated by Flux (on local PC), and video generated by CogVideoX also running on local PC! 🙂

#

I wouldn't call it good or useful by any means, but I'm going to try some other subjects and see what I get.

#

Why am I suddenly getting ridiculous times for Flux???

#

Oh it's because of the low resolution for CogVideoX images duh.

signal shuttle Sep 19, 2024, 9:10 AM

#

icy drift Sep 19, 2024, 9:13 AM

#

Over 8 minutes per video... Oh well. I'm rendering two tests now: a censored test and a spaghetti-eating test. I might leave it at that, or I might try some others if I think of anything I really want to know.

signal shuttle Sep 19, 2024, 9:15 AM

#

icy drift Image generated by Flux (on local PC), and video generated by CogVideoX also run...

Hey have you seen CogVideoX-Fun?

icy drift Sep 19, 2024, 9:15 AM

#

signal shuttle Hey have you seen CogVideoX-Fun?

No what's that?

signal shuttle Sep 19, 2024, 9:16 AM

#

icy drift No what's that?

Its a modification of CogVideoX https://github.com/aigc-apps/CogVideoX-Fun

GitHub

GitHub - aigc-apps/CogVideoX-Fun: 📹 A more flexible CogVideoX that ...

📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images. - GitHub - aigc-apps/CogVideoX-Fun: 📹 A more flexible CogVideoX that can generate videos at a...

#

Its more flexible

icy drift Sep 19, 2024, 9:17 AM

#

signal shuttle Its a modification of CogVideoX https://github.com/aigc-apps/CogVideoX-Fun

I bet this model can be dropped in. I will try this next.

sullen moss Sep 19, 2024, 9:20 AM

#

signal shuttle Its a modification of CogVideoX https://github.com/aigc-apps/CogVideoX-Fun

ImgToImg work?

signal shuttle Sep 19, 2024, 9:21 AM

#

sullen moss ImgToImg work?

Its based on CogVideoX, so if IMG2IMG works on CogVideoX then CogVideoX-fun should be able to do IMG2IMG as well

sullen moss Sep 19, 2024, 9:22 AM

#

Oh. I mean ImToVid

signal shuttle Sep 19, 2024, 9:23 AM

#

sullen moss Oh. I mean ImToVid

It can do Img2vid and vid2vid

bitter hearth Sep 19, 2024, 9:24 AM

#

sacred jewel ARS Midjourney LoRA mixed with a little sci_fi_future LoRA

wow yeah this is a really great sci-fi building
the shape is so complex

bitter hearth Sep 19, 2024, 9:26 AM

#

signal shuttle Its a modification of CogVideoX https://github.com/aigc-apps/CogVideoX-Fun

1024x1024x49 sounds great

signal shuttle Sep 19, 2024, 9:27 AM

#

signal shuttle Its a modification of CogVideoX https://github.com/aigc-apps/CogVideoX-Fun

The finetune was made by Alibaba PAI and open sourced by them

icy drift Sep 19, 2024, 9:29 AM

#

My (very lite) nsfw test worked fine.

#

Switching from CLI to comfy now.

bitter hearth Sep 19, 2024, 9:30 AM

#

the original was 720 x 480 so going to 1024 x 1024 is big

bitter hearth Sep 19, 2024, 9:31 AM

#

icy drift My (very lite) nsfw test worked fine.

ye that worked ok

icy drift Sep 19, 2024, 9:33 AM

#

It needs deepspeed and 15 isn't compatible with windows. Will try manual install.

#

Success. Downloading models.

noble coyote Sep 19, 2024, 9:52 AM

#

Flux1.Dev + LoRAs

#

Flux1.Dev + LoRAs

#

icy drift Sep 19, 2024, 10:05 AM

#

Steps are happening in Comfy.

#

Urgh.

noble coyote Sep 19, 2024, 10:11 AM

#

icy drift Urgh.

Can you free-up space by placing your Models Folder on a separate drive at all?

icy drift Sep 19, 2024, 10:13 AM

#

noble coyote Can you free-up space by placing your Models Folder on a separate drive at all?

I can clear out HF's cache. And I can delete some obsolete stuff accumulated in my AI folder, like auraflow and infinigen.

#

Wow ComfyUI is WAAAYYYYY slower than the CLI! I have almost finished my breakfast and it's still rendering.

#

Maybe the resolution is different. I guess I'll find out when/if it ever finishes.

noble coyote Sep 19, 2024, 10:25 AM

#

Video rendering needs meadow-sized RAM 😄

icy drift Sep 19, 2024, 10:26 AM

#

noble coyote Video rendering needs meadow-sized RAM 😄

Just wondering why Comfy is slower than using Python in the console. It's the same model more or less.

#

Console was 8-9 minutes. Comfy is already at 20 minutes and the GPU is still pegged at 100%.

noble coyote Sep 19, 2024, 10:27 AM

#

bitter hearth Sep 19, 2024, 10:27 AM

#

maybe node isn't quite done right

noble coyote Sep 19, 2024, 10:27 AM

#

I'm no expert - I am an artist with a soupcon of technical know-how! 🙂

icy drift Sep 19, 2024, 10:27 AM

#

bitter hearth maybe node isn't quite done right

I think I accidentally used a higher resolution. 480 vertical pixels for the console and 768 vertical pixels for Comfy. I thought the 768 was referring to the width.

bitter hearth Sep 19, 2024, 10:28 AM

#

the Ali one can go up to 1024x1024 though

noble coyote Sep 19, 2024, 10:28 AM

#

icy drift Sep 19, 2024, 10:28 AM

#

bitter hearth the Ali one can go up to 1024x1024 though

I might queue a test for that before I go to work, but at this rate I'm going to have to cancel the current render.

sacred jewel Sep 19, 2024, 10:36 AM

#

bitter hearth wow yeah this is a really great sci-fi building the shape is so complex

Thanks. I felt the same way ... Have to see if it was mainly flux or the Lora doing the heavy lifting 🤭

icy drift Sep 19, 2024, 10:37 AM

#

I have to kill the Comfy render after 32 minutes running... 😕

#

I need to generate base images and queue up some renders before I leave for work. Oh well. I'll try a lower resolution and hopefully I'll have some results when I get home.

alpine summit Sep 19, 2024, 10:47 AM

#

icy drift Sep 19, 2024, 10:56 AM

#

Leaving a few test running: Volcano erupting, spaceship flying, superhero, swimmer, running, dancer. No idea how many will get done, but I'm out of time.

bitter hearth Sep 19, 2024, 10:57 AM

#

would be interested in how Cog handles spaceships

#

got a lot of spaceship images

alpine summit Sep 19, 2024, 11:01 AM

#

noble coyote Sep 19, 2024, 11:08 AM

#

warm sonnet Sep 19, 2024, 11:17 AM

#

noble coyote

Your image looks great! Could you share your workflow?

noble coyote Sep 19, 2024, 11:23 AM

#

warm sonnet Your image looks great! Could you share your workflow?

The w/f is in the metadata of the PNG - click on it - then open-in-browser - right-click and d/load

#

bitter hearth Sep 19, 2024, 11:31 AM

#

I think this is how the model tells you that a setting is wrong

#

wtf

noble coyote Sep 19, 2024, 11:34 AM

#

Keyboardicality
Keyboarditis

#

Flux1.Dev + LoRAs

#

warm sonnet Sep 19, 2024, 12:21 PM

#

noble coyote

Got it, Thank you very much!

noble coyote Sep 19, 2024, 12:30 PM

#

sacred jewel Sep 19, 2024, 12:35 PM

#

#

noble coyote Sep 19, 2024, 12:37 PM

#

sacred jewel Sep 19, 2024, 12:37 PM

#

noble coyote Sep 19, 2024, 12:37 PM

#

sacred jewel Sep 19, 2024, 12:38 PM

#

#

#

#

alpine summit Sep 19, 2024, 12:41 PM

#

noble coyote Sep 19, 2024, 12:42 PM

#

#

#

alpine summit Sep 19, 2024, 12:49 PM

#

sacred jewel Sep 19, 2024, 1:17 PM

#

#

#

#

#

#

#

alpine summit Sep 19, 2024, 1:26 PM

#

Lol

alpine summit Sep 19, 2024, 1:49 PM

#

#

#

#

dusky thistle Sep 19, 2024, 2:35 PM

#

#

alpine summit Sep 19, 2024, 2:55 PM

#

#

#

alpine summit Sep 19, 2024, 3:35 PM

#

steep widget Sep 19, 2024, 3:39 PM

#

https://www.youtube.com/shorts/W6xmpX-fdmg

YouTube

uisato

Audioreactive Video Playhead + Realtime MIDI Control - #touchdesign...

Audioreactive Video Playhead system, now with real-time MIDI control + 21GB of new timelapses, and SD configurations.

LK + UBridge + Smartphone → TDAbleton → TouchDesigner

You can access these project files, plus many more systems, tutorials, and experiments, through: https://linktr.ee/uisato

▶ Play video

hallow lion Sep 19, 2024, 3:40 PM

#

sacred jewel

Nerdy Rodent, that u?

cursive frigate Sep 19, 2024, 3:59 PM

#

Does anyone here use JoyCaption?

I have been using it a lot as a local install lately to get descriptions for images with no prompt info for img2img.

I think it works great, its in alpha and blows my mind at how good it is, but was curious to see how it would work with different models.

Does anyone know how I could change the model or if that is possible with this type of thing. I think they are using a quantized VLM and not a typical LLM.

If anyone has any ideas or opinions please let me know. Or if you also use it let me know how your experience has been with it.

I will post the model folder structure below maybe that will help:

noble coyote Sep 19, 2024, 4:11 PM

#

Images made using Flux + LoRAs into Ollama img2img

cursive frigate Sep 19, 2024, 4:18 PM

#

Is there any way to feed more guidance into that setup. I like it but I would like to do something more like this and it never seems to work with that IF image to Prompt node.:

noble coyote Sep 19, 2024, 4:19 PM

#

cursive frigate Is there any way to feed more guidance into that setup. I like it but I would li...

Ollama, Florence2 and Jan.ai are my go-to prompt generators. I couldn't get JoyCaption to work!!!

cursive frigate Sep 19, 2024, 4:22 PM

#

You can try my workflow if you want. It has JoyCaption, maybe it will work for you. You may want to or need to make some changes to it. It is the first workflow I have ever made from an empty workspace. Also if it does work I would like to know how I can improve the workflow. I don't understand how all this works as well as some of you.

#

noble coyote Sep 19, 2024, 4:33 PM

#

cursive frigate You can try my workflow if you want. It has JoyCaption, maybe it will work for y...

NYJY Nodes won't load 😦

cursive frigate Sep 19, 2024, 4:35 PM

#

I'm trying to find their github page. I remember I had to do something specific. I also had to translate most of the page to english, lol. but then it worked.

#

I needed all 3 of these:

noble coyote Sep 19, 2024, 4:40 PM

#

That last one doesn't like NYJY won't load

cursive frigate Sep 19, 2024, 4:40 PM

#

also this was translated from their github page. I dont remember if I had to install pytrans myself or if one of those node installs did it automatically.

I think it also downloads the model on its own the first time so it may take a while:

#

Maybe you have to install pytrans

#

This is on the NYJY Page

noble coyote Sep 19, 2024, 4:50 PM

#

#

Pygtrans is already installed on my system 🙂

cursive frigate Sep 19, 2024, 4:54 PM

#

I wish I could help you to get it working. It is pretty cool.

noble coyote Sep 19, 2024, 4:54 PM

#

CXH-JoyCaption also does not load 😦

#

Oh well, its time to disable a whole bunch of nodes I guess until it starts to work ...

cursive frigate Sep 19, 2024, 4:56 PM

#

lol I do that too. Its a tedious process.

noble coyote Sep 19, 2024, 5:00 PM

#

toxic bone Sep 19, 2024, 5:12 PM

#

i don't think comfyui makes a great image tagging gui and people creating workflows for that are kind of wasting their time.

imo.

taggui among others exist. joy caption looks like a neat model but i've not seen how it's any different from WD tagger. I think it's trained on porn better. Hence the "inclusive" part of the description.

cursive frigate Sep 19, 2024, 5:15 PM

#

what is taggui?

toxic bone Sep 19, 2024, 5:16 PM

#

https://github.com/jhc13/taggui simple gui for managing captions in a folder

noble coyote Sep 19, 2024, 5:23 PM

#

van Gogh-y type stuff - Ollama img2img, with Flux output

cursive frigate Sep 19, 2024, 5:24 PM

#

Those look awesome.

cursive frigate Sep 19, 2024, 5:26 PM

#

toxic bone https://github.com/jhc13/taggui simple gui for managing captions in a folder

Thank you. I bet this would work pretty good for tagging non ai art images too. Like say product images for Etsy or any ecommerce store. 🙂

toxic bone Sep 19, 2024, 5:26 PM

#

love the whale tail in the shore. surrealism and van gogh? yes please

toxic bone Sep 19, 2024, 5:27 PM

#

cursive frigate Thank you. I bet this would work pretty good for tagging non ai art images too. ...

i use it for auto captioning, then cleaning those all up and manually tagging

#

has a bunch of models. i don't think joytag in it yet. has blip2, wd tagger, florence 2, and a few others.

#

i'm going to start experimenting with the other models a bit. you can prompt some of them and instruct them on how to describe the image, so if there are particular tag styles you prefer, that would help

#

the UI is suited more to tags. not as good for long natural language tags, but it still manages

noble coyote Sep 19, 2024, 5:31 PM

#

toxic bone i use it for auto captioning, then cleaning those all up and manually tagging

Once you upload any image (except porn/gore) to fineartamerica, it tags and describes it for you

toxic bone Sep 19, 2024, 5:32 PM

#

noble coyote Once you upload any image (except porn/gore) to fineartamerica, it tags and desc...

yeah ther are many ways to do things. i prefer a ui that streamlines it all since i manually caption hundreds of images in a session

cursive frigate Sep 19, 2024, 5:33 PM

#

I wish there was like a text merge node. Where you could take a node that you put Specific LoRA keywords into, then you take that AI output text and combine them leaving you with the AI image description and at the end or bottom you have your LoRA trigger words. Cause getting AI to include LoRA triggers without changing them is like impossible.

toxic bone Sep 19, 2024, 5:34 PM

#

pretty sure there are string concatting nodes

#

programmers don't use natural language. something intuitive like "text merging" is called "string concatanation"

noble coyote Sep 19, 2024, 5:34 PM

#

WAS Nodes has concat

tired fiber Sep 19, 2024, 5:34 PM

#

Algae

noble coyote Sep 19, 2024, 5:35 PM

#

I have a ClownsShark BatWing Cascade w/f which uses concat ...

#

cursive frigate Sep 19, 2024, 5:35 PM

#

Can you please put the exact node name in the chat so I can search it in the node add section?

toxic bone Sep 19, 2024, 5:36 PM

#

@noble coyote recommended this one. i've used it before was-node-suite-comfyui

noble coyote Sep 19, 2024, 5:36 PM

#

cursive frigate Can you please put the exact node name in the chat so I can search it in the nod...

https://github.com/WASasquatch/was-node-suite-comfyui

GitHub

GitHub - WASasquatch/was-node-suite-comfyui: An extensive node suit...

An extensive node suite for ComfyUI with over 210 new nodes - WASasquatch/was-node-suite-comfyui

#

Vincent!!!

cursive frigate Sep 19, 2024, 5:38 PM

#

That is not Vincent.... He still has both ears. lmao

#

Those are really good

noble coyote Sep 19, 2024, 5:39 PM

#

(He reincarnated and bought a 3D Printer and made another ear!!!) 😄

toxic bone Sep 19, 2024, 5:39 PM

#

also didn't catch siphilis so now he's calm and cool instead of manic and schizo

cursive frigate Sep 19, 2024, 5:41 PM

#

lol

noble coyote Sep 19, 2024, 5:44 PM

#

cursive frigate Sep 19, 2024, 5:55 PM

#

Does anyone know if there is a way to get Ollama to list the models and allow one to be selected in stead of having to type in the exact model. I have several and Its hard to keep up with the exact names.

noble coyote Sep 19, 2024, 6:03 PM

#

My Ollama checkpoint loader has all the models to select. No need to type

#

mortal mesa Sep 19, 2024, 6:30 PM

#

cursive frigate Does anyone know if there is a way to get Ollama to list the models and allow on...

ya if your talking about in ComfyUI there are nodes that do that, the one you are using might not

toxic bone Sep 19, 2024, 6:46 PM

#

https://github.com/AIrjen/OneButtonPrompt/pull/224 oh cool this works in the new forge again

GitHub

Fix UI for Gradio 4 update by zappityzap · Pull Request #224 · AIrj...

Refactor the two event listeners for OBP_preset.change() into a single listener.
Check gr.version to run the correct code for Gradio 3 and Gradio 4.

Seems to work in Automatic1111 and Forge wi...

#

thats a really interesting extension, since the same codebase is the comfyui node too

sacred jewel Sep 19, 2024, 7:44 PM

#

hallow lion Nerdy Rodent, that u?

Hahhaha

sacred jewel Sep 19, 2024, 7:44 PM

#

noble coyote Images made using Flux + LoRAs into Ollama img2img

I have two Frenchies so i have a soft spot for images of Frenchies 🥰🥰🥰

noble coyote Sep 19, 2024, 7:45 PM

#

sacred jewel I have two Frenchies so i have a soft spot for images of Frenchies 🥰🥰🥰

I will let you have the prompt; and will make sure to do some more 🙂

#

Actually, d/load the images yourself, as they contain the workflow

sacred jewel Sep 19, 2024, 7:58 PM

#

noble coyote Actually, d/load the images yourself, as they contain the workflow

Thank you, kind sir 👌👊

icy drift Sep 19, 2024, 8:02 PM

#

sacred jewel Sep 19, 2024, 9:10 PM

#

cursive frigate Sep 19, 2024, 9:31 PM

#

Does anyone know if there is a way to get an llm to offload itself after creating the prompt so it can allow for resources to be used for the rest of the process like upscaling, facefix, LoRAs, etc.

Maybe a node with an unload model boolean for on = true or false

Placed after the prompt is generated. Any ideas are welcome?

pseudo owl Sep 19, 2024, 10:05 PM

#

Yikes I am testing dalle3 and images really do look considerably worse then flux. Pretty good human anatomy though.

sacred jewel Sep 19, 2024, 10:18 PM

#

toxic bone Sep 19, 2024, 10:57 PM

#

👌👊 together makes me think of that game where you see someone doing that below their hips and they get to punch you twice for looking. UNLESS ||without looking you know they're doing it and poke your finger through their ring, thus granting you right to punch them twice||

sullen moss Sep 19, 2024, 11:33 PM

#

pseudo owl Yikes I am testing dalle3 and images really do look considerably worse then flux...

You're six months late...

bitter hearth Sep 19, 2024, 11:34 PM

#

anyone got some experience with flux hyperparams for character lora

#

i got 30 imags, complex character, network dim 16 not enough to capture fur patterns

#

wanted to do LR 0.00025 with cos restarts and adam and just do lots of steps

#

but apparently people use way less steps and get good results

sacred jewel Sep 20, 2024, 12:57 AM

#

#

#

#

#

#

steel beacon Sep 20, 2024, 1:15 AM

#

lapis plaza Sep 20, 2024, 1:54 AM

#

a happy batwinged frog playing a harp in the air

sacred jewel Sep 20, 2024, 2:00 AM

#

sacred jewel Sep 20, 2024, 2:17 AM

#

sacred jewel Sep 20, 2024, 2:35 AM

#

#

cursive frigate Sep 20, 2024, 2:37 AM

#

@noble coyote I like your setup but it keeps giving me this response.

#

sacred jewel Sep 20, 2024, 2:41 AM

#

sacred jewel Sep 20, 2024, 2:41 AM

#

cursive frigate <@801511644944400414> I like your setup but it keeps giving me this response.

you need LLAVA not LLAMA

#

(the vision models are mostly LLAVA based or Florence from Microsoft.)

Llama is a text llm

cursive frigate Sep 20, 2024, 2:42 AM

#

ohhh

#

do you have one you would recommend that is similar to the one I have?

#

do they have abliterated models? I guess that is how they categorize uncensored.

sacred jewel Sep 20, 2024, 2:47 AM

#

I haven't seen any vision models specifically for uncensored image descriptions.

cursive frigate Sep 20, 2024, 2:49 AM

#

Let's dig here:

JoyCaption uses:

Both of these for this image description node

cursive frigate Sep 20, 2024, 3:36 AM

#

Does anyone get this when running Img2Img
WARNING: IFImagePrompt.IS_CHANGED() got an unexpected keyword argument 'image_prompt'

vast condor Sep 20, 2024, 4:21 AM

#

sacred jewel I haven't seen any vision models specifically for uncensored image descriptions.

it can be done though

alpine summit Sep 20, 2024, 4:23 AM

#

#

craggy crest Sep 20, 2024, 5:05 AM

#

sacred jewel

wrong varity of tomato

toxic bone Sep 20, 2024, 5:06 AM

#

trying out this new schedule free support in flux

#

meta research's new big training thing

#

https://github.com/kohya-ss/sd-scripts/pull/1250

#

https://github.com/kohya-ss/sd-scripts/pull/1600 meant this one

GitHub

New ScheduleFree support for Flux by sdbds · Pull Request #1600 · k...

Old version Details

[ Support new optimizer Schedule free #1250 ]

Whats news?
Because testing found that Flux training works well using AdamWScheduleFree.

1、Add version in requirements.txt and a...

toxic bone Sep 20, 2024, 5:50 AM

#

woh. its fast.

noble coyote Sep 20, 2024, 8:36 AM

#

cursive frigate <@801511644944400414> I like your setup but it keeps giving me this response.

I find that my setup "gets cold feet!" I ask it to make the images as if van Gogh had painted them ...
It comes back with a lame excuse saying it cannot do that, or I should modify my input etc.
I let it run a few turns, and the lo and behold! The van Gogh begins to show up.
I've had the same problems with DallE-3 - lame excuses, and after a few goes it complies.
I've always said "computers and software are like cricketers: they sometimes drop the ball!"
My answer to you p e r s e v e r e 🙂

noble coyote Sep 20, 2024, 8:37 AM

#

sacred jewel you need LLAVA not LLAMA

I've had results with llama, llava, zephyr, qwen ...

noble coyote Sep 20, 2024, 8:39 AM

#

cursive frigate ohhh

Qwen2.5, Zephyr, Llama3, Llava2 all work for me
I'd use Claude Sonnet but I don't want to pay!

alpine summit Sep 20, 2024, 9:08 AM

#

noble coyote Sep 20, 2024, 9:30 AM

#

img2img using #Ollama and #Flux in ComfyUI

alpine summit Sep 20, 2024, 11:21 AM

#

#

alpine summit Sep 20, 2024, 11:42 AM

#

sacred jewel Sep 20, 2024, 12:29 PM

#

#

alpine summit Sep 20, 2024, 12:37 PM

#

alpine summit Sep 20, 2024, 12:59 PM

#

bitter hearth Sep 20, 2024, 1:11 PM

#

SDXL

#

sacred jewel Sep 20, 2024, 2:13 PM

#

#

alpine summit Sep 20, 2024, 2:25 PM

#

sacred jewel Sep 20, 2024, 2:28 PM

#

noble coyote Sep 20, 2024, 2:30 PM

#

img2img Ollama and Flux plus LoRAs

brittle pollen Sep 20, 2024, 2:40 PM

#

img2img Ollama and Flux plus LoRAs

bitter hearth Sep 20, 2024, 2:44 PM

#

brittle pollen img2img Ollama and Flux plus LoRAs

there's not a bot

sacred jewel Sep 20, 2024, 2:53 PM

#

noble coyote Sep 20, 2024, 2:55 PM

#

"Ollamba!!!" 😄

sacred jewel Sep 20, 2024, 2:55 PM

#

sacred jewel Sep 20, 2024, 2:55 PM

#

noble coyote "Ollamba!!!" 😄

i read that as "Ollambada" like the dance 😛

noble coyote Sep 20, 2024, 2:58 PM

#

https://www.youtube.com/watch?v=iyLdoQGBchQ

YouTube

Club Music 80

Kaoma - Lambada (Official Video) 1989 HD

Available on all platforms : https://bfan.link/world-beat
Kaoma - The Lambada (also known as Llorando se fue)
The full-screen HD official video of the worldwide #1 smash hit record from 1989

Playlist "Année 80 LA TOTALE" : https://bfan.link/annees-80

☼ Subscribe / Abonnez-vous : http://bit.ly/ClubMusic80s
☼ Follow us on / Suivez-nous sur Face...

▶ Play video

sacred jewel Sep 20, 2024, 2:59 PM

#

bitter hearth Sep 20, 2024, 3:01 PM

#

I like when it fakes small print

noble coyote Sep 20, 2024, 3:02 PM

#

Eve had a cunning plan: if she could do that with just one apple, what would happen if she had one hundred apples?!

#

Ollambada etc 😄

sacred jewel Sep 20, 2024, 3:08 PM

#

urban arch Sep 20, 2024, 3:50 PM

#

noble coyote Eve had a cunning plan: if she could do that with just one apple, what would hap...

I'd eat her apple.

noble coyote Sep 20, 2024, 3:53 PM

#

Har har!!! 😄

bitter hearth Sep 20, 2024, 3:55 PM

#

#

this discord has higher file size limits than others

#

oh its cos its level 2 server boost

dull star Sep 20, 2024, 4:14 PM

#

thank god for that 50mb

bitter hearth Sep 20, 2024, 4:16 PM

#

yeah I need it cos I never go below 4-6k any more

#

I'd rather wait for the slow generation times than go lower

noble coyote Sep 20, 2024, 5:02 PM

#

A pirate-themed Furby standing on comically tall peg legs, depicted in the style of an oil painting. The Furby is of normal size but standing on exaggerated peg legs, with stormy seas in the background. The mood is dramatic, with dark clouds and turbulent waters adding to the pirate atmosphere, while the Furby maintains its characteristic fuzzy, round appearance. The scene captures a sense of adventure and whimsy, blending the quirky appearance of the Furby with pirate aesthetics.

bitter hearth Sep 20, 2024, 5:05 PM

#

incredible prompt

#

gonna try that later

fleet meteor Sep 20, 2024, 5:35 PM

#

noble coyote A pirate-themed Furby standing on comically tall peg legs, depicted in the style...

noble coyote Sep 20, 2024, 5:41 PM

#

#🏞｜general-with-images message

#

Borrowed from DallE Theme of the Day

#

"Minimalist rugged oil painting in faded earthy blue hues, capturing delicate details in vast solid patches. A weary and anxious queen wearing a night robe standing near an open stained glass window in the high tower of a castle, holding a candle in a simple holder. View from out of the window. She gazes into the starlit night, as a distant search party of horse riders at the bottom of the castle run away on a dirt trail into the trees."

sacred jewel Sep 20, 2024, 6:07 PM

#

sacred jewel Sep 20, 2024, 6:35 PM

#

#

bitter hearth Sep 20, 2024, 6:48 PM

#

flux colours are so much nicer than SDXL

sacred jewel Sep 20, 2024, 7:46 PM

#

sacred jewel Sep 20, 2024, 8:12 PM

#

cursive frigate Sep 20, 2024, 8:56 PM

#

Does anyone know how models get on ollama.com?

I am really interested in trying this one but have no idea how to get it on ollama if it isnt on the website.

aimagelab/LLaVA_MORE-llama_3_1-8B-S2-siglip-finetuning

https://github.com/aimagelab/LLaVA-MORE

GitHub

GitHub - aimagelab/LLaVA-MORE: LLaVA-MORE: Enhancing Visual Instruc...

LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1 - aimagelab/LLaVA-MORE

icy drift Sep 20, 2024, 9:05 PM

#

cursive frigate Does anyone know how models get on ollama.com? I am really interested in tryin...

That reminds me I've been meaning the check out Pixtral 12b. I wish I could run it in Comfy somehow...

pseudo owl Sep 20, 2024, 9:21 PM

#

@icy drift Pixtral 12b is very good for its size, but this is a better choice: https://huggingface.co/openbmb/MiniCPM-V-2_6

It's similar in terms of quality but is considerably smaller, faster, and supports video.

@cursive frigate That model isn't really that great. Although it uses llama 3.1, there are still far better alternatives. It only barely beats llava 1.5 which is kind of ancient compared to models today. The above one I told is better and ollama supports it from what I see.

cursive frigate Sep 20, 2024, 9:32 PM

#

pseudo owl <@944640559878930532> Pixtral 12b is very good for its size, but this is a bett...

Can you send me a link for this one from ollama, I am having trouble finding it. The filtering and search on ollama is not great.

MiniCPM-V-2_6

If this is the one you are recommending that is.

pseudo owl Sep 20, 2024, 9:33 PM

#

cursive frigate Can you send me a link for this one from ollama, I am having trouble finding it....

Yeah here: https://ollama.com/library/minicpm-v

cursive frigate Sep 20, 2024, 9:34 PM

#

pseudo owl Yeah here: <https://ollama.com/library/minicpm-v>

Thank you.

cursive frigate Sep 20, 2024, 10:46 PM

#

pseudo owl Yeah here: <https://ollama.com/library/minicpm-v>

That model works really well. thanks.. I have to go make dinner. I'll post some images here later.

sacred jewel Sep 20, 2024, 11:25 PM

#

sacred jewel Sep 20, 2024, 11:44 PM

#

Woodcut LoRA

#

sacred jewel Sep 21, 2024, 12:16 AM

#

pseudo owl Yeah here: <https://ollama.com/library/minicpm-v>

Nice model... thank you.

sacred jewel Sep 21, 2024, 12:33 AM

#

hallow lion Sep 21, 2024, 12:47 AM

#

I love the R2D2s everywhere.

craggy crest Sep 21, 2024, 1:41 AM

#

sacred jewel

interesting window glass she's got her foot in

sacred jewel Sep 21, 2024, 2:19 AM

#

fleet meteor

sacred jewel Sep 21, 2024, 2:19 AM

#

craggy crest interesting window glass she's got her foot in

It's elastic transparent aluminum 😛 😛

#

Mix of Spandex, Aluminum and Lexan

craggy crest Sep 21, 2024, 2:21 AM

#

sacred jewel

too cute award!

dusky thistle Sep 21, 2024, 2:26 AM

#

hallow lion Sep 21, 2024, 2:46 AM

#

#

Becoming harder and harder to tell, but there are still a few weird things here and there

noble coyote Sep 21, 2024, 6:44 AM

#

ollama run minicpm-v:latest

noble coyote Sep 21, 2024, 7:01 AM

#

img2img Ollama Flux LoRAs

#

#

#

#

noble coyote Sep 21, 2024, 7:42 AM

#

https://www.reddit.com/r/ollama/comments/1f3g2ac/any_news_about_minicpm_26_coming_to_ollama/

From the ollama community on Reddit

Explore this post and more from the ollama community

#

My installation errors out - Error: llama runner process has terminated: GGML_ASSERT(new_clip->has_llava_projector) failed

#

#

#

#

noble coyote Sep 21, 2024, 8:28 AM

#

#

This Ollama does great prompt mashups! Surprising 😄

#

#

noble coyote Sep 21, 2024, 12:42 PM

#

#

bitter hearth Sep 21, 2024, 12:44 PM

#

#

and boom waow

dusky thistle Sep 21, 2024, 12:59 PM

#

noble coyote Sep 21, 2024, 1:13 PM

#

alpine summit Sep 21, 2024, 1:26 PM

#

fleet meteor Sep 21, 2024, 1:55 PM

#

noble coyote https://www.reddit.com/r/ollama/comments/1f3g2ac/any_news_about_minicpm_26_comin...

minicpm 2.6 works well on my ollama installation, which version have you installed?

#

noble coyote Sep 21, 2024, 1:58 PM

#

minicpm-v:latest

#

I will try your version

sacred jewel Sep 21, 2024, 2:02 PM

#

fleet meteor Sep 21, 2024, 2:12 PM

#

noble coyote I will try your version

"ollama run minicpm-v:8b-2.6-q8_0" , take into account that this uses around 10gb of vram

#

There are other version that uses a bit less vram

sacred jewel Sep 21, 2024, 2:15 PM

#

noble coyote Sep 21, 2024, 2:15 PM

#

... just loading it into my Ollama w/f ...

#

Error: llama runner process has terminated: GGML_ASSERT(new_clip->has_llava_projector) failed

noble coyote Sep 21, 2024, 2:17 PM

#

fleet meteor "ollama run minicpm-v:8b-2.6-q8_0" , take into account that this uses around 10g...

I have it running now on 8Gb VRAM 😄

#

When I say "running" - prompt box states "Failed to fetch response from Ollama" - so not running then! 😦

sacred jewel Sep 21, 2024, 2:23 PM

#

fleet meteor Sep 21, 2024, 2:26 PM

#

noble coyote When I say "running" - prompt box states "Failed to fetch response from Ollama" ...

:C

#

Maybe you can try with this one (its a q6 quant) "ollama run minicpm-v:8b-2.6-q6_K"

noble coyote Sep 21, 2024, 2:29 PM

#

I get an error message - llama runner process has terminated: GGML_ASSERT(new_clip->has_llava_projector) failed

sacred jewel Sep 21, 2024, 2:30 PM

#

CyberSociety LoRA

#

noble coyote Sep 21, 2024, 2:34 PM

#

Ollama and qwen2:0.5b

sacred jewel Sep 21, 2024, 2:37 PM

#

noble coyote Sep 21, 2024, 2:39 PM

#

fleet meteor :C

I was running Ollama v0.3.9 - now I've upgraded to v0.3.11 - minicpm-v:8b-2.6-q8_0 works well 😄

#

First fruits

fleet meteor Sep 21, 2024, 2:40 PM

#

noble coyote I was running Ollama v0.3.9 - now I've upgraded to v0.3.11 - minicpm-v:8b-2.6-q8...

Nice!

sacred jewel Sep 21, 2024, 2:43 PM

#

dusky thistle Sep 21, 2024, 2:43 PM

#

noble coyote Sep 21, 2024, 2:44 PM

#

#

Somehow, Ollama gets itself into the prompt!!!!!

mortal mesa Sep 21, 2024, 2:55 PM

#

If you were trying to be more descriptive you could say what model in ollama, i know ide be curious which models but im not going to keep asking you

#

just saying Ollama is like saying i used windows to generate the image

noble coyote Sep 21, 2024, 3:01 PM

#

This only happened when there was no respone from Ollama.

#

#

noble coyote Sep 21, 2024, 3:03 PM

#

mortal mesa If you were trying to be more descriptive you could say what model in ollama, i ...

My goto model is llava2:latest; then llama3:latest.
Qwen2:0.5b is also cool; and Zephyr:latest

#

My latest addition is minicpm-v:8b-2.6-q8_0

mortal mesa Sep 21, 2024, 3:05 PM

#

ya i saw you/someone mention that, looked good, will try eventually also

noble coyote Sep 21, 2024, 3:06 PM

#

mortal mesa ya i saw you/someone mention that, looked good, will try eventually also

Make sure you have Ollama v0.3.11 for minicpm to work!

#

minicpm gives a radically different look to llava2

mortal mesa Sep 21, 2024, 3:11 PM

#

you do a ton, but ide also suggest trying llama finetunes, there are some better than the base, im still exploring but this one was a big improvment ajindal/llama3.1-storm

noble coyote Sep 21, 2024, 3:15 PM

#

mortal mesa you do a ton, but ide also suggest trying llama finetunes, there are some better...

Just installing this one ...

#

minicpm has great image-making - yet poor prompt coherence!

#

Someone said it was good for producing video

#

llama3.1-storm successfully installed

noble coyote Sep 21, 2024, 4:10 PM

#

llama3.1-storm also poor prompt coherence, sad to say

#

noble coyote Sep 21, 2024, 4:31 PM

#

#

noble coyote Sep 21, 2024, 5:26 PM

#

A candle burning in the vacuum of space, with cosmic swirls of color replacing traditional smoke and flame shapes. The candle's flame merges with celestial elements, creating a blend of fantasy and abstract art, with vibrant colors and surreal space forms dancing around the candle.

short thicket Sep 21, 2024, 5:38 PM

#

#

#

#

short thicket Sep 21, 2024, 6:25 PM

#

https://civitai.com/models/783736 for that cheesy 80's low budget sci fi look. 🙂

Ice Pirates Style - v1.0 | Stable Diffusion LyCORIS | Civitai

159 images from the 1984 Ice Pirates sci fi film trained for 7000 steps with SimpleTuner to get that old school 80's low budget sci fi look. Traini...

dusky thistle Sep 21, 2024, 6:42 PM

#

tight storm Sep 21, 2024, 7:46 PM

#

short thicket https://civitai.com/models/783736 for that cheesy 80's low budget sci fi look. �...

this looks sweet for recreating a more authentic retro sci fi style tbh.

short thicket Sep 21, 2024, 7:48 PM

#

tight storm this looks sweet for recreating a more authentic retro sci fi style tbh.

Thanks! I always love seeing AI but with that retro cheese style.

fleet meteor Sep 21, 2024, 10:10 PM

#

short thicket Sep 21, 2024, 11:40 PM

#

short thicket Sep 22, 2024, 12:09 AM

#

sacred jewel Sep 22, 2024, 12:13 AM

#

#

Western Cominc LoRA

#

Intricate Details LoRA

sacred jewel Sep 22, 2024, 12:56 AM

#

sacred jewel Sep 22, 2024, 1:16 AM

#

Davinci Flux LoRA

short thicket Sep 22, 2024, 1:58 AM

#

#

130 loras merged so far. Flux1-dev on the left, Mangled Merge on the right.

#

#

sacred jewel Sep 22, 2024, 2:07 AM

#

#

#

#

Xerox LoRA

sacred jewel Sep 22, 2024, 2:45 AM

#

William Mortensen LoRA

real terrace Sep 22, 2024, 2:53 AM

#

hi, I wasn't able to run the flux nf4 models (because AMD GPU and stuff I couldn't solve), I wonder if there is something new I could try

#

what's this ollama thing, something extra in the generation process?

pseudo owl Sep 22, 2024, 3:19 AM

#

real terrace what's this ollama thing, something extra in the generation process?

Ollama runs llms or multimodal llms to help enhance prompts or convert images into prompts. It’s pretty popular since it’s fast, uses low vram, and is very easy to use.

real terrace Sep 22, 2024, 3:26 AM

#

pseudo owl Ollama runs llms or multimodal llms to help enhance prompts or convert images in...

thanks

#

how it is installed or used?

pseudo owl Sep 22, 2024, 3:40 AM

#

real terrace thanks

I believe there are comfy ui nodes for it, you can also use it normally, here is the instructions: https://github.com/ollama/ollama

The only “negative” thing about it is that most of it’s basically just llama.cpp and they just add a slight bit of code to make it simpler but it’s much more famous even when they fully rely on llama.cpp to support new models and basically all the hard code but don’t really mention it.

GitHub

GitHub - ollama/ollama: Get up and running with Llama 3.1, Mistral,...

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models. - ollama/ollama

real terrace Sep 22, 2024, 3:40 AM

#

pseudo owl I believe there are comfy ui nodes for it, you can also use it normally, here is...

ty!

errant dust Sep 22, 2024, 5:12 AM

#

https://techcrunch.com/2024/09/20/black-forest-labs-the-company-that-powers-groks-image-generation-is-raising-another-100m-on-a-1b-valuation-say-sources/

TechCrunch

Ingrid Lunden

Exclusive: Black Forest Labs, the company that powers Grok's image ...

Black Forest Labs, an image GenAI startup that only came out of stealth two months ago, has closed a a monster new round, sources say.

dusky thistle Sep 22, 2024, 7:26 AM

#

stable tide Sep 22, 2024, 7:53 AM

#

#✍🏼｜rules-and-tos

dusky thistle Sep 22, 2024, 7:58 AM

#

alpine summit Sep 22, 2024, 8:48 AM

#

noble coyote Sep 22, 2024, 8:55 AM

#

img2img Ollama and Flux and LoRA

#

#

alpine summit Sep 22, 2024, 9:13 AM

#

noble coyote Sep 22, 2024, 9:14 AM

#

#

#

#

Vape this thing!!!!

alpine summit Sep 22, 2024, 9:29 AM

#

noble coyote Sep 22, 2024, 9:38 AM

#

Oh! Llama

noble coyote Sep 22, 2024, 10:03 AM

#

#

img2img Ollama, Flux, LoRA

#

#

noble coyote Sep 22, 2024, 12:36 PM

#

vast condor Sep 22, 2024, 1:41 PM

#

toxic bone trying out this new schedule free support in flux

hey, saw this post a couple days ago, what are your thoughts?

bitter hearth Sep 22, 2024, 2:28 PM

#

from what I read that optimisation away from comfy has been around for a few months now

#

and it seems that it could be a nice efficiency saving

#

just nothing huge

#

hey guys cat bot here once again on my custom youtube modpack run episode 2

sacred jewel Sep 22, 2024, 2:50 PM

#

Well, he [is] a zombie, so the third arm is plausible? 🤷‍♂️

#

Steel Polished LoRA

hexed dirge Sep 22, 2024, 2:59 PM

#

#

#

testing now

#

sacred jewel Sep 22, 2024, 3:07 PM

#

Timeless LoRA

bitter hearth Sep 22, 2024, 3:07 PM

#

do flux loras tend to need a low strength?

#

the amount of Civit loras I find where they are burnt out at 1.0 strength is weird

#

I'm often setting it to like 0.1-0.5 at most

hexed dirge Sep 22, 2024, 3:09 PM

#

bitter hearth do flux loras tend to need a low strength?

depends on how they are trained. I noticed that with 2k or more steos and 8 network dim they look overburn, so with my new loras i use lower values. BTW I put the strength in every model I publish

#

#

https://civitai.com/models/786682

Fantastic Realism - v1.0 | Stable Diffusion LoRA | Civitai

Keyword : f4nt4st1c Strength : 0.8/0.9 . Guidance : 4 The style of Fantastic Realism blends highly detailed, lifelike representations with imaginat...

toxic bone Sep 22, 2024, 3:25 PM

#

vast condor hey, saw this post a couple days ago, what are your thoughts?

it converges very quickly and doesn't over fit. i can't seem to give it a learn rate it hates.

#

well, low ones.

hexed dirge Sep 22, 2024, 3:36 PM

#

#

#

sacred jewel Sep 22, 2024, 3:48 PM

#

bitter hearth do flux loras tend to need a low strength?

Some do... my own Magritte one is quite strong... needs to be around .60 in most cases. Similarly, my SpyWorld one is a bit strong and can work at 1.00 but mostly better at lower strengths.

DISCLAIMER: I have no clue how any of this works 😛 ... my LoRAs were done with default settings on either Replicate, Civit or local

#

hexed dirge Sep 22, 2024, 3:53 PM

#

Also I noticed that some loras with 1.0 of strength tends to draw too many things in a picture

sacred jewel Sep 22, 2024, 4:01 PM

#

hexed dirge https://civitai.com/models/786682

Fantastic Realism LoRA

#

Above is 1.0 strength, Guidance 3.5

#

Strength .80, Guidance 4.0

short thicket Sep 22, 2024, 4:07 PM

#

230 lora merged. Dev on the left, MangledMerge on the right.

#

#

#

noble coyote Sep 22, 2024, 4:11 PM

#

Ollama Flux Fantastic_Realism LoRA

sacred jewel Sep 22, 2024, 4:21 PM

#

#

Mixed with my Rene Magritte LoRA 🤭

noble coyote Sep 22, 2024, 4:25 PM

#

Ollama Flux Fantastic_Realism LoRA

sacred jewel Sep 22, 2024, 4:27 PM

#

If I use the trigger on my LoRA, it kickes into overdrive LOL. Same stregth, same settings as before, just added the trigger and goodby prompt LOL

noble coyote Sep 22, 2024, 4:33 PM

#

noble coyote Sep 22, 2024, 5:02 PM

#

DallE Theme-of-the-Day Prompt = A surreal, imaginative scene featuring an AirPod Pro floating in mid-air with glowing sound waves emanating from it in multiple directions, creating an entirely new dimension of audio. The sound waves transform into vibrant, swirling patterns that ripple through space, merging with abstract, colorful landscapes that represent different sounds and environments. The AirPod Pro is white and sleek, with its details highlighted by the surreal light and spatial elements around it, giving a futuristic and immersive effect.

#

Fabulous Realism LoRA on a van Gogh style image

#

noble coyote Sep 22, 2024, 5:30 PM

#

sacred jewel Sep 22, 2024, 5:52 PM

#

gilded silo Sep 22, 2024, 5:55 PM

#

more people added into sd3.5 testing, dpo soon i hope

bitter hearth Sep 22, 2024, 6:21 PM

#

gilded silo more people added into sd3.5 testing, dpo soon i hope

hopefully, I'm really looking forward to SD3.5

#

need an undistilled model with 16 channel VAE

sacred jewel Sep 22, 2024, 6:34 PM

#

Desolation and Lines LoRAs combined

sacred jewel Sep 22, 2024, 7:09 PM

#

#

Blue Future added to the mix

sacred jewel Sep 22, 2024, 7:36 PM

#

Victorian Gothic Horror LoRA

sacred jewel Sep 22, 2024, 7:57 PM

#

Anatomica v9 LoRA

hexed dirge Sep 22, 2024, 8:17 PM

#

https://civitai.com/models/787516

Metal Logo - v1.0 | Stable Diffusion LoRA | Civitai

Instructions : keyword : m3t4ll0g0 Strength : start with 0.7 but go up to over 1 if it doesn't come out a logo but a normal drawing. Guidance : 3.5...

sacred jewel Sep 22, 2024, 8:26 PM

#

more Anatomica

sacred jewel Sep 22, 2024, 8:36 PM

#

hexed dirge https://civitai.com/models/787516

hexed dirge Sep 22, 2024, 8:36 PM

#

sacred jewel

wonderful

hexed dirge Sep 22, 2024, 8:37 PM

#

sacred jewel

can you try the prompt adding "Metal Logo" ?

sacred jewel Sep 22, 2024, 8:39 PM

#

sacred jewel Sep 22, 2024, 8:39 PM

#

hexed dirge can you try the prompt adding "Metal Logo" ?

hang on

hexed dirge Sep 22, 2024, 8:40 PM

#

sacred jewel hang on

yes here we are

#

I'm changing the description of lora

sacred jewel Sep 22, 2024, 8:50 PM

#

prompt was m3t4ll0g0 Text : "WHOA". metal logo. zombies running screaming

sacred jewel Sep 22, 2024, 8:55 PM

#

sacred jewel

Same settings but strength 1.00 instead of 0.70

#

hexed dirge Sep 22, 2024, 9:01 PM

#

very good

#

vast condor Sep 22, 2024, 9:07 PM

#

toxic bone it converges very quickly and doesn't over fit. i can't seem to give it a learn...

interesting, I'm too far along in my current multi-concept lora to restart it with a new scheduler, but I'll have to check that out. It reminds me of early XL and prodigy, which converged way way faster than wadam

sacred jewel Sep 22, 2024, 9:10 PM

#

hexed dirge Sep 22, 2024, 9:29 PM

#

sacred jewel Sep 22, 2024, 9:42 PM

#

toxic bone Sep 22, 2024, 10:20 PM

#

vast condor interesting, I'm too far along in my current multi-concept lora to restart it wi...

i'd run some smaller tests before dedicating time to it. i did a 200 image dataset and it took it in well. mostly doing a lot of 30 image sets.

sacred jewel Sep 22, 2024, 11:33 PM

#

sacred jewel Sep 23, 2024, 1:20 AM

#

short thicket Sep 23, 2024, 1:56 AM

#

https://civitai.com/models/788136?modelVersionId=881383 Version 0

Mangled Merge Flux - v0 BFloat16 | Stable Diffusion Checkpoint | Ci...

It is my pleasure to introduce Mangled Merge Flux to the Civitai community. Continuing the tradition started with Stable Diffusion 2.1, and then SD...

sacred jewel Sep 23, 2024, 2:21 AM

#

#

short thicket Sep 23, 2024, 2:51 AM

#

i wish civitai had an easier way to manage versions. this is gonna be a mess with all these quants in a few months.

alpine summit Sep 23, 2024, 3:53 AM

#

Sd1.5

#

alpine summit Sep 23, 2024, 4:35 AM

#

#

bitter hearth Sep 23, 2024, 6:07 AM

#

craggy crest Sep 23, 2024, 6:51 AM

#

noble coyote Sep 23, 2024, 7:13 AM

#

Ollama/Flux img2img

#

noble coyote Sep 23, 2024, 7:50 AM

#

noble coyote Sep 23, 2024, 8:48 AM

#

dusky thistle Sep 23, 2024, 9:20 AM

#

#

alpine summit Sep 23, 2024, 9:53 AM

#

#

alpine summit Sep 23, 2024, 10:49 AM

#

#

dusky thistle Sep 23, 2024, 10:54 AM

#

alpine summit Sep 23, 2024, 11:07 AM

#

bitter hearth Sep 23, 2024, 11:49 AM

#

I scrolled through a bunch of new messages really quickly

#

I could see them

#

The whoas

#

thomas

noble coyote Sep 23, 2024, 12:22 PM

#

Made using Flux alone with @hexed dirge m3t4ll0g0 LoRA

#

#

#

#

#

sacred jewel Sep 23, 2024, 2:24 PM

#

dusky thistle Sep 23, 2024, 3:34 PM

#

#

#

#

#

#

#

#

#

noble coyote Sep 23, 2024, 5:17 PM

#

A surreal landscape depicting the equinox with the Sun positioned exactly overhead, casting minimal shadows. The scene is inspired by the mystical and symbolic style of Alejandro Jodorowsky's 'The Holy Mountain.' Elements of surrealism and esotericism are present, with abstract shapes, towering mountain-like structures, and enigmatic figures standing in meditative poses. The colors are vivid and otherworldly, with golden, deep blues, and crimson hues blending together. Rays of light emanate from the Sun, creating an ethereal and almost divine atmosphere, reminiscent of a dreamlike and spiritual world.

#

#

#

#

#

hallow lion Sep 23, 2024, 5:31 PM

#

alpine summit Sd1.5

This girl is the amalgamation of all girls.

#

she is nobody and everybody

noble coyote Sep 23, 2024, 5:32 PM

#

"I don't know her!!!" 🥳

#

hallow lion Sep 23, 2024, 5:33 PM

#

Quigglestink!

noble coyote Sep 23, 2024, 5:33 PM

#

😄

#

Quigglestink and Schtumm - a bad detective agency

turbid grotto Sep 23, 2024, 5:34 PM

#

guys, does quantized Flux work with controlnet?

noble coyote Sep 23, 2024, 5:34 PM

#

#

#

cursive frigate Sep 23, 2024, 6:22 PM

#

Getting some great results today. 🙂

#

#

pseudo owl Sep 23, 2024, 6:23 PM

#

Anyone test this? dev one will come a bit later it seems(still in training)

https://huggingface.co/SG161222/RealFlux_1.0b_Schnell

SG161222/RealFlux_1.0b_Schnell · Hugging Face

bitter hearth Sep 23, 2024, 6:28 PM

#

pseudo owl Anyone test this? dev one will come a bit later it seems(still in training) htt...

samples are up https://civitai.com/models/788550?modelVersionId=881836

RealFlux 1.0b - 1.0b_Compact_Schnell | Stable Diffusion Checkpoint ...

You can support me directly on Boosty The model is still in training and will improve with each update RealFlux Hugging Face Full Collection Using ...

pseudo owl Sep 23, 2024, 6:30 PM

#

bitter hearth samples are up https://civitai.com/models/788550?modelVersionId=881836

Yeah samples actually do seem pretty nice, but not sure if it still has as good prompt following and text rendering as normal schnell.

bitter hearth Sep 23, 2024, 6:31 PM

#

I might switch to this model

#

cos I already liked Schnell's compositions and layouts more

pseudo owl Sep 23, 2024, 6:52 PM

#

Yeah schnell composition was more creative then dev and even pro’s I believe. Not quality tho but realflux might help

icy drift Sep 23, 2024, 8:21 PM

#

bitter hearth samples are up https://civitai.com/models/788550?modelVersionId=881836

I can't handle Schnell's tendency to make foam-noise out of details like building windows and flower fields, and I already get 6-step renders out of Dev with the Hyper lora. But this is 💯 the best looking Schnell output I have seen so far. For close-up stuff, I bet this is good enough. Downloading now to see what it can handle at 4 steps.

#

Here's a 4MP image with this model generated in 28 seconds with 3 rounds of 4 steps each. Notice the grainy mess of the food products.

#

Here's the exact same prompt and workflow using the Schnell base model. Notice the crisp definition of every item on every shelf.

pseudo owl Sep 23, 2024, 8:36 PM

#

icy drift Here's a 4MP image with this model generated in 28 seconds with 3 rounds of 4 st...

you need distilled cfg, did you use that?

icy drift Sep 23, 2024, 8:37 PM

#

I think this model was finetuned for realistic textures, and in the process it lost some general object knowledge.

pseudo owl Sep 23, 2024, 8:37 PM

#

icy drift I think this model was finetuned for realistic textures, and in the process it l...

You need the optimal settings, try it with this
Euler Beta

Sampling Steps: 4-6

Distilled CFG Scale: 3.5

CFG Scale: 1.0

icy drift Sep 23, 2024, 8:38 PM

#

pseudo owl You need the optimal settings, try it with this Euler Beta Sampling Steps: 4-6 ...

Yeah I followed the directions and used the same for the base Schnell, just for consistency.

#

Also, I was only testing the 4-step performance, because I already have 6-step dev.

#

I wonder if there's some specific subject where it could outperform. Hmm.

pseudo owl Sep 23, 2024, 8:41 PM

#

icy drift Yeah I followed the directions and used the same for the base Schnell, just for ...

Oh interesting, both seem to have cons and pros. Real flux's looks better, the cart is not weirdly opened and a weird shape, the human behind is pretty weird(big head but small legs) but some objects behind in realvisxl's are mushy.

It does look better then normal schnell for sure but doesn't fix all flaws. Can you try text with it?

icy drift Sep 23, 2024, 8:45 PM

#

pseudo owl Oh interesting, both seem to have cons and pros. Real flux's looks better, the ...

Trying now with base, and then I'll try with real. This is the last test I'm doing though, gotta get some editing done.

#

Target text is: "Does this mawashi make me look fat?"
Base model:

pseudo owl Sep 23, 2024, 8:49 PM

#

icy drift Trying now with base, and then I'll try with real. This is the last test I'm doi...

Yep, thanks for testing!

icy drift Sep 23, 2024, 8:49 PM

#

In this case I think it got a better result overall, and the texture is much more realistic. The win goes to real. Gotta edit though. See ya.

cursive frigate Sep 23, 2024, 9:03 PM

#

To upgrade pytorch + cuda do I need to be in the ComfyUI_windows_portable folder? Or do i need to be in the python_embeded folder?

bitter hearth Sep 23, 2024, 9:12 PM

#

its realvis model
they are always only for photos

pseudo owl Sep 23, 2024, 9:13 PM

#

bitter hearth its realvis model they are always only for photos

Seems considerably better at text as well so its actually pretty nice. Probably going to replace hyper for me when a nf4 comes out.

bitter hearth Sep 23, 2024, 9:19 PM

#

sampler matters a lot too, compare these two images of same seed, 6 steps:

#

#

pseudo owl Sep 23, 2024, 10:47 PM

#

bitter hearth sampler matters a lot too, compare these two images of same seed, 6 steps:

yeah 2nd one seems much better

craggy crest Sep 24, 2024, 12:07 AM

#

sacred jewel Sep 24, 2024, 1:21 AM

#

sacred jewel Sep 24, 2024, 1:21 AM

#

craggy crest

Nice!

craggy crest Sep 24, 2024, 1:22 AM

#

🙂 thanks

sacred jewel Sep 24, 2024, 1:22 AM

#

#

#

#

I don't make the rules

#

#

#

cinder junco Sep 24, 2024, 1:32 AM

#

dusky thistle Sep 24, 2024, 2:45 AM

#

#

sturdy dune Sep 24, 2024, 3:00 AM

#

Whats the best way to copy JUST the style of a image?

dusky thistle Sep 24, 2024, 3:37 AM

#

sturdy dune Whats the best way to copy JUST the style of a image?

Cubiq's ipadapter nodes within comfyui

cursive frigate Sep 24, 2024, 3:55 AM

#

I'm not sure where else to ask this... It seems like the perfect place...

Does anyone know if Nerdy Rodent has a discord server?

craggy crest Sep 24, 2024, 4:51 AM

#

cursive frigate I'm not sure where else to ask this... It seems like the perfect place... Doe...

nope. just his twitter account and his youtube channel

#

he's pretty good about responding on twitter though

sturdy dune Sep 24, 2024, 5:11 AM

#

Thanks @topaz valley and @dusky thistle https://github.com/cubiq/ComfyUI_IPAdapter_plus/tree/main/examples

I'm just going through them now do you know what example? Kolors and image didn't do muych

GitHub

ComfyUI_IPAdapter_plus/examples at main · cubiq/ComfyUI_IPAdapter_p...

Contribute to cubiq/ComfyUI_IPAdapter_plus development by creating an account on GitHub.

#

To me it doesn't seem like it does anything more than copy the image like a noise / blur, it's not matching the style.

#

The top is the style I'd like to copy

#

Okay so for example:

#

The top left image is the style that I would like,

#

It's not BAD but is that the best quality I'll get?

#

Auto 11 is the way to train a lora?

#

#

https://github.com/hako-mikan/sd-webui-traintrain ?

GitHub

GitHub - hako-mikan/sd-webui-traintrain: LoRA training extention fo...

LoRA training extention for Stable Diffusion Web-UI - hako-mikan/sd-webui-traintrain

#

Yeah I know, that's why I'm confused.

#

Yes, pixel art.

#

But I think I'm just goint to train locally.

#

Is the traintrain good?

#

I don't know what idk civit :goat: means

#

civitai?

muted dove Sep 24, 2024, 8:27 AM

#

#

#

muted dove Sep 24, 2024, 8:50 AM

#

noble coyote Sep 24, 2024, 8:53 AM

#

#

Flux/Florence2 i2i

bitter hearth Sep 24, 2024, 10:01 AM

#

tranquil sinew Sep 24, 2024, 10:27 AM

#

metropol parasol

bitter hearth Sep 24, 2024, 12:26 PM

#

bitter hearth Sep 24, 2024, 2:11 PM

#

Schnell

craggy crest Sep 24, 2024, 4:18 PM

#

well, well, well

noble coyote Sep 24, 2024, 5:01 PM

#

Vaguely abstracted modernist oil painting in an expressionist painterly style. A young child stands by the glass wall of a zoo, a gorilla sitting on the other side in its leafy green enclosure. They both hold up a hand to sign a kind "I love you". Subtle imperfections and splattery effect. Bold textures."#

mortal mesa Sep 24, 2024, 5:04 PM

#

2024-09-24-010519-flux1-dev-Q8_0.gguf_00002_.png

craggy crest Sep 24, 2024, 5:15 PM

#

Full post is here https://stability.ai/news/james-cameron-joins-stability-ai-board-of-directors

Stability AI

James Cameron, Academy Award-Winning Filmmaker, Joins Stability AI ...

Today we announced that legendary filmmaker, technology innovator, and visual effects pioneer James Cameron has joined our Board of Directors.

noble coyote Sep 24, 2024, 5:17 PM

#

pseudo owl Sep 24, 2024, 5:18 PM

#

craggy crest Full post is here https://stability.ai/news/james-cameron-joins-stability-ai-boa...

I hope they cook something good, sd3 was very disappointing.

craggy crest Sep 24, 2024, 5:20 PM

#

pseudo owl I hope they cook something good, sd3 was very disappointing.

SD3 2B is a very good model. the only reason is was disappointing is because 1. people didn't bother to learn how to use it and 2. it isn't unet, and has a couple of core issue. however, it is much much better than flux, which is seriously broken, extremely rigid, and massively overfit for several concepts to mask the same core issues that SD3 2B has

#

yet people are falling all over themselves to use flux because robin HID those issues and they haven't noticed them

pseudo owl Sep 24, 2024, 5:22 PM

#

craggy crest SD3 2B is a very good model. the only reason is was disappointing is because 1. ...

I respect your opinion.

craggy crest Sep 24, 2024, 5:23 PM

#

pseudo owl I respect your opinion.

i did the testing - and the work - to drill down and figure out why sd3 2b has the issues it has, and just spent hundreds of hours and the last month walking through flux's latent space. it's not an opinion, it's hard facts

pseudo owl Sep 24, 2024, 5:24 PM

#

craggy crest i did the testing - and the work - to drill down and figure out why sd3 2b has t...

Sure 👍

craggy crest Sep 24, 2024, 5:25 PM

#

pseudo owl Sure 👍

i honestly don't care if you believe me or not

bitter hearth Sep 24, 2024, 5:25 PM

#

craggy crest yet people are falling all over themselves to use flux because robin HID those i...

i do agree with what you say, hope sd3.5 won't flop, distilled models are mid

craggy crest Sep 24, 2024, 5:25 PM

#

bitter hearth i do agree with what you say, hope sd3.5 won't flop, distilled models are mid

we'll have to wait and see, won't we?

lunar canopy Sep 24, 2024, 5:29 PM

#

appreciate your fondness for sd3, however can we tone back on the constant trying to "win" others over? definitely okay to have differing opinions. @craggy crest

craggy crest Sep 24, 2024, 5:29 PM

#

lunar canopy appreciate your fondness for sd3, however can we tone back on the constant tryin...

i'm not trying to win anyone over

mortal mesa Sep 24, 2024, 5:30 PM

#

too late for that

lunar canopy Sep 24, 2024, 5:31 PM

#

yes, which is why I used quotes. A lot of your history here is arguing that others are wrong, and you are right. So, I'm asking to tone back so the environment is nice and clear, @craggy crest

bitter hearth Sep 24, 2024, 5:55 PM

#

craggy crest well, well, well

we'll have good movies now ?!

#

thomas

#

fruit waow

#

some of the stuff that is known as stuff that won't work on Flux because its distilled, might actually work

#

I'll post a miku in anime

#

SDE and ancestral sampling works now if done right

#

and its possible SAG/PAG will work if you tonemap the inevitable CFG burn away

#

there might also be a sneaky way to get something like tiled control net without training one

#

so many issues

#

can't you just use a1111 sadcat

#

I actually tried the other day to read a1111 code

#

its very confusing

#

(every piece of code is confusing to me)

pliant oar Sep 24, 2024, 6:01 PM

#

craggy crest i did the testing - and the work - to drill down and figure out why sd3 2b has t...

and did you find anything?:D

craggy crest Sep 24, 2024, 6:01 PM

#

pliant oar and did you find anything?:D

yes. and turned the information into the developers

pliant oar Sep 24, 2024, 6:01 PM

#

its secret?:D I'm just nosey

lunar canopy Sep 24, 2024, 6:06 PM

#

bitter hearth fruit <:waow:1017853838516035725>

notLikeMiku

bitter hearth Sep 24, 2024, 6:12 PM

#

bitter hearth (every piece of code is confusing to me)

diffusers is probably the nicest overall code base out there for this
A1111 is essentially legacy code at this point, its not really made well for scaling

#

there are UIs that run on diffusers too like SD-next and Invoke, its not always command line

#

sd next is pretty cool

#

UI is questionable butt

#

kinda cool

#

it runs a tiny bit better with zluda than a1111 on my pc, not anything worth using still sadcat

pseudo owl Sep 24, 2024, 6:27 PM

#

bitter hearth i do agree with what you say, hope sd3.5 won't flop, distilled models are mid

Yeah sd3.5 8b already looks good, and its not even fully done training yet from what I have heard. Hope they release the 8b one instead of 2b this time.

sacred jewel Sep 24, 2024, 6:32 PM

#

#

#

sacred jewel Sep 24, 2024, 6:35 PM

#

pseudo owl I hope they cook something good, sd3 was very disappointing.

SD4.0 in a couple of weeks.

#

#

#

#

#

#

#

bitter hearth Sep 24, 2024, 6:48 PM

#

sacred jewel

ball

#

waow

sacred jewel Sep 24, 2024, 6:50 PM

#

#

brittle nexus Sep 24, 2024, 8:43 PM

#

#

cog locally

brittle nexus Sep 24, 2024, 9:00 PM

#

brittle nexus Sep 24, 2024, 10:35 PM

#

sacred jewel Sep 24, 2024, 11:04 PM

#

cursive frigate Sep 25, 2024, 2:11 AM

#

Not sure if I am doing something wrong. Maybe someone can load up this workflow and check it out but. These images seem to be grainy or pixelated.

#

#

#

Any advice here would be appreciated on this.

brittle nexus Sep 25, 2024, 2:50 AM

#

I won't run that, looks like an upscale only, so without the original image i can't say

brittle nexus Sep 25, 2024, 2:52 AM

#

cursive frigate Not sure if I am doing something wrong. Maybe someone can load up this workflow ...

And yes. It's bad upscale

cursive frigate Sep 25, 2024, 3:15 AM

#

brittle nexus And yes. It's bad upscale

This is the original that I am using to do img2img with.

#

Here is the base image it spit out for this example

#

here is the bad upscale.

brittle nexus Sep 25, 2024, 3:47 AM

#

cursive frigate This is the original that I am using to do img2img with.

Do you want to change the original that much?

magic isle Sep 25, 2024, 4:23 AM

#

Imagine: chotta bheem white dress

brittle nexus Sep 25, 2024, 4:55 AM

#

cursive frigate here is the bad upscale.

It's better, right? i'll try to build a workflow tomorrow

errant dust Sep 25, 2024, 5:20 AM

#

https://arstechnica.com/information-technology/2024/09/james-cameron-once-warned-us-about-ai-now-hes-joined-an-ai-companys-board/

Ars Technica

Terminator’s Cameron joins AI company behind controversial image ge...

Famed sci-fi director joins board of embattled Stability AI, creator of Stable Diffusion.

bitter hearth Sep 25, 2024, 5:24 AM

#

looks like SAI are a going concern I guess

cursive frigate Sep 25, 2024, 5:33 AM

#

brittle nexus Do you want to change the original that much?

i want to keep the colors, lighting, and subject characteristics close to the original. the only real change i want it to go from a rendered cgi image to a realistic looking photo if possible.

#

the images i produced so far are not really the end goal. I just couldnt figure out the upscaling issue with that workflow. Its one I got from Nerdy Rodent

bitter hearth Sep 25, 2024, 5:36 AM

#

could you tell us a bit more about
the img-to-img method you used
and then the upscale method

cursive frigate Sep 25, 2024, 5:38 AM

#

bitter hearth could you tell us a bit more about the img-to-img method you used and then the ...

i left it embeded in the images. Im already off the computer for the night. Ill upload the json for the workflow in the morning.

bitter hearth Sep 25, 2024, 5:39 AM

#

ah ok, I don't need the JSON I can get the workflow from the image when I have a server up 👍

#

we're in a rough spot for upscaling at the moment due to a lack of good Flux control nets

#

so the choice is to either use Flux with no control net, which requires re-rolling tiles a lot

#

or to use SDXL with SUPIR (the best control net around) or other SDXL tiled control nets

#

if the image is very large (and so the number of tiles is high) then SD 1.5 can be good too

noble coyote Sep 25, 2024, 7:28 AM

#

GGUF Flux

#

#

#

#

#

#

#

#

noble coyote Sep 25, 2024, 8:26 AM

#

dusky thistle Sep 25, 2024, 9:12 AM

#

#

#

#

dusky thistle Sep 25, 2024, 10:13 AM

#

#

#

alpine summit Sep 25, 2024, 10:48 AM

#

#

muted dove Sep 25, 2024, 11:36 AM

#

Genuine Trump rally sign

#

#

bitter hearth Sep 25, 2024, 12:02 PM

#

with flux even memes are high quality

noble coyote Sep 25, 2024, 12:55 PM

#

civic trail Sep 25, 2024, 1:38 PM

#

noble coyote Sep 25, 2024, 1:52 PM

#

GGUF Flux

#

#

cursive frigate Sep 25, 2024, 3:39 PM

#

bitter hearth ah ok, I don't need the JSON I can get the workflow from the image when I have a...

I started over and I am working with a new workflow. It seems to be working so much better. Hopefully going forward we get some better upscale models.

sacred jewel Sep 25, 2024, 3:47 PM

#

noble coyote Sep 25, 2024, 3:48 PM

#

Anyone got an IPAdapter for Flux - but not X-Flux - as my VRAM isn't up to it?! 😄

sacred jewel Sep 25, 2024, 3:48 PM

#

noble coyote Anyone got an IPAdapter for Flux - but not X-Flux - as my VRAM isn't up to it?! ...

Until Matteo makes one, I am out 😛 😛 😛

noble coyote Sep 25, 2024, 3:49 PM

#

I guess ...

#

#

GGUF_Q8 Flux

bitter hearth Sep 25, 2024, 3:52 PM

#

sacred jewel Sep 25, 2024, 4:04 PM

#

dusky thistle Sep 25, 2024, 4:17 PM

#

#

dusky thistle Sep 25, 2024, 4:35 PM

#

sacred jewel Sep 25, 2024, 4:51 PM

#

noble coyote Sep 25, 2024, 5:06 PM

#

A close-up, intense view of a baseball catcher signaling a low curveball, with the focus on the right hand giving the sign. The catcher’s fingers are clearly extended downward, hidden behind his legs, showing two fingers as he discreetly calls for the pitch. His left hand holds the mitt low to the ground, but the real attention is on the precise and subtle movement of the fingers, communicating strategy in a tense, pressure-filled moment. The dirt-covered ground and beads of sweat on his hand add to the intensity, while the shadowy atmosphere heightens the focus on the sign itself.

#

noble coyote Sep 25, 2024, 6:21 PM

#

#

sacred jewel Sep 25, 2024, 6:35 PM

#

sacred jewel Sep 25, 2024, 7:02 PM

#

I tried this in SDXL and SD3 base models and were a big fail... Flux does it properly

#

This one, not so much

bitter hearth Sep 25, 2024, 7:34 PM

#

anyone seen the blueberry model on: https://artificialanalysis.ai/text-to-image ?

Text to Image Models and Providers Leaderboard | Artificial Analysis

Analysis of Text to Image AI models and providers across quality, generation time and price. Analysis to help you choose the best Text to Image model and provider for your use-case.

#

bitter hearth Sep 25, 2024, 7:35 PM

#

sacred jewel I tried this in SDXL and SD3 base models and were a big fail... Flux does it pro...

what did you try

#

I just see blurry

#

thomas

sacred jewel Sep 25, 2024, 7:36 PM

#

Exactly that ... 😄
Prompt is:
smooth color gradient, representing as many colors of the spectrum as possible

#

SDXL and SD3 generated a muddled mess...

I am sure it is a Skillz issue though 🤭

bitter hearth Sep 25, 2024, 7:38 PM

#

#

thomas

#

lmao

noble coyote Sep 25, 2024, 7:38 PM

#

bitter hearth anyone seen the blueberry model on: https://artificialanalysis.ai/text-to-image ...

where do u get blueberry?

bitter hearth Sep 25, 2024, 7:38 PM

#

sacred jewel Exactly that ... 😄 Prompt is: ```smooth color gradient, representing as many ...

show the one you got from flux

bitter hearth Sep 25, 2024, 7:39 PM

#

noble coyote where do u get blueberry?

from trees

#

waow 👍

bitter hearth Sep 25, 2024, 7:40 PM

#

noble coyote where do u get blueberry?

you can go through the https://artificialanalysis.ai/text-to-image/arena for the actual arena but it's on the leaderboard tab, it's at it top above flux pro

Text to Image Arena | Artificial Analysis

Understand which AI text-to-image models to use by choosing your preferred image without knowing the provider.

sacred jewel Sep 25, 2024, 7:40 PM

#

bitter hearth show the one you got from flux

The one I posted was from Flux...

bitter hearth Sep 25, 2024, 7:40 PM

#

bitter hearth you can go through the https://artificialanalysis.ai/text-to-image/arena for the...

4 second generation time with cfg would be 6-8b model, maybe sd 3.5?? There's two models on there both close in elo so guessing they are doing a/b testing

bitter hearth Sep 25, 2024, 7:40 PM

#

sacred jewel The one I posted was from Flux...

catwhaaa I thought that was a fail it looked ugly sadcat

#

waow

bitter hearth Sep 25, 2024, 7:42 PM

#

bitter hearth 4 second generation time with cfg would be 6-8b model, maybe sd 3.5?? There's tw...

sample generation

#

doggo!

#

now this is cool @sacred jewel waow

hallow lion Sep 25, 2024, 8:45 PM

#

dusky thistle

these are flux dev + ralism lora only?

dusky thistle Sep 25, 2024, 8:51 PM

#

hallow lion these are flux dev + ralism lora only?

also using ClownSampler from my repo which is using a heavily modified version of the refined exponential solver (RES)

hallow lion Sep 25, 2024, 9:16 PM

#

dusky thistle also using ClownSampler from my repo which is using a heavily modified version o...

Is it publicly available? 😉

dusky thistle Sep 25, 2024, 9:19 PM

#

hallow lion Is it publicly available? 😉

yep, in my res4lyf repo on github

hallow lion Sep 25, 2024, 9:35 PM

#

oh my

#

that example workflow catwhaaa

hallow lion Sep 25, 2024, 9:53 PM

#

i isntalled it but does not show up in comfy

pseudo owl Sep 25, 2024, 10:18 PM

#

Alright first time using comfyui, lets see how it works.

cursive frigate Sep 25, 2024, 10:40 PM

#

This is a bit freaky... It had nothing to do with the prompt I initially gave it or the image I put in for img2img...

#

I call it the demon llama and the monk.

#

lol

errant dust Sep 25, 2024, 11:26 PM

#

Although this will come across as COMPLETELY obvious, stock images of basic stuff is going to just die as a market

#

I needed an image, on a clean background, of random stacks of books. Random sizes, colors, and age

#

Flux Pro:

#

#

Ideogram 2.0:

#

#

(among many for both generators)

#

I mean with such ease, why would anyone waste time or money on stock images of such?

#

I show both, not as a competition between the two, but to show that any top generator can do the job

sacred jewel Sep 26, 2024, 12:58 AM

#

bitter hearth now this is cool <@821785566818861125> <:waow:1017853838516035725>

Nice

dusky thistle Sep 26, 2024, 2:53 AM

#

hallow lion i isntalled it but does not show up in comfy

errors at the console on startup maybe?

hallow lion Sep 26, 2024, 3:51 AM

#

they stay red and its not in the manager yet

#

im trying to run the flux xontrolnets too and they dotn work. do the work with the schnell fp8 version?

dusky thistle Sep 26, 2024, 7:04 AM

#

No idea with controlnet

#

My notes work with fp8

noble coyote Sep 26, 2024, 7:18 AM

#

GGUF Flux + LoRAs

#

#

noble coyote Sep 26, 2024, 7:23 AM

#

errant dust I show both, not as a competition between the two, but to show that any top gene...

With Adobe, they try to guarantee that their Stock Photos have not been based on copyrightable material
Most AI, on the other hand, has shedloads of copyrightable material behind it ... !

#

#

#

#

#

#

bitter hearth Sep 26, 2024, 8:28 AM

#

Have some furry @sage burrow sadcat

alpine summit Sep 26, 2024, 8:39 AM

#

full night Sep 26, 2024, 8:57 AM

#

errant dust I mean with such ease, why would anyone waste time or money on stock images of s...

Simple answer - Copyrights. lol ✌️ money was the reason

full night Sep 26, 2024, 9:01 AM

#

errant dust I mean with such ease, why would anyone waste time or money on stock images of s...

You can save $15 and nobody gf, but business can’t afford such risk. For them such mistake could cost $15M 👻

tidal oasis Sep 26, 2024, 9:04 AM

#

Nude

noble coyote Sep 26, 2024, 9:05 AM

#

#

Newed

full night Sep 26, 2024, 9:07 AM

#

@errant dust have you seen “One Billion Code” on Netflix ? Opposite example though - Google stole Planet Earth algo, and then won lawsuit vs German founders.

noble coyote Sep 26, 2024, 9:08 AM

#

The Laion, the Witch and the Wardrobe?!

#

#

#

dusky thistle Sep 26, 2024, 9:35 AM

#

#

civic trail Sep 26, 2024, 9:40 AM

#

alpine summit Sep 26, 2024, 9:42 AM

#

noble coyote Sep 26, 2024, 9:53 AM

#

#

#

#

#

#

#

GGUF Flux + LoRAs

sage burrow Sep 26, 2024, 10:44 AM

#

I've been offline for a couple of weeks, what did I miss?

#

and why aren't there really many new flux loras and checkpoints? I made a few just to prove it could be done, hoping that others would create a bunch 😄

muted dove Sep 26, 2024, 11:19 AM

#

#

#

#

#

#

#

#

drowsy falcon Sep 26, 2024, 11:45 AM

#

Apartment building on the street with beige clinker bricks, neighboring buildings with red clinker bricks

muted dove Sep 26, 2024, 11:50 AM

#

drowsy falcon Apartment building on the street with beige clinker bricks, neighboring building...

#

#

#

alpine summit Sep 26, 2024, 12:15 PM

#

#

muted dove Sep 26, 2024, 12:43 PM

#

#

#

#

#

alpine summit Sep 26, 2024, 1:15 PM

#

bitter hearth Sep 26, 2024, 1:25 PM

#

#

#

#

#

alpine summit Sep 26, 2024, 1:31 PM

#

#

alpine summit Sep 26, 2024, 2:01 PM

#

noble coyote Sep 26, 2024, 2:31 PM

#

#

noble coyote Sep 26, 2024, 2:55 PM

#

noble coyote Sep 26, 2024, 5:11 PM

#

A charming chalk drawing of a futuristic spacescape, featuring a campsite with tents, sleeping bags, and outdoor essentials, the sky is a glimpse of outer space with stars and comets. The landscape radiates warmth and comfort, bathed in a golden glow that entices viewers to explore its hidden secrets. Looming over the campsite is a sleek, modern space station, connecting to the lunar surface via a shimmering energy bridge that glows with life.

sacred jewel Sep 26, 2024, 5:58 PM

#

bitter hearth Sep 26, 2024, 5:59 PM

#

#