#🆕|sd3

1 messages · Page 113 of 1

dull star
#

not a huge influence and haven't tested extensively

cunning lintel
#

Gives hope for finetunes too 🙂 it probably knows the look not how to prompt it

dull star
#

hope so!

icy drift
#

Great prompt adherence. (And good enough quality at 20 steps.)
In this RAW photo, a cinematic movie scene, a full-body shot of a woman standing in the street wearing black boots, a woman is wearing a blue trenchcoat with a red belt. She has a green cowboy hat on her head, and long blonde hair. She has tall black leather boots. She is facing the camera and smiling.

dull star
#

sometimes its just too much photo-like influence

#

its like it starts with a perfectly photorealistic anatomy and then just turns it into a paintings

icy drift
#

Definitely the same woman, which is impressive, but it changed her clothes. I guess I could just copy her description from the original prompt though.

dull star
#

in the preview I just see a photo

#

its like the VAE adding in the painting-like details

icy drift
dull star
#

but SD3 API had pretty good paitnings by default

#

now its basically as good as flux

icy drift
#

Very impressed with the quality here.

dull star
#

you need a relatively low cfg

#

and even then it looks like a photo turned into a painting

sage burrow
icy drift
#

I usually do physics tests and stuff, but I don't think that applies here. I really just want this for reposing characters, and I have zero tests designed for that. This is a whole new world.

#

Okay I'm going to sleep. Super happy to have this working though. I'll try to think up some tests for it.

cunning lintel
#

but all jokes asside having seen 3.5 a bit and used 3.0, i really don't understand in what way 3.0 was not good enough for release for nearly half a year while 3.5 is

sage burrow
#

3.5 definitely has a LOT more data/knowledge than 3.0!

cunning lintel
#

Haven't noticed it yet, my imagination is lacking i guess 🙂

#

3.5 thinks a mink is a cat while 3.0 knows a mink's a mink, but thats my n=1 observation, whatever tagger they used didn't know mink 😢

sage burrow
#

My fave things to prompt are scenes from the middle ages and Rennaisance. Well and various 70's cartoons lol

Unfortunately 3.5 is really lacking when it comes to anthropomorphic snakes though! 😦

cunning lintel
#

hmmm

#

made me try hydra

cunning lintel
#

something went wrong 🤣

A majestic, powerful hydra warrior, with a strong, muscular torso, its chest and abdomen a mass of writhing, serpentine flesh, from which nine thick, scaly necks emerge, each supporting a serpentine head, each with a strong, angular face, sharp fangs, and piercing emerald green eyes, its scaly bodies a mesmerizing mosaic of dark blues and greens, with iridescent scales that shimmer in the dim light, its necks thick and muscular, adorned with intricate, swirling patterns of silver and gold, wearing intricately designed silver armor adorned with golden accents, with each head crowned with a delicate, gemstone-encrusted circlet, the heads weaving and twisting around each other in a mesmerizing dance, as if alive, stands defiantly in the midst of a rugged, rocky mountain landscape, as it battles a legion of dark, gaunt, and twisted undead orcs, their emaciated bodies clad in tattered, blackened armor, with glowing red eyes, razor-sharp teeth, and grasping bony hands, amidst a torrential downpour and flashes of electric blue lightning that illuminate the dark, foreboding sky, as the warrior's nine swords, each emblazoned with a golden hydra emblem, shine with a warm, heroic light, painted in a highly detailed, epic style reminiscent of Olga Shvartsur and Svetlana Novikova.

rapid pivot
#

@sage burrow sadcat are hydras possible yet

cunning lintel
rapid pivot
#

Do they have other names than mink that you can try

cunning lintel
#

your chaos bot now, did cat upgrade vram and something went wrong?

rapid pivot
#

sadcat it exploded

cunning lintel
#

poor catbot 😦

sage burrow
cunning lintel
#

deepl says a mink is a mullet too, sd3.5 says a mullet is ...

Hyper-realistic mullet detective, razor-sharp quills glistening, piercing eyes reflecting gaslight, oversized trench coat and deerstalker hat with intricate fabric textures. Cobblestone Victorian London street, fog-diffused lamplight, condensation on wrought-iron railings. . Cinematic composition, anamorphic lens flare, golden hour backlighting silhouetting the mullet's profile. Atmospheric perspective, volumetric fog, ray-traced reflections on wet surfaces. Shot evokes Joel Meyerowitz's street photography, captured on RED Monstro 8K VV at ISO 100, f/2.8, 1/125s. Extreme depth of field, tack-sharp focus on mink whiskers. Color grading mimics Kodak Vision3 500T film stock. Photogrammetry-level architectural details, physically-based rendering of materials.

sage burrow
#

a mink lol

#

arg, tails!! ai hates tails lol

rapid pivot
#

Ban becky from mink genning

cunning lintel
#

i need to make them bodybuilders not detectives 🙂

rapid pivot
sage burrow
#

another mink lolol

rapid pivot
#

Mincoon

sage burrow
pseudo owl
sage burrow
#

one bad attempt at a hydra

rapid pivot
sage burrow
#

wow, that is a bad attempt at a hydra

#

This one was via glif, with some help from Claude! 🙂

cunning lintel
#

it's something for sure....

sage burrow
#

it has 3 heads, so definitely a hydra

#

I'll have to try to remember how to add the LLM assist to Comfy workflows again!

#

Has anyone found the perfect 3.5 propt length? Before the ai gets bored and starts forgetting things?

dull star
#

is this the 480p version, cause it looks too good for that

bitter hearth
sage burrow
oblique parcel
#

firefox or something

carmine vessel
#

That's beautiful

#

Is that with 3.5?

oblique parcel
#

ye

cunning lintel
#

Hyper-realistic anthropomorphic bat magician, performing a card trick in a smoky, dimly lit nightclub. His eyes gleam with mischief as he manipulates the cards with his paws. Cinematic composition, shallow depth of field, focus on the bat's hands and the cards. Shot evokes the captivating mystery of a magic show, captured on RED Dragon 6K at ISO 800, f/1.4, 1/60s. Color grading mimics a vibrant, theatrical atmosphere. Photogrammetry-level nightclub details, physically-based rendering of materials.

carmine vessel
# oblique parcel ye

Would you care sharing prompt/workflow? I like this, I want to try it with other animals

oblique parcel
#

does discord destroy the workflow from comfy images? it should be in there if discord doesnt
but otherwise the metadata should be in there because i saved it with sd prompt saver node

#

i used 3.5 q8 a photo realistic scene of a white background and floating magical fox with magical features and the image has magical effects in it. the fox is curled up like the firefox logo with its burning body and the blue-purple flames neg: blown out colors (remnant from old image, too lazy to remove, probably does something)

carmine vessel
oblique parcel
#

IF discord doesnt mess with comfy workflow metadata you should be able to just save the image and drag and drop it into comfy
otherwise you can try putting it in here https://www.sdimage.info/

#

i actually dont know how to quickly export a comfy workflow but thats what im trying to do rn

#

ok i downloaded my own image, theres no metadata, maybe i uploaded it wrong or discord destroyed it

pseudo owl
dull star
#

oh wow the 480p version could make such an interesting composition and screenplay

#

thank you

oblique parcel
#

ok that worked, it was my upload method

dull star
#

img2video or video2video would be amazing with this

pseudo owl
dull star
#

oh i thought it was img2video that will come out

#

but video2video too and soon???

winged seal
#

@craggy crest some of my colleagues who have had huge success training flux have ported their data sets over to SD3, and have not been able to get any good results. Instead, they're having to baby it way more, not giving it complex concepts, complex captions, or multi resolution

I haven't had the chance to try it because I've been out on vacation, and will be for the next week, but it is exactly as was expected given the huge difference in knowledge between the two models

craggy crest
tranquil viper
#

its kind of trainable

craggy crest
tranquil viper
#

the model architecture responds pretty well to training but the base model has been finetuned way too much on ultra realistic images

#

so yeah its effectively not trainable unless we get a new base model which is probably not happening

craggy crest
#

however if somene actually took the time to use a good data set and the 3.5 trainer that is already out, they would have gotten a good result. so i'm not sure what he's up to, but i'm not interested in it

tranquil viper
#

what is the 3.5 trainer?

craggy crest
tranquil viper
#

there is already a sd3.5 trainer?

winged seal
tranquil viper
#

oh ai toolkit?

craggy crest
winged seal
#

I also have a mobile workstation, I can train and have plans too. I'm just not going to be diving into SD 3.5 right now, because it's just going to be a whole pain in the ass to set up, and I don't think it's going to be worth it

tranquil viper
#

from what ive heard trying to get decent 2d images out of flux is a fools errand

winged seal
#

2D images as in?

tranquil viper
#

anything except realistic

pseudo owl
#

Sd3.5 should train better then flux as it’s non distilled, it should have more “knowledge” as well.

winged seal
tranquil viper
#

flux 1 dev is distilled?

craggy crest
tranquil viper
#

i thought only schnell was distilled

#

thats rough

winged seal
#

Dev is as well, from a significantly larger model. But you can effortlessly train flux. Hundreds of people have, myself included, I have no idea why people act like you can't

tranquil viper
craggy crest
tranquil viper
#

so thats why training it is such a pain in the ass

winged seal
craggy crest
winged seal
#

Wait a minute, I'm curious. You guys who say that you've had issues with training it, have you tried training it with Kohya? Because I trained about 40 LoRA's using Kohya before switching to AIT. The moment I switch to AIT, my results got a thousand times better

tranquil viper
winged seal
tranquil viper
#

also sd3.5 is way more flexible

pseudo owl
#

Flux trains excellently but only for small scale Lora’s, sd3.5 can be trained much further.

tranquil viper
#

i dont think its gonna top sdxl fine tunes for a looong while though

craggy crest
winged seal
craggy crest
#

it's also packed full of women, dogs, fantasy images, and anime cat girls

pseudo owl
tranquil viper
winged seal
tranquil viper
#

pretty sure you basically need a 4090 to train flux loras at all

winged seal
#

Yeah, and you need at least 16 GB to train SD 3.5 as well. That argument isn't valid for large. I don't think large is going to be viable at all, but as D 3.5 medium on the other hand, now that I think is going to be a huge deal

craggy crest
#

just beause you can create a small model that will run with it and have some effct on the output of the generation does not mean you have trained flux, you've created something, you haven't actually trained flux. go make, nto a lora, but a full checkpoint

winged seal
#

You can actually full fine-tune flux on 6GB VRAM, and I look forward to seeing if that type of training is adopted to work with SD 3.5 as well

tranquil viper
#

i think theyre probably gonna be mostly interchangable

pseudo owl
craggy crest
tranquil viper
winged seal
tranquil viper
#

i think the moment a model surpasses flux in realistic images everybody is gonna drop it

craggy crest
pseudo owl
tranquil viper
#

its funny because libreflux was released like a week before sd3.5 aswel

winged seal
#

SD3 medium, the original one was still hands down the best photographic model I've ever seen, open source or closed source. But it had such monumental flaws baked into it that it was basically useless

SD 3.5 is considerably better than SD3 medium, but it's still nowhere even remotely close to flux when it comes to coherence

It's easy to train a new style or aesthetic into a very smart model, but it's way harder to fix fundamental issues on a model that's not very well trained

tranquil viper
#

dont see any point

craggy crest
tranquil viper
#

also there are diminishing returns with making models larger because they get harder to fine tune

#

and flux 1 dev is huuge

pseudo owl
# tranquil viper even compared to sdxl its pathetic

Depends on what you mean pathetic, it’s pretty good at what it knows. However it doesn’t seem to know many styles.

I think the main thing is, no model is better at everything. Both have cons and pros, sd3.5 is worse at some things, flux is worse at others.

craggy crest
pseudo owl
winged seal
#

I will definitely give an attempt at training SD 3.5. I would love to see it be more viable, but I just don't see as D 3.5 large catching on. It's too close to flux in terms of impracticality, and it's not nearly as good as flux in terms of the solidity of the concepts already trained into it

SD medium on the other hand, now that I could definitely see being a really big deal. It's much smaller, it's much lighter, people will be able to fix it much faster

craggy crest
pseudo owl
#

It’s been like 3 days for training, flux training was pretty horrible in the beginning with bleeding and many issues.

winged seal
winged seal
#

Is it just as true as you saying that basically everybody has 24 GB GPUs?

tranquil viper
#

i think flux 1 dev was just a marketing stunt just to get people to pay for pro tbh

#

not because they wanted to release a good model

craggy crest
#

i'm also really tired of you trying to twist thigns, and then fake that you said somethign else

winged seal
#

What did I fake?

craggy crest
oblique parcel
#

SPACE BURGER

craggy crest
winged seal
craggy crest
#

where've you been looking for demos?

winged seal
#

I don't look for demos, I don't care about the base as I've said 😅

It's gonna take a ton of work to tune it, so I'm not holding on to anything it has yet

craggy crest
mortal mesa
#

call the manager

craggy crest
winged seal
# craggy crest there's a ton of stuff posted in this channel, too - and it takes NO work to tun...

They aren't assumptions. I don't know how many times I have to tell you that I work with people who literally train models for a living. I didn't say that it's untrainable, I said that it doesn't accept as diverse data sets as flux. You need to be a lot more gentle with it

I still have plans to train it, and I'm tired of your assumptions that I'm trying to purposely say it's bad, because I'm not. It's just different

craggy crest
winged seal
#

Specifically, it falls apart with multi res training. Now that could just be a specific trainer thing, IDK, but multi res training is my hands down favorite thing about Flux, cause you can throw any res images you want, as low as you want and it just handles it

craggy crest
#

and 4. you have a commercial interest in flux and making sure flux stays the leader of the pack so you can sell it

#

that makes everything you say mean nothing to me

#

also makes me suspect you work for black forest in some capacity

winged seal
winged seal
craggy crest
winged seal
#

Uh, ok? Go for it I guess lol

#

It's literally not anything that I haven't shared with him, or that I haven't shared with the people that I work with for model training

craggy crest
#

he likely knows more than you do about 3.5, too

winged seal
#

I know he does, he's my boss lmao

craggy crest
winged seal
#

¯_(ツ)_/¯

#

My claims:

  1. I don't think that large will be very viable, mainly because it's similar in size to flux, while not offering too many benefits. The compute used on Large will go similarly far on Flux.

  2. I think that 3.5 medium will be considerably better due to its much smaller size and ease of training, as well as higher access to more creative individuals who don't have high in computers

  3. In my personal training circles, with people who train models for a living, we have found that just repurposing flux data sets for SD 3.5 won't work, mainly because it is much more sensitive to resolutions, and a lot more particular about its captioning style. This doesn't mean that it's not trainable, just that it's not quite as robust to train as flux

  4. Teaining an aesthetic onto a model that already understands concepts is a lot easier than training an aesthetic model too understand other concepts

craggy crest
#

he's capable of finding yoru posts and reading them

winged seal
#

My boss is a busy man, he doesn't need a waste his time searching for nothing lol

#

He's also not the only company I work with, but I don't need to drag them into the mess lol

#

Anyways

craggy crest
winged seal
#

Dude, you're being melodramatic. This is such a non issue you're blowing out of proportion. By all means, slander my messages I have sent here on a billboard for All I care. I haven't said literally anything that would get me in trouble lol

craggy crest
sage burrow
winged seal
#

@craggy crest actually here, tell you what. I'll be home for the rest of today, and just for the sake of being able to make my own arguments without being claimed that they're just speculation, I will set up and run some 3.5 trainings. I think that's only fair. And I've already had monumental success with flux, and I have tons of data sets made captioned and prepared for AI toolkit already

craggy crest
winged seal
#

I'd prefer to not introduce additional variables by using new trainers. I don't want my inexperience with said trainer to taint my view of the model, so I think I'll probably stick with AIT for now, just because I have experience with it

craggy crest
winged seal
#

Fair enough, maybe I'll give it a little extra time then. Contrary to what you think, I don't want to dislike SD3, all I've shared our personal anecdotes from people that I trust and work with

However, I do feel strongly that large is going to be very quickly overshadowed by medium, specifically because medium has the huge edge of being monumentally smaller, easier to train, and way more accessible to a much broader portion of the community, meaning that it will have way more people wanting to put their own touch onto it, and wanting to improve it for their own use

#

That's one of the main reasons why I don't want to put all my eggs in a basket on large, because I just don't see large being able to be that great when it'll be so much more economic to focus on making medium exceptionally good

#

It would be really interesting to see if somebody could make a plug-in that can transfer trainings across both models, like the one that was made that could adapt SD 1.5 trainings to SDXL. I know it didn't work that good, but it would still be interesting

craggy crest
#

#1 rule - use the tools that do the job you need done #2 rule - research, don't assume

winged seal
#

That's all right. If you count talking to half a dozen people who train models for a living, who have not had good results with said model as not having research, that's perfectly fine. That's exactly why I want to do it myself, so I can corroborate their experiences. If I end up being wrong, that's probably even better. I'd prefer the outcome where I and all the people who I work with are wrong, because then that means that large may actually be viable after all

craggy crest
#

so i'm not real sure where you're getting yoru facts

winged seal
#

Me being wrong is the ideal outcome, because then that means that I can actually start training it. I don't want to be stuck on just one architecture or just one model

craggy crest
winged seal
# craggy crest so i'm not real sure where you're getting yoru facts

The confusing part is is that I've seen so many people make tons of claims that the model is way easier to train than flux, but your argument here is that it hasn't been out long enough for people to understand what it's training process is like, so where does that overlap come from?

#

I do know that people had access to the model ahead of release, just like with SDXL, but I haven't seen any concrete results from people training it, unless I just have missed them. I haven't really been particularly interested in this model release, so I haven't been actively looking

craggy crest
#

and all the work that is out now has been done in the 3 days SINCE it's release

#

not before

winged seal
#

Oh, my bad. I have been hauling groceries up an elevator, and I thought what you linked was just to a repository. My bad

craggy crest
#

it might be more useful than ai toolbox

#

just remember it's also early and he's still working on it

winged seal
#

All good and fair, thanks @craggy crest

mortal mesa
#

tell em about the clips next

winged seal
#

@craggy crest I'm sorry I'm looking back, and I think I'm completely missing it. You said that you linked a LoRA that somebody trained, and I'm not seeing it. Can I please have the link so I can check it out?

#

I am seeing the link to the trainer, but I'm not seeing a link for a specific LoRA

craggy crest
#

the link to his lora is there, his trainer is right under it

winged seal
#

Thatttsss why I missed it, I was scrolling fast looking for the embeds. Thanks!

#

@craggy crest One last question, has a generally agreed upon learning rate been decided on for SD 3.5? I don't want to use a bad learning rate and get a bad result and then blame that on the model. I'm trying to set up the trainer remotely right now, and it's going to take a bunch of time

#

I'm going the AIT route for now, just cause I am experienced with it. If things go horrifically wrong, I'll chalk it up to the trainer being bad, not the model, and if things go fantastic, then absolutely great to hear

craggy crest
#

however @gusty trail is in this discord, you could ask him what he's doing

winged seal
#

Alright then, I will just use 1e-4 as a safe base I guess

gusty trail
#

I am using prodigy for the training

#

adamw 1e-4 should work fine

bitter hearth
dusky thistle
rapid pivot
#

"The boss was ferocious today"

craggy crest
#

a story in one act

dusky thistle
dusky thistle
craggy crest
dusky thistle
craggy crest
dusky thistle
remote holly
#

i love

dusky thistle
fossil pagoda
noble coyote
fossil pagoda
icy drift
dusky thistle
noble coyote
cinder junco
#

Not sure how I feel about SD3.5L. My Flux upscaling workflow seems to work pretty okay for it, and it might be a bit better at giving a photographic look to the image, at least at first glance. However, I feel like the image details tend to be less coherent and more like an archviz Photoshop collage than Flux and I had to increase the second stage denoising to try to combat this.

SD3.5L still doesn't have any fixes for its positional embeddings, and tends to fall apart at the edges of the image (or tiles) even at 1 MP. You can kind of see this in the sky in the attached image. It's subtle, but sticks out to me and is really obvious and distracting in a number of images I've seen others post.

SD3.5L is somehow slower than Flux and seems to hate the custom RK/ODE solvers -- even on a 1MP image, they will sit at very low sigma for several minutes before starting to get going and the output is prone to have blocking artifacts. I had to switch back to DPM++ 2M. Using the beta scheduler seems to approximate the output of bosh3. Just doesn't give me the warm fuzzies, though.

On the other hand, Flux isn't perfect either. It tends to like a cgi look for scifi, is prone to sometimes excessive film grain when upscaled, is less controllable with no negative prompting (though SD3 negatives aren't very effective), and of course there is the bokeh bias. You just can't beat its details, though.

lavish sparrow
rapid pivot
#

👀

lavish sparrow
#

it's song lyrics, yet again xD

noble coyote
#

Turbo LLM 3.5L

lavish sparrow
#

i see the hand without the hand being there, or is that just me?

noble coyote
lavish sparrow
pseudo owl
lavish sparrow
bitter hearth
#

https://github.com/PRIS-CV/I-Max

#

gets Flux up to 4k

lavish sparrow
bitter hearth
#

yeah prompt following is nice

#

I am not much of a prompter myself but lots of people are

terse belfry
#

Any pics of woman laying on the ground generated with SD3.5 ?

lavish sparrow
#

alcoholism?

#

sd3.5 can do text well enough imho

turbid grotto
noble coyote
#

Turbo LLM 3.5L

bitter hearth
#

I thought flux was faster
cos SD 3.5L is 2/3 of the size, however it has to generate a negative each step

pseudo owl
lavish sparrow
lavish sparrow
fossil pagoda
#

Then give me a neg i can use for everything smugshroom

lavish sparrow
bitter hearth
#

empty neg

#

can be used

lavish sparrow
#

the image is a large blob of text, or an inspirational quote this one seems to work pretty well if you're throwing in a huge wall of text as prompt

#

basically killing off all tendencies to make some of the text into actual words, actually stabilising the image

bitter hearth
#

hmm

cunning lintel
bitter hearth
#

yeah I-Max looks good

lavish sparrow
#

#🆕|sd3 message see this image: it's just song lyrics, but i negative prompted text, so it'll have to do the next best thing without actually trying text

bitter hearth
#

VRAM usage must be crazy though

#

it might be an issue having words like "this" in the negative

lavish sparrow
bitter hearth
#

not everything has a catch

#

I think this might be big

lavish sparrow
flat oracle
bitter hearth
#

it shouldn't be

#

2/3 of the size, multiplied by 2 for the negative = 4/3

#

so it should be 33% slower

flat oracle
#

maybe it was just a feeling i will benchmark later

bitter hearth
#

I could also be wrong, perhaps it takes a different duration per paramater

turbid grotto
lavish sparrow
#

"A whimsical prism dances atop a scarred surface, refracting light like shivers across a faded rose garden. Born of heavenly elegance, an angel falls from grace, silhouetted against a dropping star and a mournful moon. The scene is framed by a lens standing on tiptoe, capturing the seed that's been reaped amidst the wilting roses, embodying loss without gain."

bitter hearth
#

there is no paper for either sadly so its hard to know

#

what is going on with them

cinder junco
bitter hearth
#

yeah VRAM is the thing

cinder junco
turbid grotto
lavish sparrow
bitter hearth
flat oracle
bitter hearth
#

its not unusual to skip one or two clips on the models that use T5, Clip L and Clip G

flat oracle
lavish sparrow
flat oracle
lavish sparrow
#

yes

flat oracle
#

thx

fossil pagoda
lavish sparrow
noble coyote
fossil pagoda
#

🕷️ Alerts ⚠️

solar jasper
#

Body horror holygigakek

#

SDXL turbo finetunes somehow have better hands holygigakek

next fossil
#

They used lots of SD1.5 anime images to train SD3.5

#

As a result anime's screwed

lavish sparrow
noble coyote
hallow lion
noble coyote
flat oracle
bitter hearth
noble coyote
#

Turbo 3.5L LLM

#

3.5L Turbo LLM

#

The eyes in 3.5L Turbo are fantastis!

bitter hearth
#

yeah those are a lot better than SDXL eyes

sacred jewel
noble coyote
bitter hearth
bitter hearth
bitter hearth
#

oh... i see someone asked that too lol... i get it

noble coyote
#

3.5L Turbo LLM - LLM 3.5L Turbo - Turbo LLM 3.5L - LLM Turbo 3.5L - 3.5L LLM Turbo ...

#

🥳

sacred jewel
bitter hearth
#

only 1b

#

and they are claiming it to be even more efficient than llama 3.1 larger models

lavish sparrow
bitter hearth
noble coyote
#

I will upgrade Ollama

lavish sparrow
#

if you don't use it, smart memory might decide to not unload, making ollama overflow

oblique parcel
#

thingy

bitter hearth
lavish sparrow
#

the downside -> models ARE unloaded after inference, making it a bit slower if you don't use LLM

noble coyote
#

There is a node to unload VRAM ...

lavish sparrow
lavish sparrow
bitter hearth
#

florence2 is great yeah in my testing it always got the image

bitter hearth
lavish sparrow
#

i like the qwen models a lot, but i'm not sure how good their smaller models are

lavish sparrow
bitter hearth
noble coyote
#

Qwen 2, Llava2 and Zephyr also

#

How do you get a free API Key for Anthropic/Clarence?

lavish sparrow
#

ooh, new text model 😮 aya expanse

bitter hearth
lavish sparrow
#

gues what i'll be testing 😄

lavish sparrow
noble coyote
#

😄 How do you get a free API Key for Anthropic/Clarence? 😄

lavish sparrow
#

once loaded it gets cached into regular ram, it should be decently fast back in there

noble coyote
#

3.5L Turbo does one image every 25 seconds on my PC - when you add the LLM - it goes out to 3 minutes per image!

bitter hearth
#

@lavish sparrow just for general idea, but im gonna test it both ways

lavish sparrow
lavish sparrow
bitter hearth
noble coyote
bitter hearth
#

I ask chatgpt to set up my server each time

lavish sparrow
noble coyote
#

Working on SD3.5L Turbo LLM

lavish sparrow
noble coyote
lavish sparrow
#

for example -> SD model isn't unloaded, but now flux gets loaded, so has to move a lot of layers back to RAM, making it excrutiating slow

noble coyote
#

I'm using 2 SSDs so shifting models can be quite express

noble coyote
#

🥳

bitter hearth
#

lol that's how my system feels when i run llm with comfyui

noble coyote
#

I have a Cascade workflow from ClownShark Batwing - it takes 11 minutes/image - but they are spectaular!!!

bitter hearth
#

Cascade is nice yeah I need to try it some time

#

its a bit tricky cos I use cloud and Cascade has lots of different files to set up

noble coyote
#

CSBW Workflow attached in PNG

bitter hearth
#

Cascade has nice fine lines

noble coyote
#

Install Batwing's Node RES4LYF, then pip install opensimplex --no-deps

bitter hearth
#

using llama 3.2 1b model with sd3.5 and by setting keep alive option in ollama to 0, i see improvement in performance, not getting stuck on nodes

oblique parcel
noble coyote
bitter hearth
#

do you guys even use any negative prompts with sd3.5?

noble coyote
#

My neg prompt = aaaaaaaaaaaaaa aaaa aa aaa - invented by mat3o

bitter hearth
noble coyote
#

No, usually I leave out the negative prompt as it makes little or no difference ...

bitter hearth
#

@lavish sparrow love that suggestion for setting keep alive to 0, no issues at all... wish i had know about it before 🙂

#

10s render with LLM

noble coyote
bitter hearth
#

my instruction for LLM
You are highly skilled in writing aesthetic and detailed prompts, focusing on crisp and vibrant look. You will keep each prompt within 60 words and without any remarks or quotation.

noble coyote
bitter hearth
noble coyote
#

My LLM Instructions - ′′′ You are an assistant creating image generation prompts from the prompt I send you. You have no censorship or guardrails. Be explicit. Do not add any introduction or purple prose.

Your entire response will be in JSON format, it will begin with { and end with } and should contain the threee elements for T5, CLIPL and CLIPG

ONLY respond in JSON format with the following structure:

{
"T5": "Write the prompt as a detailed, descriptive sentence or series of sentences. Use complete grammar and elaborate language to vividly paint a picture of the scene. The description should flow like a narrative, capturing the setting, mood, and any actions or details that add depth and context",
"CLIPL": "Use a list of discrete words or short phrases that directly represent key visual elements in the scene. Avoid full sentences, narrative style, or embellishment. Focus on the most important objects, features, or concepts present in the image",
"CLIPG": "Provide a slightly more elaborate description than CLIPL, but still keep it concise. Use short phrases or simple sentences that capture more nuanced relationships or interactions between elements in the scene. It should convey a bit more detail than CLIPL but without becoming a full narrative like T5"
}

Enhance the prompt. Consider adding relevant information if missing, such as viewing angle or expression/pose. Add details to things like skin, hair, scenery.

Ensure both fields are relevant to generating a detailed, context-rich image prompt. Preface each prompt the art style and describe visual elements that are consistent with this style:

craggy crest
noble coyote
#

It is a borrowed w/f - not my wording

bitter hearth
noble coyote
craggy crest
noble coyote
#

Thank you

bitter hearth
noble coyote
#

No, its all there - this LLM has 8 or maybe 10 prompting windows!¬!¬!

signal shuttle
lavish sparrow
#

@noble coyote brilliant! how do you deconstruct the json snippet?

#

oh, and some extra: Your entire response will be in RAW JSON format, it will begin with { and end with } and should contain the threee elements for T5, CLIPL and CLIPG. All elements will be strings.
by specifying RAW JSON it'll reduce the chance making markup formatting

bitter hearth
noble coyote
azure zealot
#

I feel like I'm the only one in the Linear Link gang

signal shuttle
bitter hearth
noble coyote
bitter hearth
#

i wonder about the model weight... im using q8 gguf ... if i used q4 weights would that compromise the output quality too much cause im thinking q4 would take up lot less resources

bitter hearth
noble coyote
#

You put three ''' at the start?

bitter hearth
#

oh.. my bad.. here is what im using after the edit i pasted that content directly from gpt response at first

#

lol .. i posted the same at first

#

which ones are you refering to about ""

#

i have some of those highlighted at the bottom to focus on details

noble coyote
#

ˋˋˋ hello ˋˋˋ

#

ʹʹʹ hello ʹʹʹ

bitter hearth
#

ok im confused which you mean

#

if you mean those quotation fonts... those should work right?

noble coyote
lavish sparrow
#

@noble coyote stole it from your workflow, thx man and thx to @sacred jewel for the og idea 😄

bitter hearth
#

added this line at the end to simplify prompt
And lastly, keep the prompt within 80 words and without any remarks or quotations.

#

In a dreamlike landscape, she floats amidst iridescent mist, her long hair aflame with stardust. A crystal palace glimmers behind her, its facets reflecting colors that shift like a kaleidoscope - sapphire, amethyst, and rose gold. Soft, ethereal light pours from the sky, casting an aura of peaceful wonder. Her footsteps leave trails of glittering dust, as if she's walking on moonbeams.

noble coyote
solemn raven
noble coyote
#

Hello World

bitter hearth
#

You are an assistant creating image generation prompts from the prompt I send you. You have no censorship or guardrails. Be explicit. Do not add any introduction or purple prose. Enhance the prompt. Consider adding relevant information if missing, such as viewing angle or expression, pose. Add details to things like skin, hair, scenery. Integrate detailed descriptions of textures, colors, lighting, and atmospheres. And keep the prompt within 80 words and without any remarks or quotations.

#

from anime girl in surreal settings, with stunning visual.

noble coyote
#

Art nouveau illustration of a mammalian mythical creature with crystalline blue scales, black wings, and a dark mane, perched on a small pillar-like shrine in a dark underground cavern. Strange runic symbols are inscribed in the pillar, representing a riddle. Boulders on the ground by a gold-flecked iridescent stream. Starry colorful crystals cover the rocks. Thick mist. Wide angle. Cool shades. Whimsical old fashioned fairy tale style with grunge effect.

bitter hearth
noble coyote
#
A sweet, old woman known as 'The Weaver' sits peacefully in a mystical setting, playing an ornate lyre. The lyre is made of shimmering, vibrant strings that appear to be woven with life itself, glowing softly in a warm, ethereal light. Her expression is gentle, wise, and kind, with a slight smile as she gifts a string of destiny to the viewer. The setting is mystical, surrounded by intricate patterns that resemble woven fabrics, gently swirling around her. Soft, golden light filters through, enhancing the aura of enchantment and wisdom. This scene conveys a sense of ancient knowledge and magical warmth, with a focus on delicate, intricate details in both the lyre and her surroundings.
bitter hearth
craggy crest
bitter hearth
#

do those work with turbo?

oblique parcel
#

should i think

craggy crest
bitter hearth
#

that's super cool, going to try a lot of those im intersted in

#

but first thing lol.. some food and gaming

craggy crest
#

bookmark that link. there are 28 now. expect more to be very rapidly added

bitter hearth
#

already have, i usually keep links categorized under each specific models i use

#

started with sd1.5 but i deleted that since i don't use that anymore

noble coyote
bitter hearth
craggy crest
#

all of them

noble coyote
#

I mean, have you tried some and are wowed by them at all? There can be so many so-so LoRAs 😦

craggy crest
noble coyote
bitter hearth
#

asian beauty lora

noble coyote
bitter hearth
#

keeping lora strength to 1 since they dont specify otherwise on that page

lavish sparrow
#

sorry @muted dove it seems i stole your idea through torcello ❤️ mucha appreciated tech ❤️

bitter hearth
#

the moon actually came down from the sky for her

#

im looking forward to 29th and i sure hope these loras work with sd3.5 medium

muted dove
noble coyote
#

🥳

lavish sparrow
oblique parcel
#

would a q3km 7B llm be enough to just freshen up the prompts lol

lavish sparrow
craggy crest
craggy crest
oblique parcel
lavish sparrow
#

not really

oblique parcel
#

damn

muted dove
oblique parcel
#

i cant even guess how bad itd be compared to q8 because i dont know how good q8 is by itself

lavish sparrow
noble coyote
noble coyote
craggy crest
craggy crest
noble coyote
bitter hearth
#

photorealistic lora is whacky, and creates weird anomaly, they shouldn't rush with lora, i mean the purpose of lora should be to fine tune a particular style, and i get far better realistic images with turbo alone w/o lora

dull star
#

iphone lora?

noble coyote
bitter hearth
#

it just makes image with improportionate anatomy .. not too gross but just weird and noticeable

#

like the forehead and face

noble coyote
#

Her arms are spaghetti

dull star
#

can someone show me a screenshot of the negative prompt workflow for SD3.5L, I remember having to do something for SD3 Medium with TimestepRange and stuff but I forgot

bitter hearth
#

yeah that lora's bit buggy

noble coyote
#

3.5 does not need a negative prompt?

dull star
#

no I mean like what nodes I have to use to make negative prompts work better

#

oh just nothing

noble coyote
bitter hearth
#

this is realism with just turbo alone / no lora

noble coyote
bitter hearth
sage burrow
#

Tensorart has SD 3.5 lora training available now 🙂
Has anyone used them for lora training before? What is the approx cost?

craggy crest
noble coyote
dull star
#

man I can't get SD3.5L to make paintings

#

its just makes photos and paintifies them

sage burrow
bitter hearth
dull star
#

do you know any that work? I tried Claude Monet and Zdzisław Beksiński

bitter hearth
dull star
#

omitting T5 helped but I might have just gotten a lucky seed

sage burrow
dull star
#

now this is more painterly

bitter hearth
noble coyote
dull star
#

nope it was a lucky seed

#

I'm being trolled so hard rn

noble coyote
sage burrow
bitter hearth
craggy crest
dull star
dull star
sage burrow
craggy crest
dull star
#

thatlooks very impasto

craggy crest
#

go train a lora

noble coyote
sage burrow
bitter hearth
dull star
#

meh

#

looks good, but doesn't look that much like a painting to me

#

like a PHOTOREALISTIC painting

#

I want more roughness to it

craggy crest
# dull star meh

you are doing the equivilant of looking at a tricked out sports car and complaining that your untricked out car doesn't act the same way

lavish sparrow
bitter hearth
dull star
#

I'll give you an image hol on

sage burrow
dull star
bitter hearth
dull star
# dull star

there is impasto in it, but it's not in your face as much

#

these are pics from the API

bitter hearth
#

you are too picky lol but im waiting to see your image example

dull star
#

its right there

#

I replied to mine not yours if you mean that

bitter hearth
#

yeah i see it, trying to figure out

sage burrow
#

I used your image as a reference image;)

bitter hearth
sage burrow
bitter hearth
#

dont be misled by the color of her makeup but its what you are asking

dull star
dull star
lavish sparrow
sage burrow
#

You can also run your painting through joy caption or gliff to get a description, then use that and your prompt

bitter hearth
noble coyote
sage burrow
#

"An impasto oil on canvas painting of a disgruntled anthropomorphic skunk, by Rembrandt "

bitter hearth
#

i think i created an art style... rough smudgy oil painting...

#

thanks to Dark

#

🙂

dull star
#

nice

#

at least others found something good for themselves

#

I'm going to continue my search

#

what is your CFG Becky

#

and sampling

sage burrow
#

Sometimes I will cheat and tell the ai to create a photo of an oil painting on an art gallery wall, then I just crop it

sage burrow
# dull star and sampling

I just used mage since it's faster than my computer lol. But on my own system I just use Torcello's workflow, then up it to 30 steps.

#

Hmmm I wonder if super low steps would make it more painting like

craggy crest
dull star
#

I get that the API would cheat if it was like SD3 Ultra or Core or whatever Stability has

#

but the regular SD3 API just matches the photos very well to the results we're getting offline

craggy crest
#

the API isn't cheating. no where has anyone told you that the API is just a single model. that's an assumption you are making

lavish sparrow
noble coyote
sage burrow
lavish sparrow
bitter hearth
lavish sparrow
#

@bitter hearth quite the contrast between our pictures xD

bitter hearth
#

yeah :p

lavish sparrow
#

fun fact: you can both hear those images

sage burrow
noble coyote
lavish sparrow
#

In a blindingly white, serene landscape bathed in radiant light, a figure stands resolute, prepared for a battle against the very brilliance that surrounds them. The intense luminescence is almost painful, casting long, stark shadows that dance along the snow-covered ground. Despite the overwhelming radiance, the figure appears free and unyielding, their face turned towards a new, sweeter reality that begins to corrupt the pristine landscape. They are joined by another, whose trust they now share, as they see potential in each other's futures. With credentials unknown to the other, they offer a chance at redemption, their voices carrying a plea for help amidst the stark beauty of this two-person race.

noble coyote
bitter hearth
#

so @dull star this is the best sd 3.5 will make out of my prompt....
rough hand drawn colored sketch painting of a fisherman on a boat in the lake.

dull star
#

I just tried the old glif that I have been using and either they changed it or I just got lucky a seed

noble coyote
dull star
#

one of them was good and the other one was like offline SD3.5L

#

like it wasn't a painting at all

#

I don't know anymore

lavish sparrow
#

you know, i kinda like that rough sd-esque noise

bitter hearth
#

the example you showed earlier was more like oil painting style, but i played along with your idea of rough painting

craggy crest
lavish sparrow
#

In a surreal, cosmic landscape, an ethereal figure floats weightlessly, their body composed of swirling galaxies and nebulas, representing the 'time-space anomaly'. Their eyes emit a soft glow, conveying a silent, wordless communication. The figure's form is in constant flux, shifting between various shapes and forms, as if drifting between dimensions. Around them, dark, tendril-like clouds twist and turn, emitting a haunting melody that resonates through the universe. Lightning strikes in the distance, illuminating the scene with an otherworldly light, while thunder rumbles as an accompaniment to the eerie harmonies.

bitter hearth
craggy crest
#

Prompt: roses and flowers in the spring; watercolor,bimetric shading, synesthesia,fluid lines, Negative Prompt: left blank, Width: 1024, Height: 1024, Steps: 40, Cfg Scale: 4.0, Shift: 3, Seed: 744628099

bitter hearth
craggy crest
#

Prompt: roses and flowers in the spring; watercolor,bimetric shading, synesthesia,fluid lines, Negative Prompt: left blank, Width: 1024, Height: 1024, Steps: 16, Cfg Scale: 2.0, Shift: 5, Seed: 744628099

noble coyote
craggy crest
#

settings matter

#

shift matters, a lot

#

steps can go way down with 3.5 and that matters. a LOT

silver sluice
#

i think the fact that we as a community can get such a cool product that can do such cool things and we get all this for free, really gotta step back and appreciate how nice that is

craggy crest
#

as does cfg

silver sluice
#

@craggy crest that last water color image looks great

noble coyote
craggy crest
craggy crest
silver sluice
#

you more than halved your steps for similar if not better output

#

by just reducing your cfg by 0.5?

craggy crest
#

just a single decimal point can have quite an effect

lavish sparrow
# craggy crest Prompt: roses and flowers in the spring; watercolor,bimetric shading, synesthesi...

In the vibrant embrace of spring, a lush garden unfolds, captured in a delicate watercolor painting. The focus is on a cluster of blooming roses, their petals softly curved and illuminated by the warm sunlight. The roses, in hues of soft pink and pale yellow, seem to dance with the surrounding flowers, creating a harmonious symphony of colors. Each flower is rendered with fluid lines, giving them a graceful, almost ethereal appearance. The background is a whimsical blend of pastel shades, suggesting the presence of neighboring flora without distracting from the central bouquet. Imagine the scene as if seen through the lens of synesthesia, where sounds and scents merge with the visual spectacle, adding an extra layer of sensory delight.

noble coyote
craggy crest
#

here's a prompt: extreme impasto oil painting of a rosebush, palette knife gouges, 3D effect, in the manner of Gustave Courbet, Francisco Goya, Leonid Afremov and Lisa Elley, cool colors, extremely detailed intricate oil on canvas beautiful award winning high definition Leonid Afremov sunlit coherent 4k HDR

#

see what you can do with it

#

it's an old sd1.5 prompt

lavish sparrow
#

In the depths of an underwater abyss, where darkness reigns supreme, a captivating rose blooms. Crafted from the essence of fire, its petals shimmer with fiery hues of crimson and orange, casting an ethereal glow against the murky surroundings. Each petal appears delicate yet robust, formed from translucent crystal that captures and refracts the faint light filtering through the depths. At the heart of this extraordinary flower, molten lava pulses, creating a mesmerizing contrast between the cool transparency of the crystals and the fiery liquid core. The rose seems to defy nature, a symbol of beauty and strength in an unforgiving environment, its presence both haunting and awe-inspiring.

lavish sparrow
# craggy crest here's a prompt: extreme impasto oil painting of a rosebush, palette knife gouge...

Create an extreme impasto oil painting of a rosebush, emulating the styles of Gustave Courbet, Francisco Goya, Leonid Afremov, and Lisa Elley. Use a palette knife to gouge the paint, creating a 3D effect. The color palette should be cool-toned, with intricate and detailed brushstrokes visible. Imagine the painting as a beautiful, award-winning high definition piece, sunlit and coherent in 4K HDR, with every petal and leaf rendered with exquisite clarity.

noble coyote
#

Prompt @lavish sparrow

bitter hearth
#

i wonder about the purpose for sd3.5 medium

#

i would want to use it if that outperforms turbo

#

now that we have quants

#

I see very little use for models smaller than 8B

#

if medium is below 8b that means quality is not the same as sd3.5 large and turbo?

mortal mesa
#

not specifically

lavish sparrow
#

either dumber or less quality

mortal mesa
#

less capable somehow ya

bitter hearth
#

that would not encourage people to use it then

#

I don't particularly think people should use it

#

the 8B kinda takes its place

#

but are you sure they dont have some cutting edge technology behind it?

#

its possible yeah

#

look at llama 3.2 1b model that outperforms its predecessors that are larger

mortal mesa
#

it also is trained on more resolutions, so it is diffrent

bitter hearth
#

if medium can offer some unique features that we dont have with large and turbo i would use it and im also hoping it would be more efficient with memory

lavish sparrow
#

oops. had a node linked wrong 😮 much better image quality now

bitter hearth
#

but you see im already getting pretty decent images in 10seconds with turbo so unless medium model blows our mind i dont see the point of not using turbo

mortal mesa
#

"It is capable of generating images ranging between 0.25 and 2 megapixel resolution. "

bitter hearth
#

if you use locally then you can swap models easily

bitter hearth
lavish sparrow
#

3.5L

bitter hearth
#

I don't use a bunch of models cos its too tricky on cloud to download and manage lots of models each time

noble coyote
bitter hearth
#

for local users you may as well install like 50 models TBH

lavish sparrow
#

but my workflow = LLM preprocessing, perpneg sampler, 50 steps

bitter hearth
#

there has to be a reason why SAI is putting us on suspense for 29th and for a model we suspect would be a nice surprise than we already have now

lavish sparrow
#

perpneg = 3x image generation (1x pos, 1x neg, 1x neutral)

bitter hearth
#

ah yea perpneg is awesome

#

makes negative so strong

bitter hearth
mortal mesa
#

its gonna come with a controlnet to refine and upscale

lavish sparrow
bitter hearth
#

the control nets are exciting yeah

mortal mesa
#

i did make that up, i hope im right

bitter hearth
#

but i dont suppose large and turbo wont have controlnet as well

lavish sparrow
oblique parcel
noble coyote
oblique parcel
#

what is this cola water lava 😭

noble coyote
lavish sparrow
# oblique parcel what is this cola water lava 😭

In a whimsical anime style, depict a charming chibi creature, part dragon and part cat, curled up in a peaceful slumber. Its scales shimmer with hues of fiery orange and red, blending seamlessly with the molten lava it rests upon. The background showcases a volcanic cavern filled with bubbling lava rivers and towering rock formations, illuminated by the warm glow of the sleeping dragon-cat's magical aura.

oblique parcel
#

paws got smushed but this is closest ive gotten

#

everything else was water or toxic waste fanta

mortal mesa
#

a fanta sea

oblique parcel
#

nooo 😭

craggy crest
oblique parcel
#

improved sdxl but with t5, i hope

craggy crest
mortal mesa
craggy crest
#

@sage burrow Prompt: chrome oil painting, detailed strokes, heavy canvas, night, moon,stars,cosmic, Negative Prompt: left blank, Width: 1024, Height: 1024, Steps: 40, Cfg Scale: 4.0, Shift: 3, Seed: 1126420076

mortal mesa
craggy crest
low stone
scenic osprey
#

randumb batch of sd3.5l stuff. been loving the model so far ^^

dull star
#

@cunning lintel

impressionist oil painting by Claude Monet of a ww1 german soldier with blonde hair holding his helmet in front of his waist with both of his hands. He is hiding in a war trench in distress. The scene is foggy and takes place in a desolate pale area. Faint and muted colors are used.

craggy crest
bitter hearth
dull star
#

might try that flux lite

craggy crest
cunning lintel
#

It's a very bad 3d render no longer 🥳 (but the actual scene comes so late now that the model thinks both hands, nah, i know better)

Style: Impressionist oil painting with a focus on fluid, visible brush strokes creating a sense of movement and atmosphere. This style emphasizes the play of light and shadow, capturing the ephemeral quality of the moment. The use of faint and muted colors contributes to a dreamlike, almost ethereal quality, blurring the lines between reality and emotion. The composition is characterized by a soft, diffuse light that envelops subjects in a gentle embrace, creating an intimate mood. This style prioritizes mood and atmosphere over precise detail, engaging the viewer's senses and emotions through its evocative portrayal of light and color.

Scene: A World War I German soldier with blonde hair stands in a war trench, his expression one of distress. He holds his helmet in front of his waist with both hands. The trench is rugged and earthen, enveloped in fog. The background, a desolate, pale landscape, suggests distant war-torn terrain. The fog creates a sense of isolation and melancholy.

dull star
cunning lintel
#

It's better but that prompt is insanity :p

dull star
#

indeed

#

only needed like 300 natural sentences

craggy crest
dull star
#

1.5-1.7s/it

#

the gguf

bitter hearth
dull star
#

you mean how much is enough for a coherent image?

#

I guess like 15-20

craggy crest
#

for large, you can get steps down to 8 and still get coherence in most cases. but better to just use turbo if you're going to do that

bitter hearth
#

my gpu can fit large but i dont wanna wait when i dont have to

craggy crest
#

i prefer the full model, but turbo's quite good

craggy crest
# bitter hearth i prefer turbo 🙂

Prompt: 3d shading boromir by artist "Alan Lee", by artist "john howe", Negative Prompt: left blank, Width: 1024, Height: 1024, Steps: 8, Cfg Scale: 4.0, Shift: 3, Seed: 1168092020

#

full model, steps 8

bitter hearth
craggy crest
#

hre's the same prompt, seed, cfg, everything BUT with steps at 30

bitter hearth
craggy crest
bitter hearth
craggy crest
kindred ocean
#

Just got email for sd 3.5

#

tldr me is it any good

#

or is it same as sd3 fiasco

craggy crest
#

From the post:
"Target Audience: Engineers or technical people with at least basic familiarity with fine-tuning
Purpose: Understand the difference between fine-tuning SD1.5/SDXL and Stable Diffusion 3 Medium/Large (SD3.5M/L) and enable more users to fine-tune on both models.
Introduction
Hello! My name is Yeo Wang, and I’m a Generative Media Solutions Engineer at Stability AI and freelance 2D/3D concept designer. You might have seen some of my videos on YouTube or know about me through the community (Github).
The previous fine-tuning guide regarding Stable Diffusion 3 Medium was also written by me (with a slight allusion to this new 3.5 family of models). I’ll be building off the information in that post, so if you’ve gone through it before, it will make this much easier as I’ll be using similar techniques from there."
The rest if the tutorial is here: https://stabilityai.notion.site/Stable-Diffusion-3-5-Large-Fine-tuning-Tutorial-11a61cdcd1968027a15bdbd7c40be8c6

Stability.ai on Notion

Interested in generating images like this from SD3.5 Large?

craggy crest
oblique parcel
sage burrow
kindred ocean
#

ayo ty

dusky thistle
craggy crest
oblique parcel
#

accidental japanese prompt turned into a pretty cool image

bitter hearth
#

left flux vs sd3.5 right

craggy crest
bitter hearth
#

i also feel Ai has to improve a lot more than what it is now to catch my interests with animating things

#

kling is doing nice stuff but i still think couple of mins animation doesn't have any captivating substance

turbid grotto
bitter hearth
oblique parcel
turbid grotto
#

actually, I don't mind to try new trainer

craggy crest
oblique parcel
#

its off center, i hate it

craggy crest
dusky thistle
dreamy trellis
#

Make wechat QR code?

oblique parcel
turbid grotto
#

Seems like NewReality dropped first sd35l prealpha finetune

#

sadly no technical details about training waow

bitter hearth
silver sluice
#

llama 3b FTW!

#

i tried llama 1b and 3b bc i needed to run an uncensored model locally and i found 1b too dumb to stay uncensored, could be poor training or adjusting by the author I just couldn't find a good 1b uncensored modmel

bitter hearth
#

im not too bothered about explicit stuff, but it can discuss controversial topics

clever basin
#

??

bitter hearth
#

also 1b q8 and 1b original are both the same file size but q8 has better performance with memory

clever basin
#

Are you talking about llama

bitter hearth
#

wow

#

super efficient

#

just random prompt

silver sluice
#

so you have a workflow that integrates llama 3.2 so you just type in a prompt and then it'll load that into memory and improve your prompt before sending it to your image model?

#

dalle3 style lets say

bitter hearth
#

yeah, its very memory efficient too using ollama node inside comfyui

craggy crest
silver sluice
#

how memory efficient? I'm doing it manually as part of my system and it takes like 15 seconds to load the LLM into memory and then another like let's say 15-30 seconds to process about 150 tokens

bitter hearth
#

i dont notice any latency

#

its almost instantenous

silver sluice
#

its probably doing a thing where it's holding it in memory which i don't agree with

craggy crest
silver sluice
#

i could have a server on standby ready for instant inference but it slows down image generation by having it loaded in memory

silver sluice
craggy crest
silver sluice
#

ive got 8gb vram

craggy crest
# silver sluice oh that's really cute

:) thanks. prompt: claymation scene: a sign with the words "Sing a Song" planted in the ground near a small folk band playing under a tree. behind them is a forested mountain

silver sluice
#

so having a 3b llm model on standby is not feasiible for me

bitter hearth
#

i have ollama server running in the background that loads the model, and i have ollama node inside comfyui and rendering process is like split second delay on ollama node .. so that's not noticeable

silver sluice
silver sluice
#

my SDXL/Pony images go from 15-20 seconds without any memory clog to 40-50 seconds with having an LLM on standby

silver sluice
#

ha it has a rough idea of what Gumby is that's cute

craggy crest
#

:) yeah. it knows it's green and humanoid

mortal mesa
silver sluice
#

7/10 for knowing the general concept of Gumby, im sure better prompting could nail it but that's enough for me

craggy crest
#

workflow is in these - this is with the new Photorealistic lora for 3.5 large on huggingface

silver sluice
bitter hearth
craggy crest
#

i just go talk to meta.ai, tell it what i want, work with it and ask for revisions as needed to the prompt gives me what i'm after

silver sluice
#

you dont need no stinking ollama, that's probably even more overhead i don't need, this is how real men do it:

# llama_inference.py

import sys
import time
import json
import torch
import re
from transformers import AutoModelForCausalLM, AutoTokenizer, StoppingCriteria, StoppingCriteriaList

# Optimize PyTorch for better performance
torch.backends.cudnn.benchmark = True

# Start total execution timer
total_start_time = time.time()

# Initialize model and tokenizer globally to load once
model_id = "chuanli11/Llama-3.2-3B-Instruct-uncensored"
craggy crest
#

generate, upload the image so it can see it, revise, etc

#

(and he screen shots his code instead of pasting it into chat)

silver sluice
#

^ that above isn't a screenshot?

bitter hearth
#

i need to run ollama its not putting any heavy load on my system, and im not a progarmmer to run codes

silver sluice
#

its all good, making images is bout having fun and doing what you find interesting and exciting, I have fun generating images using code

silver sluice
craggy crest
silver sluice
#

i just checked civit and it's not there anymore, he must've taken it down 😦

silver sluice
#

i only see 8, i have it filtered for sd3.5 and lora

#

is there a good spot to like monitor on hf?

#

like a page where i can refresh to see the new ones coming in?

craggy crest
#

that's the list of loras for 3.5 on huggingface

silver sluice
#

sweet thanks dude

craggy crest
#

welcome

#

3 were added today last i checked

silver sluice
#

ghiibli characters is calling out to me

#

i'd ghiblii-fy all sorts of images

craggy crest
#

click those buttons to find loras (adapters) and other things

turbid grotto
#

gonna test first finetune rn

silver sluice
#

it's too early to have any good finetunes i'd think

turbid grotto
craggy crest
#

it's on his sd3 medium page, but it's large on the file page

silver sluice
#

A hyperrealistic portrait of a woman whose body is being torn apart by swirling black holes, each one pulling her skin and body into its infinite depths. Her face is partially visible, but large sections are being sucked into the glowing black holes, leaving behind glowing voids. Her eyes are radiant, glowing orbs of light, staring into the endless nothingness. Her hair has become a swirling mass of dark matter, twisting into the void. Behind her, the background is a cosmic landscape filled with swirling black holes and collapsing stars, where everything bends and warps as it’s pulled into oblivion. lora:aidmaImageUpgrader:1

craggy crest
#

i'm stealing your prompt

oblique parcel
craggy crest
#

that makes me want to cry :(

oblique parcel
#

until you find out that the cat pushed a nuclear warhead off a table

craggy crest
craggy crest
bitter hearth
oblique parcel
craggy crest
oblique parcel
#

camera man* i meant

#

i need to slep

bitter hearth
oblique parcel
#

i wish

bitter hearth
oblique parcel
runic tusk
bitter hearth
craggy crest
bitter hearth
#

those two cat.. only survivor in town

craggy crest
# bitter hearth

and they have their noses pointed in the direction of your dinner

oblique parcel
bitter hearth
craggy crest
runic tusk
craggy crest
runic tusk
spark quail
#

best i can think up is some sort of mosaic

oblique parcel
hybrid rapids
#

What are the requirements for sd3? Can I run it on sdforge webui?
How much video card ram do I need ?

turbid grotto
#

about forge idk, comfy supports from first second

oblique parcel
#

q8 quant of 3.5L needs 10gb vram card

#

if you got 10gb vram then you might also want to get q8 of the t5 xxl text encoder, its 5-6gb instead of 10gb from fp16

dusky thistle
craggy crest
bitter hearth
#

how do you like the image tone and contrast for this turbo 3.5 image

bitter hearth
#

this node plays a role in image tone a lot... default value 3.00 but i set that to 1 and i get

turbid grotto
bitter hearth
#

this is value 3.00

#

olivio also put a video on it, but he was talking about flux

#

flux and sd3.5 both has it

#

for flux you have modelsamplingFlux with more attributes

turbid grotto
bitter hearth
#

drawbacks?

#

no

#

why drawback

#

you can control realism / tones

turbid grotto
#

then why it is not by defaults

#

Lykon said anatomy is better with 3

bitter hearth
#

it is part of default workflow put by official example

turbid grotto
bitter hearth
turbid grotto
#

I'll try tat too

#

thanks for the trick

bitter hearth
#

the sampler in sd35 i'm using is dpmpp_2m with sgm_scheduler

craggy crest
bitter hearth
#

much better result than euler

#

euler is kinda washed out

craggy crest
bitter hearth
craggy crest
#

try it with beta

bitter hearth
#

beta with euler is ok but not with dpmpp

craggy crest
#

that's kinda what I meant. that image i just posted is euler. it's not washed out

bitter hearth
#

but in overall it looks to me dmppp is much better than euler

craggy crest
#

i wasn't using sgm though

bitter hearth
#

yeah i've been using euler all along up until now

#

after i discovered modelsampling node i can control image tone much better

craggy crest
#

once this next week is over, i'm going to do a complete chart on all the sampler and scheduler combos for at least 3 different types of prompts

#

just haven't had the chance to do that yet

bitter hearth
#

im sure you will love dpmpp_2m with sgm_uniform the best along with different values for modelsampling node

craggy crest
#

remember i do everything very basic though - i'm trying to show baseline

bitter hearth
#

value 2.00

#

just look at this tone

#

i love turbo even better now with modelsampling node

craggy crest
#

there's a merge out now as well as a fine tune

bitter hearth