#🏞|general-with-images
1 messages · Page 22 of 1
Mixed ww cosplay w a bunch of different words and got 1 out of 100 that turned out good. Wdy guys think?
my LoRA data set is about to reach 150 images
This is crazy
It is single-handedly going to take my mega LoRA to the next level, I hope
looks gigachad na'vi
I have a feeling thats exactly what it is
yeah
because its half gigachad lora and half avtar lora
love to see gigachad like that x)
u mean just gigachad?
so now for the second time you have gone out of your way to try and undermine something I am working on
let's exchange same prompts and see how different results we get
saying you got this, with that big dataset
A compliment how?
not sure, you were flexing on a big dataset, seemed appropriate x)
I am not sharing any of my info with you. None
just prompts
You already took an idea I wanted to work on well after I said I didn't want to say it, out of fear of somebody taking it... to which you took it
You have none of my trust, especially not after that
so yeah, I'll stick on my own, thanks
why can't we work together>?
I'm all for sharing, but I can get that people want to keep their ideas for themselves too.
Just keeping it civil and moving on to something else, or to PM maybe ?
I don't trust you enough to actually work together, rather than just steal my work/idea just like you did yesterday after you told me you wouldn't lmao
it has nothing to do with working together, or avatar. Its the fact that I specifically said I did not want to share what I was working on, out of fear of somebody taking the idea. You assured me nobody would, to which I shared the idea. And then you came back 30 minutes, having done that exact idea. Again, I have no trust for you
went out of your way to tell me you wouldn't, and then turned around and did
if that isn't shitty, then IDK what is
It doesn't. All it did was show me you are not to be trusted. Last thing I need is for you to say you wanna work together, then steal my idea and beat me to if even though I am well over 15 hours in
Its a project for myself, at my speed. I don't wanna have to deal with somebody else
I beat you huh, well . um.. thanks
congrats on the low blow man
you really got me by saying you wouldn
well
welcome to discord
you can't tell anyone to NOT do a training because it was their idea first
this is something younger audience do, in school
I didn't say not to, you said you wouldn't. AKA, you lied. So you aren't worth my trust
moving on
I didn't say anything about avatar
never said you did. This discussion is over
yeah
Please, do me a favor and do not interact with me or ping me again
Anyways, moving on
Hello guys, do you know if there is a script to automate the test of differentes parameters to generate a table like this ?
Yeah! I use it all the time
its in the bottom of A1111, in the scripts
X/Y/Z plot
Nice thank you it will help me
you can find good info by googling how to use it. I use it for basic syuff
no problem. I know how much it has helped me lol
I can only tell you that this is the XYZ plot that is in the scripts tab at the bottom of txt2img section
but ashamed to say I have never used it
I love to use it too. For lots of things, comparing parameters is one but also prompt testing, or checkpoint comparison when doing training
Really useful once you get the hang of it
Yeah, I use it for all sorts of things now
If that is how you feel, you should just block the other user and not even see their messages again. Just wanted to say thanks, you voiced a disagreement and managed to keep it civil, it's a lot too rare a thing, that I need to say kudos
So we can use X Y Z plot to compare the outputs of different checkpoint but with the same prompt?
Yes !
I don't like to block people, and he hasn't done anything severe enough to justify that. I learned to not waste time on people who are clearly trying to get a rise out of you/be malicious. Thanks for the recognition <3
I just wanna have a fun time and do what I do
Yuppers
it can do allll sorts of things
Wow that's really awesome. This will save a lot of time for me when I'm checking which model gives the best image
There is a "checkpoint name" in the drop-down in the XYZ feature, you just put the names of the models you want to compare
And you still have 2 axis to play with
prompt S/R will look for a word in your prompt, then replace it with whatever you type
I see
so for example
Prompt S/R :.1,.2,.3,.4,.5,.6,.7,.8,.9,1
It will look for all .1 text, and replace it each time with the next number
or you could have it be a word, which I use
Prompt S/R: realistic, unrealistic, hyperealistic, ultra realistic, abstract realism
and so on
Like so
Wow this just became even more helpful
Same prompt, but the word changes, and so does the sampler. This allows you to see how different words affect different samplers with the same seed and prompt
Absolutely perfect
for example

you can see here, this model reacts a lot to hyper realistic
yeah, everything is the same, unless you use one of the X/Y/Z axes to change something
I actually use this specifically to test my LoRA values
Can't wait to try it out once work finishes!
You can see how I used it here
1 LoRA, 2 subjects. Wanted to do a batch test to see how much they affect each other
trying out my first 2 concept dedicated LoRA
Quite impressive
So basically u can set any parameters on X Y Z
Be it checkpoint, seed, cfg scale
Btw how good is it with vram usage?
I currently use SD with 3080 so 10 gigs vram
Is it more burdensome on the gpu?
It will still generate each image after the next one. So no real difference to normal batch count
Ah so it's sequential
Makes sense
That's perfect
Yea also every output gets saved as 1 image before it makes the comparison collage
If you compare 3 models, you will get 3 images and the comparison

heyo, wonderful SD community ^^
i have a question and im sure it already has been answered before, but i dont know what to even search for...
basically it is about those blue-ish smudges like the one i marked in the image. SD based models seem to generate these when there is a sharp color gradient from dark top-left to bright bottom-right. i get those with many different models (original SD and tons of custom ones) and many different samplers.
based on how often i get those, i assume they are well known by now. do you have any explanation of how they come to be?
Hey which model is that? You may need a vae file for it
Mostly these issues are from not using a vae file
ooh i thought vae was only for face restoration ^^
that particular model is CounterfeitV25_25, but i get those smudges with every model i use, roughly one smudge per 50 generated images or so
Nope gfpgan and codeformer are for face restoration, also upscalers help
shows what i know xD
Counterfeit needs a vae, let me search it quick
Do you have any other anime model ?
So Counterfeit uses the vae of AnythingV3
im in that phase where i go on a download spree on civitai and try ALL THE THINGS 😄 so yeah i have many anime models lying around by now ^^
how do i know which models need that vae stuff? id like to learn, so i dont need to steal community time every time something is off 😅
Okay a VAE file is used for color and Detail correction and goes on top of a model while generating. Most models have the vae already "baked in".
98% of the Anime models not.
So they need a .vae.pt file together with the model named like:
Example123.safetensor
Example123.vae.pt
Most anime models can use the same vae because there are mostly merges.
So your Counterfeit model is based on AnythingV3 and this needs a vae. So the vae will work for Counterfeit too if you rename it accordingly.
Here is the file:
https://huggingface.co/AdamOswald1/Anything-Preservation/blob/acc67d36406e41252aa936c43248c4ad988db33f/Anything-V3.0.vae.pt
You then need a restart and switch models that it applies
wow, thanks for the explanation. you are awesome ❤️
No problem, you want to learn, then i explain it 😉
Most model creators also Provide a needed vae for their model or mention the one they used
im pretty sure my pet store used midjourney to make this cat, what yall think? lowkey cool
Open pose + unipc varient bh2,codeformer+gfpgan+gpen,inpaint face.No extra post processing.Overall satisfied with the result.
a rembrandt and van gogh cyberpunk collaboration
wow
wow wow even
that's incredible
everyone click that ⭐
haha, thanks!
trully, great composition
Thank you for the recommendation.
I looked into Nod.ai's implementation and I can't find any documentation regarding intel arc implementation.
You certain it supports arc?
Supposedly a month ago someone in that discord mentioned they were working on it 🤷♂️
You can ask on their discord, someone of the Team mentioned they have a arc770 and test with it
I have oneAPI's base toolkit.
I've also tried the oneAPI implementation of Automatic1111's WebUI.
The problem with it unlike a normal WebUI interface is it requires extreme levels of careful prompting unlike a typical 2.1 inference where simple prompts can be used to achieve great results
The DirectML implementation also currently works, and that's what I'm using right now despite it being slower.
But it does affect it.
Especially considering I've compared the implementations, OneAPI and DirectML.
Same model.
SD 2.1 x768.
I know what model I had loaded.
I literally copypasted the safetensors version of the model between.
That's just how it be my guy
Literally in the reddit post for the oneAPI implementation:
Based on my experience on A770 LE, the second implementation requires a bit of careful tunings to get good results. Aim for at least 75 positive prompts but no more than 90. For negative prompts, probably no more than 75 (?). Anything outside of these range may increase the odds of generating weird image / failure to save image at the end of inference but you are encouraged to explore the limits. As a workaround, you can repeat your prompts to get it into that range and it may somehow magically work.
If I had an answer as to why it does this I would say it.
I just don't know why.
I'm not ignoring you, but I am directly telling you that I've tested it for over 30 minutes on both implementations.
The OneAPI version is much less stable.
i made an everywhere at the end of time lora
oh. we're there then. nevermind forget i said anything. You do you. Figure your own way through this maze.
Not sure why you deleted your last comment.
Seems you took me the wrong way, clearly.
🤷♂️
Oh well.
I'll see my way out of this convo.
asks for help, argues with advice lol
I never argued with you. I literally told you I disagreed.
But instead you went the other way around and continue to enforce that what I've tested is wrong.
bro just stop. i gave some good explanations but you disagreed lol. it's the same typical "i know better than the people i'm asking advice from" nonsense that is all over
no those words are all over. yours are just very similar
yes i see. you're very good at stable diffusion. good job
you are the one who knocks
Okay, now you're just being rude. It's obvious in the context now.
😉
Disagrees with you
Throws everything under the bus because I disagreed with what you initially said.
i'll burn everything down if i'm a little annoyed. no lie
good luck with your intel arc gpu!
is this good for my first lora? i attempted to make an Ivan Seal lora
this is the same prompt, but without the lora
Is Ivan Seal a frost zombie?
no, he is the dude that made the abum cover for everywhere at the end of time
somewhat
i'm not sure of the good tricks to train loras but i think you might need to tune some parameters or captioning maybe 😉 heh
OH its his art style
not a famous biker
i was like "that doesn't look like a guy at all"
for an art style it comes out good
This is what I mean btw
In OneAPI for WSL2's implementation I get images like this half the time.
@desert sandal It's this box
This doesn’t really fit anywhere but this the first actual physical result of my experiments with SD. A 12x12” framed print of ballerina Elsa for my daughter’s room. I’ve never been able to draw or produce any art at all, so it feels great to be able to do this.
Got zoom_enhance working with multiple subjects, should be out later today 😉
you're the one that made that post one reddit ?
yeah
it seemed very interesting, I didn't have the time to try it out yet
I didn't get one thing, you said it detects lots of things, thinner details to be upgraded, but in your examples you explained it did faces first and you needed to specify what to upgrade ? or did I missread ?
I need to test :p
great choice of a name btw
There are two settings: the search query, and another for the thing you want to replace it with. Looks like this:
yay for handposting
that's using only the "depth" preprocessor
it was an example for @supple ravine
all hands on deck !
yeah
Advancing ❤️ ❤️
it goes fast for sure
Now maybe I can finally do this
I had a blast, remaking some "crazy girlfriend "meme
some what?
let me find those
Things i've been able to generate.
wow
(thanks Paint 3D for the controlnet picture there)
dark cat is great
Oh yeah, Paint 3D is a thing lol
who'd have known
finally an excuse to use that
EW
I wanna do some sd stuff I just don't know what to make
Maybe some desktop backgrounds
desktop background is great to do
make someone that seems uncertain about what to do
I always enjoy making them, I already have a few in a folder that my computer cycles through
But not enough
in #1002293361526460608 a while ago I suggested a place for sharing desktop background gens but I guess it never happened
just "desktop background" as the prompt
god cat
its probably an alien cat, idk man
Gods don't abide by the rules of mortals, obviously.
cool i guess
just another cat hidden behind
That makes sense
strangely fits
dude idk why it sometimes comes out with duplicates
like 5% of my iterations are conjoined twins
its just an issue with how sd works
it used to work better :/ i didn't change anything with my settings
negative prompting can help, but not always.
Are you using different models? Also higher resolutions often cause duplication, which is why hires fix exists.
what do you think i use? i use high res 1.5 upscale
well, to be fair, you are generating 1568x1024 on a model trained on 512x512
even with high res, this tends to show
merged models
one of the models is actually capable of wide images
😮 nice clip
CLIPY ! I didn't managed to make him as alive as I'd like
but yeah, controlnet didn't do better than that
trying to mix Edward Norton and Brad Pitt into fight club though...
half the face is supposed to be of each actor
@barren anvil
this is one example, there are multiple models in the controlnet extension that work very well with architecture
this is the depth model
and here is a guide on it : https://www.reddit.com/r/StableDiffusion/comments/119o71b/a1111_controlnet_extension_explained_like_youre_5/
how can you mix images? artbreeder? or there is an sd extension already?
nah, I just had a base controlnet image with both the faces stitch together badly, and used latent couple to have a different prompt in different part of the picture
not wide enough
these are some other backgrounds I have made
love the top right especially
Thanks!
a little under my ratio though :p
I'm working on getting some good ones on mine but it gets harder
I have both a ultra widescreen and a widescreen monitor and it looks fine on both
I just have it set to fill the screen
so it zooms in a little
(I see you writing LOX :p give it to us :p)
yeah I'm bored too x)
I'm going to play a little monster hunter afterwards. Still wondering what model to make out of the monsters
@glossy herald Noob questions here, up to now I've been trying to coerce various online interfaces to render image A in style B (SD generated based on a prompt, so it knows the style) and I'm not getting anywhere. But what you show is way more promising. How are you generating a depth map? Any good resources for starting with depth maps?
check the guide I sent, but yeah, i'll try something on your image.
No way I can do any bush there, there would need to be any scribble of those at least in the source image, but let me see if I get somewhere with that.
Also to answer you, the "controlnet" extension does that quite well. you just put an image and choose the "depth" preprocessor
but for architecture
mlsd seems even better
so, I'm not the most trained on that mode, I don't do architecture a lot, but here is what I can get from what you sent
I'll go less cinematic
lots better
a modern glass building, office, green grass, scenery, sunlight, highest quality, RAW photo, 8k
using the mlsd controlnet model
damn it's quite impressive even
it's a very basic prompt I used, and the rendering is also quite basic, so i'm sure with a little more tinkering you could get better
model is SD 1.5 by the way
a modern glass building, office, green grass, path,professional building, scenery, sunlight, highest quality, RAW photo, 8k
Negative prompt: reflections
Steps: 15, Sampler: UniPC, CFG scale: 7.5, Seed: 3860954534, Size: 640x512, Model hash: 4d4f85a738, Model: Base_1.x_sd_v1-5_vae, ControlNet-0 Enabled: True, ControlNet-0 Module: mlsd, ControlNet-0 Model: controlnetPreTrained_mlsdV10 [e3705cfa], ControlNet-0 Weight: 1, ControlNet-0 Guidance Start: 0, ControlNet-0 Guidance End: 1Time taken: 10.67sTorch active/reserved: 4802/5600 MiB, Sys VRAM: 8087/24564 MiB (32.92%)
Man... Yeah this is great. It's a major use case IMO.
it should also work on interiors
would you have some sample pictures for me to play with ?
and for you to have more examples to show friends I guess :p
Yeah let's switch to dms?
I am waiting for mlsd for 2.1 to show up eventually. There are a couple of 2.1 controlnet developers out there. no mlsd so far.
I'm not sure how people train controlnet yet. making a dataset for that seems like something out of an horror movie
@glossy herald thanks for the Kind words ^^ nice that you have the mod finally back
security was key, but yeah, happy to get it back too. and thank for real, you know it and you rock still like a ball in a field, never stoping
Yea everyone should be able to use SD 🙂 so im gonna support that the best way i can
that's how I started around here
when i got it running myself, I couldn't stop helping
it felt like Narnia or a magic wand or whatnot, and a right for everyone to get it up and running
day number however many its been working on the ultimate Na'vi LoRA
Having to deal with new LoRA issues I have never had before. Hmm
Hey! If anyone is interested in checking out my new model I'm going to be hanging out in a voice channel for a bit
anyone know this extension? for the buttons on top
Settings > User interface > Quicksettings > add , sd_vae, sd_lora, sd_hypernetwork on the end
@static tusk brilliant bro thank you, one more quick question for ya if you know, is there a good way i can automate different lora weights with xyz script?
never tried sorry. not sure about that
no worries ty
there is add weight 1, maybe play with that
I know how to do that!
I specialize in LoRA XYZ plots
so, can you explain a little more what exactly you are trying to do?
@smoky oak i just want to automate the weight of the lora on my prompt from .1 - 1
pagbounce
XYZ, prompt S/R
okay
set the value of your LoRA to .1
then in the value, put this
and you're done
it looks for the .1, and it replaces it with the next every image
oh sick, should i be careful if i have .1 anywhere else, like in the weight of a keyword, will it do that too
Yeah, it will change all .1's
so, I recommend possibly doing this
(lora:VALUE)
VALUE,.1,.2,.3,.4,.5,.6,.7,.8,.9,1
ofc awesome
and I mean the word value
yessir
you can make it whatever you want, just make sure the first value matches
nice ty bro such a crucial thing to know
no worries. Very useful!
I used it to test a Hybrid LoRA I have
1 LoRA with 2 very different styles/characters in it
looks amazing
Thanks. Though I did find they look way better on their own haha
as you can see haha
doing 2 subjects in one LoRA lowers both a bit
Quantum Headquarters 🔥
Follow my IG https://www.instagram.com/p/CpvpqC3vpMK/?igshid=YmMyMTA2M2Y=
Anybody know of any GANs I can run an SD image through to get rid of the oversharpening effects?
Idk but I remember animeganv3 online I used to use
oh, thats nothing like what I am looking for 😅
Thank you though
I am trying to keep realistic images as realistic haha
I am really trying to reverse oversharpening
it happens when you get really high resolution outputs out of SD
Whats the perf look like? I know Arc is a mess with support/drivers
which arc GPU?
A770.
not too bad
thats close to what i get on my 3060ti
Which is expected because the cards aren't far in terms of hardware specifications
Especially so considering intel arc's drivers are still new.
So not bad at all.
Still stands.
Newer card. Newer drivers.
🤷♂️
None of the current intel implementations fully utilize the card either from my knowledge.
yeah, doesn't seem too bad
10IT on what sampler?
I am using A1111, which has a considerable loss compared to other GUI's, IIRC
despite having botched outputs
not bad
So am I, Sytan.
Oh, I thought you were using OneAPI
oh my bad, I see
nevermind
I only use DDIM now
I AM using OneAPI.
its better than Euler A in terms of speed, and matches in output (even better with LORA's)
A OneAPI fork of Webui.
yeah, I realized what you meant
It wasnt actually Ddim that froze me
ah
ahhh, I see
Lol
I tested all of the samplers, and Euler, Euler A, and DDIM came out on top
Euler (non A) actually managed to be around 6% faster than Euler A on average
What's weird forcme
is that I need to hit 75 tokens
Otherwise I get botched outputs lmao
oh, thats weird haha
Simple though since you can repeat s prompt to reach said token count
DDIM gets decent results on 15 samples IMO
Damn, even 10 isn't that bad
huh...
Let me do a comparison, just a sec
oop-
dang, DDIM is working hard right now haha
even 5 isn't that bad
10 is close enough to the others for batch work
10 batch
9.6s
so
1.2s/img
I am using a realism focused model and tweaked prompting
I have been testing extreme upscaling
native (512x768)
1.5x high res fic (768x1152)
Additional 3x super upscale (2304x3556)
What is super upscale and how do I get it?
You can download it in the extensions tab
its called Ultimate SD upscale
I recommend googling how to use it
If you have a fast modern card kde karras and 20 steps for photo realism seems to be the best. Second would be euler_a 20-30 steps
this is, of course, for 2.x
I have that, bit mine never come out that good
I recommend going to their page where they talk about how to use it
Oh I’ve done that too
Ah this is kinda my problem. See how the fur changed?
so let me get this straight, you want more res, with the same look?
you're not gonna find anything that does that
2x Upscale + Color correction goes a long way 🪄🖼️
very nice :>
Thanks 🙏🏾
Your 3060Ti have more than 8GB ? I want to buy something in that price range but in RTX I dont find any except 3060
no, its 8GB, which is pretty much all you need for most stuff
I would get a 3060ti with 8GB over a 3060 with 12 any day. The 3060 may have more VRAM, but its also a way slower/weaker card
I'm trying to get some nice pics out of a logo
anyone want to play with me ?
using it as controlnet
not there yet
hard to keep the logo intact while pushing some real style into it
I can't find the right controlnet mode for it yet
normal map strangely makes it easier to get portals like that
cute one
trying mlsd
Comfi UI makes me so fucking hyped for the future of SD, OMG
Now its a WAY more personal creative process
/drean
you convinced me to try :p
seems fun
@queen lagoon
Welcome! There's currently no bot on the server to generate your images. Start by heading over to #1072220168534642768 to get yourself situated and help find the channels you are looking for! Please make sure you review our #✍🏼|rules-and-tos and feel free to assign yourself some #👥|roles as well! Answer any questions your may have at our #1072229020520947753. There are many ways of accessing Stable Diffusion, take a look at #1080946152318443610 to start your journey!
like, it starts in 3 seconds ? nice
ho damn it's strong
ho that's going to require some adaptation and setting up
but if it has most modules I need I'm going to switch for sure
Here's a demo of a technique for animating Stable Diffusion images now using controlnet and a custom application I made. Could be for live music visuals https://twitter.com/vibeke_udart/status/1635627814476029953?s=20
Still experimenting with animating #StableDiffusion images through the use of 3D depth and light. Next step a tool for live music visuals using these techniques #AIart #AIIA #ControlNet
How to try and crack this?
I saw it here - https://twitter.com/dannypostmaa/status/1635630263148355585
didnt even know this was possible
quite interested too, i'll follow that
Yup it is...!
Cldn't get you
I often think about what folk write in their prompts and want to know what they want for result from those tags/words, the word I got stuck on now is "ultra realistic" and there is so many that swear by this prompt. So for any that use "ultra realistic", can you create an example with two images that show me the difference between normal realism and what you want a ultra realistic image to look like?
Edit: I looking what folk see as "ultra realistic" in an artistic way not is this is better or worse, just what folk see as the difference between realism and ultra realism.
people who have test ComfyUI?
I've had some logo remix fun in the past. It was very cool when Control net came out just a couple weeks after stadia burnt down, so i made these
i plan on having a lot more logo fun in the future
professional photo of the woman of your dreams hiking in yellowstone, hot springs, half dome [amateur ugly crayon scribble amateur basic draft point and shoot cartoon, 3d, (bad art), (deformed), (poorly drawn), (close up), (b&w), weird colors, nude, not realistic, deformed, limbs messed up, multiple creatures, deformed limbs, multiple heads]
closeup professional photo portrait of the woman of your dreams (blonde hair, thin, athletic) hiking in yellowstone, hot springs, half dome [amateur ugly crayon scribble amateur basic draft point and shoot cartoon, 3d, (bad art), (deformed), (poorly drawn), (close up), (b&w), weird colors, nude, not realistic, deformed, limbs messed up, multiple...
unique (dark dim dramatic atmosphere)+ closeup portrait of a stained glass winged fairy princess sitting on a toadstool looking forlorn. intricate, elegant, highly detailed, majestic, digital photography, art by artgerm ruan jia and greg rutkowski [gun amateur ugly anime crayon scribble amateur basic draft point and shoot cartoon, 3d, (bad art)...
I see people in midjourney doing this sort of stuff
using Niji+StableDiff i manage to make genshin-like characters leeet's gooo
time to make some fake leaked characters 😈
I think that is actually against TOS
boddul of whine
i mean, it's not like they're going to take me seriously, chances are none of them will fall for the bait 🫠
true haha, just letting you know in case tehre are some problems is all
why is it so bright o_O
idk 🫠
Been messing with comfyUI
its super cool, but needs a lot of work/optimizations to match A1111
it doesn't seem to have nearly as good of an xformers implementation
and because of that, my resolution outputs are greatly limited, and generations are considerably slower
I wouldn't say its that much slower
but the resolutions are a huge limit
theally made a pretty tall portrait scale demo image using comfy ui
to put it into perspective, I hit a VRAM issue in comfy when doing 8x 512x768 images
I regularly do 8x 540x1280 batches in A1111 with no issues
ok 8x images.
8 images at the same time, yeah
i guess theally made just 1 tall image (looks like maybe 3x normal 512)
so not pushing like going 8x
I hope there will be a colab for comfy coming out soon
I just hope comfy can get on the same level of A1111 in terms of VRAM efficiency soon
someone's gotta try the avengers end game battle type of big scene with many custom prompted characters lol
New Locon trained on watercolor and sketch drawings
using comfy, 1 prompt per character x 100 lol
So I would say ComfyUI is about 20% slower, and uses at least 2x the VRAM of A1111 (effective to output)
Speaking of watercolor, I made this an hour ago
original image was
10,240x4320
I downscaled to 5120x2160
what is the selling point of lycoris? I read this and don't quite get it yet: https://github.com/KohakuBlueleaf/LyCORIS
Honestly you're asking the wrong person. Loha can't even be used in any of the UI's yet and Locon throws a key error in both Comfy and Automatic1111, but still works. I had better experience all around with LoRA but am starting to see sucess with Lycoris/Locon
I am just on normal LoRA's still as they give me what I need haha
success in what? what is it doing better than a lora?
I like Locon as it trains more with less dim and alpha from my experience. So a Lora that I use 128/128 for, I use 64/64 for Locon and my understanding is Loha would be even lower to learn the same thing
LoRA's are still treating me well
and these results are from less than 1/20th of my full dataset
from what I know, Locon uses a shit ton of VRAM, right?
Does 64 vs 128 mean shorter time, or more accuracy? (but that also means you get the same face every generation) or does it mean you get more variety that look like the same style?
LoHa too, right?
Yes I can't go over BS2-4 with a 3090 on Locon and no Gradient checkpointing
means you can fit more things into a single LoRA
Ah, I am limited to LoRA then
like more characters into 1 lora like that set you made?
yes, basically
after a certain point, it can't fit new info, so understanding more using less is huge
You have to adjust Learning rate accordingly when changing dim and alpha as well. There is also a second set of Dim/alpha for locon called convolution Dim and Convolution alpha
I'll just stick to LoRA, tho my mega LoRA I am working on is looking like its gonna have to be 2 LoRA's sadly
Cyborg Suited Female Soldiers 4x and 2x 🙏🏾🔥🎯 Follow my instagram https://instagram.com/glitchzai?igshid=YmMyMTA2M2Y=
ok so i understand the interest in lycoris now. It was just that I keep seeing single characters on some civitai lycoris so I thought, why not use a lora or t.i.
From my understanding, there is no major reason to use anything over a LoRA in that case, as a LoRA is faster to train for the same VRAM
and LORA's are just stupid fast to begin with
any good tutorial for style type lora? I read a basic one on reddit. not training for a character but an art style
These are from my LoRA I trained in less than 7 minutes
I don't train for style, but I am sure I could if I wanted to
I don't have tutorials written up but I only do styles if you want to add me and dm for help
I just read that fine tuning method is good for style type lora
Finetuning is not good imo because you have to extract and it defeats the purpose of Lora, but the results are still good
ok thanks. I am limited in some ways. I cant use bf16, only fp16 on colab
one thing to know is you should only caption/tag things you don't want the LORA to hold onto
It doesn't work how AItrpreneur says
what you are tagging are inconsistencies
if there is something consistent you want it to learn, do not tag it
Yeah the visual similarities it should catch onto from the images itself
^
I can't DM you Devin.
now if you are training on a dark skin woman with different hair styles, tag the hair styles so it can reference later, but it will learn "oh, this is what her face looks like all the time, and those hair styles are some of the time"
Sent you a friend request, crypto servers ruined me leaving DM's open lol
you also wouldn't tag her having dark skin, otherwise it will make her ligth skin when you don't include dark skin
We really should make a collective place to share LoRA training info
info that is actually good, rather than most of the shit you read online
I think @green plover and @dense tapir would be great additions
yeah, maybe we can suggest this to server. there is prompting help. so there should be model-train help ; t.i train help ; lora train help
I don't know about active help, more than just a location with good information
yes they train a lot of stuff too
I work with them closely
we can pin those important info and update as necessary
I am sharing my documentation on multiple subject LoRA's
having an open channel for training help will be cool
like the one you saw
I noticed that it is indeed better to keep 2 characters separately trained in the LoRA when using the same rank size, otherwise they both degrade
waiting in anticipation to see if blip2 is going to crash
@green ploverOh, I tried comfy UI. Its cool, but A1111 is wayyy more efficient in terms of gen speed and gross res
big bummer
Give it time
on average, it seemed to be about 20-30% slower, and used wayyyy more VRAM
Yeah, I am hopeful for its development
I spent all day looking for a card to train on that was decently priced. Doesn't exist, and moore's law is dead on youtube just showed the 4070 is DOA as nvidia is 700+ for it.
And we have captions!!! 10 per picture
nice :>
4070 is just the new 3060
Yeah IDK man. Nvidia is shitty for their VRAM cuckery
btw, comfyui actually beat auto but 25-30% for me on my 1060 gens
actually 11. Now I need to do the research paper thing like we did in grade school and make those into 1 caption per picture
interesting, it was considerably slower for me
I was shocked at how fast
I was getting about 7it/s on my 3060ti on comfy, and about 10 on A1111
the speed isn't much of an issue to me, but the horrible VRAM use is
errors in comfy when doing 8x 512x768 images
whereas I can do 8x 540x1280 in A1111 with 0 problems
I wonder if I can do native 1080x2560 in comfy
I don;t look at its, but I can say 56s became 35-40 using the same everything
yeah, the benefit may be much bigger for non tensor cards
the problem with the cards is I need 16gb min and the price for a decent one from last or current is insane
I bet so
no TC for me on 10x0 cards
for me on 3xxx series, it is considerably slower, but the kicker is the VRAM constraints
which means I can't play with the stuff I wanted to do sadly
well, he is anti 100% gradio bullcrap so I am sticking with him to see what he puts into it.
oh wow
comfy actually managed to pull through on a single native 2560x1080 image
how weird
but 8x 512x768 crashed-
btw, he is a good guy and actually programs unlike auto who merely copies and pastes so if his is slower for you ticket it and he will get right back, and/or fix it.
I think the issue is just that its not using the same Xformers implementation
I am leaning to buy an ARC790 as it trains now, and gens. About the level of a 3060 but with 24gb of ram
cause its slower and less VRAM efficient
I wouldn't do it yet. You're gonna have a lottt of problems
GUYS I HAVE AP ROJECT WHERE I NEED TO SOAK A COIN IN VINEGAR AND SALT FOR 2 DAYS AND ITS DUE TOM CAN I GENERATE LE IMAGE
I forgot who it was in this server, but they were talking about how bad generating on their arc is, as almost all samplers crash
well, yeah, but all I know is I am hanging on by a thread and not even wanting to touch SD anymore.
Seriously
I am not seeing that from the recent threads
I personally wouldn't go from having one set of problems, to another set of problems, but having spent a lot of money
yes seriously
ARC GPU's are just a very incomplete product, sadly
good luck
It will get there but the way I figure it is either them or time to wander off into the desert and see how things are with SD, and the gfx card pricing, in 2024
ty
I can't handle colab any longer
I guess man... I just personally cannot recommend such a lack luster/problem filled product
why not just save money and go with an actual 3060? No problems
I know. I am trying to see more about it for SD. DX9 it is shite. DX11 iffy DX12 it is nice
12gb
I barely have enough on colab with 14.7 and sometimes get OOM
thats just down to how you had it set up
IDK what to tell you, you don't need that much VRAM to train LoRA's lmfao
you can do it on 6GB
you needing 14+ is just down to how you set it up
I do dreambooth, lora, loha, locon, ti, HN
alright man, it sounds like you have your mind set. For your sake, I hope it works out, but based off of the real world, its is a very risky buy
no, my mind is not set but is set against anything less than 16gb, yes.
also, where does one even buy an arc A790?
Nvidia cucked 3k series for vram and 4k is stupid priced
agreed
however, I don't think buying a drastically inferior product in teh right solution here. IDK what else to offer in terms of advice
I know on gamers nexus Intel hired the Nvidia dev guy and they were already 2 gens ahead and 3 in the test lab so I suspect soon next gens
I cannot find anythinga bout an Arc A790, only a750 and a770
Oh, 790 doesn't exist I meant 770. What I meant was a 790 would be nice
wait, but 770 doesn't have 24GB VRAM
yes, it does
I am tired. yes, 790 would have 24 gb and why I mentioned it
that is why it is missing and would be nice
770 is the min I would consider due to 16gb
I am not trying to be rude, just trying to make sure you have all the information right is all
I see. Yeah, that GPU gens about as fast as a 3060 it seems. No idea how training speed is on it
and it wouldn't be able to benefit from xformers, so you would have 16GB VRAM, but have a much lower max res to generate at than an 8GB NVIDIA card
But I think you care more about training than high res generations
as I said 16gb 770 is my min card but no real verifiable info
I am wondering where the 790 is with 24gb? bit odd
They said that their 770 is around 3060 speed, verrified off of the it/s, but that they have to use OneAPI fork of A1111, and it crashes constantly (they also can't use DDIM, and they have to have a minimum of 75 tokens in the prompt to generate)
you know something I am wondering?
where the fuck AMD is in all of this?
besides that
no clue
I am wondering if the people with the most issues are AMD cpu users?
no idea
Intel has some of this in their core cpus now and I wonder if they stupidly used that so those who don't get bad performance or crashes
oh, its dan
@fleet goblet
are you here? Could you maybe answer some questions about your expeirence with your 770?
I
ever tried renting a GPU on runpod instead?
thats a770m, which uses a different arch/drivers, so be careful
not for me I want local which is one reason I have become burnt out on colab
a770m is Xe based
m being mobile?
runpod is not like colab
not Arc Alchemist based
oh
Naw, not for me renting on runpod. I know what it is and steered some people to it but not for me
A770m is the name intel uses for their beefy AF iGPU's on their mobile CPU's
see, its Xe HPG based
not Arc Alchemist microarchitecture based
they use completely different drivers
thats what lead to so many people buying 770's, cause the 770m is pretty dope, but then the 770 have massive driver issues
like 3090ti and 3090 their own set of drivers.
even worse than that
where does one find a 770m?
its legit two different architectures, different in every way.
in laptops only
intel laptops only specifically
or well, you can get them in intel iNUC's
but they are spensive
Oh, wow
The new Intel NUC Enthusiast comes with a new mobile GPU from Intel. The Arc A770M which provided some awesome gaming results. which we had to put to the test.
Intel Nuc 12 Enthusiast: https://bit.ly/3XTNi7Y
►►►SUBSCRIBE for more - http://bit.ly/SubscribeToBoored
Gear
Nanlite Pavotube: https://geni.us/jT0nA
Sony A7III: https://geni.us/ykmf
Ta...
That thing is a beast alright
yeah, the A770m is pretty dope
thats a whole PC
thats an intel NUC, micro PC
Pretty damn tight
a NUC 13 I believe
ffs
and an 11th gen i7, which are very bad
(extremely hot, low perf compared to new options)
I thought it would be like a mobile gpu soldered in since it is a chip. screw that then
the a770m is the name of the iGPU you find on special 13th gen NUC devices
its the same as intel HD760 or such, just BEEFED
basically, its a super beefy APU
CPU/GPU in one chip
and there is one more thing to keep in mind@dense tapir
A770m shares RAM with the CPU
it has no VRAM, instead using system RAM just like all other iGPU's
a mobile 3070 is slower than a desktop 3060, thats why
and the a770 is their top of the line best offering, where as you can get a 3080ti from nividia
ahhh. Well, no hope then as I will only pay a set price which Nvidia is twice more than I will or I go to pascal server based cards so slow
bad fp16 performace which is emulated like mine
they do well with fp32 though
I was also looking at the 7900XTX
of course AMD is sorely lacking as they have nothing to compete with cuda
@smoky oak This is the only card from last gen I would want but that 12gb sucks ass. TI version is when the cuckery began.
3 fans vs 2 or some with only 1 :/
I have a 3060ti
the issue is sure, the 3060 has 12GB VRAM, but its really bad
they went for quantity over quality
rather have bad than 8gb
so its 12GB, but that GPU is often times less than half the speed of a 3060ti. As slow as 1/3rd the speed in things like rendering
in a case like this, I can understand that
for gaming ti for this AI/ML more ram is king even if slower
I saw some chart and in SD 3060 vs 3060ti training was about 1it/s slower
I am also HEAVILY looking at the A4000 RTX
I think that is my only option but that is 500 bucks. Now if the 4070ti were priced right it would be about 500-600
Welp! At least I didn't pay too much for car insurance
or to put that another way, can I have a button like the Staples "easy" button, except when I hit it, it says "That was stupid"
hello team
let's go back to that comfyUI tutorial 👀
for reference, this is the end of the first chapter
sure
I'm scared of where this is going
how to do it? nidji is not free right?
They give you 25 free gens, but yeah, i have the subscription, first i make a character in Niji and then I use a Lora on StableDiff
NO FAIR
SHARE YOUR METHODS
HUEHUEHEUHEUEHEUEHEUE
im sorry i came off creepy
but anyways
Me in a bad mood? Never!
what artist are still working in the prompts these days?
Proper keywording works best, but I have a list
mind sharing?
Those are the ones in stable diffusion's vocabulary
Appreciate the lists. Just a headsup, you've got "Isaac Levitan" listed twice in the Public Domain Artists list 🙂
Thanks. Probably listed twice in the database
hi
(in automatic1111) Is there a way to use Prompt S/R without generating the grid? I do use the grid occasionally, but usually just end up browsing the results individually in a different image viewer
@odd coral
Currently, there is no bot on the server that generates images. However, there are plenty of other ways such as the official https://beta.dreamstudio.ai/ website or by running Stable Diffusion locally using your own hardware! Check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using DreamStudio or #🤝|tech-support for any problems you encounter while installing it locally!
Deliberate
Chilloutmix
what's the prompt?
are you an expert on root beer?
got it from metadata
yes =]
its just a copy from chilloutmix prompt
oh you've got loras in here
then do you know where I can find some in Florence Italy?
you should try it why my model =]
thats what openjourney_V4 did
ive heard it's actually pretty uncommon in EU, so you may have to import it unfortunately
too poor for that
guess I'll have to learn to make my own
dont know your model
I haven't had a rootbeer in a little over 17 years
what is rootbeer
@calm canopy same again but now with sd upscale script
yea no problem. thats the power of upscaling 😄
its running
@sonic vessel i mean its not a women as the prompt says, but i think its still really cool
same seed as all.
@sonic vessel okay with random seed it crates a women
ah, its alright. not awesome
i mean chilloutmix is something completely insane
I trained it with the most formal pictures to get the best facial indentity, do anything you like with him.
no way
I'm playing with your prompt.
some of my cars are coming out looking like Hotweels and that's not what I want
its not mine and i dont want credit for it
its this prompt
with the better models you can usually get away with fewer filler words "8k, best quality, etc"
professional photograph, detailed skin texture of a (cybernetic android)+ (beautiful porcelain doll face side profile woman android)+ 150 mm, lace, (H. R. Giger style)+ on a futuristic scifi city street
[amateur ugly anime crayon scribble amateur basic draft point and shoot cartoon, 3d, (bad art), (deformed), (poorly drawn), (close up), (b&w), ...
That's my take on that prompt.
another one I like
RAW SDV2 no VAE no Controlnet poses, merge-models etc, ask anyone this stuff, this is difficult
Heisen-burnt
Hola
hounty
@boreal falcon
damn xD
What type of skills are best supplementary to AI? I think 3d software like Blender perhaps.
im working on a comic, WIP.
/chat
@torn valley
Welcome! Start by heading over to #1072220168534642768 to get yourself situated and help find the channels you are looking for! Please make sure you review our #✍🏼|rules-and-tos and feel free to assign yourself some #👥|roles as well! Answer any questions your may have at our #1072229020520947753. There are many ways of accessing Stable Diffusion, take a look at #1080946152318443610 to start your journey!
that's a strange way to post for the first time on the server
