#🏞|general-with-images
1 messages · Page 17 of 1
I didn't include the mouth in my mask recommendation, because it's more part of the emote itself usually, and less part of the "person recognition"
Yeah but I'm automating this on youstick.fun so manually entering "blonde hair" in the prompt is out. Unless I first do some feature extraction from the images they upload. I'm not sure how big a model that would be wrt time and cost
training a small model like that, using collab, currently is about 10 minutes if you automate it. it will require something like 5 to 10 pictures of their face in different context if possible (you can put some of the quality responsability on them here)
then you can use that model to color, it should be all right in the 1.2, it should have learned those hair by heart so it should directly use that in that step, if you use the token for the client in the prompt
this will need quite some testing to completly automate for sure, in particular the denoising of controlnet, finding the right values and prompts here and there
but it should work quite well
bro wtf
big resolution duplication shenanigans
dude why is he so happy
me and the boys
That's interesting idea
does somebody know why its not possible to outpaint upscaled pictures ?
Any tips to make an underwater scene, without visible water surface at the top of the image?
Prompts are beginning with deep sea photograph
I've tried in the negative prompt: sunlight, surface - image is now as dark as I want it but there is always the water surface showing up
Thanks a lot. Going to try this. I'll let you know how it goes
I've seen a lora for deep sea scenes. Try that
Thanks for the tip, Ill see if I can find that one 👍🏻
anyone knows how to solve this? Im using m1 macbook pro. i tried to use SD 2.1 but when i try to generate it shows this. I also added the config file in the model folder. v2-1_768-ema-pruned.ckpt this is the model and v2-1_768-ema-pruned.yaml is the config file. Can anyone help me with this?
by the way.. with other models it works fine....
Tried new lada models and wow, they are great. I used img2img on one of my old generation and new one is so much better
got that heavy metal aesthetic \m/
Hi all, this is my first post here.
Does anyone knows how to keep the same elephant (color, size, shape) but keep adding more to the picture?
is that possible, i tried img2img and locking the seed but no luck
Should be possible by using 2 more tools :
1/ inpaint, a tab inside img2img, lets you draw a mask on part of the picture you want to be modified, or not to be modified. This can let you preserve the elephant as is
2/ controlnet, an extension for SD. It can let you keep the current composition and details as most as possible
I'm trying it
someone talked about how hard it is to get guns right in SD, especially 2.1 since they are sorta censored (sex and violence), not bad guns here.
here is an example, very stylized
Thank you @glossy herald that is exactly what i was looking
prompt is
a purple and white elephant in a forest with a mountain in the background, intricate, very detailed, from
and add at the end the type of reference style you want
the first one was "studio Ghibli"
ahh! perfect!
or you know, go photorealistic
hahaha. i think i am going to give up on the idea lol
when masking it, it just places another elephant behind the mask
use the option "inpaint not masked"
by default, the mask covers what should be changed
here is the mask I drew
and here is the controlnet I used, to help keep composition outside of the elephant
last important parameter, denoising, I upped it to 0.85
the higher, the more changes will happen
yeah, the model is important
abd you may want to try and mask the husk too
the canny model is what will make it respect a lot more the original composition
here are the models
Thanks!
no problem
download the ones you want (all ?) and put them in the folder where you installed automatic, in the extensions\sd-webui-controlnet\models folder
then click the blue arrow next to the dropdown of the UI and it should give you the models to choose from in it
happy to see it work like you want 🙂
I saw a problem while testing
it seems to need to specify a lot of the colors
so the mountain going purple is mainly because it's not specified it seems
not sure, I had a same drift
have a happy editing 🙂
I trying deflickering, with controlnet on a custom style model I made, but I would like more consistency, any suggestions?
Checking on some frames how the controlnet preprocessing performs and fine tuning its setting could maybe help, pushing down the threshold sliders in canny mode for example.
Also adding a secondary controlnet, or even tertiary controlnet, will help keep for example the same depth or normal map, making it even less likely for the image to flicker
Mh, for now I'm working with a normal net, and a canny net, at a 1k Res, for both of them, I wonder if using depth may actually help
I feel like in the first half, the main problem was things coming in front and then going back, so maybe on that yes.
both normal and canny net have threshold in their preprocessors too. I didn't play with normal yet that much, but I saw that using the one in canny can change a lot of things
Yes, I played around with both, to be fair yeah, maybe i should not use normal as primary, is kind of not super trustworthy for this kind of stuff
you can just run the preprocessor on some frames to see the normal map it would use, but yep, the main feature of your clip is movement and depth, the flat surfaces aren't the most prominent. I'm not sure if the depth model will manage to read it well though
The normal one seems to not do well for this,but the leres does wha better
from this
somewhat better than the movie did
Powerpuff girl
original :
getting there
got it
COD zombies Primis crew but real
A picture of beatmakers of various races battling it out.
Similar to the style of Akira Toriyama.
The image size is 16:9 landscape.
Nope
your bot is killing its foe with its rifle? with bayonet, and immediately hosting the funeral with white flowers bouquet in hand LOL.
Well, it came prepared.
flowers to lay on ya
ok so like, has anybody ever run into massive issues trying to do concept art for weapons?
none of the SD models I am using are working
originally scimitars, bows were almost a no go without a lucky hit
I am like 100 attempts in, and SD is just ruining everything
like the results are horrible, and I don't know why
which weapons?
battle axe, battle hammer, war hammers, anything like that
it has 0 clue
bro, it tries to make axes as guitars
well, that is their nickname
yeah but like, a guitar is not a "battleaxe"
He is like the G1 megatron
ok, so the AI has no clue what bladed weapons are, bruh
Tomahawks, hatchets, axes, anything like that
no clue
probably need to grab a weapon model
looks like it doesn't know what swords are either OMG
I have even tried RPG diffusion and dungeons and diffusions, and nothing
We talked about this before sytan SD is censored on weapons/violence (to some degree)
you need to make your own loras sytan
and name them something else
dont call them swords and axes
Like our last conversation, the AI wont draw missiles and bombs on my jets either
Seriously? That is ridiculous lmao
I never talked to you about this
we had this conversation a few days ago
or maybe it was someone else
That really depends on the model, VAE, and lora. I just merged a bunch of models I like and this one gets the prompts very accurately. You can probably make a lora for guns.
but i talked about weapons (specifically swords, bows, axes) from someone here
Some one did make a gun lora
and a tank (type 99? china)
interesting, i guess I am gonna have to train these relics into the AI
my f14 tomcats carry a shit load of external fuel tanks but no missiles/bombs no matter the prompt
1 of the images carried something that looks like an external gunpod...half gunpod half fuel tank
that was as close to arming my F14 as I could get
I never knew that was against what they were doing. That is insane lmao
people got lucky with all the nsfw stuff on sd 1.5
that genie came out of the bottle and it will never go back in now that it is circulating
I am using 1.5, but I guess even that has no info on weapons and stuff?
I just know that SD 2.1 is definitely blocking sex and violence (guns/weapons)
occasionally, something does pass thru (but not completely), I've had half? a nipple show up (discolored/colored to match the skin tone and not protruding fully)
on a 2.1 generation image
Sukoi's with external fuel tanks, no missiles drawn. I think they were SD 1.5 images, not even the censored 2.1
F14/18 bastard child with extra wheels despite negative prompt to avoid them and no missiles/bombs
Their censorship is obnoxious. It quite honestly feels like its against the whole idea of a widely available free and open source AI
"use it for whatever you want, as long as we want it too"
The bottom jet looks like it is carrying some ECM pods for electronic warfare, maybe it got that from some EA-18G Growler images
Yes. Thing is you can train to add to it but there is just too many weapons missing. You mean out of all the images scraped there were no weapons?
no, they chose to add none
Precisely
its quite literally censorship. Luckily its not hard to work around, but man, it would take a lot of effort
for now I am focusing war hammers and battle axes
@dense tapirDo you know if there is a way to merge LoRAs?
Like if I made a battle axe and battle hammer LoRA, could I blend them to get one LoRA that does both?
There is
they must have the same dim size
Sorry, I guess I don't know that about LoRA's. Do you have any links to places where I could find out?
omg no. This shit is everywhere and why it has taken me months to get it all down since the start of the year.
It just requires more images and more tag words, right?
Like 10 images with "battleaxe"
and 10 images with "battlehammer"
So then they only trigger when I use one of the two?
yeah, but you don't want the caption info to overlap
If you caption one with "two handed weapon with a giant hammer head"
and the other as "Two handed weapon with a giant axe head"
then saying "a two handed wapon with a giant ---- head" will trigger both data sets, no?
Love me long long time tomorrow cause now I am too tired. https://rentry.org/lora_train
It is very mundane
thank you
It is a great little guide
Can I use this I am newbie
That is what it is really made for.
@dense tapirohhh, so its saying to keep them in different folders? So a folder for battle axe, and a folder for war hammer?
yes
as the site shows
each caption for each goes into its respective folder
@dense tapirWait, so this uses a completely different LoRA trainer to Kohya SS?
This LoRA doccumentation is going against everything you and I have been doing. Its saying use 150 images? lmao
It is for anime
they use 250 images, and tags, without blinking.
Ah, so anime benefits more because there is more of a style and less data to pull from other things
pretty much. A bit flat
like obviously if you train on a real person with their eyes always open, the AI would know how to close their eyes cause they are a person. But it wouldn't necessarily know how to do it with an anime character in a different style
Pretty much
alright, I get it now
you need more images in anime for everything from open eyes, closed eyes, mid point, frowning, happy, etc...
yep
does this rip out the style of a checkpoint and make it into a LoRA, or am I missing something?
thing I have not figured out is when you merge what then? I know merging embeddings I never could call either again.
you aren't missing anything
alright, awesome
so this merge lora, extract lora, model conversion, and auto captioning
don't think auto is perfect because it is far from that
Auto?
oh, yeah
I actually use it just to add prefix and made the files for me. I always set it to caption 0-0 cause they kinda scuk
but I will have it add like "a photograph of subject name" at the start of the tile
or just "subject name"
I wish i could look at the full setup of like a kick ass LoRA and just figure it all myself, cause there is so much random info on LoRAs
oh, wow yes. I am beginning to hate loras
I may have to go back to embeddings since I do styles
@dense tapirDo you have any documentation on how LoRA ranks work?
ranks are those dims I mentioned
I don't really know what DIMS are
don't need to
more dims the larger the file the more info it can hold BUT that isn't always a good thing
I kinda do, so I know what to set them to 😅
what card?
ohhhh, so like its the index? The smaller the less it holds on to?
GPU?
yes
3060ti
I am not actually sure where to change DIM size
I see the resizer thing, but nothing pertaining to DIMs in the training settings
about 16k per dim so a 100 dim is about 164m
or does it always train at 1024 and you prune it down?
I don't use what you do so no idea what it is doing
yes
its at 8 👁️
its at 8, with an alpha of 1
Jeez, so you're telling me I can get better results with LoRA's? they already come out so good, i couldn't imagine haha
these things can go all the way to 768 but past that all kinds of errors happen I have been told
is this gonna affect VRAM or speed?
not really
yeah, I can't train locally so I get frustrated all the time
Aha, I see
Now I wonder about the mixed/save precision, Learning rates, encorder rates, unet rate, I don't know what any of these things mean 😅
you should go search and find out cause there is no one size (simple button) fits all.
very interesting. Here is hoping I can find some info haha
precision make those bf16 always
mine are set to fp16
bf
change it to BF?
alright, you got it man
Any of this stand out to you?
like, anything I should change?
There is also this
oh wait, I don't want buckets, my images are precropped
yep
so, nay red flags I should change?
a 3060 might require the gradient checkpointing
isn't that to save VRAM?
batch size
as in the number of images? Sorry if I am missing something obvious
oh, my dumb ass
I use 2
uses about 7.7GB of my VRAM. 1 uses about 6.1
alright, cool
so, out of everything else, any red flags to you?
nope, try it and see.
I am gonna retrain a LoRA that didn't do too well to see if there are any major inprovements
sounds good :>
@dense tapirOk, my bad, I just realized
I didn't load the config file that AItrepeneur gave
I didn't like his config
true haha, just wanted to see if there were any major changes you would make, cause this one is his preset
worst case it has a high loss, or worst a NaN for loss.
alright, cool
the automatic1111 extension is a bit flaky though
I am using Kohya SS
yep, I know
Oh, I thought you meant something different cause this one is its own separate thing, not an extension
yep
there is an extension (or was) and it was crashing on people and all sorts of shit
ah, alright
@dense tapirThank you again for all of your help, I appreciate it greatly
You are most welcome. We are all in this LoRA together, but the sad thing is there are a few who gate keep the info we need.
Yeaahhhh, i try to share as much as I can. I have already recruited 3 friends/family members to running SD themselves, one of which is my uncle who is using it to conceptualize his characters he makes for VR chat commissions
Cool. My nephew is all up in this but his pc can't run it.
COME ON
gimmie 2
WOO
@dense tapirso I tried the 1e-5 speed that comes default with Kohya, and it gave me huge Loss (.2 after first epoch)
after going back to .0001, I am at .08 after the first epoc (at 100 steps instead of 150)
so 0.08 loss at 500 steps instead of 0.2 at 750
oh wait, I also changed the base model lmao
ok, maybe that was a big contributor lol
yeah, normally 1e-4 for unet and 5e-5 for te. Just remember what you see at the beginning should not be anything to judge by. A lot of this doesn't lower until almost done.
remember loss is it being spanked for getting its answer wrong so it will not do it again.
to compare, on the other it was 0.2 after epoch one, and 0.17 after epoch 2, now its .09, .071, .053, 0.335, and now its at .025 (thats where it is now)
I wanna see how much benefit each epoch gives
yeah, the LR is more inline. also I don't use those tbh
LR?
learning rate
ah
I also prefer constant with warmups
this?
only good if you use constant with warmups as the scheduler
like this?
oh
yep, that one
so what kind of difference does it make?
now warmup make 5 from 0
yes
failed?
Was messing around with overlocks, looks like I went too far lol
it returned anon zero exit value
first time its ever happened, and it happened about 60 seconds after I changed the OC lol
oh well, still got 5/6 epochs lol
Did you made this? From which api?
automatic1111
prompt - walking through the ocean
nice !
@dense tapirQuick question, Do you do your captions as like "woman, tall, pretty, dress"
or "a tall and pretty woman wearing a dress?"
first
ok
and if you had a subject specifically, how would you start it off, out of curiosity?
first is tagging (.txt) second is captioning (.caption)
wait what
don't get the question
tags and captions?
yep
WHAT
I have just been using .txt, thats all I have seen
ohhh, i see what you mean
my bad
the styles are captioning and tagging
people/subject/object too
so people subject object, do you have an example of what you mean?
well, class people would be a man, a woman, a child, etc...
actually, from everything I have read, do not name the person
oh really?
yep
I wonder if thats why I am having problems with my current lORA
might be. Remember I haven't really tried (except once) to do a lora person
ah, I see
don't name the item so use generic terms
woman, man, boy
I tag so 1girl, 1bot
*boy
oh wait
I upped the LoRA strength a lot, and now its a lot closer
with the others, if I went over like .3, it would mess up
show me one of your training images (pvt is fine if needed)
this one is still getting some artifacts
I lost my link to SD anyone want to help me out?
thank you but I was meaning the one for the web broswer
Edit your webui-user.bat and add --autolaunch behind Commandline_ARGS=
The URL is also:
https://127.0.0.1:7860
Anybody know how to install extensions while SD is in share/listen mode?
thank you but I dont understand how to do that
You right click the file and edit it
Where it says COMMMANDLINE_ARGS=
you wanna add --autolaunch
this will automatically launch when you start it
shit-
That deleted my previous arguments
Nevermind, got them back
now to figure out how to add extensions while IP hosting
Disable the hosting, add extension, enable hosting
Ah, thats gonna change my local IP/URL to access stable diffusion, but that does work
I'm trying to generate images for Thri-Keen (dungeon and dragons race) and I'm failing a bit. They are an insect race, but in human form. I've tried the [praying mantis|human] notation to try combining them, but it doesn't work. Is that because there's not enough overlap in the categories?
thought I would much prefer to just be able to do it when I am authorized with my login
there should be models that actually have info on that race
with the --
one moment
yes
--autolaunch
no spaces, just like that
then just save and start
There is a model trained on thri-keen
New version 3 is trained from the pre-eminent Protogen3.4. I found that training from the photorealistic model gave results closer to what I wanted than the anime model. This model is fantastic for discovering your characters, and it was fine-tuned to learn the D&D races that aren't in stock SD. Works best with 768x512 and other resolutions larg...
This is the closest I've come. That's just a tiny insect with a big sword.. 😄
thanks!
no problem :p
I am a fellow DND diffuser haha
it does illithids very well
tortle too lol
already love what you can get with the standard network (not sure what I'm using)
I found the right command, it was --enable-insecure-extension-access
its password protected, so its no worry
ok I have to sign in or up to make changes
really sorry to be a bother but where do I have to open the bat file? do I need to download everything and open it with notepad ++?
Having some trouble getting thrikreens to be generated with 4 arms. Anything I can do with the prompt or should I hope for a model update?
its what I use
I'm still trying to learn this programing stuff
all good, I am shit at it, I just am a very fast lerner lol
ok then how do I download it from GitHub I can never find the link in all this mess
oh, and I was wondering what is sytan sound design. do you make music with some kind of software or are you working on the software for making music itself?
new problem python is not launching
Good morning Internet. Today I test "Illuminati Diffusion v1.1" and I do not like it, it so dark and it feel very limited. But there may be something wrong with my Stable Diffusion and Automatic1111, I often get ERROR written in the UI ang I get a feeling it not been updated even I do Git Pull at start,
I do not like "Illuminati Diffusion v1.1
" but I think that is a model folk should download and test, it may be very creative.
do you want to install Automatic1111 webui?, come to #🤝|tech-support for help
weird gen , but I kinda like it
I have been trying illuminati diffusion without much success in making dark scenes. I have tried putting lamp and light in the negative prompt, and also edit the textual inversions that was suggested by the page. Not sure if it's working
I suspect "illuminati diffusion" is trained on contrast with much dark and light and little did tones, so you will get a dark room but then there will be some lights that take a bit of the focus from the image,
ship walding repaire
does illuminati diffusion use the offset fix that i've seen going around? i guess base SD and all the training algorithms have a bias towards averaging the brightness of a pic to 0.5. Someone released some math that helped the training process learn to create images that have higher or lower than the average for general contrast
I feel like that's just over hyped nonsense...
They are claiming it's SD noise algorithm problem, but...model, just trained on other images or Lora fixes it, even tho it doesn't change algorithm itself?
Am I not understanding something or they are saying dumb things , using smart words?
attempt at making a model based on the simcity 4 art style
Oh this is fire
fantastic work :>
Don't you just love when a LoRA just doesn't work no matter how hard you try lol
My hamster always have same or similar pose and the composition stay similar with small variation when use Illuminati Diffusion v1.1 or SD v2.1, what do I do wrong?
man, no matter how I train this LoRA< its always bad
@smoky oak Send it to some to test, for some reason I have trouble with that most LoRA look a bit bad and grainy, it may be some setting or so, I do not know, I have no luck with LoRA.
Most of my LoRA's come out damn good, but something is physically fucking with this LoRA. Its consistently giving her details she doesn't have across several models, and I don't get it
The biggest being that it keeps giving her a dot on her forehead thats not there in any of the data set images
I really just don't get it lmao
its insane how consistently it makes somebody who looks nothing like her
it only looks like her if I specify short hair, which she doesn't have in most of the data set images
What is up with my Stable Diffusion and Automatic1111 today, after running "Illuminati Diffusion v1.1" I get a feeling that SD v21 and SD v1.5 is messed up, even same prompt under "Deliberate" look wonky.
What am I doing wrong?
https://civitai.com/models/14605/howls-moving-castle-interior-scenery-lora-ghibli-style new LoRA for scenery style of Howls Moving Castle
Trained from the brilliant Ghibli movie Howls Moving Castle, this LoRA is capable of creating many different interiors and landscapes in the style from the film. example use: howlbgs, scenery, lora:howlbgs:1https://www.patreon.com/nucleardiffusionNote: This free work is intended for art and illustration purposes of emergent technology and not ...
Using ControlNet to generate images based on my sketch drawings.
I may been writing the prompt, but LoRA made it to "art"?
LoRA's are crazy, and I have no idea who this is, but to use that LoRA made everything a bit creepy, this is a combination of two images.
@dry crow Just a couple examples. They are very similar, but there are loads of detail changes.
Flick between the two
No matter how many I generate no two are alike
That could be the Upscaler too
Thats causing this
Try without one
Same issue
is same seed expected to generate exact same image?
For me, it always has tiny changes
It's supposed to. Disabling xformers arg and generating multiple images produces the exact same thing every single time.
Yea xformers can cause this
exact same?
Yep
hmmm
Down to the same byte
img2img or txt2img?
You want exact because you're upscaling?
Exact because its the exact same parameters and prompt surely? Is it not a bug in the xformers library?
I dont think its a bug
This is the --xformers arg removed from the bat file. All 3 here are separately generated images.
if xformers introduced randomness and eliminates reproducibility, I dont see why this wouldn't be a bug?
those are surely the same
yeah i can't tell any difference
i just checked it online
if i move the slider the images are the exact same, literally zero difference
not even a minor diff
That's the point, without xformers they are exactly the same image, with xformers they differ a lot
yes
that's what happens with xformers
it literally states that on the automatic1111 wiki
Xformers library is an optional way to speedup your image generation.
This optimization is only available for nvidia gpus, it speeds up image generation and lowers vram usage at the cost of producing non-deterministic results.
When AGI takes over 
just sydney carrying on Satya's legacy
I haven't trained anything in a while and I would like to train something cool for 2.1, is there a specific style or concept someone wants?
(I do embeddings)
I hate it in Deforum when I get what would be an awesome render using 200 steps and 50 scale but it’s too distorted because those are so high and if I lower them then the render is different and way less detailed
Im thinking about stylized low poly/game asset embedding, but I don't know if I should do that for 2.1 or 1.5
I guess why not both?
⬆️ControlNet + OffsetNoise + LoRA = Stable Diffusion 3.0!
Read my Reddit Post below to learn more!
18 votes and 1 comment so far on Reddit
i think i fed it with too much image to train , gpu goes brrrr
flowers in space
Generated
Scaled out the way it's supposed to be
I love it when a plan comes together. 16x9 ti works! 😄
cute Asian girl, long hairstyles #🏞|general-with-images
https://imgur.com/du0EDPW lil better, dog was going schizo in the beginning tho unfortunately, i used canny and the hed CN models
o and yknow what, i used a random seed for each img :/
yep, that would explain it :/
i wish the pose model didnt take so long, can be so useful 😦 like 2x longer than hed and canny combined
https://imgur.com/TQzXHX7 @glossy herald any tips? I think i just gotta let it run overnight with openpose, has odd limbs a lot
first, go on and check the setting "Allow detectmap autosaving" next time, and you'll be able to keep all the openpose precontroled frames, and reduce drasticaly the next try's time
It would also help you detect what openpose detection was buggy, and fix it, by making the frame openpose preprocessed image yourself using openpose editor
then, is that using the same seed ? you could gain a lot of consistancy through that, on the trees and the background car for example, not sure.
I find it quite good to be honest already though
I really need to start doing one like that too
ah ok thank you ill run it again, it is the same seed, 75 denoise tho. I used guess mode on both hed and canny, just had to put "watercolor, guy, brown pants, striped shirt" as the prompt, using the deliberatev2 model
its done with img2img batch and controlnet. I find a video on pexels usually, i use "free crop video" to crop to a square (if ur card can handle it then there isnt a need), then i use ez gif to turn the video to frames. I take the first img into a regular img2img page and get some decent settings with controlnet, then go to batch, link the files and an output folder, then let them generate. Then I put it back into ezgif but the other way, jpg to gif 🙂 make sure to use the same frame settings for both or itll mess with the speed
it almost understood handstand lol, using canny, hed, and openpose
Wow thanks for the explanation, never tried Animations yet, mabe a project for the weekend ^^
its super simple once you do it a couple of times, that last clip i sent took maybe 45mins-1 hour on my 8gb gpu
Nice i have 8gb ^^
Negative prompt: Popular opinion. -I think Illuminati Diffusion v1.1 just now is one of the most boring models/checkpoints.
dmn :/ https://imgur.com/mLVl6ZH
I tried it out too, the images it can make are cool but it's way too specific and repetitive for me, even with my embedding.
GUys, do you know a way to save the parameters of an image generation? Because the thing at the top right (for me anyway), only saves the prompts, it doesn't save the resolution, the CFG scale, the seed, the steps... etc.
Do those 3 look like one person in 3 diffrent stages of his life?
That was same for me, can do some good and folk should test it, but it so limited just now. We hope they develop it more.
https://imgur.com/CyMhzaU ill have to work on post processing tmrw, been a while
and i think the CN depth leres model would help with the background
mw2 type shii
It looks cool
Looks dope
lol nice experiments. openpose model isn't trained for a lot of poses i think a lot of refinement wiill go on there most definately. particularly hand stand type styles. i discovered the limit when i was trying to generate olympic break dancing shots
frfr
Hmm... Coincidence? (1st is AI generated image, 2nd is a real photo)
mute from r6 but with camo
Trying to make doodlebob 3d
i'm starting from far away it seems
I need to discover blender soon it seems
more and more things benefit from that knowledge
Have you seen the videos showing you can use blender to get depth, canny, and open pose to fix hands and feet?
yes I saw that
I did some 3D like 20 years ago, but never touched it again since
I need to just get back on that horse
controlnet creates a real use for me right now ^^
but I may start with real life video input, I wanted to try that too
such a beast that controlnet
yeah, I want to get into using this for videos
If you could do that with the offset extension for night with car lights making a neon effect swosh
like this?
That looks cool, not sure why i put neon, but how the lights look on hyperlapse videos
see without too much blur in
earth genasi
not sure Im a caveman level right now. put hyperlapse lights see what it does. a lot of us are experimenting
this

Tried updating one of my old generation with new artstyle mix, how does the style look
I know, but I mean details, I tried mixing art novou with pop art
and bit of impressionist at the top
looks good! its low res, but for that res, its clean
its high detail for the resolution, is what I mean
What about this one?
That one looks sick
yeah, thats a cool style for sure
I've been flipping between hot men and wallpapers
the duality of man lol
This is impressionism, mixed with slavic pagan realist art
don't ask how I got that mix of artstyles
alright haha
but it does look very cool man
I have been focusing on wallpapers like this
It was fix of this generation I made months ago, so I am currently fixing my old generation to better with new models
high res generations mixed with upscaling
nice
oh yeah, new one is wayyy better
Here is another fix
Oh I like that one as well (she kinda looks like a kid tho IMO haha)
phone wallpaper I made
I mean, on bunny girl there is nothing suppose to nsfw, it is just suppose to be cute painting
True, for some reason model I am using has that problem, I sometime has to specify adult so it doesnt look like a kid
Man, I am looking at what I used to be blown away with when generating with midjourney, and I can't believe how fast AI has grown
This was from less than 6 months ago man
and it progress from just couple smudges that barly resembled person to this
I want to try with new model, try to again do futurist artstyle
wow
might I add one was $30 to run, the other was free lmao
I love stable diffusion man. I am genuinely passionate for the art of image synthesis
I wanna see if I can somehow use SD to generate musical visuals for my album I am working on
I mean, the entire album is around raw sound synthesis, and synesthetic/chromesthetic reactions to music
Remember early MJ faces
yeah haha
This was once taken as peak of face generation
yeaahhh haha
now we can make basically photo realistic faces consistently lol
This was my first megabatch
and every single one is good. I just find that insane
Yeah, even finger in Lada models became less of a isue
for being a single generation across 64 seeds, every single one is unique and properly captured the look of Lady gaga in just 4 seconds an image. Insane
I'd go as far as to say I get pretty reliably good hands
not alway. At now it is like 50 50. But it is so much better then 1 in 1000 chance of having good hand in what was
before
The first one is image I tried my best to generate for hours and I needed a lot of work to have bearly human anatomy, but now I used it on new model and it is wow
I wonder if there is a way to interface with SD to do frame by frame auditory visuals that respond to specific notes and chords. Like sample the sound at the section, and associate specific colors with specific chords/notes. That would be an incredible way to have synesthetic visuals
I think some of more complicaded LADA models are using that strategy
Only issue is I have no idea how to code or anything, so I have no idea how I would even try to bring that vision to life
wish you luck
I really wanna release my album along visuals that fit the message, but man thats gonna be soooo much work 😅
I had just finished with new models first of my colections of gods and goddesses
Just this single sound right here would probably produce some insane color responses just off of note association, if that was a thing
yeah
(here it is in the little micro demo for my album)
The whole thing is based around alternative synthesis and audio scapes/expressions
oh, I forgot my icon is animated lol
I thought I was tripping balls lol
nice song 
Thanks!
i have 2 more demos (much older and lower quality, but hey)
A-rythmia is 3 concepts for a single song together
and the intro is the placeholder album intro (its LOUD at the start)
I am sure you can hear how much lower quality their sound design and mix downs are 😅
Truthfully, me getting into stable diffusion was cause I wanna make art for my album

Pixar style, a super cute and happy white fairy bunny, pink headphone sports car, bright big eyes, wearing a small yellow flower skirt. Sweet smile shiny snowflakes fluffy, fluffy tails, fairy tales, cherry blossom forest background, flying cherry blossoms, bright colors, natural light, ultra Clear, ultra realistic, ultra fine, Illusion Engine Octane Rendering 8K, 3D, HD-g 2-v 4-ar 2:3
how
how what?
I was just showing midjourney 6 months ago vs stable diffusion now
I did, i used 7th layer anime v3-a
Was my first ultrawide generation heh
now I make more realistic things
like this
just made this one too
Here, have a free GTM_v3 generated Mustang. Great model by the way.
@tired nymph charming Polish village, complete with colorful traditional houses, cobblestone streets, and a lively town square
trying now with another model
damn thats well done
decided to try something
256x256 to 1024x1024
368 to 1024
512 to 1024
768 to 1024
and finally, RAW 1024
same prompt, same seed
crazy how base res affects it
and I would say they all have their own strengths at 1024 final
tho speed is not one of them
how long does 1024x1024 take? takes forever on my 1070
256x256 took 4.33 seconds
368x368 took 4.39 seconds
512x512 took 5.12 seconds
768x768 took 9.34 seconds
1024x1024 took 18.02 seconds
256 to 1024 took 15.91 seconds
368 to 1024 took 18.56 seconds
512 to 1024 took 16.77 seconds
768 to 1024 took 20.54 seconds
not sure why 368 to 1024 took so much longer
weird
using 30 base steps, 20 upscale steps, CFG 8
with what nvidia card?
so yeah, 512 to 1024 seems like the best option
strange, does not seem much faster than my ancient one
its more about VRAM speed than core speed
I can send the exact gen settings, if you wanna compare
oh, I am sure the model matters too tho
512 to 1024 took 1m16 so yeah it's a lot slower
oh, ok yeah haha
76s vs 16.77
but still, thats not too bad
my very high res gens take longer than that, granted at much higher res
Still only 8 GBs of video ram. Haven't really kept an eye on it. Is that standard?
I have 8GB VRAM as well
I mean, my 1070 also has 8 GB
don't know what that is
you use a1111?
yes
let me do you a massive favor
xformers made my GPU way faster and I can go to absurd resolutions now
you know the bat file you launch?
you wanna edit it and add --xformers
after the COMMANDLINE_ARGS=
Make sure to add no space
you should generate faster, and be able to go way higher resolution
this 1280x540 to 2560x1080 just took 69.34 seconds
yep, xformers is a must-have option, just pure positives on vram and speed
before I had xformers, my highest res I could do was 1024x1024
only for upscaling or also for generic 512 exploration?
now I can go way past 1080p
everything
yep, it helps on everything
why is it not enabled by default 🤔
its faster, and uses less VRAM/allows using shared graphics RAM
it's just an upgrade on the cross attention thing
new tutorials tell you to enable it by default 😅
its fairly new
compability issues with some harware maybe ?
the base cross attention is maybe easier to run
I followed an old tutorial and was stuck on 1.5 for two days until I discovered 2.1 was out 😓
it does only work with NVIDIA, I know that
does SD work on amd cards as well then?
and it benefits different NVIDIA cards differently
for me it went WAY higher res and faster
both have their uses. right now, there is also the "controlnet" extension that does magical things, but only works on 1.X models
I use only 1.5 as of now. No reason for me to go to 2.x
Combining things (like earth and skin) seems to work MUCH better on 1.5. 2.1 seems to get stuck on real faces
2.1 is a mess TBH
should I try 2.0?
removed nudity, trained with incosistent resolutions, blegh
I use strictly 1.5 as of now
don't care for nudity
and don't think 1.5 can't give the same quality as 2.x, cause it very much can
All raw 1.5 results, no in painting or editing
anyone excited for 3.0?
the version matters much less than the quality of the model
realistic vision v1.3 on SD 1.5 will still blow basically all 2.x models out of the water, cause of how well it was trained
all images above are raw, unedited realistic vision results
I generated them myself :>
@charred crystalquestion, do you use just like.. SD 2.1 base release?
I'm liking the animal/human hybrids more 🙂
ahem
Random McLaren incoming.
as in for D&D, not nsfw
haha no
Oh, here is one I made a few days ago, Tifa. No, this is not from the actual game either.
no, 1.5 works better for weird stuff like Earth-Genasis
I tried the D&D thing for 1.5 but then you get bog standard stuff
What are you just using the official release?
for now
Oof, the base versions are not very good haha
I haven't even touched SD 2.0 or 2.1 after trying them the first time. 1.5 is just....better for me all around otherwise.
I've been using stable diffusion for less than 3 weeks, and I can honestly say that every version 1.5 model I have used has been better than every version 2.x model I've used
same
Now that we have Noise Offset for better contrast and darker images, total freaking GAME CHANGER to where 1.5 can directly match to even surpass Midjourney.
Stable diffusion already beats mid journey in a lot of places, like accessibility, granularity, confidentiality, diversity, cure resolution / detail output on the high level, the ability to train on whatever you want, and way way more community support/new features
But now that we can reasonably expect weighted contrast sometime soon, I don't really see what the benefit of mid journey would be overstable diffusion anymore, other than the fact that you don't need a powerful computer to run it
I mean, just the Auto webui alone with how much control we have is more than enough to kill it in my honest opinion, but yes absolutely.
haven't tried midjourney and now I won't 😉
Things like control net also just bitch slap mid journey haha
ControlNet is next level shit for us.
I was one of mid journeys original supporters, I supported it for three or four months, and racked up about 15,000 generations, granted it was very early in the beginning, before stable diffusion was released, and then I left about a month and a half after stable diffusions release
I paid for it once and after SD came out I ditched. lol
Mid journey was fun back then, before they decided to kill it by censoring the fuck out of it
I tried stable diffusion when it first released, but it was so fucking trash back then, that I just went back to make journey, before I couldn't afford to continue with mid journey and I ended up leaving image generation
Sorry, I'm using voice typing so it's a little off
I don't support the views of the CEO of MJ regarding censorship at all....
Yeah, it's completely bullshit. I don't know if it's as bad as it used to be, but when they first started cracking down on censorship, it was game ending for me
It was so unbelievably censored that you could do barely anything at one point
Hold on a second...I have just cooked up something amazing for you....
Wanted an image of a minor in any capacity? Against TOS, want anything related to bruises, looking rough, torn clothes, grungy styles, all against TOS
Anything referred to as blood red? Against TOS, short skirts? Against TOS, gym clothing? Against TOS. Men not wearing a shirt in any capacity, even if it makes logical sense like them being at the beach, or being professional weightlifters? Against TOS. It was fucking horrible lmao
I canceled my subscription shortly after
It was a lot worse than just that, but I don't remember any other examples off the top of my head
Damn, I forgot how rough that actually was.
Though I will say that I think it is absolutely bullshit that stable diffusion also censors content
Like on one hand I get the message, but on the other, it's just as annoying to deal with as mid journey at times
You talking about Dream Studio, yeah?
No, I'm talking about stable diffusion in general
There are significant parts of it that are censored, and while I can understand parts, some of it's just ridiculous
Oooooh, you mean like what they did to 2.0 and 2.1?
Nope, even before that
Try and generate a sword with stable diffusion
It won't do it, because they removed any and all weapons out of the data set
A gun? Nope. An axe? Nope. A tomahawk? Nope.
Hell, they went so far as to remove the guns on military vehicles, as well as the missiles on military aircraft
My friend was going to pay me to generate some concepts for cyberpunk/steam punk battle axes and warhammers for a character that they're creating, and I spent probably a solid 2 hours trying to do it, and it just would not do it
Oh...now I see what you mean. Yeah...you're right on that one again. Weapons are a no-go.
Like I can understand them censoring illegal stuff like inappropriate underage content, and stuff like that, but a sword?
That seems ridiculous, good luck generating any convincing looking knight lol
I was trying to generate buff lumberjack men cutting wood, and it would always remove the axe
Granted, you can add it back in by manually training it yourself, but who the hell wants to do that for all weapons?
hmm
We can barely even get lightsabers to come out decently. lol
That alone is a one in a million.
Exactly, I feel like them censoring such a light content as a hand knife or a crossbow is just very much against the open source nature of stable diffusion
It becomes basically useless to generate any form of melee-based D&D character, unless you want to generate them without their signature weapon
Thank you for reminding me, I want to see if I can find a way to generate hot cyborg men lol
Well, at least I got this image back in January based upon one of my favorite visual novels. lol
Oh wow, that looks dope
Think any of that will improve with later versions, or is just up to the community?
It's not likely, stable diffusion seems to be taking away more than they're giving
However, we should all know by now that it's the community that makes stable diffusion, not stable diffusion itself
So I'd likely say that as more people find out about stable diffusion, and higher and hardware becomes more accessible, we can expect the quality to go up just on community support alone
Alright, this is what I've been waiting to put up here for the last few minutes. My last and favorite image of the night.
That looks very nice
Thats my Car in Saints Row 3
I would hope that after some of the hype, attention, and controversy dies down, the official models become a bit more broad. I have certainly run into the mangle weapons.
They will for sure, just think about when stable diffusion 1.5 came out, and how much better stable fusion 1.5 is now, that wasn't because it's stable diffusion, that was because of the people in the community putting in the time to train the AI better and better and better
Frankly, anything that someone holds is hard to generate. I tried to get a janitor holding a broom, and only the smallest part of the handle would show up.
The Huracan didn't even exist until 2014 funny enough. Might have been based on the Sesto Elemento which predates it while being similar. XD
Yeah, I'd have to say that my biggest complaint was stable diffusion is that it doesn't generate what you ask for, it just generates in the general idea
That's something that mid journey still excels in, it's text encoder and understanding of what you're asking is significantly better than stable diffusion
However, that comes at the expense of a ridiculous monthly payment, limited access, limited image rights, and supporting a company that frankly I don't agree with
So I'll stick with my higher quality stable diffusion generations, at the cost of them not looking exactly how I ask them to be lol
Oh don't you worry...one day we'll wield enough power to make this damn AI give us what we want without question. Then we take the world.
Yeah, prompting with SD is an art in and of itself. I have noticed that sometimes it is fine generating something by itself, but not in combination with other things. Like, heaven forbid you want only a partial cyborg in a few different models. I had to use inpainting with a different prompt to get one.
But I am just a noob, so maybe that is just flaws in my prompts.
Haha, yeah
Oh yes, one of my weaknesses, hot cyborg men haha
Something I have struggled greatly with as well
It also doesn't help that there is very limited content of anatomically mimicked cybernetics
Yeah. I am only trying to get a specific limb each time, then match it with a particular outfit. I have been learning a lot about inpainting, img2img, and upscaling. Most of the original image is gone by the time I have something I like.
Johnny Depp?
That's fair, I only use in painting for specific things
yes
It also doesn't help that there is very limited content of anatomically mimicked cybernetics
I wish it did not need to have a source of that. Just smash the two topics together pleas. This in the shape of that or with the texture of some such
I wonder if I could train a LoRA on that NFL robot mascot. He's so hot, and for what lmao
Sorry, I am just too into men
😩
You can't tell me the man that made this robot does not have an eye for the male body lmao
Missing a cybernetic member?
Ha, over in the anime channel people had a tendency to make the toned gals tonight. A lot of guys seem to be into ripped chicks these days.
These people know what they're doing
They're cyborg baiting as hard as Disney furry baited with Zootopia lmao
Zwierzogród
Not even sure what language that is
polish?
it's polish for zootapia 😄
yes I'm definitely going to use this knowledge
Also, complete sidenote
I love how different the tiger designs are between sing and Zootopia
Both look really good, but they are very different in a unique way. Just props to the character designers
haha omg


