#💬|general-chat
1 messages · Page 146 of 1
ladies and gentleman
after 12+ hours of fixing my SD
i have finally done it
for all those who have helped, i love you.
use forge and reactor
hello
does using reg images mean you don't need to use captions anymore?
Hi everyone, I'm new to Stable Diffusion. I think I succeeded in installing it with automatic1111 through Google Colab. But once I disconnect my session, do I have to do the whole process to reconnect again? Or is there a way to connect to my account in an easier/faster way, knowing that I already have all the files on my Google Drive? I appreciate your help!
What is forge for?
And this can change the fairstyle too
?
I also want to generate video based on the user selfi photo and the text prompt. I don't know if it's possible and how to do.
If you are an expert, Please help me , please let me konw. Thanks. 
forge is a fork of A111
im not an expert tho but video is not very good
were not there yet
Can you suggest me do perfect video?
If the standard is perfect video, probably more like a decade
@fleet jungle imo a111 is pretty versatile and should suit most of your needs, easy to use and setup
thx
https://github.com/invoke-ai/InvokeAI is pretty good as well from what I hear. Personally, I only use comfyui
is there any downside to using forge
my friend recommends it as "literally a1111 with some preinstalled things and optimizations"
Hello, does exist any explanabiltity/interpretability like DAAM (https://arxiv.org/abs/2210.04885) for inpaiting?
any unique features these two UIs have over a1111?
invoke has a really cool new regional prompting thing that looks pretty simple to use. it's like a layer system basically. a1111 is still good, it's just kinda dated and is typically very slow about adding in new features.
isn’t regional prompting just masking over an image and inpainting? what is the extension doing that webui’s can’t do natively?
In inpainting the expected behavior if you don't put a prompt is to delete the selected object?
"A new release of pip available: 22.2.1 -> 24.0
[notice] To update, run: H:\webui_forge_cu121_torch21\webui\venv\Scripts\python.exe -m pip install --upgrade pip"
When installing forge
Should I do update this?
Grab it before it gets deleted: https://www.reddit.com/user/No_Dragonfruit_5472/comments/1chdemx/tradingview_premium_pack_crack_2024_version_free/
In your official model for inpainting: https://huggingface.co/stabilityai/stable-diffusion-2-inpainting says "and trained for another 200k steps" this finetune is to handle the inpainting task or just a general finetune?
Btw everyone - DuckDuckGo is becoming better and better -> you can now use Llama-3 70B & Mixtral8x7B on their site without creating and account / logging in / whatever. It's absolutely bonkers and I love it!
what is duckduckgo
A duck that ducks and goes
Not sure but switching to forge has fixed all my problems
oh
honestly comfy is big boy/gal image generation
i might look into it then
everything else is amateur hour
what flexibility do you mean though
😄
for insdtance i can generate with cascade or sigma then refine it with sdxl then run c-light then upscale with supir
all one go
u can even throw ella and sd15 in the same workflow
what model and program do you guys use
Comfyui as UI and models / workflows depend on the image i want to create. Could be either two SD 1.5 models (one regular and one as tile upscale), Pixart-Sigma with SDXL on Top, ELLA, ....
You can use any A.I you want, as long as it’s Stanle Diffusion
for a beginner i can say a1111 was great for me to get into it
im using forge right now which is just a1111 with some extra optimizations and some things pre installed
so if ur trying to get into it, i can recommend this
can anyone help me? i dont know how to animate with ai. you know
custom trained ai to draw over animation with your on style?
Topaz Video AI what are the 2 best models for 480p and 720p 1k to 3k bitrate videos from 2006 to 2012??
Hello, is it possible to apply only LoRa style to a photo (generated image X)
For example, I have already completed images and I want to apply Lora (sketch style) to these images. ps. in ComfyUI
Look into inpainting
Also look into reactor for face swaps
But if you want the same image just different style, send the image to img2img tab, write the prompts, and adjust denoising strength until youre satisfied
There are probably better methods but this is how i did it when i started out
Hi, how come low weight values don't change the image at all? For example (Butterfly:0.6) and (Butterfly:0.1) seems to have the same effect
Hello, How can i use a automatic111 extension from command line without the UI. Need some guidance. I am using this extension https://github.com/numz/sd-wav2lip-uhq . I can generate from UI. i Can create and add a new API but i'm not sure how to run the script to run extension.
I still don't entirely know what webui is
web [user] interface
the thing you interact with to generate stuff
1
Could someone point me in the direction of a good A1111 tutorial? I don't know really how to change/add models or anything. A general stable diffusion introduction would also be useful because I don't really know what a lot of things are. I know that lora are a thing but that's about it. It's all very overwhelming
sebastian kamph on youtube worked for me
Hello!
Hello!
How are you?
What do you do?
Hello everyone. I'm new to Stable DIffusion. Is there a way to make 2d pixel art animations? (I'm interested for a video game I'm making)
Hi everyone, I am familiar with Stable Diffusion.
I wanna help you
If you have any project, let me know , I will defintely help you
Is it possible to make it so sd automatically upscales images it generates as opposed to manually doing so?
Yes, It is possible
How would that be done?
This is a spanish channel but it got the best guides, turn on subtitles https://www.youtube.com/watch?v=qbK5_-lVmt8&list=PLDEJW5aR0tLvvf4Jg2GwTwRfT6xtGDByo&ab_channel=AcademiaSD
what is cosxl? I don't understand
https://huggingface.co/stabilityai/cosxl
using a workflow in comfyui
Hi, I'm doing well. I'm a master's student in a medical physics program
hi
https://youtu.be/kqXpAKVQDNU?si=EHs5JZaQmE1yTi1Q
This should get you going
So how does stable audio open work in terms of commercial use? Can I use the outputs in songs I release to spotify or is a membership needed?
Stable Audio Open is trained on CC0, CC BY, and CC Sampling+ audio data.
missed opportunity Stable Audio Diffusion (SAD)

hey guys whats the auto1111 of SAO?
doesnt really seem useful until we can make it sound along to a midi file or something
Any intention to support stable audio in comfyui?
When using the api to generate images I get finish_reason: 'CONTENT_FILTERED',
seed: 2950283743
}
It blurs the image. Any way to say that it should not filter content ?
Is there some audio generation, audio editing with AI that can be run locally? I run some AI trained program a while ago that allowed to extract stems from voice and instruments. I guess there should be more and better.
I posted this in #🎵|stable-audio
Stable Audio Tools (Main Repo):
https://github.com/Stability-AI/stable-audio-tools
DionTimmer's Gradio:
https://github.com/diontimmer/audio-diffusion-gradio
But you have to train your own model?
You don't have to train your own models. The weights just dropped.
In fact, you guys have had the capability to train your own models since last October, but there wasn't weights to fine tune off of.
link please?
great thanks
stable audio can be used on colab? @forest trout
I haven't tried it on colab so I don't know the answer to that.
What node?
did you say nerd? (no one gonna get this reference)
Anyone using nodejs to interact with StableDiffusion 3 using image-to-image?
Hi guys. Can someone tell how to use more then one lora in SD-webui?
And onto the redeeming arc SAI goes. 😛
nvm figured it out
are there any channels where I can fulfill LoRA requests here
How can I make it so that a lora's trigger words are shown somewhere in comfyui?
you need yet another custom node for managing the metadata of a lora. many loras have metadata already. kohya-ss gui has a way to edit it. i dont think one-trainer does any metadata.
safetensors does seem to have a spec for trigger_phrase metadata field, but none of the tooling authors seem to care about it
Happy pride month everyone 🇪🇪
probably better ways but ive used this https://github.com/jitcoder/lora-info
not sure what that is, you mean automatic1111? forge?
if auto1111 you can use as many loras as you want, you just keep adding them to the prompt (there's a shortcut of clicking them in the lora tab and it adds them to the prompt. you may have to play with the weights to use multiple, since they may conflict with each other and cause defects in your image
civit is going to seriously need some additional filters, seeing like onlyfans, twitch streamers, tiktoker...it's getting silly
foooooocus got an update
was that the right number of o's
otherwise I have no idea what software you're referring to 😛
Biggest feature is playgrond 2.5 model support, I haven't test yet
pg2.5 is a pretty solid model to play around with, i use it a lot in comfy. cool to see them work it into fooocus finally, for people that don't want to deal with noodles
God bless homeless vets.
and yellow flowers, and bees, and things that make us smile
what is better : to train a Lora with only one character or train a Lora with many characters?
and being on topic
is there anywhere I can use stable diffusion 3 right now outside of the API?
i have tried everything i have seen to make a logo based on text templet in control net using depth, lineart, canny, all and nothing seems to work. any idea what my prob is?
Stability used CogVLM to synthetizes captions for their image database. Is Stability planning on releasing the captions as well?
last release date i heard is 2024-06-12
those are not even made by Stability lol
they are open sourced
https://github.com/THUDM/CogVLM
I know, but that's only the VLM. I mean the captions for the images.
uhhh...
I'm running on 10s per image here. If there is already a dataset with captions for LAION for example, that would be cool
yeah not sure about that
Anyone know where the "copy info to folders" button went in kohya_ss? Now it says "copy info to respective tabs" and it doesn't work
SD3 when
What are the best prompt generators these days?
hh
last release date i heard is 2024-06-12
😮
That US, AU or international date?
guys how do you use sd xl in automatic1111? I always get washed out red images everytime, even with vae set to none. i am using this model https://civitai.com/models/133005/juggernaut-xl
even trying to replicate the image here https://civitai.com/images/10895925 its all washed out green and red
Thanks you for your supporting
I should reply sooner but I want to research by myself first before making stupid question
I mask the original image and run with a simple prompt like "wall yellow paint"
but the result is not good https://imgur.com/a/Xxaj8FG
Could you give me some advises so I can continue researching?
Thanks a lot
How can I use Stable Audio Open
I have a workflow for it i can share if you use comfy UI
Its not exactly stable audio
hmmm
It uses tacotron2 text-to-speech & musicgen text-to-music + audiogen text-to-sound
Its available on GitHub i can share the link for it when i get home
what?
it is difficult like this XD yo could try force with ip adapter or try with cosXL, was launched by this days I think could work
you use A11 or Comfy?
I'm testing with both A11 and Comfy
But I think A11 is easier to use for newbie like me
yes
i think that you can do it with ip adapter and cosxl
idk if exist cosxl for a11 yest, if youdiscovery you can notice me?
@viscid tiger
Comfy ui???
I want a website version
A cloud version
@low moon
@viscid tiger hey cosxl existfor automatic1111! you can make with that! 🙂
I'm looking to make sound effects for an ai game I'm making
Basically a demo
But yeah
that looks promising
let me try
Thanks a lot
you are welcome
i don't know if this is the place to ask this, but can anyone merge two checkpoints for me? 😦 I have slow internet and my attempts keep failing.
Is the Zenbook 14 OLED UX3405 laptop suitable for using Stable Diffusion?
Hi, I am new to this group...
can someone please advise me what should be the minimum requirement for stable diffusion to work on my laptop? or vice-versae?
if dont run on our pc ou can run on google colab
Thank you, but still want to run on local. Meanwhile, I will try on collab too
just install and try it, is not difficul to install
you need a gpu
Currently, in my laptop I have 4GB will that be sufficient to start or do I need to upgrade to either 6 or 8GB?
idk =/ mabe is sufficient, you can try
Thank you 🙂
you are welcome 🙂
Either /s or you are out of the loop.
(a bit less than) 1 week.
this is not SD3 this is fake
and you will see it in a week
Dude... you got no clue what you are talking about. One of the Devs ( @finite cloak) is a very active member on the discord - and he was very open about the state of SD3 2B & 8B, possibilities and limitations.
You will see it in a week.
yes, we will see a version of SD3 cropped 1000 times
💀
copium in action
we will just see then
Can someone tell me what was that extension called that detected faces/hands and attempted to fix them?
is there a way to make stable diffusion make pics of small resolution? i need a 160 by 90 picture but it just outputs random colors and stuff like that
It cannot. Need to generate a larger image and downscale
SD dont support super low res image generating because it is simply too much
try 512x512 for SD1.5 or 1024x1024 for SDXL with/without aspect ratio
how would i down scale, i dont see an option
trying that now
Using any image editor in existence
Stable diffusion wont listen to my prompts. Im using image2image to bring my handly drawen sketches to life but for some reasons it keeps keeping the white color from sketch even when denoising strength is at 0.75(even with white color written in negative prompte ) . Any tips ?
strange, put denoise in 1.0 to test
can be the model (checkpoint) so
I will try changing the checkpoint thanks
you are welcome 🙂
changing super low frequency data like background color requires either doing it yourself (eg in an image editor) or setting way higher denoise.
For what you're doing (converting sketch to realistic image) you want a controlnet, not image2image
what are controlnet ?
we will need I2I and controlnet
@atomic gull you will need controlnet, @finite cloak are right
is that basically allows you to control the position of the character/human?
any tutorial i can watch because it sounds like it would help me a lot
there's this thing called youtube
@atomic gull for what porpuse?
it's owned by google
dont berude pine
yah, dont berude me
but yeah you can found on youtube artiom
first tutorial I just google doesnt even explain you how to download -_-
pine is in a bad mood today
thanks I will check it out
you need to take the extension on github and install on extension
people dont want to do basic research a lot of the time
if you use Swarm, controlnet is one of the parameter groupings list on the left, it installs itself
he never eard about was jus a question
for example
the extension youget on github, the controlnet are composed by two partes
preprocessor (that generate the maps and models thaatgenerate the image)]
fwiw I offered a bunch of information, but anyway, I'll let you take over
you will need to download the models i think too
call me contronet
controlman
i'm on control
kk
u didnt say shit bro why you talking like ur the one helped me 😂
ok, sure
thank you just found THE perfect tutorial
good luck, any question you can @ me
Are you using the latest Controlnets by Xinsir? They are better than the old Models.
(for SDXL)
man Swarn have google colab?
its like horde from LLM, can I use by credits?
i dont have gpu I use colab, any way to I experiment swarm on colab or other way?
@static cape
I never used Collab.
yes
nice I found, thanks ^^
man
sorry my ignorance, I'm wathcinga video about swarm
but
a lot of questions
is better than a11?
is really a way between comfy and a11?
what can someone talk about swarm?
damn china release a sora clone with controlnet
god, will be porwerfull
but is a a good thing, will have competition
hey
i never use refine this things, is it yet used on sd? is necessary?
i will have to test it...
Hey everyone has anyone made the google lens translation image to image?
You do not have a valid subscription. Please login with your Discord account while signing up to https://stability.ai/stable-artisan#choose-stable-artisan-plan.
Important: Make sure you click 'Continue with Discord' at the login screen!
Excuse me, what is this question
I dont see any question
what is wrong with you people
o/ - I’m new here and just starting my AI journey. I actually wanted to create my own model, but with so many underlying models it’s insane!!!
I have 109 images and I am using 5000 woman reg images, how many epochs and repeats should I use? batch size 1
it's a character in SDXL, likeness is important. realistic
is this discord bot with SD3 able to make images in a private DM channel the same way midjourney can
i want to try this as opposed to stability assistant but without all the clutter of a public channel
Is it normal when using control net it takes way more time to generate images ? (normally I would get 3 images under 20 secondes now I have to wait 20 minutes)
I hosted stable diffusion of my pc https://15ac549072ee441cf05935fe91e7a8dc.loophole.site/
20 minutes doesnt make sense
you're out of vram
but wait, he didnt get an error then? he was staring at his screen without error for 20 minutes? lol
Not 20 but 12
it caches vram into page file/ram
Makes it slow as shit
Rip 12 isnt enough
what pc specs you have?
3060 12gb
huh
i mean i guess depends exactly what kind of workflow you were doing there, but idk, that is crazy
Ill check tomorrow what's wrong
are you using a1111? cause a1111 has some bad memory management
Whats a1111 ? 😅
automatic1111
Automatic 11-11 the webui commonly used to generate images Stable Diffusion images outside of Comfy UI
After googling yeah I use automatic webui
So basically if I switch to comfy ui my problem wil be fixed ?
it's hard to tell what exactly is going on, but you can try with comfy
No that's unfortunately not how it works, what you are experiencing appears to be a hardware limitation, comfy UI can be used to test to identify the issue though. Unfortunately Comfy UI is ANYTHING but comfy
Ill try tomorrow tweak with settings cuz I just downloaded control net
Bit of a segway here, but I'm trying my hand at training a SDXL LORA again for Pony, when it comes to tagging/describing I'm curious to know some of the best practices. Is there are limit as to the description length or specificity? I am willing to manually tag a hundred files to get better accuracy than the auto tagging interrogators provide, never had much luck getting accurate results with them. This would be for a particular style and look as opposed to a singular character.
is 1.49it/s good for 3090 on kohya?
i installed stable diffusion web ui in macos sanoma after 24 hours i have logs who wants to see?
I don't live in China but heard how the water falls in China are truly MADE in China
Not great at training models unfortunately so I don't have much of an answer for ya sorry
Sry I don't get ur point
So u are a ai? Traning a language model , is it?
Yuntai Mountain Waterfall had a scandal that went viral
This my first time to hear about this place . I live near Taiwan. I wanna swim to Taiwan. BTW I can't swim so long time.
How do u know about that , and u know ccp will ban this news
Generally we think that is fake news
Videos went viral on chinese social media, to the point CCP affiliated news stations had to comment on it and park services had to make an announcement. By that point though the videos surfaced on western social media and took off. South China Morning Post also did a segment.
Back to your question though 1.49it/s for a 3090 seems a bit low from what I found; but it depends on what your batch size is, typically I stick with 2
Hello
hello, sorry for my english, i need help, i dont know why prompts/outputs LoRa has effects, i need help ty 😦 i speak spanish
Lora's have specific weights you need to add at the end, what are you typically typing? Example: lora:ExampleLora:1 - typically each lora creator also has a descriptive guide up on Civitai to help you as well as each model can have different optimal settings.
im doing all of that and doesnt work TwT
What Lora and base model are you using?
嘎?
can i show u with a really quick screen share? 5 min
Is the Zenbook 14 OLED UX3405 laptop suitable for using Stable Diffusion?
Sure I guess
okie diffusers 2
Yeah diffusers 2 has its own chat at the side move your window so I can review a bit
there?
Yeah. It's likely due to the fact your base models are mismatched, the Iron Man Lora I believe is trained on SDXL 1.0 ; meanwhile the model you are using is trained on PONY XL
perfect
it is not a scandal per se ( as it is somehow not hidden from the tourist )
the operator said the pipe are there because of lack of water in dry season
Do you guys prefer to generate images one by one, or just set a number you want it the generate at once??

which make sense if you know some part of China have some heatwave happening in these recent year
again it is still raising a debate inside China, as which one part of the people said " this is literally cheating, you can make every cliff a waterfall with pipe like this literally" and other saying " nah this is fine, they just don't want to make tourist feel disappointed during dry season "
environmentalist... yeah you know it
I made a ComfyUI with Cosxl
the result is not bad but still need improve
room and new floor pattern image: https://imgur.com/a/4v0I4UO
Result image: https://imgur.com/a/ABh3YmB
as you can see, the floor pattern is not straight, could you give me advise to improve that?
Yeah I understand the reasoning behind it - like many other rivers it can dry up during certain seasons. The reasoning behind it though is a bit more political in nature than just sheer good will in helping tourists. Tourism contributes to the local economy, and tourist sites including ones of natural beauty reflect the nation on a global scale. Politicians with big egos dislike anything that can bring them shame so they go through hoops to make something look good rather than just let a natural cycle play out and keeping tourists informed. Unfortunately for them when it falls apart it ends up being way worse than had they done nothing at all; and for some in tourists I'm sure, while yes it's understandable, it does take some of the magic out of the moment, at least it certainly would for me.
woah
you got the potential to get everything political.
seriously you also know that the fall located RIGHT at the border between provinces
Shanxi and Henan
so yeah
upstream was Shanxi, downstream was Henan
Shanxi built a dam on their side so it can't flow into the original fall
and then Henan just feel like "what the hell" and decided to build the entire pipe system on their own
I have experience in politics and do a lot of writing lol, it's pretty much the same everywhere with slightly different formalities. Whenever it comes to money being put to a system like that, there's bound to be a secondary reason. Addendum: If a dam was built that would also be a secondary reason in that it was a dispute between provinces
the fact that Henan authority doesn't even make anything to disguise it and just straightaway say "yes this is for the fall", thing is pretty obvious whether it is political reason or environmental issue
The diverted water is all mountain spring water, ensuring there are absolutely no other undesirable water sources involved, and it will not damage the natural landscape. The scenic area is doing this solely to ensure the viewability of Yuntai Waterfall during the dry season and to enhance the visitors' experience
From the operator
never expected to talk about local Chinese issue here lol but seem it get quite an attention from SCMP
If it was environmental they would have installed a larger pump system up to a reservoir up top to ensure the river banks remained stable. If it was for display they would have installed the pipe at the end of the waterfall. That said to me this isn't a major issue, it's funnier than anything else and ultimately harms no one. It went viral because it subverted people's expectations and people around the world have seen that vid now lol
Hi @finite cloak re: infinity grid, on auto, is it possible to set the generated image name like one is able to in "Images filename pattern" of auto settings?
Remember: It's not art. It's technomancy.
Good morning, everyone! How are you all today?
do reg images add to steps? So like, would I get 5000 extra steps on top the steps on the training images?
embed failure
no bad, how you put it in? with inpaing or upload mask?
maybe try ipadapter to make the floor follow the direction of the photoreference?
you already use comfy or was the first time? I dont use yet
https://openart.ai/workflows/congdc/material-transfer-for-room/5NUUyIbVeqF6dQJIM4ft
I follow this template, update a little bit
cool
could you explain more about this
ip adapter use a image as reference, you can try take the directions of the original imagem, or some controlnet like normal, or canny, somethink like this
What do peeps think of the new Stability CosXL model?
Supposedly it has better bright & dark image generation capability
looks interesting, i dont use yet but looks possible to chanche any material keeping the form of object
really cool! i dont know that can change the light
remember me IC Light, do you know?
I'm fairly new working with Stable diffusion, does anyone know of any good programs or services for viewing / comparing the grid images from Matrix xyz scripts. I know that it also spits out individuals, but Id rather keep a single massive image grid with labels and be able to easily zoom in and view. Thanks in advance
Id be careful with that, also only having hentai models to select from is kinda 
oh yea they came with the instal....

I can defend myself it uhh gives them better anatomy....
also the v13 one functions with non hentai prompts
Sd 3 will release to the public june 12 right ?
Your negative prompt is longer than the bible
Im sure youd do similar with a shorter one
most likely
but my pc can handle it

as long as my pc still breathes
then i can take atleast 1000 words more
also im downloading a non hentai model
tho its based on anime too
can you try it out when ive downloaded the new model
What GPU are you running?
rtx 3060 12 gb vram
image genration is wonderfull with it
text genration is also good
like llms
i can run an llm and stable difuson at the same time
Is there
A nsfw channel
Been using juggernaut xl
But I can't get it to generate any nsfw
Content
Wrong Discord.

Honestly it's too late to be relevant. All good XL checkpoints have already been trained on XL and "darker & lighter" is not enough incentive to start over on everything. (you can add CosXL to existing Models though... but since no one does that it's probably not relevant / worthwhile either).
Probably any other SD discord
oi can u try genrating somthing from https://745cc7e5b82302a5e4.gradio.live
doesnt work
does it give an error?
nvm it just took a long time and didnt show a progress bar

oh
its ussaly slow the first time
then starts going faster
also my new sfw model is slmost installed
can i see the image?
I closed the tab already sorry
Doesnt save it locally?
I didnt want it tho 
i do 8000 word negative promts
btw it seems like it lets me install extensions
Id be really careful with that too
What do you mean
Thnku for telling me
You pay with money
Midjourney is free?
Then tell me a bot which can ans with images
Not a lot of free options on the go. Unless you wanna set up SD locally.
Bing image generator is one
Kinda limited
With it's capability
bing cant genrate nsfw images
Please keep it sfw tho
oh yea
all models are nsfw,at work u should be working not generating imgs 
But its my job 
rip
oi whats the best image widt hight and gfg scale
depends on the model
If I am using dual gpus, should the batch size be 2?
Yeah I saw that, looks promising
achieving better performance just means the original unet for 1.5 is very undertrained, right?
either way, this probabl decreases required memory a lot?
or no
Id say so, yes
Im not tech savy enough to comment on that lol
me neither but thats what usually happens
good word
lets say I had 50 training images in kohya, and 5000 reg images... if I put the repeats of the reg at 1, will it automatically use 50 images? or still 5000?
helllo
Whats the difference between canny and lineart in controllenet ??? (I cant tell the difference in result)
Hi, I have an idea how to improve tiled upscaling but unable to create workflow.
Problem with standard tiled upscale:
- You can't describe details of the image because it will try to generate them in each tile
My idea:
- Use some kind of segmentation to caption every part of image (something like regional prompt), so model will know exactly what is in each tile.
But I don't know how to do that in comfy, I know segment anything, but it is only for specific thing. Maybe somebody has an idea what I can use for that?
I think canny (edge detection) is more general and works on anything, while lineart (line detection) suits better for drawing, sketches or anime
Think of lineart as more of an edge detect.
#🏞|general-with-images message
These are the result of the preprocessors for canny and lineart (realistic model). You can still see a difference
basically like you said canny for more details and lineart for more "simple" things
To be honest, i would say it is more or less based on the preprocessing. So if you want to keep the fur of the squirrel you might need realistic line extration, for landscape maybe other preprocessors work best. At the end both deliver primary information about the form and texture of objects.
I have a generated image and I just want to see what It would look when using another checkpoint should I use image2image or controlnet ?
to compare?
the same method and parameters I think
the same workflow just change the checkpoint, but must configure then how is asked by creator
@atomic gull
or putin i2i and choose one controlnet to generate a new one with your new checkpoint
Not like create same exact image but with different checkpoint (my bad should have said it better)
Thanks
^^
/imagine:logo PCO
spotify gives you 1 month free, making 3 months cost $23.98 USD.
Lol, spambots can't even spambot right
What is currently the best method for regional prompting?
With comfyui I would say ipadapter with attention masks.
I really want to get SV3D to work but it seems I cannot
Something specific to SV3D and SVD is not liked by ComfyUI for Intel Arc
how abt diffusers?
does someone remember where to put commandlines when u get an error?
like for example
--disable-nan-check
oh this is general my bad
🇸 🇩 3️⃣

“BitsFusion compresses the UNet of Stable Diffusion v1.5 (1.72 GB, FP16) into 1.99 bits (219 MB), achieving a 7.9X compression ratio and even better performance.”

Not just soon tm, but 5 days tm.
alegedly
that seems nice on the surface, but im really curious about the actual results tho, would love to compare. and then if it's really good as they say, would be nice to have that for sdxl and sd3, etc @gray fern
i think it's good research that could be used for future base models. its free research and soon source code too. might not be usable immediately by people but it is a solid proof of concept
true
so which is faster Hyper or LCM
I paid them for a month of premium, and they are notcrediting my account, and their help led me to a long chat on another site.
From the sounds of it, the model they are releasing is the most trained so far but still may require more training
it has to go to the gym still
I hope it becomes easier to animate still images
Can anyone recommend me a good model for feet? Trying to work feet inpainting into a workflow and getting basically no luck with the results
My ZenBook 14 OLED (UX3405) is using the CPU for rendering instead of the GPU. How can I adjust it to use the GPU for rendering?
any1 tried Story Diffusion
e
hello need someone help who know how to generate a good pixel arts with stable diffusion can someone help me plsae

Hi Chinese friends. Gracias por el -- the very good video generation models.
Find some checkpoint

promt
any chance to use stable-audio-open-1.0 with cpu only?
i mean how long does a prompt generation take with a average 4 core cpu?
this is surprisingly a good video explaining how to use stable diffusion even if its only about Loras
Has anyone played around with nVidia Tesla cards? They're headless, but I'm seeing a 24GB refurbished K80 GPU going for eighty bucks? Will this kind of hardware work with SD?
I've been waiting 5 months for this, but it turns out to be shit.
base model is shit what do you expected
if you dont want it to be shit just finetune your own
finetune is shit
So if base is shit, finetune is shit? I think you haven't use base SD1.5
i use pony
yeah sure Pony user
did you just expected Stability to put Pony stuff into SD3
I was expecting some progress
it is some progress if you did see the papers
even the quality of the image is not there
it won't be in the cut version
dataset cuts, I will not named it right away anyway
but architecture cuts? nah bro
plus 16-channel VAE and support of T5
I have seen other models on this new architecture
There are still other thing differentiated themselves from SD3, like tokenization, generalization, settings, etc.
Pretty sure what you wanted is community finetunes that it can just show what you wanted. But you never wanted to judge the book by its cover and dirty stuff nearby it.
We will see whether SD3 is as good as community expectation or not 4 days later
I've been doing this since SD 1.4
Hello. I was looking for a "I just find SD interesting role" but could not find one
the only one who has created progress here since 1.5 is pony
i dont think we have a role for that
sorry
Nah it's ok. Just pointing out that I might ask some stupid ass questions 😅 . I will google before asking tho
they didn’t offer anything new, it’s still absolutely inflexible
wait do you mean this server or?
I mean, is this the place to like ask, learn, find models, etc?
I just googled SD Discord and it brought me here
Good morning everyone! How are we all today?
not the best server to do that as this server is... technical and often filled with direct communication with SAI devs and discussions ( since this is official server )
I suggested you to look for Civitai Discord server, they are extremely active and you can ask for some models recommendation there too
Ohhh. Thank you. Yea that's what I am looking for. I just find SD interesting and want to mess around, nothing serious. Imma go to civit discord and leave here. I really appreciate your help friend 🫡
Oh, one last question before I leave
Civit is like 99% nsfw 🥲 I'm not looking for that. Do you have any discord server recommendations?
For normal, SFW SD promts, models, etc for newbies
ah no right now they have exclusive NSFW channel for their Discord server since... honestly their NSFW channel is much active than their #1072236442463518882 lol. But nowadays you can't see any NSFW part showing around in all of their others channels anymore.
Their website is still filled with the problem and the dev try working their ways with filtering system and stuff
Thank you sooo much friend. Appreciate the help.
Yeah, CosXL came out kind of late
CosXL is what SDXL should have been
Zero SNR, Cosine schedule, v-prediction; All things missing from the initial SDXL
what is CosXL, am failing big time trying to use easy difussion.. I made a clear sketch of what I want done and I would like AI to turn it in to a water colours painting , but I am getting pure nightmare fuel
would someone kindly help me in the right direction please?
Try StableSwarm UI
okay ill google it
CosXL Edit
Also controlnet could do it easily with just regular SDXL
With anyone of the edge controlnets
I feel like I need a little bit of more knowledge on the mater because idk what CosXL edit is, is it a tool? a model?
okay so I need to download this model and add it to my difussion right?
okay that, I got no idea what it is, so sorry for my ignorance
am not doing anything with anime though
I mean just use an edge controlnet and don't prompt for anime?
Replace it with water color
why does SD have to be so finnicky hahah
sometimes it's perfect
other times it takes so long to gen an image
frustrating!
am still doing something wrong , this is pure nightmare fuel still
what's a good amount of steps for training a person's likeness in sdxl dreambooth?
Hi! do you know a good model for graphic design logos?
What I can generate with my own stable diffusion is much worse than what civitai can with the same prompt, settings and resources.
Does anywone know why that might be?
Maybe some auto include loras or upscalers that I am not privy to?
Maybe different hardware, Xformers turned on or off, different model, different seed? And I'd think the pictures are cherrypicked ...
ya who want better, what a failure
I keep failing even with a good guide image idk what am doing wrong :C
there is like a billion ways to do things, personally ide use IPAdapter for what you are trying
ill google what that is
well i use comfyui and that a node for it, the creator has a youtube page
Latent Vision
am watfching a video about it
I named my SD temp folder "image-gen heap". This might be amusing to fans of british electropop.
Bonsoir à vous
Especially that last bit
Can you make it so prompts in only have a chance of affecting the image?
Like, "50 chance of winking"
((( )))
?
Prompt: a lady with blonde hair and green eyes, (((winking)))
Is there no way to create a TensorRT engine for non 1:1 resolutions? It's incredible how much it speeds up generation times but being limited to 1:1 really sucks
Is that like a 50%?
Do you mean 50% or better?
Does it make it have a 50% of applying the prompt?
Also do a closeup or it won't bother with face details
Do you mean at least 50%?
We are failing to comunicate
Writing "wink" makes it so the character winks 100% of the time (barring flukes)
Writing (((wink))) makes it so the character winks X% of the time
What does X equal?
Wink:0.5
what is dream booth?
i think that is 1.1x3 = 1.331
() = 1.1
(( )) = 1.1 x 2
((( ))) = 1.1 x 3
@buoyant steppe
So that increases it?
Is there another noob-friendly tool I can use for stable diffusion other than sd Web UI ?
I am trying to use the dreambooth extension but it seems broken
I installed fooocus and my sd forge and comfyui both broke and start downloading the original 1.5 checkpoint 💀 💀 💀
im more excited about the stuff people will do with it (finetunes, loras, controlnets, etc), but yes I can't wait 🙂
also, im curious about the training part too, how easy it will be, i recently got into training, so that will be cool too
Hello there 👋 Anyone know any free alternative to ChatGPT AI chatbot DeepGame? Looking for a simillar AI
Bring SD3 in the gym and train it. give it steroids so it goes form 2B to 16B
there is energy we just need to lower expectations.
probably 3-6 months after release itll be good
i dont expect miracles out of the box
remember SDXL?
it wasnt on release what it is now
will 2b even be better than sdxl 😭
Something that requires 16gb gpu! It's used to create your own checkpoints .
Everydream is what I used, and only requires 12gb gpu.
It does perfect skateboards! Also it can do photos porcupines eating pizza! (No other ai can do either inc sd 2.5)
lol
welp thats oddly specific
im more intersted in hands, eyes, and backgrounds
i want realistic backdrops
I tested it with things all the ai can't do lol
not weirdly sized gooey buildings
and deformed mutant weird crowds
even SORA struggles with floor sizes
AI doesnt get buildings
so frustrating there is always "something" wrong with every single image
emerging tech amirite
Photos of or drawings of? Or both?
im specialzing in realism lol dunnoa bout drawing styles
i did do some oil painting type stuff and looks nice
goes off to try sd3 buildings
yeah try
I've always thought pony did amazing anime style short 2 story old style apartment buildings; probably not what toy are after though
I'm waiting for sd3 pony to do apartment buildings 🤣🤣🤣🥰
nah i just mean buildings in general
lol
yes pony is for buildings
that was the whole point
How are the sd3 skyscrapers I linked?
Here's some others (hoping my links work) #artisan-3 message
Des ruines de maisons palestiniennes détruites lors de conflits, avec des familles déplacées regardant tristement les débris, représentant la perte de foyer et de stabilité.
Not sure if I should be asking this question here, but is it possible to feed stable diffusion existing real life photos and get it to generate a painted version?
why does 12 june look like ages away lol
img2img
cool, thanks a lot! is physicall or can be used on colab?
Either way for both of them. On your computer or collab etc. That is for either dreamshaper or everydream2
Checkout github for either
I will check, thanks
🙂
@frail sonnet I'm thinking of getting images of famous architects from the Brazilian architecture council and creating checkpoints to publicize Brazilian architecture
If you don't have a large enough GPU you can do loras instead 🙂
looks a good idea!
What size GPU do you have? Also how much ram?
man I have a notebook, poor gpu I use colab, i will show you on the other channel
yes
training loras sounds scary
can I take your colab? or must I take mine? how it works?
what even is colab anyways
what is this?
google colab is the cloud pc from google
You can train loras right in SD apparently! I use comfy though....
Hi, i need an help. Do you know a youtube tutorial that help me to understan ComfyUI?
i dont understand how people use comfy ai
its just so complicated!
im happy with my web ui lol
I love comfy!!!
y'all are another breed
but that's maybe bc it's the only one I've tried, and it works well for me
ok thanks
If the output is better why not learn
OUtput is the same in them all I think
I've gotten my images to look as good as the ones I used to make on mage.space, so I'm happy 😄
uh oh, SD3... comfy.....?
whats the problem?
I see complicate workflow so i thought that are more personalized
I hope comfy comes out with an SD3 version the day sd3 is released, or I'll ahve to install and learn another SD
I use A11 XD saw 2 class about comfy but dont started to use it.... yet 🙂
must come out with SD3 yes, is one of the mos important
A11 its not difficult
You can make crazy advanced workflows with it apparently, I'm still in the learning the basics stage
i used to hate comfy - i love it now
beside A11 and comfy exist other important?
i could never
like prompt writing is already hard enough
not to mention inpainting

I recognize that somethings its only possible on comfy
one fortunate thing with comfy is, you can use other people's workflows by just using their images
I mean if there is so much setting to do, i think that i could personalize the output much more
Most people don't use that much in comfy!
looks so cool!
But it's kinda awesome that you can eventually
I dont see it has a problem
Well at first, start small/slow is good 😄
yes hahaha ^^
make a sky in a bottle
or just use someone ele's workflow, either way, at first
the best thing, is you can open 2 windows, and copy/paste bits of a workflow from one to the other!!!!!!
I'm recording an A11 course for architecture in Portuguese
starting wrinting the technical information about what is SD
and I'm learning the thecnical things with it
@frail sonnet do you know some extension that i could use?
What do you mean by extensions? For which?
for comfyui
You download the ComfyUI manager, then open that in comfy, and hit the install custom nodes button. If will give a very extensive list of ones you can install.
the comfyui manager is probably on github (been a month since I installed it, so don't recall now)
For more info on any listed in the comfyui manager, they are all listed on github (but far easier to install via the manager within comfy)
Do the other SD programs have something similar to comfy's Manager that can auto install extensions etc.?
what kinda art do y'all make
Yes, for automatic1111 i have Civitai helper, tagcomplete, adetailer, aspect ratio, control net and open pose
also does anyone have a good copy and paste negative prompt they can send me? i used to have one and now it's gone
If i were to delete on thing from the world it would have to be uncomfyUI
looooool
ull make the jump
you will come to noodletown
its inevitable
If you stay in SD land - all roads lead to NoodleTown.
most popular ui is still auto1111 though. comfyui users just have this idea that a nodegraph makes everything better.
i think swarmui will eventually rise in popularity. which is comfy on the backend but it avoids the noods in the front
i just like the freedom with comfy tho, you can do so many things connection wise that would be impossible to all include "in the front".
and besides, im a programmer, so i guess i like these noodles anyway, they kinda speak to me directly :3
i think at the end of the day, it's really what you use SD for anyway and then figure out what tool to use to that end.
and even if not for that, i like comfy cause it's very memory efficient, updates very regularly, also is usually the one with the latest cool stuff
(plugins, etc), so when you combine all that, it just makes it a better candidate tool for me
I mean I started with automatic1111 as i guess most people, and there is nothing wrong with that, but eventually you want to play
with some complex workflows or try the latest toys 🙂
and there are also custom nodes that remove the noodles
and make it very compact
lol
says the person who didn't floss to begin with
they just need to ask 🙂
idk but maybe some of the dataset can be collided with yours
Does anyone here actually use swarmUI and is it any good??
I tried only a pretty short time, it confused me a bit ...
is there any easy way to change clothes in video with stable diffusion or any other tools?
hi boys and girls, im looking for AI that will make my still images have a little bit of motion so they look like they are shot with video camera
good morning everyone
I am failing at getting an output at all
I would like to crop my dataset very tightly to the subjects, but that'd create a lot of really weird aspect ratios
in fact, I used to do that a lot, but started cropped to 1:1 and 3:2 in order to reduce distortions
but in SD3 will that still be necessary, or will it be able to handle training on weird ARs?
isn't that what bucketing is for?
yes, but some people claim that you should limit your buckets, because the model will get confused and start stretching things.
I wonder if SD3 will still be confused by that
some people say that images in the 3:2 ratio can only help the model when you're rendering at 3:2, and that the model will only "draw from" images of the same bucket that you're currently (||what's the correct term, not ||rendering).
when am training am missing a lot of deps appearently wtf
its failing one after the other prompt asking me for some other dep
yes sir
its been like 3 python modules missing in a row
ModuleNotFoundError: No module named 'einops'
this is the last one
am using Lora Training in ComfyUI
ModuleNotFoundError: No module named 'cv2'
hohoho
// input
pip install cv2
// output
ERROR: Could not find a version that satisfies the requirement cv2 (from versions: none)
ERROR: No matching distribution found for cv2```
now am defeated
"RuntimeError: Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check"
I'm using AMD's RX5700XT, and this error occurred. How should I fix it?
edit the batch file and add the line it tells you in the commandline args
But it uses the CPU for rendering, and I want to use the GPU.
What software are you using?
likely wrong torch version
are you using AUTOMATIC1111 or ComfyUI
Comfy UI has a module that allows you to train
oh right, AMD card
AUTOMATIC1111
@slim thunder
Follow this ( AUTOMATIC1111 )
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Install-and-Run-on-AMD-GPUs
Yes, I am based on this article
I tried adding --use-directml, but a new error was reported, "AttributeError: module 'torch' has no attribute 'dml'"
Uh, checkout CS1o's solution
did you manage to train lora with comfy? I am currently struggling on it and I can't seem to find people using it outside of yt tutorials, its not like corner cases are in the tutorials...
Hi together,
I am quite new to all that stuff around AI. I am more of an infrastructure guy and that's probably why I have a server with 2x Tesla P40 here for my AI projects.
Beside problems like that I am not really able (yet) to properly train an embedding, I am struggling on my setup.
I'd like to use both GPUs for one sd instance in order to speed up trainings. I think, that DDP might be the best approach for it, but I cannot figure out how to make sd automatic1111webui use both GPUs.
Does anyone has some experiece in that multi-gpu topic and is able to help me out?
Oi
I need to create images where should i go
Hey anyone??
Uhm
Bro i need help@vale steeple
You can start by installation of application and requirement
I also need to create images
There is no specific chennal to do that ?@vale steeple
#🤝|tech-support literally
I can’t wait for SD3, it’s taking forever!
2 weeks
am failing to install the Kohya SS GUI it finds no module torch, anyone had something similar? do I just install the pip globally?
I haven't installed that one yet, working on it though.
Did you do all the steps including creating that one windows file?
I did everything on this tutorial even commented on the tutorial hold on
this one, I also helped people with similar earlier issues to mine
Not yet, I have to figure out which of the 40 files goes where. Fortunately I used github soltware so they are all on my computer, waiting...
I'm nearly out of data, not watching vids until Wednesday
got it
Are you creating images on the discord and/or via the api with SD3 for now? 🙂
A few days for SD3, then a couple more days for some models to pop up...
I'll show the page to Gemini later on and get them to walk me through it
whos gemini ? o.o
I use SD2 local
But but, SD3, new shiny fancy fun (also it "listens" better
is he like someone super knowledgeable here?
It's the newest google chatbat, 2-6 months free trial. It helps me with coding, since I don't know crap about coding lol
That’s why I want SD3 local

SD4 better
I want SD3 local to I can create backgrounds with SD3 Pony 😉
It's a joke from yesterday, anime does the best buildings as backgrounds
I just like the unlimited image creating, the faster image creation.
And the random prevention of image creation not happing.
oh well... about that
I've found my randomizer prompt doesn't work on SD3 discord 😦
no SD3 pony for us 😔
only SD3 horse
SD3 XL
How long does it generally take the model creatores (checkpoints and loras) to create them after the weights are released? 🙂
realism models like 1month,good anime models take longer but sd3 can do anime already so prob less
I will take the bet they will in 1 day for LORA
Though SD3 is pretty awesome on its own as it is, but...
if they release LORA code at Day 1
if its censored,then adding back the removed stuff is probably gonna take like 3months or more
finetunes obviously take longer
SD3 wouldn't let me create "2 anthrogurry men kissing", so the wait is going to be loooong lol
nah that probably due to online generator censor
local would be much open that this
kissing is fine,its the other stuff they remove
... to be honest i also think your prompt is pretty sussy
The kissing didn't even go over well, has to resort to SD2.5 lol
I wanted an image for gay pride, usually 2 people kissing is fine....
It probably didn't like the anthrofurry aspect
no offense, you can try switch it to normal human
everyone does normal human though, yawn
u could try with something like "man with horse legs and brown furr covering his chest"
its all good, I got my image with sd2.5 , but I have concerns about even more spicy prompts with a local SD3 install
we will see at July 12
I'll just have to make my own loras if need be 😄
I'll test it out to its limits July 12th 😄
honestly people here and actives in Civitai don't typically like NSFW
especially furry
but what can I say
As long as 8gb GPU is enough to run it!!!
that is the point of open source model anyway, you can't turn against people who use it to train something they like
minimum requirement is 4GB
civitae and don't like nsfw, in the same sentance???!!!
got confirmed by a SAI staff
actives in Civitai
SAI?
Stability AI staff
Anyone online good with some trouble shooting? May need some help ://
SD3 needs loras for fingers and toes! (as in the correct number of)
If prompt based maybe, if code based I recommend Gemin lolol
Check the tech support chat, raised my issue over there 🙂
Just a few more days. This feels unreal. 😄
2 more weeks and we done
flowers
looks like the randomizer got stage fright
hello can someone help me do my first model with kohya ss
Is there any open source image model recommendations?
Other than sdxl 1.0 and pixart
yes 1.5
2 weeks 
2 weeks for the good models to come out 
no
there will be a good model right at release
where is it so i can test how good it is
I don’t remember the name, but Stability gave someone model for finetune
so there will be finetune on release
real and true 100
I downloaded Kohya GUI UI V24 but theres way too many options I got no idea what am doing
theres like thousand tutorials all UIs look different
Forge will be EoL for normal Users.
Its recommended to use Auto1111 or Comfyui or Fooocus:
https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/801
yes you can find the sd3 download link in the announcements channel
Well I'm a bit confused
I go there and it takes me to a docs page instead
Also, I'm not seeing a download link, just a sign up link
Looks like it is only available via API and not for download
none of that is for me right?
Oh ok thanks
we are not the 12 of june yet what are you talking about
I'm already there and it's dangerous
this sort of stuff always happens when people are anticipating a release. Where questions are asked, trolls will show up and spread intentional lies. Just for their own sadistic amusement. Looking at you @onyx elbow . Pretty typical low effort troll. Unimpressive really
lol, i thought i missed something, thought maybe the put the early release link on discord
I have an email alert set up since i'm on the waitlist. Emails will probably go out when it's released. The waitlist i dont think is for anything else at this point. Only a few got selected to preview that signed up. The free preview plan was scrapped
Heavily depends on type of checkpoints, Pony for example takes 2-4 month to train, but there are also months of preparations.
say, can you set SD to add more generations in queue?
Damn I forgot I was in this server
Hi can someone link a youtube tutorial to comfyui?
Do you mean the SD install on your own computer? If so, most definitely. I can screenshot the comfyui version if you like.
Thank you, is it noob friendly?
WARNING: Someone has been infecting comfyui users with malware via malicious nodes: https://www.reddit.com/r/comfyui/comments/1dbls5n/psa_if_youve_used_the_comfyui_llmvision_node_from/
The malware tries to stealing credit card and banking info
It's damn good ... if you start with part 1 it should be noob friendly and you will learn a lot of basics
unless you're anti-ai and anti-open source, attacks like these are a bad thing
so generating images here is not free anymore?
why
like how it used to be with @radiant meadow
because they aren't pentesting/security research projects to encourage better security. Its literal malware trying to steal your money
he just wanted to earn money, he's a person like the rest
so stealing is the move ur saying?
this is a problem of states, not ordinary people
btw is the main comy ui compromised or just those nodes
this is still safe right
https://github.com/comfyanonymous/ComfyUI
thats the one i see utubers donwloading in tuts



