#💬|general-chat
1 messages · Page 82 of 1
i've got 16gb and do alright
8gb here, been strongly considering just going for a 24gb card
if it fit in my case and my psu could handle it i would've already lol
shit 3090 then
orrrr. riser cable and a second psu
just strap that 4090 like baboon heart hanging off the side of the case
probably safer for the power connector anyways
heohhhh
but seirously wtf is SD 1.6?
all my google dives bring up information about automatic1111 version 1.6
WHAT?
I can make money doing that? :D
Hey
I got error when I try to install dreambooth and try to run webui, it says "CUDA... bitsandbytes" something like that. I use Windows11
Does anybody know how to solveit?
Yes, I do.
Yeah I mean....in other communities there's usually a jobs channel or similar where you can post job offers.
Can you help?
what do you jerk guard
Yes.
OK what should I do?
Ok?
Okay what should I do? You said you can help.
That’s correct.
How?
What?
Okay bro
Hey
I got error when I try to install dreambooth and try to run webui, it says "CUDA... bitsandbytes" something like that. I use Windows11
Does anybody know how to solve it?
Try adding --xformers into your commandline args ^-^
me too
It makes me uninstall everything if it doesnt work, I’ll try
Useless. I've tried
This error appears as soon as you add the dream plugin.
Bitsandbytes was not supported windows before, but my method can support windows.(yuhuang)
1 open folder J:\StableDiffusion\sdwebui,Click the address bar of the folder and enter CMD
or WIN+R, CMD 。enter,cd /d J:\StableDiffusion\sdwebui
2 J:\StableDiffusion\sdwebui\py310\python.exe -m pip uninstall bitsandbytes
3 J:\StableDiffusion\sdwebui\py310\python.exe -m pip uninstall bitsandbytes-windows
4 J:\StableDiffusion\sdwebui\py310\python.exe -m pip install https://github.com/jllllll/bitsandbytes-windows-webui/releases/download/wheels/bitsandbytes-0.41.1-py3-none-win_amd64.whl
Replace your SD venv directory file(python.exe Folder) here(J:\StableDiffusion\sdwebui\py310)
I saw such a solution but I didn't understand it
+source
Anyone know how to make SD stop putting umbrellas in rain scenes? Tried umbrella in neg prompt and no umbrella in pos prompt...
why not giving me the right result????
#1100170312106127410 message
Hello!
So I got a job offer for illustrating a kids book with AI and they asked me for a price per 10-20 illustrations. Since I do not really know the prices from that field, what do you think that are some good prices?
Apologies for jumping in here... but I've only just started with SD (using A1111), and I see quite a few posts about SDXL/SD 1.5. May I ask, why 1.5 is better for you?
I can guess; It's a more mature product.
While SDXL is still very new so its addons, plugins, etc aren't as developed yet.
For me personally my favourite chekpoint still uses that version
Is this free to use or not?
hey guys
do you maybe know a way to remove metadata files from an already trained LORA?
it's a bit important and urgent
I love SDXL. I use it for everything except controlnet and animatediff.
That makes sense. Thanks!
Are you using SDXL via A1111?
I moved to comfy for SDXL.
Learning curve is low because you can just borrow json from others. Find a working flow and tweak it, instead of starting fresh.
Lol I personally cant see how comfy with its geo node like interface is supposed to be easier to use than A1111 :D
As someone who has installed A1111 for some friends it has basically no 'learning curve' aside from the usual prompt crafting and such. Just having a prompting box and a big 'Generate' button is pretty intuitive :D
I must admit, I'm not a fan of nodes... I never know which one I'm supposed to use and where.
I can recommend A1111 :D
Because prompt crafting and knowing stuff like CFG and Denoising is already 90% of the image, and I think the node noodles would only distract from that
Having a proper prompt is like already 80% of a good image
Why do people like kitchen sink models so much? :O
What I love about Stable Diffusion over any other project is that you can change the checkpoint to be for the specific subject you would want ^-^
Good morning, everyone! How are we all today?
Moin ^-^
I will be checking out ComfyUI just because I dont like being biased without at least trying it out :D
But I have my doubts its the interface for me
Waves
Well, you can always use #🐝|swarm-ui
It's a combination of both
Currently I am using A1111 with great results. But Ill be trying it out :D
Theres no harm in at least seeing if they fit my needs ^-^
Can somebody teach me Animate diffvid2vid on stream ?
I've tried but there are few issues in output
Hi guys, I have a little problem, maybe some of you had it too. I use ComfyUI and everytime when I start to render something, the pc freezes, right before the ksampler step starts, it freezes for about 5-10 seconds, then everything is ok. What is the problem? Is my RAM too low? I'm on 16gb.
Is there a function (that I can use on google colab, not a webui script) to merge checkpoints?
so whats everyones opinion about sam
How do I hires fix in that :D
https://github.com/Stability-AI/StableSwarmUI/blob/master/docs/Basic Usage.md You can also find the tutorials on that we did on ComfyUI here if you want to get more familiar with it: https://www.youtube.com/@stablefoundationai/videos Scott has a lot of great videos on ComfyUI: https://youtu.be/CxB47DMEyYQ?si=d27J_HPDSLP8eWT0
As far as I got with my own research its replicable with a 2 pass text2img, with requires a custom Comfy Workflow :D
Thanks so far for the infos ^-^
In stableswarm I am always getting the warning that my embeddings do not exist :O
Even when I select them right from the UI
To elaborate on that: It seems in the prompt they can be loaded, but not in the negative prompt :O
Np! If you have any questions, or problems, I'd refer to #🐝|swarm-ui
hello i saw today this server have text to video, is any one know is there any image to video with prompt?
Hi! Has SDXL recently become more heavily censored? I'm a romance author and use it to create bare chested men for book covers. I didn't have any problems before but now it's not allowing any bare chested men to come through. If this is the new reality, SDXL had just rendered itself useless for my needs.
Never mind. I've found what I need with SDXL using Night Cafe (and I have a ton of credits with them from before). They have an SDXL model. They've upgraded a ton of models, adding their own secret sauce.
Hi, I am trying to apply AI to replace a series of steps I need to do manually in blender. I start with a mesh representing a "sand trap / bunker" (golf game). What i want to do is alter the edges of specific mesh without changing the border itself just the vertices inside the border close to the edge. I am hoping to create variability in the edge of the bunker by doing this. Is this something that Stable Diffusion could be adapted to do? https://drive.google.com/file/d/11P3CQ1N8ro14jqWPI9l6CyiWNmmHTOAB/view?usp=drive_link The two matching faces (silver and yellow) would need to stay fixed. I would be interested in extra rows inside this "border" and altering z values etc. to make the edge appear to be a bity "wavy"
if you're using it through a service like the bots here, yeah. it will always be censored. It's not smart business to create liabilities.
What is everyone using now? automatic1111 hasn’t been updated in months
TypeError: Accelerator.init() got an unexpected keyword argument 'logging_dir'
guys what is the problem
I'm using the cloud website version. Thanks!
No problem! :-)
A1111 works really well :O
And are you sure about no updates?
weird bots are weird...
automatic1111 gets tons of updates on teh dev branch. The community started getting pissed because all these dumb 1 button install scripts would put "git pull" in their launch, so whenever the repo had any changes, they'd push to eveyrone's installs and break everything.
both sides to blame here. Automatic should've had it set up with proper releases, but also 1button install writers shouldn't have put "git pull" everywhere because most people have no idea how to manage python dependencies and discrepencies. Troubleshooting is a huge time sink on a community and it stalls momentum out. Those developers who wanted to create something easier actually caused a lot of damage.
So Automatic1111 started releasing a proper release cycle. 1.6 stable. Everything else is pushed to dev branch. Development still occurs daily, but most users don't catch it until major releases.
Extensions also always release, which are considerable updates to the UI
https://github.com/AUTOMATIC1111/stable-diffusion-webui/commits/dev actually, it does look like it's been a couple weeks for the dev version. but i'm also not sure what new versions or featurs need iterating at this point. there's tons of other branches people are working on.
hello where i can find good prompts examples
I personally like browsing civitai on occasion. You can check the images that use your favourite checkpoint and see what kind of prompts people use ^-^
i did and didnt find anything good
i want weird type of style
basically a mixture of realistic characters and beautiful background(nature, buildings, sky, effects) and ancient medieval like style
like in movies related to medieval arabic wars
or vikings serie
https://discord.com/channels/1002292111942635562/1011743094309396631 can help, also experimenting, the good old trial and error, that's how you learn
Lol help
What do you need ^-^
OSError: Can't load tokenizer for '/content/data/realistic'. If you were trying to load it from
'https://huggingface.co/models', make sure you don't have a local directory with the same name. anyone know why its hapenning on dreambooth
you tried downloading it from hf through the api once, but didn't let it finish and it closed early, leaving a partial corrupted file. so now when it tries to load that file, it can't. delete it and try again . it might be saved to the appdata folder %localappdata%\huggingface but i dont know how db extension do it
i just load the models manually myself. i got my own folder
civitai. seriously.. maybe not what you want but what you want isn't the only thing under the umbrella term of "good"
instead of just copy pasting prompts, look at how they're doing it so that you can figure out the language and get the prompt you actually want
Hello guys! What's the best (and preferably fast) method to do regional prompting in Automatic 1111 at the moment? E.g. if I want to generate a picture with two distinct characters.
HOLY SHIT, nevermind that. don't use civit to learn prompts. i just opened it up and it's all hardcore porn on the featured top 50
beyond hardcore.. like woah buddy.. we're talking about socks like how they turn inside out. top post this week. wtfffff
There is a filter available at the top-right part of the screen, though some lewd art still slips through
i opened the fitler and it said "videos or images" and that was it. so i turned on only images and it's all teenagers showin boobs
saw a girl showing her ankles there even with filter
donnnnnnnnnnt send people ot civit owahhhwoahwoah i have not prompt explored there for some time
i'm not a fucking prude or nothing but WOAH
lots of butthole closeups
wonder if openart will last now that altman's ousted
No, there is a Browsing Mode button next to your account icon in the top-right corner.
plus ppl who arent logged in cant see nsfw
It has an icon of an eye crossed with a single line
oo openart.ai moved away from openai long ago and is focused moreon other systems now
@pale latch I have just found a picture of you on CivitAI: https://civitai.com/images/3587104
Lol
problem with filters is they'll block softcore and hardcore and extremely hardcore indiscriminately. 13+ 17+ 18+ on the blurs, none of it is meaninful. trying it that way and it's a total fucking minefield. so you don't show the content at all and then you get zero babes at all because you're either into XXXXXXcore or nothing. no between. i've used civit with the filter off for a while. it was never like this
her legs are showin
dude i got a pornstar model trained on civit. stop clowning yourself
no the panties part
you know why i got this profile pic? cause i'm too sexy for this shirt. so sexy it fucking hurts
hammer time / / / / / / ...... \ \ \ \ \ \
you know what isn't sexy? fucking goatse
True. Unfortunately, these age ratings don't matter much. Often absolutely vanilla pictures (even without actual nudity) get 18+ for some reason.
i bet theres a lora for that thing 😬
#1 thing I don't want to google for today
https://i.imgur.com/xBfjy3K.png for the training data. risky click of the day
The keyword (duo) works great, especially with landscape aspect ratios ^-^
Kek. Is this really in Bioshock?
I'll try that, thanks. But as far as I remember, it still tends to heavily bleed the traits between the characters. Gotta try Regional Prompter, as I remember it working fine for me. But I thought there are some newer methods.
Might be because I use a furry checkpoint, but the posts at least can be inspiring for a few key words :O
But over all I would try to look into the basics of prompt crafting. Lots of the prompts on civitai are just badly constructed on top of the other issues
Yeah its doing that alright :D
SD is not actually good for more than one character, so its great that you can do it at all ^-^
it is but i think it's just a design coincidence. anotehr gem from the vaults. https://external-preview.redd.it/iheHchyBdzhQtGlu68za10uKhV878xc7Qm3iC6IGocQ.png?auto=webp&s=e93853cb441468fa0a9e0d210af570c22295e69a
"Gem"
😳
As for this "regional promting" (or how would you call that), I've seen a pretty interesting workflow using the IP Adapter and ComfyUI: https://www.youtube.com/watch?v=vqG1VXKteQg
But I'm not sure if it can be 100% applied to Auto1111
Gotta look into that
what can i write to make the image shows more content
i dont like zoomed in portraits
(far shot) is a great keyword. Other than that look into photography perspectives ^-^
I usually use something like "full body view". Also, if you are generating a character, it often helps to specify the body part that is not getting into the picture. E.g. if you describe both footwear and hair of the character in your prompt, it's more likely that you get the wider shot as a result.
Haha I also use (full body view), but often times I specify (feet wide apart) to draw attention to that ^-^
i want more than full body
i want to show the whole scene
with environement
and buildings
should i write 16:9 in prompt
Which turns out to 800 width and 450 height before upscaling ^-^
Whatever UI you are using - there should be a direct way to control width and height
it still give me portrait haha
i dont want the face to be zoomed in
i want people to be in the background
Which UI do you use?
automatic 1111
Yeah that has easy Width and height controls :O
Nice
they still give me portraits
Oh boi. I hope thats the ratio after hires fix?
check dms
What model do you use?
colossus project xl
Anyone know where i can find a guide to make lora?
Yeah, sorry, cannot really try it. My config doesn't work with XL models well. But in general, if you want to see something, you should describe it in the prompt. I know it sounds pretty obvious, but it works most of the time.
So if you want to see a city landscape, describe it as a "far shot of city landscape", or "wide shot of city buildings", or something like that.
Satya: Will you return if I fire Ilya?
Sam: Yes.
Satya: Hold my Masala chai
HI everyone
I am new in the field of stablediffusion,
I am seeing this thing when one can use the image nd change the outfit or style an all, lets say i have my image and i just want to make it like caption America clothing, same image or lets say
change the background or look,
please guide me through what should i check or learn, what models i should use , what prompt i should use ?
/dolphin
Thats inpainting I think. You can do that in the Image2Image tab in A1111
Good morning, everyone! How are we all today?
If you want to learn more about prompting, I suggest checking out #📝|prompting-help as well as #1080946152318443610 !
Moin ^-^
I am bursting at the seams with enthusiasm
Me too! I'm just having some fun with the bot and so forth. What's up?
I am currently unraveling ComfyUI and StableSwarm and using some of their benefits ^-^
Nice! How are you liking it?
ComfyUI is almost brutalist in its approach to refuse having an intuitive UI. Its technical to a fault in my opinion
StableSwarm does a great job in providing ComfyUI with an actual UI. Its aaaalmost beginner friendly :D
All in all I am grateful that A1111 exists as an entry point into the technologies
Yup, A1111 is still unmatched for me
I'm sure though that ComfyUI is great for complex workflows, e.g. if you are using different models at different steps during the generation
And stuff like that
Also, it is a great and fast backend for other projects
Right now I'm simply amazed by the Krita AI Diffusion plugin
Regular generation is a bit too slow for me, but live painting with LCM sampler is absolutely great
Exept for the very specific case of using two or more models during multiple pass text2img... I am struggling to imagine any work flows that actually benefit from all this technical stuff instead of focusing on better prompts and such
Well, for me it was also interesting to get under the hood of Stable Diffusion and learn how it works (even just a bit)
But yeah, A1111 is more than enough for 90% of ideas
Ahoy
I'm working on my master degree diploma and I'm researching stable diffusion method
Currently I'm writing about sampling methods
so, are there any resources where I can read about them more?
and, yeah, I've seen this one https://stable-diffusion-art.com/samplers
but it seems too casual and too much of just an author opinion so I'd like to read something with more formal/professional attitude idk
ofc, bc that's a begginer guide 💀
whats the best way to use sdxl nowadays?
ofc, that's what i've said
"of fucking course" why be hostile? forget it. gonna have to doubt that you're a phd candidate at this point. just weird flexing
bro, chill
naw
I literally said that stable-diffusion-art is the first thing that popup, yet I can't really use it
let's keep it civil please
they called a mod here because i revoked my advice after hostility? weird
nobody called any mods, I was just keeping a look on the channel and saw all this
Ofc is an abbreviation for of course.
I had no clue that it could mean other things tbh
@silver inlet there's tons of documentation and info about samplers online, a simple google search could help you get started
yeah, but I thought that asking for some help here could be really useful
that's fair
it's a somewhat complex subject and depending on what you are doing, it's best choosing ones over others
check graydient.ai guide
ofc has alwyas been a hostile way of saying "no fucking way?!" in my vernacular and experience
let's get past this
yeah, but as I've seen it, there r a lot of literal developers here so yeah
thank u tho
got u, my bad
just chill, I didn't mean nothing like that
we are past it. lets stop doing weird mod flexing. i was explaining where my understanding came from. now we understand . there wouldn't be that if i never posted it.
I didn't call for mods or something 💀
if you are looking about specific samplers or combinations used on the bots or apis, that's propietary information that's not disclosed publicly but as I said, guides like the one at graydient are good to understand the differences
ok, thank u, buddy
you can also check the guide made by one of Stability's QAs https://www.youtube.com/watch?v=N5ZAMa3BUxc
is there a tag to tell the bot NOT to do a video?
yes, if you press tab after prompting, it will let you choose image or video
thanks 💕
at least will be useful for better understanding
lolo
How can I get more roles in discord?
How on earth a gay couple kissing on a bot room not considered as nsfw not been blurred out? 🫨
how is people kissing nsfw
It's the idea of tolerating the general age here
literally never heard of "ofc" being anything other than ofcourse
is there a particular model that's best for generating things like store interiors?
Good morning, everyone! How are we all today?
I suggest that you learn more about prompting--You can get help in #📝|prompting-help We also have #1047197565365538826
does stable diffusion and stable audio has affiliate or referral program, i want to promote these amazing ai products.
xD
Wae. Since I can now do multiple model two pass text2img my images are waaay better quality than before. So thats a plus ^-^
My floofs are way more floofy
Do tell! And fur=always good! I love drawing it, and creating more of it!
Im gonna send one or two examples on #🏞|general-with-images :D
Awesome!
I Hello. I just start using Stable Difusion 1.6 - specifically text to image. All my images are coming out blurred. How do I address this? Any help would greatly be appreciated by this newbi.
Are you using A1111/ the general web UI?
Not sure, I just used the prompt field that first appeared when I logged on. I am not a tech person; was using night cafe but was frustrated by their limitations. Do I have to know how to code to use this program. Thank you for the quick response.
Hey, where do you use it?
On which website or program
I did a search through Microsoft Edge, clicked on the Stable Diffusion link which took me to their home page.
I am using a Macbook Air.
can somone explain me how this works again?
What's the site URL. The official stable diffusion website for generating images is
https://beta.dreamstudio.ai/generate
If your on
https://stablediffusionweb.com/
Thats a non official site that pretends to be from stable diffusion.
We dont recommend using it as its false advertising
You can use the bots in #1100170312106127410 for free here
its kinda a lot
yes
Check this guide for the Bot usage:
#1100170312106127410 message
Thank you for the tip on dream studio, this is what I was expecting. Great help1
No problem 🙂
Also here is the #1025467151206854736 channel
My recommendation is to check out the FAQ here, where you can find the information on support directly here for StableAudio: https://stableaudio.com/faqs You can find more information about contacting Stability here: https://stability.ai/contact
Hi folks 🤗
For using SDXL you only have to download any XL model , nothing to install right ?
Hey, to use it localy you have to install a webui
I already use A1111 with 1.5 (I forgot to mention it) I have to install another for XL ?
Nope. Auto1111 works with sdxl but you may need some editing to your webui-user.bat Commandline_ARGS, depending on your GPU
For 8gb - 10gb vram use:
--xformers --medvram-sdxl --no-half-vae
For 6gb vram use:
--xformers --medvram --no-half-vae
sure, I'd suggest hopping into the https://discord.com/channels/1002292111942635562/1025266140445933648 channel. lots of AnimateDiff users there
thank youu, that makes total sense
Ok thx I have 12gb
Then you only need --xformers --no-half-vae and if it takes multiple minutes for an image then add --medvram-sdxl
Last question bro (I swear) :
I heard XL is superior to 1.5 in prompt analysis, better output (these MJ Niji ripoff looks awesome) better consistency ,less need of controlnet but cons are more power need and less versatility due due much less LorAs is that correct ?
@warm junco hi can I ask if you know the solution for this
cv2.error: OpenCV(4.8.1) D:\a\opencv-python\opencv-python\opencv\modules\highgui\src\window.cpp:1266: error: (-2:Unspecified error) The function is not implemented. Rebuild the library with Windows, GTK+ 2.x or Cocoa support. If you are on Ubuntu or Debian, install libgtk2.0-dev and pkg-config, then re-run cmake or configure script in function 'cvDestroyAllWindows'
I did the solutions from google but I'm getting the same error
this happens when I try to use facefusion in stablediffusion web-ui
good morning! I hope this is the right place. Is it possible to dl resulting images from the blackmagic websites that keeps the prompt text as the filename?
""Photon"" teaser https://www.instagram.com/reel/CxNu93cMlDq/?igshid=MzRlODBiNWFlZA==
Written and directed by:
@anya.koshka.neon.
@koshka.neon production, together with @osvaydercinematics, @brian.mitro (Spirit view) and @valentyn_grosu for the project @whitemirror_xyz, created a teaser for the upcoming film "Photon" using Unreal Engine 5 and artificial intelligence tools including the program @move_ai_ for animation and @stablediffusion for textures.
The film, which is currently being worked on, tells the story of Jeremy, a young guy from the USA. His strange visions give a special understanding of the nature of people's desires and motives and, through fear, lead him to discover the deepest mystery of life related to photons - elementary particles.
The teaser was presented at the 80th Venice Film Festival in 2023, and the film is scheduled to premiere at one of the major festivals in the United States in 2024.
Ive always seen it as the snarky sense especially when it's in a one sentence post with no added context.
Witnessed a misunderstanding once before someone thought it was the calm version but their boss who said it wasn't calm at all. Ahhh text . That company was a shit show though.
Nope sry, you can still ask in #🤝|tech-support but include more info like what webui and extension your using
Sdxl is very good in all that but as you say it lacks the lora amount and it needs more hardware performance
So like, is stability going to hire some of these ML experts from the openai firesale?
seems like it's an everything must go situation over there
How to add Stable Dreamer to my Server
you can't, it's only for use on our Discord Server.
hey there, so I used to use the Stable Diffusion Photoshop plugin on mac and had a style nailed I didn't intend to modify, but I lost access to Stable Diffusion 1.4 and 1.5 due to updates. Is there anyway to access the older versions or are they no longer available anywhere?
can we do img2img on here?
they are deprecated for anything that uses the API (like a plugin), you can still download and use them locally
thank you
You can also find models in #1047197565365538826
guys I have problem with my trained dreambooth model and converted to lora, its creating only face photos, when im giving in the prompt, standing, full body shot etc. What could be the issue:?
overtrained?
hi guys, i am new fan of stable diffusion, i trained my model with an embeded name, how do i trigger it in my prompt?
it still shows someone else face
Hi I have a request. I want to upscale my old videos(noisy and dark) but I can't because I have an android and it requires softwares like topaz video enhancer or hiptpaw. please help me if you have oneone
what is the latest stable diffusion model ?
how to prompt to stop video generation?
#💬|general-chat hi folks, new to stable diffusion. i have installed that in my local server. need to train that for one purpose. can anyone guide me ???
hi brother, i just want to train stable diffusion diffusion in my local for this kind of result, can you guide me?https://i.pinimg.com/564x/9f/b5/83/9fb583b48f7c171388738c8497627586.jpghttps://i.pinimg.com/564x/69/23/4c/69234c51b2b433405f774e5f4ad0ed65.jpg
The Drama at OpenAi is interesting: https://workflowpedia.com/openai-mass-resignations
hello
can i send an image to stable diffusion and then tell him to inspire from it discord?
SDXL
Hey there, can anyone help me with a prompt? I can't make it look like how I want... :/
on the bot? tab after the prompt box and set 'format' to 'image'
there's a whole channel about that! #📝|prompting-help
Not sure what you mean?
models are released when they're ready to be released
sounds like overtraining yeah. You might try generating without the lora and then inpainting the face with the lora?
or retrain with a better dataset
I believe you should be able to just switch it to using a more recent model and it'd be fine, right?
You seem to be the first to not immediately reveal leaks about the capabilities or insides of the model, but only after the model is completely ready for release. I thought you also show some leaks.
Where did controlnet go in the bot? I want to share my assessment that he did his job well.
it's disabled for now. May or may not come back in the future (hopefully will). The bot's used for testing things, so it's most likely return date is whenever we next have something involving controlnets to test
Could you do variations of an image ?
Anyone here got experience using CharTuner?
what is the best model rn?
Something getting released today or just hype here?
Emad
@EMostaque
What is being released tomorrow?
Guesses go here 👇
1:29 PM · Nov 20, 2023
·
61K
Views
why does the bot generate videos?
You have to set a format, otherwise it will randomly decide to make a video
what idiot came up with this?
No need to insult the people behind this. This is mostly happening to gather data as far as I know
how to specify the format?
ahoy mates
anyone knows how to generate multiple images varying a single thing? something like
"a man's portrait at (KEYWORD) at night, epic, good composition blahblah" and the keyword points to a .txt file with a hundred different words like "the eiffel tower" or "barcelona" or "the mountains" or whatever? i KNOW there's a way but im not finding it on google because i forgot what this tool or option was called
im talking about A1111 btw, i don't use online generators or whatever
Dynamic Prompts extension maybe
In one hour according to Emad. My guesses are:
- text 2 video
- semi-agentic text-2-3D scene builder
- stable audio V2 with V1 going open source
From https://fxtwitter.com/MysteryGuitarM/status/1726753676885377071 looks like another small LLM?
But I'd love to try Stable Audio on my own hardware
who is experienced with proxies and multiaccounting?
hello
I have been under a rock the past 4 months
I have noticed some new regulation about AI
And I didn't see the release of any new groundbreaking image AI program, other than DALL-E 3
could someone explain to me those two topics? thanks
@signal coral could u check ur dms pls 😅✨🙏🏽
Hello
I have been trying to upscale one photo, without success.
I have done many attempts with different upscaler and settings.
Does anyone with good experience want to give it a try ? And test his skills 😉 ?
And let me know what method he used
You can DM
Thanks
Have you tried the Clipdrop upscaler? https://clipdrop.co/image-upscaler
Oh wow, 40GB of vram
wait list 404s
🥱
research only 😢
epic moment
same question
thought it was gonna be 1.6 though 😔
Need to get them workstation cards
Im an idiot they already published the weights
is stable audio available to download and run locally? why not?
#✍🏼|rules-and-tos please keep it tame
While things can be possible, we don't facilitate this server as a place to discuss the details of those things, as it can spiral very quickly
Thanks for understanding ^^
The model being used with the stable audio website is trained on audio provided by audiosparx. Stability.ai has a revenue split with them so that model isn't getting released to the public.
i set format:picture or format:photo... voila: it's video
(adds format:video after my format choice)
**horned demon in mist, red eyes, cursed forest, poison grass, night time, cinema scene, volumetric lights aspect:21:9, format:photo** height:576 format:Video
Did you do it like this? #💬|general-chat message
I just tried the free option, doesn't change anything, but thanks
I would have prefer an option with stable diffusion
does anyone in here know if something exists to show me the command being run when i click generate, so i can use it to build that in python?
do you mean you want to build a plugin that shows the command?
no im wondering if there is one, because im using so many extensions, i cant figure out what to send to the api
is this in a1111?
yeah
if so i believe you can check there api docs
i can see the api docs, but im trying to see how the whole command is built. like if i use inpaint and controlnet, and im using a specific checkpoint, how do i see how to format that in python?
or in api
you could try a library like this to make it easier https://github.com/mix1009/sdwebuiapi
so this repo makes sense, maybe i am asking the wrong question 😬
if i select all my options on the webui (inpaint, mask, controlnet w canny, etc) and i click generate, is there a way to see how that call was constructed so i can then build or use that same formatting when i'm submitting that through the api? or can i export that as a python script?
the normal calls in the ui dont run through an api maybe there's an extension or something that you could use to convert the commands but i dont know of any conventional way to do that.
ok cool tyvm 🙏 so if i get something on the webui setup how i want it, i then need to figure out how to build that in python or with the api calls on my own, it sound like?
yes
there's no verbose command for python maybe that will show the generate task in the command line?
daily outloud wondering about what stable diffusion 1.6 is
if you've got apple or an nvidia gpu laptop
Is stable diffusion something that can be affected by what's happening at openai?
6 months ago I would've shamed you for proposing such an abomination, but Dall-E is surprisingly okieday, recently.
is there an api or anywhere i can use image2vid online
dall-e 2 was pretty good. like it or hate it it would've been good to have it open still
https://www.pika.art/ their discord server kinda
stability's app has a wait list
Hate it.
are repeats important in kohya when using max steps?
beacuse i think its just calculating steps, but when you give max 3k steps for example it doesnt matter?
or nah?
Repeats help, to a point. Kohya is on my shitlist just now and I want my damn A11 extension working again.
kohya is ass
for real
so repeats are important or nah? Beacuse when im giving 50 repeats, or 20 repeats or 10
it always 3k steps when im setting 3k max steps
oopooh. Too many brah. How many images you training?
and repeats doesnt influence this i think
31 images, 500 class images
and 3k steps
maybe 5 is what's needed?
5 repeats?
Yeah, easily. For 30 images?
I did 100 on 2. Fine results.
and how much steps
max
so idk how it works, if im changing the repeats it doesnt matter for the steps bcs its always 3k
Are you using the dreambooth trainer? I've gotten better results with the LoRA trainer.
kohya
but im training model dreambooth
then lora
without captions
https://rentry.co/lycoris-and-lora-from-dreambooth look ast this, and go to script (train_morgan.sh)
is it possible to replicate these settings in a1111 extension
just to copy this config or something?
Kohya SS is incredible, right up until it can gobble a whole-@ss bag of d1cks and eat my @ss with a spoon. Just saying. Crashed compy harder than I've seen a compy crash just now. That was a whole world of BSOD bad.
the world of pytorch can really bsod the shit outta compies
make sure all drivers are up to date. all overlays are closed. even a bios update helps
not just gpu drivers but anything touching the system
Why did he generate a video for me???
Does anyone knows why those models does not exist anymore ? Or if they've been replaced ?
- Deliberate
- Reliberate
- Lawra (LORA)
Thanks
Where is this? They still up at civitai (i think), could be that they made newer versions and those got deprecated?
that's the question because they are not there anymore
is there any colab for stable video diffusion?
please follow https://discord.com/channels/1002292111942635562/1176560067961696276 they posted a bit up one
thanks
hello. is there a way to use the bots in this channel for in-painting and out-painting? Thanks.
I'm running A1111 and plan to transition to SDXL for some things... Should I switch completely over?
Good morning, everyone! How are we all today?
Afternoon! 🙂
How's it going?
All good here. Still trying to wrap my head around all of this... but I've had some great help from @visual pier
I have been summoned
Moin ^-^
Whatcha all up to?
idk , it works for me
Still puzzled about this...
Heading home from work and trying to set up SDXL after a nice shower
Nice! You can always use #🐝|swarm-ui
just use it additionally
SDXL can do some things sd1.5 cant
So, SDXL can be used with A1111? Or you need a different WebUI?
is it stupid that I joined and already have applied for mod
I been active with Stgable diffusion based AIs for ages so im not new to it
just nerver knew there was a discord for it!
i've never understood masochists but i guess someone has to do the job
also how are you all doing 
I have a LOT of moderation expirences
thats my problem too. maybe i was moderating wrong communities though. (gamers)
I don't think it's stupid. KEeping communities safe is a great volunteer effort
I mostly did gaming and pride besides leonardo(an AI server im greeter for) but I did just join but then they did place the application on the first channel I see
i'm not a fan of leonardo but things change. Their mistakes are past the tide line at this point.
they haven't kept licensing exclusive deals for community models since they were doing that when sd2 came out
there kind people tho
and I use invoke too
heck I started with invoke
I'm sure many business school grads are but they also were signing exclusivity contracts with model trainers. Mistake in hindsight and they seemed to have stopped that behavior. it's been a while since that initial "wow i don't like these guys" happened
sure! my issue was they were approaching public community model authors and signing exclusivity rights to the already released models. if you release a model from the start and ask for $, i'd pay if it was interesting enough
i bought px8
mine is based on pictures from my father
who rised the world and knew steve jobs and bill gates
and also was a lead of the karamapa foundation
he passed away
due to the source images its very viarable
careful about doxable info. that's just my inner spy persona being like "woah tell them to chll out on that!"
looks like pxl8 isn't being sold on the main shop site anymore. just the free version is available now. https://devilismyfriend.gumroad.com/l/drzhn
hence me using filtered images
and many people
even if most are dad and his GF
both deceased
so its not even doxable
you'd know best. i'm just projecting i think
its fine
welcome to the server experienced vet!

look in #🏞|general-with-images @pale latch
I am currently training 5K+ of images
it works in auto, just a little wonky and lower performance vs comfy-based uis like swarm
So is Stable Swarm good as a standalone UI? In terms of user interface, extension capability, etc. Of course I'm comparing it to Automatic 1111.)
I always thought it's main purpose was to use this GPU Network, or how is it called
Which I'm not really interested in
yes, yes it is
the multiGPU is just the feature that inspired the naming
(and is a really cool feature, but swarm is a lot more than just that one thing)
that's the default role
Sounds good! And looks good too (watching some overview at YouTube right now).
What I want to try is to optimize some XL model for my 1060 6Gb, lol.
Don't wanna to dive too deep into Comfy again
swarm uses comfy as the engine, so you'll get the full speed of comfy's optimized core without having to dive into the noodles
Well, actually it worked kinda fine when I tried it on Comfy, especially with LCM LoRA. However, LCM only speeds up the actual sampling steps. So it worked fast as long as I only changed the sampler parameters.
But on my setup even a simple change of prompts caused a very noticeable slow down in performance. And don't even talk to me about changing the model...
When it took about 9-10 minutes to load some random XL model, I decided to drop that idea for the time 😅 But now wanna try again.
Sounds awesome
So as I understand, Stable Swarm takes all the power and customization capabilites of ComfyUI and streamlines it into a more comfortable and easy to use interface, with the ability to add additional custom workflows via Comfy.
Now the only logical step I see is to integrate all the common workflows into the base version of Swarm and get an Auto1111 with a better backend engine 😅
oh
how do I get the devs role or is it staff
unsure if devs is for all developers or just developers working here
I guess its a staff role
Devs in yellow is staff yes. You can get other roles in the #👥|roles channel
ah I applied using the forum in #1072220168534642768
what role will I get if accepted or does it depend
sorry if I ask too much 
if you applied to be a mod, you'd theoretically get a mod role
I'm not involved in the decision-making on that, but I don't think it's likely you'll get accepted if this is your first day here and you're unfamiliar with the channels & roles. You'll probably want to wait off on trying to become a mod until you're, yknow, familiar with the discord
they can always delay my app
im not here for moderator im here to work with stable diffusion
I am willing to help ofcourse

The main swarm Generate tab auto-generates workflows on the fly rather than using pre-set ones
trying to premake all common workflows would be pretty difficult just from the mix-n-match capability
(eg want to add a lora? that's a whole new workflow. Want to add 2 loras? that's also a whole new workflow. Thus, swarm auto-generates on the fly based on your params)
Depends what software you're using to train, but generally your best bet is per-image captions
stable-diffusion-webui
As in, Auto WebUI? Are you... training a TI Embedding? that's the only trainer built into auto
(well, and hypernetworks, but, lol)
If not, you're using an extension not the webui itself
wondering about a training thought i had. often i use generations in training data. what if i throw the prompt i used into the caption? thoughts?
pinged you showing the settings
What if I want to add some more complex stuff like ADetailer or Roop/Reactor though? I guess I would still need to dive into the noodles, or find a relevant workflow online and run it once in Swarm?
oh, yeah, that's making a TI
I hope it will work my input is very variable
You... uh, should not be feeding 5000 complex images into a TI
its not one type of image
You'll want to train a checkpoint model or a LoRA

yeah TI won't work at all
lora going to be better. that kohya like i was saying
welp I am training it so better finnish and see what it makes
im used to LoRAs but my LoRAs are always one artstyle
Swarm has ADetailer support built in (you just need to load the comfy extension to the comfy backend and download the models - in the near future that part will likely be automated away), you just drag an image to the prompt box and then select adetailer model on the left
these are many diffrent ones from irl photos to buddhist art
roop not yet but probably should be added. FOr now, yeah, noodles
well it seems to be working
i wonder what i should make for my next published lora. maybe anohter celebrity face
LoRAs are commonly one thing at a time, but can be several. TIs have to be only one concept
Cool, thank you for answering. I will probably try it today.
oh well most are IRL
wait sorry i had a brainfart lol - I just said "ADetailer" and described IPAdapter lol
v2 of madball lora, i'm going to split it up into multiple concept folders
Lol
most of the images are photographs as my dad toke them
ADetailer swarm actually has a better implementation of than anyone else and it's purely built in
Huh?
i was about to say "didn't know adetailer was in comfy now!" but then i assumed i just didn't know as much as you and shouldn't question things
just add to your prompt: <segment:face> a perfect face or whatever to use "ADetailer"
but you can also change what object it's matching
oh i like that
(which adetailer requires training a classifier model for. Swarm uses clipseg to allow text specification)
ONLY 24 HOURS OF TRAINING TO GO! WOOOOOO!
I see, that's actually clever. I just was thinking about something like that today.
one feature i'm dying for is lora autocomplete in prompts. i've got an extension to do autocomplete in auto1111 but its so horribly buggy and has bad ux that i keep it off
well seems my model works I will keep traiining it
type <lora: and get a little drop down of them all that'll filter as you type. just like visual studio or any ohter autocomplete would.
Swarm has an actual interface for it
but yeah tab-completer is on the planned features list
already have that on the grid generator interface
squeeeee
Haha, I just realized you are the dev of Swarm (if I understand that correctly)
i'm very liquid with UI's lately. i been using stability matrix to try different ones, but i'm going to abandon using that. i have more flexibility without it and it the launcher itself will always crash and close the process AND the browser
did not expect a model with 80% photos to make actual art
TI's aren't models. They're textual embeddings
Regarding that "ADetailer++", can I use something like segment:face without any additional prompts to use the main prompt? As it's done in ADetailer itself.
you're turning all those images into one catch all token that represents them
I never really used additional prompts for ADetailer's face fix, just it's capability to quickly inpaint faces at a higher resolution
yes
oo, not currently (other than, yknow, just copypaste the prompt text) but I could probably add that
That would be cool, as I suppose I'm not the only one who uses ADetailer that way, lol
But using the CLIP Seg is absolute fire. Wonder why haven't anyone implemented this into some extension for A1111 yet. Or maybe they did, lol
Hey, can we no longer use images as base for generating?
In what context? The bot here? #🗣|artisan-support-feedback ... the Stability API? #1042896447311454361 ... a local UI?
Yup, here.
if the bot here, then #🗣|artisan-support-feedback
Ah, thank you!
watching your own AI model grow each training epoch is truely amazing
slowly but surely its getting better
it already is making some amazing enviromental pics
it also is good at text and UIX
I forgot to filter out the ID cards and the AI is trying to make fraudulent IDs now 
fortunately it failed 
and fortunately only 1 of them was ressembling a ID card

on the bright side its getting better at faces and finally figured out how to make a server
Jesus loves you all
he hasn't returned for over 2000 years. exempliary father figure huh
He bin hurr bruh.
whats a good resource to get more knowledgeable about SD stuff? samplers, refiners and stuff?
Do you mean practical guides or more indepth technical stuff?
For the former I've used the tutorials from https://stable-diffusion-art.com/ for quite a few times
For the latter... Well, I would also like to know that, lol. The articles and videos I find on the Internet are most of the time either too complicated, or, on the contrary, too shallow (again, delving more into the practical side of things).
well really anything, my knowledge about SD at all is quite poor
will check out sd art com tho
There are a couple of pretty cool YT channels I check from time to time, though they also mostly cover the practical use of Stable Diffusion. Look for @OlivioSarikas and @sebastiankamph.
@Not4Talent at YT is my most recent discovery (though I've stumbled upon his earlier videos before). One of his latest videos deals with the topic which a lot of SD enthusiasts tend to literally sweep under the rug. Or, well, at least hide as much as possible. https://www.youtube.com/watch?v=oPcQzhhwsGU
Yeah, it's about generating hands
Hey 👋 just wanted to say hi and that I'm happy to be here. I started looking into stable diffusion recently, and playing with it and it's so fun. I love that stability.ai made it open source too. 💯
Hi! Yeah, working with SD could get a bit frustrating at times, but it is incredibly satisfying to finally produce the picture you had in your mind. Even better is to produce a very good picture that you least expected, lol.
out of curiosity, I'm planning to try run it locally and do some fine-tuning. Are most people here doing fine-tuning as well?
Yes! I'm blown away by the output. I also looked around in this community - the art work is pretty amazing.
I just started and haven't experienced the frustrating part yet, that will probably still come lol
i finetuned some madballs this week
Just so I know what is awaiting me - what can get frustrating?
out of memory errors
Honestly, with the sheer amount of LoRAs out there, and with stuff like ControlNET, IP Adapter, Reactor, etc. I don't really need any finetuning most of the time.
What is madball?
collectibles from 80s
oh wow
Or a hardcore punk band from 80s
Lol
lol unrelated to the band
there were series made recently i know about them cause a friend collects a lot of stuff like this
Well, I'd rarely get OoM errors @chilly furnacecold has mentioned, as I mostly use SD1.5. And it works perfectly even at my 1060 6Gb card (albeit a bit too slow if you start to increase the number of ControlNET models or other processors).
ah.. is there an ideal hardware setup for finetuning -- one that's still within reasonable budget? Not sure if there's been discussions about this before. I'm looking to renew my equipment.e
turn that batch size up and you'll find them. training especially
I saw the minimum reqs on the website, but they seemed a bit low to me
don't want to run into too many frustrating moments lol
I was talking mostly about the times when SD doesn't want to generate the picture you want it to, heh
fine tuning is usually what we call the deepest levels of model refinement. you need 24gb to unfold all the layers and refine it all.
Dreambooth and LoRA are other methods that we use to refine a model, and they require less memory
Like completely ignoring a pretty simple pose you are trying to apply.
attention problems are a big deal too. While these models take natural language inputs, they're REALLY stupid and don't pay attention
or if you're using sd15, a 512x512 pixel trained model, and generate a 1024x1024 image, it'll only give attention to 512x pixels at a time.
oh okay.. I wasn't planning to go that deep. I was more thinking of making the outputs a bit more personalized to start with. Good to know that 24gb is required to unfold all layers though.
centaurs and siamese twins
Yea, I haven't tried that myself yet, but I saw some examples from other users.
sdxl is trained at higher resolutions, so it's base attention is much higher
do many people go that deep btw?
You can look into the names I've mentioned in my reply (IP adapter, etc.)
Chances are it would be more than enough for your purposes
most people do lora's i'd think. thats all i stick to. if i were to do full models i'd want a 24gb card. i've gota 16gb one
There are quite a few guides and overviews on YouTube
I'm sorry, I might have missed it -- which reply are you referring to?
Sorry, this one
Gotcha. The requirements for lora seem quite reasonable too 😄
Difficulty: easy-hard (less likely to affect the base model)
Style training: good quality
Subject training: good quality
GPU cost: medium (8GB or more)
ControlNet and IP adapter are basically a bunch of additional models that allow you to better control the generation. E.g. you can use depth maps to control how far the object in your scene should be located from camera, or use OpenPose to control the pose of the character you are generating.
Ah thanks, I still need to look up what these all are. I'm only jsut getting more familiar with A1111
aaaahahaha sam altman's trollololol video making rounds omg that's a properly timed remix for this meme
link? 😄
IP Adapter allows to replicate the style and/or composition of a certain reference picture. It also has some models for face replication, but if you need a fast and reliable face swapping tool, then definitely look into Reactor. (I'm not even talking about deepfakes, it just really helps with character consistency, lol.)
LOL 😂
personally, if i want to do a person, all those guidance models don't compare to a lora
All of the things I've described exist as extensions for A1111, so yeah, get aknowledged with the basics, and then you can dive deeper... and deeper... and deeper.
i wasn't able to get madballs from guidance models either
ah that's interesting, didn't think of using face swapping in that way before
True
the face swapping models are nice too, but they're based on 256 resolution transformers and then fixed with codeformer
there's a lot of limitations there
sounds like it's going to be a deep rabbit hole 😄
use reactor on anime generations for instance
a lora will make an anime versionm of the subject trained. reactor will face swap their photographic face onto an anime face
Also true 😔
and with reactor, because it's 256, you can see the pixels. so codeformer fixes it , but then sometimes what reactor swaps and codeformer fixes, don't line up and you get a big bunch of pixely hair
I think I might go with 16gb as well, feels like something safe to start with. It will probably be a while before I will think about upgrading, but why would some people do full models though - for what kind of use cases?
yeah 16 is nice. i want 24 but i've lived a lot with 16
Ooh I would like to try that out 😁
I don't think you do, haha
It looks... weird
Do you have any experience with not doing it locally but renting servers?
Definitely not me, but I know a lot of people are using rented GPUs for training
Maybe other people here could elaborate more
Am thinking that IF I really get that far along, and my hardware isn't sufficient anymore, perhaps I can rent GPUs too, but not sure how the UX is.
Or I don't buy new equipment now, just get familiar and play with things while renting GPUs -- and decide later what equipment to buy
Do you have an idea if more people here rent GPUs or run it locally?
Nope, I have just joined this Discord a couple days ago myself, lol. Though I have been playing with SD for quite some time already.
Cool, we're both new here! Although I'm much more of a novice yea
Have you played with other models before joining this server?
I want to try out so many things, but after hearing you both talking about SD, I think there will be enough for me to learn and try out here before I go exploring on hugging face again.
I think I tried Midjourney for some test generations, but decided to stick to SD in the end. Just because of how configurable and modular it is.
@finite cloak Hey man, sorry for a pretty basic question, but where does the Stable Swarm UI keep the server logs? Cannot get IP Adapter to work, as it says the following: No images were generated (all refused, or failed - check server logs for details).
I heard something about Stability updating SD1.5 to 1.6?
Can anyone tell me if they know what the differences are yet? 👀
Curious.
noone knows /guitar
its on the api. some comparisons were shown. that's it. thats all we know. some have said it won't be compatible with any of the loras and controlnets is why its not released
Ah. I see. Thanks for letting me know. haha! xD
okay the model is okay enough
time to work on my woman/anime/dental LoRA
why those things AI fails a lot at putting teeth in mouths
and I wanted it to be better at faces and have anime style options
I also included NSFW but I have my doubts those images make anything good there like 3 of the 1K
but well while making a LoRA for humans it would be interesting to include some questionable ones
as I did try and AI has failures in nakid pictures as well as when it holds items
so might make a LoRA for hands

oh also my LoRA has a bunch of PD paintings
so im truely wondering what it will make
also making one with all of the pics from dad
after this one
so one for mostly woman faces and cartoon
and one with anything my dad stored on his drive
https://www.reuters.com/technology/sam-altmans-ouster-openai-was-precipitated-by-letter-board-about-ai-breakthrough-2023-11-22/ oh they mean this clickbait nonsense exclusive being passed around recently
first claimed that altman was fired over q* then backtracked it because they rushed to publish something saucy
Hello, I am new to all of this, is there a specific subsection to discuss and receive tips on LoRA training?
is there any youtube video to get training to get started on Stablefusion along with some sample prompts. I am a newbee wanting to learn
can I use stablediffusion to create a new image from a base image?
is there a subscription needed and what prompt do I use to start with a base image the then alter into a new image?
I am finding lots of very high quality Gifs made with Prompt Travel in Animatediff -- but no one mentions how to get that crisp quality. Any one has a good guide?
Hi, anyone have experience about animateDiff in ComfyUI connection lost issue?
ERROR:asyncio:Exception in callback _ProactorBasePipeTransport._call_connection_lost(None)
handle: <Handle _ProactorBasePipeTransport._call_connection_lost(None)>
Traceback (most recent call last):
File "asyncio\events.py", line 80, in _run
File "asyncio\proactor_events.py", line 165, in _call_connection_lost
ConnectionResetError: [WinError 10054] An existing connection was forcibly closed by the remote host
Prompt executed in 3839.24 seconds
ERROR:asyncio:Exception in callback _ProactorBasePipeTransport._call_connection_lost(None)
handle: <Handle _ProactorBasePipeTransport._call_connection_lost(None)>
Traceback (most recent call last):
File "asyncio\events.py", line 80, in _run
File "asyncio\proactor_events.py", line 165, in _call_connection_lost
ConnectionResetError: [WinError 10054] An existing connection was forcibly closed by the remote host
is this an official sever?
I think it is
Can we generate videos where the characters move? I generated a video but it's only the camera that's moving.
stablediffusion is a IMAGE AI it is possible to make a gif from the images but you got to do that manually
wdym?
can however use adobe animator on AI characters but there still images
but i've heard of stable video diffusion
no videos as videos are not easly fully made using AI
it may make videos but only without sound
and only if you make a tool that makes it frame to frame from images
ik, but how can i access stable video diffusion here?
and due to AI being unpredictable this is might fail
my bad
I forgot there was stable diffusion videos
but is there a stable video diffusion bot here to genrate them directly?
I only use the image stuff

I recommend doing it locally anyway
if your GPU can handle it
my colab isnt powerful enough and im broke 
I feel your pain
my 3080 barely holds up
Do I only have the free version since the dream bot Channels are locked for me?
and that thing is pretty beefy💀
the only thing I know better then a 3080 is a 3090
Will there be a svd bot here? plssssssss
idk if there is even one for images here
there are for imgs in bot channels
oh well it wont take my custom LoRAs anyway

what are custom LoRAs?
LoRAs are like plugins for the AI
they make it better at specific things
there like plugins to models but unlike plugins for applications there trained using AI like a model
so like mini models that get injected into for example SD2 or any other model
so you can see LoRAs as expansions for AI models
Even a 4090 won't cut it for Stable Video Diffusion locally... you need 40GB of VRAM.
im talking about image gen anyway unlike LoRAs finetuning is making a new model based on existing ones
This is all very new to me. I'm still trying to understand what does what...
it's fine if you compare AI models to games
LoRAs are DLCs and finetuning is a new game in the same engine
im currently testing my LoRA
but first im making a normal gen using SD2
its sad to see that my old LoRAs are incompatible

What is a normal gen?
with that I mean just the base model
no LoRAs or control nets(extensions)
I forgot I put way to much anime in my LoRA
need to change the prompt to even make it ressemble the same thing
Ah... OK. This may be a dumb question... but how to you make a normal gen? I don't mean the specifics (I'm sure that's fairly complex?), just the outline of what you'd need to do.
just select a model and do not touch the LoRAs or control nets
idk if it is the actual wording for it

I think that's half the battle for me. I mean, remembering the names, and what thing does what to an image. Haha. The bit I really don't get when you say you're making a normal gen is that if you're using a base model (in this case SD2, is that right?), and you're not using any LoRAs or other extensions etc. then how do you influence the base model?
And apologies for all the questions!
I know the basics
like controlnets and LoRAs
controlnets are code based extensions LoRA are expansions of the model
Right... I think I understand. So what are you doing to make a normal gen then?
I made a anime LoRA by mistake
OK... I think I'm getting this. So you want to to 'de-anime' your LoRA? Or you are starting from scratch? If it's the latter... what process do you use?
im making a new one
So you're making a new LoRA for SD2?
I made one but Im making a new one for 1.5
I do wonder about AI copyright
especially seeing that this new SaRA model is made from images I do have rights on due to the one taking being my deceased father(I inherit his copyrighted work as a child of him)
not to mention most humanoid ones in it are of him and sari both them are deceased and I can garentee if dad was alive he would 100% aprove of me doing this
I know dad he loved tech and he would love AI
and most are enviroments
this SaRA LoRA will be amazing in variations
dad toke so many pictures of diffrent enviroments,people and even some stuff normal people do not have access to like sacret buddhist statews and server rooms
That sounds amazing!
it is
and legally I own the images most are of humans that passed away,wont mind me training AI with there face(monks would not mind) or of stuff that would make the taker of the picture the holder
in the case of dad the only copyright holder is me due to his passing
let me check just in case
Like any other property you own, what normally happens is that ownership of your copyrights is transferred to the heirs of your estate.
I guess this includes photos otherwise copyrighted by dad
That makes sense. And I'm sorry to hear about the passing of your dad.
its tragic especially as he passed away suddenly
but AI can make him not die in vein
also im training the data and already see an avr_loss of 0.12
likely as its extremly viarable
AI LoRAs/Models love data of diffrent stuff
can you apply q* to stable diffusion
what is Q*
also SD is just a model
so you can in theory use other AI to train it
if you make a image(and own the rights to it) you can train AI to remake it
for example
if you draw anime and cram that into a model
the model is yours to control
but if I cram all of highschool DXD into a model that is infringing
as it can remake all of the characters and uses one style
this is ESPECIALLY a danger if the person is real
I say its okay to use people and anime art but it has to not be one style/person
it makes for better AI models
im also making a LoRA for hands
Hello!
Are we allowed to promote our services on this server? Because I couldn't find a rule against that. It's about generating images with stable diffusion on cloud GPUs.
so if you hide the theft it's better. got it
no its still recommended to use copyright free or your own images
as as soon its regonizable as something it is automaticly infringing its rights
for example if you ask a AI to make homer simpson its the same as overdrawing homer from a screenshot
but if you draw a bunch of stickmans
and cram those in a model
then ask it to make a stickman
then its still yours same if you use picaso paintings or other PD art
same with Voice cloning
If I use my voice to cover adele set fire to the rain
and I have optained lyric and beat rights(and the concent of adele to cover her song)
to cover the song
then I can
even if I use a AI version of myself
as my voice = my voice
but If I take adele's voice and make her sing bumble bee then I am able to be DCMAed by adele
the waters are murcy with dead people
same with mixed work
hence why im using royality free content and dads images
dads images are mine copyright wise due to his passing
How do I unpack a model, its a safetensors?
I have a model that I created a while back and it worked quite well but I want to make an updated version but would like to know what images I used to create it in the first place to avoid any kind of duplications.
you will need it's origenal dataset
with description and images
else you can only retrain/finetune it
or LoRA it
Ok. No problem, I was wanting to check which images I used as I deleted soon after. Nevermind
Hy guys
I'm kinda trying to get sum overall knowledge of the SD models and the generational process
Could you correct me if I got it wrong pls.
The trained SD model is a complex model which consists of multiple components:
CLIP - nn to ensure the corresponding between images in training dataset and their text descriptions
U-Net - nn trained to define the details in the latent space representations of training dataset images, and to evaluate the noise in that latent space representation
VAE - consist of encoder and decoder. nn encoder converts images into their latent space representation and nn decoder that converts the latent space back into pixel space image
I feel like I have some grasp, but there are still a lot of dumb gaps
I have read a lot of things, but they cover different aspects and don't give the whole image 
star wars is bad
star wars the force awakens is the best movie
anyone of u wannna doooo my homework......
Hi, im trying to git clone comf ui, but it keeps giving password error. I made a ssh key already, and i tried to paste it into github, but it says invalid key
I don't think you need passwords or ssh keys, are you on the real comfyUI repo?
yes, i tried to use my personal token, but it still says fatal error
why do you need a token?
you just need to git clone this: https://github.com/comfyanonymous/ComfyUI.git
onto whatever folder you want
it is asking for my password, but i typed it is asking me to switch to currently recommended modes of authentication.
I tried to git clone, but i need to authenticate first before i can clone it
let's start from the beginning, where are you installing? OS?
Comfy UI, mac
Ok I understand, the MacOS might be asking for your admin password in order to let you install thru the terminal
that has nothing to do with comfyUI
you need to have permissions
have you installed other things like that before?
no it is not my admin password it is asking what is my username on github
usually i use github in colab. Occasionally I use homebrew
it shouldn't ask for a password to let you do that
go here on a browser> https://github.com/comfyanonymous/ComfyUI
click on the green "code" button
and download the repo as .zip
unzip it wherever you like on your HD
and then follow the instructions on that page to install the rest of the stuff you need on a mac
pytorch etc
I thought just downloading the zip but in the directions, they told me to manually install it
dont worry about not cloning it, you can install the manager and update everything to the latest version from the manager
download zip as https or ssh?
yeah i need a ssh key for ssh, i tried to add one, but it didn't work
Lol it is 1.3mb
yeah it's a bunch of text files
you have to download the models and controlnets etc later
How do i run it?
there's a mac section on the github
you need a couple more things for it to work
pytorch nightly
without that it wont run
i have pytorch nightly
did you isntall the dependencies?
the models not yet
no, the dependencies
yes
pip install -r requirements.txt
it doesn't matter, you can have it wherever you like
I tried to put the comfy ui folder into the terminal, so i can run the main py
but it says access denied when i tried to put it into the terminal
do you have experience with the terminal? it's unix based
you have to /cd and then the path to the comfy folder
sort of, i forgot a lot of the stuff already. I only use it sparingly
once you are in the comfyUI folder at the terminal you just run this: python main.py --force-fp16
It is not working, i tried " /cd/User/.... " and "cd/User/...
/ls will list the contents of a folder, /cd will change the directory
depending on where you are you need to navigate, or do the path from root
/cd / is the root
/cd /user/xx/documents etc
I use /ls then
no such file or directory it says
I used to know this stuff lol, but i forgot. Years ago I learned it.
do this "/cd" after the cd do a space and drag your comfyUI folder into that space
it should give you the path
then you can /ls
and should show you the content of that folder
still says no such file or directory:
let's please take this to https://discord.com/channels/1002292111942635562/1002602742667280404
ping me there and we'll try and figure it out
Are there discord servers that are more general purpose about AI art tooling and techniques?
Seems the AI is struggling again with simple stuff like faces and detail
My images are getting harder to get the way I want as the algorythm seems to have either forgotten stuff or been reprogrammed.
doesn't work like that
the model is basically frozen outside of the training process
But, surely it changes as it learns?
digital neural netoworks aren't like the brain at all. People lie all the time about that. THey're a mathematical construct. Not a biological brain
Oh ok
training is a very intensive process and you won't be doing it accidentally
But it draws upon experience noes?
no. it's frozen. they only learn when someone runs training code on it and saves a new weights file
with chat gpt, you'll see that it'll draw on a particular session . the weights acting on the previous text. sort of like feeding the output back into the same algorithm
doing that on stable diffusion is img2img.
So if you bias a convsation with suggestion into what you want to hear, it will tell you what you want to hear?
user feedback is used in training, but what happens is it's a spreadsheet that an operator coallates and then provides to the training algorithm during runtime.
Yea ok
so me spamming the same thing over and over and pressing image a or b as good isn't affecting it in live time?
Oh yeah. LLM's (chatgpt) love predicting what you want to hear. That's their whole jam. Sort of like Stable defusion is a denoiser trained to predict what would be where noise is. LLMs predict what the next character in a sequence will be
yeah thats right. It won't start picking up that you love apples because you prompt for them a lot. you might think it is because you start seeing better apples, but you probably were just tweaking knobs towards better results progressively over time
Yea ok.
automatic1111 and comfyui and most UI's aren't set up for training. Well, sort of. A1 has a training tab and some extensions. Either way, you need to run a specific script for it to change, and then you get a new file
I see! I made some amazing stuff before but it is like doing the same thing now produces poorer results, but I am using the samed image filname, which has been truncated so has lost some of the details I inputted.
Ok
you might've had different sampler parameters before
So it is far from self awareness yet 😄
Yea
more steps. different cfg . something like that
that's right. Super intelligence might come before self awareness though. We just don't know. Engineering awareness is a tough concept

