#💬|general-chat
1 messages · Page 81 of 1
Hello everyone, we are an AI startup team from Silicon Valley, and we are currently developing productivity tools for students, and we hope to collect your valuable opinions! [Rose]
We will randomly select 3 users who participate in the questionnaire and give them the company's peripheral gifts or the same amount of cash ($30)!❤https://docs.google.com/forms/d/e/1FAIpQLSechz007ja4AoARJaxV-q15QRoIEH_Kd7gbmHiUucrlbtfWpg/viewform
Hi, no
damn only thing in general today is a scambot 😭
so i switched to NVDA GPU, fuck you AMD
Hey yall, i was just wondering if there is a thing similar to stable diffusion that allows you to train a voice locally with audio files. Is there something like that?
what is interesting is I can see the stable audio october event channel but not the stable audio channel
and I cant add an image here to show you
but it's supposed to be right above general chat correct?
when I click on the link above that @sudden ruin sent, itll show up for a sec then disappear agan
voice ai. it requires the voice of the user, for example 25minutes
but thats a voice changer. i would like something where i can input text and have it give out the text in speech of the voice i trained it on. this jsut seems like a real time voice changer. but i am not trying to change mine exactly
if that makes sense
dont know
fair, was just asking around if others maybe have
cause some german streamers have managed to get it to work with some colleagues
ask them in comentaries or stalk them to their discords
I need image generation reviews for my fine tuned sdxl model with Loras
(running on nvidia A100) estimated time ~~ 15 seconds
Can someone help, you just need to create images and give reviews everything is running on my own resource
Do you know how some people can make videos without flickering and with a very good consistency?
Because the tutorials from youtube are not good
I mean there are some gifs/videos on civitai which are high quality, but I think that there's no way to use the reverse engineering to find out what settings were used for them.
is there any way to put a text on an image?
Thats weird, it works fine for me 
Dudes. one can think that there woild be milon of clothong models on civitai
Deforum has it's own discord #1034941531762733167 message
Could someone point me in the direction of how the bot-1 is doing videos?
there are
look at that creator's profile, he makes a crapload of clothing loras
ebsynth
guys is there a possibility to add text that is not deformed on sd, not sdxl?
like for example "sprite" text
its always deformed on normal sd, prompts doesnt matter
You mean besides controlnet?
it can be controlnet, is it possible in controlnet? To generate text
SDXL doesn't do text good either, maybe you'll get lucky 1/10 times. You need to add it after the fact or with a controlnet unless you want to fight RNG
like character holding a sign with some text that im giving in prompt
always its giving me deformed text
how can i do this on controlnet? I only did poses in controlnet
Here's a good guide on that
https://civitai.com/articles/2746
TLDR depth and softeedge on low weights
White text on black background
Either that or add it in post
Method 2 is generate image without text
photobash text with above image
then use IMG to IMG or Inpaint the area with the text, low denois or with a controlnet
Okay now I have a question:
What is the best extension to 1111 for viewing your output images after a generation. The standard one sucks dooty booty for a lot of images as a gallery
I haven't touched SD in what feels like ages ever since DALL-E 3
I wish we could have something with the power of DALL-E 3 but the freedom and openness of SD
.
It's a great tool but belongs in the hands of someone better than Microsoft
Hi guys,.. I have SD installed (Automatic1111 WebUser),... it's on my Brave browser, runs fine. However, earlier tonight, I installed Opera browser,. (NOT as default browser),.. but,.. BUT,. now my SD won't load in Brave,. I closed (shut) Opera and it still won't load,.. any ideas what I can do other than remove Opera? Ok, I got it working,.. I restarted PC,.. launched the .bat file again and a pop-up asked which browser I wanted to open the URL with, I chose my Brave browser and hey presto, it loads.
You can set up which program you're using for each type of file in windows, but it's different in each version
so...you can just google how to change default file type for x in win x
So, if that pop-up hadn't, erm, popped up,. what could I have done (like, how, in Windows?).. ? Thanks.
Oh, in Settings,.. default file types?
It probably set itself up automatically...apps do that sometimes after installation
if it's win10 - yea
It didn't when I installed it (Opera),, but after a restart and lauched the .bat file, the pop-up came up to ask which browser I wanted to open the URL in
oh I see
W11
no idea about win11, just search how to set up apps for different file types in win11
it shoudn't be too hard
Yeah, ok,.. for future reference. Thank you. 🙂
Thing is,.. I doubt windows would know I had SD installed,. since it's not an actual .exe file
So it may not know a SD URL needed to be opened. Which in turn may mean there'd be no way to set a default file type..
You should be able to set app of your choise for any kind of file...bat, txt, rar...whatever
Stable diffusion isn't a file type,.. it's a specialist browser installation 'thing',..
technically it opens link...it should just open your browser
I just looked,.. and I can open a link type,.. HTTP,..
It's just a link to your localhost
So,. I guess that's how I'd do it in future.
the videos the bot is making. What kind of GPU do you reckon you need to run something like that locally?
Is there like a site to use while still be able to have workflows and get consistency cause I tried using comfyui and my laptop is just way too slow for it.
Really don't wanna be forced to buy a prebuilt pc or build my own just for stable (I don't touch pc games at all)
Just get a crappy pc with a good video card
And remote desktop into it
You can run sdxl on a 1080 ti for like 350 incl pc
Thaaat's gonna be slooow...
hey dudes idk if this is the wrong chat but i just had to completely reinstall my stable diffusion anyone able to quickly give me a refresher how to install cpkts?
or like how to install different things? idk im trying really hard to remember how this thing was used
i just cannot for the life of me generate anything anime
Im no expert so i might not be of much help but doesnt the model change art style the most?
i havent touched stable in like a year so im honestly just trying to remember half of the things
so do i just go to hugging face for different models?
I use civitai personally. Gives me a preview of the model results on the page and makes downloading simpler than Huggingface
but do i have the right idea?
ABout the models or the cpkts thing?
Yep
thanks
im trying to generate some art for a dnd game im going to be a part of and for some reason i thought to steal my character design from some anime game
another fun thing to do is IMG2IMG generation
I feed it mediocre art and it spits it out as professional art.
before i get on a tangent can i also drop in .json files into models or do i have to do something different?
Hm... im actually not sure i havent done that yet.
Sorry to be unhelpful
nah no stress im not upset
i have no idea either :p
last thing then you got any reccomendations for anime esque models then?
Yes
I use one called Manmaru mix but be advised it's more of a studio ghibil style than say a demon slayer or tokyo ghoul type of "modern anime" style
im going to keep it real with you i have no idea what any of those are
i am the last person who knows what anime is
Oh lol
Studio ghibli has a unique style that looks almost like a mashup between Pixar animations and Anime
very famous in the 90s
It's a great genre but it's also a very unrestricted genre
There are some I consider peak fiction and others that are so weird and odd that I would not reccomend any human ingest it
When you simply dont have cancel culture, censorship, or really any restriction on what can be done, sometimes weirdos are allowed to make stories.
i saw berserk i liked the big sword guy but it was messed up and i saw some nasty things i didnt much like
Berserk is a very mature and dark story. I would not reccomend it for a beginner, or anyone who is offended by 18+ sort of topics.
ah nah i liked teh idea of being a cool hero guy with a heavy sword im not trying to buy your pity but ive struggled a bit in my life and seeing a guy who keeps on kicking even when bad stuff happens is cool
but ive also had a couple of friends who are really into the stuff reccomending the most obscene things and i dont really want to be associated with it
Well it's not pity, everyone has what they can and can't handle. I like dark stories but there are a specific couple tropes I can't withstand no matter how good the story is.
kudo to you if you like the stuff just aint for me
he wanted me to watch some sort of healer type dealio and i almost threw up
it was the most vile
stories that put children at risk just to make you stressed, I can't stomach it. I dont care if it's a horror or action or whatever, its just sad.
Oh goodness, redo of a healer? Your friend hates you.
that is one of the only anime I rated 1 and wish I could rate -10 but the scale only went from 1-10
i dont like seeing evil people be evil im sort of an odd type of fella
that show is the most degenerate, most disgusting, most vile thing I've ever seen on TV.
i like the over the top dramatic hero good guy type beat up the bad guy because i like it when the good guys win
that could just be my hick nature coming out tho
oh yea i did see one other one that i liked
it was like a boxing type one where the guy didnt have robot arms when everyone else had robot arms
I'm an avid writer for a trendy website called "Webtoon", I'm not sure if you've heard of it since you havent watched anime you likely haven't.
im faintly aware one of my coworkers like that thing but id prefer actual books
what ya make?
Its basically the equivalent of Youtube but for comic creators instead of videos. A bunch of people make their stories avalible for free, in exchange if their popular enough they get an instant route to fame and money
I've made a couple quite popular comics. But just the storylines
Thats actually why I'm here, I'm using AI to give me examples of how to improve my own art so I can release a comic with 100% my own art for once
My self-rule is basically everything I publish must be done by my own hand, and AI is only to inspire and help teach me
I dont wanna cheat and just publish ai generated stuff
neatio i just have odd dreams i could make into stories that or i do a little dumb thing where i collect them
I've tried the dream route before... Unfortunately my dreams are really dark and morbid. To the point where I cant really make a storyline out of it cause its just too sad
this is extremely personal so dont you feel any need to awnser but is there a story that you want to share with the world that youve never told someone NOT because its a super private secret but because its almost been forgotten
thats the neat thing you dont gotta follow them 100%
heck i could give you some ideas for you to try and make
itd be nice to share some of my dreams
Hey, if you listen to some of mine, I've got no problem listening to some of yours.
neat
My current main project is funnily enough inspired by a Micheal Jackson song... or at least that's where it started, and once the ball got rolling it evovlved crazily and wouldnt stop rolling.
I was tryna think of a story that was inspiring but also made the reader feel empathy and thats when I was listening to Billie Jean and it frickin clicked
I thought to myself "Wow a legendary guy who is just tryna convince the world this aint his son... Whatabout the reverse side of that? How would the son feel, truly believing a legend is his dad, maybe even wth prooof, but the guy just says no?"
sounds interesting enough
i aint have enough talent to do the art part but i can write a heck of a story
I feel ya there, for sure. My arts getting there but ive got a lot of work to do
seriously friend if you want to ever talk about writing ideas id love to share some
Absolutely.
i just have aphasia cant see things in my head
Oh my.
That sounds difficult....
imean it can be but its also got its upsides
Though I will tell you what, it's somehow not impossible. I did know a fellow with aphasia who wrote and drew amazing things. I have no idea how, that sounds really tough to me.
oh no its actually quite easy for people with no mental imaging to express their talents other ways
My writing and drawing proccess is 100% just fantasizing in my head with images and emotion. So i have no idea how the buddy did it.
if you dont focus on HOW it looks you can spend more time focusing on the why or cramming more details into it
you know "flow state"
Wow
That actually is the exact advice I got for improving my imagination-drawing
i can sort of turn it off and on if i get into a rhythm most things happen by instinct so i may not be able to draw faces but the posing or bat crap creative stuff works wonders
i do believe we should take it to dms to not gunk up this chat if thats alright with you?
Yes
but yea for the rest of you how do i use a json in my stable diffusion
Ah fk did I miss the backdrop event
question, could I photoshop an image then use SD to correct the light and the color?
Why wouldnt you do the task in Photoshop? 😄
There are also auto settings for matching colors etc
Oh, light
Depends what you mean by that
I use gpt to translate, im not good at english. my question is " Can I use Photoshop to composite an image and then use Stable Diffusion to make the composite image appear less abrupt? "
What do you mean by less abrupt?
I choose a bg photo and a portrait, I want to make these two blend more smoother
I mean you could load a photoshoped image into A1111 and play around with a certain plugin for it for that but Photoshop is much better and efficient for that
Like the light source, and color correction
Yeah, Photoshop is best for that, there is even auto color correction amongst all for this
You can try it with SD but its really not near efficient as PS and i work a lot with PS
I havent try it with SD yet. I'm worried that SD would change the detail of the portrait or the bg
I saw an article talking about models for fine tuning and 3d model generation. Are there dates set for those releases?
Hey can I use the beats that I create with the stable audio and make a song and upload it on streaming platforms?
With PS if you have experience with it you can work easily with layers
Remember seeing minecraft build diffision a long time ago so this is not some new stuff
Hopefully, I can post this resource as it's currently free and I've put about 3 months into it.
Made this 2D Character Animator course that shows how to clean up AI-generated characters and then Animate them in After Effects.
Have a video that covers it in more depth here:
https://youtu.be/g1vBjfHYrGY
Udemy lets me do an initial free discount for 5 days to promote it. Figure it might be of interest to you guys.
The direct link with the coupon already applied is here: https://www.udemy.com/course/2d-character-animator-photoshop-after-effects-midjourney/?couponCode=A42E2BAB1066AA982B8C
There are a couple of pre-set stock assets that I've made in there too.
first time working with sd, wanted to try face swap/morph
wanted to use reactor, i tried installing it twice, even in console it shows as reactor running but i dont see it when i run stable diffusion though, can someone help
this is an offical model released by stable diffusion though
Does paper space allows uncensored usage?
can we use custom lora or models in dezgo?
Anyone seen the Deliberate v2? It seems to have vanished
@karmic brook did something get messed up with my perms?
Should work fine! Restart client
my image been generating for 18 minutes
no idea it was working before alright probably just bugged out
AttributeError: 'ControlNet' object has no attribute 'label_emb' - I have updated all my installed extensions,.. problem persists. 😦 I need to outpaint but the controlnet refuses to play Cricket. 😦 Image is a tasteful NSFW image but her foot is cropped out of the image. It's 1024 x 1024 but I think 1280 might be just enough (ideally 1800 x 1800 but hey, that's asking a lot). Anything I can do?
Can someone help me on Lora´s in SD? I cant see them all, I know there are tricker words, but if you cant remember that, what do you do?
hey guys ! how can i use the models from A1111 with comfy ui
i dont want to duplicate SDXL twice since its huge haha
Hey, yes in comfyui there is a file you need to rename. There you set the path to the auto1111 webui and it will use the models and upscalers etc
don't ask me how, but i managed to get AnimateDiff running on my 3gb card. my pc is not happy
hmmh where is that ??
thanks for your answer man !
I showed it in #🏞|general-with-images
ahh thanks man !
do you work in that domain ??
damn i cant find the config file xD wiat
ahhh i see the folder
and lots of yaml path
mann its complicated haha
ohh do i have to create a YAML file
No you just need to rename it
So that the file will be a .yaml
and im using a mac m2 pro and damn its slow af when i use comfy ui haha
Ah well. Did you tested auto1111 for Mac ?
yeah but not with SDXL
i added you btw !
Couldn't download any attachments for awhile but it works now
hiya
am here because im hoping amd gpus may have gotten a bit of love since my last attempt at getting stable diffusion to work
but i doubt it...
i remember looking into it and basically getting "yeah you can do it but you need to do some absurd roundabout command prompt magic and you'll never have a webui for it so good luck having any kind of accessibility"
Are there any fixes for the errror: "RuntimeError: mat1 and mat2 shapes cannot be multiplied (154x2048 and 768x320)" -- When trying to use Adetailer! on SDXL?
Someone told there's some other site to use that doesnt use your own gpu so you can actually have reasonable generation times.
i want a local installation though, so i can use my own data sets
my friend sent me a stable diffusion data set for my favorite video game character and i wanna use it but i have an amd graphics card...
hey quick question, how do i change like format and style, cus im trying to generate stuff and it randomly adds formats when i dont want it to
Or well.. let me ask about the root cause of my issue. How can I get perfect hands on SDXL?
What do you mean by format? PNG, JPG?
"TakeRep" command returned an error: strconv.ParseInt: parsing "--": invalid syntax
webui wont run, it has been running all day, then this happend, it loads Python once and it starts but nothing more, I had it running one more time after this, but now it is the same. Does any body else experience the same?
Could it be some extensions I just installed?
Hi guys, it's been a few months since I last used stable diffusion. Would y'all say SDXL has progressed to the point of replacing previous versions? When SDXL models first came out I wasn't too big of a fan of the quality, curious if it's improved a lot since then
SD needs a dedicated help channel,.. I've asked 3 important questions in here and only got help once. Edit Also, I see so many questions from other users, all going unanswered. SD, this is how you lose people using your product.
its a free product so you wont get costumer support here unless its from their paid services,only support u will get here is ppl who volunteer to help other ppl but they are not here 24/7
Good morning, everyone! How are we all today?
From where can I get set of styles ?
Hey, on windows there is the auto1111 directml fork for amd cards
the number of detailed prompts i lose because i forget to copy it before the "GPU's are on break" message yeets it into the abyss, never to be seen again 
wait so theres actually some level of support for amd now? 
whyyyyy does it keep randomly doing videos
am I misunderstanding how "negative prompt" works? I'm entering "torso, full body, arms, legs, hands, belly" into the negative prompt and it's still generating full bodies
You can change it from video to image under format
If you want to learn more about prompting, I suggest checking out #📝|prompting-help
Yes!
Its usable
What's your GPU?
rx 6700 xt
Ah nice. That will work. Tested the same card
woohoo! i looked it up on my phone earlier and saw it had lora support?
Yes loras work!
Yep.
Here is the guide:
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Install-and-Run-on-AMD-GPUs#windows
For any questions feel free to ask in #🤝|tech-support
Is there like another site to use so I dont have to use my own gpu while still using workspaces? The generations just take too long.
hey all, is anyone using SD with objects like soda cans? ive been able to get controlnet and use it on people, wondering if i can do this with objects
PLEASE , I NEED a HELP . Does Anyone knows some PROMPTS that I can to use correctly for to create some images of the female characters " WITH STRECHED ARMS TO THE FORWARD " more or less " as if wish to be embraced " on Stable Diffusion ?
idk maybe this lora will help https://civitai.com/models/43366/conceptoutstretchedarms
Hello, why do I get a video and not the image?
So what is a Lora? Like a mini model? Is this like the difference between niji expressive cute scenic, or more for specific stuff that models have trouble with
How can I register on stability.ai?
Hi all. Sorry if this is not the right place to ask, but how do I go about training a model with my face? I’d like to generate images of myself but I am unsure where to start. Is there a way for me to train a model easily? Any recommendations on how to train one with my face?
Search for lora, lora training, that's what people usually do if they want to add someone's face
Thank you friend
Guys, anyone specializing in models modification? I need to ask.
Also, where is the artist channel?
What
Hello here
Good morning, everyone! How are we all this morning?
hello everyone, is there any better suggestion for command line parameters for M1 mac than those I currently use "--skip-torch-cuda-test --upcast-sampling --no-half-vae --use-cpu interrogate --medvram --opt-sub-quad-attention"
sorry, i am talking about automatic1111
I would actually ask this over in #🤝|tech-support !
thank you
We've come to this. With the latest update of A1111, SD no longer generates images. It runs for a few minutes, and then it throws a tensor error.
i had that error a few days ago (i am a total beginner) but I fixed that without a problem. i can't remember if it was on windows or macos. which one do you use?
and what was the exact error message? developer in me complains about the incomplete question, but a fellow user in me understands your pain
windows
nvidia with 6GB or less VRAM and SDXL or 2.x model, I presume?
"NansException: A tensor with all NaNs was produced in Unet. This could be either because there's not enough precision to represent the picture, or because your video card does not support half type. Try setting the "Upcast cross attention layer to float32" option in Settings > Stable Diffusion or using the --no-half commandline argument to fix this. Use --disable-nan-check commandline argument to disable this check"
just add --no-half in command line
Why would I have to do that now, but didn't have to do it before?
https://generativeai.pub/how-to-solve-stable-diffusion-errors-performance-tips-included-fb1fb1a03ad6
this article might answer some of your questions. for technical details, you can ask in support channel
hey, with my friend we've been building Social Network for AI-creators. You can create art, music and text together in public chats. And share it on your Public profile.
And we've just launched Early Access here https://aiphoria.world
If someone want to test and be one of the first, just DM me or apply on site! good luck
Does this line go in webui-user.bat?
--no-half
you can add it here in that file here
set COMMANDLINE_ARGS=
and run webui-user.bat
@echo off
set PYTHON=
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS=--no-half
call webui.bat
export COMMANDLINE_ARGS="--skip-torch-cuda-test --upcast-sampling --no-half-vae --use-cpu interrogate --medvram --opt-sub-quad-attention"
set PYTHON=
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS=--no half
and just run webui-user.bat
this is what i have on my mac
do not use those
OK, now it won't run at all
error?
Hold on, I will check. This is when I did:
set PYTHON=
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS=--no half
do the test manually from cmd
liek this
webui.bat --no-half
then check the errors and fix one by one
when you are satisfied add them all to webui-user.bat
launch.py: error: ambiguous option: --no could match --no-half, --no-half-vae, --no-progressbar-hiding, --nowebui, --no-gradio-queue, --no-hashing, --no-download-sd-model
you missed the - in the middle
Yes I did
--no-half
btw, please go to tech-support, i will need to go soon, and someone there will help you even better than i can
check out this too https://stable-diffusion-art.com/install-windows/
Sure, but I've been using this for months
if you have nvidia with at least 6GB like 1660 for example add this --xformers
these issues began after the last update
it will make it work faster
but you will probably need this one --medvram too
than forget about --medvram
and be sure to add --xformers
webui.bat --xformers --no-half
run this
is it better?
pls dont use --no-half with a 4070TI
I have to noiw
its not needed
without it, it crashes with a cuda error
this is the reason i told him to ask in support
thanx @warm junco
your card only needs --xformers --no-half-vae
the reason you get an error is you probably try to load a lora as model and that wont work
i didn't know which card he has until a few second ago, i thought he has something like 1660 that needs --no-half (based on the error)
true for that!
i am using --no-half-vae on ma mac m1, but i had to use --no-half on my sons 1660 super
yea the gtx1660 need it unfortunately
it needs almost the same time as m1 to generate SD 1.5 image 🤷🏻♂️
yea macs arent that good fo SD either 🥲
yesterday i helped someone with an M2 chip
he generates an SDXL image in 1:30 like my old GTX1080
thats okay imo
for an chip that doesnt have dedicated vram
please take a look at my command line parms in support channel, and tell me if you have any advice
I am using a model I've used in the past. It''s a model.
well this didnt go as expected
which upscaler is best for photos?
Guys?
@austere marsh may need to ask a quick question.
What happened to the artist channel?
It was archived, i think due to low activity and a need to make room for other channels but i don't entirely remember
if you react to the community archive access in #👥|roles you should be able to see the archived channel if there was a particular post you were looking for
Hey guys. I'm wondering what the current state of AI is in regards to consistent character generation. Is it at the moment possible to use a dataset to define a character look, and have SD generate all kinds of scenarios and environments with the character remaining the same?
If you want more information on how to go about this, you can check the #1080946152318443610 and #🔧|finetune channels!
Thanks!
np!
Okay, so I've tried getting a good grasp of the different approaches after looking into the resources etc. I'm just gonna ask this:
I want to create a workflow where I can block out 3D scenes in Blender and output a render, then run it through SD to add the desired style, and replace the rough character with a trained consistent character. Is there certain methods that would be obvious to use of this process?
I've been looking for near months now trying teh various incarnations / frontends for SD/SDXL and i'm not really sold on the ones i've found. I started using the standard method of an installer script to install a1111 gradio interface which served me well for a time. But then I started poking around the various other interfaces like invokeai, SDG, and most recently stablematrix which in-theory sounded great, and it has lots of ease-of-use features which are attractive / convenient.. but what with the inference module within the GUI, the discrete launcher concept, with multiple shared models/checkpoints it all gets fairly unwieldy IMO. I've now begun running into issues involving directories not being synced and LORAs / checkpoints not loading within the various frontends. It all just seems like a huge mess to me. and I'm wondering what is the BEST, most straightforward, least-sprawling local option to use? A large part of the problem is having to go into the various envs / packages in order to modify extensions, or forcibly remove things that have caused the app to break and/or not launch a111 in particular. a headache. and another thing is that i was having many issues saving generated media locally. Like I could not drag from the image browser/gallery to my hard drive, could not select multiple images at a time and save as... etc etc. But this cannot be the final solution it touts itself as .
thanks
Hey, Stability Matrix isn't a webui by itself. Its just an installer for the different webuis. (With some additional features).
I wouldn't recommend it exactly because if the issues you got. Its just another point of failure in the chain.
Better to install a webui from its github source. And then install extensions for additional features.
The best option to use rn is auto1111
Good android games like mc and geometry dash pls
Can we disable the video and stick to static images on the bot image generation?
It is clearly having an impact on performance
sussed it thanks for the prompt help lol
So, auto1111 is probably where I should have stayed, then. What is the best way to install that locally? I have been very lazy using installer scripts. And they're mostly failsafe it seems. But it might be good to have something that's not just a floating ENV in a directory, right? Should I Start all over , wipe my miniforge lib, etc? And what is the best place to install would people say? Probably cloning the git to the users folder?
curious if defocus is the only option for animating? I imagine not..
does anyone know if stability plans to release a diffusion model beyond SDXL 1.0 soon?
Why does sd keep trying to download the default checkpoint when I already have a bunch of .safetensors in the folder?
How to fix? Im on linux if that matters
Just follow the guide for your system/GPU https://github.com/AUTOMATIC1111/stable-diffusion-webui#installation-and-running
How to change model ?
Can I ask a 'noob' question. Using Auto1111 web UI, Is there a way to go back to previous generations and prompt settings from a previous images generated in a single session, or is it once you make changes, and generate a new image, anything gone before is gone? (other than the save of the images)?
The main way I know is to load the image you want into PNG Info and send those settings to the UI.
If you use a certain set (or sets) of settings a lot, you could also try the Config Presets extension which allows you to save and load settings. It's highly customisable too, allowing you to save exactly what you want, not just everything. It's one of the best extensions IMO.
https://i.etsystatic.com/24542731/r/il/3507f8/3885724701/il_794xN.3885724701_t9xa.jpg create a similar, childish,Simple image (╯°□°)╯︵ ┻━┻
/promt
/prompt
hi, guys can anyone recommend any creators or courses where i can begin learning about SD? Im new so would be helpful to hear from experianced people)
i just found this can someone explain me what this is i comparison with midjourney ?
and is this free t use ?
can you also do all aspect ratios ?
and based on which models is this trained ?
is there any website ?
so many questions im sorry im hope someon ca answer them
Anyone ???
anyone got a jupiter lab i can run for vast ai?
I don't understand your question!
I asked multiple but the first thing what Is this server is this the same as midjourney?
What's the difference ?
@topaz parcel
Stable Diffusion vs Midjourney - Side by Side Comparison https://www.youtube.com/watch?v=Ytzs7bauQj0
Is this opensource ? Thanks !
Are there any colabs where I can train my own models with this engine ?
@topaz parcel
Stable Diffusion is licensed under the Apache License 2.0, which is an Open Source license. Although Stable Diffusion is open-source and has a permissive licence, there are some restrictions.
thanks
@topaz parcel thanks but can you use this model on your own server ?
And are there any documentation about this
And can I finetunen with my own hardware ?
@topaz parcel
And if I can finetune it are there any Google colabs that I can use to try and research it ?
@mild ferry <@&1112843154362740796>
<@&1025179534330433656>
@still glacier @elfin dome
@bleak matrix
@fervent thunder
Pinging everyone in here is probably not the best way to get people in a mood to help you.
Also, I don't use google colabs
@wind glacier that's really lame. i would offer to help you out, but you seem to think everyone is at your beck and call. So if i start helping you seem like one of those people who would latch on but then also argue because you know better
colab bans people trying to abuse their services anyways. it's meant for students of machine learning to learn the concepts and understand things. It's made for experimentation. dropping a premade script someone else produced into it and expecting it to push button go, is against the spirit of colaboratory
So Easy Diffusion is frozen - no new features will be coming unless someone other than CMDR forks it and takes up the mantle. I will miss it greatly. I know it will continue to work, but I'll miss the excitement, remembering back when updates were almost weekly and full of new goodies. CMDR recommends Fooocus or Invoke. I've got them both, but theither seems to fill the void. Any suggestions?
i just stick to automatic lately. invoke is nice for sure but they don't make their ui too extensible and they seem to be late with updates. I can't use animatediff with invoke still
hey guys, I'm trying SDXL for the first time after using SD1.4. It's taking several minutes to do an image with my 3070 with 8GB RAM. SD1.4 was a few seconds for me, is that normal switching to SDXL? I'm using automatic1111.
@pale latch I'm sorry battery died in sorry I'm just very impressed and I need to know everything about it I really want to fine-tune my own models an I hope this is the place ?
I don't abuse the colab it's just to learn then I will run on my own hardware
It really own but my AWS servers
It = not
@zinc turtle
You can learn about finetuning, etc, by going to #1080946152318443610 Please do not keep pinging people. Thank you!
@bleak matrix thanks sunny
np!
Hi, is it possible to download all controlnet models for SD1.5 and SDXL from one simple source? I used google but not found all-in-val source.
Just noticed that I only have 2GB VRAM memory.
Is there any way to get around this restriction, maybe exchanging it with slowdowns instead?
(I would prefer minutes of waiting actually to not getting pictures at all)
what python version does stable diffusion use?
3.10.6?
because it's not working somehow
What is the error message?
D:\stable-diffusion\stable-diffusion-webui>git pull
Already up to date.
venv "D:\stable-diffusion\stable-diffusion-webui\venv\Scripts\Python.exe"
No Python at '"C:\Users\Metability\AppData\Local\Programs\Python\Python312\python.exe'
Press any key to continue . . .
I downloaded 3.10.6 and i checked the folder and it's saying 310 instead of 312
do I change the name?
do i download 3.12.0?
@regal meteor i fixed it
delete the "venv" folder and run "webui-user.bat" again to create a new "venu" folder, then the problem is solved. It took some time, but it worked
here
All the license stuff is confusing to me.
Is there someone here willing to answer a few questions for me?
(Which license in #1080946152318443610 covers the bot-channels?)
How accessible is stablechat to write blender addons for example?
could i pay someone here to make me a perfect prompt for a realistic girl?
Where can I find a guide on how to use the sdxl code download?
With google colab (not webui), is there a way to draw a mask inside google colab, or must I use separate program and upload it manually?
Hi guys, question,
is it possible to have a style and a character(lora) in one checkpoint?
there's an error when running webui-user.bat
You can post the error in #🤝|tech-support
For WebUserUI A1111: Has anyone found (or got that you'd recommend, obv') a SDXL (not 1.5) Checkpoint which is really good for producing period piece imagery,.. classic style,.. medieval, renaissance, French revolution, Battle of Waterloo, Napoleon, Marie Antoinette, WWI, WWII, etc.
I know about the Renaissance Checkpoint but that's not for SDXL AFAIK.
I currently have: JuggernautXL, CopaxTimelessSDXL, Cyberrealistic, Dreamshaper, NightvisionXL, ZavyChromaXL, SVDN6RealXL, PirsusEpicRealism, Reliberate, Retromix, SDXLV1VAEFix, RoXL, RaynaeVisionXL. Photon.
Good morning, everyone! How are we today?
I created a GPT that allows you to use SD in ChatGPT: https://chat.openai.com/g/g-oCMGQZu95-stable-diffusion
Just dandy,. my dog and I found an empty cave of gold, my wife's bringing 4 of her best friends over for drinks later and I found the lottery results for December 2023. I'm winning. lol
Dandy like candy, then!
I'm eating my bowl of cereal, ready to plot my way into whatever it is I'm doing today.
Oh, so plotting for world domination,... after your cat gets there first of course.
Not working, tells me it's not ready and to go back to my ChatGPT.
I am pretty sure I know which one is going to take over first
You better get ready then, your cat won't let you use its favourite seat again. 🤣
I don't think it's the seat that's the issue. Rather, the bathroom sink, or the tub. Hahaha!
lol 😄
You know, one of those cats that likes water.
And I suppose the resident mouse hates cheese?
Unfortunately, I'm afraid this is also the case.
Mahahahaha!
The land of opposites, doncha know?
lol. I'm so sorry, my deepest condolences.
Hi uhh
It's all good! It's entertaining.
Hello!
How do i make consitent character ?
How? Does the mouse give rousing speeches to tell the world (read: lounge) why he hates cheese?
A google search would have provided some help: https://www.youtube.com/watch?v=WQsUeyzq-VA
Oh Yes thank you, ive searched video like that, but it is not show up in my page 😅
Thank youu dude
No worries. 👍
You can check the info in #1080946152318443610 for more information on this as well as #🔧|finetune
I imagine so! Beatrix Potter style, with an immense amount of watercolor, really.
Do you have chatgpt plus?
No. If I did, then I assume your CGPT would work. Sorry, you might want to specify that when you post a link to it. 🙂
Thanks for your response. I have no idea what OpenAI's plan is, if they will allow non-plus users to use custom GPTs soon. I really hope so. Or that I can pay a fee to make it publicly available.
AFAIK, the company operating ChatGPT is losing SERIOUS money every day to run it.
Interesting, I am not familiar with that. They were however the history's fastest growing app. So I guess they are ready to take a loss.
For WebUserUI A1111: Has anyone found (or any that you'd recommend) a SDXL (not 1.5) Checkpoint which is really good for producing period piece imagery,.. classic style,.. medieval, renaissance, French revolution, Battle of Waterloo, Napoleon, Marie Antoinette, WWI, WWII, etc.
Which bot is the best one to create a sorrowful image having proper teary eyes
It never creates tears
how to get sd without my pc explode
sdxl blurry
What should i do with all generated images. Publish on website, blog, children's book, book. Any suggestions?
nightvision
I'm using Nightvision and I just cannot get Paul-Emile Becat's pencil and watercolour style. *Note,. singular. Style.
I also can't find a Lora for his style either.
Yes it can be hard to tweak exactly as you want. But i find it to perform quite good with my historical images.
I want to imagine how it looked like in old times. photostyle
I find it incredibly hard to GET a BASIC style and tweak it toward what I want,. there's no guarantee Nightvision has the data it needs to reproduce his style (not content, that's something different).
i see. think i answered wrong. you meant historical artist styles 🙂
Yes,.. though some of the figures and interior design also (to a degree)
Most Checkpoints are portrait oriented, so tend not to grasp I want a small sketch of 3 characters, created in his (or other artists') style/s.
agree. its hard to not get a portrait style of picture. but its possible with a lot of work of course.
This is why I think SD needs to do away with singular downloads and have 1 massssssive database of ALL the datasets so that we can use it like Midjourney
sounds good
You can never be too sure what data went into a checkpoint,.. and,. let's face the obvious,.. 1 file,. is,... 1 file,.. that leaves out a TRULY EPIC amount of data that ISN'T in the 1 file.
It's like trying to build a house with ONLY 20 matchsticks,.. no glue, no more match-sticks, no nails, no metal, no glass,... etc etc etc.
If I asked Midjourney to render me a beautiful woman in Picasso's or Frank Frazetta's art styles,.. it's almost guaranteed I'd get an accurate result for each.
For a lesser-known artist,.. in SD,.. no guarantee whatsoever.
i see. still i am amazed by the output i get but haven't tried Midjourney
Midjourney, will, I admit, likely blow your mind.
It's a million times better than SD for its data.
*if the only thing you want to do - generate image and...leave it as it is.
True. Editing the images in it is,. well, basic, at best.
And time consuming too.
what plan is best= 10 /30/90 a month?
MJ is good at generating...
SD is good at tons of other different functions, and community content
90 I presume is the best for getting the most out of Midge, but damn that's expensive.
I was on the 60,. but they've increased the prices. 😦
thankss
£60GBP per month... I don't know if you meant $ or £.
Dreamshaper,.. ha,. almost all I get from that is cartoon-like and very rubbery ugly looking figures.
Will look at Dynavision (is it SDXL or SD 1.5?).
Ahh,. it's not SDXL. Bummer.
Dynavision SDXL 1.0
Nightvision is persistently giving me its own 'oil-painting' renders,. despite my best efforts to make it into a competent and expertly created mature (as in skill, not content) pencil sketch
On Civitai.com?
yes
Author says it's: " stylized rendered anime-ish".. I definitely don't want ANYTHING anime in my work.
dynavisionXL try : eastern bloc hydraulic gopnik, zulu background, rococo watercolor painting, highly detailed,dreamy, a busy street in Paris, a cafe, people in high hats
That's a big download to just test lol
nightvisionXL. or : RAW photo,full body shot of a drawing of a man walking to his cabin door to open it and a bird on his shoulder, storybook illustration, a storybook illustration, serial art, digital Aquarell painting
yes it is 🙂
I can't see a Lora I downloaded,.. it's a pencil sketch (Anders Zorn),... in my list of Lora,.. any particular reason why that might be?
*Yes, it's in the correct folder.
not compatible with sdxl?
The civitai page says it is
i get that some time with loras. works with one model but not the next
can i ask? what do you do with all your renders?
I only 'accept' about 1%,... the ones that are really good. Only about 50% of thoese get printed and only about 50% of those get put in my soon-to-exist portfolio.
nice. so you print ( tshirt?) the ones you like
Prints, on paper.
square on A3 'square' format to go into the books (2 books).
Most are midjourney renders, some newer stuff is SD
Well, thanks, yeah. It's a damned sight cheaper than photography which is what I had hoped to do but can't
why not do both?
a artist did a exhibition with photos taken with mobile phone
Photographic portfolio,.. done properly,.. several thousand pounds,... MJ or SD,.. a few thousand.
Cool. 🙂 Cya. 🙂
👍
what model is the bot currently using? and is it the same model for video as well?
(assume a different model, or something else it does by generating multiple images on the same model?)
is there something like reposer for facial expressions? or what would be the best way to handle that?
Hey everybody, would anyone be able to help me out? Ive been trying to make an AI pic of "a kobold wielding a scorpion on a stick battling a beholder in a cave" in that popular 1970s dark fantasy style, and I can't get it to work. If you have to the time / want to help can ya DM me!
And also, are there any tips or specific AI models that you can create a detailed description of multiple characters, or like a recurring character that can then just be kind of name-dropped?
What is reposer?
hm i thought it was an extension but looks like it might actually just be a comfyui workflow, my bad
@left star Might be off-topic, and I never used it either, but you might like to use GFPGAN
I happen to need similar feature, but for anime faces, do anyone know about it, or does GFPGAN also work for that too?
that looks interesting i'll check that out, thanks. https://github.com/nerdyrodent/AVeryComfyNerd If you look at the reposer images there you can see the workflow that shows how it takes the face from the first image, the physical pose from the 2nd image, and the outfit from the third image. I'd like to figure out a way to incorporate a facial expression into that too but I'm still kind of new to all this
Oh, I thought you want to fix faces
not exactly but it's related and could be useful
hey guys, I used to use stable diffusion 1.5 locally in my computer, then took a long break. what is happening right now? What's SDXL? and where can I find more news and how to install?
I greet everyone
where did deepfloyd go?
Uhh...I don't know where I should bring this up so please don't ban me, it'll be like pulling off a bandaid here goes: SD doesn't like gay kissing
it just makes the people each kiss the opposite gender
like, you want 2 people but you get 4
any ideas on how to uhhh...compensate for that prompt wise?
uhh..you also can't put men in lingerie...
there's probably models or loras for that
Hi folks I don't know if I am in the right place so sorry if I am not.
Can someone help me identify the model/Lora for this style ? Thank you so much .
o/ how to train SDXL with 24GB ram?
good morning! How are you all today?
can i add @radiant meadow to my server?
Sdxl is a new model, trained on images with 1024x1024 resolution.
You can simply download the model and put it in the same folder as the others.
But before using you need to update your webui and if you have 6-8gb vram you need to tweak your webui-user.bat
You find help for that in #🤝|tech-support
Anyone know any ai that can listen to a song and tell me what piano notes are being played? I've tried a couple but it's only the chords not the fingering for the notes
Hi, can someone please explain to me the relation between hires fix and base model? Say I want to render at 512x512 and hires upscale by 2. That works good, but if i increase hires upscale to 4, things go weird. From my messing around i found out that for upscale 4 you need 256x256, so what is the point? I'm kinda new to this.
Lower denoising of highres to 0.49 or lower. If you go too high (like default 0.75), too little info from low resolution image goes to second AI pass and lower base resolution of model than rendering resolution will cause repetitions of prompts in image.
ahh, ok, it looks good now, thanks
░░░░░▐▀█▀▌░░░░▀█▄░░░
░░░░░▐█▄█▌░░░░░░▀█▄░░
░░░░░░▀▄▀░░░▄▄▄▄▄▀▀░░
░░░░▄▄▄██▀▀▀▀░░░░░░░
░░░█▀▄▄▄█░▀▀░░
░░░▌░▄▄▄▐▌▀▀▀░░ This is Bob
▄░▐░░░▄▄░█░▀▀ ░░ Copy And Paste
▀█▌░░░▄░▀█▀░▀ ░░ in Every server
░░░░░░░▄▄▐▌▄▄░░░ So He Can Take
░░░░░░░▀███▀█░▄░░ Over Discord
░░░░░░▐▌▀▄▀▄▀▐▄░░ (don't spam him)
░░░░░░▐▀░░░░░░▐▌░░
░░░░░░█░░░░░░░░█░░░░░░░
░░░░░░█░░░░░░░░█░░░░░░░
░░░░░░█░░░░░░░░█░░░░░░░
░░░░▄██▄░░░░░▄██▄░░░░░░
yo
Hi
I don't know if it's the correct channel to ask this question
I'm a total newbie to stable diffusion
I'm searching for an api solution that takes as input an existing image and a prompt and change detail of the image, like change the hair color, or the background
Tried the sandbox but it return only GRPC request failed message
Is SD the correct solution for this job?
is there somethings new to know about SD or it's still the same since fews month ? there were any update or something ?
aside from 1.6?
1.6 is tha bomb diggity. Never looked back.
ReActor also launched not long ago which is a godsend
I mean sd is still the same since i use it, there had been never news things since ?
Does anyone know some ai that find images on the Internet with a similar Strukturen like Mine?
i'm new too but i think using controlnet with SD would be able to achieve this
I've seen some services that are made to do that...but I never seen one that's actually good.
https://deepai.org/machine-learning-model/image-editor
You'll probably have to make your own thing if you want it to be good.
just got stable diffusion to run on my macbook
Hey guys! If you have any time could you please check out this user study I made for CVPR? I'm trying to get some human-evaluation results for our new dataset, and we're doing a last-minute user study by the submission deadline Friday lol https://forms.gle/wKPFtTm8178adL1r6
It's a new massive image generation dataset specifically made for generating transparent images - we're hoping to release it to the public soon
Right now I'm hoping to get at least 20 people to do this study...it's not going to be very large so every person that fills it out helps tremendously
If anyone wants access to the dataset early please let me know!
https://chat.openai.com/g/g-BSGm5ucqy-visual-prompter-version-a
https://youtu.be/hXXFiw8QH9k?feature=shared
This will give you a variations of image prompts and also let you test the prompt in DALL·E
This will all so create a promp from your Description.
Make sure you log into Chat GPT-4 FIRST then go to the link, you may need to refresh the page TWICE there's a slight bug. any question go to the invoke discord #1050123398342250526 message
Anybody here using Fooocus?
i there an easy to use ai or model to touch up images? i went on vacation and I have several images where i'm not looking at the camera that i would like to fix.
anyone know how to do clothing change or undress on a image
hey guys, someone have a link for the dreambooth discord server please ?
Hi. Longtime NMKD user. I'm wondering what are people's favorite GUIs?
I tried Auto1111, and it was ok . . . but I'm looking for something even deeper and more comprehensive.
Join over 10,000+ readers to become smarter about AI with new tips, tricks, and hacks - all in a free bite-sized 5-min newsletter. Read by subscribers from Amazon, Anthropic, Cohere, and Google.
Subscribe and invite friends to access leaderboards for top AI courses, research reports, events, stocks, investors, grants, repos, AI tools (including this one), and AI tech stack companies.
hello guys a few months ago i was using SD webui automatic 111 on google colab...now i tried again but it's not working i tried running the last notebook from "the last ben" but some models do not work , for instance sdxl and protogen and dreamshaper xl....they all give garbage output...only dreashaper works...what could be the problem? anyone using automatic 111 on Colab here?
Anybody here using Fooocus?
Besides using Linux, is there any way to use AMD GPU on A1111?
I'm currently running off my CPU
directML but its about as slow as a cpu. you should just realize that linux is another OS, not something scary, and just use it. Don't let microsoft indoctrination control you, well, completely anyways. microsoft is still pre good
do it. use arch. go full linux. make windows the dual boot and linux the main. dooo it
/whispers do it
kde plasma, the windows wobble when you drag them. wibble wobble windows! i'm not even joking bruh
https://youtu.be/UJtAnZ917O8?t=43 lookitthis
@craggy tartan ||do it||
Just tried SDXL 1.0 1024x1024p and it took like an hour
I am trying to understand if this is my fate running on an AMD Ryzen 5 3400G with Radeon Vega with 32 GB RAM
Or if it is due to me having the wrong torch or xformers installed, or something else
Anybody who knows if this is to be expected, or am I doing something wrong?
guys how can I generate image using 2 different lora's (celebrities)?
the trigger word for both is sks person
cant make it work
I did DirectML 😈 It is way faster than CPU liar
use comfyui
it's faster
might take some time to learn how to use it though
using sdxl as a base and img2img sd 1.5 takes about 1 min per gen
Does inpainting require specific LORAs, or you can use any LORA during inpainting?
compared to linux its like , the little dml that could
might as well be cpu tier
get on rocm you'll be like "woah!"
i was generating 1 image per 7 minutes on CPU. I'm generating 1 image per 12 seconds now
i did
Would this be the right channel to ask a question related to creating custom models?
If not, where would be best?
Latelyy I've been getting the word "arafed" when I do Interrorgator. Can't seem to find any solid definition via google etc. Anyone know what this word means in relation to prompts?
You could try the directml for for amd
Anybody here using Fooocus?
Boys went from directml in window on my 6700xt, then Rocm in Linux and that was a big jump, but prefer windows then I got a 4070 today. Good god so much easier and so fast.
And all my generative ai stuff I’ve been learning for work…. So much easier. AMD needs to step it up
what model are you guys using on invoke AI?
Not using invoke or whatever foocus is
a1111, comfy
is that the best?
results shouldn't be affected too much by ui anyway
that's most commonly used I'd say
where can I ask a question about comfy ui
is it worth to switch to linux for the performance increase?
fooocus here
it should. invoke use different grammar and results will be out of order when copying civitai prompts/data. also i find images from invoke in most cases less saturated than a1111 even after I double checked vae things. besides, fooocus use inner GPT to handle inputs and no matter what i type, the results are always not bad
idk about getting exactly same results, but while I was trying out invoke it wasn't that bad, I noticed weird thing tho...
Maybe it was old version or maybe I had something bugged out, but it used DDIM nomatter what I chose
Which is fine for testing things out, but as for getting actual good image...nah
sd models have too many magics behind different impls - but what is clear is that if i just copy the prompts from civitai into invoke, it should not work since they use different grammar system and most existing civitai posts are using a1111 grammar like (red:1.1)
I found out that the term "execution" is on the list of prohibited words for use with the Stability AI API. Could anyone please provide me with the complete list of banned words to ensure I adhere to the guidelines when making requests?
Hey! What do you think about my design for a 2d game made in AI? I would appreciate your feedback!
[iOS: https://apps.apple.com/us/app/cat-journey/id6471379757]
[Android: https://play.google.com/store/apps/details?id=com.yaskravicontent.catjourney]
https://youtu.be/tTceryLuPVE?si=PxvdEff2q_u3w0J3
Good morning, everyone!
My image generation is taking too long
like hours
do you have any idea why?
Will Stable finetune be a FOSS application?
how do I check if my stable diffusion is auto updating?
I keep a site on AI/SD latest is LCM speeds up image generation many times some links to it here https://start.me/p/xb4Npa/ai
if it says "already up to date" in the first lines after starting the webui-user.bat
or if you edit the webui-user.bat and find git pull in it
ok thanks
Greetings everyone, Pardon me for my intentions.
I am a software engineer in pursue of producing a AI based application
And I am deeply in need of a part time designer ( 2d )
If anyone is looking for a part time job, My DM is always open.
( You do not have to be a professional designer nor programmer, since all aspects of QT designing envrironments are user-friendly and made easy. )
https://youtu.be/0pGR6JJk2n8 Guys I asked AI to analyse Interstellar. Some really dififcult science concepts broken down so did a great job. Let me know what you think
With what tools can I create a virtual assistant that answers my questions? or who has experience doing it? It's for a job.
Where can I download models?
which version? i just now used 803 it is good and fast
2.1.803
are you using 20xx or 30xx or 40xx cards
https://youtu.be/0pGR6JJk2n8 Guys I asked AI to analyse Interstellar. Some really dififcult science concepts broken down so did a great job. Let me know what you think
you mean gpu?
Im using 2070 super
idk, but maybe a victim of nvidia driver, try studio driver 531, and a1111/comui also need it
Can someone direct me to a place where I can ask questions? I have just installed the software and the batch file says press any key to continue and then just disapears
https://platform.stability.ai/docs/release-notes#stable-image-v1-release
so does anyone know if/when we will see model weights released for this SD 1.6 model?
There's a 1.6 model? News to me...
on civitai.com
stay tuned
What about hugging face?
there too
but on civitai youll find more stuff and better visualised/sorted
Am I missing something or is the model a1111 and comui a completly different program or models you need to add to your existing program?
Auto1111 and comfyui are both Webuis (Programms) to use Stable Diffusion
The models itself are .safetensor files
These need to be downloaded to be used in a webui
I found a nice suggestion from someone on Reddit on how the next SD could beat DALL-E 3:
Just in time, please reserve at least a million for this:
How a new SD could beat DALLE-3 at the base level without any extension
A full training on the biggest LAION instead of a smaller LAION subset -> (1M$)
A new CLIP -> (*LLaVA or even better)
RLHF training using vision feedback loop concept -> (idea2img https://arxiv.org/pdf/2310.08541.pdf
Sounds like a nice idea for @wise stratus to consider when SD3 is being made IMHO.
oh
Do you know what's the best model for Invoke AI?
I am using that atm
actually what's the best model to make a logo
can anyone explain to me how to train it via this guide? It's so messy but results from this guy are great
lmk
I'm using Fooocus in Colab. I'm new to all of this, so just trying to find my way around.
Hello I need help with using SD on CPU
I'm getting "Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check"
i setup automatic1111 and it worked okay, but when i installed control net, and installed the models for it, and placed them in the models folder in its exetension folder, and opened the web ui, i can see it, it shows me the preprocessor and stuff, but when i click generate its like its not using controlnet at all, and when i open settings and go to control net, not all the settings are there, 2 are missing which are the config file for adapter and control net models location.
Hmm anyone interested in AI image generation models
Looking for collaborators 🗝️
Did you add --skip-torch-cuda-test to the commandline arguments? My full command to turn on automatic1111 with cpu uses --use-cpu all --no-half --skip-torch-cuda-test
Does anyone have any additional information about SD 1.6? https://platform.stability.ai/docs/release-notes#stable-image-v1-release
WHEN CONTROLNET FEATURE WILL BE AVAILABLE
What do you mean?
One message removed from a suspended account.
Any info on SD 1.6 beyond the release notes here? https://platform.stability.ai/docs/release-notes#stable-image-v1-release
Strangely quiet!
Is there any realtime sketch2img app somewhere ? Where you have a context prompt and then draw a sketch for controlnet ? (a kind of img2img)
is clipdrop paid now?
anyone know how the taylor swift x oscar the grouch NSFW image was made like what website cuz it was really well done but every website ive tried is bad
trying that
add that to which place exactly @potent frigate
an do you add wuotes
quotes*
type into your command line "/Path/To/Your/webui.sh" --use-cpu all --no-half --skip-torch-cuda-test
And that should work. Maybe.
hi all I'm just wondering if these tools like stable diffusion, midjourney, etc generate high enough quality / high resolution images that I would be able to take to a print shop and print in a large size for wall art, like 30x40 inches or greater?
that fixed it, thanks
Do we need to install the CUDA toolkit to benefit from performance or stability/compatibility (using comfyui or a1111 on win11 or WSL2-Ubuntu)? And are there any benefits of manually replacing the 7 cudnn files, under the site-package/torch/lib path, with updated releases?
Of course. It (SD) has the potential to upscale to any resolution, just depends on your computer's horsepower. You can also use other upscaling methods like with Photoshop or Gigapixel to get to your desired resolution.
100%, there are very good upscaling solutions.
2060 super is still a decent gpu right?
The video features is a little disappointing I ask it for a car with flicking headlights and all I get is a simple static image that pans left to right
I have a few hundred images I need to run through png to text2image with a hirespass on. Is there any easy way to automate that with auto1111?
Img2img with batch folder input would help
Аааааа
I trained my lora for 2400 steps using 34 photos and i used it. man, my pictures looked worse with this lora than without. What should i do, train more?
Hello everyone,If I want to generate some 3D game scene concept, what should I do?
pick model and prompt what you want on scene
Good morning, good morning! How are we all today?
como cria imagem ?
depends on your needs tbh, getting into SD was what got me to upgrade from my 2060. one image takes 1-3 minutes to generate atm, but i will also say that i didn't try to optimize performance a whole lot
Hello everyone! Where can I find an overview about the frameworks and tools, like Stable Diffusion A1111 / Stable Studio / ComfyUI / ... maybe a table or so?
I once read about an option where you can generate an image where a prompt tag is only applied to the top half of the image for example. Rings a bell for anyone? I'm trying to find some info about it
Edit: I found it, it's called "Regional Prompter"
cc
who speak french here please ?
salut
comment on fait pour utiliser stable difusion stp ?
hi can i specify seed in the bot?
there is no bot parameter
-seed in the prompt?
also no outpainting here yet?
Salut, je parle Français, par contre pour utiliser SD faut savoir utiliser son terminal un minimum
How difficult would it be to prepare a large dataset like LAION but with LLAVA-generated captions? 🤔
Tried the online demo for LLAVA, it's awesome and seems to outperform BLIP
thanks!
Is there an extension in which I can make cumulative changes based on the resulting image?
Like, let's say I got a simple full body image of a man and I want to make his hands bigger. I send the pic to img2img, inpaint his hands, then write in the prompt something "bigger/larger hands"?
Few days to participate at the first AI film festival organized in France.
It s your chance to win an award and have your AI film show in a theater!
https://x.com/aiffmontpellier/status/1724672577480622232?s=46&t=uB2SUao7bmOQCqJZgl18LA
HEEEEEELP please ;(;(
No module 'xformers'. Proceeding without it.
Warning: caught exception 'Torch not compiled with CUDA enabled', memory monitor disabled
2023-11-16 11:45:05,216 - ControlNet - INFO - ControlNet v1.1.417
ControlNet preprocessor location: /Users/stable-diffusion-webui/extensions/sd-webui-controlnet/annotator/downloads
2023-11-16 11:45:05,276 - ControlNet - INFO - ControlNet v1.1.417
Loading weights [6ce0161689] from /Users/stable-diffusion-webui/models/Stable-diffusion/v1-5-pruned-emaonly.safetensors
Deforum ControlNet support: enabled
Creating model from config: /Users/stable-diffusion-webui/configs/v1-inference.yaml
Running on local URL:
To create a public link, set share=True in launch().
Startup time: 5.0s (prepare environment: 0.2s, import torch: 1.7s, import gradio: 0.5s, setup paths: 0.5s, initialize shared: 0.1s, other imports: 0.6s, load scripts: 0.5s, create ui: 0.5s, gradio launch: 0.2s).
Loading VAE weights specified in settings: /Users/stable-diffusion-webui/models/VAE/vae-ft-ema-560000-ema-pruned.ckpt
Applying attention optimization: sub-quadratic... done.
Model loaded in 8.0s (load weights from disk: 0.5s, create model: 0.7s, apply weights to model: 5.3s, apply half(): 0.4s, load VAE: 0.8s, calculate empty prompt: 0.3s).
Restoring base VAE
Applying attention optimization: sub-quadratic... done.
VAE weights loaded.
Restarting UI...
Closing server running on port: 7860
2023-11-16 11:45:31,729 - ControlNet - INFO - ControlNet v1.1.417
/Users/stable-diffusion-webui/extensions/deforum-for-automatic1111-webui/scripts/deforum_helpers/ui_right.py:38: GradioDeprecationWarning: The style method is deprecated. Please set these arguments in the constructor instead.
with gr.Row(elem_id='deforum_progress_row').style(equal_height=False, variant='compact'):
Running on local URL:
To create a public link, set share=True in launch().
Startup time: 0.7s (load scripts: 0.3s, create ui: 0.4s).
0%| | 0/20 [00:00<?, ?it/s]2023-11-16 11:45:57.324 Python[10468:375542] Error = Error Domain=com.apple.appleneuralengine Code=6 "createProgramInstanceForModel:modelToken:qos:isPreCompiled:enablePowerSaving:skipPreparePhase:statsMask:memoryPoolID:enableLateLatch:modelIdentityStr:owningPid:cacheUrlIdentifier:aotCacheUrlIdentifier:error:: Program load failure (0x5)" UserInfo={NSLocalizedDescription=createProgramInstanceForModel:modelToken:qos:isPreCompiled:enablePowerSaving:skipPreparePhase:statsMask:memoryPoolID:enableLateLatch:modelIdentityStr:owningPid:cacheUrlIdentifier:aotCacheUrlIdentifier:error:: Program load failure (0x5)}
2023-11-16 11:45:57.325 Python[10468:375542] ANE load failed!
2023-11-16 11:45:57.325 Python[10468:375542] Issues were found in compilation, falling back to GPU
Trouble with SD from the start. Initially, faced NAN errors, and now nothing is happening in the program. Need assistance 
Hey what's your GPU and what's inside your webui-user.bat ?
Pls answer in #🤝|tech-support
ok merci mais c bon j'ai trouvé sur tiktok comment faire
post it also in #1092446741984444416 Thibaud!
hi, what's the last version of SD and SDXL and where do i download them?
i have 1.5 but nothing more
Anyone know of a discord that talks specifically about LoRA training? They're all so scattered
I'm finding a lot of success with training sdxl but I'm doing a project with sd1.5 and they are turning out horribly
hey guys quick question
for SDXL training
what model do I use? what's the best one? because there are different ones out there
could you send me hugging face link please?
Any news about SD3?
can anyone recommend me some free windows software that will allow me to realtime(no delay) preview still jog as a video? I used to have a software like this but can't remember the name
i installed a model and try to make a picture but it seems like no matter what model i use stable diffusion uses its own model and the output is bad, can somene help me to solve this thank you.
Hey guys, how do I add an additional conditioning to a stable diffusion model? Has anyone does it before whose script I can refer to?
hey all, I am downloading control net 1.1 and came up with two pages to download models from, one has 14 models and the other has models for sdxl. i am using sd 1.6. which page do I use?
SD 1.6? You are probably mistaken there. Do you mean 1.5? that's the one with controlnets (and sdxl)
stable-diffusion-webui-1.6.0 this is what it says
oh the webUI okok, thought you were referring to the models
the webUI is one thing and the SD models another
yeah the model are 1.5 and sdxl
For SDXL I'd suggest the Control-LoRAs https://huggingface.co/stabilityai/control-lora
for 1.5 there are quite a few, depending on what you are doing (there's ones specific for anime for instance)
where does the images i make save to?
it says it autosaves them but idk where to look
guys, if i want to train model using kohya dreambooth, I should save it as diffusers, safetensors, or ckpt?
i want to later convert to lora
when I uninstall SD do the images stay?
NotImplementedError: Cannot copy out of meta tensor; no data! anyone?
anyone can help, knowing kohya?
Hey
Hey guys, currently I'm using stability.ai rest API, so what I want to achieve is, I put image, and regenerate that image with new aspect ratio, is it possible? anyone can help?
outputs in your wbui folder
hello
Hey fam! I'm new in this. Dumb question, how safe is to install SDxl locally in my pc?
Good morning, everyone! How are we all today?
If you are interested in safe and easy way to install, I would recommend that you check out #🐝|swarm-ui as it is super easy to install! If you have any troubles, you can ask about installation there, or #🤝|tech-support
I mean, there is no risk about any malware or malicious code that could be hidden in this installations?
If you want to ensure safety, I would use official repos! https://github.com/Stability-AI/StableSwarmUI Installation instructions are on this page. (You can also check the pinned comments as well.)
If you want to learn more about SDXL, you can also use any of the #1100170312106127410 channels. (Pins have the info.) I use #🐝|swarm-ui every day, almost, lol. I love it!
Heya ^-^
Im new to this discord. I have a local Web UI Stable Diffusion and since I started there is hardly a day where I not engage with it ^-^
It's super fun, isn't it? Haha, lots to learn, lots to explore, lots to do! What do you like the most about it?
I like that it tickles both my tech brain and my art brain. And that even though I have a terribly old graphics card I can still use SD ^-^
And obviously I love the images that it generates
Oh and since then I helped 4 people install SD on their own devices! Since once its set up its surprisingly easy to use I feel at ease helping my friends set it up
Haha, that is what I love about SD! You can meet in the middle right there!
I also love the part of helping people---that's what's really fun about SD as well--having a community of like minded people, as well as being able to help family and friends enjoy this new technology.
What kind of art do you like to do?
does stable animation allow to get gifs from absolutely nothing except prompts and a base image? basically img2gif
Furry art ^-^
I love being able to get furries into settings that are currently unusual for them (Cyberpunk for example), and also trying out a lot of poses and clothes you dont see often
damn your messages feels like chatgpt answers xD
I mean thats a good question since you can do so much with SD :O
bruh animate diff is so slow though
it takes several minutes to generate some 3-4s 24fps animations in low res
even 4K upscales are faster
I figured from your icon! I do the same thing, hahaha! Cyberpunk is really fun. I also really love doing furries as well. If you haven't seen my guide, you might find some useful stuff here: https://docs.google.com/document/d/1BxdWqfBJ3QPggHnBCBx3QIkUpHnBWShd_s3zEt2dTLM/edit#heading=h.po8dv7lcq48s
You might find more answers, etc, in #🎥|animation
Whats your favourite checkpoint?
i personnaly prefer hybrids (cute girls with cat/wolf/fox ears)
I also have my animation+Blender playlist that I've been working on as well here: https://youtube.com/playlist?list=PLbJAuQIEql3YuacjyEedREPnwgljH7CST&si=_CijBmZPmVB8uIhy
no need to send so much stuff xD
i only asked to see if stable animation was as good as animate diff
i just last night installed Comfy (after using A1111 for a couple weeks) and in the process of doing some research heard about Swarm for the first time. now I'm wondering if i should try out Swarm instead.
you're asking a question that most people can't answer lmao
i have less than 20 models but i cannot choose
even tho cuteyukimix and akkaimixV2 are in the top
I love the indigo furry mix v 90 :O
Its super versatile and has supreme anatomy and clothing skills. And weirdly enough furry models are already really well with backgrounds
anyone knows where i can put a bunch of my best generations?
I wondered the same :D
It's very easy to use--if you have any questions, too, you can always ask if there's issues.
As for me, I don't have one in particular! I like to use a lot of different things, and I'm always trying out new ways to use SDXL!
You can use #✨|sdxl for showing off your work, of course!
There's also #🌠|show-and-tell
Idk if its SDXL tho :O
I am still figuring out where the difference even is
Between it and...?
What about stuff generated without sdxl ?(so with models based on 1.5, such as cuteYukiMix or meinamix)
There are many different versions of SD that have come out. If you want to learn more, you can check out the #1080946152318443610 section, as it has more information on that. You can also check out my guides/info here as well: https://atypicalconsortium.carrd.co/
You can always use #🏞|general-with-images
sdxl basically is more stable with large images such as 1024x1024 (the basic SD often get bad quality pictures in large resolutions)
i don't want to spam 50 images lmao
We also have our Dreamer communities, which you may view #1073085702927024128
Isnt that what hires fix is for?
yeah
BUT sdxl can generate in large resolution without hires fix
well i'm just dumb lmao, i can simply post in #🍥|anime
But I love hires fix :O
It fixes faces and fills in more background details
Is there a fitting channel for furry art?
the main thing i am trying to figure out is what UI will best let me develop and tweak repeatable workflows. i've been using A1111 and it's been incredibly fun, but it's really frustrating to lose my progress between sessions.
i know i can drag the image in, but then i need to go back and format my prompts, set a bunch of settings that aren't in the image data, etc.
Theres that blue-ish arrow that you can use to retrieve promts and settings
yeah, but often i will want to go back and forth between several gens that happened in different periods of the previous night's work
so i want to be able to save point-in-time versions of my workflow to iterate on
As soon as you paste the image into the prompt and hit that blue-ish arrow it should also set the settings
oh interesting. i thought that only loaded the most recent image. you're saying if i drag an image into the prompt then hit the arrow it'll load other settings from the image like adetailer/etc?
most of the time i don't use it, because i can't see the image before hires fix is complete
I generate without it, and if it's good, i put the image in an upscaler
Yesh. And you can also share your settings and prompts with friends that way ^-^
that's very cool, thank you for the information!
Thats also my workflow since my graphics card is sloooow. And I dont see the issue ^-^
Your welcome. Its kind of a hidden feature isnt it :D
well the issue is that i don't want to waste time upscaling bad images. I check if the image is good at first, and then i upscale to 4K only if it's good
I upscale it with hires fix. Although not to 4K
the other thing i was running into with A1111 is that i was having problems sometimes replicating an image even with exact settings. i did a little research and it seemed like it might be a bug that can creep in when you're switching models and reuse a previously cached version
And also check your embeddings and seeds. Sometimes those can be different ^^'
embeddings have seeds? hmm..
Nope. I just meant that while replicating an image I didnt notice the seed was set to something other than the seed I wanted
oh the seed was definitely correct, i was double-checking that.
i saw a github issue that talked about the problem, but didn't dig into it too deeply yet
i was getting frustrated and decided maybe it was a good time to try out SDXL since i'd only used SD1.5 so far
Are you using Euler A or any other 'A' sampler?
which led me to Comfy because SDXL on A1111 was juuuuust a smidge going over my VRAM (3070 8GB) and i'd seen that Comfy was more efficient (proven since, i can fo SDXL gens on Comfy as quick as 1.5 on A1111)
There is #1071935886800982108 among others!
pretty much alw2ays DPM++ 2M Karras
That's why you want ComfyUI, etc! Which #🐝|swarm-ui has--you can load workflows from other people, save your own, etc. People often share them in #✨|sdxl so I really do recommend checking that out, cuz you can literally just switch between whatever workflows you want.
okay yeah that's what i was thinking, thank you!
Do you really need preview?
I disabled it completely to make gen faster
There's literally so many effecient workflows that people have made that it's super super easy to just...find what you need.
i just installed Searge last night and have been messing with it a little bit. i did not know about the new prompting styles for SDXL and it seems like it may help me with the styling project i'm pursuing
We also have stages on them, so: https://youtu.be/zxjGLa960_I?si=-JEeyX8gRD97MRCI If you want to learn how to use ComfyUI in general, there's already tutorials here.
I wish we had like a library of most basic workflows we can adjust...tutorials are good , but kinda takes too much time when all you need is just to take one little thing
For sure~! I would check out the prompting guide that I posted earlier if you want to learn more about very specific things.
I'll be updating it when I have time, but there's enough info for you (and others) to get started!
oh this is great, thank you. already learned something (double-click to get a node search box)
We actually do have the prompt book! #📣|announcements message
As well, and there is a ton of information here as well to get you started--there are tons of resources available to you to learn!
oof i am just skating below my VRAM with basic gens on SDXL. i think my xmas present to myself is gonna have to be a new card
I uploaded some of my best ofs on the character channel ^-^
How do you do that :D
Since I have an old graphics card I figured out how to generate high res images while using as little ressources as possible ^-^
this just doing upscaling in post or do you have a secret technique?
Kind of. I first generate a 512 x 512 or 800 x 450 image that I like the composition and promt stuff of with as little steps as necessary (around 25). Then I pin the seed and generate it with hires fix. Upscale it by 2 and around 25 steps are enough for most images. If theres lots of background details I crank it up to around 35 hires steps
Hello. Completely new to AI generation! I've installed A1111 and this is running SD1.5... but presumably there's a way to install SDXL? Are there any guides for this?
That's not when I meant, I mean like a...
basic gen workflow
- file
img2img - file
upscale - file
e.t.c...
Ah but upscaling with anything but text2img bars you from using hires fix :O
I have no idea how most things work in comfy, and not sure if I bother, I just want to use something that already works without spending hours going through tutorials on something I'll need just once and then just save workflow and gen gen gen
gonna try this out, thank you!
Your welcome ^-^
i just last night started messing with Comfy and after booting it up and messing around with it i picked up this workflow that is set up to give you most of the stuff you'd use 90% of the time in one organized area https://youtu.be/_Qi0Dgrz1TM
has made it a lot easier to understand going from A1111 to Comfy (and SDXL)
im only trying out the move because i've got to the point in my current project where i felt like i was missing a deeper understanding of stable diffusion's behavior and wanted a better view into the guts
Apologies for jumping in here... but is Comfy better than A1111? I'm a complete newbie with this stuff, so it feels as if I'm just stumbling around in the dark. I've installed A1111 but haven't installed any models yet, so my results are pretty rubbish at the moment. I want to generate scenery/backgrounds/environments so that I can comp my 3D stuff in... but all models seem to be aimed at character generation, which I don't want/need. Can I just install a suitable/relevant model and then use a negative prompt to ensure that I don't get any characters... or is that not how this works?
Funnily enough I can recommend furry models for backgrounds that are drawn :D
For whatever reason furry model backrounds rule
i wouldn't say that one is better across the board, and also i've only been using stable diffusion for a few weeks now so i'm far from an expert so take what i say here with a grain of salt..
A1111 was great to get into everything and it's super powerful and easy to work with. what drew me to check out Comfy (and again, i've only been using it for less than 24 hours) is that the way Comfy lays things out you can see (and fiddle with) all of the elements in the pipeline.
also fully half of why i wanted to try Comfy is that on A1111 with SDXL my 8GB VRAM card takes 3-5 minutes to gen but Comfy is more efficient and it takes ~20-30 seconds, which is similar to SD1.5 on A1111
Ah... I'd be looking for photorealistic stuff in this case, but thanks for the tip!
so my impression so far, and again i'm still less than a month in, is that most models aside from the base tend to be trained for particular character looks
if what you want is more landscape, i might suggest looking for models specifically aimed at that. i have been using civitai for that: https://civitai.com/models/85137/landscape-realistic-pro
hello guys sorry to interrupt, can anyone explain again the use of the little blue arrow (is the one next to the bin?)...thanks
it does two things that i know of:
- if you have no image loaded, clicking it will load the settings from the last image you generated
- if you drag an image into the prompt field and then click the button, it'll load settings from that image (i just learned this)
the second one which could be super useful doesn't work for me i just tried....i mean are the parameters saves somewhere inside the png file???
saved
i always have a myriad of images i want to go back and rework but i completely forgotten the parameters...
my understanding is that the setting are written into the png metadata chunks. you might need a setting in A1111 to control how much info is written in. i know at one point i had to check something to for instance add adetailer prompt data
interesting,,,,i wonder where can i find more info about this setting that adds the parameters inside the png ....
i pretty much learned everything i know about A1111 from reddit
Yeah, because i want to see what i got before upscaling
I'm not going to upscale if it's Bad, it just waste time
Might be that your images are compressed somehow. I recommend fetching them from the output folder if you dont already do that
Thanks! Yes, Civitai is a great resource... though, as you say, most models are aimed at character generation. I've seen some amazing images that are just scenes without characters... buildings/cities, landscapes, you name it. So I know it can be done. I just need to figure out what/how?
i just realized for the first time there's a "png info" tab!!! but unfortunately my png s have no paramaters saved for some reasons even though in the settings is said that the parameters are saved in png chunk...also in the settings there's a
Create a text file next to every image with generation parameters
i will try that from now on
huh i feel like at least some info like the prompts, model, vae, hires fix properties should be saved in the png by default
if you don't have them super personalized, might want to try resetting your settings to defaults
Just try a few models that are built for characters. If you dont add a character it sometimes works out well enough
i confirm that automatically saving a txt file with the parameters works...
I will, cheers! Actually, presumably I could add a negative prompt, to exclude any characters?
You will have to see if thats necessary :O
What's the best way to generate a character into a background image using ControlNet Pose? I know you can just inpaint the area where the pose image is, but I'd like something that doesn't requiroe guess work as to where the character will be and not be cut off or change the backgroun too much
Are you on Comfy? You can do a pose using one image, then a depth/canny using the background image. It should mix the two together.
Could you give me an example workflow?
I don't want to change the background at all though
hmm. that's a good question. I saw Krita just got a plugin - you'd just overlay it and have the image generate. I haven't played with that too much yet, but I have a drawing tablet, so I am considering putting in the work to see what happens.
otherwise you'd make the character in openpose, then cut it out, and have img2img clean it up a little bit with the lighting, etc.
but you could also try img2img and mix in the controlnet into it, with a low setting. I feel like there should be some way to make it work.
All the latest AI & SD news and tools plus https://start.me/p/xb4Npa/ai no ads
is not a bot? hahaha
Is not this one the way to install and run SDxl? https://github.com/AUTOMATIC1111/stable-diffusion-webui
Does anyone know of a good model (Lora, checkpoint, anything) or good prompts to use to make pictures look more realistic? As in like look like a photo taken on a smartphone or cheaper camera? I want things to feel more authentic in both composition and quality
Yes :O
Am I the only one who thinks SDXL kinda sucks?
I personally think there is a need for good XL checkpoints before we can properly judge it :O
hello
Moin ^-^
German?
Yes :D
nice to now
I'm thinking because staff were altering the prompts in post on the discord bot that it was causing some issues with people trying to generate images out of the box on their own machines.
I like SDXL, but man, controlnet for 1.5 is far superior in every way.
Is there a job board somewhere? I don't see any channels to post SD related jobs
What kind of jobs are you talking about :O
Sounds like commissions to me.
is there any workflow using a1111 or comfyui that can reproduce similar effects as runwayml gen-2 (image+text)-to-video? if so, can anyone point me in the right direction?
what is sd1.6? anything about it other than it just existing on the api?
you want animatediff motion modules
and you'll need lotsa vram since it's all the frames generated in one big batch