#💬|general-chat
1 messages · Page 89 of 1
i got tons of xl checkpoints too
tried meshgraphemer before, it didnt even realize that there were hands there
they might be too small idk
i post a picture in the other channel
I'm having a ton of issues getting extension like roop, reactor, etc to work because I can't install the visual studio dependencies.
If I dual boot my Win pc with Linux and use A1111, do you think the new install with resolve the problems?
I'm assuming the same extensions run on Linus and a dual boot would mean installing the necessary dependency
I keep getting this error on my A1111, anyone know what it is or how to fix it?
RuntimeError: The size of tensor a (153) must match the size of tensor b (140) at non-singleton dimension 3
It only seems to be happening currently when I use "Custom Hi-Res Fix" on img2img inpainting
It won’t let me generate
dont bother with any of that compiling. i got a thing for you to do instead. hollup
Linux has a lot better support for all these various python libraries i think. i haven't used it for over a year though.
just my deck
My requests have failed for multiple bots. Any insight?
Is bot no longer available
bot not working?
I am not sure where to put this question, but is there a ai that can take an existing melody and chords and create a variation? I.e. add more instruments, add parts that follow after the melody, add a counter melody?
Has anyone had this error?
ModuleNotFoundError: No module named 'numpy._utils'
Do those who use amd gpus have issues with extension compatibility for webgui?
Folks please dont shoot the newbie I love Emad Mostaque and write about him often on linkedin can you point me to the page to generate an Imgae I have it all in hand on gpt4 and dalle but yet to do stable thanks peace
Thank you
Tried that what am I doing wrong ?
There are some rare extensions that could rely on cuda that won't work. (I don't know any other then TensorRT rn)
Im using an AMD GPU and dont have extension problems but for example you can't train models on AMD windows right now.
On Linux it should be possible. If your going mainly for SD. Nvidia is the goto. But if you just want to create some nice Images in whatever resolution and primarily game on it then AMD is perfect too.
On Linux the AMD compatibility is much better right now. (Much faster)
You can also ask me if I should test something on it (extensions etc)
They finally fixed it, i was already going crazy 
Its down right now, they are working on it
Should be back I think
ok
anyone knows a way to install sd/sdxl locally for AMD gpus?
an up to date method
Hey, sure. Checkout my install guide for AMD its inside the Pinned Messages of the #🤝|tech-support channel
alr let me see
The extensions I use typically are multidiffusion, cd tuner, animatediff (mostly just for the LCM sampler), dynamic thresholding, and adetailer
Multidiffusion, Adetailer, works as I use them too.
Dyncamic tresholding should work too.
I didn't tried AnimatedDiff but I can later
Don't know whats cd tuner
It allows for color controls, detail controls, etc. Accomplishes something similar to certain loras set out to do but with a bit finer control.
Do you find that functionality such as hires fix work properly on amd?
Depends on the card and operating system.
On windows you can use hires fix upscale by 2 from 512x768 or 540x960 (for FullHD)
Without problems as long the card is a 6700xt (12gb) or newer
On Linux it could be higher res.
For higher upscale you need to use SD upscale script in img2img
Looks really interesting will try that later too
are you using rocm or the directml fork? What platform and gpu?
Right now I'm using the Directml Fork on Windows with a 7900XTX (24GB)
On windows directml you get 4it/s
On Linux with Rocm you get 18-20it/s
Or if you use Shark on Win or Lin you would get 20it/s
But yea shark needs easier usability with model usage and more and better feature support
ai act text got leaked https://fxtwitter.com/echo_pbreyer/status/1749745628496748552
You can try my install guide for AMD GPUs.
Its in the Pinned Messages in the #🤝|tech-support Channel
Ok, I'll give it a try. Thanks.
Do I have to install it in the C drive or any drive on my PC?
Works with any drive
You can also install it on C but store the large models on an other drive
Do anyone know why SDXL takes so much time to load a lora in a1111? also, when generating the first image using the lora it can take more than 4 minutes with a 3060 12gb, but after that it takes like 20 seconds 🤔
could be because you re storing models onto a mechanical hard drive, or because you re running out of ram during when loading the models
Do you use xformers?
RocM is AMD's implementation of CUDA. It's an API to the drivers, sort of like DirectX or Vulkan are what games use to talk to the GPU.
my guess is your models are on an old hard drive. Pulling 6-7gb off a platter disc takes a few minutes
Yes
The models are in a sata ssd, but maybe I should create another a1111 installation with just sdxl
i dont see why that would help at all
That won't help
Oof
I have 16gb of ram, and about 45 free space on C drive
Maybe I should remove the --medvram command, I used it when I had the 1060 3gb
i always start to sweat whenever my free space is less than 100gb
only remove medvram if your new gpu is over 8gb of vram
Windows always takes much space somehow 😭
yeah it has 12gb, its a 3060
I´ll try
nice upgrade
i always try to upgrade to double vram if possible. that one is a double double leap
I just tried loading a lora with all the tabs closed, no lora = 22.8 seconds, lora = 33.3 seconds, changing the lora weight from 1.0 to 0.6 or other value = 55.7 seconds
wtf somehow it works and uses less vram than with --midvram
you don't need medvram on a 12GB card, i have the same card and whatever you've done that makes a generation take 23 seconds, you need to adjust your workflow.
i make images in 5 seconds, 2 seconds with an LCM model.
I gotta download download the LCM for sdxl, does the quality drop so much or is it not so noticeable?
that totally depends on the model
personally i don't do much with SDXL, i find the 1.5 models have been refined so much over the years, some of them match and even surpasses the SDXL models.
medvram probably turns off a whole bunch of optimisations that help grease the wheels.
i can't get consistent quality with the lcm lora. honestly have no idea what i'm doing with that thing. when i do get quality images out of it, they're sd15 quality , not sdxl
no, it splits the process to reduce the vram usage, which slows the process down.... useful if you don't have enough vram but a slowdown if you have.
seems all these model speed up releases always have huge trade offs
F
i just confirmed. there's a lot of other switches turned on when medvram turned on. its not just how it manages models in memory
then don't use the lora, get an LCM model.
i find 4 "trained" SDXL 1.0 LCM models on civit.
might as well just do trt then, but i don't, since i don't want to convert every single model i want to use.
lcm models, quality is very inconsistent too. seems good if speed is what you want. i'm not sure it benefits quality or prompt comprehension one bit though
speed is king
and honestly, sd 15 lcm and sdxl 15 are very comparable in quality so just use 15
medram breaks the generation process into three stages to reduce overall vram usage
i've got a 4080 so i got pretty fast generations already
love the homunculus man. actually now that i think about it, weird that no one's done controlnet on him as a popular post yet
idk why some ppl setup ComfyUI forkflows making 4-8 images that takes minutes to make and then realize they're not even happy with the composition.
meanwhile i can make 40 images and pick a handful with compositions that i actually like and easily fix with an inpaint.
Also people essentially just recreating webgui within comfyui
i have perfected a custom 1.5 model which i don't even need negative prompts for.
and its name is 51f... 51 cuz it's my 51st attempt, and F cuz it's the 6th revision of that attempt (f being the 6th letter in the alphabet).
dang... i must have made over 300 models to get here xD
trains and merges, lol
Help me please, because I don't see where I can find preprocess images While using train

the ones that frustrate me the most are elaborate multiistage workflows that do a ton of quality sampling on the first stage, just to thhrow it all away with an upscale and denoising stage right after it
i just download a popular base model and throw a standard negative at it
after cleaning my models directory, i still have over 500GB of models.
reasons like that why i'm not about to start on optimizations that require me to make copies of existing models
i was tired of models giving half-assed images with simple prompts, or models requiring the fancy title of being a "prompt-engineer" to make something decent.
now i have something that's perfect for me.
so, in my opinion the journey was worth it 😛
Noob question! Looking to buy a new GPU; but I can't find an exact answer. What does VRAM do? I get that more might be better, but just to confirm. My understanding is that VRAM only allows me to produce higher resolution images. For my own workflow, could I not just stick with 8 gb then upscale? Thanks for your responses!
more vram = higher res
CS1o is a helper now, I remember when he joined
long time since I came here
😐
hows things
around
where are the researchers
Vram is the memory of your Graphics Card, its needed for Gaming and Ai stuff. More vram = better.
But 8gb vram on an nvidia card can be enough for most tasks. Upscaling works easily too.
Best would be to have 12gb or more if your planning on a new gpu.
hi
what auto1111 extension have I been seeing that lets you set a lora for lora:strength:BACKGROUND (I've seen :CHARACTER, too)
I've always used Regional for that, but I've been making complicated sets of boxes.
i like that syntax. would be nice to know where it comesf rom
never seen it before but it makes sense
maybe its from a website service. Where did you see it before?
Not sure. Image of a girl with the demon version of herself lurking behind/above her
Maybe some kind of scheduling plugin?
I use SA and Regional, and I can kinda make SA do something close but it dies at the end
Anyone using amazon aws server ?
quick question can i train a model to generate images with the exact object images i feed the model?
theres multisubject prompting too. i like regional prompting plugin. i wanna learn about how to use its new masking mode
i know theres an extension that renders a background scene seperate from the characters in it. i can't find it though. might've been multi-subject, but i remember it having specific background prompting
Did you end up trying those extensions from earlier?
Nope sry, hadn't enough time this evening
Do you also use ONNX on your directml windows setup ?
Nope. As onnx/olive doesn't support every model, Extension or loras. Its fast but its bad.
Shark webui uses the same approach as olive but works much better, much more advanced, but both need converted models.
For AMD, Directml has the best support for models and Extensions on windows rn.
How do you find vram usage?
They're probably off by now. But I can answer for them, it's bad.
If you want proper proper support for all models, all extensions and correct vram usage while using AMD. The best route is still to go Linux + auto1111/sdnext/etc
Any ways that I know about to get full performances on AMD + windows involve compatibility issues and other caveat.
Man I finally got a lora to start training in 1111 it's on step like 134 of 2000 just training a model of my own face to start only getting like 11 seconds per 1 iteration but I'll take it
SD launching with wrong GPU (i have two). Can I do something to "select" which GPU should use?
I think there may be a command argument to specify which device you want to use, but don't take my word for it. I remember it being a command inherited from stable diffusion itself rather than a specific front end.
Hi, Anyone has a running sample of Stable Video Diffusion?
On windows display settings>graphics settings , you can add the python.exe from your sd installation and choose the GPU that you want to use
You need to add
--device-id 0 or 1 or 2, to the COMMANDLINE_ARGS= in the webui-user.bat depending on the GPU Number in taskmanager
As aryetis said, on Directml its very bad. Using to high resolution, to many hires steps while upscaling or having other programms open in the background can cause an out of memory error.
Do you consider amd worth it given all the tedium?
Depends: If you want to do mainly AI stuff, for youtube, work, job, etc. or your not to techie with PCs, go for Nvidia.
I switched from Nvidia to AMD even knowing the problems AMD had with SD before, as i dont like to pay nvidia prices or support what they did with their gpu releases (low vram amount for high prices, every half a year they release a "super" series so that you feel like you've been betrayed with a worse gpu, doing a crap for community or open standards).
If your primarily want to Game on your PC, or your a programmer/coder, tinkerer, and also want to use AI stuff with maybe some hickups here and there i would go with AMD. Much better prices, more vram, better Software, best linux support.
If you go for AMD everything below 12gb vram of the 6700XT is not that usable imo.
AMD works on getting Rocm (which is used for SD on Linux) to Windows i nthe next months. That could change a lot.
For me who uses AI Stuff as hobby, its totally fine even if its slower or require some tinkering on Windows. I could switch to Linux anytime if i want to. Some people also Dualboot so they can use SD on Linux, and game on Windows.
does anyone know why my adetailer eye quality is so much better when i do img2img instead of txt2img? i have the same settings on both
i cant figure it out
it's not possible to port 1.5 loras to XL right?
could use some help with this check #🏞|general-with-images message for examples
Hi, are there any quality models you would recommend? I'm mostly working on sd1 but sdxl is also accepted
Hey everyone. I'm looking for someone who speaks Portuguese and is quite good at using Stable Diffusion. Do you know anyone like this? If so, please let me know! Very much appreciated. Thank you guys
what the actual fuck
i just discovered that i can litteraly upscale to 16K at a 9:16 ratio without problem (it takes years but it actually works)
yeah but it's more like i was dumb as shit
before that, when trying to upscale from 4K to 8K, i got out of vram errors because i was upscaling tiles of 3840x2160
but if i actually take a 4K image but upscales it with tiles of 540x960, it works without any problem
Good morning, everyone! How are we all this lovely morning?
Hello
I can't seem to be able to use hi-res fix
is it fixable with tiling?
it's just x2 from 540x960 causes OOM on 8 G VRAM
hey, whats your GPU? and whats inside your webui-user.bat?
RX 7600
didn't have any options (which seemingly good for performance)
I'll try medvram
you need --use-directml --medvram --opt-sub-quad-attention --opt-split-attention-v1 --upcast-sampling --no-half-vae
I'm not on windows
hey everyone, Do you know if there is a way to save the prompt text of each image in the filename inn comfyUi ? Actually I found a module that its called "save image with prompt data" it saves the image but not the prompt... is it necessary to do anything else ?
Hang tight, you're still dreaming! 🙂 - - It's been there for two days, how do I get rid of it?
okay, --upcast-sampling seems to have no effect, --opt-sub-quad-attention improves speed however, --medvram seems to fix OOM issue
Guys, I'm not finding a Comfy UI workflow for "content aware" similar to Photoshop, not inpaint, but rather to expand the image to the sides, I downloaded the "multi area conditioning" node but I still don't know how to use it, if anyone has it a Jonson file would help a lot
how to fix this error guys? ImportError: cannot import name '_TORCH_GREATER_EQUAL_1_12' from 'torchmetrics.utilities.imports'
can you post an full error log into #🤝|tech-support ?
hello! what are considered the best open-source image+prompt-to-video open source models out there right now?
"video stylization" to use google's term for it, from the lumiere page https://lumiere-video.github.io/
How can I do Hires. fix manually using only img2img tool?
Manually?
In img2img you can use the SD upscale script or the ultimate upscale extension
I'm just not sure about settings
it seems to completely erase detail I added previously in inpaint
is it denoising?
shoult I repeat prompt or leave it empty?
You can add quality tags
And also negative tags
Lowering the Denois can help or using an other upscaler
Hello, I'm trying to make the background blurry, but I can't do it more than a little, my intention is to make it moderately blurry, I already eliminated words similar to "blurred" from the negative prompt, but I still don't get the result I want, what could do? thank you
Hi! I want to use stable diffusion for 2D game development. Currently I using AUTOMATIC1111 with the base model, but its obviously not very efficient. What models and extensions do you guys recommend?
refined models like juggernaut. loras for purposed styles. xformers should be enabled if you've got nvidia hardware. latent control models are worth looking into. LCM-Lora too
don't expect stable diffusion to do all the lifting. Take it into photoshop and process it out for your game afterwards.
Hi , is there any really safe for work tutorials ; that does not show picture of inappropriate picture of females ? , I don't know but I have feeling that people just use stable diffusion for the sake of this kind of thing under other excuses , every time I say to myself I am going to master knew thing today about stable diffusion and this technology I end up stopping after I realize that of man it is just another video that shows how to generate image , animation or God knows what of "females" .
for example I got intereseted lately about how to make animation using stable diffusion , like "animateDiff" and most time I search a video I found just video of dancing girls .
if anyone knows safe place to learn about stable diffusion use cases without being exposed to inappropriate content, it is much appreciated .
Salut
Gigabyte GeForce RTX 4070 Ti Super 16GB Windforce OC is 1000 dollars
damn
I remember when 1000 dollars meant a full medium-end computer
No I'd rather save up for that KatVR
Or if I can figure out a way to build my own platform and only utilize their shoes
ياخي اسكت
how do i use a LoRA model?
Anyone know how to make the streaming structure better? Right now im getting no bullet points or anything until the very end of the stream. (Using gpt4 turbo api)
Hi folks,
I'm using A1111 with SD 1.5 with custom checkpoints. I have a question: If I a LoRA or a checkpoint gets updated, will the model get updated automatically or do I have to download and update?
imagine buying a 4070 over a 3090
and a Ti at that lamo
Based on the bot channels, I can tell some training is going on with colorful vs matte/dulled down color images.
You have to download the new version then
Hi, does anybody have a practical method of managing lots of similar image results? For example, I have a result folder with over 9000 images but do not want to browse through and cherrypick manually based on sampler used etc.
hey one of my firend is unable to join this server , and when i am sending him invite link it sayting that he is block can anyone help me in that ?
There is an extension for A1111 that rates images by aesthetics, if you trust an AI to decide what looks good 
Just checked that, how to start the aesthetics ranker?
Just a random side note SD very often uses females because of the users, ppl who learn about certain features of SD enjoy generating females and in turn their guides would be tailor to such
If this is a use case of educational purposes like in a classroom it be easier to write your own guide on w.e the topic is to make it sfw/pg13
Otherwise if it's personal well....I mean, that can't be helped as a vast majority of humans enjoy making females
the latest feature will often be used with some pretty lady because that's what a lot of ppl like
The Little Explorer's Magical Journey《小探險家的奇幻旅程》【AI LOOKBOOK】【Children's Picture Book】
This is the first children's picture book I've tried to make. It's actually quite difficult. The difficulty lies in fixing the characters and costumes.
https://youtu.be/uu0v-sY73YU
Not sure, ive never used it myself
is there a way to save both the image, and all of the prompt and settings, in AUTOMATIC1111. ?
You can do it with the metadata
any more info ? process ?
it is if you use budget options like intel arc or an amd radeon. thats what full $1000 pcs were in the past. Nvidias at the high end and you're looking at the low tier of their high end.
alwayshasbeen.gif
AMD is poop for SD for the record.
Just as an fyi, came from there myself not long ago
was using a vega64 with rcom for a few months when i began. linux is a better environment for amd. those windows drivers are balls
Yeah you're correct
went back to windows though when i got nvidia card. wanted to take 11 for a spin.
tabbed explorer! sold
Easily best feature
Am I allowed to post SD animations here or do they strictly belong in showcase?
Good sc reference
i post stuff like that in genwithimages often enough. theres a showcase here?
I'm super new
Umm there is an animations channel
Lord knows if anyone goes in there but I'm addicted to posting them.
And there ain't no pic posting in this channel which makes me sad
hi
Anyone know if there's an auto1111 fork of this lurking?
2024 meme potential aside, I mostly want to make pics of my kid riding a T-Rex
Maybe randomly replace people in the family group photo wall with celebs
Has anyone ever dealt with trademark stuff? Our startup is: https://pilot.io/ (we've been using this domain/brand for over 9 years, well before Microsoft co-pilot etc..) - we're talking to a trademark lawyer, and it sounds like they might advise us to change our name/brand, but I can't help but think about "Mike Rowe Soft" and how we were using it first - anyone been in a scenario like this before?
hi i put a model to run and said "model failed to run", somebody know how to make work?
What GPU? And what perf would be expected compared to achieved?
AMD 6800XT, it took me about 2 minutes to generate 1 SDXL picture on windows.
and I couldn't use comfyUI
Linux the performance is supposedly better, but still eh
If you decide to take on Microsoft's 10,000 lawyers, can you post before and after pics?
more than "take", try to make a deal and sell them the domain
I switched to a 4090 so it's not fair to make a direct comparison at all
taking it means going to court, I bet they prefer to do this privately between parties, but i'm no lawyer
i know personally of a case (friend) that took a celebrity name URL back in the early days, and they resolved it privately, sold. but those were other days
I genuinely hope you do/can
But don't write any checks against that money until you have it
has anyone ran into the issue that if you change the adetailer model in the adetailer extension, suddenly it stops working? i start getting red squares over the faces of the people it should be fixing, or black squares etc, just not working
and i dont see any error in the cmd.exe
thank you
Dears , I am newbie to IPadapter , I am confused about the followings :
IP adapter, PhotoMaker and InstantID .
Can anyone tell me what is the difference between all of these ? , you will find below a link for each one of them .
https://github.com/tencent-ailab/IP-Adapter
https://github.com/TencentARC/PhotoMaker
https://github.com/InstantID/InstantID
hello, why is my permission removed to generate in bot channel?
Hi all, I have a question for everyone: In Stable Diffision in discord you have also banned the generation of pictures "You do not have enough rights to send messages on this channel"?
or how do I find the admin?
since the stabledreamer is down without an eta on coming back is there any other similar AI you recomend for generating the videos? this one was until now the only one I managed to understand lol
you can use your own GPU, if you have one with more than 4GB.
That's great, do you happen to know any video/document that explains how I can set this up?
In the videos about stable difussion local hosting I saw that they didn't had the option to generate video
I'm kinda new to this so sorry if my question is stupid
you can watch some tutorials from Olivio Sarikas, Sebastian Kamph, Aitreprenour, and/or join the Unstable Diffusion discord, they have tutorials too (not just for nsfw).
Thanks I'll do that then
I thought this was the stable diffusion discord O.o
Do I need to download the stable dreamer model or something?
Unstalbe is not the same, it's another project NOT related to Stable., was meant for nsfw but now it's pretty much a universal community.
no, ppl download models from civitai.
it requires some work to get started, first time might seem tedious, it's rly not though.
I'll give it a try and see how it goes, thanks for the help
I don't have the budget to buy one. Not everyone has the budget to buy a NVIDIA GPU 4GB.
Hopefully one day I will.
Are there any other servers that are as good as StableDiffusion for image generation that you can recommend?
Hi
Kandinsky telegram bot
为什么在机器人1-10频道中,显示:你没有权限在此频道中发送消息。以前都可以,现在为什么不行了?有没有人跟我一样
Russian bot
IP-Adapter can do more than just faceswap.
The IP-Adapter FaceID models are comparable to InstantID and Photomaker
In the Pinned Messages of #🤝|tech-support you'll find an easy install guide.
应该是机器人维护吧,公告显示要一段时间呢
还不行吗?
The Bots are down for maintenance:
More info here: #1047610792226340935 message
hi
hi
bot not working?
look up just 3 messages above yours #💬|general-chat message
or check the #1047610792226340935 channel
or read the last message of every single bot channel #1100170312106127410 message
We can t make it more obvious that the bot is down for now.
there's no deadline at all?
No estimated times for having the bots back sorry, this has been posted many times and available at #1047610792226340935
Any good anime models that are based on SDXL? I’m still using Anything v5 for 1.5
Try this at #🍥|anime they know their stuff!
there isnt too many anime Loras for XL
What about models?
Or are ther no SDXL models, only loras?
there is various SDXL anime models
what i mean is that you cant use like tatsumaki Lora on SDXL
you have a pc with more than 4GB of memory?
Yes, 8gigs
DDR4
Thanks @terse plume
BTW, I found the place: https://huggingface.co/spaces/stabilityai/stable-diffusion
Also thanks for that as I am a python developer!
I was also about to ask that!
@fervent thunder
does someone has an invite for the unstable difussion discord server?
Prompt: prompt:prompt:Vulcan, star Trek, a female Vulcan, tanned skin, elf-like vulcan ears, arched eyebrows, skinny, slender, femenine body, sensual pink sci-fi dress, in a vulcan white alien mountain, Vulcan robe, fisheye camera, fisheye effect, retro, selfie, summer, transparent dress, sexy dress, 90's, long hair, very long hair, beautiful, exotic, sunkissed
Negative: long dress, covered dress, modest dress, ugly
ola
Sup
I was downloading a repo from nocrypt and this popped up (Misleading:Linux/FRP!MTB). What is it?
What Operating System are you running?
What the hell? Unstable * laughs *
Windows
Oj
Ok
Can you tell me the repos url
maybe its this: https://github.com/AGWA/git-crypt
Can you please explain me what the repo is about for better understanding?
I am helping as a Windows 11 Home user.
Oh, I think I've got this thing figured out a little bit. I think it's just another stupid Windows nag
Thanks for your responsiveness bro
Theres a lot of bugs in windows to watch out
Thanks! Enjoy your dev journey(I guess). :)\
Its windows man it happens....
If you need any further assistance then please convey me!
hi, What happened, are they getting spare parts? Will this still work?
is sd cooking or did they fall off
Hmmm… I think they released Stable Video, Sd Turbo Xl and 512, Zero123, a LLM and a Code LM if memory serves well, not quiet at all : )
because I get: you do not have permission to send messages on this channel
yes, Why do I get that?
Its down, just read the latest message in that channel
Good afternoon. I haven’t used this server for a long time, but today I discovered that I don’t have permission to write to bot-1 and other channels. I couldn't find a reason for this in the rules. Perhaps it is no longer available for free accounts? If anyone knows the reasons and how to fix it please tell me. Thank you.
Thank you very much, I only understand Spanish, I write this with Google Translate
Please read the message above yours
any recent announcement ?
Doesnt look like youve read this message then #1047610792226340935 message
Hi everyone.
Please dont be about the bot is down question 😜
Where i can ask about SD model, that can understand and transform schematical drawings into interior maps\schemas?
Nope. xD
Maybe look for checkpoints or Loras training on those, Civit.ai is a good bet
Models don’t “understand”, they’ve been trained on images and and you can infer new images based on that training
There must be LoRAs for technical drawings for sure
Yeah, i'm understand that. I used that as "trained".
Or you can even train yourself one
Not as far as i can see, through stability matrix. Will try to go through hugging face later.
I want to create encounter maps from simple schemas, as i cannot draw at all. -_-
Oh, darn, there is!
Yay.
There u go!
Thanks. For some reasons i though that Stability Matrix uses same search for site, as, well, via Chrome itself.
Second question - is there any other opensource picture generating AI aside of SD?
here's image research by one of our researchers released a few days ago https://arxiv.org/abs/2401.11605
We present the Hourglass Diffusion Transformer (HDiT), an image generative model that exhibits linear scaling with pixel count, supporting training at high-resolution (e.g. $1024 \times 1024$) directly in pixel-space. Building on the Transformer architecture, which is known to scale to billions of parameters, it bridges the gap between the effic...
its not the same thing Stable diffusion or others site generate imagens?
there is a expert
i bet you just need to get your priorities straight, like, look at how many subscriptions you're paying for... netflix, world of warcraft, patreon?
there's a lot of ppl paying for more than they need, there's a lot of free alternatives for many things.
you don't need an Nvidia GPU, just 4GB of vram, the first 4GB GPU came in 2008 which should give you plenty of options on a wide variety of prices.
Unstable Diffusion has regenerating free currency for the AI on their website but i don't use it much so can't say anything about its quality. it's nothing fancy tho, you probably make better images here.
why ppl dont just invest in a basic ass gpu is beyond me
they rather struggle with these colabs that keep banning AI tools over and over and over and ppl keep asking "wHy caNt I gEn oNlinE"
keep bending over for Google g. Ima be here chilling multitasking while ur begging Google for credits
Well, tbh, if you want to work with decent models and don't start rendering and then going to a work, because it would take a whole day - you need decent GPU.
Like i had 6GB 1060. Let's say i was not satisfied with speed and results.
meanwhile AMD https://github.com/ROCm/ROCm/issues/2689
Yeah..
Is there a way to fix the "killed segmentation" error messages?
It errors out after generating for like 30 minutes and I've tried a completely fresh install and it still happens.
dumb question, where to put character and training data downloaded from citiviai
you get what you pay for
nvidia vs AMD is exactly like coke vs pepsi
one is clearly superior but people choose to save a few cents by going with the other brand... then you get what you pay for lolz
cuda is the true future of AI, and with 50xx series being specialized for AI, my body is absolutely ready.
Well, yeah, in AI sphere - Nvidia is clearly are superiour.
How can I train a character lora? Like the technique comicsmaker.ai uses for example
you gonna have to put pictures in there
Guys, does anyone know any web app or library or API to convert images to vector for free that gives good results?
I managed to find what i meant, textual inversions
loras and textual inversions are two different things (albeit similar). Generally in the modern world you almost always want lora rather than a TI
I'm mostly just intrigued by the coke vs pepsi hot take here lol
I don't even know which of the two you're trying to imply is the clearly superior option
even for something like learning a character so it gets represented well when I reference him?
would like a guide honestly
one thing I liked with the site I mentioned is that it got pretty good results replicating the inputs
hello!
is there any benefit training SD 1.5 LoRA with 1024px images?
you'd have to shift the entire model to support 1024 first. Just use XL
where do you find loras and embeddings? i can't stand amount of nsfw and asians on civitai
Okay, I did a dumb thing! Enjoy my choices here! I was going through art styles and I got up to Pixel Art. Using this, I found an upscaler I wanted to try out. So yes, I upscaled a picture that is pixelated with the Pixel Art prompt.
I would like to know what can I use to convert an image into a Stable Diffusion prompt
good
6
Why is that dumb ^^ it can work with the right upscaler
Is there a discord server dedicated to training loras?
Or maybe training models in general
Hello everyone, I'm looking for someone who speaks Dutch please
The five stages of accepting the inevitable - denial, anger, bargaining, depression and acceptance
I pay for the work
can i make a video when i have 1 image as reference, and a sequence of open poses (like 50 or 500 open pose pngs)?
i know how to make video2video and img2video and text2video, but i dont really know how to combine one image with a series of open pose images to a video
any hints?
I put a question in #🤝|tech-support that I couldn't find a helpful solution for yet. I hope I'm right asking for help there 🙏
yeah, you posted that in the right place, hopefully someone would help you figure it out : )
thank you : ) yes, i have most of my issues sorted out of the way thanks to CS1o, thanks again
how can i write a prompt in a way that the main object is a bit further away so there's space for other elements on the image. I want to generate a human face surrounded by patterned circles but the face is always very close up and ignores the circles
seems like they are trying that too! would be interesting if they manage to do it
you can also try it, there's the code and nodes, you can run it over comfyUI
where do you see any code there?
in my experiments, it works, sometimes it fails too, depending on your image and openposes you can get good results
ahh thanks!
have fun!
its interesting, this is uding a "pose guider" and no controlnet with openpose, also there is an "animateanyone" sampler, and no ksampler
uhmm my first try failed spectacularly lol its just a garbled mess
don't know what's under the hood on the nodes since I'm no developer but you plug your openpose animation in the loadvideo and the pose guide will work it and make latents for it, which will guide the animation
at least its good in producing completely disturbing bodyhorror movies
i think the input image and the video/openpose sequence have to match exactly
Where to check for docs on pipe() method?
In this code I want to see what are the parameters that I can provide to the pipe(), such as height, width and others?
pipe = AutoPipelineForText2Image.from_pretrained(
"stabilityai/sdxl-turbo",
torch_dtype=torch.float16,
use_safetensors=True,
variant="fp16"
)
pipe = pipe.to("cuda")
prompt = "a photo of Pikachu fine dining with a view to the Eiffel Tower"
seed = random.randint(0, sys.maxsize)
num_inference_steps = 4
# This pipe(??args)
images = pipe(
prompt = prompt,
guidance_scale = 0.0,
num_inference_steps = num_inference_steps,
generator = torch.Generator("cuda").manual_seed(seed),
).images
I've been working on a little web app to help cut down on copy-pasting when writing prompts by taking a "card-based" approach to prompt building:
I would love it if people could give it a try and provide some feedback 🙂
Good morning, everyone! How are we all today?
Please see the #1047610792226340935 for more info on that!
Hey, hoping I can get some help with a fresh automatic1111 install on ubuntu 22.04 (virtual machine, gpu passthrough). Automatic1111 installed fine, loads fine. Now I'm trying to setup xformers and local network access. I've added the line: export COMMANDLINE_ARGS="--listen --xformers --reinstall-xformers" to the file webui-user.sh, but it doesn't seem to have any impact. I'm not new to this, I had to bang my head againt the wall a long time to get it working before and I don't want to do that again. GPU is 3060, is recognized by other things in the VM. Any help would be appreciated.
Oh, jesus.. I run the webui.sh inside the stable-diffusion-webui directory instead of the one the install instructions say to run that's outside that directory and it's now installing. That's an aweful oversight and I'm sure I'm not the only one running into that mismatch. Maybe someone will read this and fix that. Almost surely not.
hello
If anyone has a spare second to help in #🤝|tech-support 🙏 trying to reproduce an image but can't
Hey I am new to stable diffusion and I am looking for some help. I am having issue reproducing the style and quality of an image even though I have the exact prompt that was used to generate it (at least according to the creator of the image). I am using Draw Things, and the particular image I am tryng to reproduce is https://civitai.com/images/1742145.
is rocm fully working now on windows?
are you using everything down to the seed, embeddings, clip skip, etc? you should get something similar
I don't know hwo to set clip skip on Draw Things.
Nor embeddings
It only lets me set seed, steps, cfg scale
of course I am using the exact model it was generated with. It said nothing about LORA.
he's using specific embeddings and a lenghty negative prompt, I think if you have all that, should generate something really close
Sorry what is an embedding?
clip skip 3 is a must
and the embeddings seem to be easynegative, ng_deepnegative
also, I don't think they're using Draw Things, this seems either ComfyUI or Auto1111
Alright, maybe that is the issue.
By the way thanks for the detailed explanation.
no probs, there's a lot of moving parts and you have to match them close in order to get similar results
By the way, how did you figure out what embedding it was using?
it's on the negative prompt: extra digits, bad eye, EasyNegativeV2, ng_deepnegative_v1_75t, NSFW, nudity
Oh i see.
Are those embeddings just automatically included in the checkpoint model? I am guessing that is why he had to remove them with negative prompting right?
those are also files, like LoRAs and checkpoints, you need to have those at inference time> https://civitai.com/search/models?sortBy=models_v5&query=embeddings
that's one, you can search for the other one: https://civitai.com/models/100191/ti-easynegativev2-textual-inversion-embedding
they are textual inversions, another training technique
I have a quick question I hope someone can answer. I have very little SD experience because of my crap gpu, I'm looking at a 4070ti with 16GB vram. Question is: is there anything that a 4090 with 24GB vram can do that the 4070ti could not in terms of resolution? I assume if I were to ever max out 16GB vram (if that's possible?) it would start using my regular ram.
4070ti should be fine for most applications
I assume I might see issues when training neural networks aside from SD
resolution is not only limited by vram, also by the models, a 1.5 model can't go too far on it's own, etc... you need to upscale, tile, etc
training is where vram gets really taxed, training sdxl checkpoints or LoRAs, depending on how many images you use, will eat a lot of vram, sometimes even maxxing out a 4090
if you don't plan on training heavy sets.. a 4070ti 16 GB is fine
the new 4070ti super too, it's basically a 4080
oh sorry yea I meant 4070ti super
okay that's the confidence boost I was after, thanks for the input I appreciate that
no probs! that's a really solid card
oh you will have fun!
nope, pytorch needs support for it
quick simple question I reinstalled comfyui and my yaml file for extra models path did not have .example and the type file will not change over to show YAML. How do I fix this?
`#Rename this to extra_model_paths.yaml and ComfyUI will load it
#config for a1111 ui
#all you have to do is change the base_path to where yours is installed
a111:
base_path: path/to/stable-diffusion-webui/
checkpoints: models/Stable-diffusion
configs: models/Stable-diffusion
vae: models/VAE
loras: |
models/Lora
models/LyCORIS
upscale_models: |
models/ESRGAN
models/RealESRGAN
models/SwinIR
embeddings: embeddings
hypernetworks: models/hypernetworks
controlnet: models/ControlNet
#config for comfyui
#your base path should be either an existing comfy install or a central folder where you store all of your models, loras, etc.
#comfyui:
base_path: path/to/comfyui/
checkpoints: models/checkpoints/
clip: models/clip/
clip_vision: models/clip_vision/
configs: models/configs/
controlnet: models/controlnet/
embeddings: models/embeddings/
loras: models/loras/
upscale_models: models/upscale_models/
vae: models/vae/`
Dont buy 40xx unless its a 4090
Otherwise buy a 3090. Preferably pre-owned
Vram is everything in the world of AI
not necessarily true, remember that we're moving to fp8 and even fp4. Not everyone has budget for 24GB vram, he's fine with 16 GB for most things beside training heavy sets or heavy animation (lots of frames)
A 4070 is more expensive than a 3090, for less vram, and small performance increase
Ideally, buying a cheaper 3090 and saving up for a 5090 is the best plan right now
Otherwise , a jump from 4070 to 5090 is less cost effective than 3090 to 5090
power consumption is a bit of a concern here, 3090 is closer to breaker tripping 😆
with fp8 it will halve... 16Gb will be the new 32 GB
There are always efficiency updates, but there is always also more powerful technology coming out
Just wait until SD 3D comes out
Then you'll be begging Bill Gates for an extra gig of vram
I want to wait for the 5000 series, but I've already been waiting since the 3000 series
there will always be incoming a new generation
yea it never ends, gotta pull a trigger at some point
at least I can get a card now post covid/scalper shit
post crypto
the 4070ti super you mentioned is a cheaper 4080 with almost the same performance, it's a good bet
it's frowned upon for what it's offering at it's price point, but it really is perfectly aimed at me for how old my card is and how badly I need it right now
of course there will be 50xx cards, and then 60xx, you just need to balance what you need right now and give yourself extra room to be future proof
this is also a good plan, and I'll def consider it
nervously looks at 80A breaker panel
if you can get your hands on a cheap 3090 it's a great card but as said before, not everyone needs 24GB ram
I was perfectly fine with a 3080ti 12GB until I got into training and running longer animations, that is where you'll make use of the extra ram
and never underestimate the power of good developers nailing down efficiency, when SAI realeased Stable Video, we needed 40GB cards, in about 3-4 days they got it down to 12GB
and it's happening on every new model
I've come to terms with the fact that I might outgrow 16gb vram, but I think I'm willing to take a little financial hit to upgrade some day.
Also take in account if ur a multitasker
I can game while training a model no sweat bc of the extra vram
haha that's funny
If power consumption is an issue you can always under volt
that's right
my rx570 pulls 150w max right now, couple hundred more watts is ok but I can't run the coffee maker if a 3000/4000 series is full load 😧
the 4070ti super is also great at consumption, I think less than 300Watts if I remember correctly the launch data
less than a 4080 with same perf.
I have extension cords running from other rooms into the living room just to sustain this nonsense in some arterial fashion
can't upgrade panel or run new circuits cause no certified electrician would touch the wiring without a full $$ upgrade
285 W
3090 is around 450W
It was funny reading all the 4070tiS reviews cause almost everyone shit all over it, but it's actually a smart move for a lot of ppl.. I think jayztwocents touched on that
well it's funny, just the name 🤣 4070 ti super
and the vendors will add more shit: extreme OC
ti super master extreme oc aorus elite limited edition
not 4070TISMEOCAELE ?
rolls off the tongue
intel cpus suffered a similar naming fate
Hello
is there a smarter way to check/see all the a1111 themes? enabling one, and restarting evbeyrthing seems quite the chore
im torn just like everyone else but i was thinking upgrading my 2080 TI to 4080 Super, but ide really like to know timeframes for 50 series....
I think all nvidia has said is maybe this year
from what I read it's a card they're keeping in their deck until it's a smart time to play it
in other words, milking consumers for the maximum amount of money until then
but also watching what amd puts on the table
it's like a trillion dollar game of crazy eights
Hello everybody! Can someone help me install Automatic1111 and Comfy UI locally on my machine? I would greatly appreciate it!
i would not upgrade from a 2080 lmao
Those 2 are supposed to be the best right?
how do you not cry when you see the price for a 4080...
like 10 years ago you could buy a really good computer with taht
inflation
still getting paid the same
Guys, anyone can help me?
theres a million tuts out there on how to install those
wonder how the hell I can get OpenVoice to work on Windows
seems like it's either pay for their service or use Linux
I had a bad experience couple years ago following a tutorial, I need someone to guide me
or use applio
or supertortoise
try this also at #🤝|tech-support those guys rock and can maybe help you get started or point you to documentation on how to do it
or literally the storm of audio apps that have been released lately
ok thank you
the instruction on git are very simple
the only thing most people dont know is to edit the batch file with notepad
OpenVoice is the better one
on gitpage the first line is Make sure the required dependencies are met, and I'm not sure what's that even
are you talking about something other than OpenVoice? I would assume so
yus
until when will this not work? or will it no longer work?
I want something that can replicate voices at high quality, near realism. That can to T2S or V2V
yeah I open a repo and see .ipynb, thats an instant nope and move on
maybe I'll just pay to use a service instead of fucking around
nah dont feed the machine
at least then I'll have an interfac etc
applio. mangio-rvc fork. audio-webui
all of those are local and work pretty good, although I havent tried applio
if you have samples that show how good they are at cloning voices
I do
would take a lot to convince me
want to hear my widowmaker clone voice
I dont know what widowmaker is
thts a problem then
just take a celebrity and clone them
if u got the samples
hey, until when will this not work? or will it no longer work?
I'm also in too much pain atm to navigate into each and every github
and investigate it
helpme
ima share you my true-tested audio apps gimme a mo
so many of the voices that are recorded by randoms and are free aren't made in a professional setting so they have background sounds, heavy room noise, reverbration etc
making them horrible to clone
why would you want to clone with someone else's samples
the whole point is to train ur own models with ur own samples
or u can download premade models
Because I need placeholder voices, so I need to try and mix two or more voices together and get that into the clone
and create a new voice
this is why Im saying that the paid versions probably are much better because they allow these things, they also let you use their super computers to generate and train
or just use coqui or elevenlabs for that
no point in downloading a trainer repo if u just want generic voices
I don't want generic voices, I want to clone voices
like resemble.ai
then there's descript
there's quite a few and until there's a relatively intuitive installation of an opensource AI Voice app with T2S and S2S, with an interface and all...kinda like how easy it is to get Automatic or comfy etc to work
supertortoise G
super tortoise
i was looking for the link but am multitasking like a lil bih
just gimme amo
works out of the box
he calls it ai-voice-cloning, I call it super tortoise cuz its a fork of tortoise on steroids
the only ish is that it doesn't produce .index files (afaik)
but might be just what ur looking for
never support cloud services. power to the people
Does anyone know how to connect newreality to fooocus
Or how to use a LORA in general
LoRAs extend models, they are trained on smaller datasets, depending on where you work, you load them different
like auto1111 or comfyUI etc
Fooocus
I use Fooocus, and I downloaded a LoRA.
Best stable diffusion for AMD gpu?
i haven't done much Foocus but I think you have an advanced tab or window, where you load the models, you have LoRAs there
Best performance on Linux:
Automatic1111 on Linux (best usage and fast)
Comfyui on Linux (also good)
On Windows:
Auto1111 Directml (best support, but slow)
Comfyui with directml (also good)
Shark webui (the fastest for AMD cards, but lacks features/usability)
Yeah dude that also depends on if my computer is fast enough to handle the workload I require and if the thing has stuff like formant, pitch band, herts etc
Where would I install auto 111 direct ml
for that checkout my install guide in the pinned message of the #🤝|tech-support channel
Hey guys, I'm trying to learn fooocus AI, could someone tell me if it is even possible to take an image of a dress and inpaint it into another image of a woman?
I have no idea but I tend to run all my questions at the youtube or google search bars, try inpainting foocus at YT : )
Thanks
not really, most projects use cuda toolkit 11.8 to keep compatibilty
I was thinking that too. thanks
I getting my comfy and stable back from overdue hiatus
what is your main/fav driver Comfy or sd?
its still automatic1111
easy to use, good extension support
I have a video clip that I would LOVE to remake using stable diffusion if it can. Everything would stay the same in the video, but switching out the person including the clothes they are wearing. they are doing the same action as the original video, keeping the object in the video that the person is holding, the object that is a pin being clipped, also keeping the sound affect.
mj community is so toxic imagine being ganged on because you won every live prompt battle competition 
this is why SD is always 100% better
@warm junco followed all of your instructions. now how do i get it to work with a lora i downloaded?
Download the lora file. Place it into the models/lora folder.
In txt2img select the lora tab. Then click on the lora to add it to the prompt
Make sure to use a model that is based on the same SD version.
1.5 models will work with 1.5 loras,
okay
You can also adjust the lora strength by changing the number behind the lora name in the prompt
with A1111 how do i know what version of SD its using?
or what are most people using now days?
how to make anime ai images? any good models that could get u the results similar to novel ai?
Hey everyone! Just wanted to let you know I'm donating a lot of images to AI research/use--you can find out more about it HERE in this document: https://docs.google.com/document/d/1n7c2M6FNMUR5oxb2-f59lFyKXnko_byDIHg_tOiUGFI/edit?usp=sharing
I'm headed to bed, since it'll take forever to upload everything--but will update, hopefully, throughout the week. ☁️ 🌙 Good night, friends, and enjoy!
Im using invoke ai. It says its a standalone, so does that mean I can remove stable diffusion?
And then how do I get a SDXL LoRA to work properly with it?
i see your point
Why cant i use the bots
Hi, i am trying to install stable diffusion locally, but i keep getting error code "ERROR: Could not find a version that satisfies the requirement torch==2.0.1 (from versions: none)
ERROR: No matching distribution found for torch==2.0.1" anyone got any idea how to fix this?
Anyone good with loras....know how or suggest how magazine cover LORAs are trained to get the magazine cover style and even the text style?
Like this one: https://civitai.com/models/78559/mad-magazine-cover-1980s-style
This channel used to be jam packed, lively. Now, mostly dead.
all the sd channels are like this now - just a few comments each day
how many people know about models being used as a vector for malware?
Im unsure of sites like civit ai
as far as i understand there was/is an exploit to have .ckpt files to load executables, but the .safetensor versions cannot which is what civit has
safetensors protect from malware
what is the main difference between a model and safe tensor?
I think Safetensor maybe a kind file or data format of a SD painting model.
Hello, I have installed Stable Diffusion yesterday with some checkpoints and loras and played along with it. I have seen that you come generelly fast into NSFW (not that intended). Now I want to show my daughter the tool to create some cool images. But I wonder what I should do to minimize the posibility to get some NSFW content. As far as I understand I can use the negative prompt. But what should I use to get sure that there is no problematic stuff? Also I use some generic checkpoints like Dreamshape and Realism, whihc would you use for generic purposes (thought about some images of teddybears or spaceships etc ...)
This is an excellent question, you can do many things. First having your daughter only use a user/instance that has zero nsfw specific checkpoints and loras. The second would be preparing that instance so it has all that in the negative prompt (nsfw, nude, naked and so on). You can also load embeddings/textual inversions specific for that purpose, like> https://civitai.com/models/99890/civitai-safe-helper
Something's wrong with my lora. I try putting it in the prompt and it doesn't work.
Thank you. For my learning, how do I import embeddings/textuals? I have managed to install loras, but not embeddings yet
It’s similar to a LoRA you put them in a specific folder and refresh, usually “embeddings”
The checkpoint is based on the SD version. Most are 1.5 based models, 6gb models are sdxl ones mostly
I have tried that with that https://civitai.com/models/99890/civitai-safe-helper and put the .pt file in the embedding folder. But after restart and refresh it is not shown in the Textual Inversion tab.
For auto1111 there is the Rembg extension
I use stable Diffusion Wub UI
Good morning, everyone! How are we all today?
你


There anything special to be done to use sdxl with a1111?
when will Stable come back?
Excuse me, the bot is unavailable, so we can't use stable diffusion on discord anymore , right ?
hey everyone! yes the bot is currently offline, you can always check https://discord.com/channels/1002292111942635562/1047610792226340935 and https://discord.com/channels/1002292111942635562/1100170153829871686 for more information
Hey guys, I joined this server recently because I was curious about the current state of generative AI and Stable Diffusion.
The last time I tried out SD in a serious capacity was... I think 9-12 months ago.
Has anything changed? For example, does it still take a lot of VRAM to run models? I'm on an old NVidia card with only 6g of RAM so I could only go medium VRAM without having to close a bunch of applications... hoping some improvements have occurred in terms of memory usage.
I also saw SDXL on CivitAI. What's the situation with that, and can pre-existing LoRAs/etc work with it?
you download an sdxl model and put it into the models/stable-diffusion folder
if you have a low vram gpu you need some additonals cmd args in the webui-user.bat
ask in #🤝|tech-support for any help on that
Oh I thought you meant like, use comfy ui
sdxl works in auto1111 and comfyui
jnjjjfcc
how can I create videos with automatic1111?
yes since 10 months much has changed,
a 6gb gpu is now better usable then before.
follow my install guide in the Pinned Message of #🤝|tech-support
there are also the best performance cmd args added.
SDXL needs sdxl loras
Why are all the bots down?
hey guys its been forever since ive done SD is automatic 1111 still the one to use?
def. the most popular and more extensions, but depending on your needs, you have now a lot more options to choose
i got a 1650 super so im just tryna get anything thatll run faster than 2 min per gen
foocus is a lite UI with most options
Big thanks
hi
Didnt touch ai image creaton for 6 months
and i see from stability matrix that there are a lot of interfaces
can someone give me a quick rundown what is the best way to use it now
Don't use Fooocus with your gpu
It will be extremely slow
Auto1111 would be the best for your GPU
For an install guide checkout the pinned messages of #🤝|tech-support
There are the performance settings covered too
oh thank u sm
apparently i have to like redownload auto1111 cause it like failed to download torch or smthn
Checkout my guide 🙂
Maybe your python version wasn't the right one
Does anyone have a link to that very handy database with i/s per gpu please ?
does anybody know anything about Magnific AI?
I know they use Stable Diffusion
but what has changed?
what s theb est message to create a constant AI model with a similar face and body in different situations?
best method*
they have a pretty solid detailing/upscaling system with tons of parameters and up-to-16K output.
I see
Hello everyone!
I am not Sure where to write this question, but I will just ask it here:
Regardless of which UI this would work. Is there a way, if you have a picture of a character as a 3D Modell, and I sorta import it into SD and just want to turn the picture into a digital drawing.
Is this possible? Just changing the style of the picture without major changes in clothes and face?
I've tried various approaches with controlnet and so on, but I think I am missing something...
I am currently using the automatic1111 UI, so it would be neat if it is possible in this. But if there is a easier approach in for example the Comfy UI then I would like to hear that too
Thank you in advance
Best regards
Hey, with using controlnet your one the right path for that
Best would be to use the IP-Adapter model for that
Maybe with canny and openpose too
So I would have three controlnet with each of them using these models? Should I have a specific setting for each model or just the general ones?
But okey I will try it out thank you!
It depends on the input image
If you want the exact same form then use canny.
Same for the openpose for the pose
IP-Adapter is for getting the style
IP-Adapter face is for the face
I would start with IP-Adapter + openpose
Okey! Would you like me to keep you updated if you are interested?
Sure! I've been doing a lot with IP-Adapter and lately I transformed anime images into realistic images. Keeping the consistency.
Didn't tried it the other way round.
But if you have problems or questions feel free to ask
Okey! Thank you very much! ☺️✨
Np, if I have time tomorrow I will try 3d to digital art. You can also send me an example if I should try something specific
That would be terrific! Sure I would gladly send you my example picture tomorrow! If this would work I guess many people could take advantage from that 🤣
Sup boys, random question, is there AI to create sounds?
Hey, there are some Webservices to create music and sounds. But as far as I know there isn't something you can use localy
You can try out Stable Audio. Its limited but still pretty good
Ok, another random question. I have fooocus, but struggling to understand on how to restore/improve quality of my old pictures, that possible?
Locally?
No only online afaik
you can run it locally but there's no opensource model released, you have to train your own with the Stable Audio Tools, or look for models people have trained> https://github.com/Stability-AI/stable-audio-tools
Excuse me , can i know why i can't use bot to do anything.....
You can use it to read the last msg on each bot, or #1047610792226340935 or #🗣|artisan-support-feedback 😁
I would like to start creating some animation, and have already installed plugins with controlnet for krita. Was wondering part from animate diff, if there are other tools to create pc-based Ai videos without having to use paid for cloud services
Is there anyway that for example a flat tshirt input is getting gernerated on a person?
I also trained a lora on a product but sadly the written stuff isn't good at all. So the product was kind of the same but the writing wasnt good. Is there a way to make it better?
Guys, How do i cut off unnecessary light on character? It keeps generating some shine, sunlights on character. I'm tryna create webtoon sort of thing. I add special (light, atmosphere, shadow etc) effects later defends on whats going on that cut. Even i write negative prompt it seems not working.
hello everyone
I have a question, which is better to use? Fooocus or AUTOMATIC1111/stable-diffusion-webui?
I am very new to this
Fooocus is simple UI for quick great img generation, A1111 is for doing custom stuff
how about comfyui, I am doing research, and saw that it allows you to make videos aswell, and that's something that interests me along with custom and advanced cool stuff
which one would you recommend ? I am fine with it being complex and hard to learn
Depends on your GPU too. Fooocus needs more than 6gb vram to work fast.
I would recommend auto1111 as an easy start
What are the requirements for video creation
Having 8gb or more vram.
But in fact a videos are just frames so you can also create gifs and videos with less
What about using control net pose and generating tons of frames
Yea thats doable
Img2img also features batch process so you can easily run through a folder of images
Controlnet supports that too
Why can't I chat in bot?
Its under maintenance #1047610792226340935
Hello, I have a question. Can I read out the parameters I used for the picture later? I used the Stable Diffusion Web GUI and as I understand it saves the information in the PNG-Metadata...
Hey, yes you can. In the webui there is the PNG-Info tab where you can drag and drop images in to see the Meta data.
Other PC tools like Exiftool can show these too.
Ah, thank you. Haven't seen that tab.
that's a good idea
does anyone know why my loras appearance changes so much after inserting a break?
i would like to keep the appearance but also have the benefit of the break system for color bleed
Stable diffusion and also llms in general are so gpu hungry, literally need 3k build just to get them running. 5k+ for a solid performance and trying advanced features
Hello, another question, is there some kind of commenting sign (to deactivate a prompt without deleting it)?
No there isn't
Morning. I'm trying to improve my understanding of how everything works. Is this a fair definition for sampler and steps?
Stable Diffusion starts a generation by creating a fully random noise image (using the seed - which itself can be randomised). The model then figures the amount of noise in the image before removing it. This is repeated many times (steps) until eventually an image free from noise is created. The process used to calculate the noise and produce the next iteration is described by the sampling method.
not exactly, it's a dual process, forward and backwards, the forward part injects noise to images in steps, the backward part denoises back to images, all this is done on kind of a compressed space called the latent phase. in all that there's a lot of components that encode, decode, deal with text prompts, etc.
Why all of bot channel not working long time? #1100170312106127410 #1103708504142925824 #1100170365604483202 ..... #1101178553900478464
Bot is offline... migrating
Hi everyone! Could someone please help me understand the pricing for fine-tuning an SDXL model for face features via stability ai API? I'm also interested in the cost of accessing the fine-tuned model after training. Thanks in advance!
Thanks. I just need a basic understanding at this point so I know what I am altering when I make changes! At the moment I'm in the dark! 😁
don't worry at what's happening under the hood, more about the stuff that will change/alter your look: steps, samplers, schedulers, cfg, denoise... a good place to start is this> https://www.reddit.com/r/stablediffusion/wiki/tutorials/
here you have the info for the dev. platform/api pricing: https://platform.stability.ai/pricing
getting rid of mental aphantasia with good nutrition
maybe silly question... can I use sd1.5 trained Loras on an img to txt with base prompt of SDXL?
no you shouldn't mix sdxl with 1.5 loras/controlnets, they are trained different.
thx @lavish lake
people have multiple versions of SD? how does that work? do you just make another directory?
Anyone have any experience with animateanyone?
I need some assistance with a generation. I feel like it's ready difficult to achieve this. I'm trying to create a image of a man sleeping on the balcony of a home in the snow outside. With his window open, and someone else sleeping inside his home. But nothing I put comes close to this. Any suggestions?
The death of the artist - https://www.youtube.com/watch?v=WxGOtwOZCm4
what kind of AI products, assuming anything is allowed to be used, that would help a political candidate in winning an election the most? i mean very specific use case, for example deepfakes for what specific use case, in technical sense
you might be able to cover this so that people are more aware for safety purpose?
ya figures, they have that new production to offload..
why i cant create image ?
Most of the models are trained with female characters. Do you know any good models that are doing great on male characters? Preferably anime style
I dont know the range numbers of what is low or high in advance sttings on all settins, what numbers are low or high prompt strength, generation steps or seeds what numbers sould i put in order to what results? Alsi, what is the difference in models SDXL V1.0 ou stable diffusion v1. 6? What all of this means?
Disclaimer: I'm also a beginner so there may be some false information
SDXL is for higher resolution and details, It takes longer to generate images with SDXL
SD 1.X is the regular version most people use. It is for daily usage and there are so much more models and LorA's for this version
There's also sd 2.0 but don't bother, literally everyone had a problem with this version, and close to no people use it.
You can't custom your seed according tou your wants, Seed is like a navigation point of how the denoising process will result. So start with "random seed" option, generate a couple of the SAME PROMPT with random seeds, and if there's a specific image you like, you can use that seed and alter the prompt to get more similar results to that one.
There is no SD 1.6. You might be thinking of WebUI which has had a version 1.6. There is SD 1.5 and SDXL (plus turbo editions). Just use 1.5 models if you dont have a powerful machine.
Guys, is there any anime fine tuned SDXL turbo model?
Hi i could nor buy credits, apears We are unable to authenticate your payment method. Please choose a different payment method and try again.
What is pronpt strenght?
Hey , is anyone using VMs on google to train and generate?
dream A woven labels of Galaxy
are people in SD working on any humanoid bot project?
just curious
something on the lines of tesla's optimus?
hi, is there an equivalent of canva/invideo on local pc?
as all these services are paid and stack up to quite a hefty expense
You can prioritise certain words in your prompt to make them more prominent. You do this by adding brackets for example with: cute fluffy cat
You can make 'fluffy' more prominent by typing: cute (fluffy) cat
Each set of brackets multiplies the emphasis by 1.1 so ((fluffy)) would become 1.21 strength. You can also just type in the number like in the following: cute (fluffy:1.5) cat
It might depend on the model, but whenever I tried to go over 2 the picture started devolving into a non-sensical mess of noise, so I tend to stick with 1.1 to 1.4.
Admin please?
You might want to @ them.
Hey there, I really don't get through this.
in the free stability AI membership it states that its not for commercial usage. Does this mean that SD is no longer free for commercial use?
true only for new models, if you are using 1.5 to sdxl nothing changes.
bot always down ?
since it was migrating yes, check #1047610792226340935 . we should get it back soon 😎
Hello
So SDXL is still free for commercial use but SDXL Turbo and everything after that is/will only be commercially available for ~20$/month.
Hello
correct for turbo, sdv, zero123 and everything new
💐Good morning!🦄 How is everyone this beautiful day?
Where does n00b go to learn about stable diffusion?
youtube too: olivio sarikas, sebastian kamph and a lot more!
Question about adding quick settings and whether they need to be adjusted per check point.
For example: I was copying another image prompt which used SD VAE and Clip Skip. This article about Clip Skip suggests its only really relevant on 1.x models. So my question is should I disable it Clip Skip if I am experimenting with say picX or another checkpoint and does the same rule apply to SD VAE?
the vae will affect the look and colorspace, so having the correct one always helps
about clip skip, if the models have been trained like that, you need to use it correctly as stated on the model/lora
most models/lora have that info (at civit.ai, etc...), they also state if the VAE is already baked, so you don't have to choose one
Hey guys! Anyone knows how to better avoid the tiling issue when using the Ultimate SD Upscale node ?
Sometimes it works by decreasing the amount of noise in the sampler but when i do several upscales to increase the definitions i usually get that tiling issue with colors and bugs between each tiles..
Hi guys. How I can use this bot?
right now you can't, it's been migrated and will take a bit to be operational, please check #1047610792226340935
Can anyone point me to some workflows/tutorials for getting a person's body type into stable diffusion? My faces look pretty good with ReActor but for the bodies, I'm just winging it trying to adjust prompt weights on different body parts until I get something close enough to trick my brain.
@lavish lake Hi, I need to talk to you privately if it's possible! Thank you!
please use the ticketing system https://discord.com/channels/1002292111942635562/1010934719455707218
Yo
yo!
I usually throw in a couple fullbody images in the data set when training a face in SD. Hopefully a good descriptive prompt can help u get the body type u looking for .or try with a Lora
I'm jonesing to generate some videos ,like when will the bot be up????
When you train a face from a set of photos, do you use reActor at the end? Do you merge your trained face with an existing checkpoint to create a combined custom checkpoint? Can you recommend a tutorial on training on a face?
anyone here who captions pics for lora training, do you caption the emotions as well? Like if the person is smiling, angry, etc, should you caption the emotion? I go back and forth on what's better.
Is there an Auto1111 specific discord server?
Hello everyone!
I have rich experiences with sfw, nsfw image generation platform development.
My experienced image generation platforms are DALL-E, Stable Diffusion, Midjourney, Randomseed.
Also custom chatbot development using OpenAI is my major.
If someone needed to develop something like this, I am ready to work for it.
Best Regards.
Thank-you.
for questions about the usage or for developemnt?
usage mostly
then there isnt a specific server, but for technical questions feel free to as kin #🤝|tech-support
or for getting specific things in images ask in #📝|prompting-help
Whats an appropriate channel for the controlnet face-id model?
#🤝|tech-support would fit for that
Thanks
np 🙂 see you there
I got into SD about six months ago and had a lot of success with lora on my 3070. I have been out of the loop since and will have a 4090 soon. What direction should I be going?
- SDXL?
- a1111 or comfy?
- LoRa or something else?
- Kohya for LoRa?
My plan was to just go back to LoRa with Kohya and SDXL. Play with Comfy. Is A1111 a waste of time with SDXL?
Is sd 2.2 going to be coming soon?
hey, nope sdxl works with a1111 as good as comfyui
having both doesnt hurt in anyway too.
Would also recommend using khoya_ss or the lora scripts:
https://github.com/Akegarasu/lora-scripts
for lora training.
A1111 or Comfy:
Depends on your interface preferences. A1111 is more restrictive but everything is all set up for you to use out of the box. ComfyUI uses a module based system which is less intuitive unless you are accustomed to modular plugin interfaces.
SDXL supercedes the SDv2 line
first version of what became XL was called sdv2.2 until it was understood as a whole new beast
Thanks, both of you. Yeah I plan on playing with both. It sounded like SDXL was not supported on A1111 at some time but ideally I could stick with what familiar while I get back into LoRa training.
Ohhh i see
When was xl released
And where can I read about it
Also does xl need better GPUs then standard?
only slightly, I think 6 GiB VRAM is the minimum to run normally iirc? with 8GiB preferred
nvidia 20xx, 30xx, 40xx desktop versions all work fine
laptop versions are more mixed
2.1 runs totally smoothly on my 6gb 4050 laptop version
I'll try and see if xl works too
xl launch news => https://stability.ai/news/stable-diffusion-sdxl-1-announcement
more recently, xl turbo => https://stability.ai/news/stability-ai-sdxl-turbo which is a bit easier to run
note that you need a good or well configured ui, like comfy or Swarm
I see, checking on that
.
BTW, how does stability AI profit from all of this if everything is free and open source?
is there special channel for developers to ask dev questions in?
Ah I see, makes sense now
Wow sdxl turbo is only one step wow
Could you clarify what ✔ Enterprise features means? It doesn't specify that anywhere.
for any questions or inquiries regarding the enterprise membership, please contact here https://stability.ai/enterprise
hmmm, how do I get permission to use the bots ?
im my old install of a1111, it wrote a log.csv for images...i cant seem to find the option to enable that now. anyone know where it is?
i dont know what I did wrong, I followed the steps in the stable diffusion art guide but my sdxl turbo images are like they need more steps
cant upload pictures here
they are all grainy like when you set a normal model to a single step
can anyone help
i literally turned it on and off and its fixed lol
what is the word to make the images look like real photos?
"A photo of" and add some photographic prompts like "35mm f2.8" "Kodak Gold" "Canon 5D"
I also like to neg out "painting/drawing/cartoon/airbrushed"
hi … i'm new here and search for info's for the app draw things … any hints are welcome
hey, what infos exactly? DrawThings is an IOS App that runs Stable DIffusion localy on your Device
that's what i know currently … want to know the general usage of this app, the meanings of the parameters, what happens when I use this or that … means the general usage of the app …
q
werty
A
i search for general infos on usage drawing things … the meaning of the parameter and so on …
Hello world! 🙂 Do you think it would be possible with AI to generate a drawing that looks like a panel of Little Nemo in Sumblerland, drawn by Winsor McCay? 🤔 (Or even a page.)
then maybe checkout out some basic automatic1111 usage tutorials, because most of the features (txt2img and img2img) are the same in DrawThings
Steps, cfg, hires fix, resolution, models etc
thx … i will start with that 🙂
I also saw draw things has its own discord. So that would be a good source of information too.
Its linked at the bottom of their website
great … thx so much …
How soon will the bot be back? before Saturday?
hello. I start getting the message "you dont have permission ..... " in the the server rooms of bot-6 and up. WHat is that? why?
#diffuse-together diffusers, thank you so incredibly much for helping me get a start here! Just won a monthly “special stuff and ai” on International Underground Music Video Festival and will be in the annual!
time to transfer 400GB of SD 1.5 models to my server. 2TB nvme is full, so i can wait 40 sec for spinning rust for 1.5, then 7 sec for SDXL from the local nvme lol
Does anyone have experience generating assets with SD? Looking for tips and resources
why do you have so many models?
I tried sdxl with 6gb vram, even with lowvram setting it doesnt work...it puts out half assed poor quality images
worked fine for me with 6gb vram
What api do you use? I tried with a111 and comfyui same result
i used a111
what gpu
@vapid tangle
Why don't I have the authority to send messages in stable diffusion
you cant run it with just 8gb of ram,its slow af on my pc because i only have 16gb of ram,you can do like me of increasing pagefile but because its using the pagefile as ram it will be even slower
Hi guys, is there any leaderboard for text to image models? I prefer the kinds like chatarena which is ranked by human comparisons.
does anyone know of a way to insert a comment into a stable diffusion prompt that will not be rendered or effect token count etc
why why why
cmooon its nearly been 2 days
?
Why is this ai a 100 times slower if I change the width and the height?
?
Higher resolution means more pixels to "calculate" so it takes more time
Also if you go beyond the resolution the model was trained on, it doesn't have data for the extra resolution so it tries (and hard) to come up with something it doesn't understand (way slower). For SD 1.5 is around 512 x 512px, over SDXL it's 1024x1024
the best way to have more resolution is upscaling
Good morning, everyone! How are we all today?
hello
Good afternoon. I'm looking at the JuggernautXL model and wondering what is meant by "HiRes: 4xNMKD-Siax_200k with 15 Steps and 0.3 Denoise". I'm guessing its something to do with high resolution but thats as far as I get. Can someone explain it to a simple soul?
the hi.res fix is using an upscaler, those are models trained on big images, that 4x_Siax mans that it's best for 4X scaling
if you give it 1024 you can get 4096px out
you can learn about steps and denoising levels, sampling etc more here> https://www.reddit.com/r/stablediffusion/wiki/tutorials/
Hey everybody, enjoy your creativity day
Thank you
thats probably the probelm
with 16gb ram it runs fine for me
ofc fine is subjective lol
Hello everyone, I just started with stable diffusion and I am currently facing some small problems. I am using fooocus and I am so far satisfied with how it works. Now I want to change the size of individual body parts and I can't get it right with inpaint. Does anyone know how to enlarge or shrink the breast size, for example?
I’m looking to fine-tune SDXL with** Lora** method for my new project. I’m wondering if I should go with TPU or GPU for this task. Can anyone tell me which one would give me better performance and be more cost-efficient?
Hey totally off topic of stable diffusion but I've paid 3 different companies for ai headshots and they are bad. One had my head like way to big, one of them turned me asian and the 3rd just looked cartoonish. Does anyone know a good viable solution for this.
Maybe hire a freelancer? Any corporation in 2024 is just out there to scam you, no matter the field.
I wish there was a paid project request channel here, that would be a perfect solution i think. I assume the founders dont want the liability of that though.
What was your desired outcome of your headshots anyways?
Just a nice linkedin photo that looks professional. Perhaps pretty up my ugly mug up a bit LOL
Sounds like you want AI to do what photoshop does
pretty much. I just liked the idea of spending 35 bucks and gett 40 shots... It would take me hours to edit that much in photoshop... plus while im a graphic designer Im not well versed on faces and that type of thing. I create logos and marketing graphics for compnaies.
I DM'd you
nice! i guessed you only did nsfw girls, judging for #🎥|animation
Another fun fact is I'm only attracted to males too 💀
hey guys, I'm looking for a better open-source image generator,model currently I'm using PixArt-alpha, and sometimes it generates low-quality images
Yes I was not impressed with it.
hi guys
can someone help me figure out how to utilize more VRAM in fooocus
I find the documentation lacking and I can't get my 7900XTX over 1GB util
?
no idea, but there are additional steps in installing for AMD cards on the github page if you missed that https://github.com/lllyasviel/Fooocus?tab=readme-ov-file#windowsamd-gpus
I can do that
If you're looking for a photorealism option anyways
Hello, is it normal I have generating speed about 2 it/sec on SDXL 1024x1024 with RTX 3060Ti? Or is there some problem?
hey, make sure you have --xformers --medvram-sdxl --no-half-vae in your webui-user.bat
Forgot to mention I have ComfyUI, so no webui there.
