#💬|general-chat
1 messages · Page 74 of 1
About latent space. To keep it extra simple (if any AI scientists read this.... spare me please) the usual stable diffusion workflow goes like that :
"prompt in actual english words" -> [CLIP phase] -> "token" (math values) -> [VAE encoding] -> [latent space neural network complex math iteration 1] -> [latent space neural network complex math iteration 2] -> ..... -> iteration xxx -> [VAE decoding] -> "pixel values"/image
When doing hires.fix it goes :
prompt in actual english words" -> [CLIP phase] -> "token" (math values) -> [VAE encoding] -> [latent space neural network complex math iteration 1] -> [latent space iteration xxx for pass 1 at low res] -> [latent space iteration xxx for pass 2 at high res] -> [VAE decoding] -> "pixel values"/image
That won't be in the test. Don't worry
I switched to Comfy and like how I can visually see the flow, makes it a bit easier to understand.
basically what it means is when moving from pass1 to pass 2 we're staying in latent space. Which is good because it's fast and it avoid the loss of information caused by VAE decoding
Glad that it helps you then.
Is there ever reason to use any mode besides Bicubic?
I'd say it's an aesthetic choice.
If you play video games. Think of it kinda like the different Anti Aliasing options. AAx2, AAx4, FXAA, SMAA, etc
each has its own strength and weakness.
I started using bicubic heavily as it seemed to produce the most accurate highres/upscaling, I really haven't played around with it since.
@potent jungle Thanks. I managed to do it in automatic.
Hey guys, sorry to interrupt you… I am pretty new and exploring the amazing landscape of stable and its ai. When I choose an image from my files with controlnet, it doesn’t use this as orientation - how can I change that ? Sorry again guys and thank you so much
Glad it worked out, I like Auto's inpainting tool for quick ease of use.
that's a topic for #🤝|tech-support (precise what controlnet model you're using and/or screenshot your webpage)
I am a noob sorry, I thought I could just use the command and then everything would be working here in discord and generated 😭
I'm pretty sure you can't use controlnet with the discord bot ^^". You can but only in #1100170312106127410
we all start from scratch at some points
You can use controlnet on the bot, but only in #1100170312106127410
thanks for the correction
I was double checking :p, I'm never using the bots channels
Anyone get this error with SD XL?
OSError: [WinError 193] %1 is not a valid Win32 application. Error loading "X:\ai\ComfyUI_windows_portable\python_embeded\lib\site-packages\torch\lib\nvfuser_codegen.dll" or one of its dependencies.
Proof: #1100170312106127410 message
that would also be a question for #🤝|tech-support
Legend thank you !!!
Thanks
Appreciate your help as well
Just keep the original image sfw since they display the original image.
Could you pls explain that ? Because as soon as I send it to the bot, I just get the image I uploaded, not changed
Wait a second for the bot to generate the results.
Wooooow now I understood thanks man !!!
Sorry to disturb u again, can I chose picture that was created with promted with controlnet, but in a completely new angle and everything? So for example a logo that I created but on a trophy for example that is lift but an athlete ? Just to give u an example
Depends on the controlnet model. I am guessing this controlnet model is either canny edge detection or it's a depth model (most likely this one) so you would not be able to use it to create variations of your given image.
The controlnet depth model creates a 3d depth map of your image and then feeds it into stable diffusion to diffuse an image that confirms to that depth map.
Appreciate it a lot ! Thank you, was a huge help
Here's some fun homework for you: https://stable-diffusion-art.com/controlnet/
Well maybe it's a little drier than I was expecting but it explains what it's doing.
Yo how do u use the bot
@gaunt pendant
Currently, there is a public bot on the server that generates images available as a research beta for SDXL, you can find the current status of the bot in #1047610792226340935. There are plenty of ways to use Stable Diffusion such as the official https://dreamstudio.ai/ website or by running Stable Diffusion locally using your own hardware - check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using DreamStudio or #🤝|tech-support for any problems you encounter while installing it locally!
Meh the faq message isn't that helpful actually when it comes to how to use the bot.... Check the bot channels pinned message for more information
From how it was explained above, HighResFix uses the model to make multiple passes of the base resolution image it first generates, upscaler then takes the highres image and upscales that completed image to an even higher resolution.
High-res is using a latent image, upscaler is using a completed image.
@still glacierHoi, do you know how i acquired the gamer role?
hi
What is Lora?
Lora is that thing which is trained usually on 1 specific thing, star wars uniforms, somebody particular. Or tilt shift for example.
Control LoRAs https://huggingface.co/stabilityai/control-lora
I'm curious - why is the bot set to generate square images at 1048px x 1048px? Isn't it supposed to be 1024?
I am new to learning stable diffusion
should I go with automatic1111 or comfy?
or learn both😂💯
Anyone does model training commissions?
Hello everyone,
Has anyone had experience with generating flyers/posters using prompts that have actual text in them (like a lineup)? I can't seem to get the text to look normal in the generated outputs; it all comes out resembling ancient Egyptian hieroglyphics.
it is very hard to get it to generate intelligible text, it only works if the model you use is trained expecially on text (e.g STOP road signs)
your GUI does not matter that much, you will get all the things eventually
i see. thanks.
hi
Anyone have pfp like my they can send me?
I want to see different pfp I can use
hello what is sdxl? If you know a video that will help me or can you help me how to download and use it? now i have sd installed
No idea, I know what sdxl is, but xdsl might be something completely different
sorry i wote it wrong i meant sdxl
I think I found the download, but can we only produce realistic pictures with this model?
Figured 😂. It's the latest model developed by stabilityai, it's a text to image model
I want to create landscape and anime pictures. When the sdxl is loaded, is it possible to use anime models in addition to it?
It can produce a wide variety of different types of images, but in general that will grow as the community releases fine tune models/loras based on it
hello! is there any resources to teach deploy the stable diffusion on local machine? thanks!
There are many on YouTube, search for secourses, he has had a bunch
okay thanks again
Windows only, if you need Linux there are others out there
How can I find customized models with xdsl build?
There's a site called civitai
Curious how people land on this discord and don't even know about the model, like how did you find the discord in the first place?
I knew the models and how they work. I just couldn't understand the developments that took place in 3 4 months. I thought sdxl was a new program
thanks for your help
Lol, what's the significance of the voice channels if there's jsut gonna be one person alone in each
Of the weeks i've been active here after SDXL came out, i've wanted to hop in one that has 3-4 people in it, but i've only ever seen many in them, but one in each
Good question, the release had a bunch of people, I can't imagine wanting to join one at random personally
is this for those who use SD with python working or like anything, cause i use a1111
GM
Is legal to monetize SDXL tool?
The bot does not seem to understand the concept of a tower that you can’t see the top of, no matter what I type. I’m trying to get something that’s really tall and disappears into the clouds, but it just keeps terminating the object in the image at a really short height.
Had no idea there was a discord, yaay for discovery
Hi everyone, i have a question:
im trying to recreate the process on the video below, and i was wondering if there are any free alternatives to midjourney where i could create my avatar with a similar level of quality
https://www.instagram.com/reel/CujzvcXuMGO/
true
barely anyone and if then still not active
Hey guys, I've got a really weird issue. Haven't used SD in a while now, and now it suddenly became REALLY slow to the point of being unusable. Additionally, the Dream Vinfinity that I had as my base model.ckpt fails to load now. Nothing in my setup changed, though I did update my AUTOMATIC UI afterward, but that didn't fix it. Anyone had any similar issue? Any pointers on what to do?
I had a similar issue and reinstalling fixed it for me
with lora embeds, where do I download them to in the stable diff file structure. on civitai I see people using <loraembed> but not really much other info on how its used
Where do i go to create an image and how?
You do not want ckpt. Redownload the model as a .safetensor. ckpt is insecure as they allow for malicious code to be executed within memory
Gotta be more specific, in what way? Website? Or one you run locally?
As I noted to knaapje, you do not want ckpt. It's insecure. Read my reply to him 2 comments up :)
This will use runpod which I've also used. And you can rent all from simple 3090 for what.. 37 cents was it per hour, to highest end Quadro 80GB a100 workstation card. Which I used
but costs 1 dollar 75 per hour iirc
@floral umbra I created one. Unfortunatly their is no ai platform that can create images that are able to communicate with textile machinery.
Where do you guys find models for Stable Diffusion?
Anyone into the following
I am a sock manufacturer.
I am looking to have AI create image Files from images given by customers and utilize AI image creativity.
Image files created are used to transmit data to a machine to engage functions.
Data transmission or Data signal designators to the machine are represented by the RGB colors located in the file. Machine capability is limited. RGB colors that can be in the file must be limited. Currently Ai image generators use shading, gradient, etc.. in creating images. you also can not designate the image size more specifically image size in Pixels.
Example
168 pixels wide 400 pixels height.
168 represents the 168 needles that are in the cylinder of the machine.
400 represents how many courses are in the sock. or how many times the cylinder has rotated picking up different colored yarn at its yarn intake points.
RGB colors in the file are used by technicians to designate fixed yarn takeup points on the machine.
Transmitting the data to the machine is not what I am looking for. I am just looking to create images
Me and two other educators and prompt engineers are putting on a set of masterclasses, paid prompt engineers.
Instructors:
-
Harris Terry - 30-year educator from USA, works as a prompt engineer with a startup and has given lectures and masterclasses on various AI topics including Midjourney with more than 60 attendees.
https://aidreams.tech/ -
Geoffrey Mollet - Prompt engineer, web developer, and AI social media specialist from France with videos with 38m+ views.
https://linktr.ee/singularitydiffusion -
Tanvir Hafiz - YouTube AI educator and tech specialist from Bangladesh, AI freelancer for corporate clients and video production.
https://www.youtube.com/@TanvirsTechTalk
This course will be Sundays at Noon EST, starting August 27th via Zoom.
The dates are:
Aug 27
Sept 3, 10, 17, 24
Oct 1
Course Summary
Session 1 - Basics
- txt2img settings, models, prompting, styles, consistent characters, upscaling
Session 2 - Images and Inpainting
- img2img settings, inpainting basics, inpainting for upscaling, Adetailer use
Session 3 - Deforum
- installation, basic features, scheduler use and frames, movement controls, prompts, Init and Final tab settings
Session 4 - ControlNet
- installation, model download, basics of UI, types of models & uses, usage in txt2img and img2img
Session 5 - Textual Inversions & LORAs
- basics of Textural Inversions & Negative Embeddings, LORAs and various uses (characters/styles/hair/accents)
Session 6 - Additional Extensions
- installation and use of:
- ROOP - allows face swapping
- Inpaint Anything - faster inpainting alternative
- Tiled Diffusion / Multi region prompt - different prompts for different areas of the screen
- Tag Autocomplete - see popular terms and use wildcards
- Cutoff - allow separation of colors and descriptors by using commas
If for some reason a class is postponed due to an unexpected emergency or technical difficulty, the following Sunday will continue where the last session left off.
Upon sign up, you will be emailed a single-use link to join the class Discord server, and will receive a Zoom link for the class before the first session. Sessions will be recorded and available via Discord for members. There are no refunds for this course, and distribution of any Discord or Zoom links will get you removed from the course without refund immediately.
You are expected to have some sort of Stable Diffusion software (A1111, SDNext) installed before sessions begin if you want to follow along and try what you are watching. The Discord server and YouTube have install videos to walk you through the process, and the instructors are happy to help you during non-lecture time. By signing up, you agree not to distribute or share videos or materials from the course.
Let's become masters at Stable Diffusion!
https://sowl.co/s/1HsPX
Sample pics
thx for reading 🙂
@solemn blade
Example
168 pixels wide 400 pixels height.
Step 1 - have a prompt template, that will create images in a style you're happy with
Step 2 - Have a setup that creates images locally (SD1.5, SDXL, or a custom checkpoint) or pay for an API key, and use that to create your images (clipdrop, or similar services)
Step 3 - Use that to generate images with with 50:21 aspect ratio (cause that can later be resized to 168:400px)
Step 4 - Take those images, and send them to your custom script which then:
• matches colors to the closest color available in your machine, based on a modifyable template.
• has a few filters added, so that contrast gets adjusted, so it still looks good on socks with a low resolution (though this can be skipped, if your prompt is good enough)
• resizes images to your compatible resolution, so that it matches your machine input
you did get permission from a mod, to self promote, right?
No, I had no idea where to put this, and guessed. If need be, mods can take down. We are tying to help new prompters, and it really makes us almost no money, I don't know if you read the post, and did not think it was a problem on such a huge Discord. @earnest lichen
I could speak to a mod, or if they wish can delete it and I apologize for any inconvience.
just get a quick ok from @bleak matrix , so it's in the clear
I teach AI and am looking to help new prompter
omg there are like 10 sunny names ROFL
oh you tagged ok thx
will probably be fine ^^
OK, if I broke any rules I apologize. Should I wait around a few?
to speak with Sunny?
Let me take some time to review; thank you!
civitAI
no one?
i love SDXL i just have no computer to run it properly atm
then how do you love it?
It is hard to use. If I make a prompt to give me a good character, it ignores things like clothing and situation. If I make a very short prompt to get a generic character, then I get control over clothing and situation. It's hard to get both, often impossible.
Could somebody tell me if StableDreamer bot is capable of private messaging or are we required to sit in a bot channel to generate images?
I think you have to be in the channel
anybody know how to get Digitigrade leg style robot prompt?
If you want the bot, you gotta be here. But you can use ClipDrop or DreamStudio without the public display if you prefer (costs a couple bucks to register for the privilege of privacy)
huh?
just want to know the fps setttings ..how can i change fps of video in gui settings
any tips on how to solve RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!.? I am not a computer expert
maybe you have all your tensors spread out on multiple drives
is there a way to check or solve without being programmer? and more important is it just error msg or really stopping because i can generated image ok
not fast but ok
Come to #🤝|tech-support with a screenshot of the error and your GPU name
super thks
далбоебы бля ахахах
send the info there now
does anyone know of any good discord servers for ai voice model stuff? like is there something like Unstable Diffusion but for ai voice models? plz @ me if you do, thanks in advance
Imo the best way to work with AI imagery is just like with actually doing art: Working with layers and piece by piece
please use english when communicating on the server, also please be respectful to others and review our https://discord.com/channels/1002292111942635562/1040299965647437905
I think jiafei is cringe
Serious question if I have the hardware is MJ better than SD?
sounds like a subjective topic
Do you know pros and cons of each?
Just curious
literally never tried midjourney ever
not even curious about it, tbh, so that disqualifies me
In StableSwarm, when running ComfyUI, using the Load Image node results in a "400 - Bad Request" when trying to load any image other than example.png--even after complete re-install. No errors or diagnostic info in cmd. Running locally on Win 10 Pro with RTX 3090 FE. Anyone?
Hi, I found a 3090 and a A6000 at a similar good price, which one would you choose? They look similar to me
i used Midjourney for pretty much 6 months and SD for a shorter time and now im using Firefly. Midjourney and SD are quite different. The question is what do you want or need? Do you generate images just for fun? Is high customization important for you?
MJ is great for one-offs, things you don't need perfect consistency for. SD I use, as I need to correct pictures with inpainting, use different models, use ControlNet for poses... MJ IMHO is more beginner software. For example, the seminar I give on MJ is 3 hours and teaches you... basically 100% of the program. My SD seminar, on the other hand, is 6 hours and really just gets you from install to almost intermediate, there are so many features.
Really, MJ for me is total fail because of no inpainting. No correcting? Can't use it for my job lol
@alpine shuttle
MJ users partially use Adobe products including Firefly as well tho, at least those that want more than just ready to go images, also inpainting is apparently coming today or these days to MJ as well.
But MJ on itself is very limited compared to SD or Firefly but many of the users dont want or need more than that
Thats why i asked the user questions above
There a way to speed up model loading?
asking myself that as well... and I've got a VAE installed as well
You can set how many models to cache in settings
also, using --medvram with this helps
Mmm heyo, is there any place that I can find models to download ? Kinda new here D:
Models: https://civitai.com/
Thankies
Hi has someone else seen that the tensors are cleared with the new clora workflows in comfy? I need to reload the base model every time
they actually solved inpainting this week
it's a feature now
oops i went to get the link and they only mentioned it's coming quite soon this week
I will have to see how well that works, as I often use PS and minipaint to go back and forth with inpainting, ie paint part of a sky with crap blue, denoise at 0.2
can you load a pic into MJ back?
anywyas. no big deal. sdxl is free and is better
lol true
MJ does lots. i wouldn't use it but they are staying competitive
if it had more models it might interest me
what we're doing with revision now, MJ users have been doing all year
it was like the one thing i considered paying to play with
lol, just canceled it today how ironic
SD has its target audience, MJ theirs
With interchange ofc
Not a lot of people are willing to jump to SD even as it is free

MJ is much more user friendly. Different target audiences really. One is casual, one allows for more control.
That's why I posted this, to try to help people get into it #💬|general-chat message
Its really not all that hard, once you understand you have to work a little harder
I just am not into making 100 iterations and picking out 3 that are ok lol, corrections and ControlNet are a must for me
I used MJ a bunch, but I found it lacking in making things like my comics, and for my job no way I could possibly use MJ
but i see different target audiences
sup homies
MJ seems to be more for those who want to make something that looks pretty or interesting and that fits a general theme. SD is more for extreme control. SD also allows for bypassing MJ filters.
i mean with custom models and checkpoints, loras, custom scripting, multiple passes
isnt mj just a product?
some folks struggle with accuracy and realism but with the proper setup in a backend program like comfyui you can do anything
the sdxl models by themselves are wonky, but there are many ways to work around (or with) them
like checkpoints, loras, controlnets
Base models are intended to do almost everything but not be perfect at anything. Fine-tuning (LoRA, TI, etc.) is intended to train the model to be better at certain things.
Checkpoint is just a fancy way to say base model.
@south flame part of them uses it in company with Firefly and Photoshop tho
for me personally, i dont want to pay for MJ anymore
and SD doesnt suit my needs and workflow either
how?
wdym how? 😄
i think SD is limited only by the user
waiting for the day when text, voice, and image generators merge into one
eventually image generators will have to reach the point where they become text generators as well
once they learn their alphabets
nah, they all are limited compared to doing it yourself but i prefer Firefly (especially in the future) and SD could have been a company for FF for me but especially atm its not worth it for me
plus i have enough to play around with customizing my art softwares, i dont need the art generator that is serving primarily a idea and reference generator to be unnecessarily time and/or learn consuming as well
well learn not so much because i have to learn to code etc. anyway
not sure what you are trying to do, but SD supports masking and controlnets for advanced usage, as well as clip functionality for advanced merging/blending
you don't need to learn code if you use a program like comfyui
just have to learn node workflow
i picked it up in a day
though i have experience working with nodes from my blender days
yeah but for my workflow and workspace SD is very inefficient, especially with current specs as well and on top of that i already have the tools or better said softwares that give me the maximum possible freedom and control at the cost that i have to learn art and editing skills myself to reach that point
that's fair and admirable that you're willing to improve your own artistry
thats where Firefly fits me by far the best since i also happen to do my stuff in Adobe ecosystem when it comes to 2D especially
just be aware that ai can always suppliment your endeavors
no matter how great of an artist you are
i love AI
i think it works best for artists really
not meant to replace traditional methods but to provide a new way of enhancing and dreaming up new possibilities
well skilled artists can maximise the potential of generative AI the most
I'm planning on wiring up my photoshop edits and pen sketches to SDXL and see where that goes
SD suits well certain people, but for me it doesnt work well with my pipeline. And then my specs make it even worse to use SD seriously as a side tool
they just released some kind of experimental control model that turns line art into fully colored/shaded illustrations
though if you're clever enough you can figure out how to do all of that yourself
oh, you mean sketch to image?
not sure what it's called but it was just released (this past week?) along with a handful of other tools
tried that with Alpaca, didnt try it in SD. Firefly will have it as well. That feature is amazing.
including two new control nets
controlnet itself is amazing
yes it's basically an automatic photoshop tool
there are a plenty of tools im waiting for Firefly to release, some of which are already existing in SD or Alpaca plugin for PS themselves and some arent known anywhere as of now
interesting times are coming
even Autodesk brings now again some generative AI stuff
the only one remaining is Maxon to do it as well
i mean we went from disco diffusion to midjourney beta to dalle2 to early SD to modern midjourney to SDXL
all more or less within a year
😁
i had dalle2 access around that time
and i was deep into some other generative notebooks
like disco diffusion
I've been doing this since image generation was a blurry and indistinguishable mess of pixels
u are using Blender?
you'd ask for a bird and it would give you a pizza
i used blender for 3D art unrelated to ai
Blender has that SD plugin
i didn't know that
even for Maya one is developed
but i use comfyui, which is a node based program like blender
for some reason Autodesk brings some sick generative AI for Maya but 3ds Max (which i use) isnt mentioned...at least not yet
it's limited only by how clever you can be with the nodes
and has custom scripting for advanced usage
i can imagine that although im not deep in programming and scripting area yet
but i hopefully will one day
because i will need it
other people will develop the tools you need by then
except, adobe...gag
if you want them now you're gunna have to be inventive
i mean the tools already exist you just need to assemble them
I'll take open source any day, gimp over photoshop, etc
professionals often dont have a choice, but as a hobbiest
photodemon is cool too
and inkscape
blender also does just about everything these days
including photo editing
but personalized will be yours to do
blender and gimp, they are amazing really amazing
well FF is limited as well obviously but it fits my workflow so much
What's holding back the release of inpainting xl models?
you cant inpaint them? I wasnt aware
I am already inpainting? Though not via controlnet.
Hi
there are many ways to inpaint
if you're looking for a program to do it for you rather than assemble the pieces yourself, I've seen a couple paid services
VQGAN+CLIP?
ah yeah that was the other one
i loved the "animated" frame compilations
made it feel alive
Is there a way for me to input an image of a item and have it used in the image i want to generate? like I upload a picture of a pen that i want to be used by the character writing a note?
Hello everyone! the latest update to my Deforum-Helper app brings some new features. Your control options have expanded, now encompassing camera FOV, near and far. Plus, you can fine-tune the Deforum diffusion strength. I'd love to hear your thoughts on any additional settings you believe should be included in the editor.
I've also introduced a bar visualization in the 3D live mode for more effective control and monitoring as you work. Your feedback on these updates is highly valued!
Enjoy and thanks for using my app 🤘
https://deforum-helper.vercel.app
Released now! https://civitai.com/models/127923/xl-yamers-realistic
hey
any one around ?
can some one explain the :1.5 in prompts
and how i would take advatage of that in otehr prompts i make ?
hi
hi
if it is (snow:1.5) it put more attention on word snow and therefore higher probability to have snow in picture. It can also be used that way ((((snow))))
Not sure if you mean this, but probably yes. @lunar pier
yes ty 🙂
hey! you can also try this on https://discord.com/channels/1002292111942635562/1011743094309396631
Hi, can i combine both an image of mine and a prompt to create a new image or ideally an image using the pen of my first image?
I tried to make "Rinoa Heartilly nude" but Dreamstudio gave me a message saying "Something isn't quite right with your prompts". Are NSFW prompts prohibited or something?
can i use this bot for nsfw prompt?
Does anyone know how to change the pose of a character image without changing his/her outlook and style? Tried on Automattic 1111 img2img and controlnet but it doesn't work. It changes completely the outlook
Yall trying to use public bots for NSFW images... 👀
I'm talking about openoutpaint specifically, that's usually all I use
tiger
you cannot, please follow our https://discord.com/channels/1002292111942635562/1040299965647437905
Why not just use SD? It's inpainting is great, why reinvent the wheel? Inpaint Anything extension makes it really amazing too
From what i have seen and played with its not as great as some other ones
But not bad at all
Please, I love suggestions and need inpainting for my job, any suggestions?
What kind of suggestions are you looking for?
@potent spire said there were other inpainting extensions, personally i just use regular inpainting, or Inpaint Anything
I also use Adetailer
but I use inpainting for work
ah ok, not sure, those are what I use
Generative Fill but due to Firefly i think its not for commercial uses yet
Or Alpaca plugin if you have PS at all
Oh my bad, it doesnt really have advanced inpaint yet
How soon? 
Hi, who can suggest what models and promts are better to use to make 360 landscape panorama like Blockade Labs did? They're actually very good but now after they made subscription it's only 15 generations/per month if free (and 48$ for "unlim" is not that small amount).
hi, i am creating a lora adapter on top of stablecode-instruct and am running into a specific. is this the right place to find help?
do you want to just use or can you finetune?
To make 360 view (so you can sphere in Unity, put that as texture on it) add camera in sphere and "kinda" make 3d view. Like certain (CENSORED) Labs did.
i understand your usecase, have you tried stable diffusion or any other tool with those keywords added to your prompt?
is a1111 working with sdxl now?
anyone know what the best driver for the 4080 for training loras?
has been for weeks but there's a lot of people saying it don't. just go use it. it'll work you'll see
i think automatic had support for it during the beta period even if you used the dev branch
Whats the difference between the base and the refiner model?
tbh, i don't know. The refiner feels like a step child. it was the original version of SDXL but then they changed directions but decided to keep that original model as a second stage.
I honestly don't even think it's important for end results. It just complicates things for little benefit. Especially if you're upscaling, all the refined details it adds are just smoothed right out.
The Super Stage event for the SDXL 1.0 release, the devs have a big round table discussion about all the ways its not needed
i dont understand the refiner at all. It's purpose seems elusive
maybe midjourney uses something similar in their pipeline
I dont know why. Great results are had without it. MJ would probably rather maintain one model than two.
Model LatentLabs360 with promts to use it. But it wasn't 360 (borders didn't match).
No controlnet on a1111 or sdnext. Not a good thing.
most people aren't using controlnet workflows in comfy either.
for most reasons, a1111 works. i think the extension got updated by now too.
Not sure why not.
Controlnet is very good. It''s still only limited static posing, but we had a big step backwards when SDXL released
static posing as in it helps get images with objects and people posed in what looks like a static pose they could hold, rather than a frame of action from a real action
Hi i have a question. Can i use stable diffusion model in my app / platform
well, in this case i dont have a better tip than retry until it works. checkout https://civitai.com/ for alternatives
weird thing to say
sdxl is a leap
Hi
It's not a leap in controling poses without controlnet
"huge step back" is hyperbole. Maybe i'm just too right minded for this discussion.
It's not. We lost the control of controlnet, the ability to pose somewhat, to copy certain images in color and composition. We still aren't back to that with all the popular UI's. It's been a step back.
So MJ finally has inpaint too
SDXL is definitely a leap forward. It just doesn’t have all the tools 1.5 had months to acquire.
1.5 is a modified Toyota with a custom engine. SDXL is a stock corvette straight off the lot.
I kind of think that when people just refuse to learn and use the tools that are available, choosing to wait for someone else to make it available to them in their preferred form, that's more of an individual preference dillema than any kind of industry wide step backs.
It’s open source. Your basically going to lose familiar functionality between versions every time. The development of tools isn’t some centralized project.
On day 1 of sdxl, i saw people putting guidance onto a 1.5 latent, passing that off to the xl model after x number of steps, and cooking it into it's final form that way. A little janky but pretty great results still. Of course, i try to find that now and it's buried in the feed.
Is stable diffusion unstable with memory? I just got a bluescreen cause my memory filled up
sd 1.5
with that logic is some parts "botched"?
For SDXL? Likely not, I don’t suspect it will do anything worse than 1.5 save for resource management once the tools catch up.
Yes, whatever interface you use is up to you.
I suppose hmm
I use comfyUI but I recommend Auto1111 for beginners.
well I'm having issues with Auto1111 but I don't want to drop the tools it gives me
current version of auto1111 is known for havin some memory issues
I see
Auto1111 very recently got some extensions to help with SDXL, but as I don’t use it, I don’t have more exact info.
i feel like everytime i generate images for a while then close stable diffusion, i lose more space than i should
? I mean, my workflow saves three images per generation at high resolution. So hard drive space is eaten up, but I have so much I don’t notice.
you can look in stable-diffusion-webui\outputs\img2img-images and stable-diffusion-webui\outputs\txt2img-images
windows can be slow at updating space changes
I am having problems with my SDXL install. Is there a Tech Support channel somewhere on this server where I can ask my tech install questions? Thanks in advance.
question... does the model loaded changes CLIP or DeepBoorou interrogation?
this might be a beginner question, but is it possible to run the "dreamshaper" model in deforum stable diffusion?
I managed with warpfusion so I just assumed I would be able to in deforum too...?
Look in the list. There is tech-support
Is there a way to use the Portrait Depth Estimation feature, like in the Clip Drop API, through an existing web UI?
Why do I feel ashamed of using stable diffusion after being a few minutes with some acquaintances?
huh?
blud was doing a public event generating sexy girls 😭 💀
So MJ inpaint is kinda like DALL E
How do i use the bots to generate some images
I tutor. When the students are working on a question, I tell them I'm staring at their hands to make sure they're really humans.
where are the cards of all the bitcoin miners?
I see nothing on the market
best price for a 3090 is $700, is this a joke? like the housing market? will never crash?
buy them while u can,now with the ai craze it and when ai goes more mainstream prices will go up again
i bought them all
@ $525
what channels are these?
heya, I have the --no-half problem and all that stuff about nansexception
the thing is that all the tutorials to solve it are from the files folder and none of that has helped me because I use colab
Hi Everyone,
I recently started contributing to open-source projects, and I'm very eager to begin contributing to Stable . Any guidance or assistance would be greatly appreciated as I embark on this new journey. Thank you in advance for your support and insights!
Hi someone now, the size of the SDXL model (in Go) and the amount of ram memory required to run it ?
bitcoin uses dedicated asic hardware for mining, you cannot use GPUs or commercial grade cards, perhaps you must be refering to ETH or other crypto
Ya
I am not sure where to start looking for someone to do some NSFW commission work, any suggestions?
fiverr
Guys, I run automatic1111 with --xformers --api --medvram --no-half, on a 4070ti and get 1it/s on a batch of 4 512x512, that's not normal right?
Medvram on 4070ti ??
shouldnt be normal. What model are you using?
better @undone garden ask in #🤝|tech-support probably
Absolutereality. I'm on it
@undone garden I use it on a 3060ti 12gb with only --xformers and --api and I do around 6.25t/s
but not using no half.. so
hi guys, can someone explain to me how SD works? do I buy credits to generate the artwork? or is there a plan?
Welcome! You'll find everything in #1072220168534642768 & #1080946152318443610. Many ways to use SD depending on your preference.
Join us soon! https://discord.gg/cTxNV2HE?event=1143268524031234152
Am I able to have the controlnet options in stablediffusion just like in warpdiffusion? I am running on Google Colab Pro; not local
might be a stupid question....
hey there, what resolution should my images have to train a model in sdxl?, I don't use this since v1.5 and I don't know if it's still 512x512 
Is there anywhere I can find proper documentaion over the different options present in Stable Diffusion?
Or is that just not something available atm
ty guys for hosting this 🙏 my first time joining and very glad I did
happy you could make it! ❤️
Is there a link to the recording?
In what channel can I make the bot generate my ideas?
Is there a link to the recording?
Hey, I´m totally new to this. Where do I start to understand how creating these images work?
I'm wondering about this too, I was listening live but had to shift focus from time to time
https://github.com/ashawkey/stable-dreamfusion
anyone used this?
Does anyone know any web based Stable Diffusion image generators? someones is asking on another discord but i dont know of any
I'm actually doing web development, logo designing and I would love to help you out is anybody interested DM me
I missed it 😦
Hi Everyone, I made a CV video for myself with AI art. Enjoy watching..
https://www.youtube.com/watch?v=twxzOu0YDM8
A111 and SDNext, let's help them get Controlnet working, do what's good for the community
Heyo :)
I've had struggles with getting SD to add scars on my characters. Any hints on how to solve that?
How do I stop getting notifications from the "events" channel in this discord? I've already set the notifications to none, but I still get them anyway…
can i use stab
Options for what? Which UI?
whoops
is it possible to add the dream bot on my own ds server?
A1111. Honestly a good documentation would be cool but asking for too much. But I wanted to ask about the inpaint mask related options. Mask Blur. Masked Content. Inpaint Area
give @delicate oxide back owner
can i run stable diffusion with 4gb of ram?
Whether or not it's good is a subjective matter for you to decide, but https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features
goddang thanks, I tried their wiki link but didn't think it was hidden under the features list 😓
is the alpha mask option new?
and I saw a mention of a model made for sd2.0 but in a 512 base - so like, can sd2.0 different from sdxl?
im goin back to youtube tutorials
whats the best SD model to train a lora of a humanoid with a non-standard head/face
The bots don't allow for negative prompts, do they?
if any getting error running stable diffiusion just use "--skip-torch-cuda-test --precision full --no-half --no-progressbar-hiding --opt-channelslast"
hey anyone here can helo me?
i'l tryint os install a controlnet model but can't find the download button on the huggingface page 🥲
Okay, so, generally... why does stable diffusion refuse to generate scuba tanks? 1.5, XL... none of them do. lol
do they accept [] as deemphasis?
they do know negative
tab from your prompt, type "negative", you'll get choices, and one is negative prompt. Then you can type the negative prompt in that new box
Ive made two Textual inversions, now how do I get them both in a single render together? still dodgy?
Why can't I use public bots to make NSFW images?
They aren't allowed here, but what about on Dreamstudio.ai?
There are NSFW filters in cases-- I recommend checking it out yourself and or installing locally, as we don't support NSFW on the server and the bot here
How do I enable NSFW prompts on dreamstudio?
lmao
Use SD
How? Do I switch the model from SDXL to Stable Diffusion? Or do I need to go to another site?
You have to run it locally
the sites that let you create images are set so you can't change it
they aren't going to let people make NSFW - porn - stuff
NSFW is a bad name. If you worked at Coke a Cola, a Pepsi logo would be NSFW
How do I run it locally? Is there a download button?
Hi, I have downloaded SDXL and have ComfyUI set up, but I want to create some kind of digital art based of my own portrait like the following: https://imgur.com/a/T1XCXsw
Is it possible? It should still look like me, but have some of the digital art looks to it (like multiple colors etc) and not have a background (or solid color background I can easily remove in photoshop)
Do you guys support Easy Diffusion?
Late reply but thanks! 🙏
I would like to ask you, why I use controlnet to select my local image, will always generate no image, and then in the generated command can also see "width:0 height:1", regardless of whether I manually select the image ratio is so, the complete command I generated is "/dream prompt:UHD,highly detailed,professional,bokeh,chain saw man,senior high school student,asian,fighting_stance,white shirt,sneakers,tie,black jeans,blonde hair,tokyo,street,cinematic angle,(((full body))),rain style:Comic Book width:0 Height: 1 seed: 192892105 ", I downloaded to the local image, the url is https://i02piccdn.sogoucdn.com/a77f145ac4c1576a
Good morning ☀️😃
is it possible to add this bot to my own server or not yet?
👍
unfortunately, same here, did you manage to sort it out?
In case this is needed! https://youtu.be/X6kOgUJTePw @last parcel
👍
Invoices? Anyone getting invoices from Stability.ai please guys?
newbie here, where can I find a document or tutorial that show meaning of setting value? Like strength, CFG, noise. Thank you.
Hi. Are there channel moderators here?
I am looking the whole time for faqs where i can find the general prompting rules/infos. But under ressources and faqs its just so general. I want to upload a picture and the the picture should be modified by SDXL. Can you help me? 🙂
ugh, civitai is always having problems :/
I liked it before the redesign. The search bar seems annoying to me now.
is there a way to see what kinda added words are being add when you add a style using the bot?
Yes! Stability does it again!
Is there a setting between lowvram and medvram?
im not even sure if i can run sdxl local on my laptop
Its been a year already? God damn
i know lol
Here i was thinking its been like 7 months
its still absolutely wild to me how its only been 1 year since SD was released. Its ridiculous how quickly its been improving. The world of AI is just built different
I tried sdxl but 30min for a 1024x768 pic is ridiculous on lowvram mode
Honestly its scary how fast these ais are improving. Arent there text to video ais being worked on now?
Text to vid looks wonky now
yeah its still kinda mid
Maybe in 2 years
but its getting there
2 YEARS
lol
thats the equivalent of a decade in the world of ai
I remember seeing report about bad looking ai text to img pic some 1-2 years ago so I thought 2years would be a fair assumption
Text to video looks good on Pika but its not something to be jumping up and down about yet
It feels like a dream now, not very sharp or detailed
Like how you can see stuff in your mind but the details break ddown the closer you get
its getting close
I hope nvidia makes some cheap AI card for general consumers
We are moving at lightspeed when it comes to ai
i look forward to many of the online platforms only because we do not have the tech available at a reasonable price
lol nvidia wont be the one to do it
I really would like to see technology to be able to rewatch our dreams
amd is catching up to nvidia fast in terms of software support, any other competitors will be very close after them
im using an amd laptop but i heard that they can not run well atm
Unless amd or intel can somehow convince devs to make stuff for their hardware. I can only nvidia come out with a cheap AI card
"somehow convince them" bro Nvidia being ridiculously expensive is already doing all the convincing
nvidia just has a massive lead
its all about who controls the chips
now we see why the big uproar came about a few years ago when Trump was in office.
I wonder how more powerful pc can get when we are approaching 1nm
idk faster and more memory is more important for the most part when it comes to AI stuff
My air cooled ipad pro is already 2x more powerful than my workstation laptop from 2016
Does anyone here use the power of ai chat to help fine tune prompts?
i dont see many
Many are underestimating the power of them
I used bing once
bing is ok i think they use chatgpt right?
one thing i saw people do is use claude which has a massive 100k context window, dump in a file with a bunch of prompts and then ask away
Yeah, bing is just gpt underneath
Oooh id also like to know what ai they use for prompt help
Try bing, its free
Niccce okay. Thanks
I just tried that and its just giving me a copy pasted list of tips on how to structure my prompt. Not actually helping me
Whats claude?
stability has beluga 2
Hello everyone
How are you doing
quite good my fellow
wanted to flag this open letter from generative AI artists to policymakers, calling for them to have a seat at the policy table too -- read, sign, and share (privately) if you like: -- feel free to ping me with questions: https://forms.gle/5gHAUeyPztQKBA5G6
why would they be there?
Jolly, ol chap! And yourself on this fine day?
to ensure that their interests are represented - ie, the way they use and benefit from tools like Stable Diffusion
whats next a change.org petition?
i highly doubt a controversial community of "artists" would be invited there. Its between artists, developers and companies and policy makers (+ eventually AI expertes or whoever)
and the representative of Stability AI was at the hearing about 2 weeks ago or so?
and btw. your letter is set to fail imo because actual artists actually used AI before generative AI was a thing
Wait what
I thought Stable Diffusion existing for so long like many years???
It's just 1 year old?...
The perspective of people using a technology is no less important than the people making it. You wouldn’t make traffic laws by only consulting car makers.
Did I just witness the event of humanity history?
even if they succeed the us congress has no jurisdiction outside of the US,so idk how you will force ppl like for example in asia to stop posting or training models
and come one, why do those AI "artists" always blaming the corporations, its them that are aligning with companies like Stability AI itself
plus THEY are the problem as well
come on*
thats also a thing, yes
How do I turn off getting the red 1 in the corner whenever a random discord server wants to @ everyone? I turned off notifications and @pings but it doesnt stop the red 1
its also up to the providers like Artstation etc. to take a stance
they could but sites like pixiv are japanese,unless the japanese also sign a similar law,just the us passing a law wouldnt be enough,and even if both countries pass it ppl can still train and post models on other image sites that are located in south east asia or china etc..
yes
but it can make a significant difference
its not about the "the cat is out of the bag" crap, because the technology wont just disappear, but there are definitive things that can be done to at least reduce the damage
I don’t think draconian laws on model training are going to reduce the damage at all. If anything, they will make it worse. If AI is actually going to make manual art obsolete (which it won’t), you want the models as open as possible so that people can adapt instead of die.
Anyone know a way to make specific words 'legible' in a prompt for a logo?
It does a pretty good job at coming up with the unique images, but specific words requested from the prompt are not legible. It might not be possible but I thought I'd ask 🙂
disagree there
Draconian laws just mean that the largest IP holders replace their own artists with AI reaping massive rewards in throughput and efficiency. Everyone else is left out in the cold.
they can try to ban it , just like when they tried to ban torrent and software piracy, are both of those things dead?
wont work that way
Artists still lose. Artists lose more. Everyone else also loses.
they wont try to ban the technology
its about regulating
both from state as well as sites
I trained my first decent Lora today but something strange happened. It started great, I got perfect portraits with no prompt at all apart from the Lora, batches of 8 were 90% workable or so, it stayed consistent even with longer prompts and inpainting. Then I messed around with different Loras and models, different prompts, and one hour later when I went back to trying that Lora it was as of it was broken
A no prompt batch of 8 gave me botched faces, full body shots, weird positions, a wall a cat and a bus
well they can try just like when tried to regulate ppl copying movies and music
put one of the good images in png info or drag it into comfy to get the exact settings you used for generating that image. maybe you are missing something
I think the current system where training isn’t subject to copyright restrictions but the outputs are also uncopyrightable is quite good. It limits the ability of AI to compete using scraped data, but it ensures easy access to it as a tool.
If I forced it through prompts I started getting decent portraits but nowhere near the quality
The outputs arent copyrightable?
yes
they are if u can afford to go to court and prove u made it
Is it possible that I should maybe reset the webui every now and then with a new console and everything to "flush the vram" or something? I'm pretty sure the settings were the same
If your taking about the court case that happened a day ago or so, they rules that the AI itself cant copyright it.
Not unless they are edited or composed or something else beyond just the initial output.
because you din't make a " great picture of a landscape", you just gave it into production by a programm. You didn't make it
if i print it and then paint on top of it while recording the whole process i guess you could win in court
Did you mess with the clipskip setting?
you can try it. But try my suggestion first. Would be way easier
appreciate the back and forth, and if the letter ain't right for you, nbd. I'll just add there are tons of precedents of grassroots efforts making a difference in internet policy - e.g. net neutrality, and stopping harmful copyright legislation like SOPA.
Frankly you could win with less than that. The threshold of where it becomes yours is still somewhat nebulous.
yet you can effectively do something
But why would you want to?
i just saw, Artstation finally at least made the filter available to exclude AI generated stuff from their marketplace
to waste more tax money on bureucracy
Wonder if a lora is copyrightable, i realize the licenses and such might get in the way but its technically your own creation, you chose how to train it.
you ask me why i want to combat cancerous/malicious usage of generative AI?
but the image it creates then is not again
No, I am wondering why you want to do it in a way that will barely affect the malicious usage but would cause a large number of unintended effects.
Maybe if it its way over trained to only create 1 image? 😆
this, 1gb lora that can only produce 1 pic
Well the issue with things like suddenly making copyright/training regulations after the major companies have done it is making fees and various hoops to jump through then becomes anti competitive for the more open source AI models, for big companies they could even campaign hard for draconian laws because anything like that would just be an overhead to them like an electric bill, but also a handy barrier to keep out anyone else.
You can probably make a better case for controlnets and similar techniques. You have to edit the AI image, but I wonder if the editing can take place before generation.
Exactly this.
I thought by now we'd have celeb likeness's being officially licensed in loras and the like
We do not want AI to be monopolized by only the existing IP holders. That is a great way to get all the downsides of the technology and none of the potential upsides.
exhibit a for me of this kind of "big company does good thing for really bad reason" is amazon campaigning for an increase in a minimum wage. I'm all for an increase in minimum wages for example, but they are because 1) thats pennies to them 2) thats a whole lot of money to mom and pop stores.
which sounds good to me from my perspective
in what way exactly?
even if u guys pass that law,it will probably benefit our ai companies in asia,since their competitors will be more restricted in the things they can use for trainin so i guess go for it
This is the thing, sometimes a good can be done because it consolidates someone elses grip on something.
What do you see as the downsides of the use of AI? I almost guarantee that a few corporations monopolizing them do not remove most of them. It certainly constrains other people’s ability to benefit.
and thats another issue. its why you have people against reducing emissions to the point of denying its an issue. its fine to say in one half of the world "nah this is actually a bit bad, luckily we're in a good place now." but lol if you expect other nations to just lag behind you because YOU decided to add in regulations or say industrialization is a terrible idea. you just cant expect other countries to give up advantages.
yea while u kill each other over dumb things there will always be other ppl ready to pick up the scraps
the largest one to me, is completely monopolizing opinion on the internet. if people start asking AI over checking out websites what the best car for them is, and honda paid the most, 90% of those people will barely hear of other cars.
and think about political campaigns. theres like 15 billion with a B put into advertising with the media by politicians. if you can essentially bid for the AI's favour, everyone else is screwed.
downsides of using AI? There are several ones but thats not what i have issues in general with. Or do you ask me in general about AI and not restricted to generative AI?
For example, artists losing their jobs. That will still happen because the companies will no longer have to hire artists, nor anyone big enough to license the models. Even independent artists lose under a glut of mass produced work. If the AI is relatively freely available, anyone can add it to a workflow and compete if that is what is necessary.
We are talking about generative AI here.
artists will only lose their jobs when AI becomes superintelligent
above human level
anc conscious
and
I agree, but some will lose them before that. It is a good example for illustration, but if you have other issues, state them.
you cant make AI do certain things that all the elite level artists can do unless that AI gets to human level and above
what? no! you can pad out 90% of a fulltime job with AI and just hire people for day gigs to do the other 10%.
thats literally one of the biggest issues the writers strike is over.
@tough bough one of my issues with generative AI is more like what people do and are potentially or actually allowed to do with it. Generative AI as technology isnt something i want to get rid off
Not really. It is over streaming residuals. AI just got tacked on at the end.
for example people spamming generative AI on artists platforms
yes, i dont consider AI art people as artists
and never will unless certain circumstances
I said one of. 🙂 But part of the residuals hollywood accounting issue is breaking a job for 1 fulltime person into 10 day gigs with no major contribution. if you can replace even 1/3 of that with AI thats a lot of real jobs gone and turns artists into zero hour contract workers hanging around outside supermarkets for someone to pick them up in a van in the mornings for work. 🙂
It would still happen whether there is a monopoly or not. It is more of a platform issue than a societal legal issue.
in a authoritarian state that would be different
but yes
it took Epic Games long enough to filter AI stuff on their platform
what i think should be done by the policy makers is to let people have to pay licenses for LLM for example
just like Sam Altman proposed
Pay 2 Win
It would have to be a totalitarian state, more likely.
Anyway, you would be as likely to see that under a monopolistic outcome as not. If the platform decides that they like their AI, it will be all you see with no recourse for seeing anything else.
i would rather have a monopoly by certain companies/corporations at this point
That falls foul of original position fallacy. You are just as likely to get an authoritarian state that disagrees with you and suppresses your opinions as one that agrees and elevates them.
does chargin ppl for software stopped software piracy?
we are speaking about theoretical or ideal stuff anyway
reality is more complex ofc
making things marginally less painful to pay for than jump through hoops to download stops piracy
nope
Netflix Technical Director, AI/ML - R&D Tech Lab <--- look this up lol. Hell, apply if it floats your boat. But saying that AI is just something thats tacked on and not an existential threat is silly.
@wild steppe Wow! I totally overlooked this! The main thing that makes me sad about ChatGpt and Claude is the censorship, I can not even make a cheetah hunting a gazelle because it is deemed as harmful.
it'll probably makes games better and movies worse for a long time. like cgi initially 🙂
generative AI?
As soon as I am home I will be trying stability chat, I would love to test out how my prompts run on there.
I really feel like many underestimate the power of the chat bot and its ability to be an extension of your brain personally
Each person working with and training their bot will have it specifically tuned for them and their needs.
you might be interested in looking up AI alignment/superalignment. censorship isn't quite the correct word and has too much placard waving and colour wearing baggage attached to make arguments about.
When the LLMs want to be humanized and you can not ask the AI to create a prompt based around a cheetah hunting a gazelle we have an issue with how the algorithms are coded. To even consider that a roadblock is not a very good look on the behalf of the company itself. @outer crescent I understand what you mean but this is logical for any ai chat that wants to have long term success.
If this is the case we are not being scaled up, We are being hindered and will be moving backwards or wherever the coders of this algorithm will allow you to go.
The success of AI coexistence with humanity will teeter on our ability to be free while using this technology.
Netflix has been using AI for classifier and recommendation engines for a long long while. It's how they got big before the website. They even held a contest for it.
oh this is their games division specifically after buying up some studios and its specifically to work with ongoing advances in AI/ML. theres been machine learning everywhere a long long time 🙂
Man, crazy to think that Stable Foundation has only 1 year of life
Hello, May I ask how do I optimize my 4090 on windows 10 to get as many it/s as possible?
been playing with it for over 1 week now and still getting 5 it/s
my laptop with 3060 mobile gets close to that
What are my options on amd windows if I have a ton of ram but not enough vram for comfyui controlnet
hello, I need some help plz
is there a way to tell the AI to not touch a specific area in case of doing image to image ?
for example: not changing or touching the head even slightly, not changing it at all, but the rest is ok to be changed
or not changing/touching the sword
like fixing a specfic area of the image
you might call it "reverse inpating"; changing everything except a specifc area
ya thats still inpainting
wait, really ?
oh wait i misunderstood
you could probably crop your image and then outpaint
outpainting is exactly what you described
hmm, I don't want the image to drastically change
in case of image to image, I want to fix/lock (making it not change) a specifc area, but the rest can be changed slightly according to how much I set "image strength" (%)
yo guys, i am new to this stuff, while installing that web-ui.bat file, my pc crashes. Is it a problem with my files or is my pc weak to handle it. (there are other issues sometimes as well where it says git bash could not be foudn etc.)( PC specs, i7 3770k, 12 gbs of ram, gtx 1650 4 gb)
lmao can we start a petition to bring stable diffusion to geforce now
Do I need to be expert at english in order to type some prompt for my checkpoint?
I am not good at describe some people like policeman in usa usually do
wait, i need to have amd for that?
does anyone know why when I want to download an image I get it in webb format and not in png format?
No, just type in words
Ex: a man sitting on a bench the beach at noon could be written as:” 1man, sitting on bench, beach, noon,”
Just do an image search or use snip tool
when I generate an image and try to download it I get it in webp format
anyone know what's the difference control-Lora and controlnet sdxl 1.0?
Yes its still inpainting. But you mask the part you dont want to change and set to inpaint not masked
Hey, you if you have only 8gb of RAM that's a common error
Can ne fixed by adjusting the windows pagefile.
For more help ask in #🤝|tech-support
Hi, I want to buy large amounts of credit, it only allows for $1000 increments. That's a no-go. Who can I speak to about this?
hi is there any way to generate img2img in this discord bot
(I hope this is the appropriate channel, there's a lot) Hey everyone, I've gotten to the point where it's about time to upgrade my PC again, and I'm wanting to take SD into consideration for my parts. Specifically for the GPUs I'm wondering a bit about what would be sensible. 3090s (24GB) are going for 650-700 euros used in my area right now, looking around a bit that seems like a pretty good bang for buck and I'm not that scared of used cards. Any opinions on that?
hello everyone
is rtx 4090 good for stable diffusion ?
i have budget of buying rtx 4090 but is there any other good choice ?
i am going to install this gpu in my server and primary intention is deep learning and no gaming at all
AUTOMATIC1111 1.6.0-RC just released
Has a crap-ton of new features.
Anybody tried it yet?
Basically higher vram = better
then wouldnt 2 3060 be enough ?
Hi guys how are you all?
I have question
why my generated picture is not clear? like if i create picture it will come up uncomplete like his/her hands are uncomplete facelook uncomplete or strange, i don't why its happend. anyone knows?
hi guys, is it possible to generate seamless images through the bot?
@cobalt tulip cant it be missing VAE?
I dont understand what you say
check what VAE is, or post your picture, what issue it has.
What is the best model at the moment to generate UI / UX designs?
what is the best img2img model for colorizing pictures?
if you install locally, the generated image is in the webui folder inside the output/text 2 img-images folder
it saves everything you generate
guys why is sd 1.5 inpainting not giving nsfw?
depending on model
I mean I had an sfw image and wanted outpaint the clothes but it just puts another set on clothes
if all of those can do nsfw, just type in naked or something
my prompt:Naked,human body,no clothes
try (naked:1.5)
dude, take a look at civitai, most of them are just nsfw
ye
try with no lora
let me
how so, how is a1111 bad
automatic 1111 is so bad man
after the sdxl integration update they broke it
Welcome to the world of Generative AI
Use colab lamo
my pc cant do SDXL fast enough, 30 min for a 1024x768
i7 6820hq, 16gb ram, quadro m3000m 4gb vram, very old by pc standard
Use google colab
they offer a 16gb gpu for free
scroll down
and click the "open in colab button"
does google colab even allow nsfw
you just need 4gb vram to run locally, what gpu do you have
Basically the processing would be done in google's supercomputers
also make sure once you are in the webui
I dont reccomend comfy to beginners
make sure the resolution is 1024x1024
click the play icon on
looks like it have 8gb vram?
Bro why is sdxl so hesitant to generating nsfw!
alright back
just slap some embeddings and ((nsfw)) onto the negative prompt
Ive been able to install a standalone version of SDXL in the cloud with the text to video extension on a dedicated cloud GPU (a6000 in my case). ive also been able to integrate this with twitter's api and automate a process. if anyone is trying to do anything similar, reach out and i can help.
post your result here man
I have seen stability.ai took a page out of midjourney's book
and made sdxl less realistic and more dependent on cgi
They really want to beat midjourney
Hello everyone. Could anyone please tell me if there's any way to upload loads of pictures of me, make a model of me, and prompt the model so I can generate ultrarealistic pictures of me? If so, please tell me how.
search how to make a lora
runwayml offers 1 free plan
to make a model on yourself
What's the best way to access Stable Diffusion that allows you to use a lora online?
Thanks a lot
Good mornin', everyone! How are we all today?
Pretty okay, working and recovering from a wesp sting on my throat lol
How about you?
Ugh! Yeah, I'm still recovering from being sick, rip!
Haha, but doing better
Good recovery!
I didnt see the sting coming
I thought a vene is "dead" lol
Wasps are not on my friends lists
R i p allergies
Is there a limit to how many images you should use to train loras? Because I'm about to go all in on this mf with thousands of faces autocropped from various videos
Hello, does anybody know the steps to get automatic1111 to work with a self-signed certificate, and then be available through a No-IP dynamic dns domain and have SSL?
I need to know how to properly generate the certificate, how to configure the COMMANDLINE_ARGS for it to work.
I have attempted it a couple of times, I get certificate errors, automatic1111 straight up crashes, it says that my dns name does not match "localhost", etc.
it is normal that generated images can only be downloaded in webp ?
Do you have "webp" set in Settings -> File format for images ?
Oh and you're running automatic1111 right?
okok i'm going to automatic1111 i'll come back if have more questions, thanks for help !
hello everyone
you can download them from the disco directly
is it possible set how much strenght the inpainting will be? so I can retain some of the old image (the area that will be inpainted) look?
denoise value (higher value means more changes)
is there a way to point towards a directory for my models
i have them all saved in a different location
if so please someone point me in the right direction
Yes there is the --ckpt-dir COMMANDLINE arg
For the webui-user.bat
any great tools to get started with stable diffusion you all recommend, either generating or training,
So am watching those corridor crew videos on youtube where they make the animes using SD and a bunch of other tools, they keep mentioning "reverse noise" or something, describing it as the noise not being generated by a seed but instead being generated by the input image used for img2img. Can someone describe what that is exactly or how to use it?
face palm now i see the tools channeL:)
Does anyone have the upscale 4x-ultramix balanced?
thank you man
is there a similar config for loras, embeddings and VAEs?
nvm found it, @rotund wharf you should also bookmark this page
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Command-Line-Arguments-and-Settings
is there a way to perfectly swap faces of same styles ?
like swapping human to human, or 3d to 3d, 2d to 2d etc
Just want to share a thought. When I visit civitai or huggingface and think about downloading one of the new sdxl models, I constantly think to myself "how do I know SDXL base can't do that already?". It's a good problem to have..but hard to tell without A/B comparisons.
Hey everyone, looking for places to get some exposure for my idea, to get some opinions/advice/questions. It’s a Transformer architecture, that is faster, cheaper and performs better. You can even get 100K+ sequence length on one Colab GPU. https://www.reddit.com/r/MachineLearning/comments/15zzft9/r_elita_lineartime_attention_done_right/
Quick question. I have SDXL model in automatic1111 based on the github model provided. As well as I used the sample on clipart. It seem like clipart produces much better quality results? What could be the explenation for the difference in quality? Is the model I am running locally not working the same as the model on clipa
In Google Collab which GPU accelerator is meant to be used for image generation? TPU or T4?
Does anyone know of an existing workflow to apply hi-res fix to a number of images after the fact? I'm currently passing them through the PNG info tab in auto1111, sending the parameters / prompt to txt2img. I have comfyui but still learning.
trying to figure out a batch solution
Is stable diffusion censored?
definitely not
you have to be specific with your prompts if you want to add or avoid nsfw images
It's not creating a few images ,like a car crash
Maybe I have to work on prompt writing
that could just be your model or your prompts. Not every model recognizes every prompt
Yeah, thanks
ive seen some models have an excel spreadsheet with the prompts it recognizes, and theres a tag autocomplete extension that helps with that
And what is seed?
ever played minecraft?
A Lil bit
think of it like the seed of the world
Idk abt that sry,
if everything including the seed stays the same, the image will generate the exact same. if you have "-1" as the seed its random within the bounds of the prompt
Ohk
No
you can also keep the same seed and change the prompts to adjust the image, as long as they stay mostly the same
So the more seeds , the more random img from prompt will generate
each image has its own seed
its either randomly generated when you generate the image or you pick a specific one(which i wouldnt recommend unless youre redoing something)
There still a random seed ever seed has defined image
For randomness try playing with the guidance scale
each image only has one seed, unique to that specific image and those prompts and model
Got it guys, thankyou
Tpu doesn't support stable diffusion
you dont really need to worry about it, id say just leave it at "-1" and let it work
Ok
I'm not sure if this is a good place to ask, but does anybody know some sort of ai voice over program? Like where I enter words and it'll read it back to me.
ive tried some basic programs online but they tend to be pretty bad in my experience
You don't really need to write long prompts
Unless you want absolute control over your images
@clever shore
Yeah, I tried a few prompts with 3-4 words and it works pretty well
Python is general language
Learning that would make you a ml engineer
But libraries like pytorch and tensorflow will
Numpy,panda
C
For basics
I'll delete this conversation because it is not related to this group
Java,cpp in my opinon are not good
Does anyone know why when I generate an image in discord and when I go to download it comes out in webp format?
Quick question. I have SDXL model in automatic1111 based on the github model provided. As well as I used the sample on clipart. It seem like clipart produces much better quality results? What could be the explenation for the difference in quality? Is the model I am running locally not working the same as the model on clipa or their playground in terms of quality.
hi does anyone know how to load the civit ai lora model in python code?
for example i want to load this
https://civitai.com/models/112902/dreamshaper-xl10
i tried
pipe.load_lora_weights("./dreamshaperXL10_alpha2Xl10.safetensors")
but it returns
ValueError: None does not seem to be in the correct format expected by LoRA or Custom Diffusion training.
I am trying to find my dream post forever still cannot fjnd it
Good morning
I am looking for a French community on Stable diffusion to answer some of the questions I have about using the software.
My level of English is not top, it would be easier for me to exchange in French.
Do you know of such a community?
Where does it exist on this discord?
Thanking you
bonjour haimday, je suis Aizen, tout d'abord, qu'entendez-vous par "communauté", votre communauté ou cette communauté
Une communauté au sens large.
Celle-ci m'irait très bien si je peux échanger en français de temps en temps.
Ok, donc la communauté de diffusion stable existe pour fournir des connaissances sur l'outil génératif open source, sur la façon de l'installer, de l'exécuter, de demander des conseils et plus encore ! Pardonnez le français, j'avais l'habitude de sauter mes cours de français au lycée
ah comme moi avec les cours d'anglais ^^
Ah, de toute façon, si tu as d'autres questions, n'hésite pas
ok merci
aucun problème
its a pretty quick read
The Clipdrop website does not provide information on whether purchasing a Clipdrop Pro subscription gives users commercial rights to the art they generate during their subscription. However, Clipdrop empowers teams of all scales to create better visual content for a fraction of the time and cost
The Pro subscription includes features such as Stable Diffusion XL, which generates high-resolution realistic images with AI, and Uncrop, which allows users to uncrop photos to any image format
The Clipdrop API also allows users to integrate best-in-class AI to their apps in minute
While Clipdrop is an AI-powered collection of tools for image editing and generation, it is unclear whether users can sell or auction the art they generate using Clipdrop Pro
But according to US
If you just modify the picture even with 1 pixel its yours to keep
also you can always run local for free......
ahem lets assume you dont have a powerful pc
then it wont work
but google offers you a 16gb vram gpu for free
everyday for 12 hours
@leaden vessel refer to this ridiculously long chat--->#🏞|general-with-images message
Apologies if this has already been posted
https://www.theregister.com/2023/08/24/hugging_face_big_tech_investment/
$235M is peanuts
for traning LlaMa-2 meta already spent 6 millons
Openai got pumped $10billon
Google has plans to pump $100 billon into generative Ai
Except webui now violates the TOS. Paid tier is fine though. https://www.reddit.com/r/StableDiffusion/comments/12t8tc7/is_colab_going_to_start_banning_people_who_use_it/
Warning: Google only bans Webui which have webui in the name
however Hugging Face is a Repository and isn't training its own models so comparisons with money spent on Dev isnt an apples vs apples comparison 🙂
Understandable point
Ok but I wouldn't risk my Google account over that amount of money
you can always make another
besides runpod,paperspace,kaggle
besides they only ban things like detected nsfw images,nsfw chat models like pygmalion
because if they were to ban fucking everything
than colab would be point less
Good morning, everyone! I hope you're all doing well!
Good morning? man its night in asia
When I'm training a Lora, I'm keeping the portraits at 512x512 but what about the full body shots? Should I resize them to something like 300x512?
Does it make the difference by itself as empty space or will it go apeshit
I picked up one of those Coral USB TPU accelerators
is there any chance of running stable diffusion through it?
hey guys, can you use temporalkit/warpfusion in comfyUI?
Hello everyone, I can't get it worked
hello guys, I have one question. How can I integrate a Space of huggingface on my website?
hey guys, i'm trying to use control net to upscale a image via tiles and i'm getting the "CUDA out of memory. Tried to allocate 42.00 MiB (GPU 0; 4.00 GiB total capacity; 2.48 GiB already allocated; 0 bytes free; 2.56 GiB reserved in total by PyTorch)" message even with the Low VRAM option ennabled.
I'm running in a laptop with NVIDIA GeForce 1650 with Max-Q Design.
My command line args are the following: --lowvram --xformers --no-half --disable-nan-check --upcast-sampling.
Am i missing something or my system really can't run this upscalling?
(i'm using the automatic1111 webui)
it has to load the control net model in addition to everything else into memory. Seems like 4GB is really too low for that. Can't you do it in two seperate steps?
maybe google colab is a good option then?
Is it possible to control the mouth gestures of a generated image?
I think thhe current version for 1.5 has full face control
Could you please explain how it's done?
https://www.youtube.com/watch?v=GF2vIgyn4Qo you can watch it here 😉
Is it possible to control teeth and lip position?
hello
Anyone in here that can hop in vc to help me out real q
I am getting images do not match error but the images are 1 to 1 the same size?
API error: POST: http://127.0.0.1:7860/sdapi/v1/img2img {'error': 'ValueError', 'detail': '', 'body': '', 'errors': 'images do not match'
with that you can pose it yourself https://www.youtube.com/watch?v=vvrH1Lcx-jo . but I don't know if it's updated to control the face or just the body
pls?
Hello I would like to know if you have any advice for me to optimize this prompt
A captive galaxy floats within a sphere of red and silver-tinted glass, engulfed by an enigmatic blue mist. In the background, an apocalyptic cyberpunk world takes shape, with dragon silhouettes as well as wandering zombies adding a captivating touch. This fusion of a galaxy and post-apocalyptic horror is captured with hyper-realistic precision.
which best version of python is recommended?
there is this thing called ultimate sd upscaler. it is nicer to use than control net default upscale. perhaps you should try it
use inpainting
What?
SD doesn’t do compositions very well
just paint in where you want your object to be
first you make a sphere then inpaint a galaxy then some blue mist then the other stuff
Ok thanks you
Is there any way to generate a picture of a model biting their lip? It seems to be a complex task for AI similar to what generating realistic hands used to be
Generate realístic hands still complicated in very specific ia images.
that one is easier than getting perfect hands
Anyone have an animatediff workflow to share for comfy?
Obv. There's no such thing, it's subjective
Do you know how it could be done?
something like portrait,face only,bitting own lip.
Yes, it's like a selfie so you can see the face, neck, shoulders and upper torso
upper body works too and u could add full body to negative
Could you please explain step by step? I've only been using Stable Diffusion for 2 days
try asking in #🤝|tech-support
i would show u an example but im not on personal computer rn
Take your time bro, there's no rush. Feel free to ping me when you are ready.
What version of Stable Diffusion are you looking to use?
Hi.
Is there a prompt to tell to the ai to think outside of the box ? To have some creativity and add stuff we did not necessarily think about ?
Please :3
Try turning up the temperature and down the guidance scale it will be more random and creative but it is not recommened
is it possible to do img2img with sdxl and comfyui?
how do I switch my pipeline in VLAD1111 from backend to the original pipeline again? I've been trying to get VLAD working again but I can't for the life of me get things to work since I tried messing around with SDXL
I thought it was in settings, select ORIGINAL
However, Im having an issue with VLAD that all the menus are doubled. For example in settings, it shows Original, Original and Diffusers, Diffusers
When I try inpaint, it gives an error
others claim it isn't happening to them, but I've tried fresh install and even install on another computer. Vlad right now is not working
For stable diffusion, can I give a color prompy with HEX Code Color?
like...
#204d52 hair color, 1man, male
basically stable horde?
This is a discord robot of sd-webui, which supports roop, controlnet and other plug-ins.
Support multiple sd-webui node deployment
so it uses a cluster of gpus?