#💬|general-chat
1 messages · Page 57 of 1
Seems like the first time i ever used stable diffusion to me personally
Hi! I think I asked this yersterday, if there's a way to paint lineart images with SD in google colab?

If Colab has Controlnet, then yes. I dont use colab so not sure but if it does you can use this as an example https://www.reddit.com/r/StableDiffusion/comments/11cukhp/mlsd_controlnet_preprocessor_options/
Yes the one i'm using have controlnet, I saw the inpaint options, but usually i only get the same image (not painted)
for example, I want to paint a character with a certain coloring style
but it doesn't work
use your line drawing for the contronet section. Above in the normal img2img section insert an image that has the colours you want to use
the style will mostly come from your prompt
But how the ai will know if i want a certain color in the hair
from your prompt
hm, ok I'll try. Thank you very much!
How many max steps do i need to train properly a model?
hola
s
How to create image?
Is there are reason for all these deleted channels? I am new here
I don't see any deleted channels and I joined today
Can you see all the channels in art-movements?
This post in particular just shows a lot of deleted channels for me
Yeah I see all of em
Weird, for me all of them are deleted except for the anime one and the modernism channel
Here, you'll find a plethora of threads awaiting your art submissions. Unleash your inner artist within our new forums and share your creative endeavors with our stable community!
Welcome to Dreamer Communities!
Anime
anime - dreams of a world where animation transcends its limits and becomes an immersive experience that offers an escape from reality!
Art Movements
art-movements - dreams of vibrant and dynamic art movements, a celebration of individuality and artistic expression!
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
modernism
#deleted-channel
#deleted-channel
Humans & Animals
humans-animals - dreams of character creations, be it the complexities of humanity to the wildest creatures of your dreams!
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
Landscapes & Buildings
landscapes-buildings - dreams of the harmonious balance between the beauty of nature and man-made structures!
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
Objects & Vehicles
objects-vehicles - dreams of three dimensional objects that evoke feelings of nostalgia, comfort or joy, and can serve as reminders of the people, places, and memories!
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
Genre & Styles
genre-styles - dreams of the whimsical world of cartoons, from the mystical realm of fantasy to the spine-chilling genre of horror, from the stunning art of photography to the trippy realm of psychedelia!
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
#deleted-channel
Send exceptional posts into Starboard!
Have you heard of our Starboard? It's where we collect all of the best posts in our server, and it's curated by the community!
I cant share images so I copy pasted the text
skill issue
good morning
you must be using some kind of discord extension that keeps it's own server logs or something. the server got reorganized a while back to threads instead of chcannels
better discord often just screws up discord. there are others and they're often malicious and send your login tokens to other users.
Don't run scripts on your discord client. it's a really bad idea.
I dont use a special discord client though
those channels are long gone. They're threads now
if your client is remembering them, it's modified
yeah. modifying discord often breaks it. i said that
if you can not see this message then I guess there is something wrong with my client
I did not modify my discord though?
i see that message. all those link to threadded topics. not channels. The channels are long long gone
Weird
discord is definately broken in a weird way where it's remembering shit it shouldnt'. that sounds like it was modified to me. friend did it for you. you did it months aog and forgot. Or you just don't want ot admit it because its against tos. i don't see how discord could believe they were still channels other than it was modified to see stuff that moderators had changed and other hacky discord stuff.
as a fix i would uninstall discord 100%, delete all registry keys and folders and anything associated with it, then start over
I'll look into it later, I feel like there might be some other solutions since neither of your described scenarios are applicable to me
I have a small question to ask
why this generating speed seems very slow. the generating speed of my computer is 3.59it/s,(4080 laptop)
it's definitely client side for you. try force restarting with CTRL + R and see if that fixes it!
Thanks that fixed it
Glad to hear it! 
is stable diffusion xl free from copyright issue? (like gettyimages watermark)
1.5 is even if your creations are transformative works
wow. i assumed that restarting it had been tried already. mb
civitai is down?
Yes
Do you know When can it be restored?
I have no idea, but it keeps getting shut down every now and then so next time it goes up just download as much as you can
@marsh hareok ,thank you
this is their latest news, about 4 hours ago, an announcement on their discord :
We've identified the issue: Our database replicas are failing every 6 hours or so, and we're not 100% sure why. We're doing our best with our limited knowledge of Postgres but its slow going. To that end we're currently looking for a Postgres expert to consult with us to help get some longer term fixes in place. If you or someone you know is interested, apply here:
We have what we think is a more stable solution going out now, (Thanks 💬︱general for the feedback!) no eta on that yet but hopefully not more than 4 hours.
As always we appreciate your patience and understanding as we work to get your waifu's available once more ❤️
I'm quite excited for it, and looking for any little news on it, but nothing more for now
(well, civit is back down already)
where can i create my images
Anyone here versed in python/pytorch that can help a brutha out?
why do people use civitai? its so hard to search for anything and it mostly just shows images of the same style
and every category is just flooded with pictures of girls
venv "C:\Users\Mooi Kin Heng\stable-diffusion-webui\venv\Scripts\Python.exe"
Python 3.10.6 (tags/v3.10.6:9c7b4bd, Aug 1 2022, 21:53:49) [MSC v.1932 64 bit (AMD64)]
Commit hash: 22bcc7be428c94e9408f589966c2040187245d81
Traceback (most recent call last):
File "C:\Users\Mooi Kin Heng\stable-diffusion-webui\launch.py", line 355, in <module>
prepare_environment()
File "C:\Users\Mooi Kin Heng\stable-diffusion-webui\launch.py", line 260, in prepare_environment
run_python("import torch; assert torch.cuda.is_available(), 'Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check'")
File "C:\Users\Mooi Kin Heng\stable-diffusion-webui\launch.py", line 121, in run_python
return run(f'"{python}" -c "{code}"', desc, errdesc)
File "C:\Users\Mooi Kin Heng\stable-diffusion-webui\launch.py", line 97, in run
raise RuntimeError(message)
RuntimeError: Error running command.
Command: "C:\Users\Mooi Kin Heng\stable-diffusion-webui\venv\Scripts\python.exe" -c "import torch; assert torch.cuda.is_available(), 'Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check'"
Error code: 1
stdout: <empty>
stderr: Traceback (most recent call last):
File "<string>", line 1, in <module>
AssertionError: Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check
Press any key to continue . . .
hi anyone know this issue? need help
i'm thinking about time to create a new alt civitai. or people don't see the need to upload to civitai anymore. images too easy to create on your own? versus seeing what others are generating?
you have an nvidia card right? @lapis nova
it's not easy to create on your own and there's plenty on civit, but the big names have it all handled @weak kelp
fair but the content seems to have become stale on civit
A bit of their own fault cause they changed their TOS to be harmful to creators
oh. didn't realize that
yeah how to fx this problem
did it work before or this your first time using it?
this is my first time download
Hi guys,
Is anyone experiencing this when trying to login dreamstudio?
Hmm, looks like logging in didn't work. Please try again.
so python installed and then you ran webui-user.bat? Assuming you're using automatic
Tell me, in which folder to download negative hints from sites? For example https://huggingface.co/embed/EasyNegative/tree/main
Stable-diffusion-webui/embeddings
Thank you.
@warm juncoI downloaded and saw that one model was added. But where is it? How to use it?
You can click on the third button below the Generate button. There you find embeddings, loras etc, then select the embedding
Found in textual inversion.
Guys, I post pictures of anime/game characters on twitter. I don't do nsfw and I don't promote any products. In short, is it possible to file a lawsuit against me over this? I don't have many followers right now, but who will know in the future...
No, if its not against the ToS of twitter then your safe
I've been researching a lot about copyright, but this is a headache
I'd still be careful if, say, you trained a lora on some very recognizable style and are posting images that some reverse image search files as that artist... 
can i write prompts in google sheets and generate images there instead?
is it possible to link google sheet and stable diffusion tgher?
wow that's not a sentence I thought I'd ever see
I guess the correct questions is.... why?
also l0l at discord's sad attempts to get people to buy nitro.
literally effects that we've had since the days of msn messenger
i saw those habit tracker thing template, random thought of generating images for visuals maybe characters to "lvl up"
just a random thought
Am I the only one who can't access civitai now?
I'm getting in. Quite slow tho
Anyone can file a lawsuit against anyone - it’s a matter of whether they have a potentially winnable case. It’s pretty costly to file a lawsuit you won’t win
could someone pls share a bit of info about models with me? i am currently using v1-5-pruned-emaonly and wondering how to find new models and how to choose the best models.
Hey you can find more models on civitai.com or huggingface.co
does anyone know how to fix the hires fix that is in Automatic1111? I have the latest version and for some reason, the Hires fix is adding geometry. It's horrible.
hi all
been messing around with AI art casually since nightcafe early days, but I'm finally using it offline
Nice! How are you enjoying it?
yeah it's been really fun so far
how can i get help about stable diffusion
what do you need help with?\
why cant you ask in here?
im trying to send image and i cant send it here
I'm glad you're enjoying it!
What kind of help are you looking for?
If it's technical support, there's #🤝|tech-support but generally speaking, you can learn about SD by going to #1072220168534642768 followed by #1080946152318443610
If you need help with prompting, there's things like #📝|prompting-help etc. I recommend taking a look around the server, as there are a lot of great places to get help
@analog mango
do more people use 1.5 or 2.1 here?
I wouldn't say I have any particular numbers, but there are many people using 2.1
I put "too many hands" in the negative"
and it literally just created a person to fill in the hand
🧠
The EU's AI Act seems to be moving towards making it illegal for individuals and groups to train on copyright content: https://slashdot.org/story/23/04/14/142246/european-parliament-prepares-tough-measures-over-use-of-ai
This will mean the megacorps like Getty Images will have monopolies, and won't help small time artists
It has been a while since i been active in this subcategory in the internet. Has Stable foundation figured out the issue with fingers/hands yet?
all this preemptive legislation has to end. No one knows for sure how AI will effect us at large. Greed is pushing all of this.
hey guys , noob there, what make that a lora work with some models but not with others? even when sometimes they look similar style like anime for exemple.
shouldnt it also be illegal for artists to learn from copyrighted works too?
For the EU, its a bunch of private lobbyists representing the white old men running legacy industries who are making decisions for the politicians in secret. Then they plan to share the finished result with a simple 'yes or no' answer to the public (that the MPs will vote yes on due to political pressure).
Nope, stable diffusion is terrible with small details like that.
Hi all!
Is this the latest version of Automatic1111: Commit hash: 22bcc7be428c94e9408f589966c2040187245d81
Git pull does tell that I already have the latest update
whats the most realilstic ai model
firefly
How to disable showing the image in automatic1111? I want to reduce the downloading of pictures and only stock them
Depends how you define solved but it is by far the best. Maybe not the best at first gen but between inpainting and controlnets using poses there are many ways to get good hands.
there is an option in the "Settings" tab in the "User interface" category named Do not show any images in results for web
thank y
no prob 🙂
what happened to thispersondoesnotexist.com?
anyone recommend a discord where i can generate some AI Pictures?
I recommend that you start with #1072220168534642768 and #1080946152318443610 but if you're looking to generate images, you can actually do so for free right now using the demo: https://huggingface.co/spaces/stabilityai/stable-diffusion
hi frens. is there a good api or protocol for creating videos on social media? seen a bunch of reels in the past week that are morphing images with movement, character based. I created a good one with an app, but I'm looking to use the actual beta instead of relying on a third party app.
Lebon james
I assume I should just slowly increase CFG scale the more tags I put in
Especially if I am getting my tags ignored randomly
how do I make a picture?
https://laion.ai/blog/paella/ is it this one?
all there is
Where do people share their art?
You can share them in many places, the "dreamer communities" category is where you should do it (look for the forums for a category that fits what you made), or #🏞|general-with-images and #1045349359044280360 (for 2.1 images)
Hello friends
When I want to generate a photo in the automatic 1111, I receive this message: NansException: A tensor with all NaNs was produced in VAE. This could be because there's not enough precision to represent the picture. Try adding --no-half-vae commandline argument to fix this. Use --disable-nan-check commandline argument to disable this check.
Time taken: 31.30sTorch active/reserved: 3324/3910 MiB, Sys VRAM: 5565/6144 MiB (90.58%)
It used to work fine, now it's like this, what should I do to solve the problem?
Edit the webui-user.bat and in the line
Commandline_ARGS=
you add --xformers --no-half-vae
After the =
Then save and restart
Thank you very much🌹 🌹
My problem is solved
Can this error be fixed for high quality image processing and weak hardware?
error : OutOfMemoryError: CUDA out of memory. Tried to allocate 768.00 MiB (GPU 0; 6.00 GiB total capacity; 4.45 GiB already allocated; 0 bytes free; 5.22 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
Time taken: 2.86sTorch active/reserved: 4953/5368 MiB, Sys VRAM: 6144/6144 MiB (100.0%)
You can add to arguments (same place)
--medvram - it will make gen a bit slower, but you'll be able to go for higher resolutions or gen more images at once
or
--lowvram - same idea, but alot slower
I'd assume to increase speed 
thanks a lot 🌹 🌹
It became obsolete
i still enjoyed using it tho i liked all the faces it made with no interaction
You can literally create realistic looking faces in two seconds flat with what we have
Yeah but i didn’t want to make them i wanted them made for me so i could just. reload the page
When are the XL safetensors coming out?
How do I make NSFW images?
Has anyone installed sd cn animation? I cannot figure it out
You need a custom model. Default sd models exclude nsfw content.
people really ask anything
1.5 doesn't exclude afaik, but yea...it's better to take custom model anyway from https://civitai.com/
ok I keep forgetting but I also see it everywhere... what does afaik mean?
as far as I know
anyone used controlnet 1.1? How is it?
@alpine bear ,I have small problem in SD,might need your help

my notebook computer use rtx 4080laptop version as GPU. but its generating speed of SD is 1.5-2it/s

Hehe, aye
which is very slow
Ya
🤔how to fix it?
Read through my whole conversation with this guy #🤝|tech-support message
Cudnn.dll files, and python 3.10.9
Use xformers too
OK I will read them carefully
@long wind I'm using AI for my graduate thesis project and since I'm using your models for most of my output, I wanted to credit you properly with creating them. However, as I was working on my paper, it occurred to me that saying "models created by Sir Veggie and uploaded on HuggingFace" might raise some eyebrows, amusing though it is.
Is there a better way I can credit you for making and releasing models like wlop_100?
Veggie's got their name on HF. Could just credit 'Veikka Pulsa' but I won't speak for em 
I wasn't sure if that was their real name or another internet handle 
I'm not sure either 
But I think its real
Veikka Pulsa is indeed my real name, you can use it
Thanks!
good luck with your thesis
Not sure about where to ask but : Does anyone here have a program they use to make gifs? I want something that makes high quality gifs that also tells me what the file size will be.
is there some ai powered program that can output gif vids?
ezgif is okay but I want something better
ffmpeg is best.. its all command line options. might have a way to only output the potential filesize
hi
ah i see
ty for the info
@zenith thunder if you allow users to upload GIFs to your site then making them autoconvert into MP4 as background jobs is ideal (Twitter did it some 10 years ago).
You would probably need Ruby On Rails to make this nice and easy though.
Is there a bot that i can use for txtto image in this server?
POV: feeling __ having MX130 stressing itself for 8 min for 500x800 pic.
I do have it loading on my phone, and on my computer in firefox, but computer + chrome seems to give me a blank page right now
caught SyntaxError: Invalid or unexpected token (at auth0.f9e84e44.js:1:8532)
not loading on my ios phone either
or safari desktop - so I think broken from a deployment I guess 🙂
I'll try to make sure the devs can take a look
NansException: A tensor with all NaNs was produced in VAE. This could be because there's not enough precision to represent the picture. Try adding --no-half-vae commandline argument to fix this. Use --disable-nan-check commandline argument to disable this check.
how can i fix this
their github directly https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/new/choose
But they do have currently around 1800 of those open so... not sure what to say
I'll just leave it then. it's a pretty niche issue that has a relatively easy workaround
im trying to install atomatic 1111 where i can ask my questions ?
here or in #🤝|tech-support
I think I'm doing pretty good this morning, so far, all things considered! And yourself, Guiz?
What are you up to?
Also, now I just want cereal because you said that lol
I'm quite good ! productive. I have mowed the whole garden once again lol
and I'm working on the PoW megamix model currently. I published a preview from a test I did last night, to see if I was on the right track
#🚀|pow-discussion message
I'm trying to finish captioning but...
it's so long ^^
I'm at 80% of the dataset done now
Oh heck yeah!
Yeah, captioning takes such a long time to do right
I did the test by auto captioning the rest of the dataset
it works, but I'm doing it by hand for real
To me, it seems that it's like.......it gets it half right, or almost right, or really good.
Depends.
"Mowed the garden"
Okay guiz 

He runs, but what will happen next?
On this episode of general chat
I just realized that my characters in Unreal are GIANTS compared to the character height
yeah... it's the 1 thing I do outside. it happens
Perhaps my joke has flown over your head 
Does anyone have any suggestions for a music ai, maybe one that takes samples for reference?
Check out our Harmonai server!
Thank you ☺️
what is "baked" in a model's name suppose to mean?
already include vae or lora inside the model file?
why don't you just copy some from civitai
It means the vae is already included
Hey guys, does anyone here use LORA models with Google Colab?
Hi, If I wanted to create an image based on both tex prompt and image, what method in python should I use ? does such a method exist? Does Stability offer sucha functionality? @vast ingot Last Question I swear xD
ummm controlnet?
I'm not sure how you are planning on using stable diffusion here. it seems from code directly from what you say. it's a way I don't really know how to control to be honest.
But yes, this is called "image2image" and is one of the 2 main ways to use Stable diffusion
it takes an image input, as well as text conditioning, and modifies your picture
Look what I found
thank you that really answered my question!
how do i change which browser webui1111 opens in?
It opens in your default browser
You can change it in windows at standard apps
i remember being able to select a browser SD opens in without changing default browser, but i haven't gotten that option again.
I'm checking the current available options https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Command-Line-Arguments-and-Settings
and it seems to only refer to the default system browser currently. if it was there before, it seems it's not here anymore
Hello Morning. Is there a Moderator here that i can possibly reach out to.
Thanks 🙏🏻
@vast ingot hi 👋🏻 are you by any chance the moderator 👨⚖️
First come first serve 😆
if you need a private channel you can open one if needed, using #1010934719455707218 by the way
May I DM you I wanted to ask something in private thanks
I can't DM directly, nope, sorry, I get too many
but I'll see you in the ticket in a second
Done
"Guizmus , The Moderator" 
we'll do a movie soon
I want to see what words I may have overused in a dataset captioning, what will be trained enough and what could need a change before I start.
I would need to dump my file names in some way and analyze the words, but somebody may have encountered this problematic already ?
Wassup edited, because I don´t want to get banned
Bruh
The auto mod is kinda bad here
What's the relationship between deepfloyd and SDXL?
is there a way to force SD to render subject only? - with 100% black background. I did try "character sheet", "empty black background" etc. Nothing seems to work well so far. It will do characters in this manner, but i'm trying to get some vegetation.
(transparent BG would be even awesomer)
i think you'd have to train it on pictures that constantly have black backgrounds
but thats just my opinion
makes sense
try (minimalist), (simplified character design), (line art)
Regarding #dreambooth I wanna ask if it is a sensible idea to train my first set of photos based on , say, chilloutmix first. Then I use another set of photos to train based on the checkpoint that I have already trained by the first set of photos ?
lets try asking an ai
be right back
im using bard
the one that's supposed to be bad?
i pinged you
in general with images
i signed up when it came out
there's a waitlist
like dall-e
somewhat i guess
not really reliable for coding though
i pinged u in #🏞|general-with-images with the response
it gave
so u can look at that for reference
can it code acheong?
no
probably not
Google has just revealed in an interview they're rolling it out slowly
so it can do much more, but they're letting everyone get used to the tiny features
so it can do more than this
Hi all 🙂 I have a question. I am using Deforum in Stable Diffusion. I have an automag 1111 installed. Version 1.5 (on the screenshot).
I heard that a new model is out. Please tell me - does anyone have a tutorial on how to install a newer version?
P.S. I am a newbie, sorry for the stupid questions.
v1-5-pruned-emaonly.ckpt [cc6cb27103]
hey there
you just drop models into "stable-diffusion-webui/models/Stable-Diffusion/"
@runic igloo
well, there is a new model since 1.5, there has been lots.
"base" official models go up to 2.1 currently, XL is in beta on their website but you can't download it yet.
But civitAI has thousands of models for you to choose from
string some models together ? WHat do you mean ? merge existing models or train a new one ?
I'm no python dev though. I do a little but chatGPT is mostly there for me.
I think there is an extension for automatic1111 and civitAI
nope I'm not going voice, lot too tired
you'll need to explain more clearly what you want to do, I don't understand you there. I thought you wanted to download models : civitAI or Hugging face are the place to go
even through code
HF mostly because you can clone
you'll need to build the library by hands, or web scrappers for each type. there isn"t a common place for each, it's all over the web
downloading and experimenting with the models is really the only way to learn them. there's no standardized way of defining what a model can do and what it is meant for
hope that the author's description is reliable
often times it's "This is the greatest model that fixes hands!" though
you should ask the #1011228667659178055. I can't be your help on this there. I have too much on my plate to do, you're asking me to think on things I have no idea about there, and I'm far from the best person.
Start with a spread sheet to build your notes out. in time you could convert that into a DB
I would love to talk about modellinga DB for these purposes but again, there's no standardized way of defining a model yet. i don't know how we'd get there though. (DB Model in this context, not an AI model)
i'm not sure what string models together means still
but why call it "stringing models together" what does that mean?
a database of notes doesn't really mesh with that phrasing imo
i think you need to work on defining your goal better.
before asking for help
Can you share the link where to download the latest model for SD? Official)
i contextualized notes here yes
You really need to define your goal a lot better sorry
i obviously don't understand you
that sounds liek notes, but you're insisting now that i made that up
I want to learn where from can I download models, because civitai only gives me hashes
I did but you're saying you're not making a db of notes now
latest model (2.1) https://huggingface.co/stabilityai/stable-diffusion-2-1
last model of the 1.X generation (1.5) https://huggingface.co/runwayml/stable-diffusion-v1-5
so i don't understand, obviously
now i'ts suddenly an argument
you don't need to bash me for not understanding. i've told you a few times we're not getting there
Let's stop there
This is not a place for personal attacks, nor responding with more
I'm not gonna sort who is right or wrong, you have the right to differ opinion. block yourself, move this to PMs or I don't care
those aren't the talks that are OK on this server
Hello all. I'm new to SD and have a couple of hardware questions. I have older system with an Intel Core i5 4440 and a NVIDIA Quadro K2200 with 4GB VRAM. I have A1111 installed and have been running SD locally. It's been working to generate images, but it can be a bit slow depending on what I'm attempting. I can't go larger that 512x768 at 20 steps, and I can't upscale. It's fine as I learn, but I know I'll need better resources to do more and grow. I've been looking at systems to upgrade to, and from what I've read something with a Nvidia 3060 with 12GB of VRAM is what I'm aiming for. Outside of the overall implications of have a faster processor for all computing use, is there any other key factor I should look out besides getting a Nvidia card with 12GB or more VRAM for using SD? Is there any consideration for AMD vs Intel, or if I go with AMD consideration between a Ryzen R5 or R7?
I think I have said my part on this : if you want to continue discussion on this level, take it to PMs with them. I warn them too on exactly the same thing, this ways explicited for both of you, not just one
idk about wiki, but you can get models there
https://civitai.com/
I think generally there most popular upscalers are
- ultimate SD upscaler
- 4x-UltraSharp
Can't help with link on 1st, never used it.
Your link has link to ultrasharp, but it's...uh...not for "everything", let's call it that
I don't do upscaling , probably someone else can provide more info
Is there a comparison of different fine-tuning methods? I have a custom domain-specific data-set of ~10,000 images of items that I want the model to be good at generating variations of.
How can I find out if I should train using lora or regular continued training of the SD model?
I would try Lora with very low learning rate 🤷♂️
in theory it should be comparable to full fine-tuning, although some people report they had problems with it
Hi all, new to the community!
Just wondering if there was a YT resource for prompt tips and fine tuning basics for auto-photoshop SD plugin specifically?
at the moment it feels like I'm just throwing paint at the wall and hoping it makes what I ask for lol
I should say that I looked through the 'start here'thread' too but couldn't see any video tutorials or anything?
How to use the bot
Usually authors tell in which folder you should put it
(if it's not in description, I have no idea )
I have only 1 upscaler downloaded manually and it's .pth
we don't have a bot, you can install SD locally on your machine or use services to host it
greetings, how do i manage to create a seamless pattern?
How do LAION, Google, Stability AI, etc create labels for billions of images for their diffusion models?
They aren't human labeled right? That's too many images..
https://ai.googleblog.com/2022/05/image-text-pre-training-with.html tons of papers on this subject. thick reads. i don't fully get it, but this is the last one i was poking at trying to understand.
What is the best way for me to learn how to make photorealistic images? My people keep coming out looking like monsters with lazy eyes.
Interesting.. does that really translate well into prompts?
One thing that doesnt' explain is how Stability AI and Midjourney have solved the problem of creating prompts for billions of images
I've just started using stable diffusion with automatic1111 yesterday and I was looking into inpainting. Using version 2.1 I generate some image and then send it to inpainting but results are really poor and I was wondering If I must install or configure something or maybe I need to use another model for inpainting. Any help or tips ?
🤔 How does that help you with descrbing images though. As far as I know, the input to training diffusion models is an image and a description, not a classification.
I.e "a grey cat in a yellow rain jacket" How does classification help turn that into a training sample prompt for a diffusion model?
theres pretraining too. clip in stable diffusion 1.5's case.
ahoy
Can you "paste" an object previously generated inside another image?
There used to be a channel where people would generate textures
I thought there might be a thread for it, but I cant find it if someone can link it?
dont know were to ask this but, i've been searching for the embed:bad_quality": "c1c5471862. Can't find that embedding, please help to were to download it.
There's a thread for it here: #1072018553768779837
and the old archived channel: #1019350146619818055
You'll need the community archive role from #👥|roles to view the 2nd link
Does discord delete metadata on an image when uploaded? If I download some AI art will it fit into PNG Info?
yes, its deleted by discord
ty i need to make some
Im looking to upgrade my GPU currently Nvidia 750 with 2GB vram, stable diffusion 1.5 is pooping when I push past 1024x768. But leaping into the RTX 4090 with 24GB is a bit painful. What benefits (perhaps resolution?) besides speed that I can only unlock with the RTX 4090 that I can’t with the RTX 4070 with 12GB?
for stable I dont think anything
quite a bit for games I suppose
hard to top a 4090
you guys seen any place to make ai music thats on the level of stable for images?
go for 3090
4090 is overkill unless u got money to burn
at least if ur serious about diffusing, cuz XL is probably gonna be a vram hog, at least during the first stretch
If you have other models you like better for SD is it safe to delete v1-5-pruned?
I'm trying to save space where I can.
Yes, and no
base 1.5 is way too valuable to delete IMO especially if 1) it gets blown off the internet and 2) it's a good idea to use it as part of a blended model if u ever make ur own merges (which you should)
I'm pretty confident I downloaded someone's png's and png info was still there, but some people upload without info
(including me in most cases)
This was the case before discord updated it and now every meta data gets removed at upload/download
Is Platform API of https://stability.ai/ broken? it's can not open the page,
works for me, try ctrl+F5 on the page
is there a way to pause rendering?
hello everyone! first post here so sorry if it is in the wrong place....
You guys know is ConfyUI is on discord? looks like an interesting project but I cannot seem to find any community for it.
hey there Niconico
well, I do love comfyUI, and it gets talked a little more rencently around here
but I don't know of their dedicated community for it
you can find quite some workflows around though
They only have a Matrix Space it seems
I come from vfx background and the idea of a procedural production of ai art is very tempting the node workflow could really be a big step onthat direction.
there are specialized models for inpainting. However, for 1.5 I never observed a big quality difference between using a specialized inpaining model or a normal one
привет
Good morning, everyone!
why there suddenly come out a downloading in shell when I generating a image and it stuck on 98%?
shell prompt:"Downloading: "https://github.com/sczhou/CodeFormer/releases/download/v0.1.0/codeformer.pth" to D:\code\stable-diffusion-webui\models\Codeformer\codeformer-v0.1.0.pth"
I am confusing, why would this downloading happen?
You activated restore faces it seems
sorry, I don't quite understand.
this is my positive prompt:"a 19 years old girl, best quality, masterpiece, (realistic:1.2), 1 girl, brown hair, brown eyes,Front, detailed face and breast, beautiful eyes, small breast lora:koreanDollLikeness_v15:0.1, white jacket, jeans, fashion, smaller head, fashion girl, wrinkle t-shirt, hair covers ears, hands in pocket lora:japaneseDollLikeness_v10:0.2, real face, real skin, realistic face, realistic skin, rough skin"
oh it's an option
I see.
I will learn more from google. thx you so much
You're free to ask questions here. There's plenty of info in #1080946152318443610 and plenty of community guides, etc
hey guys, what are your favorite LORA models?
Hello everyone, how’s it going?
The previous versions of sd run perfectly on 3090s
guys
what prompt should I use to make the skin of the character like literally black?
can someone help me understand how to find appropriate inpainting models for sd 2.1 ?
#📣|announcements rip, videos over 100mb don't embed
what's up with that link in the announcements? https://cdn.discordapp.com/attachments/1002292398703001601/1097900421584007318/DiffuseTogether_-_Peter_Gabriel_Twitter.mp4
it doesn't even load in browser for me
wget works tho
damn fail
Videos don't load in browser, they just download
But the file was too big to embed in the message
here's a re-encoded version if you wanna view without downloading https://i.thomiz.dev/Ccj9gkBa.mp4
Sweet
this one is 35mb
Can anyone give me a link how to make a photo od multiple characters in it?Im trying to make my dnd party.
Are there any benchmarks comparing Nvidia GPUs vs AWS Inferentia for StableDiffusion-like inference tasks?
fffff
It's the thought that counts 
so true 

you can try composable diffusion. Otherwise, I would try to conpute each character individually, copy them into a picture together, then use img2img on it
trying to find SD 4.5
what channel would it be in? or just link pls
Hey folks. My name is Vatsal. I am a Computer Science student. I have received a research fellowship in deep learning and a data science internship at a top vc backed startup. I’m currently doing ML research at Stanford.
I am looking for people with experience fine tuning stable diffusion for Text2Image with excellent results. I have a custom dataset with image-caption pairs that I built readily available so a decent amount of the work is done already. Didn’t have time to do proper fine-tuning cause I was so busy.
I am really looking for anyone with significant experience fine tuning stable diffusion that can help me wrap up this project with good results.
Ideally we would also package the product and ship it as a web app.
guys, is 378 words too long a prompt?
damn I forgot. Does anyone remember what the name of the extension for the one that checks your model for CLIP deviations, numbering etc?
is the local url down rn?
guys. what happens if you use a Lora as the checkpoint?
you can't
Are there any photoshop plugins for comfyui? I'm having a hard time finding it with search engines if there is
is it possible for me to find a pic of five friends then scan it via control net and use their poses for the characters?
Would any one be willing to help make a picture for my Kick Starter. I want good art and this does take practice and skill for good results.
I'm currently running A1111 off of Colab because my current mac is from 2018 with a intel chip. I'm considering upgrading to a new Macbook pro with the following specs so I can run it locally. Would this be enough? Any limitations with running it locally on a newer MB Pro?
Apple M2 Max with 12‑core CPU, 38‑core GPU, 16‑core Neural Engine
64GB unified memory
2TB SSD storage
Guys, is there any colab link that loads the lora models from google drive?
But you'll probably have to launch it - upload what you need - resstart it
It creates folders with Embeddings,models, loras
Could it be that the lora models I cannot save in the folder: models/lora/positive/?
when i do it manually it works perfectly my lora models.
What do you mean?
Folders should work
I am interested in that 2 week challenge
checking out the songs to see if I can do anything cool with them
I need to resonate with these songs in order to try to do this challenge, but the signs are not good 😅
Updated my Installing Automatic1111 guide, in case it's of use to anyone, feel free to suggest improvements: https://rentry.org/installing-automatic1111
I created it for the 4090 Grotto community on Discord. If you wish to join feel free to DM me 🙂
the lora files I usually put manually in the path: models/lora/positive/
I just listened to all of the songs in the challenge and unfortunately I am not feeling these songs at all. As much as I would love a shot at $1500, I just can't sink the proper amount of passion into songs I don't get anything from, personally
Best wishes to all of those that compete. Would love to see what you all do!
I get the artistry of the songs, and they are very well made from one music producer to another, just not my cup of tea
Why tho?
a1111 have docs on installation and other things
Yeah, that seems like an extremely oversaturated thing to document IMO, but who knows
hey guys
I had a question concerning SD
is it in any way possible to generate images using negative prompts only?
@drifting lagoon
All you have to do is prove/show that your prompting as a human determined the expressive elements/details/style in a prompt. Attention/emphasis, break padding, negative prompts, etc. You can refine your prompting in SD to exercise control over what's generated and how, not letting the AI do it for you.
And really, even if prompting alone wasn't enough, and in most cases won't be due to people just typing in a single line and pressing generate... people that actually work on and refine their AI art beyond just prompting, such as... inpainting/outpainting details, or any other kind of human controlled influence, they'd be able to copyright it.
This is the same document we were reviewing before I see nothing here that makes any change to what we already discussed.
So does legislation in that document not read that as long as you have sufficient human influence on the overall work and not let the ai handle it, you can copyright the work?
Because the reasoning why simply prompting alone wouldn't work is very clearly defined.
Which means all you have to do is make it fit the criteria. And you can do that with prompting.
To be more specific. The documentation reads that you prompting doesn't qualify in the case of the AI deciding on the elements of composition for you.
I don't really see how they can make it any more clear than #27 here: https://www.federalregister.gov/d/2023-05321/p-46
The prompt could be protective if it is creative enough, but not the outputs themselves. They view writing the prompt in a similar way to giving instructions to an artist for a commission. It doesn't matter whether you tell the artist you want them to "draw a girl with pink hair and brown eyes in anime style" or you write an entire book describing every detail of the image. In both cases the artist created the image and so they have the rights to the image, not you. You could copyright the book that you wrote to describe the image, but the artist would still have the rights to the actual image. The only difference here is that the "artist" in this case is an AI and therefore can't hold rights to the image so nobody does.
Is there a way to put an image sequence (from a video) and have an sort of animation as a result?
yea thats how animations work even without AI
i mean with stable diffusion xD
"While some prompts may be sufficiently creative to be protected by copyright, that does not mean that material generated from a copyrightable prompt is itself copyrightable." The actual words. It doesn't exclude new copyright either
like i have an image sequence, how can I have it exported with some prompts, for example? Like you would do with a still image, but in this case several images that would result in a sort of rotoscope
imagine reading the cited link
How do I install loras in google colab
And nowhere does anything say that any/all material generated from a prompt can't be copyrightable.
The office gave an example on prompting not being enough because of how their understanding of how prompting works, specifically stating that due to a lack of control of how the AI understands and generates prompts, it's not sufficient enough to say that a human had influence and creative control on the output.
They specifically state that the material generated from a theoretical copyrightable prompt is not itself copyrightable.
27. While some prompts may be sufficiently creative to be protected by copyright, that does not mean that material generated from a copyrightable prompt is itself copyrightable.
It says it doesn't mean that it is. It doesn't say that it can't be.
Because people needed a guide, I don't know what to tell you 😄 🤷♂️
Go on? What does this have to do with what we're talking about?
Because my whole point in this conversation/the overall topic is that the more human input/influence is involved, the higher your chance of being granted a copyright. And the more the AI does, the less your chances. Because it's case by case right now.
It's due to their current understanding of how prompting works that has them use prompting as an example.
You can be granted a registered, documented legal copyright. That is necessary in order to go to court for your copyrighted work.
You have a general copyright for the things you make. Yes. But from a legal standpoint, the copyright has to be registered on a legal record with documentation.
This still doesn't have to do with what we're talking about. I knew what you're saying already.
You can't even go to the court and sue if you don't have a registered copyright.
This conversation isn't about the prompts
It's about the AI generated material from the prompts
Being registered for a copyright or not
And nubby said that the work cannot be, when nowhere in the documentation does it say that.
Correction, edited
I understand where you're coming from, I'm not disagreeing with what you're generally saying right now
But purely in this documentation
It's saying that just because you prompted something, it doesn't mean you can register it for a copyright
But what I'm saying, is that it's not saying you can't register it for a copyright
I'd be happy to be proven wrong. For now, the documentation put forth by the US Copyright Office states that the product of a generative AI requires additional human authorship in order for a copyright to be registered. It's quite clear as far as I can tell, and I have not seen any specific line showing otherwise.
If I had seen a single line saying that AI generated work from prompting alone cannot be registered for copyright, it would've been case closed.
But, one. The offices understanding of the technology is new, as is almost everyone's.
"Based on these developments, the Office concludes that public guidance is needed on the registration of works containing AI-generated content. This statement of policy describes how the Office applies copyright law's human authorship requirement to applications to register such works and provides guidance to applicants.
The Office recognizes that AI-generated works implicate other copyright issues not addressed in this statement. It has launched an agency-wide initiative to delve into a wide range of these issues. Among other things, the Office intends to publish a notice of inquiry later this year seeking public input on additional legal and policy topics, including how the law should apply to the use of copyrighted works in AI training and the resulting treatment of outputs."
So things could be subject to change, and likely will be. For better or worse regardless of what side anyone is on.
And two, again. Nowhere does it say that AI generated work itself cannot be registered for copyright.
Yes, hopefully things will change in some ways. I'm not talking about how things should be, or how they may be in the future. I am only talking about what the document we have says right now.
It just said that it doesn't automatically mean just because the prompts can the generated work is.
I have 0.6 credits left and in dreamstudio I do something which says below takes less than that, but it still tells me "not enough credits". What could be going on?
No idea, haven't used it before. Can't help you with that, sorry.
The issue is that they do not consider you to be the creator if you prompt a generative AI for an output. You created the prompt and so the prompt could potentially be copyrightable, but the AI created the product and so it is not copyrightable. Modify the output enough and you can claim to be the author. How much you have to modify it is the only real question left.
I mean. It seems they've been "generous" with allowing some AI works to be registered at all. So I think they just need further specific guidance and understand on exactly how the process works. And even then, most things will have to be a case by case basis and defended/proven/"argued" in court.
Yeah, but the thing is... it's not the AI outputting it that's the issue. It's how much the AI is doing compared to the human.
Yeah, fair use has pretty much always been a thing, which is why we can use models trained on other people's work legally to begin with.
I refer you back to my example of giving instructions for a commission to a human artist in the #🍥|anime channel from earlier. I have nothing further to add so I'm going to excuse myself from the rest of the conversation.
When did this happen? I remember the ruling of the US Supreme Court and US Department of Copyright
I linked it here #💬|general-chat message
The SC and DOC ruled that AI works cannot be copyrighted right out of the Ai itself, as the process can be streamlined to be pure Ai to AI.
They said that all things AI generates are public domain, as their data sets were public domain. However, if you can prove transformation of the original content, you are still eligible for copyright.
Ie, a drawing, a hand edit, an input of your own creation, really anything that would lead to a normal public domain image being copyrightable
So an AI image cannot be copyrighted unless you had some provably human input. For example, that includes using a photograph you took as an input.
Link an official government document/source.
but its also a great way of saying that the average person on some sketchy AI generation app on their phone won't be able to copyright low effort crap they just happened to spam the keyboard to make.
Its saying that the general masses of people who put in no effort cannot copyright
It says it right there in the document @drifting lagoonlinked
We had a whole breakdown of it less than 24 hours after it was ruled in this very chat
Nowhere in that document did it say the generations can't be.
i actually have the document as a PDF on my PC
It used prompting as an example, and the example isn't even completely accurate/always applicable to every program.
Whats the best way to train a model on realistic people? My friends want to do ourselves as fantasy character portraits. I tried using Lora, but I keep getting stuff that sort of looks like us or is cartoony. Is dreambooth better, or Everydream2? Or are there some really important lora settings to get percises results?
So far in my test Lora < Dreambooth
But I dont know between Everydream or Stabletuner which is best.
does anyone know if in camenduru's colab, we can change the code to make the lora models be loaded from our google drive folder?
exactly, I shared it on google driver access for those who have the link. so, I put the link in place of the higginface url, but when running the code, it gave fatal: repository not found
hmm... will only git repository work?
ok
I replaced the line of code:
!git clone !git clone https://huggingface.co/embed/lora /content/stable-diffusion-webui/models/Lora/positive
for this:
!git clone "the drive folder where the models are" /content/stable-diffusion-webui/models/Lora/positive
I understand, is it the beginning there? I need to change !git, is that it?
and if I would change it to: "!wget"...
ok, thanks bro!
Thank you for your help
Guys, is there a website that can generate SD with chilliout model and korean likeness lora?
ty
Thanks, will try it
And yet 🤷♂️
The installation instructions there are good enough, but I've provided some improvements in my guide that will lead to a performance increase and better maintainability. Hopefully people will find it useful. If it's not useful to you that's totally fine.
hi
Thanks for letting me know. 🙂
<3<3<3
Whelp, tis almost midnight! 🌙 Off to sleep I must go. Have a wonderful evening, everyone!
Is there an official Stable Diffusion API from the Stable Foundation, like the way other gen-AI companies have commercial APIs (e.g. - OpenAI/Dalle-2 API? I did some web searches, and it seems like the top results are just ML service providers (HuggingFace, etc.) that have Stable Diffusion instances available publicly.
Are there a few websites where people upload the pics with metadata?
Hey I'm new to Stable Diffusion and I'm not sure how this works where do I learn about that and how I can use this program. Also I heard Stable Diffusion is powerful enough to make ultra realistic looking humans is that true as well. Thank you in advance for answering my questions.
i think the #1072220168534642768 and #1080946152318443610 channels have a bunch of good information
i learned all by myself just by installing automatic1111 and trying it out myself tho, theres really not that much to it
theres instructions on how to install it if you scroll down
@broken smeltmy output pics are less saturated than the preview, what do you think is causing this
not sure but i'd test around with trying different VAEs if the outcome isnt to your liking
i wonder if olivio ever made that VAE explaination video
SD first makes images in a 64x64x4 square, the VAE is whats used at that point in the process. the base model installed with a1111 comes with one inside. you can change it though, so look for fun ones to try out
Its needed for Color and Detail correction.
Most models have it already included.
But most anime models and merges with anime models not
its just a trained model, your question is the same thing as asking how a phone works which isnt really necessary to use the phone tbh
unless you really want to get into the ML field of research lol
Its a finetuned diffuser it seems.
Here is more Information on the official sd vae:
https://huggingface.co/stabilityai/sd-vae-ft-mse
put another way, if u were to inpaint an image and chose the "latent image" option only, the picture would be a big colorful blob, since that's the latent image.
a vae decodes that and turns it into what we know of as an actual image and not those blobs
like i said interesting stuff but in terms of actually making art, not really necessary to know
I've been using midjourney and running into a million problems. Also the extreme banning of words like shower, petite, bikini, bloodred, and etc.
Okay so this doesn't work like Midjourney on a discord but on a computer program instead? VRAM is what exactly. So I'll also have to get the latest python and something caked Automatic1111? This sounds so different from Midjourney...
Okay. So in comparison Midjourney is crappy compared to Stable Diffusion?
ya the very first iterations of ai art gen was all done in google collabs i believe, midjourney does something similar where you dont run it on your computer. you dont have to pay fees though if you have a decent computer and run it yourself, which is why automatic1111 is so revolutionary imo
So download Automatic1111 and I need Python and what is PyEnv?
yep you can just follow the instructions there, i think youll need python and git
yes
Git? So I'll be making Ai Generated images on my computer instead of my phone and the program is Free?
yep thats correct
git is just how developers pass their code around pretty much
the instructions go over how to install and use it
its fairly simple but if you need, i believe there are some step-by-step videos too that you can google for
@glad pond before you search and follow wrong guides try this tutorial:
https://m.youtube.com/watch?v=VXEyhM3Djqg
If you have any Questions come to #🤝|tech-support
heheh, thats the same guide that i followed that made me download 3.10.6
we come full circle
xD yea because on Automatic its still the "recommended" version, he should update that to 3.10.9
I'm hoping going to school for computer programming helps.
Thanks I will be looking at that.
How is Stable Diffusion with Hands and Feet?
I get tentacles pretty much with Midjourney.
Its difficult at first but not impossible. It depends on the model you use, the negative Prompts and if you upscale
I've seen people make almost flawless characters with this program and I want to make allot from this. My goal is to make 3 Anime Novels turn 1 into an app game and all 3 into Animes with a program that can give me free range. Midjourney has soft bans on words and when I tried to create a character with a Bloodred Dress on it let me create the character then erased it once the program discovered the word BloodRed...
yeah censors are lame
I was trying to show the evolution progress of a Slime to a Slime Humanoid male and female. And even though they had no private body parts showing, just because they had Humanoid forms and the body was pretty much the same color the creation was erased.
Yea in SD you have the ultimate freedom to do such stuff. It has a Learning curve and you need to learn a lot but its very powerful and fun too
That is great to hear. One of my Anime Novels should be rated R'ish bering an action superpower Story. I was wondering how I would accomplish that with Midjourney. Does Stable have an easy way to recreate the same character in different positions also is Stable good with multiple subjects in a scene, for fight scenes and army type scenes?
I felt that way.
one technique i can think of is creating what's called a lora of that character, where you train a new smaller model on specifically that model. so then anytime you prompt "charactername", you will get back that character.
Wow by name... That sounds great...
True
yes like @broken smelt said a Lora trained on a specific char would be the best option, you also can use multiple extension to get poses right (ControlNet) or multiple characters (Latent Couple)
Y'all got me excited. Now I can't wait to get home and work on this... 😁😁😁
So that link you gave me will teach me the basics i need to know about Stable?
That link will show you how to install Stable Diffusion on your local PC
It will not teach you how to use it xD
Just the basic get ready stuff
Okay that's what I need to... Lol...
Here is a little promptbook guide:
https://openart.ai/promptbook
For using SD you should first try out and play around before watching "step by step how to get 4k images" stuff on yt
You learn more by Experimenting with it. For Questions feel free to ask in this dc
yeah imo the best way to learn is to just install it, throw a few runs and see what different stuff do
kinda similar to learning photoshop for the first time
you could watch a whole video on everything but thats boring, id rather just click on everything and see what they do first for a few mins on my own
in my experience, that first learning session helps you learn from the videos better
but mabey thats just how i learn
I'm that way to.
Double the efficiency.
Generate while you watch videos on how to generate ✅
Can I friend you here?
You can do that?
Why not?
I mean...youtube doesn't take alot of resources
If you have 8gb vram and use highres fix up to FullHD you need to close every other tab xD but tiled vae fixed that
✅ watch videos on your phone then
Idk , I always have browser on, if what I have after that - not enough for SD - That's SD problem, I'm not turning youtube off for it 😄
evening...
any suggestion on which model best use for horror, lovecraftian image generation ?
or morning, rather. Anyway I had a question - is there some way to exit A1111 webui on Linux, without hitting Ctrl-Z? I always end up having to terminate the dead process, after that
maybe Realistic Vision 1.4, at least for more photorealistic imagery?
Probably something that can do dark images...?
yeah, there are horror themed loras and checkpoints
for anyone curious, I may have just discovered that you can use walmart eyes in 1.5 and give that extra weight to give people blue eyes without making their outfit and everything blue. may put people inside walmart if you aren't specific. do with that what you will
tried a single image across multiple models, then spaghetti demon trump and a lovecraftian monster and walmart eyes consistently gave blue eyes without making everything blue

Okay I'll be doing just that...
hello which chat can i ask questions about the installation of stable diffusion
#🤝|tech-support is the best for that
Hey Guys, I created a resource on an introduction to what is Diffusion Models and a basic intuition behind them! Do check it out!
https://www.youtube.com/watch?v=l6-B3bzQzb0&t=21s&ab_channel=Jovian

checking it. but you used a link that skips first 20s 😉
sry for late follow up; but got any specific models in mind?
hi
hello
hi
Hello everyone 🙂
Noob here in deforum 2d how do I control the heads and objects from getting cut off the top screen wich axis do I use ?
the github doesnt work for StableLM
repo gives 404 lol
link ded
yoo they just dropped a language model
nice github 😎
lol did GitHub rug?
404 moment
but did they really? github is 404'd
404 pog
lol
stable diffusion but text
404 lol

looks like it's not public yet https://github.com/stability-AI/
i think they didn't realize it's still a private repo
1st of April yet again
coming up shortlyyy

interesting to have a new LLM to play with. lots of LLaMA stuff going wild lately
yes guys it's working!!!!
are you guys able to run llama on your personal rigs?
or are yous using cloud?
also how much vram does this require?
Did they forget to make the repo public lmao
coming up shortly
how much vram 
github link is dead
yup
Ugh. So many channels here. I don't remember where I spotted that announcement ...
yep
it alive now
awesome
huggingface link in repo still private though lol
The notebook seems to be still private
so like
Why did they make an LLM based on the terrible pile
precisely WHO is going to be able to run 65B?
live now
wouldn't this take like an A100.
live nowwww
i don't think even a 3090/4090 can run it.
HF repos are private too 
hugging face down?
2 3090s in 4bit but it doesn't matter the pile is trash dataset
unless ya'll did some wizardry with vram costs.
Maybe the announcement was made in haste?
so how to download and use it?
now to see if it runs in oobabooga
Wait for them to make the HF repo public I guess
And all the other models are equivalently as bad as gptj and gpt neox
4096 context window!
What architecture does it use?
excellent
NeoX?
It is the equivalent model as gptneox and gptj it's the exact same dataset but with 4000token context
NeoX was pretty alright
is the language model thing like chatGPT and can I run it locally? maybe use it in my game?
HF will be public soon, please be patient! Our GH repo is live: https://github.com/stability-AI/stableLM/
It's the exact same model as that
it's not the pile it's a 1.5 trillion token dataset
Is this coding focused?
I'm talking about the architecture, not the dataset
is that one still a buzzkill? as in it wont respond to controversial things
But a 3B model on 800B tokens sounds promising
I wouldn't recommend trying to put models with billions of parameters in a game
but yes, Large Language Model, same as GPT
neox is definitely not a buzzkill
Excited to see how it turns out
can I use this localy? and how
I would!
i am strongly doubting anyone can like
ITS UP
Yes up
How do I run LM? 👀
just heads-up HF link is broken
isn't this supposed to be pip install accelerate bitsandbytes transformers instead of pip accelerate bitsandbytes transformers https://i.thomiz.dev/TxPp7H5x.png
in the notebook
But gz exciting all!
the HF space seems to be working now https://huggingface.co/spaces/stabilityai/stablelm-tuned-alpha-chat
yeay I saw
I'm running llama65 right now
more competition is better
Where can we submit feedback for StableLM?
Don't we need this to be public to use the model locally? https://huggingface.co/stabilityai/stablelm-tuned-alpha-7b
how much vram is it taking?
46-48 GB
I'm reading 0.8t tokens?
brooooo
still waiting for my answer xdxdxd
not many people have dual 3090s
the 13 B yeah
or an A100.
Why go smaller than llama ugh
like for all intents and purposes just about no one is running 65B
i'm not about to drop 5,000$ for it.
Redpajamas is doing 1.2t on 7/13b
3B and 7B sound good, especially if we can quantize them to int4
it will be in half in the next month and in four at the end of the year I reckon
it is public now
is the chat on hf using my gpu?
StableLM-Alpha models are trained on the new dataset that build on The Pile, which contains 1.5 trillion tokens, roughly 3x the size of The Pile. These models will be trained on up to 1.5 trillion tokens. The context length for these models is 4096 tokens.
int4 quantization will truly democratise it
Even on 7b?
and also CPU CPP ports which use RAM instead of VRAM
can only hope they reduce vram costs of LLMs
because right now they are totally bonkers
not sure, the docs don't seem consistent
Yep, I can see 4 versions https://huggingface.co/stabilityai
https://huggingface.co/stabilityai/stablelm-tuned-alpha-3b
https://huggingface.co/stabilityai/stablelm-tuned-alpha-7b
https://huggingface.co/stabilityai/stablelm-base-alpha-3b
https://huggingface.co/stabilityai/stablelm-base-alpha-7b
no idea what "tuned" means yet, just here are the links
niceee
Emad come clarify this ':c
tmwr is event
too many things tomorrow :0
it's taken so long to load 🥲
starship launch as well
hopefully Stable model will be less woke
what are the differences just sizes?
175B planned too ?????
I guess if DreamStudio or something offers it as an API I guess 🤷♂️
no idea
dev says "1.5 T. Wait no let me double check"
"We would love to port llama.cpp to work with StableLMs"
That would be awesome! 👍
The Tuned model seems to be referring to instruct finetuned from various open source datasets
how do i use Stable LM im not very intimate with coding so can somebody give me a guide?
Sadly it's with alpaca dataset as fine tune instead of gpt4-llm which is alpaca but with gpt4 outputs instead -_-
It seems we'll do an ama about this <t:1682019000:R>
https://discord.gg/stablediffusion?event=1098267150726221824
Still no answer from the AI yet XD
Everything they've done is like two steps behind llama 😭
I believe there’s legal issues with using gpt-4 outputs as a dataset, I’m not a lawyer just a guess as to why.
SD literally used gpt3.5 outputs
it would have almost single-handedly fixed the open source LLM issue, as I personally think Llama is one of the BEST models
So that same applies regardless lol
Exactly, we can't use it to train models
because ClosedAI 🤷♂️
They used alpaca dataset to fine tune, which is gpt3 outputs, while there is a gpt4 version of alpaca by Microsoft called gpt4-llm
Stability fine tuned on gpt3 outputs @charred mesa
I'll try and port the 3B to ggml. With its small size, should be able to run on about 2GB of RAM. Though the context length worries me a bit. Might bump up the requirements to 4GB
It's literally part of their fine tuned stablellm model on hf
also, we opened #1098025024541167646 channel for any discussion/feedback about it !
have you seen OpenAssistant?
Yes why
imo that's better than gpt4
No, the fine tuned model stability put on huggingface used gpt3 outputs dataset
To fine tune with
When they could've used gpt4 version of that dataset that's my sadness
Hey guys, is their a way to invest in this project?
Can someone clarify if they used EAI's V1 or V2 Pile?
I disagree but also it's a 30b llama model powering it sadly not 3/7b size so not reproducible from what was released so far by stability
Yeah, if you have questions, or would like to discuss, please feel free to head over to #1098025024541167646 there to give us your feedback, etc!
A portion of pile v2 it looks like
Where did redpajamas get their dataset?
Okay, cool. Cause someone reminded me the original Pile was 300 B
Oh boy I bought an RTX 2060S 12GB a while ago and I hope I can use the 13B model
lmao good luck.
6.7b typically already takes around 12GB on most models.
yeah that's what they said when SD was released and only high-end customer GPUs could run it.
unless they pulled some serious wizardry with this i wouldn't get too excited.
but hey, i'd love to be wrong.
Guys, hopefully 4-bit GPTQ support comes to StableLM
would be nice to see 65B be possible to run on reasonable hardware.
which will drastically decrease VRAM requirements
and not dual-3090s or whatever
it is already possible on LLaMa, OPT and BLOOM and it's great
I bet we are going to see some insane optimizations from the community and medium range consumer PCs will be able to run 13B models in a few months.
next best thing would be the CPP ports (like llama.cpp), which have quantization by default which are good enough
it's slower, but it uses RAM instead of VRAM, and I suppose a lot of people have more RAM than VRAM 🤷♂️
Man, with all these AIs I hate myself for buying RTX3070 instead of 3060 thinking that despite having less GB it would be better for gaming...
And then SD was released 2 months later
(little reminder since it scrolls fast and lots of new people arriving)
An AMA about StableLM will happen <t:1682019000:R> ! prepare your questions !
also, #1098025024541167646 is the new hype channel about all this if you'd like 🙂
https://discord.gg/stablediffusion?event=1098267150726221824
why is the download so slow ? for sd language model from hugging face
i am getting like40 kb
8hrs to download bruh 💀
well
possible that, given the 4 models just got released, HF has a hard time following the huge demand
hey Flex 🙂
I'm cloning it too, I have 8MBytes internet usually, and it goes at 600k right now
but I started downloading quite fast after it just got released, so not sure how their "queue"works
I'm just happy it hasn't crashed yet
looks like we gotta wait tomorrow
cuz even tho ama is finished peoples gonna be rushed to download again
I still need 3 files of 9+GB so... yeah, tomorrow seems right
would i be able to load this with alpaca electron ?
I haven't used alpaca electron. it's a UI for LLMs ?
yep lemme show u where do i show u
it seems people are running it OK using oogabooga in #1098025024541167646 chat
ohk
so I guess it should also work in there
still, the guy that said that, also said it was using 18GB VRAM
Thanks. So I take it by your reply that the Stable Foundation does not have its own cloud based API service, and that you have to install your own or someone else's custom instance?
we do have a cloud base API, yes
dreamstudio
but you are also able to download/run an endpoint yourself, if you so choose
Sorry if this a repeated question but, can someone explain me why during training my model, it doesn't mimic well my drawing style and just draw a buch of abstract lines? Is like it doesn't recognize well the images or something
Hi all.. is this discord just for StableDiffusion, or can I chat about other related things or potential ideas here?
I note with interest some new language models, and the page for them linked me to this discord.
well, we announced our LLM just a few hours ago 🙂 join the chat in #1098025024541167646 !
Yes. And I already managed to get one issue on the github resolved almost immediately 🙂
https://github.com/Stability-AI/StableLM/issues/3 -'free' means 'free'
Unlike some other LLM's (not mentioning any names)
I'm delighted at the strength of responses 🙂
Now to move on... The StableLM relase is just a model, what clients can it be used with?
I was following the development of something called GPT4ALL, and (when I get a new computer) if StableLM can be used with it?
it also released a codebase to run it, but I got it running on oogabooga, and some other users were talking about other tools already
Are there plans to enable StableLM to be tied up to StableDiffusion directly?
What's needed to run StableLM locally?
just to be clear, I'm as much in the unknown as you on all this currently. the AMA tomorrow may help a lot
Can someone help me in VC
I don't see how anything would stop this now
I want to create a LORA model but don't know how to create one.
Can someone help?
sorry Jack, I have almost not the time to even come here right now, I can't. I'd love to when I get the time, but this is a rush day for me because of StableLM
maybe other people in #🔧|finetune
Can anyone else be able to help?
it seems to run on GPU correctly. i run it on 16GB but I am not optimized, people seem to report lot less in the other channel
I'm also trying to figure out if a sutibaly trained StableLM like model could be used on Wikipedia (as an assistant) , Using LLM generated content in actual articles though seems to be not allowed.
you checked the model cards ? they say quite some things in those already
Can anyone help me please?
Thanks! Would you happen to know if images generated with that API are eligible for the Peter Gabriel contest? Or do the images have to be generated with a self-hosted instance?
its 4/20 where deepfloyd
they are. the API is using Stable Diffusion there 🙂
Yudkowsky is holding a Q&A. You can join the livestream on Youtube here: https://www.youtube.com/watch?v=3_YX6AgxxYw We're just waiting on him.
Can stable diffusion be run on an RX 5700?
Hey everyone
I wanna apply for stability ai program and I've got a great idea I'm just unable to find the link
Could someone just guide/redirect me
sometimes i feel like southpark got it right with how we'll interact with AIs. Awesome-o 5000 nailed it
Is this tensorRT pipeline uploaded an hour ago big news? https://github.com/huggingface/diffusers/blob/main/examples/community/README.md#tensorrt-text2image-stable-diffusion-pipeline
I know Lsmith existed beforehand so not sure about the differences
i think it might be good for python2 stuff but maybe and also i have no idea
i'm trying to fine tune a model on 15 images of my friend, using dreambooth and v1.5, but during inference it doesn't actually preserve what she looks like that much.
- cropped the training images to center on her face, 512x512
- 0.000001 learning rate
- 100 steps per img
- used "zkz" as instance prompt (also tried: "a photo of zkz woman"; also tried w/ regularization images from https://huggingface.co/datasets/ProGamerGov/StableDiffusion-v1-5-Regularization-Images)
- 8bit adamw, fp16 mixed precision, xformers memory attention
- everything else default
- i am using "Restore faces" during inference
using prompts like "zkz pro headshot 8k linkedin", it doesn't produce results which preserve her face
also just prompting "zkz" it sometimes just gives an image of a car?!
any ideas what im doing wrong lol
this is a lora right?
no, just fine tuning w/ dreambooth
ah okay
i did use LoRAs for this use case and it was ok but i was told i could get better results by fine tuning
some people achieve some crazy good finetuning i agree
Thanks again. 🙂
that's kind of what deepfloyd is doing
Okay what license is it using?
(Aside: If Stable Diffusion ever decides to opt for a Creative Commons License... (cough cough) )
hello everyone, sorry if my english is not very good, I'm a french director/technician who work in french film industry, I would like to know : Dreambooth doesn't work anymore ? Who replace dreambooth ? With colleagues from the profession we trying to create a movie with stable diffusion for character and gen-1 for landscape
we saw "Rock,Paper, Scissors" and corridor's tutorial and they don't use controlnet and new plugins, we are late of the technology that's why I am here tonight
big
if someone can help us it would be great x)
dreambooth is still a viable option, why do you think you can't use it anymore? Are you having issues with it?
Today, I was trying to generate some large images using Stable Diffusion. Instead of directly using the hires option, I first created smaller images and then used the img2img feature to enlarge them to more detailed larger images.
My question is, are the sampling steps adjusted automatically? I always chose 150 steps, but earlier the command prompt window displayed 76 iterations, and just now, the same image showed 91 iterations in the command prompt window. Could anyone please explain what's going on?
In reality we have install dreambooth but it's still possible to have the same results than corridor ? Because interface is very different no ?
That has to do with your denoising scale. The higher the denoising scale the more steps it will take. You can disable that auto-adjusting in the settings though.
You can get the results corridor got, and possibly even easier with newer tech. Try searching up a tutorial on Lora training. It's faster and can be used on multiple models instead of it being a model in of itself. Or textual inversion is a great option as well. I assume you are trying to keep a consistent character/style. If it is a style, maybe what you are looking for already exists.
for the style we have our own references, we would try to training style and characters for characters have the same style, like corridor you know
I see. Try looking up a lora tutorial. Actually i can find one for you
It would be great ! 😄
thank you for answering my question.
So that's how it is. For the 76 iterations, my denoising setting was 0.5, while for the 91 iterations, it was set at 0.6. In this case, would a higher number of sampling steps lead to better results?
thanks man ! And that its for training charachters and style if we want ?
Typically that is the case, but after so many steps it no longer improves the image.
Yep! You can train pretty much anything with it!
I am trying up upscale a 5761024 picture to 11522048
Incredible, I think i'm gonna love this discord community !
576x1024 to 1152x2048
Glad to have you here!
is there a tiled upscaler available for txt2img (besides highres fix) for auto1111?
it also depends on the sampler..The stochastic samplers like Euler A might get even worse with high number of steps, while deterministic samplers like Heun are just converging and don't improving anymore with many steps. In general I would rather use less steps (like 20-40 steps is often enough), but that might depend on your images (photographies might need a bit more steps than animes or comics)
Not in txt2img but in img2img there is the sd upscale script
may I ask where the "disable auto-adjusting" is in setting?
So Euler A is not good?

