#💬|general-chat
1 messages · Page 95 of 1
When will the bot be Fixed?
@echo off
set PYTHON=
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS= --opt-sdp-attention
set TORCH_COMMAND=pip install torch==2.0.0 torchvision --extra-index-url https://download.pytorch.org/whl/cu118
set "TRANSFORMERS_CACHE=%cd%_cache\huggingface\transformers"
set "HF_HOME=%cd%_cache\huggingface"
set "XDG_CACHE_HOME=%cd%_cache"
set "HF_DATASETS_CACHE=%cd%_cache\huggingface\datasets"
call webui.bat
#1047610792226340935 it's not broken, just disabled, no ETA on return
ok cool, just eliminating a change of a1111 version causing the issue
Both lora and lyco has the same issue
for some reason --medvram has actually made it go faster
up to 2 it/s from 1it/s
taking off --medvram and adding --xformers put it up to 2.8it/s
why do you have all that stuff in your webui-user.bat ?
Trying everything I can to fix my generation issue
copy pasting random stuff from google wihtout understanding them is probably not a good idea.
For sure
I'm at the end of my rope
I know I don't know enough so I can really only go by trial and error at this point
start back with a clean .bat
@echo off
set PYTHON=
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS=--xformers
call webui.bat
delete your venv folder
and launch
Finished reinstalling after deleting, 2.78it/s with loras
9.53it/s without lora
let s move to #🤝|tech-support
I can't generate images, get an error "Something get wrong, try later“
So war yesterday, and now
Още ли не са възкръснали ботовете?
I have encountered an error with DreamStudio while generating an image. The error message displayed is 'Something went wrong, please try later.' I have been experiencing this issue since last Sunday, up until 15:00 UTC. Could anyone provide any suggestions or status updates? #🤝|tech-support
CC @sour lintel
Having a problem again with creating new face with InPainting and ControlNet Depth (midas and Zoe). For what ever reason SD creates an odd shadow over the face, this seems to happen every time, despite what settings I use. Ideas? Tried couple of different checkpoints, no change. I'm using SDXL pipeline all the way, including ControlNet models.
Can you show an example in #🤝|tech-support ?
Please help ✌️this has been happening for 5 days now 😩
Good morning! How is everyone?
hi everyone, is there a tutorial somewhere of how i can run stable cascade locally on my GPU?
I am looking for that too!.. the sample code on huggingface did not work for me..
Is anyone aware of a machine learning model that is capable of generating a Stable Diffusion-compatible prompt from an image description?
Pretty decent! Any idea who may be able to help out those dreamstudio users among us that have been unable to use the tool since Sunday? Everyone who tries to use it is getting hit with a “error on our end” message
Actually, it appears to be working again lmao
Yup! DS is working! 🙂
If you have an issue, I would reccomend that you e-mail support via the website!
Anyone have a guide for altering python code from Stable-diffusionXL to stable-cascade?
Does anyone have any good material on getting started with the newer version of dreambooth? All the turtorials i can find are a year old and seem alittle outdated
Try chatpgt
Dreamstudio seems to be working. Just tried it and no errors
Are there some decent web based stable diffusion text to image generators?
I am checking some, pricing etc
ok found some Xd
Why almost no one talks here? 🙂
Wonderss
When will stable diffusion be back on discord?!
Same Dream is working
someone said comfy UI has now suppoer for Stable Cascade but i don't see it working
it throws error trying to use official workflow
has anyone found a way to use it in Comfy ?
sd fills up my vram sometimes and hangs... interrupt button doesnt do anything i have to kill the cmd
it doesn't work
works for me.
produces errors
make sure to update
hello guys i have a question
do stable diffusers like ai plugins on kritea etc online. so like i connnect to a big ass server and then the server does the image and then sends it back to me. so like all images are made off my pc and just sent to me?
oh i see those are different files
also why it doesn't use stage a ?
stage A is baked into b or c or what ?
A is baked into B
A is the VAE and i dont know why they gave it a ridiculously non descript name like "stage A" this time. they made the naming convention SOOO much worse
Are you asking about plugins for krita, or krea.ai?
yeah the git hub repo. like is all that dwonloaded things just donig a connection between me and a server
and it isn't even an intuitive order. like, stage c loads first and a is last? what?
Which github repo?
the one from acly
Ah right - the krita one
yeah my bad
Immediate results from this research model reveals that the naming convention is just SOOOO bad and they shouldn't do that anymore.
So theres two options with that. You can either use it locally which means the number crunching happens on your pc and nothing happens in the cloud; or you can use runpod which is a cloud pc and have it do the number crunching and it returns you the result
i just did the the first option which said krita ai will download locally onto your pc rather than the second option which said id manually do it. so did i pick the cloud option lol
local server managed by krita
Yep - it creates a folder on your pc in the krita extensions area, creates an interface for doing the number crunching (its comfyUI under the hood) and uses the web server from that interface to do the work
damn it man comfyai going to be storing all my data
is it hard to do it locally?
comfyui is local
Comfy is a local install - there is no comfy server collecting anything
oh right but u said it uses the web server
haha local web server
It makes its own web server on your pc
i can see how that is confusing
damn im not knowledgable at this
why would it create a website
that is local
isnt a website meant to connect with the internet
Nope - website is just a way of accepting commands and returning results. In this case the commands are local instructions from krita and the results sent are the images your pc generated
#oversimplified
so if i didnt connect to internet, can i still generate images?
Only need to connect to the internet to download models
so i downloaded 10gb of models. which is enough to generate any image i like in the whole universe on my pc based on that model?
this is interesting man
i thought 10gb wasnt even enough to download a movie
Well you can throw any combination of words at it and it will try and render what it thinks that should be, but some models dont know things just like certain people dont know things
You can get a movie in 700mb these days fyi
The model isnt a collection of images, its a collection of numbers used to do calculations
and its all local so theres no filters or anything etc. why doenst everyone just get this then instead of paying like 30 dollars for krea ? lmao
like an instruction manual i guess?
Because unless you have a great gpu its pretty slow
oh so they pay for the cloud i guess
Think of it as a really giant maths equation
damn lol thanks man youre really clever at this
or a little brain
that only the computer can access
all this time i thought these models worked with like massive gpus and you couldnt even use them on the best consumer pcs
They used to be that way until about 2018
what changed? the models got very smaller or something?
GAN algorithms happened
oh i thought these used lcms or whatever they were called lol
Turns out math and IT bachelors are not just solving 2nd degree equations and delivered something
Will Smith is entering such a new comedy phase of his life. he immitated that spaghetti video! ITS SO GOOD i thought itw as AI!
whats the spaghetti video
yo whys this ai making like 240p photos lol
nvm i think im mean to upscale idk. idk any photo terms
im littel bit new to SD i have sd-v1-4. how do i get multiple versions
I got baited by it too 🤣
go to civitai.com and find a newer one
Genuine question, for those running higher end GPU's what's your average wait time with SD? Rather new to AI art generation so curious lol.
is there a prompting trick to have the character face a certain direction?
seems like it only vaguely understands when i put something like 'facing left' or right or away from teh camera and everything by default faces forward
will bot ever return?
Might depend on model? Side view, facing away, usually consistently gets basically not front face
Or looking away, that usually turns the eyes not the face
But that's for models I use so idk if these keywords r universal across all models
i'm trying in cascade. i think it depends on what i'm asking it to do. if its standing/walking it does better in different directions
@novel ocean
I doubt the discord AI system will return, I think they turn it off to force money payments.
Yea idk about cascade, that works for sdxl and SD 1.5 based models
epic shit but how do i join dreambot discord fr
you feel me
thanks. i just tested out the helloworld xl model and it does follow what i want a lot easier with the same prompts - seems that model was maybe focused on that type of thing
Hmmm interesting 
force lol. it was just a pain to maintain i'm sure. they got an api anyone can build their own bot off now.
the sdxl bot was only for helping select which model remained
the bot would have been costing them $$$ to run, and the gpu has probably been swiped to train newer models
It's also shared with multiple other projects they are working on currently: llms, audio models, biomedical projects, etc.
I wish it was possible to share some "gpu power", something like nvidia folding home to train open source models using the community gpus
does anyone know how to setup a good face swap? willling to pay $
u can prob do that if u ask nicely to people
"can i use ur gpu to train this ai"
could anybody explain the difference between faceswap and lora training
faceswap pushes the face in as the generation is being done, lora training teaches the model that if I put the persons name in the prompt it will put out an image of that person
Thx
Does hi-res work with animatediff? it always just crashes for me
hey guys can i ask if anyone here's on forge webui - and got cascade extension to work in there?
Midjourney can produce that trendy "Dark Fantasy" style.. I can't post pictures here sadly.
Anyone have any ideas how to replicate this with stuff from Civitai on Auto1111 or ComfyUI?
I'm trying to use this https://platform.stability.ai/docs/api-reference#tag/v1generation/operation/masking, but it generates a completely different image and it seems to be ignoring the black masking. Any ideas?
What's the best smaller 9:16 ratio before 2x Hires. fix upscale?
dang what's with half the loras requiring a civitai account to download now
even ones that were free to download without one before
It's crazy but I actually get faster speeds after I'm done gaming then switch to SDXL.
It's like it's warmed up or something
like it's annoying as heck
did something change recently?
ugh, it's looking to be more then half the loras now
out of like 12ish I tried only 3 were able to be downloaded without needing a civitai account
what's the problem with making an account?
half the ones that now require an account didn't like 3 months ago
also I don't want to have to sign up just to download a few loras
whats wrong with signing up?
i love how he didnt answer the question whatsoever
Does anyone know what's coming after cascade? According to Emad's comments on reddit stability is cookin' something good
Is access to generate images now gone? Logging in after 9 months!
I don't want to have to sign up just to download a few loras
I already said why
womp womp
hahah
try to search the name of the lora on google sometimes ppl reupload them to hugginface
thanks 👍
but bro, I literally answered your question
how is it "unreasonable" to be annoyed that it requires signing up to download half the models that could be downloaded without doing it just a few months ago
since you seem to be obviously confused i will help you out
i asked you what the problem was with making an account for a few loras. the answer is that you are blowing this minor inconvenience wayyy out of proportion. you put more effort into whatever this tantrum is
also this is the end of our conversation consider yourself blocked
are all the people in this community like this?
not all of them but a few are one of those redditors that like to poke ppl for fun
ty for the gold kind stranger 🪙
btw is there a reason why most of the loras require login now?
did the community get spammed or something
is it an anti-scraping defense?
the reason I am confused is it wasn't like this a few months ago
yea some lora creators dont like it when ppl upload them to other place so they lock them
ok
thanks for an actually straight answer
yeah I will probably just not use those loras then
this convo basically proved my point why I don't like to interact with the community that much
Sure, I'll send an image there
Why is the bot down and how long is it gonna take to start working again?
gmgm
Heyyy is anyone awake?
any idea when will the bot start working agai?
and which other image generators can i use for the time being that are as good?
IDK but i am pretty new... I dont even know how to download sd :///
Can u help me about it pls?
Hey guys i have a question about stable cascade.
The website says following about the licensing:
Today we are releasing Stable Cascade in research preview, a new text to image model building upon the Würstchen architecture. This model is being released under a non-commercial license that permits non-commercial use only.
But the GitHub repo is under MIT license.
So what is correct?
Hey guys! You might want to checkout Picasso on practal.ai if you want to use another UI.
Hey there it's me, mister "non legal advice". To me it seems like the training and inferences scripts are MIT but the model itself is still non commercial / research only.
Hey man, thanks for your fast reply.
I overlooked the weights license on git. Agree, model seems to be * NON commercial only.
model seems to be research only
i got this error when i get first sample of train
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 252: invalid continuation byte
does anyone know where I can get that standalone tag manager from?: https://www.youtube.com/watch?v=RT2jj-5t8x8 @ ~5:00 min
I can only find is https://github.com/starik222/BooruDatasetTagManager and that doesn't explain how to use it. There is no .exe or anything to run it
Is there any folder where my generated images are?
ahh found an exe: https://github.com/starik222/BooruDatasetTagManager/releases/tag/v1.6.3
doesn't work with wine T_T
Hi guys how can I use stable diffusion pls
yes
The code on the github repo is MIT license, the weights on HF are under the research preview license 😁
where is stable diffusion v1 channel?
Does anyone know a tag manager for images that works in linux? Something like: Stable Diffusion Tag Manager or BooruDatasetTagManager.
wdym
explain
there are couple of them available
u can use fooocus, there are plenty of style models integrated
just check youtube for tutorials
which Ui are u using? there most be a "output" folder in there
what are alternative websites of civitai?
if u cant find what u are looking for there, u are doing something wrong
oh you are one of those.. okay
Ull need the models from the site to run them in a external site or SD
wdym? there is literrally everything available - iam using the base model, without any refiner or everything else on top and i can create whatever i like. stop getting offended, explain what u are trying to do and we will help
U could use paying sites if that’s what you are looking for
Same
^^
@teal osprey explain what u are trying to do and we will help mate
In terms of stable diffusion models, civitai is THE repository. Huggingface has a few but its very limited
thanks. also, do i put everything that i download from models to models>stable diffusion folder or are there any other exceptions? iirc someone was saying that if the file is lower than X gb we should put it somewhere else but i might be remembering wrongly
models- and then "checkpoints"
there is no such folder there. i put mine to stable-diffusion and can select them on checkpoint selection dropdown menu
which UI are u using
Fooocus? ComfyUI?
i have no idea
what did u download to use sdxl
link me the tutorial or whatever site where u followed the instructions to setup sdxl
check this out, everything is explained there, if u still have any issues, come back
and it seems like, u are kinda "new" - i can recommend you, using "Fooocus" instead of automatic, there are a lot of styles and models integrated and easier to use
@teal osprey https://www.youtube.com/watch?v=zIhODzEVZqg&t=456s
i would recommend you, to switch, right away
if thats easier to use why not
easier and faster
i want to learn to use this program for trying out different facade designs on works of architecture and also to create videos with deforum that are also architecture related
I can recommend not using online image generation tools...stick to something SD
wdym its just using the browser for the UI
i tried learning controlnet for those facade renders but looks like i need to learn alot more
believe me, u can do whatever with fooocus
OK. Because Gemini is engaged in heavy censorship, erasing history.
was there a backfire? i havent checked the news today yet
what do you mean backfire?
like "roasted"
No idea
k
I tried generating an image of George Washington - it won't generate the image : Gemnini, that is
all censored anyways, yeah
I doubt there will be mainstream backlash.
they thing they can "censor" something. lmfao, they forgot the people running anything on their owm machines
Im going to try fooocus. I have been trying to get controlnet to work in SDNext, with no results
SD or any run at home can decide to censor too
for now, it isn't
imo - and iam prompting shit 24/7 (not even kidding) - fooocus is the best way to go and its faster then anything else
u can prompt previews, to know if u are going into the right directioon, 32 images in seconds
SDNext claims to work with control. However, when I try it, it just zips my GPU to 100% and hangs
Im using A1111, but I can't get controlnet on there beyond pose, depth, and ganny, and other than pose, they make weird images with weird colors, etc
dont want to offend any A1111 user, but its shit
It's been a nightmare since SDXL released. It was miles backwards, and it still hasn't caught back up
go mate
u have set it up in minutes
I am DLing now
lfg
can I select the models I want for it
just check the video, there are shit ton of styles integrated
yes, but can I choose the model to use
ofc u can
can i use controlnet and deforum with focus installed?
it depends.. if its a full model (likely to be 2gb or ~6gb) then its in StableDiffusion folder, loras need to go in the lora folder, and upscalers go in the ESRGAN folder
also whats up with seed option? do i need to write any number there or keep it at -1? or do i spam random numbers
A seed is the underlying noise that the system starts it's calculations from. if you use the same seed number for the same shape and resolution (i.e. 1280x768) then it will start from the same point allowing you to recreate old images/explore how the prompt changes the image
but not identically on another computer 🙂 seed random is super similar but doesnt translate generally.
my computers fuzz is similar to yours but not the same.
can i use a guide image for inpainting rather than prompts? i want a specific type of image on the painted area to be stitched. is that possible?
yes
what gpu has the fastest training time for dreambooth fine tuning?
foocus doesn't let me select a model
just search for fooocus impainting tutorials on youtube
activate "advanced" on the bottom
then on the uppoer right models
and be sure having the model in the checkpoints folder
I did, it only has one model despite me adding models tto the dir
should i download each model file for controlnet on hugging face? the ones that have 1.5gb for each model or are there any other way to get the models for it
Only download em as you need em..
mates, i really love to help, but most of your questions are sovable by searching on yt - "fooocus WHAT U YEARCHING FOR"
xoxo
good grief, foocus is sooooooooo simple
that is the idea of it.. take the complexity out of the system so that you can fooooocus on the prompt to make good art
add a prompt, hit generate, after a minute ERROR
that sounds like any interface tbh.. 😛
I have been away on holiday for a bit and just returning now - I noticed the old BOT1-10 channels have disappeared. Is that hidden under a new role or a different channel?
what holiday...
a personal vacation
Looks like the bot channels are still down for maintenance... #1047610792226340935 message
Got it to generate images. Foocus doesn't have negative prompts. The output without certain negatives is bad, at least for the images I am generating. Negative is needed. For example, SD tends to make men big muscle men. If there's a prompt for athlete - I am trying images of tennis players - they look like body builders playing tennis unless it has a negative like "beefcake", which removes that big muscle magazine model look
Yeah fooocus not best option once you understand whats going on with stable diffusion - too limiting
the same prompt, with negative, gives me tennis players who look like athletes in A1111. In foocus, I get topless women who look like bodybuildres
To Foocus, I say Hells to the Naw Naw Naw cus: https://www.youtube.com/watch?v=PB4Nby2Ai-g
Hi! Im new using Stabble diffusion and i have a couple of questions/problems with the GUI. It is ok if i ask them here?
whats the worst that could happen
Hello all I need some face swap help please can anyone help me please just some doubt
of course
absolutely nobody
how do i report shopify accounts?
Hi where is the channel for use / dream please ?
#1047610792226340935 read there
anyway, gui questions
So what exactly is dropping on “22 02 - 24” (the date in the Xeet by @wise stratus )? Commercial-use weights for Stable Cascade? Stable Video XL?
can someone tell me what the score_9, score_8_up, score_7_up, score_6_up tags do?
ponydiffusion model*
In the dataset they had ranked the images by aesthetics, so they have a score_9 if they were in the top bucket, score_8_up if they scored 8 or 9 and so on.. essentially the tages make sure that you get a high quality image
i would assume since your model is based on a 4chan type board's images* those are tags to denote quality.
also thats the second time someone asked that exact question, is it an actual question or astroturfing
actually, almost word for word. i AM going to assume astroturfing
hello, i'm wondering if anyone can point me in a good direction for figuring how to to impliment or generally use the generative-model?
what would you like to do?
Thanks! My questions are these:
you know... make an image? or is this the wrong project?
1- Im having problems with automatic1111
specifically this:
venv "C:\Users\Usuario\Downloads\Stabble diffusion\stable-diffusion-webui-directml\venv\Scripts\Python.exe"
fatal: No names found, cannot describe anything.
Python 3.10.6 (tags/v3.10.6:9c7b4bd, Aug 1 2022, 21:53:49) [MSC v.1932 64 bit (AMD64)]
Version: 1.7.0
Commit hash: 601f7e3704707d09ca88241e663a763a2493b11a
Traceback (most recent call last):
File "C:\Users\Usuario\Downloads\Stabble diffusion\stable-diffusion-webui-directml\launch.py", line 48, in <module>
main()
File "C:\Users\Usuario\Downloads\Stabble diffusion\stable-diffusion-webui-directml\launch.py", line 39, in main
prepare_environment()
File "C:\Users\Usuario\Downloads\Stabble diffusion\stable-diffusion-webui-directml\modules\launch_utils.py", line 560, in prepare_environment
raise RuntimeError(
RuntimeError: Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check
Press a key to continue. . .
No names found and torch is not able to use GPU
pls post that in #🤝|tech-support
Ok thanks
ill meet you there thanatos 😄
Are there any good pixel spite models other than retro diffusion?
plenty.. heaps on civitai
https://civitai.com/models/277680/pixel-art-diffusion-xl probably the best one
I would use this instead: https://github.com/AUTOMATIC1111/stable-diffusion-webui-pixelization
depends on the use case, but that is a cool extension 🙂
I'm just saying it's an extension with models you can download to pixellize the image in post.
Understood.. I'm just wondering if there is a difference trying to generate it with a model trained to make blocky art, versus one that is trained to take art and make it blocky
I'm guessing the former doesn't try and throw as much tiny detail in
whereas the other one would be trying to pixelise that detail and may not end up with a better result... dunno - never tried comparing them
From my experience if you try to do pixel art, the blocks usually aren't square so you might want to put it through that extension anyway. The style is likely different though.
We had a pixel art contest at one point and some people used models and some used that extension but I can't seem to find the thread anymore.
they got rid of the pow channel
but yeah, I was here for that and made a lora from the resulting images 😛
I used the huggingface space to achieve the same effect
Rip pow channel.
All we got is dailies now.
I think I'm currently winning this daily 🙂
I've done pixelart with models, can work good. Upscaling is tricky.
I also tried the Extensions, but I prefer the direct image generation
I mean it's good at pixel art for sure, the grid does have a "homemade" look to it since it's not pixel exact. It's fixable with that extension though.
Yes the generated pixelart has multiple square resolutions while the Extensions makes them all the same
Last time I used the extension you could adjust the pixels resolution too, but I haven't used it in months.
stable diffusion
that is why we are here yep
thank you
xbox please make me a elon munk making a sandwitch on the twitter, best quality, 4k, nsfw
xbox... ?
I have been trying to understand the various policies regarding image generation and have run into a problem. I want to depict events that happened but were not recorded. Some of these depict war or catastrophic events and they don't seem to make it through filters. Here is an example prompt "scene of a roman crucifixion where everyone is going about their business as if this happens every day". That is blocked by Bing. Or "Adolph Hitler and Winston Churchill seated at a table playing a shell game while Josef Stalin and Franklin Delano Roosevelt watch". That never happened obviously, but it's purely metaphorical (you could imagine it as a sketch in a newspaper in 1943 accompanying a story). Suppose I want to tackle serious topics as an artist? How is this usually done?
Any text to image application will apply all the text to all of the image so usually more complex processes would require you to assemble the image more precisely through tiled diffusion: https://github.com/pkuliyi2015/multidiffusion-upscaler-for-automatic1111/
Basically it works like this: you select a part of the image and apply local text to that area (frog, pitcher, flowers, man reading book, etc) and this will blend the sections together into a single image. This is an extension for automatic1111, but there's ComfyUI versions of tiled diffusion out there.
ok. I will look into this. Thank you!
is civitai being slow for anyone ?
it's slow for everyone....
Hello all, new here, hoping someone can point me in the correct direction. I am loooking to do some AI generated images for my nieces birthday, but am a complete noob when it comes to AI image manipulation and generation. Where is the best place to get some advice on how to go about this? My PC has already proven inadequate to do the images myself using stable diffusion, so i am at a loss as to where to go to even figure this out. Any help would be greatly appreciated.
I was looking for a gen room myself ,been away from creating stuff lately everything changes so fast I have a hard time keeping pace
Hello everyone, please does anyone know of a good tutorial or workflow they can share/recommend of using stable diffusion to generate videos. I have seen some tutorials where videos are converted into png files and the files then transformed using imagetoimage, but I have not come accross a video that explains the process in details. I will be very grateful of someone can help me or recommend resources for this. Thanks. I am using stable diffusion automatic 1111 on google colab
I see, thanks
No worries
Does anyone know how to make the last frame of an animation in deforum the same as the first frame
Deforum, animatediff, and stable video diffusion are the tools you are after, the latter two are good for looping gifs
@ionic locust as well: options for online generators (some are free tier, some use token systems) = civitai, clipdrop, dreamstudio, pixai, tensor.art, aiscribbles.com, tinybots.net/artbot, leonardo, mage.space, aaai.eu, shakker.ai
ahh bugger was hoping to loop it in deforum as I've already set everything up with prompts and keyframes
There are also cloud pc options to run your own ai generator in the cloud - runpod, vast.ai, rundiffusion, thinkdiffusion, paperspace, aws sagemaker, google colab
and practal.ai
Thanks
Never heard of it 🙂
And as you can tell, Ive seen a few in my time - oh forgot happyaccidents.ai as well
Dont know enough about deforum myself to help sorry. There is a really good tutorial series on civitai doing a intro to deforun by Harrowed
yeah I've seen that one, it's a good one!
Please check it out 🙂
Will do
Any ideas how playground.ai does its inpaint masking?
Probably same way as a1111 or fooocus does it?
Any suggestions? SD's REST API masking seems to be a little quirky and returns a completely different image every time, ignoring the mask.
it returns this kind of results https://imgur.com/a/DMoypaZ 1st image: mask 2nd: init_image 3rd: result
Oh - you asking about using the mask with the stability API?
Id have to look at the docs later to see if its even possible
gmgm
There are also cloud pc options to run your own ai generator in the cloud - runpod, vast.ai, rundiffusion, thinkdiffusion, paperspace, aws sagemaker, google colab
Thats what I said - are you a bot…
Hello
somebody can help me to draw a scheme diagram about macromolecule material?
Artistically or accurately, this is an art sub??
I've trained a LoRa model for logo on Runpod, but I'm not sure it learned properly. Sometimes the model generates the exact logo, but it seems more like luck. I trained it using 7 high-resolution images. Here are my parameters:
Steps: 20
Epochs: 10
Batch size: 1
Rank: 128
By the way, I also did captioning and I use the SDXL base model. What could be wrong? Is it the dataset or the parameters?
More steps? Try with 70 or 140?
#🔧|finetune may have tips for you as well
How do I get rid of Shiny plastic skin in the Anime styled pics I make?
Can you show an example in #🏞|general-with-images ?
please fix the bot already im bored
Bot is offline so the gpu can be used to train better models - there are plenty of other generators online - start with the one on Civitai.com - it's free at the moment
Is there anyone who can build me a Workflow in ComfyUI? I will pay for it to make it happen, please DM me <'3
Good morning, everyone! How are we all today?
bring back up please
🙏
If I use stability.ai's REST API for text-to-image generation, will adding LoRA placeholders to my prompt text help, or are they just ignored, such as "lora:AndyLau001:1"?
Hey how many nodes you need?
If you like I could create you a custom one
Puh i dont know. As many as the workflow needs to be perfect
I think they are ignored, that is the automatic1111 syntax
Heya, I am curious if there is a specific thread for self-promotion or offers for commissions and such here guys
SD V3, whos excited?
Need more examples of what it can do 👀
seems to generate text pretty well though
Yeah but the "safety" part is concerning
If it's just some check in the code like it was in 1.x then it's fine. If the model believes nothing exists under the clothes it's a different matter...
Though that also could be a sort of conspiracy that if the base model can't do NSFW it's impossible to fine tune it into it
It probably is possible, just needs more work
I think that's the worst aspect of "AI safety", the utter vagueness of it. Because if you say explicitly what you consider unsafe and will do your best to prevent that unsafety from happening, it might cause even more damage.
So excited !!
i wouldn't mind if it couldn't do NSFW
SD 3 will eventually be open sourced right
what i'm alarmed about is the closed initial release + the trned toward centralization + censorship
one can pray
It's not about personal preferences, it mainly hurts adoption
They say so, yeah
none of the models could do nsfw, that was trained in later by users
im saying in general, i really think it would be better if it couldn't by default
Well, same happened to SDXL though, closed beta release first, then it leaks 😅 and then open for everyone
Yeah, that's the funny part. I don't know why people say in unison that 2.x is censored and can't be trained for NSFW, is that just a rumor that was repeated many enough times for everyone to believe it's a fact?
fair point
I never did full scale training, only loras, so I don't know how actually hard it is to add fundamental novel concepts such as breasts if the base model doesn't know about them at all
I'd love to run a huge model that takes up all my 24 Gb and the output quality is just as increased
where that has been posted?
Nowhere, it's not open yet
Not out yet, there’s a preview waitlist
the waitlist seems a bit bugged btw, there is no feedback if it actually got submitted or not, so hard to tell if you actually signed up
why isn't sd3 in #📣|announcements
i got feedback
what did it say?
i cant put an image here?
Thank you for joining the Stable Diffusion 3 early preview waitlist. You'll notified by email with an invite to our Discord server when you've been granted access to the preview. To learn more about Stable Diffusion 3, visit our blog here.
surely i get access tmrw for being so quick
multiple people didn't get any, weird
it blocks you from submitting the form again, but can probably refresh and try again
yeah try deleting cookies
fine letting others be the early adopters, took forever for xl to get controlnet, community contributions, etc, it was painful for a while
you mean you wont bother with waitlist?
yah, why bother?
fair enough
Still don't have inpainting and tile CNs for SDXL btw...
It's been 84 years
I did it within 6 minutes of announcement 😀
There's a waitlist?
Diffusion transformer eh
https://pixart-alpha.github.io/ Pixart-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
This just got released as an open model with weights in the hugginface + diffusers support
SD3 be like ""Our safest safety minded safe model yet, here are stable diffusion we really care about safety, and so we're so happy to that we can finally introduce a model that has safety in mind. Please be safe""
a
I think that's a great thing to say
We want Stability.ai to thrive and prosper
And if statements about safety are what it takes, then they should say it 100%
hello newbie here, what is the best settings for rtx 3060 on stable difusion
lmk when u got it we'll try to compare
wdym best settings, and which model
i.e. do u want performance or quality
Hello everybody, i am looking for free uncensored Image Generative AI, is someone aware about them ? what happened to unstability.party ? Thanks you all
The new model looks fricking awesome
And why are yall complaining about the safety? It took like 5-6 months at most for SDXL to be able to generate simillar stuff with 1.5.
The main question I have is that it seems we are going to have like a whole bunch of models at different parametre sizes.
Stable diffusion is uncensored
I don't know what lmk means 😕
That will probably split the community up too much. I wonder if the LORA's or CN's trained for one would be usable on the other ones.
Breaking News https://stability.ai/news/stable-diffusion-3
Half of the community still cannot adopt XL due to hardware limitations
Yeah, stable diffusion is releasing their models for free
yes, but i dont have a computer to run it locally, i was using unstability.party but the website is down now, cannot find a similar tool...
What more do you want? The super duper censored DALL-E and Google Bard generators?
At least with open weights you can fine tune
any model can create a cyberpunk cat?
https://huggingface.co/spaces/PixArt-alpha/PixArt-alpha
Type cyberpunk cat and generate
You can try it here select the realistic vision because it is only uncensored
https://fumesai.web.app/img
Type cyberpunk cat and generate
omg stable diffusion 3
ikr
.
This scares me
the biggest SD 3 will be bigger than SDXL
How can I access it?
Sign up for beta
i want it so bad
where
I wonder what the spec reqs will be
well its between 800m and 8b params
so like anywhere between 6gb to 24
I was thinking about purchasing a 4070tisuper
Signup here: https://stability.ai/stablediffusion3
gn
Huge news, congrats team!
SD 3 is a transformer diffuser, someone else also just recently open sourced weights + demo here: https://huggingface.co/spaces/PixArt-alpha/PixArt-alpha
For also a transformer diffuser
Now I am either gonna wait until the 5000 series or gonna purchase a 4090, depending when the model drops.
Go big or go home, gotta have that 24gb
I only have 16gb vram
8B model is like 20% bigger than SDXL
unfortunately
SD3 8B model*
Yeah I just wonder if Nvdia will have 24gb vram with any of the lower models
Ooh, then even the 8B model should be runnable with 16gigs given enough optimizations
Maybe
It is unclear if the transformer part of diffusion transformers will mean higher memory use
As transformers have quadratic memory cost w.r.t. resolution
have anyone granted access to stable diffusion3?
Hold your pants love it was just announced
I hope they watermark the models this time, so we know who leaks them
That aint happening in this economy
NEED
5xxx might offer 32GB if the rumors are true
I'm just hoping for a 24gb 5800
Would be awesome
Yeah it is slow
It's mostly for the LLM stuff, RAM offloading becomes a bottleneck
Damn this week has been exciting
Ahahahahah for what?
down the drain? No, time enjoyed wasted is not wasted
why lol
For a dataset
Yeah
Ooh that is bad tho
Rip
I've collected 300 thousand Stable diffusion prompts from this discord
it will still take a vew months for sd3 to relese
Every single prompt that made it to showdown
And every single prompt that got voted #1/2/3 in showdown
300 thousand from last july until Jan this year
I mean you still can train that model
Is there the private discord for subscirbers by now?
I think it's the "Stable Builders" one, but doesn't seem very active
Yeah. I don't know why but my payment bounced and they don't even seem to have page where you can manage your membership
ah thx
already signed
Biggest change is
- from conv diffusion to transformer diffusion model.
- Biggest SD3 is roughly 20% bigger than SDXL
- will be open weights
And a slightly different flow + score matching training objective
wow SD3's prompt adherence looks fantastic in the demo images
ikr!
hey sandbit! 🙂
Hello masslevel!
the text is crazy
holy sh! its sandbit
Transformer diffusion models should be cheaper to train. Pixart-alpha https://pixart-alpha.github.io/ claims just $26000 training cost to get to SD1.5 ish quality
still using your awesome sdxl prompt list. such a treasure
Has Stability abandoned this Discord? SD3 announced but nothing in here??
https://pbs.twimg.com/media/GG8mm5va4AA_5PJ?format=jpg&name=large
“Photo of a red sphere on top of a blue cube. Behind them is a green triangle, on the right is a dog, on the left is a cat”
That's really good wow
Ooof I pray it isn't that good.
We have a paper on compositional diffusion
Where you have controlnet not restricted to the segmentation labels from ADE20k
Pixart creates mostly deformed images of humans,
There's a reason, due to safety they used a dataset with all human faces blurred out
Oh
... it is what it is, I'm not an author of PixArt, but know the authors
"""Safety"""
Due to "safety" they couldn't train on too many pictures of human faces
☔️ one day they will realize safety is a myth
To clarify, some of the dataset has human faces, just a large part of it is blurred out
Safety is indeed a myth
I only thought it was interesting due to SD3 also using a diffusion transformer, and PixArt is the first open source diffusion transformer model with weights
Will SD3 be released as an Open Sorce model? Or it will be closed?
SD3 should be open-weights hopefully. Otherwise I have little care for it
Any non-open AI model is useless to me
We just had a giant meeting about SORA yesterday. Discussing the likely architechture based on what we know in public and private details
50+ people, our conclusion was that SORA likely spent less than a year training, as one of their top 3 authors only joined OpenAI within the last year
And we agreed the likely architechture was diffusion transformer, likely with a hierarchical and first image conditioning like NUWA-XL
NUWA-inifnity from microsoft: https://nuwa-infinity.microsoft.com
Not to sound rude but whos "we" in this case 👁️
A bunch of people working in machine learning professionally that know many people from openai personally
You've likely read our papers at some point in time if working on image synthesis
Snowden fails to not post absolute truth. Amazing 🔥
I don't believe the crap about SORA training on unreal engine, we think most likely is movies + short clip dataset, likely they labeled themselves
Do you agree that the claims of SORA being a "life simulation" are marketing at best? 👁️
I could be wrong but I swear I've even seen first parties saying something similar to that
are there any downsides of merging similar contextual models with each other? i dont want to switch between them frequently
Yeah the stuff about world simulation (aka world model) is likely marketing only, I suspect they are just doing video generation
To be a good world model, you need object and spatial consistency, SORA doesn't have that (yet)
Something about the SORA captions looks very GPT-4 Captioned. It has GPT-isms (utterly and hopelessly verbose)
Well, I think there are probably many teams globally with the talent and compute to do what OpenAI did with SORA, probably 10+ teams globally.
But.... No one has the OpenAI video dataset lol
I believe sora has gpt4 prompt enhancer
Yeah that was my take too but so many seem convinced SORA is somehow more than just text to video (very good text to video of course)
perhaps, but I think marketing is marketing
I doubt they'd make money with a world model (for robotics)
If they do text-to-vid, they can at least charge subscription fees
It sounds more exciting for sure. But at the same time people are also convinced ChatGPT thinks it's people because of some quirky behaviour
My 2 cents, if it can fool 50% of humans then it is what they claim lol
Very good point. I keep forgetting that most people don't spend enough time at their computer to notice patterns in how GPT writes, how t2i has that AI sheen or anything like that
The way of the programmer with too much time 🙏
I beg my clients not to send me emails expanded by chatgpt.
Use ChatGPT to summarize the expanded emails
Use their weapon against them 
you're so right! 😄
https://github.com/microsoft/NUWA/blob/main/NUWAInfinity.md#image-animation-hd what do peeps think?
been using SD1.5 and then SDXL for some time, few days ago I have finally subscribed (more to support them, I never expect to earn anything, since I don't nor plan to sell anything). didn't like that their later models weren't open in open-source sense (even if I mean only weights and ignore content limits), but they are only, well not open, but probably something like weights-accessible capable image models. after reading about SD3 and how important is the censorship for them, and discovering how Stable Cascade doesn't understand chest size (even for sfw images), I am disappointed. I read from several guys who train (both loras and finetunes), that excluding this kind of images is detrimental to how model handles poses, clothes, legs and arms in majority of images, not just nsfw. and sufficiently censored model might be very hard or impossible for community to uncensor. I am starting to consider unsubbing 😞 I don't like using discord, so just leaving it here
🙏 without finetunes SD would be useless
I just subscribed too, just to support them (well, via clipdrop, which is owned by stability.ai)
not anymore
What's the VRAM req for SD3?
what about ByteDance
I am assuming that fine tuning will get over the censorship bump. Stability are just protecting themselves here considering the growing wave of political and social pressure building up around deep fakes
They actually sold clipdrop, that announcement is right under the SD3 one 😉
Good riddance i think, clip drop never made sense and their outputs did SAI more harm than good
sd3 is bigger than xl right?
24gb 
thats a big boy
no😭
Cannot wait to see how the new architecture supports video.
gonna have to wait a few weeks/months until the community manage to make it run on 10-12gb
without having to wait 1+ minute per image
it would be awesome if we can add data to it. For example, give it images of my dog and make it do all sorts of stuff
That would be fine tuning either the foundational model, or better a LoRA or similar tech. You can do it now with existing versions
well, that's what LoRAs are for, and controlnet
never doubt the will of some people out that wanting to make p@orn with it. they will remove censorship of it in a week
It's sad there is no other competitors that actually care about open-source and no-censorship
we'll just have to stick to sd 1.5 forever it seems
Or SDXL
or wait until training an AI becomes cheap enough a small group of people can do it
release the weights 🗣️
SDXL is censored too, isn't it? You need to use another model on top to generate NSFW images and even then the anatomy is usually worse than in SD1.5
Howdy, ya'll. I was deep into SD for a while and tapered off around SDXL release. I was mostly exploring ControlNet, Parseq, Deforum, and building a workflow of animated texture sequences into Ae and Blender.
Anyone know a good place that curates updates and new releases so I can catch up on what's happened since?
Not just in SD, but the tools, addons, and utilities as well
I use foundational models based off it. You get anything and everything you like. Look at civitai for both FMs and LoRAs
This is why I think SD 3.0 will have as its base model a censored product, but it will be fine tuned the hell out of to make it actually useful
Yeah, but finetunes don't work as well as it simply not being censored
SD 1.5 still generates better anatomy most of the time
even if SDXL's overall image quality is better
Controlnet doesn't work as well with SDXL either
There is not really a place like that, not that i'm aware of atleast, I usually inform myself by visiting the StableDiffusion subreddit and looking for posts tagged "News" or "Resource Update"
That's what I figured - I apprecate it! It still feels like a wild west where you gotta do your own research
Thankfully there are lots of places to search
You might want to have another look again, I can generate what I like from SDXL and while there are fewer specialised LoRAs than 1.5, they are being developed every day.
I suggest trying to look for every "News" or "Resource Update" post that has atleast 200 upvotes, since those are usually the important ones
You can definitely do good images with SDXL, but that doesn't change the fact it struggles with anatomy more often and using Controlnet is harder than in SD1.5
also it struggling with anatomy more often doesn't mean it can't do anatomy, just that it's more likely to fail
Anyway, we have to suck it up that 3.0 will be censored - the political environment will not allow otherwise unfortunately.
So, we will have to rely on others to create the foundational models we need.
It should not be censored 😑
Would that be the case?
I did not know it may not be open source
That would kill it for me
In theory it is, but there's open source and "open source"
Hmmmm
how exactly will they censor it
if the code is open source but the weights aren't then it's not really open source
Removing NSFW images from the training data
By filtering out the inputs, not including nsfw content with training, and filtering the outputs are all options
yeah but if they release the model weights we can just remove all of that
finetune it for nsfw and stuff they didnt include
filtering inputs and outputs can easily be disabled when running the AI locally, but there's nothing we can do about poisoned training data
Bingo!
finetuning isn't as good as just training it with the correct data from scratch
even if we need to retrain the model entirely there are people willing to do that and have the datasets already
for example there are companies like novelai that dont really censor anything
So let them censor it (as long as released open source), then let’s encourage them to create a video service off the news architecture (similar to Sora). Then … we fine tune it
I think sora is a lot bigger than 8b though
It will be via fine tuning and LoRAs of course
so video diffusion based on sd3 wiill be bad
Nice fractal
thank my gpu
I feel inclined to buy the nvidia 5000 series after investing in them so much lol
Sora is going to be a massive gpu sink - just have a look at Open AI purchasing from Nvidia
yeah even dalle 3 is probably losing them money in chatgpt
they had to reduce the generations per prompt from 4 to 2 (or 1? havent used it in a while)
i don't doubt nvidia will be a bunch of assh*les again and have RTX 5000 series have little to no vram
yeah probably
Vote on politician that will regulate in way yo uwant AI generation to work. Until then there is 0 chance for base models to be uncensored, especially as they gain more and more capabilities
just so companies and enthusiasts are forced to buy either the 5090 or AI-focused cards
Hi there!
fintuting will fix it though
Doesn't really matter since even if the US or Brazil or Canada or Mexico or whatever allows it
I do see a future in where you have a seperate gpu that only does AI, like a commercial ai accelerator
another country won't
No chance - too many stories about revenge porn and kiddy shit evil twats are making
the EU probably is the one that will always be insufferable related to that
oh no you can generate a naked cat that's criminal
always gonna be a problem with open source sadly
$2B profit tho
Yeah, but I see it as a neutral tech. Since it is used for evil, we either accept the base models will be censored, or, we embedd tech to trace who created what.
Prefer the former
OpenAI makes MSFT billions, just from the stock valuation
they can lose all the money in the world, and still make the MSFT stock go up
it's all about the image/perception
also Google has way more compute than MSFT/OpenAI
Just disappointed I did not buy stock in Nvidia a year or two ago
im up like 50% so pretty good
That's not good for StabilityAI then
for the government and authorities they have a bad image because they are against open source AI
for the community they have a bad image now because of the censorship
interested to see what they are gonna do with that, maybe its just me but feels like all their ai products are lackluster
products are just a show
internally, I bet Google's models are very close to OpenAI's internal models
beating a year old product
From what some ex-google employees their researchers were frustrated
I wonder what openai has internally
that Google would never release anything
Goddess Lilith
but apparently now that's changing since they don't want to lose any more researchers to Meta and OpenAI
Gemini 1.5 was apparently done all this year
yeah google sucks at actually releasing stuff
since some of the papers were released in 2024
where did you hear this?
makes sense
I doubt it
AI Explained has a video on Gemini 1.5 Pro
Gemini looks like outdated research
he shows that some of the cited research papers used on Gemini
are very recent
and they are fundamental to how the AI's context window works
I would believe if you said Gemini just finished up being "aligned"
so they couldn't just be "attached" to the AI later
research that's outside of Google?
yeah
interesting
I still believe the leakers
At least for OpenAI, the leakers are saying that Sora is very old
nah they were being intentionally obscure
probably the initial model was first trained about a year ago
when gpt 5 tho
Any predictions on SD3 release dates?
3.5 months ( it took this long for stable 123)
from Emad: Some notes:
- This uses a new type of diffusion transformer (similar to Sora) combined with flow matching and other improvements.
- This takes advantage of transformer improvements & can not only scale further but accept multimodal inputs..
- More technical details soon
sounds super exciting
Hopefully it releases soon than 3 months from now so the community can make SD3 useful
I wonder if this makes it harder to finetune or train in general
multimodal sounds very interestign
Yes
I'm conflicted with SD3. Either it's going to be totally awesome or another SD2.1
Hopefully there's no forced censorship
https:// x.com/andrekerygma/status/1760676074491687310?s=20 thats pretty impressive prompt following right !!!
how big is sd3?
im trying to use this repose workflow i found but it says I need anything everywhere
I cannot find that anywhere
in the manager or on hugging face
Hopefully for a quick release
there are a few bits I am struggling to find tbh
Yeah, the prompt following abilities of SD3 are insane
Does anyone here had his hands down on SD3 ?
I wonder how we're supposed to train it and if poor training ruins the prompt following abilities
" The Stable Diffusion 3 suite of models currently range from 800M to 8B parameters. This approach aims to align with our core values and democratize access, providing users with a variety of options for scalability and quality to best meet their creative needs. "
800M ≈ sd 1.x
8B ≈ 3 × sdxl
@wise stratus out of curiosity with the model prompt ability, could you please generate for example " a clown balancing on a hug red ball with one hand, and two monkeys standing on each of his feet juggling green balls, and on the left side of the painting a tiger dressed in a velvet red tuxido playing on a jet black piano, and on the right side of the painting a baby elephant balancing on a small red ball wearing a playing the trumpet " .. this is for research commercial purposes obviously
Or just generate two female hands but each individual nail has a different color or art, "1° nail is painted blue, 2° nail is painted yellow, 3° nail has a rainbow drawn on it, 4° nail has a drawing of vines" and so on
but i doubt any current ai can do that without it looking like a monstrosity
anyway to get fast access after sign up
i can do it... dont need no ai for dis
That would be testing it to the extreme... even regular pose would be great. Like a two fighter holding correctly their 2-handed weapons while fighting.
Given the high degree of prompt comprehension, I wonder if they have a specific way to organize training data when training SD3.
Or a boy playing correctly with a rubik's cube.
i think my prompt is perfectly reasonable
Except the hug red ball part. Why hug the red ball?
they didn't post will smith eating spaghetti nor a horse riding an astronaut, so I guess it's on par with D3 but not that good at prompt adherence
i mean huge red ball... but IF THE MODEL IS SOO SMART IT SHOULD HAVE KNOWN

Or better, the triangle, sphere, cat and dog example was already better than D3
Isn't tht because they rewrite prompt? Such a layer would "guess" hug is huge
fuck it lets keep it as hug then
(or they would have banned you for daring to ask for someone hugging a non-consenting object.)
"SD3, genereate an image of will smith eating spaghetti."
- Unfortunately, we cannot generate images of real persons like Will Smith or the delectable Ragu™ pasta sauce. Ragu, it's delicious!
thats how you really know if its censored
when you ask it for van gogh style and it says noh 😦
@wise stratus yo bruh. Got a question if you don't mind. Is SD3 gonna be as responsible as Goody-2 is? If so then count me in!
seriously about van gogh? Omg...
Does each copy of SD3 come with a helmet and a pair of shin pads? I can never be sure if it's safe enough
More seriously, I hope the censorship is like SDXL where it can be trained out
can it make a sandwich with specific layers?
why are people pinging the server owner
GOODY-2 is the end-stage of Anthropic and OpenAI
because emad sometimes is cheeky and he likes to drop in sometimes
8 billion people in the world, ill take my chances
its better than dalle 3
I meant parameter size
800m-8b parameters
its really better than dalle3
sdxl + refiner is like 10b
T.T where download? :c
I think the main thing for the community is if SD3 is censored or not. If it isn't, goodbye SD 1.5. If it is, we'll stay at SD 1.5.
if its censored we will uncensor it
just finetune it
The Coomer Community
It's not just about porn. It is about being able to generate gore, "disturbing" imagery, "offensive" topics, and a variety of other content that were also censored heavily in post 1.5 releases. They filtered out artistic styles, images of specific celebrities, etc which is why image quality sucks for so many topics. Censorship = missing information = bad image output
Look lets stop beating round the bush, Do the wider audience really truly bother using SD if its not for pornography ? Lets not kid ourselves here and see how the internet uses it. What are you trying to achieve if SD dont have it ? Dalle is literally better in every aspect so what gives and why bother trying if you keep denying your purpose lmao
I came here wondering the same thing re: censorship. I really hope they release an uncensored model and let us decide how to implement guardrails ourselves in our own applications
Honestly, customization of the image through img2img and inpaining in ways D3 can't. D3 produces great images if you're lenient about the content. When illustrating my RPG campaigns logs, NPCs are already detailed and I need the image to match every aspect of the description that was given during the game. I can't do this with D3, I need the framework of tools around SD. All my NPCs are clothed.
D3 can make impressive images of a towering orc leader on the battlefield wileding a 2 handed axe, but if I can't have him wear the crescent-badge of his clan, it's useless. I prefer having to work harder on SD and be able to inpaint the badge.
So, there are use case (maybe niche) outside of porn.
nsfw is a single application, there are many others
Also, were I live, it's not illegal to make caricatures of political leaders. D3's censoring is appalling in regard to this.
its going to be open source soon why are you freakinng out lol
D3 opensource ? I doubt it.
its confirmed
SD3 certainly, but D3 I don't think so
oh I though d3 was a typo for sd3
We're speaking about dall-E 3 (as to why we need alternatives to it outside of porn)
my bad
Is SD3 posted in Discord annoucements yet?
what I want to know is if the smaller models are distinct models trained from scratch, or if they're distilled models
Imagine BCI (Brain Computer Interfaces) with SD3 and SORA like models ... that would be wild 🧠 🧙♂️ ✨ ✨
"...We've got some really exciting experiments running in the coming months using fMRI and ECoG, and then plans for multimodality + hopefully applying to new BCI tech..."
https:// x.com/humanscotti/status/1755021725077504027?s=20
what is D3? 🙂
: )
dall-e 3
My rank in League of Legends 
Generating all that gore
Seems bad for your mental health

I might want to generate imagery of a combat scene. I might want to generate a picture of a surgery.
And again, it's not just gore and porn and racism and whatnot. I literally can't generate pictures of celebrities, or emulate artistic styles because these fools removed everything that could potentially offend/upset anyone.
I'm not pro censorship
from dall-e?
Perhaps, Bytedance does very good research in image synthesis models
Oh wow, happened today
what about blatant misinformation designed to incite civil wars and riots
No, we were talking about censorship in SD 3 and hoping that they release an uncensored model.
its going to be open source
did I stutter? 
im saying you're wrong
Free speech doesn't prevent accountability. You can yell fire in a theater. It's not the act of yelling fire that should be prosecuted. It's the fact that people were stampeded to death because of your false alarm.
the only censorship I'd be okay with is the obvious stuff
i.e., content involving hurting humans (which is not to do with reporting/news)... especially hurting minors
Who gets to determine which content is harmful? What if the censors themselves are a bunch of rich warmongers that are actively killing people all over the planet?
Anyways, I digress. If censorship is going to happen it should be by application developers, not baked into the model
if it's AI generated, I say allow literally everything imaginable
literally
the law. stay within the law, or be punished
the law is a funny thing
What if the people who write the laws are a bunch of corrupt, unethical monsters seeking to write laws for their own personal enrichment?
thats why we have the masses decide on who makes the law
Not too long ago, segregation and slavery were legal.
point is, will we have nipples in SD3 or not
If you're in a dictatorship, you're in a bad place. If you're in a democracy, I'd suggest not voting for them
Eugenics was legal
it's like you've never seen the modern world and is are just spouting the most aggresive stance you can because you think you're edgy and cool
Discussing this topic, inevitably brings us to Elon
What did Elon do after saying he is a free-speech absolutist? What did he do after saying he'll "only follow the law" (re-iterating this multiple times).
It was legal because most people were OK with that. Then we progressed in our collective enlightenment.
can anyone help me?
What Elon did, was constantly perma ban people on Twitter/X, who never even came close to breaking US law.
It's like your views are inconsistent, and you're a naive little child that thinks lawmakers actually care about you lol
And we successfully voted out eugenics.
I am struggling to understand why when I clone a repository only the frame is copied and not the files
thats not an argument...
why is this not working like it works for others?
Slavery, too, though I have heard that some countries had a little fight over that
just name calling
lol I'm just responding to your personal attacks bud. Don't dish it out if you're going to whine if you receive it in return.
For example, Elon banned people who criticized the Turkish dictator. Elon also banned people who criticized some local Indian governments. Elon also banned people who correctly stated he grew up rich (Elon was lying that he grew up poor, then people fact checked the motherfker). So much for free speech absolutism eh. I have screenshots of all of this, and more, as proof.
Never trust someone who says he is a free speech absolutist.
Never trust anyone's word. Only trust people's actions, not words.
you joined this server TODAY so I'm just going to assume your a troll then. my message contained facts, yours was pure name calling. blocked
oh no! blocked ... I'll miss you terribly ...
if free speech absolutism can get me an invite to SD3 faster, I hail my new Elon Musk overlord.
why are yall comparing a base model that doesn't have nudity to slavery, eugenics, dictatorships and segregation? 🤨
moving on now ... are we going to have an uncensored version of SD3? Or will censorship be baked in again
It literally would
Your reading comprehension is pretty weak. Nobody said that.
why are those things being brought up at all in this context
Lets not devolve into using ad hominem, people? 
@vague pond Someone made the claim that ethical choices should be made based on the law. These things were brought up as examples of things that were unethical but legal.
that's the problem with democracy . Quantity becomes quality. Segregation is still legal in some places like .. Israel. As JKM once said, unlike some tyrants, the so called "majority" is always wrong.
Ah, the genius that brings politics into it. There's always one
Because democracy basically means, government by the people, of the people, for the people, but the people are ...


Model censorship (I don't know about SD3 yet) is eminently political. I guess a model made by a Chinese company would be much more open than OpenAI's on some aspects, and ruthlessly forbidding on others. So it's an outcome of the political climate.
Well, say no more!
Let's get you in: http://tinyurl.com/5n8h2j2c
shh embed
don't spoil it
I'm not sure what you mean. The large company which releases a truly uncensored model would get in trouble.
I do want to try SD3, but it's not going to be usable for a few months because of the censorship
I can already do better with midjourney or Dalle
Ever heard of Wuhan's animal meat market?
Freedom isn't always a good thing.
lol
I mean China does have certain extra freedoms.
Like selling random ass animal meat. (Yes I know they've started to police this a bit more.)
"..enables 3D.."
wonder if that means input control, or output as they talk about SD3 will come with a complete suite of tools for everything ??? 🤔
https:// x.com/EMostaque/status/1760661179444219951?s=20
The Chinese government is very scared of AI though. For many political reasons you can think of.
Yes, but it would be differently censored depending on where the company is. The banning of representation of US-based companies's notable IPs will certainly not move the Chinese censor. Drawing Xi Jing Ping would (just examples)
Enables video, 3D & more..
Pogchamp face
I'd be scared of what the chinese government could do with AI, if I were there.
Don't worry, your friends at the NSA/DARPA can do better.
Hopefully AI will teach people not to trust anything on the internet, as they should have been doing for years
But I know people will still be taking twitter posts' truth at face value even as the world around them is generated with a prompt
It took centuries for people to realize that anything written wasn't necessarily true (OK, most people were illitterate)
but something must be true, so how to see the diff?
Surely my boomer parents will get used to AI
The truth is the vast majority of people have no idea what the truth is, just what their governments and their governments enemies want them to know
Trying to get my discord id to register in stable diffusion 3. Any ideas how from the phone?
I'd suppose they would but they're currently investing a lot into it with all their research state funded research institutions
My take is that critical thinking toward news lowered in the last decades
From what I've seen they seem pretty confident they can censor things that they don't like with filters on their inference platforms.
Both are true
I think, they use it for automation of a lot of things
You can be scared of nukes while building your own nukes