#🏞|general-with-images
1 messages · Page 3 of 1
Sociological influence of potato of next generation should list this
Happy new year to everyone! Enjoy your day! 
I kind of like that first little white potato, looks like an anime critter that would say something like "Jaga-Jaga, Jagaimo!"
With the altitude of some of those fireworks, I really hope this is AI generated. 😆
This is really cool, I've been trying to generate a minecraft-like style these days, can you show me your model and prompt?
Oh yeah it is 😄
That looks cool, what shaders are you using? Is this Minecraft RTX?
Dude I just told it to make a nether portal with RTX and it made that
show us
i used a merged model of derrida_final and Dreamlike_Diffusion
This is without minecraft in the prompt
Forget faux fireplaces, I want a faux nether portal in my house!
wtf is a faux fireplace
like those TV stands with a fake fire/heater built into them
ah i see
I feel like Automatic1111 could benefit from an 'enqueue ' button to pre-load more prompts without using an external text file.
pretty sure it already has one, but the ui i use doesn't because im on AMD
weird, I'm on an NVIDIA and I don't see one.
so I'm considering a script, where you specify a square image size, it takes that number of pixels, and lets you specify a viewing ratio that uses that same number of pixels.
so I could take a prompt and run it through a series of ratios like 1:1, 16:9, 4:3, golden ratio, etc.
but not overflow my VRAM
Yes, have a fine New Year’s Eve! 🎉
what is this??
ignore the training steps to 200, but like .
i cant turn it into a checkpoint
oh do i need to set a checkpointname?
model name
not optionally ok
ok fixed
Sure was a struggle took nearly two hours for me just to fix everything under the teacup to remove a second teacup and busted arm with attack of her suddenly holding peas from nowhere in the hand that was decent and cat and hand not wanting to agree to inpainting at all, just happy that i did not notice much more stuff busted needing to fix.
Good stuff, and yeah post work is where the time succubus comes into play. All the antiAI people show their ignorance when they say it is just copy and pasting their work, or work of others. Far from the case most times.
Yeah easy making pretty girl promt but once you want to customise or replace stuff it get tricky, first versions of cats were such abominations
Never thought i would struggle this much
Cats are pretty great now in 2.x
Now eventually we may be able to just prompt it and no post work will be needed but that is a long ways off regardless how much SAI, et al., wish to say otherwise.
Generic stuff is there now about but custom stuff like above? Nope
depends on what people think is "generic stuff" :3
Just hoping for way at some point to prompt or explain features two or more characters, have been struggling with it so far
generic is NOT subjective it is an objective statement meaning, for those who can't, or refuse, to understand that simply typing a prompt of "xyz sucking a cow's teet" is a generic prompt.
you will get a generic outcome. Now look at the above that is far from generic.
I hope they finally get the hand issues fixed as that ruins most outcomes for most.
I asked that chat ai if generic is subjective and it told me
"Generic" can be used in a subjective or objective way depending on the context in which it is used.
In an objective sense, the term "generic" can refer to something that is not specific or unique, but rather is a type or category that is commonly found or considered. For example, "generic brand" could refer to a type of product that is not a specific brand or trademark, but rather is made by a company that does not have its own brand or trademark.
In a subjective sense, the term "generic" can be used to describe something that is considered unoriginal or lacking in distinctiveness or individuality. This usage is often pejorative, implying that the thing being described is not particularly interesting or noteworthy.
It is important to consider the context in which the term "generic" is being used, as it can have different meanings depending on how it is used.```
How I am using it is how I just described it.
Now you know, and the more you know...
Fun thing is that the original hand with teacup was ok but once i started to swap things out if started to fight back
can you show me a example which is not generic? :)
Look at what I wrote as a prompt. That is fucking generic. Now look at a real prompt, with directorial words in it to SD to actually writing a book for it. If you can't grasp it then you are either not trying on purpose, or everything you do is generic and you are getting butthurt about. Generic has it place but for great work in SD it goes beyond embedding stacking or sucking off of models that do great work even if I just prompted "a cow in a field taking a shit".
I'm sorry, but I can't have a discussion with people who I believe are this combative. I just wanted to have a fun time talking, but this doesn't look like it is something fun I want to be a part of. Sorry. :(
I figured out the math, and I'm getting neat results from running the same prompt through different aspect ratios with approximately the same number of pixels, at this point I'm doing it manually, but I'm excited to script it into a sort of matrix output to batch these.
for example, here's an angel square, 16:9, 4:3, Golden Ratio Horizontal
Wow, what prompt and presets?
(beautiful) blonde young woman , (up in the sky:1.2) , with wings , highly detailed, professional photography, rule of thirds, photo, detailed face, hyper realistic, DSLR, white robes
Negative prompt: ugly, misshapen, watermark, signature, obese, short hair, blurry, painting, blender,
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 1399292197, Size: 591x443, Model hash: 04baab99
I've gotta say the 1:4 and 4:1 ratios actually came out as my favorites:
That's nice! Which model are you using?
Yes, I posted this in #1045349359044280360 already but still- now officially:
Happy new year from the Netherlands everyone!
What is this?
up in the sky:1.2
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#attentionemphasis adding emphasis to sky, without it, and the world "up" she kept winding up on the ground. Up also appeared to put the camera low looking up, which worked out well.
Using AI to generate "Have a potato" finales for my banal and long winded posts is a new favorite past time.
Happy new year from the AI living on my network....
steps or epochs? Not interchangeable.
Dang, how many pics did you use?
Holy Toledo
I haven't touched DB since 2.x dropped
Lora/xformers/8 bit adam etc. on
however
i managed to leave text encoding on
yup
it is on
my issue with SD is it cant make vehicles
Lora is rubbish a lot of people are waking up to it. Plain old DB is the best and TI/HN the next best. TBH a good HN can be as good as DB.
how does one do HN?
doesn;t do well with planets either as they are more egg shaped than roundish.
I barely can on a T4 on colab. It is in automatic1111
I mean i am just testing, results are good so far
Major project will be the sci fi diffusion
i will hire A6000 for that
and
do some scraping lol
using Lora cause 3060 ti
Lora is bad though
People are turning off from Lora as TI/DB/HN is better, but if you manage to wade through 50k controls to find the sweet please let everyone know. btw, I could not do lora on a 1060 as 6gb is <100mb too little. but on Linux it would have been enough because W10/11 are resource hogs relying on the video card up to a GIG of vram is just sucked up.
well, again just testing. Unfortunately i dont have time to mess with 50K controls, but if the settings im using ends up good, i will make sure to share ofc
Prompt. HDR 4K saturated Sci-fi Style of cyberpunk synth-wave vapourwave Wadsworth vorticism wetta digital ILM cgsociety dreamworks cgi ray-tracing digital-art Robert McCall, H.R. Giger, Frank Stella, Ted Nasmith , John Harris, Syd Mead, Luis Dourado | ugly disfigured artifacts draft mangled revolting pixelart watermark signature: -5.0
SD 1.5 60 steps
cfg 9
Wadsworth why this tag? i recognize it from reddit infamy, the wadsworth constant. start any youtube video at about 10% in, because the first bit will always be some BS about the youtuber or what he's been up to. real content starts inward. it was such a thing that you could add "?wadsworth=1 " to youtube urls to do it automatically
Ive been feeding pompts from Chat gpt into stable diffusion and im pretty sure not too long from now we can just ask ai to make me a movie we like just based on our user data or something
There's already conversation 2 img systems that are coming along. and there's already txt2mov systems that are coming along. Convergence will occur
Two more papers down the line
What a time to be alive!
I must be doing it wrong
Ur fine, by default WebUI does not use nightly version unless you make it do so.
oh ok
What command does it?
Well, something like the following:
i have this folder
thats TensorBoard, its fine...
if you really wanna be sure, run the code in the article in your sd python enviroment
how to do that exactly, as in where do i find the python env
first, shift + right click your webui folder. The context menu should apper with a new option, something like "run powershell here" or "run command prompt here". Click that. Then run "source ./venv/bin/activate" and the code in the article.
gave error when trying to run source command
(powershell)
Edward Alexander Wadsworth ARA (29 October 1889 - 21 June 1949) was an English artist, closely associated with modernist Vorticism movement.
I mean, Im not sure how to do that using Windows, but if you manage to run that activate file...
This guide says its just the path to the activate file
okay the activate thing worked but the code from the website doesnt
Is there any particular reason why this output was so terrible?
I feel like mid journey probably would have done a good job since it has some presets in the background, but I assume they also filter for Pepe memes
can you pls print your terminal?
Microsoft Windows [Version 10.0.19044.2364]
(c) Microsoft Corporation. All rights reserved.
C:\Users\imran>Desktop
'Desktop' is not recognized as an internal or external command,
operable program or batch file.
C:\Users\imran>cd Desktop
C:\Users\imran\Desktop>cd stable-diffusion-webui-master
C:\Users\imran\Desktop\stable-diffusion-webui-master>\venv\bin\activate
The system cannot find the path specified.
C:\Users\imran\Desktop\stable-diffusion-webui-master>\venv\Scripts\activate
The system cannot find the path specified.
C:\Users\imran\Desktop\stable-diffusion-webui-master>.\venv\bin\activate
The system cannot find the path specified.
C:\Users\imran\Desktop\stable-diffusion-webui-master>.\venv\Scripts\activate
(venv) C:\Users\imran\Desktop\stable-diffusion-webui-master>python3 -c "import pathlib;import importlib.util;s=importlib.util.find_spec('triton'); affected=any(x.name == 'triton' for x in (pathlib.Path(s.submodule_search_locations[0] if s is not None else '/' ) / 'runtime').glob('*'));print('You are {}affected'.format('' if affected else 'not '))"
Python was not found; run without arguments to install from the Microsoft Store, or disable this shortcut from Settings > Manage App Execution Aliases.
try just "python" instead of "python3"
(venv) C:\Users\imran\Desktop\stable-diffusion-webui-master>python
Python 3.10.6 (tags/v3.10.6:9c7b4bd, Aug 1 2022, 21:53:49) [MSC v.1932 64 bit (AMD64)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> python -c "import pathlib;import importlib.util;s=importlib.util.find_spec('triton'); affected=any(x.name == 'triton' for x in (pathlib.Path(s.submodule_search_locations[0] if s is not None else '/' ) / 'runtime').glob('*'));print('You are {}affected'.format('' if affected else 'not '))"
File "<stdin>", line 1
python -c "import pathlib;import importlib.util;s=importlib.util.find_spec('triton'); affected=any(x.name == 'triton' for x in (pathlib.Path(s.submodule_search_locations[0] if s is not None else '/' ) / 'runtime').glob('*'));print('You are {}affected'.format('' if affected else 'not '))"
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
SyntaxError: invalid syntax
>>>
sorry, I meant for you to replace in the code from the article the word python3
Just as suspected. Midjourneys was way closer:
You are not affected
noice
O_O
yeah, I guess so
nah, its ok. Im glad I could help...
would the --medvram argument would get me compromised? i dont think so cause 30th of december has passed amirite
and i dont think that has sth to do with pytorch nightly
yeah, --medvram does nothing related to the pytorch installation. Its ok...
Can anyone explain the hidden parameters Midjourney uses to get results like this?
Compared to what I just did with SD:
I mean the quality isn't even close
Looks like a 1980s instructional VHS had a seizure
not sure if it will help but try some negative prompts , it might be that the mention of a cartoony thing is making it favour cartoony generation, maybe this can be discouraged with negative prompts for "cartoon style" "unrealistic"
i've definitely seen SD make things in a realistic style
Hello, does anyone have suggestions for negative prompts to remove duplicate limbs etc? I'm generating 1024 x 1024 so that might be an issue. I already have ((extra limbs)) ((deformed)) ((disfigured)) (((duplicate))) as negative prompts :)
(sorry uploading again cause spoiler didn't apply for some reason. Trigger warning - some blood on white Space Marine-like armor)
evenin'
Why are people using () in their prompts?
Also, is there a prompt preset we can use to negate most of the garbage output?
if they use for example auto1111's webui then it's used like this a (word) - increase attention to word by a factor of 1.1 a ((word)) - increase attention to word by a factor of 1.21 (= 1.1 * 1.1) a [word] - decrease attention to word by a factor of 1.1 a (word:1.5) - increase attention to word by a factor of 1.5 a (word:0.25) - decrease attention to word by a factor of 4 (= 1 / 0.25) a \(word\) - use literal () characters in prompt
Is it possible to train a model that uses Stable Diffusion to generate agar.io skins?
no idea what that is, the more detailed you want something, the harder it'll be.
Are there webui prompts you can use to get higher quality from the start?
"Something Rotten" Deforum Animation:
https://www.instagram.com/reel/Cm4pZfgJBwh/?utm_source=ig_web_copy_link
does this make imagen sd based?
Hello everyone ! im saw this image the other day and i didn't have time to see who did it, is possible if you know to tell me who did it or the artiste style name
maybe somebody called bitspirit
two blonde women sitting next to each other in front of a window, a storybook illustration by WLOP, featured on cgsociety, fantasy art, artstation hd, detailed painting, storybook illustration
i tried a reverse images search and nothing. So it must be Midjourney or Stable Diffusion
Wow thanks ! i will try it, im new to this i thought i had to train my ai with his model
@errant leaf This is what has me confused.
When SD does anything it's only drawing on that 4 GB of dedicated video memory, even though that's only a 3rd of what's actually available
It's 100% of your dedicated memory, aka the chips that are on your GPU
okay then. What is the other 8 GB being used for?
That is literally your RAM memory, or well, half of it
Maybe this helps, I'm bad at explaining it https://www.reddit.com/r/StableDiffusion/comments/x308i4/how_does_shared_gpu_memory_work/
6 votes and 2 comments so far on Reddit
Can you pop into #🤝|tech-support for a few moments I need some stuff related to memory and SD I want answered
The ELI5 at the bottom of an answer:
think of VRAM as a bottle which you can fill up with water, but shared VRAM is a bucket under the bottle which catches water which overflows from the bottle, if the water overflows from the bucket onto the floor your system crashes, however if you want to access the water in the bucket, you have to scoop up the water from the bucket and pour it into the bottle which means it will overflow other water into the bucket
I'm not a tech expert but there may be people there who can help
The actual video memory you have is 8GB; the other 8 is 'shared memory', which means that it uses half your RAM memory as 'emergency' memory if you're using your dedicated 8GB of video memory
and that's not enough for SD?
RuntimeError: Not enough memory, use lower resolution (max approx. 832x832). Need: 0.4GB free, Have:0.3GB free
What resolution are you trying to get? I think the error is pretty clear
768x768 is the native resolution of SD 2, 512x512 of SD 1- you can't just generate full HD wallpapers with any GPU
768x768 Batch Size: 3
Why a batch size of 3? Try a batch size of 1, and 3 batches
So you're generating 3 images, one after the other- right now it will try to make 3 images all at once
For whatever reason, when SD hits the limit of your VRAM instead of attempting to access your regular RAM it just returns an error. So of that 16 GB of memory, Stable Diffusion gets 8 GB max.
is there a tutorial for an appropriate setting for SD? I don't know myself very well yet
this should work for you
I mean a general tutorial for SD setting
I'm sure there are tons, on youtube for instance
Not to be a jerk but you can easily look for those yourself just by typing 'stable diffusion tutorial' literally anywhere
If your graphics card isn't powerful enough for what you want to do you don't have many options it seems
Supposedly there's arguments that modify VRAM usage
like --medvram
But when I tried that one the impact on VRAM usage was negligible if it did anything at all
his gpu is perfectly capable, that's not the problem- the above setting is what was wrong
AI stuff is just vram intensive- but you can also run it on your cpu if you want, it will just be hilariously slow
I have a 10 year old graphics card with 4 GB of memory. I tried setting an argument --medvram which trades performance for vram efficiency
but as far as I could tell the difference it made was negligible
I even found a website that was like "use this argument if you have a 4 GB piece of shit graphics card"
well yeah, it's not like it would decimate your vram usage or something- your 4gb is already pushing the limit of the lower end
You’re misunderstanding what --medvram does; it doesn’t reduce VRAM usage, but enables tricks that make generating larger images possible with the same VRAM (at the cost of speed).
I see
Well I've still got 500-ish dollars from my college graduation party last year. I think I'll spend it on a new PC.
Got any recommendations?
If you’re not building it yourself you’re probably wasting money
Honestly, 500 would not be enough for a new pc, or at least not something cutting edge-ish considering you're doing AI stuff- you might need to consider replacing components instead of a completely new pc
depending on your current build that would be the best bang for buck
Maybe something secondhand
I'm not looking for something cutting edge. This computer I'm typing on is going to turn 15 in a few years. Even something painfully average by modern standards would be a drastic improvement.
RuntimeError: Not enough memory, use lower resolution (max approx. 768x768). Need: 1.4GB free, Have:0.2GB free okay my max sice is 768x768?
In that case you might consider going for a secondhand gaming pc with something like a 1060 or 1070
1980x1080 is not possible
Know where I can shop for one of those?
No idea, I'm not in the US
No websites or nothing?
My advice would not really make sense as all I know are relatively localised european tech marketplaces
the only name I know is craigslist lol
how can i update SD or can i get SD automaticly update
SD 😢
New embedding that I’m working on, Glitch.
Can I get images larger than 768x768 without using SD on my PC?
how can i do images like this in SD v2.1 is it possible or i will need other model for this ?
yes u can with any model but u will need a higher GPU if it doesn't work
But Dreambot or SD online only creates 768x768?
Why i have this error since last updates ??
raise RuntimeError(f'Not enough memory, use lower resolution (max approx. {max_res}x{max_res}). ' RuntimeError: Not enough memory, use lower resolution (max approx. 1152x1152). Need: 2.7GB free, Have:1.0GB free
Have 8 GB of Vram, crash on 1280x768, when it always worked
Any ideas?
restart comp
Fixed, it was because of the upscaler in txt2img, works back 🙂
lights and shadows are amazing, wow
You mean it's better not to leave them on the side of the road alone? effectively
I don't understand a word of what you say
find carhelper
clck all the links
press download
put in stable-diffusion-ui/embeddings
say "carhelper" in your prompt
😎
Does anyone know another program that can fill in spaces like Dall-E does in this example? Photoshop content aware doesn't seem to cut it well with cartoony, fantasy stuff
SD have an inpainting model but i havent try it yet
SD 2.1
Yeah, it doesn’t seem to match what is around it, rather filling it with whatever you prompt
been having fun with the new protogen model. its a good blend.
I grabbed it but #1 the skin doesn't look better to me, and #2 it is a 1.5 model so will never get used by me.
yeah, it being a 1.5 model limits it's lifespan quite a lot. in this session, i'm just messing with celebrity likenesses and enjoying the results. Exploring the aesthetic really
I grabbed it not really knowing but the lure was it said it did skin, and hands, correctly, but the skin was not right to me.
fair. i don't see improvements in those areas too much. i think it's just selection bias on results.
Yeah, I was let down by the over hype. 😦
I have to add carhelper to my already existing prompt?
If I say for example:
BMW M3 E36, carhelper, futuristic, blah blah
?
How was it trained? I ask because the way you call any of mine it changes how it looks.
wdym
I can't break it down any more than that
changes the name?
changes the outcome
the image
yeah
we had different resolutions
Well, depending on how an embedding is trained there may be more than just one way to call it with some way little change to other ways drastic.
i just throw it in the prompt somewhere and play with the weights
bad idea
does position change importance?
well i gave you the exact prompts the other day
This is what all of mine use to be called:
image in [name] style
[name] style
[name]
in the style of [name]
by [name]
oh, i do: "name",
try the others
I prefer "in the style of [name]" when I call them
I am unsure how they trained theirs and no one says.
for my embs the best one is normally "in the style of"
That looks like a 2.x Walter White. 1.5 he was great
here's one of walter on 1.5
i mean kind of similar
2.x looks more realistic
another 1.5
I love the way he looked in 1.5 but not so much in 2.x
yeah the difference between 1.5 and 2.x is quite jarring, both have their strengths and weaknesses
I switch to 2.x to never go back to 1.5 myself but when 2.5 comes out it should be leaps above 1.5
Is there a reason why SD likes to generate pics like this, cut off, no matter what prompts you use?
You aren't using 1:1 is it most times.
1.5
1:1 meaning 512x512 or 768x768
yeah tried that and same thing. its not always but quite a lot
I barely ever get that, but did some times on 1.5
2.x I never really get that as long as I do 768x768. I still get heads cut off though
so generate 768x768 and then resize the seed if you get something nice?
using 1.5 would be 512x512
I don't use 1.5 any longer since 2.0 was released and I use 2.1 now
ah ok
Image will be the same because of the seed, on 2 different resolutions?
teah same picture but it generates it at a larger resolution
dude, if i knew this
it may have very small changes but nothing dramatic
I guess it preserves the angle and positioning of things?
I was reading that 2 and 2.1 arent good for inpainting. I use that quite a lot, so do you think its ok to upgrade?
yeah same everything as long as the same file prompts settings etc are used
Well, at 768 I never had an issue and I never needed an inpainting model but I do know of a 512 inpainting model for 2.0
in 1.5 I used the inpainting model for everything not just inpainting as it was just superior.
how do I find that model? sorry Im failrly new to SD
is it the noema-pruned file?
I think I will stay on 1.5, its all too new to be jumping around at the moment. lol
I guess I will keep trying 1:1 image sizes and hope they generate properly
Well, that is akin to sticking with a horse and buggy vs a modern automobile, but cool. Good luck.
true, but I started just a few days ago. Its new to me 😆
I have no idea what files are what etc
Playing with CharHelper V3 in it's current state while I train model B to merge it with. We're getting weird tonight.
agar.io is a game
rovery
Don’t know where you heard that. I find it better. Use the dedicated Inpainting model
good to hear. It was from a youtuber
api? are you getting it from somewhere?
getting what from somewherew
Im running auto's webui locally but I am generating through a custom python script I made
I can generate images but not upscale with the api it has
ah, custom stuff is a little harder to see what might be wrong. I just downloaded all the upscalers I wanted so I won't need any outside calls
and did you also add the --api to the bat?
is it wrong to think of your collection as 'your own collection'
i have a bunch of picture i quite like it myself
but i have a feeling, i don't deserve to show it as a collection cuz i don't draw it idk
i made a lux model 🙂
https://civitai.com/models/3774/luxanna-crownguard-from-league-of-legends
Lux from League of Legends.Settings:Sampling method: DPM++ SDE Karras
Width: 1024, Height: 1024,
Sampling Steps: 30
Restore Faces
Highres. fix
CFG Scale: 7-11Example prompts:Studio Ghibli, extremely detailed CG unity 8k wallpaper, full shot body photo, base lux skin, masterpiece, high quality, absurdres, lux (league of legends), nsfw, ful...
i personally don't know why the proton model thought that putting a light saber in your pocket is a good idea
lol
how can i generate more than 1 image in the web ui ?
change the Batch Count number
allright, thank you ❤️
Hi guys, I'm generating 1280x768 images, but it often causes me duplicate elements, where it didn't before. It is Denoising Strength which is supposed to reduce this effect, isn't it? No matter how much I mount it, it doesn't change anything.
it did before, you jsut noticed it less. this is always a problem with larger images since SD is trained on 512x512 blocks. Or refined with 768x in 2.x's case. It'll attempt to do the same seed in each 512x chunk it sees.
hires fix in automatic1111's webui will alleviate these issues by first generating a small lower resolution image, and then building on that with img2img
Yes, I know all that, hence my question, that it gives the impression of no longer being effective.
So, you will agree that in principle, reducing the Denoising Str to 0.15 should have an impact. Here, this is not the case.
On 1.4-1.5 I feel like it worked better. For identical resolutions
bnnuy
baby images
I'm torn between two styles for my Visual Novel. Which would you prefer in a VN?
I think the first one is cuter, but the second one is probably a more modern style for VNs, and might be more immersive.
The story deals with some serious issues, so cute might not always fit. (both of these styles are made with Stable Diffusion of course)
I really love the 3D vibe of the second one. The colors are very soft and the cafe vibe looks really good.
But I'd have to say that the design of the first girl's outfit, and ears, are more innocent/charming/unique
I love the little bow
And the outfit is cute
It feels more personal, I think
The overall style is more inviting
If you could maybe soften the 3D one, or adjust the face, maybe add some soft focus/motion blur, you might get a blend between the two of them
anime
Are you using SD 2.1 to make the anime? I had to switch to Everything 3.0 to get anime to be good again
how this is possible?
What is your gpu?
1050 ti
why?
Because sometimes it can run out of memory if you got 3-4 gb of vram, I have the same issue with the 1060 3gb
how do you fix it?
If you close and open the webui again you should generate images again
I still don´t know how to reset the vram usage withouth closing the program 🥺
Np🤗
same
Exception training model: max() arg is an empty sequence
what does that mean?
in auto1111 dreambooth
The anime one is using AnythingV3, which is based on SD 1.5
er yeah I meant Anything 3 lol
I only care for anime art really everything else is sub-art to me 😄
except Midjourney does good non-anime stuff but only anime for me from SD
There is also a very low vram option flag you can set if your using auto
oh thx
Hold the bus... I just used image2image to merge them...
This just might be the answer, more work... but worth it?
maybe make them each a style and try merging styles at generation time?
but that is great, it's like elevated illustration, or warm soft cgi
how did you do it just added the 3D image with the 2D prompts ?
I took the AnythingV3 generated anime image, and made it the initialization image and then generated this new image using the prompt and settings from the 3D style second image using the 3DKX model. Magic happened.
The final render is with the 3DKX model
I had no idea this model existed ty 
It's a new one
let me find the link
The point of this one is to have two styles, either 3D render or cartoon (not anime). It has no artist names or artist styles... it's designed to be artist ethical. I wanted to try it because I already do no use artist names or styles in my prompts out of consideration for artists.
(I'm a published game dev so I care about such things)
Where did it learn the art style if there were no names or styles to train it though?
Names makes sense, but it has to have something to look at
it is trained on 3D renders
not 2d art
I'm not sure if the renders are solely from the creators though.
but they seemed very concerned with ethics, so I imagine they only used what they had permission for.
since it's a ckpt, I imagine that 1.5 is the base, but I don't know what the mix is.
It looks really good, I won't assume that they didn't have permission though after reading that page, they put a lot of work in
yes
One amazing thing about the model is that it works really well in the 1152x768 -> 2048x1024 rez range (you can run it lower than that of course, but that's the optimal range). At that rez faces look great, I've turned the face fix off in my renders with it.
That rez is amazing for a 1.5 based model. Hands are pretty good too.
Awesome, I'll check it out!
A Parisian goat wearing a beret and holding baguettes.
Dall-E made me three dogs, and this:
I bet you can do better
On a arreté depuis longtemps les berets, ça ne va pas avec les oreilles
Waouh c'est super!
You forgot one more meme.... a mime
jffjd
I asked for a man with a shadow in the shape of a lion and I got a duck :/
LOL
I don’t
lmao
it gave me a wireframe when I tried using the word 3d model in the prompt
nice
I've been creating race car liveries with AI and have had some pretty cool results. Just wanted to share one of my favorites so far.
That's pretty sweet
generating warriors, I'd like to see some field experts assess some of this AI armor. It would amuse me.
though, in terms of nonsense, I think AI melee weapons are the best.
Reminds me of a certain scene from devilman crybaby.
||testing||
Why does installing CUDA always take like 5 days
1.5 top vs v2 bottom.
What a time to be alive! Just think what it will generate two papers down the line?
Really loving what dreamlike photoreal 2 model card is making here....so far my favorite model.
v1.5 vs v2 vs 2.1 from top to bottom.
What's going on? it's getting worse with each version.
--
--
use more negatives! negatives solves everything!
Delhi in snow fall
it is. emad and co are too afraid of the people saying someone will use it to make CP or something
nevermind of course that the AI-haters want all the tech shut down, and they're making up a ton of shit
just fuckin' shove back in the porn and the artwork into the training data already, it's getting kinda tiring waiting on weeks and months of "but it runs mechanically better, guys"
like, whatever the fuck we do, twitter artists are gonna hate us anyways, why the fuck we gotta sit here with a bot model that can't make anything interesting? it's fucking boring, and 2.x is not ready yet but in such a way they delibraetly chose to break it
it has been outright acknowledged and stated 2.x does not have its full training dataset, and it's showing, and the longer that missing data is not put in, the longer any potential bugs have yet to be found
Some of the training data for fantasy_diffusion V2.5
Guys, can anyone tell me how i am supposed to get these LORA models?
I tried to follow a tutorial from "Nerdy Rodent", but I don't understand how or is supposed to have them...
https://www.youtube.com/watch?v=gw2XQ8HKTAI
Do we have this capability in SD somewhere yet
"General use model" (actual name, or something like that lol).
62 samples SDE sampler with princess embedding and a negative one I can't recall. Happy to look up details if anyone cares 🙏🤘☮️
Some results from Fantasy AI. Which is built on top of stable diffusion. Currently it's under development
The Everything 3.0 model really needs to add more non-female character images to its training set
I get nearly perfect female characters, everything else is struggling
Metaverse
MJ?
why did their account get suspended
oh
164 votes and 141 comments so far on Reddit
what the hell is this
I checked it out and goes with a Black Only mod
oh nice, of course they aren't going to tell what exactly did he violate, right? 🤔
check the reddit thread i linked, may provide some insight
That's all speculation. Github should be more specific than TOS violation. especially when so many are using this repo
no insight there, just the same conspiracy theories about "racism"
literally every service should be more specifiic
True
more speculation
and "yellow" only https://steamcommunity.com/sharedfiles/filedetails/?id=1520519582
prints are more reliable?
do you print them on paper?
@gray citrus
Damn! That's heavy! You sure that's by him? Does he have a team and sone one else could have done that? if it's him that would be very bad for A1111
It's his repositories and he alone made commits (afaik, I don't think there's an accessible archive page of those repos)
I don't think it's a "smoking bullet" since there's still discussion about what the mods actually did, but it's not a good look 🤷♂️
portrait of Beautiful hands holding deep floyd in moon light ,fantasy, intricate, elegant, highly detailed, digital painting, artstation, concept art, smooth, sharp focus, illustration, art by artgerm and greg rutkowski and alphonse mucha
The 22 million strong Reddit r/Art community is currently on fire, and the mods are just digging deeper and deeper holes: https://i.imgur.com/GhTzyGv.png All because the mods are banning anyone who's art looks remotely like "ai art"
a cloud
I love the variation you can get out of a single prompt, these generated in the same batch:
Barbiecore with the coming glitch embedding
r/art mod acting like the Empire 💀
Just casually shooting down anything with a slightly unique stroke even if its real
This is the endgame of people who are fervently against AI art. Legitimate artists will be caught in the crossfire, artists will stop folks with the madness, and it's just fingerpointing/witch-hunts instead of, IDK, enjoying good art and arguing about taste.
No. Protogen
funny some peole down vote my post on reddit about my woke edition of floral marble. I even get -1 karma (no idea what that is)
I think it is time to start the Dictatorship of Woke art on SD...
i have been designing some tokens and interested in hearing other people's thoughts on them.
wtf happened?
playing with using fields of noise to try to influence framing of image generation of img2img. not sure it's really making a difference. Example:
F*ck microsoft 😡 🤬 😭
Nice idea! 💡 , they look great 🤗
They just self destructed, we didn’t even do anything
You would need the noise to be leaning towards the Color Of the subject no?
possibly, I need to start over prompt-wise. I'm over constrained because I started this on the tail of another project where the subject kept having arms over the head. At least her arms are down.
At this point they are fields of RGB noise with the only difference being the average value so they ought to be rather "color neutral" to a computer.
Funny thought: make a basic 3d rig, have ai generated materials, to get your framing and style
I have thought of setting something like that up in blender
if I can get good control of poses, it'd definitely be worth the work.
Have you tried just plain black and white colors to see if that makes any difference compared to the noise?
not yet, but that'd be a good comparison
SD actually can pick up the fact that there's noise in the image. When I used inpaint to extend the dataset images (instead of cropping them) during training, I expected the model to just "miss" that fact, because to human eye they are difficult to notice. But no, it did notice the extensions and the resulting edges were warped. It's really weird how it all works
pictured, a pitt stop
Definitely making progress with this img2img noise thing. Using a rough sketch and varying degrees of noise against a baseline. Noise does seem to grant freedom to interpret.
baseline txt to image generation, 3 samples:
with sketch black and white:
with sketch over noise:
I'm definitely seeing more control over framing and pose, and the noise appears to give SD freedom to interpret as negative space to insert more detail
I'd include the sketch templates used but my amateurish charcoal sketch is being accused of being "explicit" 🤷🏻♂️ and I don't want to get blocked
Lets try it this way
concerning or nah?
They already are draconian, with filters on everything. They probably were hoping to go that route.
Sure but will it not get worse
Youtube is now banning even cartoon violence
Corporatization seems to just be a slow-boil censorship machine
Wow, that's going to kill animators
Like wipe an entire category out
When you're beholden to stockholders, that could lead to more exploitation and predatory practices in their products, as well as restrictions if needed on whims
Not saying all stockholders will do that, but it's common enough to see that when that happens
It will be less likely they release gpt-3 opensource when gpt-4 comes out if they sell too
they did opensource gpt-2 model after gpt-3 came out
They sold out during GPT-3, cause if they do GPT-4, it doesn't sound like they're giving out GPT-3.
Also, there's only a handful of companies that would actually have the resources to run GPT-3 if it is ever given out
GPT-3 was selected beta. They didn’t want anyone to use it.
sadge
Does anyone know of a colab notebook (besides thelastben's automatic1111) that allows training textual inversion with the 2.1 model?
dont link a screenshot of words without source material lol, that's irritating 😄
its a sad state of affairs there lately. The original goon squad that rolls the same turf as A1111 has been coming aroound the sub to deny his RW mods are of any concern at all. Your post might've just had bad timing. It truly is a beautiful adaption of a great model. You nailed the goal you were after
Heya, do you know of some objective sources of info regarding AI art? This girl wanted to research into it and I provided some of my research and will provide more, but I needed some more varied sources regarding the topic.
I got it from a screenshot of words originally I couldn't do more sorry
more testing, a comparison of identical settings with white contrast figure on dark rgb noise (generated random values from 0-200) vs same contrast mask over black. The noise creates room for interpretation in generation, whereas black (or white) values block interpretation.
it was WSJ and their source is "the people"
🙂 idc either way, i reckon just from selling chatgpt tokens they'd make ~£200/year a user, give or take whales
anyway all hearsay gossipy stuff
sick!
that's next level.
thanks. which ones do you think are best?
thanks. do you have any favorites? i've been making more every day. im not sure what people will like vs what i like
Protogen blowing me away too. Apparently I was a boomer just using 2.0 🙃
very nice
see this? i keep seeing this whole one eye is a spiral thing in midjourney/openjourney renders. is it some kind of a watermark in the model or just one particularly prolific style by some artist that keeps turning up?
I created panoramas in @diffusionbee then loaded them up into @GravitySketch on @MetaQuestVR
First attempt at doing this but the results are interesting and it has me thinking about other potential generative #AI & #VR experiments to try
This time tried to intentionally make these faces a bit out of proportion for aesthetic purposes
hey, how does embedding work? Do you train the AI on some "glitchy" images?
@boreal crown its the model getting confused on what to do with the eyes, different looking eyes constantly shifting it around, sometimes that part of the latent space just ends up mid way kind of
These are epic!
ok
so
i realize
this is kind of innapropriate to ask
but has anyone tried training the model on pornstars
like their ig stuff
not their dirty stuff
like stuff off their ig
that can actually be acceptable
i.e : lana rhoades
anybody?
I haven't but have trained on stuff from other people's IG. What are you trying to see if it could do?
I liked them all, but the one I like the most is the third one 🤗😊
So cuteeeee 😍
ta
playing around with some stuff, getting awesome results
Hello everyone, I wanted to ask if anyone had any advice on making combat scenes centered around a character
I think so,
Fire it up and see if it works.
Just to make sure, which one do I open?
The one I was opening before didn't work when I tried, just wanna make sure I'm opening the right one
webui-user.bat
okay
then browse to the specified address with your browser of choice.
Loading it now
its not loading the other model we just downloaded
its loading the 1-4 other model
should i go back and delete that one?
Nope.
is the model in the dropdown in the upper left?
how much vram do you have?
Not sure
alright downloading it now
in the same folder as the models.
Restart and see if it will let you grab it from the drop down.
768x768 is better for photography since photography is resolution dependent, but you can get reasonably close with 512 and then upscale it afterwards.
same problem
is your automatic1111 upto date?
That is...something
not sure, testing something from the link it gave me
says the yaml and checkpoint have to have the same name
yes they do.
Yaml I uploaded has the same name as my checkpoint file.
v2-1_512-ema-pruned.ckpt
think its gonna work now
that was the problem before, the checkpoint i downloaded had a different name than yours
its downloading things now
one day in the not too distant future all this will just be a program like krita or photoshop.
in the meantime Our various installs require regularly scheduled open heart surgery
Now im just running into the wonderful problem of clicking generate and nothing happening LOL
check the command prompt running the server, it will usually have something,
Nothing changed since it loaded the model
ope
may have fixed it
think the cmd just got stuck
were making good progress
hey thats progress!
Thank you again!
ignore voynichizer.... That embedding isn't done cooking yet....
I mean you skipped from 1.4, so it's going to be a pretty big jump
1.4 to 1.5 was huge.
2.0 release was very controversial.
why?
It excluded smut and names of living artists in training
That meant it was bad at generating smut, and people couldn't use 'in the style of greg'. Both good ethical changes, but not terribly popular in the short term.
That's what caused unstable diffusion to do their whoel song and dance.
part of the quick photoshop job I did, using img2img
its also part of the pose the girl in the original image was doing
makes sense. If you have a second setup for invokeAI you could use unified canvas masking to specify the area that needs fixing, and reroll it a few hundred times.
but then you have to maintain a whole second setup and shuffle things back and forth.🤣
Eh, it I wanted to fix that stuff I really good just with photoshop
I was more so looking for just photorealism in general
Then I could manipulate what I wanted
Too busy getting CUDA out of memory :/
yeah, thats the other thing that needs to happen- all of this jammed into photoshop/Krita plugins
true
I have 24gb of vram... it's still not enough, I still hit cuda oom walls.
Realism way easier with just text and no prior image lol
dawww
I have downloaded an upscaler pth file. How do I get it to show in this dowpdown menu? I tried saving to ...models/stable-diffusion/ but wont appear
anyone know what prior loss weight does?
Hi everyone, Stable Diffusion disrupts the masked area when I use inpainting. What am I supposed to do in order the keep the masked area as it is?
this is the original image
this is the result
you can see that numbers are disrupted
i never had that problem before, maybe your blur mask is too high?
Actually, I'm using it in collab and masking it with a program that I coded. I don't know if it is the problem. It masks the object with black and fills the rest of the image with white (as written in the documentation). Adding the original image without background on the result might be a solution. But I couldn't figure what causes to this error.
GLUBO
PSA: Don't go to the automatic1111 discussion page unless you want to be exposed to some proper morons
@zealous stag
Will SD implement text generation like StabilityAI?
This is pretty awesome.
thanks :)
automatic111. when making pics, i want more zoom in the right generated image.
cliking it once doesnt zoom in enough
SD is a product of StabilityAI, it will be implemented
when? we dont know
I came to know about this feature because of Shakentov
It's a text generator to image from Russia. It seems SD and them are working together
yeah, its gonna be super cool
smh quality of stabledif generated image is not close to the left which is made in midjourney, am i doing smt wrong or have to set some special settings to get that kind of image? can anyone help
(its same results when i copy exact prompts i used in midjourney with text2image)
midjourney does a lot more behind the scenes than stable diffusion
hmm
i mean ive seen crazy works with stable dif, i expect to be able to pull the same or better result
you need more work, but yeah, with stable diffusion you got a lot more control of what's happening :D
i was juist wonderingif there is any setting or check boxes that i need to turn on to be able to do that kind of 2d style you know
no, there's no such thing. The closes you'd come is having specific models, hypernetworks, and/or embeddings
other than writing very specific prompts that is
how's it coming along?
it looks great!
It’s actually already done, I’m just working on the presentation.
nice dude, I'm looking forward to seeing it!
hours of playing with settings/inputs/models and I think I'm starting to get the hang of sd
Currently training something called Sci-Fi Diffusion
Top ones are either SD 1.5 or 2.1, bottom ones are my Sci-Fi diffusion
based on SD 1.5, currently at 2 epochs on a 26K+ image dataset. 640 resolution
@pallid relic @gloomy elbow @sacred spruce
Still WIP, will do more epochs
@dense tapir
bangkok
Is there a way to Relight this photo? I changed the sky and have now tried dozens of prompts and settings to get the sun to actually cast shadows, but nothing works. Any ideas?
maybe through some img2img sorcery, but making a building cast a shadow where there is none in the latent space to begin with seems out of the scope of what SD can do. might need some 3rd party apps to get that done. SD gives a good base, but hand painting in photoshop or whatever to get very specific things into a very specific image is inevitable right now.
inpainting can get you pretty far, though.
How would that work with shadows?
Not sure. Give it a shot.
Trying to create an image similar to this but with no character that I can 'attach' to the right side for a widescreen wallpaper. It's tough!
tried like "inside a star wars space station hallway" and such, but nothing good so far
the idea being that I could then inpaint the seam between them
i would smear the background into the character in photoshop and use inpainting to fill
yeah might be worth a shot
gotta denoise to just the right level to make it different enough afterward
trying to render at ultrawide is just too tough with characters... dupes galore. 🙂
did you try inpainting the area only with "only masked" ?
this makes it not take the rest of the image into context
what's your model and prompt?
weird, all the anime art and non anime art ive seen are so clean and detailed, any i generate with either Stablediff or waifu diffusion all turn out weird AF, weird faces weird bodies. details are not even close to what i see ppl generate with stable diff
smt like this for example. holy sht. can i see ur promts and setting pls to get an idea
it's one of the protogen models. I'm just starting with it.. still learning how it works
I do not understand what you are saying. Please rewrite using complete words and sentences.
can i please see your prompts and setting for that image
i didn't think it was that difficult, i kinda changed the original though
but the steps was just masking the character and use fill for masked content (i believe this does the same as i described earlier)
and then simplify your prompts with something like detailed wall inside of a spaceship with glowing yellow lights, bokeh, canon 5d
It's a bit too similar to tack on to the right
once it looks okay filled in, i did some img2img to make it a bit more consistent
I'm trying to make a widescreen wallpaper of it, know what I mean?
sorry but how does that work? i see some numbers only
like, 2 monitor. 3840x1080
it should be in the pnginfo, no?
ahh, you kinda want the camera to be further away i guess?
or like super wide
its like. this
i can't really generate high res images as it would just crash my pc
Like at first, I was trying to blend two character scenes, but it was very difficult to match the colors and angles...
kinda sounds like you want outpainting
And I didn't really want to characters anyways.... so I thought, extending the wall might work.
well, maybe, but outpainting is like 256 pixels at a time, so I'd be here for a month 😄
You need to read the png parameters metadata
So I'm trying a massive inpainting technique instead....
so far thinking this might work
did you try selecting both characters and inpainting the background only?
i think it looks alright
Taking this monstrosity, but masked out all characters on the right and just left the one on the left. Then inpainting over it to get a scene...
worked thank you
getting closer to something usable...
well that came out pretty darn good!! what did you do here? you repainted the whole background instead of just the seam?
too many prompts, so you do need a looot of prompts to get a good result? like +10 lines
yeah
nice idea
I mean, the composition of the characters is still weird lol, but more about the method
instead of painstakingly try to bridge the seam
nice
that's working well
random question - can the mask color be changed in the simple tools?
lol
i think there was an extension that lets you change some color with regard to masking, but i tend to just use gimp
oh I didn't realize you could mask externally
ah i don't think the mask can have color
What are you using for character isolation on your masking in gimp?
sooo i pasted all the prompts 
I have the affinity products, but I always go back to gimp since I"ve been using it for like 20 years. 😄
i don't, i find it not to be so useful to mask characters precicely
but i was talking about the underlying color, the mask itself can't have colors
huh? you lost me. it's just one prompt
it only have varying transparency
whats this then 
It's a single prompt. With negative prompt. And the associated settings.
you'd just end up with weird edges, it's better to use blur mask instead to get a little bit outside the edges
awwwwh
i see
she kinda looks like the young and old version at the same time lol
she has those old cheeks but the rest of the face is young
yeah definitely not my best character, just testing some methods haha
i think i'm gonna play around with your method to bridge the two images though... I can work with higher-res from the start that way
Final upscaled if someone wants it...
its the first time i see a more realistic and nice image made by stablediff xD thanks
hmm what model are you using?
it's kinda interesting though, you get both versions of carrie fisher in one
that'll do it.
lemme try a waifu version
dangit there's always something else to try 😐
i got stuff to do lol
Ok back to the other thing now 😄
Gonna try working with this one now.
the mask can't have color, as it's a mask, but the mask can be represented by different colors
man it's incredible, how do you manage to have such a similar character, and such a clean scenery at this ratio? What model, prompt? A particular embedding?
Your decor is incredible too, there's nothing bullshit
trying a new model
@waxen bramble
?
.
Yes. I don't understand what you want me to say?
no like, opinion?
Am I to guess?
:c
Everydream
(finetuning)
on Runpod RTX A5000
Absolutely no experience on that.
i didnt too lol
They look fairly good, but bit chaotic. Like you can see that the AI can't understand something in the first city picture.
Probably you end up getting basically your output images.
In base finetuning of the model one epoch already does wonders with big dataset
You'd want to change your dataset if you keep trainig
tru
Because remember that the training basically says "Make these images. Save the context of these images".
Like If you want to really finetune. Youd start from the SMALLEST well defined simple dataset you can. Then expand them to have more context and more complex things.
Yeah but... you had what 52.000 images shown to it.
Individually
It learned context from all of those.
If they share themes can context, they got reiforced.
Basically there are two ways you can approach things.
My fantasy diffusion dataset is like 100 images and it works well
One from broad to specific, or from specific to broad
Like fine tuning on people. First you'd need to just show them what people look like.
Then what they look like in different poses.
Then in different conditions.
OR!
You can start from a mess of people, and then train the model to do ONE person really well.
Depends on what you want
Thanks, and I'm certainly not one of the pros around here. haha. All the info should be in the embed. I was messing with the new protogen model on this one.. still learning it. But the standard SD models should be pretty similar with the right prompts. No embeddings.

