#🏞|general-with-images
1 messages · Page 113 of 1
i installed a model and try to make a picture but it seems like no matter what model i use stable diffusion uses its own model and the output is bad, in the process of creating it looks good and colorful but when its completed the colors look bad, can somene help me to solve this thank you.
that looks like a screen shot from an actual anime. are you using A1111? what model have you loaded?
i am using A1111 i guess, downloded from a title, “install and run on amd gpus”. With all models i get same outputs, this one was abbysorange i guess
seems very odd you get the same output. are you using the txt2img tab?
yes i am using that one
by saying same output i mean like similar pictures, washed out colors, deformed faces
try different settings for sampler, steps, cfg, also enable the hires fix and try different settings for that, but change from latent upscale to something else like lanczos
are you using a negative prompt?
see what the model creator used in their showcase images for negative and try those
i use the exact same settings with the model creator, checkpoints, loras, and textual inversions, tried changing every setting, cfg sampling method, hiresfix, etc
I am using RX6600 8gb gpu by the way @sterile temple but as i said i once run stable diffusion without any issues
gpu shouldn't affect the final result, just how long it takes
how about the vae? did you download the vae-ft-mse-840000-ema-pruned.ckpt ?
just downloaded and tried it, didnt help. The picture is about the come perfect but when it finishes producing it looks terrible
actually, if you are using an anime model, there is probably a vae that is better suited for that
just used realisticvision and i get a perfect image thank you, but why do i get bad images with anime models what should i do with them.
try the vae they made for orangemix
I have tried LCM-LoRA Weights, and I am amazed
Steps: 6, Sampler: UniPC, CFG scale: 1.5 😱
still the same problem :/
I think the previews actually might be a bit tinted towards red and orange, I noticed this myself when i thought the preview looked good, but the final result was more brown/black where the orange was
you could try adding 'warm lighting' to the positive and maybe 'desaturated, dull' to the negative
but looking at the pictures what others made with this model, my outputs are far worse than theirs
i've tried all the samplers, dpm++ 2sa seems to be the best
dunno if that's the case for sd1.5 lcm lora tho
for 1.5 unipc works great
Guys which negative prompts i need for that human hands can anyone help me on that ?
(((human-hands-anatomy, human-fingers-anatomy, detailed-fingers)))
maybe... you can try
or just add "furry_hands" into your positive prompt. or Anthropomorphic_hands
maybe both?
Or something like (paws) ^-^
Hey
I got error when I try to install dreambooth and try to run webui, it says "CUDA... bitsandbytes" something like that. I use Windows11
Does anybody know how to solveit?
you know how much i sacrificed
if i want to create fan art of anime character what is best model to use
stillllll wondering what stable diffusion 1.6 is. Theres a lot of cross signalling here since automatic1111 webui has it's current release called version 1.6
HI everyone
I am new in the field of stablediffusion,
I am seeing this thing when one can use the image nd change the outfit or style an all, lets say i have my image and i just want to make it like caption America clothing, same image or lets say
change the background or look,
please guide me through what should i check or learn, what models i should use , what prompt i should use ?
Those short prompts, "space face"
Create a scenic background with a Mayan cultural atmosphere. Incorporate rolling hills with a mix of green grass and wildflowers. Use gentle, fluid brushstrokes to evoke a sense of movement and texture. Integrate elements of traditional Mayan architecture. Include grazing cattle and sheep in the scene, and consider adding some trees or shrubs to provide shade and shelter. Employ warm, earthy tones to evoke a sense of history and tradition. To enhance the overall mood of the artwork, consider adding some atmospheric effects such as a sunset or light mist. Use subtle color gradients and soft brushstrokes to create a sense of depth and distance.
space face
generate image:
They're beautiful. Very well done. Love how they turned out. Good repition on the fur.
What was your process/key words for these?
Well my prompts always end up being pretty verbose :D
And its a two pass text2img first generating with the indigofurrymix v90 hybrid and the second pass with the indigofurrymix v80 realistic. The fact that the same model comes in different styles basically enables this, since they have similar behaviour with the same key words
And I like the composition and stuff from the hybrid most, and basically use the realistic just for stuff like fur texture ^-^
Interesting! I have fun making up some verbose prompts--there are a lot of catagories you can mix and match in surprising ways.
Making/drwing furries is fun--I can't say that I've picked up much on the other furry models--but is there anything that you're looking for, furry wise, that you haven't seen that you're interested in?
(I haven't tried either model)
I find that at least the models I am using work best with verbose prompts. It actually reduces stuff like deformities n such
If you look at my guide, you'll find more info, at least as far as SDXL goes
Heh. Indigo furry mix has not updated to XL yet, but they are basically on the verge of it
I'll have to check it out!
I always love seeing what people create.
And I love sharing it ^-^
I just dont want to spam a chat with all the good stuff :D
Look at this floofy doggo :O
I bet hes a good boy
I love the skull head going on! The mist looks great, and the body has a lot of good form. Love the atmosphere, too!
Thanks ^-^
I think thats just a lot of prompt crafting praxis paying off :O
Oh but stuff like anatomy and clothing, as well as the detailed backgrounds is definetely the model. My contribution is mostly stuff like composition and such :O
thats weurd
Dude, foqqin right?
12 steps. un fucqkqqing real.
And this is just a reg I barely paid any attention to prompting.
x2, its so much faster
8 steps with a 3gb vram gpu lmao
Changes the whole game. It's ridiculous.
It also uses less vram, withouth it the max I can get is 850x850, now I can use 1024x1024
Yeah Kohya is garbage, but I just got my first sample off my XL model. This will be an awesome thing in 24 hours.
If it doesn't crash.
A photography logo in a minimalist flat style featuring the arrangement of 'EJAY' to form the design.
A loser, (((no idea what he's doing))), LORA:areyoufuckingkiddingme:3.5
the crashing is a driver issue. bsods always are. kohya is just maxing your card so hard that minor conflicts that normally pass just stack and stack and stack
training is a sensitive proces because it's non stop
pytorch is also one of those libraries that just reaches into the guts of the driver and ugggngnn, y'know?
I know it. I'm trying to figure out how to revive savestates in Kohya, but, like EJAY, I am also not smart.
yeah i haven't got that figured out gracefully yet either. i just tell it to start from a sample safetensor that showed up
How?
you finetuning your model or doing dreambooth? lora?
i know my way around the lora menu, lemme look at db
It's a massive-ass LoRA. Maxing image count.
ahh. lora good. i know this
that LoRA network weights is where you put the most recent saved weights
that is the tempo
Oh, shit. I've been staring at it this whole time.
set it like you're starting over i guess. if it already done 5 epochs then subtract 5, or whatever
Perfect. Thank you, sir. You are a gentleman(lady?) and a scholar.
i've paused and gone higher resolution for another few epochs too sometimes. works well
if i were a chick i wouldn't be no lady. probably.
Me neither. Ooh. Sample 2 dropped.
Most cohesive gif I have done so far - enigmatic_e workflow for the win! Going to roop my kid's face on there and see how that looks.
Breakfast in the very beautiful garden on the farm
#1100170312106127410 the bot channels are for generating images
First project completed with Stable Diffusion + Deforum https://youtu.be/lroYQXvgobU
Premier projet Stable Diffusion + Deforum terminé.
5876 images
13fps
Musique par Astrix : https://www.youtube.com/watch?v=ASR6R8COgk0
Hi everyone!
I want to generate consistent images of this character (different poses and angles), but i have only one image.
What do you recommend?
Reactor
in A1111

training 5k images?
hope you're saving those cached latents
14 hours of preprocessing could be annoying to do a second time
its preprocessing its also going way faster then the ETA
age old issue
RN its at 1 hour ETA
not using kohya though? kohya got that cool full bf16 support
@proud dagger these are the settings
it seems to be making something good
prompt: a painting of a man wearing glasses and headphones in a room with a wall in the background and a door in the background, art by SaRA all
seems to include the model name(SaRA all)
but does do a decend job for a heavly undertrained model
Thank you.
Reactor doesn't recognize this face. However, when it does work with other cyborg-like faces, it doesn't maintain the 'texture' of the face.
slowly its improving
face wise the model is still failing but look at it's enviroments
@limpid lichen
IP Adapters
POV: your LoRA is being generated and you see the avr_loss drop to the lowest of all the LoRAs you changed
spooky
i am trying to use SDXL model and I am getting this images.
any idea how to fix it?
You need to use the SDXL VAE and not a 1.5 VAE.
I did set automatic on vae selection and I have both versions.
Try explicitly setting the SDXL VAE.
Yo, I have a 4Gb rtx 3050 grahpics card on my laptop
is that good enough to run stable diffusion
16 gb ram
Why do I get that out of memeory error?
Aaa okey, then paste "--xformers --medvram" after commandline_args @lone cloak
K got it
Tell me if it worked when you try it again🤗
K thank u very much
Oh, it seems I got a new problem
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)
Time taken: 0.0 sec.
I recently reset my laptop, could that have to do something with it?
I mean the gpu hadels blender and other software just fine so ig it is a stable diffusion thing
You have installed the gpu drivers right? @lone cloak
Idk how do I cheak that?
Oof I searched the error and idk if it was because of the drivers 🤔
Did you just downloaded the automatic1111 zip, extract it, put the models on the models/stable diffusion folder and opened the webui? @lone cloak
Also try opening the webui.bat, not the webui_user.bat, just to see if it works
I just cheaked and I do have the driver
Idk if it is the latest version
Its not the latest but that version works fine, I think I have the same
Try opening the webui, instead of webui_user to see if it works 👀
K I'll do that
Nope, we are back to the first error
If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
What about this statement at the end that it shows?
It shows the same thing when I run the bat file
at the end
Wait hold on, I did not edit the weui bat file, I just did that with the webui_user bat file
let me do that
This looks more like this ss
Yeah, I never really use the webui_user, just regular webui
Could u cheak and tell me that in your webui_user file, do u have --xformer --medvram statement?
Or is it jus there in the webui file
Could the problem be with the model that I am using?
I am using the juggernaut_xl something
Aaa a sdxl model...oof i only tested 1.5 models
Should I change the model and try?
This is my webui
The green line is what is activated, the red one is just if I wanna test it (but its deactivated)
Oh the problem seems to be with that specific model as when I changed it and tried it, it works
!
Thank you very much for you help tho
np 🤗
I did a quick test with around 500
Then I pushed it to 2080 ig and got the error again...🥲
LMAO 2080
2048*
Try idk 720 or up to 1024
Yaay!
with a sample of 20
yo so what model should I use as a begineer
I want to have diverse range and just try things out keeping stuff loki
You want to use sd1.5 or sdxl models? or both? I dont know much about sdxl because I haven´t tested, but I usually use this one
This is a realistic model, but there are a lot of different ones like anime or more drawing style or things like that
I am new so I don't know the difference
Not very good
lol
I made this in blender
This are some examples of generations with the model that I used
Nice, you can do img2img with this pic
Oh yeah, good suggestion
What does the cfg do?
Its how hard or close sd follows the prompt (the text)
ohhh
is this normal speed? my gpu is 4090
Which model did you use @pallid ruin ?
Its epicphotogasm_lastUnicorn
Is this sdxl or sd 1.5? and what resolution and steps? 👀
I´ve seen someone on reddit using the same gpu and it took them like 30 secs to generate but im not very sure U.U
Maybe you can ask on #✨|sdxl channel @jovial saddle
?
Sorry wrong ping
Oh you're cool
I finally got around to replaying Gof of War Ragnarok NG+.
I mean, replaying GoW:R via New Game Plus.
😁 👏
KRatos and Thanos fighting over the last bowl of Macaroni and Cheese
👀 👀 👀
How is your setup? you run SD on the rtx gpu and then another installation/model on the amd one? @exotic yarrow 👀 😊
What is the prompt for images not videos
its a laptop with Both GPUs in it
Im not sure, I tried with "/ dream" and the output was a video but other people used the same and they got an image 🤔
Type dream then backspace the prompt
You have four options, aspect ratio, style, ect then pick prompt last
How to create a prompt to generate a shape without any light effects and black background?
Really I'm asking for the simplest thing that AI could ever generate and it still tries to add some effects to the background.
It's pretty much the same for all models I have.
Prompt: dirty random shape of blue crystal, animation, pure black background, solid color background
Negative prompt: light, glowing, mirror reflection, light reflection, light effects, shadows
Result:
I never been so disappointed with AI in general.
It's pretty much impossible, including extension for a1111.
I managed to remove background with https://github.com/danielgatis/rembg
I'm making a mindmap of generative images stuff, what would you add to it?
Nice, I think node based interface could be added (like comfyui)
Guys is there any caps limit to a prompt?
I dont think so, I mean, the image probably wont be able to have all the things that you add in the prompt, but it wont give an error or something like that
The thing is that it remains there loading without giving me an error
Wait are you using the local version or the one in this discord?
Because I tried a 299 words (locally in automatic1111) and it worked
The one in discord
It worked for me @frail peak , maybe you should try the same prompt in other bot, (i mean from bot 1 to bot 10)
Thanks i will try
hi guys, does any one can help with this problem? I have enabled controlnet (had tried canny linear etc.) it seems working in the preview, but none of them finally impacting results, I had checked my extension etc, all enabled.. how can I solve this? thx
how the heck do you not get the grey screen
Idk 😁
THIS IS A NIGHTMARE!!!!!!!!!!!!!!!!!!!!!
Are you running the local version? @kindred depot
YES
i uninstalled it
i did EVERYTHING
switched versions!
DID DEBUGS
NOTHING!
IM LOSING IT
.........................
T H I S
STOOPID GREY SQUARE
every time i try to load a prompt i get this crap!
it always does this!
no matter WHAT prompt i use it does this!
In VAE
its on.. ;-;
@kindred depot In your webui.bat add this line
yee
same crap
another one
the one it comes with is the only one that works 😭
the other ones dont
im just stressed and i spent hours trying to fix it
Maybe its the model problem then @kindred depot
what do you use?
besides, how can it be a problem when it has 5 stars is the most downloaded and people use it for this site
ooo
I downloaded it from civitAI
yea but how do i run the anime module
i cant run it on the site
and when i run it on stable
IT BREAKS
But what model you downloaded?
this 😭
it allows sword generation
and it doesnt work for some reason
I´ll try it
k
It worked
1.2
But you gotta use another model
You downloaded this and put it in the lora folder, right? @kindred depot
🥺
Now download a checkpoint model and put it in the models/stable diffusion folder
what do you mean another model
The original 1.5 is a bit "old"
Yeah, but idk which one is a good model for 2d/anime
how did you run the weapon one?
leo?
Its a lora, not a checkpoint model, I ran the lora using the epicphotogasm_unicorn model
imma use that
@pallid ruin how do you download the epicphotogasm_unicorn model
C:
Here´s the link
Yes, you download it and put it in the models/stable diffusion folder
added 😄 thanks
I've enriched it a lot, does anybody see somethign else to add? (generative ai mindmap)
generative AI (Images, Sound, Video, websites, youtubers, 3d models, Text)
ctrl+wheel to zoom in and out
👏
so how do you run the sword thing with this
what do i do?
Click on lora
k
and click on the lora:Pecha_Swords_LORA_V1
@kindred depot did you put the lora here?
Aaaa you have to reload the webpage
f5
😁
Then click no the lora one time and add the prompt, something like this
@kindred depot if the lora still doesn´t appear just copypaste this prompt to test "lora:Pecha_Swords_LORA_V1.2:0.85 a sword made of water"
works
Yeeee
You put the lora in the stable diffusion folder and the stable diffusion checkpoint in the lora folder
you swapped places
😁
OOOH
Idk how it even worked lmao
so it all goes into the LORA folder!??!?
Nonono
just the lora goes in the lora folder
the stable diffusion model (the 2gb or more one) goes in the stable diffusion folder
.
omg im so STUPID
THIS WHOLE TIME
3 HOURS and realizing
i had to put that CRAP IN THE LORA FOLDER!?!?!?
🤣
I WAISTED 3 HOURS OF MY LIFE CAUSE OF A GOD DAMN FOLDER!?!?!?
Lmao the best thing is that your mistake worked for some reason
HAHAHAHAHAHA
it fucking works lmao
ok now i got this
show me a photo of your prompt
ok lets be quick i gtg soon
what did i do wrong?? 😭
im generating computer viruses
Oof
Delete all the prompt and write something simple like "a car"
to see if it works
k
a car
i got a fatal error
XD
WAIT
whats prompt do you use
or what stable defusion thing do u use
Stable Diffusion checkpoint
@pallid ruin
😭
😭
Try swapping the 1.5 pruned for the other model
ok
the epicphoto one
where do i get that?
white theme



when i come back
ill tell ya
where's your vae? 
is it true that models that are not .safetensors could run malicious code?
yes
but you should be fine unless you use a specific command line option, which does this: "Disable checking PyTorch models for malicious code." - I intentionally didn't write an option but just posted a description of it
generally, try to download safetensors version whenever it is possible
cuhmon now bro, cawmawn mein 😭
Yaay
It worked the image to prompt to image worked
Lmao never swap the positive to the negative prompt
What was the prompt? Singing apples?
I have such weird hobbies... I enjoy going to facebook AI art groups...
and finding AI pg13 sexy pics...
and converting them into violent robots.
acutally not sure how bannable even just underwear pics are, so here's just the final result, she had a midrift black tight top on and undies, before i made her more personable.
I think you guys may appreciate that more than they do when I post it in their thread... 😐
Anyone know what might be happening? The problem is it takes 100-300s to generate. I had one instance where it generated within 8 seconds and hadn't changed anything so I'm not sure.
Each time I hit generate, it asks "Requesting to load BaseModel"
I'm using DreamshaperV8.
did you try closing it and running it again?
sometimes gets stuck and needs a reboot
Good idea, though just tried it and its the same issue:
Here's the startup cmd output
yo dont have the embeddings but that shouldn't 1000x your render time
Might have figured out why: I had removed the upscale here and rerouted the blue spline to VAE encade since I thought I wouldn't need it as I didn't want to upscale the base image
Though the cost is that img2img no longer respects the input image's dimensions from what I can tell
Hello there. Can anyone help real quick? My images do not look good at all. I am new at stable diffusion so please bear with me. I dabbled in it on TensorArt and SeaArt and I find it amazing. However I want to try the local Stable Diffusion via Automatic1111. I am always using the NeverEnding Dream model and suffice it to say that my generated images look like sad waterpaints compared to amazing images I create on SeaArt or TensorArt. What am I doing wrong here?
Thanks. Will post comparison with the same prompt if anything is unclear.
expression wise yes
Nice👀 ,what model are you using?
about 4 models, 6 different loras, and tile upscale USDUS, about 8 inpainting cycles, downsample and upscale with foolhardy.
I generated with a merger i made with some kinda rendering looking scifi / horror models + epicphotogasm, then upscale with epicreality with tile and ultimate, then inpaint with I forget which, and focal loras like optical/jolt and perfect eyes, and a few others, and downsampled, to get rid of some of the weird aliasing between the inpaintings, and then sent to extra for one back up.
check that clip skip. make your life easier with different models, and put it on your menu in UI and add clipskip to the show on front page textbox on there.
settings>user interface>
Any ideas for how I can make this tail smaller? Image is perfect otherwise
I do not know why I am laughing so hard at that.
what was your prompt? may just need some adjustment.
try whatever you're using with the tail, lowering the weight,
(prompt blah blah fat tail:0.8) maybe would be my first try
for that segment
and if you're just typing "artstyle stuff, man wearing whatever, WITH A BIG MOFOING TAIL."
trying using lowercase letters 
Currently doing some training and I'm trying to look for the instance style for this image, does anyone know what it is?
hey so okay I figured it out, if you make the squirrel smaller, the tail isn't as large
I hope that helps
Jesus christ
what? thought you wanted it to appear smaller :\
here, in case when you're happy with the smaller tail, you wanted it to be robotic, to save you the trouble I did it for you.
Hello! Can anyone tell me what models and lora should i use to create same art?
Here the tail is much less thick now, is this more what you were thinking?
I mean, pixel art use a lora, or style controlnet t2i sometimes will do it, any of the manga anime checkpoints can probably do something close, and some prompts, whatever the ear things are called are probably in some of those anime datasets. lora for pixel art are numerous, and toss a little t2i style on it and you're probably there.

😮
Well I thought you'd like it. I give up.
not really hard to
I think it is a little better. Actually.
I'm sure @sullen onyx would agree
or maybe this?
ape
Hate to break this, but why is his thumbnails so off-putting, as if he's trying to be obnoxious with AI tech when I know that he's just informing new AI tech.
most of youtube thumbnails are like that now,he just need a big red circle and a screaming emoji to complete the pack
A1111/heavy inpainting, that last one was img2img with one of the majic mixes, and a comic model, and then another anime, cardos I think or that may have been the first and majic mix the last cycle. something like that I forget
before and after inpainting that is
This should be a game. Telephone on images.
Niice , it looks so detailed
Thanks, I use multiplke upscale / downscales with ulitimate SD upscaler / controlnet tile with varying >0.45 noise settings to add details but not detract from the original generation, and different models for each iteration, I use inpainting a lot. sometimes 10-20 different steps, with numerous iterations for each
When people complain that AI art is "just typing some stuff" I want to send them a 2 hour video of me making something lol.
That's just one I had in my extras since last time I archived stuff, from the final upscale, cos one image often has 2-3gb of intermediary step images / mask outputs lol so I delete all that stuff and keep the final upscale in extra.
so it is far from the best
🤗
Anyone have experience with DiscoDiffusion (video) that could help me debug this black void that takes over the top right and left sliver of my video frames?
It's never there on Frame 1 , but always appears on Frame 2 and then it uses it for the rest and renders random stuff inside it.
^ I thought maybe it was a contrast issue where it hit some threshold and just assumed black but changing settings around saturation/contrast doesn't seem to change it
Trying to Outpaint in A1111: Using poor man's outpaint, get this:
This is the original
how do I get it to blend, get rid of the line?
You can img2img or inpaint ...idk if there's an option to make outpaint more smooth in poor man's plugin
I was using https://www.painthua.com/ for outpaint, you can set "seam_fix_radius" there, it makes transition between new thing and old look better, but might change part of "initial" image
I think invoke ai has setting like that too, not sure, it's been some time
or just grab the base model of inpainting, and run over the line with it .
and kick the masked padding up to like 64~ or so and denoise around 50
3 ste[s with inpainting, just a default brush size over line, first prompt from left to right "a hallway floor" and then "pants and white wall" and then wall and floor. as prompts. simple, 30st/.5denoise
that shold fix it, here's how it looks now
@strange jungle see, line is gone
@strange jungle you can also get rid of the lines with some small simple tricks, like this
Oh I see. That's so much better.
and the face looks like a zombie face
no no, I just upscaled it, you can see more detail
here it is though
used inpainting as described above
sorry you didn't like my first attempts, I got it right in the end though, I hope, @strange jungle
Even more like a zombie
Well if you need that to be running, yes. cos clearly it halted.
the hell is wrong with you
is it possible to combine depth controlnet with the sdxl inpainting model?
create an image blending Evotech Fire & Security van with snowy UK road
I use it with sd1.5 as long as you use the sdxl depthmap I do not see why not. Give me a moment to actually look how the inpainting / controlnet stuff works, cos may need loopback for the masked area segment, if not full image.
i don't usually use it in comfy ui, so haven't dipped that deep into that particular.
but a1111, just enable the right models in inpainting mode it'll work
How to generate things like this?
how thanks, i try to run with cog in local but i don't have succes
I dunno, i just never has used it in that way, so really unsure 😦
well I use comfyui for sdxl cos a1111 is ridiculously slow, plus I like the workflow complement itself.
so I've not used sdxl at all with controlnet in a1111, nor depthmap + inpainting in sdxl, or any controlnet with inpainting tbh in comfy. But for a1111 it just works when using masked area inpainting, or whole image, it seems, for sd1.5
Hi friends ♥️
Hope it's okay I post this here - I'm a digital art curator with HUG. I've helped curate AI art into exhibits in 20+ countries in the past year, including AI Art exclusive exhibits in NYC & Paris Fashion Week!
My team at HUG is hosting a guided workshop with the Stability AI team after the new year!
We'll have live/recorded sessions on creativity & AI featuring firsthand guidance with Stability AI tools.
Registration is open. It's technically freeeee, you pay a deposit that you get back when you finish one task at the end of the course!
Learn more & register 😊 https://www.studios.thehug.xyz/lab
image with blue sky
Why is there always this sunglasses in the background? Also how do I show half of the body of my character?
what models would i have to use to get sometihng like this or similar
hey, i have two images one with a person smiling for example, or one eye closed and i have another image of a person
i want to swap the facial expression of the person to the other person. i dont want to swap the whole face
is there an extension for this? if someone could help me i could pay a tip
any critique ?https://www.phixiv.net/en/artworks/111756708
the negative prompt is VERY important, id put like "close-up,floating objects,blue skin," in the negative prompt
thank you so much for your reply man
What are typically seen as "good" upscale workflows for realism in comfyUI?
Go to civit.ai and search for sketch loras you don't need a model for that just a lora should be fine
Welcome! Start by heading over to #1072220168534642768 to get yourself situated and help find the channels you are looking for! Please make sure you review our #✍🏼|rules-and-tos and feel free to assign yourself some #👥|roles as well! Answer any questions your may have at our #1072229020520947753. There are many ways of accessing Stable Diffusion, take a look at #1080946152318443610 to start your journey!
Hello! I would like to use impaint mode on SD, while put Disclosure Face Effect on an original pic. Any advice?
use a vector art model
If you need hep with prompting, I suggest going to #📝|prompting-help but I would put "name wearing glasses," to help.
https://civitai.com/models/81125/oriental-giant-dragon or something else?
古老的巨龙,匍匐在东方的土地下,今天,它将冲天而起,展示其雄姿。触发词:long,no humans,dragon。 现在是广告时间,开了个群,大家有想炼的,可以在里面说,我看着会帮助炼一下,下面为链接: https://t.me/+GtikdYf3inUxNzll The ancient d...
dalle 3, prompt: A view from the ground looking up at a massive, ethereal Chinese dragon in the distant sky. The dragon's long, serpentine body snakes across the sky, with shimmering scales that catch the sunlight. Its flowing mane and elongated whiskers are highlighted against the bright blue sky, dotted with a few fluffy clouds. The dragon's eyes emit a mystical glow. The perspective is from the ground, with treetops or rooftops at the bottom of the frame, giving a sense of scale and emphasizing the dragon's grandeur. The scene is imbued with the mystique of ancient Chinese mythology.
You can try doing this with prompting as a base, using the bot in any of the bot channels. (Vector art+isometric keywords, and logos, and graphic design keywords will help with your first question.)
im testing differences using same prompt in local sd/dalle 3, later I send the comparitions
i think you quoted the wrong person 🙂
Hahahaha! Sorry about that. The first part was for you because I thought you were asking the question--I actually do have some prompts that are similar to the dragon above.
I was trying to respond to this #🏞|general-with-images message and explain the dragon
(This is what I was originally responding to, since I wasn't sure what you were responding to.) For a hand painted look, you might want to go with terms such as "a detailed painting, digital painting." I'd also use Chinese dragon, feral furry art, furry art, dragon art, and specify the color palette. The color palette should go towards the end of the sentence. I've also recommend, if you want to put the building at the bottom, that you put the specific wording about the type of building with words such as "in the background." For clouds, I would specify the time of day, and keywords such as puffy, etc. If you want to learn more about prompting, you can check out one of my WIP guides here: https://docs.google.com/document/d/1BxdWqfBJ3QPggHnBCBx3QIkUpHnBWShd_s3zEt2dTLM/edit#heading=h.po8dv7lcq48s
SUNNY’S SOLID PROMPTING LIST Ahahahahahahhahahaha, have fun :< Last Updated: 11/23/2023 Some of you might recall,not too long ago, that I made two installments of prompting for SDXL, with the first being here. This, dear friends, is the third, and I am about to open this jam jar right open and...
wow thx for that
np
can I download that? Also, do you possible have magical sleep's prompt book? Is that being updated?
the one ive is old
I no longer have the link 😐
I'm sure that you can download via Google Docs, but I have no idea about that person, as I'm not familiar with them, sorry.
yep
So cute!
Is there any prompt engineer or someone really good with ai that can send me a really good prompt any prompt cause I am trying with bard to replicate prompts but for others image like I'll say create a prompt for a tiger flying following this style and method of this prompt: prompt . But the thing is that it always ends up telling the story of the character and not describing it visually
all prompts in the dumb bing ai are "unsafe" wild
Did AI end up getting good at making hands with 6 fingers?
can't use words "scary", "horrifying"
Don't worry though. Here is one with 5 fingers!
Just not the proper order in height for two middle fingers
Hey folks! new the server, looking for some help
got automatic1111 to do auto update today. after the update, it starts looking weird
not sure if I have permission to post picture or not, but basically the button where it used to be "Inpaint masked" has became "Inpaint masked, Inpaint masked", and when selected, return errors ValueError: ['Inpaint not masked', 'Inpaint not masked'] is not in list
anyone has artemis mix??
really needed that model but why its archived on civitai
ok, this is officially insane
Steps: 8
Sampler: LCM
CFG scale: 1.7
Size: 768x1024
Model: photon_v1
Yeah, I got like 4 consecutive dogs (unsafe prompt)
batman fighting with an hamburguer (sd1.5)
can someone recommend some goodl SDXL lora similar to this one https://civitai.com/models/153562/detail-slider-lora
Spent the past little while working on this one. I think I'm satisfied after a lot of inpainting and outpainting.
I like the picture, even I do not like that glittery blue costume in general
Understandable, the costume actually just looks like that in some of the comics. Particular the variant cover art this was inspired by.
it is from new 52
Yeah, both New 52 and Rebirth had that effect going on.
i like this one, almost the same as in tv show
I installed steable diffusion but I don’t know where to start the application
Pls help me!
What version is it?
Try opening the "run"
Are you sure you installed it the correct way? @marsh tulip it should look like something like this
@marsh tulip Go to the automatic1111 repo and click download zip
Then extract it on a folder
thank you so much I will try this right away :))))
Okay so I just got GPT to do something really weird. I know I have never discussed with it. My certifications, and I tricked it into listing biographical data for an assumed name that used my first and last names. Letters, the name Robert fogerty which is not my name but it was able to were out of the air kind of pulled my age, my obsidian state, my zip code, It got my spouse name. Incorrect giving Emily, who I have no idea who that is, and when I had it switched from CSV output to biographical information. It listed a project that I just started working on if you weeks ago that I have never discussed anywhere. It's not that interesting to even do so but involves AI. It knew how long I'd worked in EMS, and other data which I had discussed with it involving my music production and machine learning stuff. All of this in a completely new session
And then I ask it to list. Robert fogerty's FEMA certifications
Just kinda. Wtf. Lol because I don't think those are actually listed anywhere and I definitely have never discussed those with him lol
But that's a list of well, at least the top level certifications with each set
As well as my national registry and some others
Admittedly this is my own information so I'm not as concerned as if it was somebody else's. But the accuracy is almost on point
It even had my zip code
So the suggestion that a new session with GPT is completely discrete is not at all true. Either that or it has access to information external to, which it must for the certificates
It's trained with external data and I could almost guarantee that data about those things is found on the web pretty easily.
I don't think my fema certs are available as such I asked it for more extensive information
Here's the thing, I'm not even using my real name. Just some biographical suggestions that are associated with me which would not at all allow lookup of such
As well I do not think FEMA publishes that info publicly and the lookup for EMS certificates requires EMS number, my state license you can lookup by name, but not my certificate itself
If I apply for a job I have to always give copies of my FEMA certifications to them, while my EMS certifications. I have to give them my EMS ID, so they can look it up and my state licenses. They just look up by name for employment. I don't think it has access to something that they don't. Because it would be much easier for them to just look it up as well.
The only public available info on this should be my EMS state licenses
are u american?
Yes I live here
yea thats probably why,there has been several data leaks tru years so maybe it was trained with one of those leaks
Yeah one of the certs is within the past year
some of them posted on pastebin or other paste sites so it probably trained on data from there,that would be my guess
My certifications? In PB?
well those leaks contain everything from medical data to education,job so maybe its that
Just assume that literally everything about everyone is available on the web somewhere at this point.
yea when the whole snowden files scandal happened,microsof was one of the big tech companies accused of collecting ppl data without consent,americans are a little bit more protected 🤏 but ppl from other countries are not
Yeah true. It knew when and where in Ukraine I was during the early war and what I did lol. And that was after it's training data period lol for 3.5
I mean I don't care , it isn't a secret but it's just weird imo also that it associated it with me even though I used a fake name for the hypothetical biographical formatting "exercise" I had it do
u could try to post this on twitter/reddit so other ppl know maybe u can make enough noise so microsoft acknowledges this
if i knew this i would prob send it to arstechnica or other known digital magazine maybe they will publish it
so other ppl know that microsoft is doing this
For them not being able to identify me even though my real name is my Twitter and I sent them my ID lol
X banner me
It's ridiculous. I mean I've had that Twitter account for like over 10 years, uses my email address that I've also had the entire time. My telephone number is associated which I've had 20 years and every time I file for recovery or to have it unbanned they say we cannot identify you. I sent them a copy of my passport card and they still even though my picture on my account is me and that's is my face. That's on my passport card because it is me they still say we are. Sorry we cannot identify you. I'm pretty sure Facebook must have bought them because that's how they act
Not that I'm so much missing anything now since it's gone to crap but weird nonetheless
Regardless of origin, to be honest, even if comes from other conversations, it acts as if each sections is discrete saying that new sessions are entirely separate from others, so in one case it has access to specific and associated information, which is may associate with me, cos of my login name (as my name initials and surname are in my email address and my real name as the name on it, and my account) which it associated with pseudonym robert fogarty in the exercise. or b) it is pulling from past conversations and maybe I mentioned something about the certs in a manner which implied I had them and it associated the cert type number with the fema identifier, but I have years of convos with it, since however long it has been released. just goofy stuff, but how it knows my city / location // zip code, and where i lived in ukraine also, is... beyond me, cos I know i did not tell it my zip code, it could have associated my AI stuff with the things I have talked about. I am thinking maybe B somehow and web training and associative derivation of information from conversations to values like zip for city, though my city has 8 zip codes, it knew mine which i know i never mentioned, and other things.
so there's something interesting here. but who knows. It's neat imo, don't care about the information being leaked to myself. it is not someone elses
Holy sh1t that sounds scary
I doubt it's just microsoft, for example,I looked up the just the ID of some friends on google and their full name and university major + degrees appears in a pdf on a university website that is not supposed to be accessible to everyone. Im not sure if @spark finch case its the same or if chatpgt could have used other past conversations to get that information
I´ll try to replicate it on chatgpt
💀 💀 💀 I searched my nick on google and some old shitty facebook games that I played like 8 years ago appear, with my location, nick of some of my friends that I played with and more stuff
yea facebook,google,microsoft those 3 never delete data they have about u
I just logged on facebook and removed every app that I used, maybe I should do the same with my old microsoft account
The funny thing is that I haven´t touched that thing in years and the info still online
Yeah but the thing is I didn't even tell it my name. I just used my first to initials and made a name up Robert Fogarty which is definitely not my name lol
hmm i don’t get the differences btwn loras and models, also i want able to find one like that
just need something that looks like a pencil sketch
got it to repeat the name fogerty over and over and over and then told it that it had spelled it incorrectly and then had it repeated over and over and over. And then I asked it to give it a first name Robert, and it began repeating that. And then after that I ask it to switch from the repeating formatting method and we were going to now exercise CSV formats. And I asked it to populate the fields with age, occupation, zip code, but I did each one and had it repeat the exercise three or four times before adding the next one and it kept filling it with correct information as if it was me
.
A lora is a low ranking adaptation of a model. Basically somewhat of a very refined miniature model that can't do things alone but it can be merged with the model and utilize the models base to apply. Its refined training. Alone it would be. I'm actually never tried to see if I could force load a Lora as a model, but I would wager that it would be like trying to generate images with an inpaint model only 100 times less capable at image generation. Just getting some really wonky output. Because it lacks the weights for probably most things outside of what it explicitly is for
There are tons of those on civitai
i don’t plan on making characters
wasn’t able to find one
Would work
Doesn't need to be a character
.. tell it to be monotone. Tailor your prompt and negs
Also sketch controlnet can further preprocess an image of you have a base image
And use with that it'll make it work well
what’s a control net
It's another type of model that sits in line, and ads conditioning based on a style of pre-processed image, for example, sketch pre-processor converts it to a sketch like appearance, the image that you are using, and then the model will add conditioning to your encoded prompt, and inject that as guidance for the generated image
To put it simply
sounds so intelligent but it cant make a hand with 5 fingers
it can
There are only two types of people who use stable diffusion
would this work with an RTX 4060ti?
Form Factor Desktop external GPU EnclosureExternal Connectors Two TB3/USB4 port Power(C14-type) Two USB Type A(10Gbps) One USB Type C (10Gbps) One NVMe Slot (10Gbps)Internal Connectors Three 8-pin (6+2 pin) power connectorsExpansion Slot One x16 mechanical (x4 electrical, TB3/USB4 spec) PCIe 3.0PCIe Card Supported One full-length, full-height, t...
You should search for a lora model trained on vector images, there are some on civit.ai
Just remember the output will always be an image, not a svg, but you can convert it
Thank you!
Hello, is there a way to swap races? using an image?
You have the local version right? You can use img2img and specify it on the prompt
No im using rest api from stability
This new model called as PixArt (not stable diffusion) is literally better than SDXL. have you tried and compared? in the beginning of video i compared with 58 prompts and 290 images https://youtu.be/ZiUXf_idIR4
Introduction to the new PixArt-α (PixArt Alpha) text to image model which is for real better than Stable Diffusion models even from SDXL. PixArt-α is close to the Midjourney level meanwhile being open source and supporting full fine tuning and DreamBooth training. In this tutorial I show how to install and use PixArt-α both locally and on a clou...
Hello! Im trying to set up stable diffusion and im following this youtube video https://www.youtube.com/watch?v=onmq &ab_channel=KevinStratvert
Problem is when I get to the "launch stable diffusion" part this is the message I get in cmd. I tried uninstalling and reinstalling python again but its giving the same error. Pleaase could someone help me with this? it would be very much appreciated
is it broken?
i have py 3.10 and 3.12, how do i make stable use, by default, the 3.10?
Can someone explain to me why I'm getting a cloud instead of a person?
Because you only have 1 step on the standard SDXL checkpoint. You need to turn those steps up on the scheduler.
...and/or use the Turbo model for that matter if you want to keep them down low.
i know that turbo works with just 1 but I guess base doesn't.
thanks for the tip
two ckpts?
Nice!
what settings are people using for SDXL Turbo? I was trying stuff earlier, but couldn't generate anything other than garbage. I downloaded a tuned model from someplace else and got good stuff immediately
Guys, please help me. I don't understand why I no longer have tiling on my A1111. I need it to create a texture and I realize that I don't have it anymore... Do you know why?
this model has unfortunates...
such a beautiful creation it gave me.. do you thin it was my prompt, or my random image->ipadapter script?
so, yes, do not use random image from google images from random wordlist search terms as you ipadapter image, it does not produce the best santa
i tested it for you, so you don't have to
Would love to make my dog into that watercolor art style. How difficult would that be with sd. Or what would I need to Google to learn how to do in it.
Will we ever have a bot with more options, or is there already too much to do with the bot ?
Create a 3D effect for the image of the little dog, depicting multiple action poses and a side view.
im new to all this, what can i do to make my images better?
this is just a test prompt thing
also, does anyone have good models for like, starry skies and ciites/towns?
AAA
SCARY
WHY?
iphone16
@pine seal Basically you need something like this, adjusted to the input video of yours
Pretty much load the video, apply the preprocessor and export it again
thanks! i'll try this out
My first SDXL Image. Yikes. This took 30 mins to generate :D
Just for comparison sake an image I generated with a 1.5 checkpoint in about 1.5 mins :D
Have you tried LCM or the turbo model?
Not yet. Maybe some day
You def should, they are way faster and need less heavy hardware, perfect for people like you 
I think you could train a lora or you can try using controlnet extension with ip-adapter model.
I´ll try it with the pictures that you posted and i´ll send it here if the results are decent🤗
Maybe because of the resolution, try making the image of faces "closer"
hi, can someone help me to find a way to generate stencil/svg style images from an image? so far the best i got is this but i would like to have it with only black and white, no gray, also a bit less detail would be preferable
the input image was the one on the right
Do people with those certs usually have all of them or certain groupings? Could be confirmation bias
No, because it included my unrelated certificates, from NREMT for example
Hell and even knew that I had spent 6 months in Ukraine during the beginning of the war as a medic
Which is definitely not grouped in with my FEMA certificates lol.
anyone know how to prompt SD to not create multiple models? I want to create just 1 model and SD creates 2-4 of them for the picture
Ive tried this in negative prompt to no avail: (((multiple people))), (((multiple women))),
I'm guessing it has something to do with the resolution because it doesn't happen in square images
Use "duplicate" in the negative prompt @tawdry escarp also try using highres fix
thank you so much, going to try that
Cooperation Email: yanet.prod@gmail.com
In this video, I'll craft and explore the Windows 2000 installation powered by Extended Kernel. Will it be possible to run such complex and modern software as Discord, Visual Studio Code, ChatGPT, and much more? Is it possible to buy a 🪙 bitcoin while the computer is running Windows 2000?
@pallid ruin highres fix worked amazingly well, thank you so much
@dusty minnow
What's the best way to transfer an anime art style with ComfyUI?
Like from an image I like the style of, but has no lora or author name.
Hires was made for that problem 🙂
Didn't know that, I'm super new to this 😝
anyone know why 2 of my pictures took ~30 seconds to generate and the other one took 4 minutes?
no changes done, just hit 3 batch count
maybe the first 30 steps are generating the image and the second ones are the highres, which takes more time because of the upscale?
does 30 steps first, then the steps for hires, if you have it set to 0 on hires, it will be equal to the normal steps
10-15 is fine for hires
the step count in cmd will show normal steps + hires steps x/x
Do you guys know any lora for this style? Help pls
Anderson Design Group is what I'm getting from a quick google compare, definitely no LoRA based on it. But I think this style could be common.
Ok, thanks
man showing sword
Thanks
thank you so much
bruh I swear if I get banned from bing AI because of this
look what prompt it flagged
and yeah I clicked report
since unless they consider badass a swear word
which they didn't on a previous prompt
they also allowed anime fight so it is not fight
it would have to be cyberpunk or lord farquad
What you're seeing is "resolution attention" issues. you might be using SD1.5 here. It was trained with 512 x 512 sized imags, so trying to diffuse from noise to somehting bigger than that, will have the prompt regenerate entirely in each 512x512 sized patch. The second person is the same prompt just filling into the extra space. "duplicates" in the negative won't fix that since it's not a token attention problem. I honestly don't know why people give such fluffy advice. "bad hands" in the negative won't solve hands either.
Hires fix in automatic1111 is what you want. You set your resolutin to something closer to 512x512 to prevent attention issues here, and then the hires fix multiplies the resolution and does the prompt a second time with the lower resolution base image as a starting point. Bonus round is an extension you can get that uses a method from Kohya, that allows even more consistency on the resize.
SDXL was trained with 1024x1024 resolutiion base so those attention problems are a lot less so
thank you so much
Seriously, can't thank you enough for the amount of useful info you're providing me
https://github.com/wcde/sd-webui-kohya-hiresfix this is one of my new favorite extensions. It brings things like depth guidance to the rescale, and the consistency is great
thank you, will be giving this a look
Gladly. I love to help people get a leg up on this field. So much misinformation out there makes it difficult, but hopefully we can cut through that in time.
literally still shaking my head at "duplicates". This stuff is so vulnerable to magical thinking. I often call it voodoo in how it works, so it's understanable that people are going to throw chicken bones at it and believe it has an effect.
There's a ton of misinformation, part of the reason why is because this is still a very new field. It's still not an exact science and people just try different stuff (to varying degrees of success)
I'm glad you've been helping people as it's kinda overwhelming the amount of information there is to start with
Luckily i started before controlnet and 1000 extensions were published, so i was able to learn as things came out. Getting into the field today would be a lot more difficult for me.
All these new models coming out lately have me just so exhausted with keeping up. Especially after situations where freeu and ait got hyped hard and were actually not that impressive. Tensor RT came out and i invested a ton of time trying to cut through all the technicals, only to find out compiiled models are limited to few resolutions. I have fatigue. so much fatiigue
helping new comers figure out the basics is refreshing
Yeah tell me about it, I still don't fully understand controlnet and have been kinda putting it off
I really need to step up my game though because the pictures I'm creating aren't what I want tbh
You get fatigued in something you're an expert in after a while, especially after doing it a ton
I get that 100%
Don't. It's basically essential knowledge for diffusion. Controlnet or any other "guidance models" are so awesome
I would try a vector art model.
thanksss
https://github.com/Mikubill/sd-webui-controlnet if you haven't found it already. you'll need various models to use iin it too
what's the best way to give SD something like a silhouette and have it generate colors? e.g. like this image?
Inpainting...?
IPadapter
seems to work good enough ☑️

any idea why resizing to higher res just produces the mask image?
for example (last is the original image)
do denoising strength and resize scale affect each other?
@proven wasp
the x ur feeding a positve prompt into negative clip?
then negative is not doing anything?
ya this i missed, 👍 sorry
lmk if that fixes it
sure, le me correct it
Same error 😦
ohhhh ur using a SD1.5 checkpoint with SDXL encoder
u need a SDXL model
help me correct it
i couldnt find encoder for sd 1.5
do u want to gen SDXL images or do you want to do SD1.5?
i think sdxl is doing problem with my installation so i prefere to u sd1.5 image generation in this workflow
its called clip text encode
so right click, add node, conditioning, clip text encode (prompt)
u also problably want to downlaod a better 1.5 modal than base
these 2 nodes r different
let me try this
yes, use the first one for SD1.5 the one on the right for sdxl
left 1.5, right sdxl
then how will i connect the nodes in sd1.5 there r jst 2 connections
ur whole workflow u have is set for SDXL, where did you find this lol, so basically we need to get you whole new workflow.
yeah i know its a mix plate.
sometime i think that there should be someknd a script to "explode" nodes.
that should get you working with 1.5
that's the most basic worflow from there u will be able to expand
hmmm. its a load default
oh it is nice,
i jumped from Auto11111 to comfyui so its giving me tough time
use the default, gen 1 image then u can expand from there, u can look at the last worflow for features that u wanted
like the concat text u can add that in
thanks mate. i will try it and will share with u
the last step, save image part if you hover on the bottom right a diagnal arrow will show u can click and drag it down and u will see your image there
Come on over to Spaghettidonalds.
😂 yeah i know this one.
First customer
His evil twin in the back
weather
blend it with a puppy
Hi all, does anyone know how I can make the generated images less blurry? Or is it just a negative due to me using AMD?
Image for reference
can you render it larger?
Im superr new to this so idk how
You don't know how to generate in different sizes?
I just set it up today, haven't found a video that explains anything other than installing it atm
Ohh I just realised I had the width and height option this whole time 
np! Im glad it worked 🤗
Lmao my prompts always get blocked
I tried img2img 😁
I do not own this song. The original was produced by Michael Bublé.
This version is an AI one made to sound similar Frank Sinatra, I apologise for any glitches in the AI vocals.
You can create your own AI recreations for free on fakeyou.com.
I used x-minus.pro to separate vocals and instrumentals.
The image is from architectural digest. (I c...
doesn't work well for me
However, maybe my textual inversion is messing with it. i just use verybadimagenegative and a few other words.
Hey, anyone know why the faces of my model are fucked up? using controlnet
I've tried running a ton of times and the model's face is always fucked
using Openpose btw
The same thing happens to me when I use controlnet, I think its because of the 512x512 resolution @tawdry escarp , you can try opening the image in photoshop, convert it into a smart filter, copy paste it into a new image and zoom into the face, then img2img it on sd withouth controlnet and then paste it back into the new photoshop image, select both images and click link layers, then copypaste them back into the original
That face is just a test, I used a random prompt
with quick lowquality upscale*
Huh? Sorry, I got lost with the explaination, tried reading it a couple of times and don't seem to understand what you mean after "smart filter"
Also, is there no way to fix this w/o photoshop?
I don´t know really, but with controlnet I always get a out of vram error (if I try with a higher resolution)
I mean this option
And then create a new image, and copypaste the original one, making a "zoom"
So you can use it on img2img
I recognize the explanation was pretty bad lmao
Ok ok, what I don't understand is why the zoom?
And what do I need to do in img2img?
inpaint the face?
ohh
and then I align the new image to the old one in photoshop?
Exactly
Yeah it is but I don´t know other way to solve it withouth running out of vram haha
@pallid ruin what am I supposed to do here? link layers but how do them align? the img2img2 background was completely different from the one on the original image
Copy the layer 1 image, and open it on a new image
Wait
Do you have the image with the "zoom"? @tawdry escarp

