#📝|prompting-help

1 messages · Page 28 of 1

tired vigil
#

thats comfy

faint bolt
tired vigil
faint bolt
sour beacon
faint bolt
#

which doesnt need anything special

sour beacon
#

Basic for which model? Pixart, Flux, Sd3, sdxl, sd1.5?

faint bolt
#

Flux now - i already did sdxl and sd1.5

sour beacon
# faint bolt Flux now - i already did sdxl and sd1.5

I would say go with the basic workflow

https://openart.ai/workflows/mentor_ai/flux-1-fp8-comfyui-basic-workflow/Vn1oDI2ofxWPjkEakk1J
If you got already another t5 model or flux model you could use these instead of download the ones in the docu

OpenArt

Created by: MentorAi: Minimum Hardware Requirements VRAM: 8-12GB+ (16-24gb recommended) Steps to Follow Download Necessary Models: Download t5xxl_fp8_e4m3fn.safetensors and clip_l.safetensors from here . Place these files in the ComfyUI/models/clip/ folder. If you have more vram and ram, you can download the FP16 version (t5xxl_fp16.safetensors)...

faint bolt
sour beacon
tired vigil
# faint bolt thanks

Ive explained here what you need to install and where https://tensor.art/articles/760080809324596639

FLUX DREAM DIFFUSION BY DICEModel can be found on Tensor Art https://tensor.art/models/759856135286068673/FLUX-DREAM-DIFFUSION-BY-DICE-V-1or all my models are also over on Shakker.aihttps://www.shakker.ai/userpage/8b0d2aadaa2a4f2592cbb367c329ea51/publishStart of with these settings in comfy to get a feel for how it runs ....Simple Prompt : a jet...

#

the workflow , the vae, the t5xxxl. and the clip-L

rich shell
#

can anyone guide me those thing

keen spoke
#

Okay so im new to this. You know when you put prompts into that box and it says (numb)/75 and continues to increase if you go above that. What is the purpose of that? Does it start ignoring prompts if i go above 75? And is there a way to bypass that?

#

Because it does seem like at least some prompts get ignored a lot of the time.

#

Lastly, are there often prompts that sd simply doesnt understand unless you download more?

surreal rose
surreal rose
surreal rose
keen spoke
surreal rose
#

people try all sorts of ways to get past that limit. none work. it's coded to ignore anything past 77 tokens

#

SD 3 or Flux have a much larger token limit

keen spoke
#

Ill have to look up what kinds work then.

But man that token limit sounds bad. Do either sd3 or flux cost money or increase it drastically? I have seen images that seemingly go past 77.

surreal rose
#

exctly 2 word prompt

keen spoke
#

ah ic ic. So far with what ive tested on certain anime styles at least, its more trynna get precise events to happen that gets finicky and negative prompting to retain quality and decent bodies.

surreal rose
keen spoke
surreal rose
keen spoke
#

Heh, i may end up being there a lot, thanks

tired vigil
#

you need forge to run flux

keen spoke
#

Uhuh. Ill just look up comparisons between the versions.

lone idol
#

hey guys so i'm making stuff of a char that just so happens to use an hairband and i'm like 99% sure all pics of it is using and hairband, putting hairband on negatives does not remove it... anything i can do?

#

maybe some way to write around it

silver valley
keen spoke
#

I have 8gb of vram. and 32 normal ram. But basically just try not to use two different words with the same meaning for the most part.

And btw, what camera prompts would you guys use to adjust the camera to fit in multiple people? Either facing them from the side or above? I tried "sideshot view" and it only sometimes works.

keen spoke
#

Yeah it often seems to vary either way for me.
Also been messing with the image to image part of the prompting system. But what is the difference between the main part and the sketch segment you can also put images into?

#

Im guessing sketch is more for just modeling and actions

fiery pecan
#

there is a bunch of type. so you can mess around and get good result from it.

fiery pecan
#

or you can use xyz plot so you compare the difference in a single view easily. though it is better for a more finer and subtle changes as it is harder to detect if you cycle it one by one

keen spoke
#

Yeah you gotta train it a bit. Also just gonna have a list of positive and negative prompts for various body part fixes. Hands and sometimes face features r a pain

fiery pecan
#

what do you mean by body part fixes?

fiery pecan
keen spoke
#

Nah not at all, entirely making images to see what i can do with it. Riiight now im just experimenting trynna do anime chars mostly women in specific artstyle loras. And getting images where different characters interact in whatever way and prolly some nsfw stuff if i get bored enough later. Just that sometimes a facial expression may be a bit distorted or hands or arms might be deformed. Badhand seemingly varies in its helpfulness.

fiery pecan
#

ahh i see. mess around with models. it always interesting to see how images change when changing model. hence why it take so much space for me lol

#

it used to be more. but i remove alot that i no longer use.

fiery pecan
keen spoke
#

Oof.
Funny enough i just downloaded T-ponynai3 but havent tested yet if thats what ya mean
Been testing stuff out with touhou mop style recently but the only model thats worked with that style is anime original so far. everything else so far messes with the colours or goes blurry.

fiery pecan
fiery pecan
keen spoke
fiery pecan
#

heck it so good. it got its own category for model that base on it.

fiery pecan
keen spoke
fiery pecan
keen spoke
#

Ah okay. Yeah ive just been told sd1.5 is generally best so im sticking with that

fiery pecan
#

also, for Pony model. in order to work normaly please read the description. it has it own way of prompting.

fiery pecan
#

SDXL has easier time with fingers.

#

on SD1.5 you may get good hands but you generally need to inpaint the hand for the good image you want.

#

especially Pony diffusion, it has way easier time to get good hands.

keen spoke
#

Read description and posts basically yeah. Yknow considering you mention nsfw earlier i am surprised theres no nsfw channel on this server for precisely that since its the greatest factor for people doing this

fiery pecan
#

and especially if you go to civitai and allow you to see all content lol

keen spoke
meager temple
#

Hey hello, sorry to bother you but I updated Stable and now it seems to me that it takes a long time to generate even simple images, furthermore the results dont resemble the model or the Lora I'm using, I fear some of my settings are wrong, could anyone help me? ;;

lone idol
#

I don't even update mine no more in fear of exactly that 💀 works well as is

meager temple
#

sighh

faint bolt
#

so i tried using this workflow to make video from this image and for some reasoni get this error - chat gpt wasnt able to help me - Error occurred when executing SVD_img2vid_Conditioning:

'NoneType' object has no attribute 'encode_image'

File "C:\programs mine\stable difusion everything\Confi UI\ComfyUI_windows_portable\ComfyUI\execution.py", line 317, in execute
output_data, output_ui, has_subgraph = get_output_data(obj, input_data_all, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb)
File "C:\programs mine\stable difusion everything\Confi UI\ComfyUI_windows_portable\ComfyUI\execution.py", line 192, in get_output_data
return_values = _map_node_over_list(obj, input_data_all, obj.FUNCTION, allow_interrupt=True, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb)
File "C:\programs mine\stable difusion everything\Confi UI\ComfyUI_windows_portable\ComfyUI\execution.py", line 169, in _map_node_over_list
process_inputs(input_dict, i)
File "C:\programs mine\stable difusion everything\Confi UI\ComfyUI_windows_portable\ComfyUI\execution.py", line 158, in process_inputs
results.append(getattr(obj, func)(**inputs))
File "C:\programs mine\stable difusion everything\Confi UI\ComfyUI_windows_portable\ComfyUI\comfy_extras\nodes_video_model.py", line 46, in encode
output = clip_vision.encode_image(init_image)

hushed berry
#

Hi all, please help 😦 . I would like to create a sticker of a Kwaii cat eating ramen. I use **Juggernaut_X_RunDiffusion **model with this settings: Sampling method: DPM++ 2M Karras, Sampling Steps: 40, width: 1024, height: 1024, CFG Scale: 3. Prompt: "manga style cute kawaii style cat eating ramen, T-SHIRT DESIGN, STICKER, gray background. vibrant, high-energy, detailed, iconic, Japanese comic style ", Negative prompt:"ugly, deformed, noisy, blurry, low contrast, realism, photorealistic, Western comic style". GPU: NVIDIA GeFORCE RTX 2070 Super. The output is that glitchy image with poor quality:

#

what is wrong? How can I have better quality stickers? Thank you

woeful epoch
#

Hey/ iv been using tsbale diffusion for a long time now. and it was working fine but now with or without loras its messing up faces. none of my settings have changed

woeful epoch
#

been asking for help and have been bounced around to 3 dif channels now. over the course of.. 4 days.

hushed berry
woeful epoch
#

liek 4 days ago. was working perfectly fine until then

rich shell
#

what should i need to make right image have bringer color like left image, what's i have wrong?
help me
thank you

silver valley
#

Use the kl-f8-anime2 vae

rich shell
silver valley
rich shell
#

thank you sir

woeful epoch
#

with and withour loras it still messes face up

#

like this

rich shell
silver valley
#

Remove them

rich shell
#

what should i remove

#

things seem ok now sir

rich shell
silver valley
rich shell
silver valley
#

Yea or 0.8 or 0.7

rich shell
#

also how shoild i fix background
i want it beliek have some white lily or Red spider lilies

#

but just bunch of unknown things ==!

woeful epoch
silver valley
# woeful epoch stilll need help with this..

Thats caused by using lora strength 1. sometimes they can be to strong.
Try set them to 0.5 or 0.7
Also don't use the pony xl 6 model. Better use an anime pony model.
And if you use Pony models you should use the score_ tags.

woeful epoch
silver valley
woeful epoch
silver valley
rich shell
#

i have some problem abou makeing good background, it usually give me unkown things btw
anyone can help?

finite isle
#

Hello everyone, I'm creating some stuff with Lora models (characters from certain animes) and I'm having a bad time trying to get some nice eyes. How can I fix it? I tried by inpainting, but it isn't that good. I'm using SDXL and SD 1.5 models

real echo
#

anyone knows of any way to specify a color on a1111 that you dont know its name? like using hex code or something like that, hope someone can help pls!

silver valley
rich shell
#

is front prompt is more important than later prompt

limpid bison
#

Hey, any advice for how I can convince MiniSD to stop cutting edges of stuff? For example the pommel here:

Detailed high quality monochrome sketch of a sharp claymore sword upright
...
Detailed high quality monochrome sketch of a dagger
...
Detailed high quality monochrome sketch of a dagger with ruby pommel
Or is the model unfixably bad?

#

Ah, some luck here:

Detailed high quality monochrome sketch of a dagger with ruby pommel contained within a square
seems to work about 50% of the time

#

Only for the dagger example, sword always gets clipped - either the blade or handle

wicked sequoia
limpid bison
#

It's very very fast - wanted to see what is available when you sacrifice quality for pure speed

wicked sequoia
#

Just use lcm, thats faster I believe and better quality.

limpid bison
wicked wren
#

Is there a way to prevent photos from being modified with img2img, similar to how Nightshade prevents photos from being used in training?

sour beacon
wicked wren
#

Thank you, I'll give it a try!

static hinge
#

Is there some most sofisticated way of using several Lora's on Comfy than chaining the Lora nodes? Also I guess the order matters

manic coral
#

or simply merge the lora in the model lol

static hinge
manic coral
#

it's the same a smerging diferent classic nosed of lora

manic coral
manic coral
#

and numerous other useful nodes

#

or look for lorastack on custom nodes inside comfyui manager

atomic wigeon
#

Since we won't have a channel specifically devoted to prompting I see no point in being in this Discord anymore. Good luck everyone!

manic coral
#

see ya

#

and good luck

near lynx
#

I need some help setting up my local Stable diffusion. I want pieces like the follow (created with imagineart ai generator) but my results are more like the other one. I am fairly new with ai art, but have IT background in case thats relevant for help. Goal is to use Stable diffusion to help me create Tabletop RPG campaigns

#

this is.. mine

sour beacon
near lynx
sour beacon
# near lynx Yes, sorry didnt make that clear. The prompts are different, and i used first im...

Ok so you could try to use image to prompt generators with the first image to check what could be a useful prompt or at least a part of it. Here the result of gpt:
Futuristic female cyborg with blue-green armor and glowing yellow accents, sharp facial features, fin-like headgear, and flowing energy-like hair. Detailed mechanical and organic textures, standing confidently with a dramatic cloudy sky in the background. Sci-fi fantasy theme, blending advanced technology and biology, cinematic lighting.

near lynx
#

I do know the prompt, its a simple "Character of a solarpunk alien world" (i created it with the imagineart ai). My point is the quality. For example urs looks absolutely stunning and what I am trying to achieve from the quality perspective, but I cant get to it.

I also just broke my SD so I cant send examples rn._.

sour beacon
sour beacon
near lynx
#

sorry, my questions are pretty... dumb i know

#

with the model dreamshape and ur prompt.

#

Its a bit better for sure (just fixed my SD)

#

Okay your prompt style definitly improved my results

#

I guess i shouldnt approach stable diffusion like other ai generators with simple prompts like mine Character of a solarpunk alien world but be more precise

sour beacon
#

Well you said you got an IT background, so i would start by installiing comfyui in a separate virtuel environment. After that i would start with gathering some models. I would suggest some SDXL models (Dreamshaper and Realvis) and actual a great model is flux.dev. Later you will create workflows were you use for example a pixart-sigma model but you keep some noise (or denoise) and run a SDXL on top of that. A good ressource for these different workflows is openart.ai

near lynx
#

I saw comfyui on quite a bit of places by now, does this only work with a VE? Would need to setup that up again

sour beacon
# near lynx I guess i shouldnt approach stable diffusion like other ai generators with simpl...

well most of the commercial tools use LLMs to generate a more complex prompts for the user inputs..
This for example would be the prompt generated from your solar alien world prompt with chatgpt:
"Design a solarpunk alien character in a lush, futuristic environment. The alien has glowing emerald skin, bioluminescent patterns, and plant-like hair with flowers. Their attire is made of sustainable organic materials, with intricate vines and solar-powered technology integrated. The background shows a vibrant solarpunk city with towering trees, solar panels, and wind turbines, blending nature and advanced technology. The alien's eyes are radiant, reflecting harmony with their green world. Warm, natural lighting, a utopian atmosphere, and soft pastel tones emphasize the peaceful, eco-friendly civilization."

sour beacon
near lynx
#

Aight I will try to use the information you gave me the following days to improve a bit my workflow and my SD. Will see if I set up a Virtual environemtn (VE), but ngl, I hate them.

sour beacon
#

Which OS are you using

near lynx
#

My IT Background is game developer, so do not overestimate my knowledge with tech :D i run simple windows

sour beacon
#

Btw. the prompt above from chatgpt with different models:
Dreamshaper SDXL, Dreamshaper6 SD 1.5, Playground, Pixart/SDXL and Flux

near lynx
#

Yep thats definitly what I am trying to achieve. Just glad that local SD is capable of that. So will need to improve my things with the help u provided, thank you!

sour beacon
timid kindle
#

How do I stop the AI from generating humans in my images?

manic coral
#

add stuff like pink skin and similar in negative

timid kindle
#

Similar such as?

sour beacon
manic coral
#

with such words, the risk is to block the gen of every humanoid form

timid kindle
#

Tried those, still around 20% of images will have humans

manic coral
#

the bias is strong

#

and the forms are not associated strongly with such words

timid kindle
#

Oh I should specify I'm generating images with no humanoid forms

manic coral
#

nah

#

the model is not trained much on such words usually

#

usually the types are beast/feral, male/female/man/girl, object, situation, color

#

models are more biased toward such groups

#

there is some basic trainig toward other descriptive words

#

but mpdel are more biased toward these

#

because booru and such galleries, use such groups

timid kindle
#

I'm a bit confused. Are the words you specified simply terms the AI is "more familiar" with?

manic coral
#

youp

#

the space in models is limited

#

so, the models are more trained toward certain groups

timid kindle
#

Uh huh

manic coral
#

pony is more biased toward feral, for example

timid kindle
#

So if for example I want to generate a picture of a landscape, and I want humans to stop generating, what would your negative prompt look like?

manic coral
#

usually starting the prompt with landscape helps a lot

#

inserting in the promp negatives like, man, girl should be enough

#

if not, add, boy, woman ecc

timid kindle
#

Yeah, I did that. By the way, the person that keeps generating is almost always a young woman with blue jeans clothing for some reason

manic coral
#

probable the models is strongly biased toward portraits

#

again, for example, pony is very weak in generating landscapes and bg in general

timid kindle
#

Makes sense. I am currently using Perchance because I'm on my mobile and frankly too lazy to install proper models lol

manic coral
#

lol

timid kindle
#

I don't even know what model they're using

manic coral
#

ai can be vey complex

#

for lanscapes is suggest soushiki

#

a very tasteful model

#

in case you want to tray local

#

you can link the local gui on a phone very easily

timid kindle
#

Wdym, would I be able to generate it on my mobile?

manic coral
#

starting the gui on a pc and linking the phone on the pc, yes

timid kindle
#

Hm maybe

#

The problem with local is I'm also too lazy to keep it up to date

manic coral
#

eheheh

#

stay away from comfy XD

#

if yo uare ok with a setup, simply don't update it

#

and always use the same set up and extensions

timid kindle
#

Oh another quick question, does the number of entries in a prompt or negative prompt make the prompts less effective or is the generator able to handle it no problem?

manic coral
#

the first tokens are more effective and the effect of the farius tokens, influence the successive tokens in cascade. the more distant tokens are less affected

#

example, red hair, white dress, the dress may get affected by the red color, genning sometimes red resses, or pink

timid kindle
manic coral
#

every token influence the result

near lynx
#

and its finally using my gpu, fast af!

silver valley
near lynx
#

radeon 6700xt

silver valley
#

Ah OK. The tutorial you linked is for DirectML.
But your card is also supported by Zluda.
Its 4 times faster and uses less vram than directml.
I have made a lot of AMD guides for every webui on my github.

near lynx
#

i have tried using zluda 3 times, it just didnt work. I am right now just happy that it works

silver valley
#

Oh okay, but did you tried my guides to make it work?

near lynx
#

nope didnt see them ngl

silver valley
#

Feel free to try them out sometime

near lynx
#

i will save that and maybe comeback someday. Right now directml is fast enough for my needs and I cant be bothered to reinstall again, but thank you!

silver valley
#

And upscaling isnt really possible (needs a lot of vram)

near lynx
#

I have quite some vram and upscaling is not needed (rly just need some picture for dnd) so for now it should work, thank you for the hints!

silver valley
#

Np have fun!

manic coral
#

for a 512x512 the quality is very high

near lynx
manic coral
#

cs1o you have experience in setting clips for a merge? suggestions about it?

#

i suppose i'll experiment

manic coral
#

the embedded python is so cheap. it can't install pakages >.<

sonic shore
#

I'm expriencing trouble to manage different characters, does anyone have a working way ?
In my prompt I use a LoRA that's supposed to add a male character with my actual char with the 1boy (description) way. I notice sometimes i'm forced to duplicate the oy description separately using BREAK 1oy (description). And doing that sometimes it works, sometimes it doesn't. It's actually really annoying

So... if you guys have a easy way to manage 2 / 3 Pov+non pov / more characters, i'm all ears

silver valley
sonic shore
#

Thank you

young lion
#

why is this workflow not working?

warped bridge
#

tjuo8i

twin yoke
uncut willow
#

lady gaga for a cat

tepid inletBOT
bold meadow
#

Tips for recreating this style in other images? Not sure if I should be prioritizing prompt engineering or if its more valuable to find a model better suited for this style. So far using base SDXL, I've had no luck.

fiery pecan
#

there is many ways to fix it. you can refer to this video. it kinda old so there could be a better way to fixing it. but hey. thats a start. https://www.youtube.com/watch?v=g4Oggft64dI&ab_channel=ChameleonAi

Noob Guide Part 1: https://youtu.be/Jya6qcokqZQ
Img2img Noob Guide: https://youtu.be/NJwg7HWiZQw
I admit, I'm not exactly the best hand fixer, but I didn't see a lot of noob oriented guides out there especially focusing on anime styles. In this tutorial, I cover hand fixing in Krita, sd-webui-depth-lib, inpainting, and ADetailer. I also go over ...

▶ Play video
modern jolt
#

“I’ve been trying for months to generate Spider-Man’s mask, but I keep getting the neck and shoulders along with it. I’ve tried specifying just the face, no body, everything possible, but nothing has worked. AI assistance hasn’t worked either. I just want a Spider-Man face to cut out like a puzzle.”

twin yoke
modern jolt
#

Yes, I would love to do that if it didn't always blur. Unfortunately, I can never get a clear and detailed head in comic style.

twin yoke
#

That depends on your model, your prompt, your settings, etc.

modern jolt
#

That's the best I could achieve so far. I don't want an original Spider-Man, but rather a homage. But the tip with GIMP is great.

twin yoke
#

Well, you have several things I'd recommend in that workflow.

  1. It's an SDXL model and your settings are for 512x512. SDXL works best on 1 megapixel (1024x1024)
  2. If you want only a head/face you would have to put some stuff in front of your prompt that says that.
  3. I also wouldn't recommend the heun/normal settings in your KSampler as that's not really a great pairing for quality on most SDXL models. (Not that it won't work, just that it won't be your best option.)
  4. If you want just line art, you should say that as well.
modern jolt
#

ty 4 your help!

#

Super, but I'll never be able to do the prompt collision thing coolcow

twin yoke
modern jolt
#

Sure, here is the translation:

"I am completely new here, where can I find the embedded workflow?"

#

to realistic not enough line art

twin yoke
#

Line art is not realistic. If you want realistic you wouldn't use the words "line art".

The workflow being embedded means you download the full image, then drag it into ComfyUI and it will open up with the workflow.

modern jolt
#

okay

twin yoke
#

This looks interesting:

modern jolt
#

Looks strange!

twin yoke
#

It's a comfyui workflow, not Automatic1111

modern jolt
#

okay i am searching comfyui

modern jolt
twin yoke
#

The workflow provided is very simple and straightforward.

modern lily
#

Hi, I'm trying to generate a specfic character from a video game but it's not generating her correctly. Is there any way I can correctly generate her?

silver valley
cedar terrace
#

pls subscribe

river fern
#

It's so hard to get a full length tube top jesus. ever with like navel/stomach in negative and tucked in in positive

dapper egret
#

Slightly afk. But if someone can ping me with the response to a question. Information says you can use stable diffusion to create a 2d character concept art. To later turn into a 3d model. Can anyone tell me which checkpoint, of the many many choices, i should use?

errant blaze
#

I'm trying to generate an image of a bottle on a table, but I need the bottle to be open. The AI keeps generating a closed bottle. Any tips on how to prompt it correctly to show the bottle opened?

elfin star
#

I'd probably just inpaint it instead thinking

errant blaze
#

there has to be a way to prompt it lol

analog gorge
#

What can you tell me about “search-and-recolor” in the “stability ai api”! It can be seen that the colors specified in the prompt are not uniform in the output for each reference image. Based on the above, is it possible that the color of the original image is affecting the output result, is this correct in your perception? In the future, we would like to make detailed color settings using color codes, but is there a setting that allows us to output the specified colors as they are?

gilded turtle
#

Hey anyone able to help me?

#

postive promt : 1girl, solo, edgAdepta, wearing edgAdepta, power armor, shoulder armor, skull emblem, red armor, ((angel wings)), blonde hair, warhammer 40k, long hair, sanguinius, rule of thirds, symmetrical, symmetry, simple background, lora:dcau-330:.6, lora:edgAdeptaSororitasPonyv1:1,

#

negative promot: score_4, score_5, score_3, score_2, score_1, score_6, ugly, disfigured, poorly drawn face,

twin yoke
# gilded turtle

Change your resolution to something that is closer to 1 megapixel (1 million pixels total). Your image is generated at a resolution that is not only strange (512x712), but is also only about 1/3rd of a megapixel.

gilded turtle
#

or what do you recomend i set res as?

gilded turtle
twin yoke
# gilded turtle how much is a mega pxxel?

As I said, it's 1 million pixels. Just do the math on the resolution:
Your 1st image is 512 x 712 = 364544
Your 2nd one is 712 x 1064 = 757568

But, try something like 768 x 1344, which is 1032192 pixels; a lot closer to 1 million.

ocean timber
#

when doing prompts, I've noticed when I start to use too many words in them it can start to get more blurry. Should prompting be as concise as possible or as detailed as possible?

#

Because I have seen people say the latter.

primal umbra
#

I’m trying to get a picture with two people in it

My basic description is like this

beautiful young woman with blonde hair and velvet dress sitting in a chair with young peasant boy kneeling next to her with his head on her lap sobbing in crusaders castle Rembrandt style

What I’m getting looks very little like the prompt…. What m I doing wrong ? Suggestions?

#

This is the closest I’ve gotten ….

#

Any suggestions would be greatly appreciated

wicked sequoia
old cobalt
#

Does anyone have any advice with stable diffusion prompts to get a subject to hold something for example "wielding a samurai sword"?
I'm getting struggling to get SD1.5 to get the subject to hold objects properly.

Prompts I'm using at the moment:

masterpiece, best quality, aesthetic, ultra detailed illust, cinematic, recent, Japanese fantasy landscape, sunset, snowfall, extremely detailed background, (((Full-length landscape shot))),(((Full shot))),
1girl, beautiful, facing viewer, geisha, revealing clothing, royal blue floral kimono, extremely detailed eyes, samurai sword in hand, real hair, extremely detailed hair, high heels, black thigh highs, intense, expression,
Negative prompt: low quality, worst quality, normal quality, bad anatomy, stretched body, long neck, nostrils, slanted eyes, upturned eyes, bad hands, deformed, bad hands, cat ears, dragon

I would post an example, but think it'd probably be flagged as NSFW.

#

thanks in advance

wicked sequoia
old cobalt
wicked sequoia
# old cobalt thanks for the reply, im fairly new to this so have managed to cobble an automat...

Sd3 will perform worse, it's incredibly bad at humans so probably not a good idea.
How much vram do you have? You can try some sort of flux anime model(I tried https://huggingface.co/Raelina/Flux-Pastel-Anime which does an great job in this space: https://huggingface.co/spaces/John6666/flux-lora-the-explorer). Even though flux models are the best models yet, flux is very very slow compared to sd1.5 unless you have a powerful gpu.

I would probably recommend something like animagineXL(a finetune of sdxl) instead since although its worse then flux, its much faster. I can get a good img after 2-3 tries. It's much better then sd1.5 though.

old cobalt
silver valley
silver valley
old cobalt
silver valley
old cobalt
silver valley
#

Ah nice so you have set it up with zluda 🙂

old cobalt
#

I'm getting this message at the moment when I fire it up:
Using ZLUDA in C:\Users\xxxxxx\sd-test\stable-diffusion-webui-amdgpu.zluda
Skipping onnxruntime installation.

| You are not up to date with the most recent release. |

Consider running git pull to update.
no module 'xformers'. Processing without...
no module 'xformers'. Processing without...
No module 'xformers'. Proceeding without it.
#

and I'm not precisely sure what it wants me to update

silver valley
#

It's an update for the webui

old cobalt
#

brilliant! thankyou so much

#

your guide was very well written 🙂

silver valley
#

Thanks!

#

Didnt know it was linked on reddit ^^ thx for letting me know

#

If you have any questions with stable diffusion and AMD, feel free to ask me anytime

old cobalt
#

I think it was linked on there in a SD post for AMD as far as I recall

silver valley
#

Ah cool

#

Its the most complete guide for AMD out there

#

All tested and working ^^

old cobalt
#

I'm very grateful for it, thanks again, ive been wanting to get into AI image generation since Midjourney came out (I had a subscription when it got released) but it was restrictive, and the AMD official one is also like that too

silver valley
#

Np, yea the AMD ones are very limited. Model conversion is inconvenient and there are also a lot of false AMD setup guides on YT out there. So I had to made my own.

silver valley
#

The "holding" problem you mentioned before is indeed an 1.5 model issue.
Better use SDXL or Pony models. They work much better

#

And they are faster than Flux

old cobalt
#

I've managed to get one decent scythe, but no luck with a samurai sword at all

#

or magic staffs

#

they're always about 80-90% there, but flawed

#

(in the case of scythes and magic staffs)

#

zero success with samurai swords

silver valley
#

Yea its not easy also Inpainting hands is not worth it most of the time

old cobalt
#

I'd love to post to get feedback but I suspect they're NSFW (not nude, just lots of leg and bewb lol)

silver valley
#

Ah yea then maybe not here. But you can send it per DM if you want

old cobalt
#

that would be super 🙂

silver valley
#

But Its late here so I'm off for now, cya

old cobalt
#

ok no probs

primal umbra
primal umbra
wicked sequoia
#

You can try sdxl, I tested base sdxl and it provides images closer but not quite correct. Sdxl can handle both tags and natural language tho and you can probably get a decent img after a few tries.

primal umbra
#

Thanks

#

I’ll try it out

quiet raft
#

What Sampling Method and Upscalers do you think are the best for Cartoon / Manga / Anime different times of Models for Stable Diffusion Auto 1111?

#

What is that you wanna do?

opaque flicker
#

Try lowering the denoising strength to about 0.2.

prisma blade
#

I'd like to take a photograph and use a model trained heavily on anime/manga to convert that photograph to an manga-style illustration. I assume I can use img2img to do something like this, but I'm not sure what options to select in the SD Web UI. Is this something I would use the refiner for?

opaque flicker
prisma blade
prisma blade
opaque flicker
#

Yeah I've used animerge before. Its pretty good. It wants clip skip 2, make sure you know how to change that setting. Make Anime your first prompt word, followed by a description and what you want/don't want. Resizing it by 2 for SD 1.5 will help. I get better results using Euler a as sampling method with karras style. I would say without an extra anime lora on top, just try denoising strength between 0.4 and 0.5, that is where the important setting for img2img things seems to be if you ask me.

prisma blade
#

The main hurdle I had, which was silly in retrospect, is that I didn't realize the checkpoint selected in the drop-down at the top affected the img2img generation. 🤦

opaque flicker
#

aahh

#

Nice and easy one for you then!

quiet raft
#

Why is it that.. When I done 1 .. I get this

#

When I do in batch like 5 .. I get this

#

Skip the prompts but this is in Img2Img

#

I don't wanna lower the Denoising strength or it will turn out to like this

#

or this

versed garnet
#

ive just noticed on Civitai they dont add that to their prompts

silver valley
river fern
#

Any advice on essential "quality prompts" when upscaling? Or is it irrelevant since each chekpoints different?

versed garnet
timid kindle
#

Is {option1|option2} possible in normal stable diffusion?

#

You can do that in perchance

wicked sequoia
timid kindle
#

Then half of the times it will pick desert and the other half forest

#

Also you could do {desert^0.2|forest}

wicked sequoia
timid kindle
silver valley
timid kindle
#

Hmm

unkempt oak
#

ok so far I'm working on making anime art like this

#

But instead I got this

#

@surreal rose

#

ok so this is the image that it generated for me so far.

surreal rose
#

or try a 16:9 landscape aspect

unkempt oak
#

Where is that?

surreal rose
unkempt oak
#

and this is what I got

surreal rose
# unkempt oak ok changing that right now.

and rethink your prompt - you've asked for the subject to be wearing coats - but then also described them as wearing armor - but the coats should be covering that and hiding the amor. only describe what can actually be seen

surreal rose
#

what AI are you using?

unkempt oak
#

stable diffusion

surreal rose
unkempt oak
#

Oh shit that's right there's other models, I dunno I think SD1.4? It just says stable diffusion or wait. webui.user.

#

That's the one I'm using right now.

surreal rose
unkempt oak
surreal rose
#

1.5 is also very good, but you really need to use loras with it unless you want to go down the prompt salad and hoops route

unkempt oak
#

I dunno what any of that means

surreal rose
#

unless you're using that on a website?

unkempt oak
surreal rose
unkempt oak
surreal rose
# unkempt oak nope just got it all downloaded on my computer. Cause I tried huggingface and fu...

i feel for you. this is not easy technology - however ... there is a solution you might consider https://pinokio.computer/ cocktail peanut has created a very nice install and launch program. you just install his program, pick the application you want to run, and it installs - and then you use his launcher to run it. rather than having to do all the install and set up work yourself. then the only issue might be whether you have the hardware necessary to run the program you're trying to lauch or not. SD1.4 will run in a lot less VRAM than SDXL will

AI Browser

unkempt oak
surreal rose
# unkempt oak Does it do the same thing stable diffusion does?

it's a launcher. stable diffusion is the AI you want to talk to - but it has to have an interface of some sort to run in. it can run in the interface you keep showing screen shots of, and a number of other interfaces. the launcher - pinokio - can install whatever you want to run, and then run it, and stable diffusion will run inside it. go to the website i gave you the link to and read up on pinokio and what it does

unkempt oak
unkempt oak
unkempt oak
# surreal rose did you read the page?

ok so far gave it a read and....well it's like the people who made it thinks everyone knows how to code the shit out of a picture. It's like they're speaking arabic, chinese, japanese, german, nordic, slavic, and fuckin taiwanese in the same fucking sentence.

surreal rose
surreal rose
# unkempt oak oh, so it's on autopilot

no. you know how if you go to a website and you see a bunch of games you might want to play, and you click on one of the games and it shows up on the window an you can play it? The website is the launcher. it gives you a menu of games, and once you decide which you want to play, it runs it for you. you don't have to worry about anything other than just playing it. Well this is just like that, but instead of running inside your web browser and being on a website, it has a window that opens when you run it to show you all the applications and programs it can install for you, or run for you. all you have to do is just decide what you want to install. and once you've installed that, you just run pinokio and tell it to run the program you've had it install.

#

then that program will run and you can use it exactly the same as if you'd spent hours setting it up first yourself, instead of letting pinokio do that for you

quiet raft
#

Hey, if I do Img2Img and use the upscaler.. Does it matter what model that is loaded in? Yes right?

pulsar moat
#

any rec on how to do like a spectator pov where the characters aren't looking or facing the viewer, i've tried just typing in looking_at_viewer in the Negative but it doesn't really give the same effect if you get what i mean. Especially for images where multiple people are involved.

versed garnet
#

any advice on realistic selfies? so far the best I can find is boring snapchat photo of a 18 year old Australian girl Sienna Calder, brown eyes, natural red hair, long hair, selfie, The photo was shot on a phone and posted in 2015 on Snapchat,low light, dark, jpeg artifacts, bedroom, night, lora:lora:2 (Noisify), lora:amateurphoto:0.6

#

Using flux btw

mellow wave
#

Hi, I work with A1111 but i can never make the Hands/fingers Right. No matter what i write in the positive or negative prompt. No matter which model/Plugin i use. I tested like 20 of them. From anime to real life and it's Always the same. Even with after Detail it doesn't get better.
I know that generating Hands is Always a Problem. But i have to render like 50 Pictures to get 1 more or less good one.
Here for en example the lasted i tired, but don't Focus to much on the promt since i changed and tested like 50 different once Yesterday.
thanks

silver valley
fast depot
#

what would this art style be called? it seems like it a bunch of styles together such as washed out water colour and anime

lone idol
#

Hey guys, I've been trying to make the hairs similar forever now, the one issue is that one of the hairs has a fade and I want it to not have a fade like the other hair

I'll share the prompt in DMS if ya want so I don't leave a lotta clutter here

marsh vine
#

How do I make a regular red fox wearing a backpack and not an anthropomorphic one

#

seems too niche

spark sentinel
#

Someone know why I got this issue?

#

Lora and checkpoint have the same base model"pony"

quiet raft
#

What model should I use .. If I have a GTX 3060 with 12 GB ram and 64 GB ram?

#

Should I focus only on SD 1.5, SD 1.5 Hyper etc?

#

I feel like XL are a bad.

wicked sequoia
quiet raft
#

For the time worth waiting xD

wicked sequoia
silver valley
#

Then it won't take that long

viral wolf
#

What is the best way of going about making an image with two different characters from two different Loras?

#

Kind of like in this example

silver valley
viral wolf
#

Thank you!

viral wolf
#

This one?

silver valley
dense maple
#

anyone have experience with getting tattoos to be consistent and in the correct spot with SDXL models?

I'm trying to make a set of pictures based on my DnD character I had a character art commissioned of, but she has a red rose tattoo wrapped on her right, (or maybe it's left but I'm trying to prompt it on the right as that's where I remember it being), calf.

however it is consistently putting either actual roses somewhere on her body, or it's putting a similar to the right kind of tattoo, on her upper or mid body.

I'm currently genning pics that don't show the calves, if I changed that for the prompt to show the calves in the pics, would that maybe help? I'm trying to follow this guide https://github.com/BelieveDiffusion/tutorials/blob/main/consistent_character_embedding/README.md to make my textual inversion embedding since it's linked in one of the sillytavern discord channels pins.

but it says to start with an extreme closeup front shot, which is why I'm not having the calves in my pics currently, as well as to generate pics that are, extreme closeup, medium closeup, closeup, medium shot, and full body. and then the angles of front shot, rear angle, side angle, shot from above, and low angle shot.

so I don't know if I should be putting the tattoo on later, or if there's even a good way to do it at all. I'm still pretty new to image genning.

GitHub

A collection of tutorials about training and generating with Stable Diffusion. - BelieveDiffusion/tutorials

quiet raft
#

Anyone in here who uses Ultimate SD Upscale and can recommend settings for cartoon style pictures?

mellow wave
#

I have a 3060 (12GB vram) is this enough for forge flux? for nf4 or dev? I don't wanna use the schnell version, cause it seems kinda dogshit for real life pictures

wicked sequoia
mellow wave
wicked sequoia
dense maple
#

well, since my question seems to have been lost on the posts, guess I'll have to ask again.

anyone have experience with getting tattoos to be consistent and in the correct spot with SDXL models?

I'm trying to make a set of pictures based on my DnD character I had a character art commissioned of, but she has a red rose tattoo wrapped on her right, (or maybe it's left but I'm trying to prompt it on the right as that's where I remember it being), calf.

however it is consistently putting either actual roses somewhere on her body, or it's putting a similar to the right kind of tattoo, on her upper or mid body.

I'm currently genning pics that don't show the calves, if I changed that for the prompt to show the calves in the pics, would that maybe help? I'm trying to follow this guide https://github.com/BelieveDiffusion/tutorials/blob/main/consistent_character_embedding/README.md to make my textual inversion embedding since it's linked in one of the sillytavern discord channels pins.

but it says to start with an extreme closeup front shot, which is why I'm not having the calves in my pics currently, as well as to generate pics that are, extreme closeup, medium closeup, closeup, medium shot, and full body. and then the angles of front shot, rear angle, side angle, shot from above, and low angle shot.

so I don't know if I should be putting the tattoo on later, or if there's even a good way to do it at all. I'm still pretty new to image genning.

GitHub

A collection of tutorials about training and generating with Stable Diffusion. - BelieveDiffusion/tutorials

crude kettle
#

Does anyone know any model which can create anime screenshots / anime screencap style images

neat goblet
#

Guys, is there any way or prompt to improve eyes?

frigid roost
cinder sky
#

Is lora clash unavoidable with current syst we have for a1111?

#

Even with regional prompting the image becomes useless with 2 character loras or more most of the time

orchid scaffold
#

does anyone have tips for getting rid of facial hair

dense briar
#

I'm new to prompt writing and image generation, and I need help crafting a prompt for the idea : coffee (and other foods) bursting out of a launching rocket in soviet propaganda style . I've tried several prompts nothing looks like a coffee bursting out of an launching rocket. I use stable diffusion 3, but I also can use sdxl.

#

I wanted to do the same idea but with potatoes and I got the result I wanted. This was the prompt: " Soviet propaganda poster, a towering rocket launching as potatoes burst out of it instead of exhaust". Other foods and coffee don't seem to work as well.

#

Here is the image:

languid radish
#

can generation rooms be used only via subscription?

twin yoke
#

yes

brittle pine
static hinge
#

or you could always use inpainting if you have a usable base

#

not great results (in terms of the pile) but another trick is you can just make a rough design in Photoshop, pasting the desired food in the desired position, and then inpaint until it blend

median bronze
#

Hey, I am currently trying to create stylized images of classic cars, with SDXL 1.0 models. I am having issues with jumbled interiors, like the seats or just random non-sensical things inside. See attached example.

Can anyone help with how to improve prompting generation or fixing the specific area after generation. I have tried using img2img to redo the area, but haven't found a way that yields decent results. Any help is greatly appreciated, as well as any recommendations for other models / programs that do what I'm looking for better. I can show the actual generation info if needed, thanks.

static hinge
#

just like I showed here

#

I have tried using img2img to redo the area, but haven't found a way that yields decent results.
well that what inpaint is. It re do the area like it was img2img while keeping and being aware of the other area. It takes some knowledge and experience to set it, and it is kind of frustrating many times, it make take several tries after tries, but works in the end.

cedar forge
#

Guys may I ask which API you guys are using?

#

From this prompt: a study of cell
shaded cartoon of
the interior of a
bioshock style art
deco city,
illustration, post
grunge, concept art
by josan gonzales
and wlop, by james
jean, victo ngai,
david rubin, mike
mignola, laurie
greasley, highly
detailed, sharp
focus, trending on
artstation, hq,
deviantart, art by
artgem

#

A paper demonstrates that graphs like this can be generated with Stable Diffusion (version less or equal to 2.0)

#

But I used stability AI API and only got graphs like this:

plucky fjord
#

when using multiple lora what strengths to use btw?

rocky nebula
#

does anyone know which art style model this is

nimble mountain
#

Does anyone know what model I have to use to get this style of art please? I made this one on bingai, and I really like the style. Learning how to use stable diffusion but really want this kind of style. If anyone can help I'd really appreciate it, thank you

#

Also keywords to make the hair similar, so far it's been no shaved side 😂

vapid lichen
#

I uh... wow... I can't get this to follow my prompts at all...

vapid lichen
#

Oh god... Ive created an abomination... it used the shape of the img2img and filled it with a human centipede lol

solid kestrel
#

Hey i'm hoping to get a few basic tips or tricks on how to get more of a style down. I'm using lora:bg3_actions_offset:1 and dreamshaper_7 to make baldurs gate 3 icons and these few have been okay. Trouble is every generation after this has been really miserable. (trying to generate things like hands, swords, blood etc). Is there certain keywords I should or shouldn't use? Any negative prompts to add?

Current is; lora:bg3_actions_offset:1, bg3 Action Icon, (hand), dripping with blood, green background, and I'm getting results like the next image's post

sour beacon
#

With the start: "felttip pen image"

wicked sequoia
#

What style and what’s your prompt, also what exact model? Sd1.5 doesn’t have good prompt following really, good for simple images tho. It should know many styles tho.

marsh swallow
#

I have a request and was wondering if there's a model that can help me with this. I need to edit some images by replacing different people's heads into specific positions. Does anyone have any recommendations for a model or tool that would work for this task? Thanks in advance!

nimble mountain
#

Hi again! I'm hoping someone can be kind enough to help me once more. I'm trying to make a warforged for my friends DND character. I used the war forged lora, but the face is very human in it and he has said that the character has more of a smooth helmet face. He sent me this image as reference. I'm trying really hard to find the right prompt for this, but everything I make ends up with a face 🙈

frozen dock
#

Hi!
Any tips to avoid white halo ? I've already tried lowering CFG scale & wrote no white halo (pos. prompt), halo (neg. prompt), sharp edges, ... Something else?

wicked sequoia
sour beacon
# nimble mountain Hi again! I'm hoping someone can be kind enough to help me once more. I'm trying...

"A majestic and powerful warforged steampunk machine wearing golden armor stands in a grand, ancient hall. The wareforged machine is cloaked in a hooded cape, their face obscured by the hood, emanating an aura of mystery. The armor is intricately designed with glowing golden accents, The setting features towering stone pillars, with beams of golden light streaming through, giving the scene a divine and mystical atmosphere. The color palette is dominated by gold and warm hues, enhancing the sense of power and divinity in the scene."

nimble mountain
#

Thank you!!

tidal flare
delicate arch
#

Been awhile since I've been here, but how does training work?

#

I always find myself struggling to get the results I want, but sometimes I'll get just the right one

#

But changing the pose and details results in something else entirely

delicate arch
lone idol
delicate arch
lone idol
#

Well try it out :))))

grizzled river
#

someone created portraits for random characters in Rimworld that I would like to recreate but I haven't been able to come up with something that creates a style like this or keeps the individual characterization of different characters

#

any suggestions?

rocky nebula
# tidal flare hentai

honestly the art work is kind of that yes that's why I cropped it out. I saw the art style and just wanted that look on my generations. I tried meinamix and counterfeit but it doesnt give that soft digital art look

chilly crane
#

Can Someone Help with improving prompt for my interior design ? as furnitures are generating small and I want to generate different interior styles automatic

Prompt: Choose random theme and make interior design for this bedroom with large furniture
Negative prompt: A distorted, low-quality room with altered walls, ceiling, and unrealistic doors and windows. Blurry details, bad lighting, unfinished areas, and mismatched elements create a chaotic, sketchy feel. Distorted furniture, oversized decorations, and fake shadows add to the bad perspective and clutter, giving it a surreal, cartoonish look.

vapid lichen
#

So, I have tried to use stable diffusion, but it seems to not really follow my prompts, and I have trouble getting the style I am looking for. Anyone able to give me some help?

twin yoke
vapid lichen
#

Think you can help?

twin yoke
vapid lichen
#

this was one of the prompts I tried

#

((best quality)), ((masterpiece)), (detailed), lora:Cps3-SF3mix_Fp:0.8 lora:Guilty-Gear-Sprites_Fp:0.8 lora:MvC2:0.8 1 Young man; Spikey blonde hair; Big black eyebrows; Anime; White t-shirt with a red lightning bolt on the chest and a black hood at the neck; Ripped blue jeans; White and red high-top sneakers; Chain hooked to his belt; fingerless black leather gloves; standing; fighting stance;

#

I even tried some img2img and couldnt get that to work

lone idol
#

having some real trouble making a character not duplicate, already got hires fix on and the res itself isn't that high sooo, kinda sucsk

delicate arch
#

@lone idol i forgot to ask, what model are you using?

lone idol
vapid lichen
rocky nebula
twin yoke
# vapid lichen did you see all my responses?

I'm just getting back to it now. You posted 3 LoRAs, but you didn't post what base model you're using. 1 of the 3 LoRAs is for SD 1.5 and the other 2 are Pony, which is based on SDXL, but is different enough that you need a Pony base model to apply it to.

On top of that, if you are using a Pony LoRA & model, you should be prefacing the prompt with the standard Pony flags, which you didn't mention in your prompt verbiage.

Also, you have 3 LoRA listed in your example prompt. Doing that when trying to extract a style will mean that you might run into some odd conflicts.

Start with 1 LoRA that is appropriate for the model you're applying it to. If using Pony, add the Pony prompt stuff that you're supposed to in order to extract what you want out of the image. Then, start simple by giving it an easy prompt to follow and then build on that if it's successful.

Plus, because of all the other parameters, you should be trying to follow the LoRA & model's recommended settings for steps, sampler, scheduler, etc. Note that on the CPS3 LoRA page you linked, they tell you that you will need to use High Res Fix to get the pixel look, but using an upscaler may break that style.

Moral of the story is to keep it simple, make sure the right things are matching, and then work slowly into what you want; don't try to do everything at once unless you know what you're doing.

vapid lichen
twin yoke
vapid lichen
#

hmmm

twin yoke
#

In any case, this has little to do with your prompts aside from the fact that if you use Automatic1111 you're calling the LoRAs in your prompt, which is fine. But I would go back to the SD 1.5 one, use ONLY that one LoRA, and try something basic.

vapid lichen
#

Im using

#

webui forge

#

but the problem is, being basic doesn't get me the generations I am wanting.

twin yoke
#

You start basic and then re-prompt to add more in once you know you have something beginning to work.

#

The more specific of a look you want, the more complicated and less likely it's going to be that you'll get it. As great and magical as AI image inference is, it's at its best when you have lower expectations.

vapid lichen
#

eh... Midjourney doesn't have that problem 😉

#

hence why I was asking for help

#

the difference is, I can't train midjourney on a set of imagery

twin yoke
vapid lichen
#

but that doesn't change the fact that I have much better results

twin yoke
#

That's an apples to oranges comparison.

#

Yeah, and if you wanted to pull a heavy trailer, you'd have better results with an F250 than you would a Mustang. But the Mustang would certainly go a lot faster than the F250 would on a race track.

#

Two different things.

vapid lichen
#

But If we are going to use this comparison, I am asking what type of truck to use, the mustang is over there being awesome on its own, I just need help getting the damn trailer pulled

twin yoke
#

You're the one that brought MJ into the conversation, not me. That's the Mustang in this case.

vapid lichen
#

Yep, and it's awesome over there on its own

#

I brought it up as an example of "MJ doesn't have the same trouble following my prompts" which is what I was asking help getting viable results in SD

twin yoke
#

Right, so I've given you information on how to work toward that. It's not going to be as automatic as get in car and drive. You've gotta hook the trailer up to the hitch, pull the feet up, connect the electronics, test the blinkers, make sure towing mode is on, then drive carefully. So...one step at a time.

#

Otherwise, you're stuck with the Mustang.

vapid lichen
#

I appreciate your attempt to help.

lone idol
#

how do i make it so there's absolutely no living beings in the picture? tryna make a scenery but there's always a person

river fern
#

Im looking for some advice on when using ADetailer+Hiresfix.
Regarding since they both have denoising sliders, what should I set the value to so its not changing the face twice when I just want to enhance the original face. TLDR Managing Hiresfix and ADetailer's denoising values to work well together

#

I usually do after-the-fact with the button to make sure i like the base image first

indigo furnace
#

Anyone know of a basic AI workflow chart? I've been toying with this since the start but it's still confusing about which file types go where and why we need VAEs and is this a model or a checkpoint? I never wanted to learn python and I'm spending much more time trying to figure out what the error 'g' means than I am making art. Also, can the various platforms (Comfy, fooocus, etc) use shortcuts to models? I just discovered I have 4 flux dev copies LOL. Hoping to have 1 folder and just shortcut everything to it.

surreal rose
#

depending on what interface you're using, you have to have certain files in certain folders - because it's going to look for them in those folders and no where else. and if you just drop them all in one spot, it won't find what it needs.

twin yoke
# indigo furnace Anyone know of a basic AI workflow chart? I've been toying with this since the ...

To add/clarify some stuff in addition to what cw said above:

  1. Checkpoints are models. There are, however, several different kinds of each.
  2. The reason you sometimes need a VAE is because not every model bakes them into their safetensors file. The VAE is essentially what does the final translation (encoding/decoding) into or out of the image format that you are looking for. (The other stuff is working in what's called latent space.
  3. Most of the time when you have an error, you either have something connected that should be, are missing a file you need, are misusing a file for the wrong thing, or have sizing incorrect somewhere.
  4. Yes. You need to look at the extra_model_paths.yaml file that exists for most of these applications. That is where you can define locations for the model files to be found.
indigo furnace
# twin yoke To add/clarify some stuff in addition to what cw said above: 1. Checkpoints *are...

While I do appreciate the answers, the answers themselves point out why I am looking for a worksheet. All of those notes do things and yes, there are checkpoints with their different kinds of checkpoints and they do different things and mean different. Things is exactly what I’m looking for a solution too. I don’t know what all those things are. I don’t get to play with AI 40 hours a week and I’m not a programmer, I am an artist.

twin yoke
surreal rose
quiet raft
#

Do anyone know why I these blue stuff appear in a photo?

silver valley
quiet raft
silver valley
quiet raft
thick hornet
#

I'm looking for help on prompting the stable-audio bot. Is there only one UI? What is the syntax for prompting the bot?

polar forge
#

Hello. Anyone knows how to mixe 2 lora chaacters to obtain a mix of a person hat has both traits from 2 characters? (we can see 2 persons in one face? )
IN FLUX?
(for example mix of obama and trumpt whatever examples)

tidal kayak
#

Hello. How can I upload an Image and then change it ?

weary comet
turbid sparrow
#

i have two gpu's one with vram 12 gb on with 16 gb. is there any possibility to run stable diffusion video using these two. its would be a great help. i am new learner .

#

i am able to run sd 2.1 but i want to try 3.0 or 3.1 or svd.

tepid inletBOT
#

Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.

If you have any questions, feel free to ask us!
Your dashboard
Help
Support server

Other languages
en: help
ja: help Japanese

ionic umbra
#

Hello, I want to make a prompt to generate borrowers, as in miniature humans, like from the book and The Secret World of Arrietty. So far all results have yielded kids, which is definitely not what I want. Any advice or guidance?

sand wind
#

umm sorry to ask here if i shouldn't post in this channel.. I just recently began trying ai generating pictures, and was wondering how to correct use the trigger words 'random something'?

For instance, if i type 'random color hair', the results i get most often is just red/blonde hair color. I meant to let the trigger words activate that choosing from
red/yellow/orange/blue/cyan/green/brown/purple/pink/black/grey/white, but not sure how to correctly type it in the trigger words list..
Help would be appreciatedsadcat

#

i tried something like
(random hair colour choosing from red blonde orange blue cyan green pink purple brown grey white black:1.2)
but the picture results i get are still mostly red/blonde/blue hair and stuff, no varieties at all..

#

or if anyone can kindly show some sort of correct template for these type of trigger words?waow

tired vigil
#

how do I prompt for "waving hair" without making the person wave lol

tired vigil
sand wind
#

gotcha, will try comfyui sometime soon. still currently trying webui waow

tired vigil
#

corrected

sand wind
#

Looking for help,
I was just on trying the sd-dynamic-prompt plugin for webui... I made a simple wildcard including red yellow orange blue etc... color words, however when i try to implement this into generating pictures, somhow instead of randomly choosing the word from the wildcard, instead the webui generated the wildcard trigger phrase itself as a whole..(see picture)

Is there anyway to solve this (let the program correctly randomly choose words from the wildcard)? Really looking forward for help 🥺

violet glade
#

I am totally new to discord and I am interested using Stable Diffusion. I am a bit lost on my way finding access to SD. Can someone point me to the right place? Thanks

silver valley
violet glade
#

Thanks. My PC is not powerful to run SD. I heard SD is available on discord but I could not find the dreambots. Help please!

silver valley
violet glade
#

I see. Thanks for your help!

silver valley
#

Np

slow mist
#

i need help with dreamshaperXL
cawboy shot change the content and add cowboy elements instead of change the view

what should i use to get wider shots?

#

i guess i can try with medium full shot

willow flume
#

Underwater world with colorful fish, coral reefs, and sunken ship, illuminated by natural light filtering through water, in a hyper-realistic style

fleet pasture
#

Prompt: A cyberpunk city, neon lights, rain, 8K resolution, art nouveau style, lora:anime_style:1

quiet raft
#

Hey, is there a good tutorial out there that you would recommend for me going to train my own Checkpoints, Lora's etc on my own local PC with auto 1111

silver valley
#

Dont use dreambooth, It will break your auto1111

quiet raft
silver valley
#

Auto is not for training

quiet raft
# silver valley Yep

Aight cool, I will check out those two and check up some tutorials. I would like to update a checkpoint I have been using and it's like 2 years old haha 😄

silver valley
quiet raft
silver valley
quiet raft
silver valley
quiet raft
ashen glade
#

Identification request: Does anyone know the specific image model, or artist name, for this art style? I already tried doing a reverse image search and asking Pixtral to identify it, but got nothing.

dapper flame
teal slate
#

How do you just sharpen an image with img2img? I have a blurry portrait and I want to make it super sharp with like fine skin details and such. I tried running an SDXL img2img with a denoise value between 0.2 and 0.3 and prompts such as "close up photo, 8k, detailed skin, skin pores" etc, as well as scaling it up from 1024 to 1536 or 2048, but it just comes back either blurrier than the input or if I increase the denoise it changes it up too much on the macro-level, that is all the facial features etc.

#

I just want to add in the fine detail, not rearrange the entire face

quiet zodiac
#

I'd use Florence2 to generate a prompt for the original image, then use the original image as a style transfer on the IPAdapter. There is a sharpen node in comfyui, too.

limber shard
#

I just talk to Hunyuan now.

#

It's like C3PO but with uhm oriental sensibilities.

sour beacon
eternal osprey
#

Hello people, is there someone who knows how to make anime indoor scenery? what model do you use?
am trying to make AI scenery that fits this style

nova relic
#

Hey everyone, I'm trying to find a prompt for Flux that allows me to have a character with their hair as clothes, like Bayonetta, for example.

eternal osprey
#

yeah thats good @nova relic but when i try to remove the character the scenery becomes literal ass lol

nova relic
eternal osprey
#

this is what i get when i dont have a character lol

nova relic
#

Oh... What model you using?

eternal osprey
#

this is anything v3

#

idk i couldnt find a model good for indoors scenery

nova relic
#

SDXL ?

eternal osprey
#

yeh

nova relic
#

I'm trying something

eternal osprey
#

30 steps

nova relic
#

30 should be good. I always used 50 when I used SDXL

#

With Flux

#

What's your prompt?

eternal osprey
#

so basicly i want a cozy wooden shed with a nice bed, villag theme indoors

#

so something like this
Cozy Bedroom, simplistic, wooden furniture, Wooden Bed with comfy pillow, Sun shining through the roof cracks

eternal osprey
nova relic
#

it might just be that the checkpoint you use is not made for that...

#

very detailed illustration of an Anime cozy scene inside a wooden shed with a nice bed in a small village,

eternal osprey
#

could be anything, its my 2nd day learning

eternal osprey
#

i downloaded anything v3 and its vae and i run it on gui thats all i know

#

what setup do you use for these results?

nova relic
#

Setup? You mean my hardware?

eternal osprey
#

i mean model and lora and stuff

nova relic
#

The ones I sent were made with Flux. Another thing from SDXL. No Lora.

eternal osprey
#

caz i got this on anything v3
Cozy Bedroom, simplistic, wooden furniture, Wooden Bed with comfy pillow, Sun shining through the roof cracks

nova relic
#

I switched to SDXL with this prompt: Very detailed anime scene,inside of a shed,cozy bed,small village,godrays from the windows,

eternal osprey
#

maybe i really need some godrays limme try

nova relic
#

Don't use "Sun" if you're looking for lights. Use "sunlight" instead

eternal osprey
#

checkout this godrays, i copied same promt lmao

nova relic
#

It's your model, then...

#

Try mhxlAventisHorizon

eternal osprey
#

ok ill try

nova relic
#

What's your graphics card?

eternal osprey
#

rtx3080

#

laptop

nova relic
#

16Gb?

eternal osprey
#

10

nova relic
#

You're good with the model I linked. It's 6.5Gb

eternal osprey
#

so am supposed to have same result as the cartoony pic with the same promt?

nova relic
#

Really depends on the prompt. But I made the last one with the model I linked

eternal osprey
#

its very good, ill give it a go then

nova relic
#

Also... Wait... What is the resolutions you used?

eternal osprey
#

1200x675

#

16:9

nova relic
#

Try 1152x896

#

With your current model

eternal osprey
#

recommended by the author?

nova relic
#

Recommended for SDXL I alwaus use these dimensions

eternal osprey
#

ok sounds good, how long do you think it will take to generate it then with 30 steps?

nova relic
#

That depends on your hardware. What did it take for the last one you posted?

eternal osprey
#

few seconds, but its shit idk

nova relic
#

1024 x 1024
1152 x 896
896 x 1152
1216 x 832
832 x 1216
1344 x 768
768 x 1344
1536 x 640
640 x 1536

These are the resolutions recommended for SDXL

eternal osprey
#

ill use them then, but is there anything else other than that? like sampler and diffuser?

nova relic
#

If you're into it for only a few days, you might not know. But resolution is pretty important for the quality

eternal osprey
#

i do karras and ddim

nova relic
eternal osprey
#

ok ill start it

nova relic
#

That's what I use, anyways.

#

But you can try other things

eternal osprey
nova relic
#

yep, go for it

eternal osprey
#

20 minutes

nova relic
#

20 minutes?!

eternal osprey
#

14-16 fluctuation

#

is opera gx a problem?

nova relic
#

It shouldn't...

#

20 minutes for one image is... A lot...

eternal osprey
#

idk why XL models take all that time while the smaller models are like 15 seconds

nova relic
#

Even with my old 2070, I had like a minute or two max per generation...

#

Did you change model, yet? Or you're still using the old one?

eternal osprey
#

ye i did

#

top left

#

gpu going all out on it too

nova relic
#

It might just be the model loading in your VRAM that takes some time...

silver valley
#

For 8gb vram
--xformers --medvram-sdxl is required
Add them to the commandline_args=

#

Then it should take 1 minute max

eternal osprey
#

tanks cs1o

spiral narwhal
#

how do i highlight a lora and use the shortcut to raise/lower the weight of it?

real lark
#

Can I proved an image and ask it to generate images based off that ? Like a pic of a person provide then have stable diffusion make in a certain style

quiet raft
#

What is the best negative prompt to get ride of these?

#

Rig cages?

crimson patio
#

I think it's the first time I've posted on this channel but I'm not quite sure what to do anymore. Do you have any advice on how to type a prompt well so that the character points to something? “Pointing at xyz” often doesn't work, even increasing its power. Just the hand often points to something random. Any advice? / noobAI

icy geode
slow warren
still marten
#

Quick question guys. How long is too long for a prompt? (SD15)
I keep getting not exactly what I want (not a problem for now) so I keep adding more prompts and I might be overthinking but I feel like it started to get worse

twin yoke
#

For the most part, SD 1.5 isn't going to be super great about prompt adherence no matter what you do. After a certain amount of tokens, it will begin to do some math that's essentially going to be hit or miss and likely just dilute the adherence. Because of how this works, tokens might be one word or several, so it's difficult to say how long a prompt could be from a traditional sense. But if you see things getting worse and not better, odds are you've probably gone too far.

#

Each model version has its quirks about this. XL is definitely better with adherence, with longer prompts, and with proper sentence structure, but it still shows limitations, has missing tokens in lots of models, and can still visibly break down at a certain point.

#

Cascade was truly amazing for prompt adherence...
...for like 7 words. 🤣 Then it broke completely.

Flux is better for prompt adherence than any other model version overall, in my opinion. (I'm sure there are studies that you can find that will show results for this; it's pretty commonly ran every time there's a new model.)

still marten
twin yoke
#

You can, but it's likely that you'll essentially see diminishing returns.

#

The best thing to do is to change words and use smart selections in your negative on SD 1.5.

#

Consider that the prompt is not "english" in the traditional sense. Every word in the prompt has a likely visual impact. If you have something in your image you don't like, look at the words you have and see if there's something that might be misinterpreted. If you're missing something, use a different word to describe it.

still marten
twin yoke
#

It's funny you say "night". Things like "nighttime" can sometimes display a clock because of the "time" portion of the word. Don't want the clock, change to just "night".

heavy dew
#

How to describe style like this? Actually, it is difficult tho. I mean, what artists specialises on urban style like this?

oblique crane
#

Generate an IP three-view of AIDS-prevention

crude crystal
#

I am trying to make this little goblin dude ride front mounted on a mech, armored personnel unit from the matrix style. I cannot achieve this. The attached image is the best I can do with flux, pasted together in photoshop

quiet raft
#

What is a good way to not get these at the back and front on models?

#

If you watch the lower area

minor lynx
#

I’m trying to get a set up to create xenomorph variants loosely based on other sci-fi creatures, in this case creating xenomorph variants of Pandoran life forms, from the avatar movies

#

But the AI seems to keep rejecting my instructions and making the image too xenomorph like no matter how I try to prompt it

sour beacon
#

"An otherworldly creature reimagined with Pandora's lush and bioluminescent ecosystem in mind. The creature has sleek, organic, chitinous armor resembling a mix of blue and glowing crystalline textures, with faint glowing veins of blue and green running across its elongated body. The creature's head crest mimics the iridescent patterns of the banshee, reflecting purples and golds when illuminated. The backdrop features a dense, glowing jungle with massive, towering trees, floating mountains in the distance, and soft, ethereal light from the planet's moons filtering through the misty air. The atmosphere is otherworldly, blending the haunting and predatory nature of the creature with the mystical beauty of Pandora's ecosystem"

mint thistle
#

/dream:

buoyant phoenix
#

Hello hello, i've been trying to use embedding stuff for example EasyNegative, but im not really sure where I should put it.
It's a .safetensors, but i've seen people putting embeddings in the, well, embeddings folder, but nothing shows up for me in the textual inversion tab.

silver valley
stone path
#

hello there, does anyone know which prompts could I use to attempt to recreate when the sleeves cover partially the hand, reaching only half of it? like in this image

sweet rock
#

ChatGPT mentions "oversized sleeves", "partial hand coverage", "cozy fit" but I haven't tested these

robust mantle
#

how can i have the output look more like the input picture, making roblox faces for fun

noble estuary
#

Hope someone can help me on this..

The text outside of the parenthesis is the base prompt for all my characters then the inside text is distinct for each character.

The issue I am running in to is the styles of the images are not consistent. Yes the character elements are correct but the output is not following the style guidelines provided. I have tried adding a seed from an output I liked, starting with an image, a whole variety of negative prompts etc..

Any advice for me? I'm trying to create 200ish characters in a similar style.

A (digital caricature in realistic proportions:10) of ({A young man with neatly styled dark brown hair, dressed in a crisp white shirt under a soft beige
sweater. He sits at a wide wooden desk, surrounded by an open linguistics textbook, a laptop displaying phonetic symbols, and a notebook filled with handwritten notes. The background features a quiet study room with shelves of thick reference books, a chalkboard covered in linguistic diagrams, and a softly lit table lamp casting a warm glow.})
The scene creates a soft and clean illustration style. The overall aesthetic is modern and illustrative, including (flat design elements:10), with smooth lines and vibrant shading.

#

The weights are also a WIP, I have tried all kinds of combinations..

noble estuary
#

This is essentially what I am after..

topaz root
#

In my experience, the flux model can understand the user prompts because it is using the another text encoder

noble estuary
topaz root
tired vigil
#

what is the system requirements for flux

#

cuz its extremely slow in my laptop

cyan kelp
#

Hi, can someone tell me how to create looped videos on AnimateDiff Lightning ComfyUI? Its not working despite setting closed_loop to true as it does for other motion models

stark lark
#

Where can I find information about those weird prompts I always see on civit like core_6 or score_5? I don't get it lol.

slender pasture
#

needed to add a VAE 🙂 simple fix; ty anyways x

rare peak
keen panther
#

Hi guys, I want to generate assets for my game, however, the level looked like it has been taken from a far away view. While this is fine for a lot of games, my game is in the perspective of a mouse so everything is massive relatively. What should I do to fix this? Here is my prompt:

(2d side-scrolling platforming game, side view:0.15), extreme closeup, soil, dirt, hovering platforms. HD, high definition, high resolution, masterpiece, detailed, realistic, stylized, intricate, digital paintinglora:lcm-lora-sdv1-5:1

blur, blurry, blurred, distorted, characters, stone, rock, bricks, grass

peak sun
#

hi

hexed drum
#

what's everyone's favorite sampling method?

neat goblet
#

hey guys

#

Is there a prompt I can use to make the character drawing face down?
It always turns them face up 😦

idle estuary
#

Hi I am new to this space.
Trying out search and recolor feature for a project. The goal is to change color of the shoe while maintaining the texure of the material.
I am able to change color based on 1 color and the entire shoe changes the color but i want to get more control over this.
I have a specific color palette that i want to follow and i want to change specific parts of the shoe like the side stripes only or have the body be a different color and strip be another color.

Can anyone please help to achieving this.

vital depot
#

anyone know what prompt i can use in img2img to make this alleyway look like it's at dusk time

surreal bobcat
#

Create a cozy and festive social media advertisement for a company named 'EcoGlow Candles,' promoting their eco-friendly soy candles for the holiday season. The design features:

A warm home setting with a rustic wooden table.
Elegant soy candles in recyclable containers glowing softly, surrounded by holiday decorations such as pinecones, ribbons, and fairy lights.
A color palette of gold, cream, forest green, and earthy brown tones to evoke a comforting and natural atmosphere.
Text overlays that include:
Headline: 'Light Up Your Holidays, Naturally!'
Subheading: 'Shop Now and Get 20% Off on Holiday Gift Sets!'
Modern and minimal fonts, with the headline in an elegant serif style and the subheading in a clean sans-serif.

#

#artisan-1 Create a cozy and festive social media advertisement for a company named 'EcoGlow Candles,' promoting their eco-friendly soy candles for the holiday season. The design features:

A warm home setting with a rustic wooden table.
Elegant soy candles in recyclable containers glowing softly, surrounded by holiday decorations such as pinecones, ribbons, and fairy lights.
A color palette of gold, cream, forest green, and earthy brown tones to evoke a comforting and natural atmosphere.
Text overlays that include:
Headline: 'Light Up Your Holidays, Naturally!'
Subheading: 'Shop Now and Get 20% Off on Holiday Gift Sets!'
Modern and minimal fonts, with the headline in an elegant serif style and the subheading in a clean sans-serif. prompt help

fast oriole
#

Anyone have a workflow or info of lifestyling a product? Like a chair product and create a model to sit on it, or a bike and create a kid to sit on it, something this way

quiet raft
#

How can I make controlnet work for Pony Checkpoints? :/

fair dust
#

Hi everyone!
I’m looking for advice on creating prompts in SD that feature two or more characters (LoRa) interacting with each other. Any tips on structuring the prompt, avoiding issues like characters merging together, or making the interaction feel natural? Also, are there any tools, keywords, or techniques you'd recommend to improve results for multi-character scenes?
Thanks in advance!

sacred birch
#

hey how can i prompt 3 people in the same image ?

obtuse tundra
#

Ok so I have a prompting question- I'm using DrawThings on the mac, as well as Civit.ai (both for exploring models, and experimenting with creating). If I import a LORA into DrawThings, I usually forget to add a trigger word to the model when importing (as opposed to Civit.ai which automatically tells you what they are. My question is, does this matter? If I'm explicitly loading the LORA in DrawThings, do I still need to call the trigger word in the prompt?

placid pollen
placid pollen
obtuse tundra
#

Another noob question- is there a way to find out what keywords and triggers a particular model has?

regal dome
pulsar sonnet
#

does ‘full stops’ do anything in prompts? or is it just commas

fair dust
#

What tools are you guys using for Regional Prompting and which WebUI?

unreal willow
#

is there any good way to have good quality result? If i use a model that isn't anime so realistic model or a mix between anime and realistic, i keep having problem, the skin seem a little weird, the face seem weird. is there any ways to fix that?

silver valley
unreal willow
#

i should probably try that

silver valley
#

Sdxl has a good quality

unreal willow
#

i've try with pony but never with sdxl

#

i'll check these 2 model out

#

thanks

silver valley
unreal willow
#

never tried that one

#

i tried Real Dream once but it wasn't as good as i thought it would be

glad urchin
#

@silver valley Okay, so here the results of a regular img2img:

So either there's very little difference, or... it's suddenly an entirely different cat, with a totally different shape and look.

Now, when I use inpaint and draw a small streak on the cat, it doesn't do anything:

A thick streak makes it again a total abomination:

If I cover the entire cat, it again turns into a totally different animal:

silver valley
#

Do you want a different cat or do you just want the same cat but different?
Or do you want to change little detailes?

glad urchin
#

But honestly, I'd be happy with lots of different things

#

As long as the cat is still recognizable.

silver valley
#

You can't get that with inpaint or img2img

#

Its directly generated

glad urchin
#

Like... I've added presets from SDXL Styles. It gives a bunch of options. Any of those would also be nice, but I can't seem to be able to get this specific cat turned into any of those styles. It always becomes a different cat.

glad urchin
#

What options do I have then, with an existing image?

silver valley
#

Img2img is for gettinf variations of an image.
Inpaint is for changing details on images, replacing stuff or outpaint (enlarge images)

#

You could try using controlnet extension + IP-Adapter style and then input the deer image as source

glad urchin
#

Okay, I'll check if I can find a guide on that.

#

I have played around with controlnet already, but IP-Adapter will be new for me,

silver valley
#

Ah okay, IP-Adapter is a really cool thing

#

It can do style transfer as well as faceswap

sweet swan
#

@glad urchin send me a dm for guidance and assistance mate

glad urchin
#

I have all the other ip-adapter preprocessors available, but not that one. And I think it's pretty crucial it's that one, if I want to combine it with the ip-adapter_sd15 model?

glad urchin
#

Thanks for the tip @silver valley !
This IP-Adapter is an amazing tool!

#

It's still not exactly what I had in mind, but it already looks pretty good for a first attempt 🙂

#

I also think it's clear that this is the way to do it. And that what I was previously trying was clearly wrong.

silver valley
glad urchin
#

I'll check if I'm still missing some when I get home later today.

#

Are the differences that big?

silver valley
#

I need to check my preprocessors first, then I know if something changed or if you miss one

#

Will make a screenshot later when I'm at home

plain citrus
#

hey, im not sure where to ask this, but im looking for recomendations for settings to give good looking ai art in stable diffusion. these are my current settings

silver valley
plain citrus
silver valley
#

and whats your gpu?

plain citrus
#

hold on

#

how do i check the model?

#

my apologies im not great at this 😅

plain citrus
silver valley
#

And if I know your GPU I can suggest you stuff it can handle

#

Like sdxl models or upscaling

plain citrus
#

sure hold on

plain citrus
silver valley
#

In the top left dropdown

plain citrus
#

ohhh

#

its kinda weird lol, its just one ive been using since my older stable diffusion

silver valley
#

Ah okay thats an old 1.5 model

plain citrus
#

ohh oki

#

should i worry that could be making it worse?

silver valley
#

Its not as good as newer sdxl models. They have a better quality

plain citrus
#

should i look for a newer one?

silver valley
#

That or you can first try using Upscaling

#

That enhances the quality of images

silver valley
plain citrus
#

the results are better, but im finding hands and faces to be pretty rough, something im used to, but still surprising

#

originally the sampling was dpm++sde karras

#

should i have it as that?

plain citrus
twilit raven
#

Lf a checkpoint for realism and one for anime.
--> I have an rtx 3060 (6gb vram).
For realism, I tried flux nf4 v2 but it's 1,3 min (it take too much time)

silver valley
#

Or a pony model

spiral copper
kind wyvern
#

I have been very fascinated with using Trellis recently and I need help creating good prompts for SD3.5 medium. I have tried countless different combinations of prompts to try and remove shading and lighting from the objects that I am trying to make images of, but had no success.

When creating the 3D models in trellis the shadows basically get baked into the model if they are in the image. Any help would be greatly appreciated.

honest wyvern
#

For img-2-img, is there a way for me to keep the background and clothing unchanged/altered while only changing the character? Say for example, I have a picture of 2B in a certain pose, outfit and background and I want to use that image to generate an image of Raiden Shogun with accurate body proportions to her and not 2B while keeping the elements of the image I want to keep

stark lark
#

Any tips to change colors of clothing without the use of Lora?

#

I have all the commands for blue pants in my positive including stuff like (blue_jeans:1.5) and in the negative all the 'yellow_pants, green_pant,' kinda prompts in my negative

#

Is there a secret to it lol or is it because the model I use has no idea what blue even means in relationship to pants and that's why it doesn't do it?

#

Sometimes when there are characters in the background they d wear blue jeans however lol.

quiet totem
#

Hey all! Hope everyone doing good, had a question, why is it when if you try to remove a dark cloth/clothing/curtain etc. It turns the wall/body etc the same shadow? I tried putting no shadows on negative prompts but it keeps giving me shadows on walls or on windows etc.

stoic halo
quiet totem
#

Is like everything else is normal light, but that one spot has a shadow like if there was something infront of it

stoic halo
#

It has four configurations. Like: Original. Filling. Denoising. Blank nulls. You're probably using the original, so the program uses the original pixel color. Try using a stronger diffusion value from 0.7 or change the configuration.

quiet totem
#

Is the stronger diffusion value the CFG scale? Or the denoising strength?

quiet totem
topaz yew
#

Hey someone do here faceswap with A1111?

modern bridge
#

hello, you think you knows why it gives a bad quality in my stable diffusion from my pc
on right it's same parametter from tensor ia (but the vae is automatic, and i can't know which why is used here ...)
can i anyone guess ? or i didn't gave enough detail ?
thank you

torn pond
#

what to do if on my face swap ears are not looking real? i already tick options" "face mask correction, soft inpainting, denoising strenght on 0/0.11" i as well add some prompt to img2img. what i could do else to correct face swaps?

tidal spade
#

I'm trying to generate a cat girl with 2 tails. Any tips on how to get that to be more consistent? I commonly run into the issue of it deciding to do twin tail hairstyles and sometimes cats in the background. I'm using a pony checkpoint.

stoic halo
#

Guys, do you know why it generates better pictures in a built-in editor in civitai than on a computer? Both id and prompt and everything else is the same.

empty sluice
#

tlaking about lora training? they had their settings set in stone whereas you can tweak it locally

stable valley
#

hi there, is there a way with flux to save an image so its the shape of the image i.e. no background, like a png file in photoshop? or will it always save a "canvas" behind it? i want a transparent background.

bright grove
#

any1 knows how to caption image to train lora for a char

#

i want char to remain same

narrow needle
bright grove
#

how do i fix this

boreal gull
#

Is there any way i can know what style this is how can I recreate in sdxl or can find this style and create lora for sdxl and flux

vast oasis
#

Who can help me? I've tried at least 50 negative prompts, but my models always have deformed hands, missing fingers, crippled hands. Who has the right prompt so that I can realistically display hands? Thanks for your help.

iron granite
#

i've tried so many prompts but my images are always weird or off putting? I also just can't seem to find the SD 1.5. the huggingface link doesnt work. can someone give me a good negative prompt?

silver valley
iron granite
#

isnt it better to use it with that then?

silver valley
visual umbra
#

does adding line breaks within the prompt influence the output? or will it the image be generated the same either way?

#

I'm thinking of adding line breaks simply to kind of 'organize' my prompt a bit better and make it look cleaner, but not sure if thatll impact the generation

quiet raft
#

How can I create cute dogs/cat photos with stable diffusion?

#

I would like to make them as similar as possible with how the real photo looks but more cartoon ish you know?

pallid marlin
#

prompt idea to generate something like this?

#

It doesn't have to be vignetting

misty bough
# pallid marlin prompt idea to generate something like this?

Vast cosmic landscape, featuring a mysterious planet shrouded in swirling misty clouds, vibrant nebulae in shades of green, purple, and gold, glowing stars, and distant galaxies in the background. Ethereal light flares emanate from celestial bodies, creating an atmospheric and mystical vibe. Space is deep and dark, with vivid contrasts between shadowy regions and luminous cosmic elements. High detail, cinematic, atmospheric lighting, ultra-realistic, 8K resolution.

empty sluice
vestal fjord
woeful sinew
#

Hey y'all. I'm completely new to SD and don't really know where to start. I want to create a pixel art image like the one attached (I got this one from ChatGPT)
I don't really get, how I should prompt for SD. The images it creates are not good at all.
This is the prompt I used so far (with a lot of iteration and experimentation):

Pixel Art, 16 bit, penguin and ferret with headphones sitting at a round table having a business meeting with a retro microphone mounted on the ceiling, simple background light grey

I used a Pixel Art-Checkpoint (https://civitai.com/models/277680/pixel-art-diffusion-xl) as model and 4xUltrasharp upscaling because it was recommended by the creator of the checkpoint.
I set the steps to 40 and CFG to 7

If there is anything completely wrong with my approach please let me know. Also if you can recommend any ressources for a beginner I would be very grateful.

lone idol
#

hey guys, how do you do white skin? I mean literally white skin not pale skin, something more like white wall paint type white. cuz "white skin" just does normal skin

lone idol
quiet zodiac
#

@pallid marlin There is a technology called Florence2 that can analyze any image and give you a prompt to generate an approximate copy.

young horizon
#

How can I prompt for a cyborg/humanoid robot? I just always get real people, there is no hint of robot/cyborg in the images. I have tried generic "cyborg" and "humanoid robot", as well as trying to give lots of skin/plastic/electronics details

neat goblet
#

hey guys

#

i need a help

#

I'm trying to make a brown grizzly bear (all brown).
But the chest is a lighter shade.
Is there a label you can provide me with?

#

I use this prompts for the color of their fur

#

(brown_bear), ((brown_fur)), ((brown pecs)), (brown chest), ((brown body)), ((brown belly)), ((Brown fur on chest and stomach)), (brown chest and stomach), ((completely brown body)),

#

I used that, but so far I've gotten results like this

#

But the problem is that I don't want the fur to be light colored, but brown.

past quest
#

guys if i want to train a model where i want to generate an image using the art style of training images, what will be the prompt ?

thin carbon
# past quest guys if i want to train a model where i want to generate an image using the art ...

dont quote me on this but:
you need around 10 to 50 images for a character, 100-4000 for styles or 50-2000 for concepts

First add a keyword that will trigger the lora (or don't) like: NdleStyle

i didnt use noodlestyle but a non existant word so your image doesnt suddenly get noodles in it.

Then just try to describe what's in the image (Better using tags)

there are AI tools that'll do the captioning for you on civit ai if your feeling lazy

#

but you also need a lora script or use a web tool etc

past quest
#

is dreambooth good?

#

i have only 64 images for training it soo not sure if its enough

thin carbon
#

i heard dreambooth is good for styles but not so good for characters/concepts

#

not sure either

silver valley
past quest
silver valley
#

Its more practical than training a model on an artstyle

past quest
#

no matter how hard i fine tuned

#

thts y i want to make model instead

past quest
#

i basically made lora from pixai and it was trash lol

quiet raft
#

Is there anything good to use any of thse when using the Extra tab and just upscaling the images?

wind kettle
#

how to do multiple charcaters prompts with noobai xl ? can i write a reference to one character ?

#

for example character1

thin carbon
#

Hmm i have some decent results but its mostly random, ill get back to you on that once im home

#

@ work rn

obtuse tundra
#

What would I put in a negative prompt to get rid of these comic flourishes that emphasize a surprised expression? These little yellow jaggies?

thin carbon
#

oh the shocked thing. i think thats just build into the "shocked / surprised expression" if its a model trained on danbuuru

#

could inpaint it away i suppose

quiet raft
#

Currently working on some stories and making characters.. I love how this turned out but I struggle to keep her facial stuff .. How can I keep her facial to another prompt with different outfits etc? I have copied the Seed number and keep the look prompt but I only change the clothing ..

wet moth
#

Create an image with a 50 cm handle, made of plastic or stainless steel, lightweight, ergonomic and with non-slip rubber. In the 30 cm horizontal part made of flexible silicone rubber and sponge bar, with rotating head, implemented with articulation at the junction of the handle and the horizontal bar for greater mobility and ease of use at difficult angles. Integrate a small reservoir into the handle with a spray mechanism to spray water or cleaning solution directly onto the windows. The handle can be extendable.

quiet raft
#

Do anyone know a video or could help me out in DM with Kohya_SS? I have tried to find any good tutorial or like a more step by step to create my own Lora but I keep getting some kind of error and not getting it to run. I don't know how to setup the folders to make it work. Happy for any answers ❤️

neat goblet
#

Hey guys, is there any prompt you recommend for making a micro/very small character (in size)?
I'm just looking for a prompt that will help me make a single character very small in size

obtuse tundra
#

At what point do you decide to call it quits with a base model and try a different one? I find myself pounding away trying to get Pony v6 to understand a basic concept (right now trying a girl surfing), and even with a surfing LoRA, I could not get it quite right. Then I switch to another flavor of pony, and it made a drastic difference in it's composition. I feel like I waste so much time burning through generation credits trying to pound a base model into submission.

silver valley
obtuse tundra
#

Hm ok good tip

sleek meadow
#

Sometimes I'll very slightly change a prompt for SDXL/Pony in Automatic1111, and the image instantly starts looking 'overcooked', like I used a wrong VAE or something. I'll just add or delete a word, my generations goes from normal to super HDR or just broken, and I can't really get it back without copying an exact prompt that worked. Anyone know the reason for this?

silver valley
indigo furnace
#

Any suggestions to help convince Hunyuan to only make 1 shot. Longer gens always split to 2-3 barely related takes. It seems to like very short prompts in my experience.

tired vigil
#

i found this on web how to make this i have promps but dont work is there a checkpoint for it?

hot finch
#

How do I put a negative keyword within the positive prompt space? I'm using A1111 and this is for a dynamic prompt

empty sluice
#

you dont?

hot finch
#

I thought there was some way with a '-'

quiet prairie
quiet prairie
hot finch
#

Didn't think about that. I'll give it a shot. What I mean't is using it for dynamic prompts. Like if for settings say a college party and a library, I want the negative prompt for college party to be "solo,alone" because I want the party to have people there. But in a library I dont want the negative prompt of "solo, alone". Ill try that with: {college party, (solo,alone:-1), crowd | library, solo, alone}

quiet prairie
#

but if that doesnt do it then i have no idea

hot finch
jade inlet
#

How is it possible to get variations on the same image?

#

If I add something to a prompt but I want to keep the image relatively the same

quiet prairie
quiet prairie
#

Can someone give me some tips on how to improve my prompting? is there any video i coul watch?

signal sable
#

Hello, I am working on an e-commerce project and I need a text-to-image model. I want to deploy this model on Google Cloud Platform (GCP), but this process seems quite new and complicated for me. Since I have limited time, I would like to know which of the following scenarios is more suitable:

Using ready-made GitHub models: For example, pre-trained models like Stable Diffusion. Can I import and use these models on GCP? If possible, can you share the recommended steps for this?

Google Cloud Marketplace: Would it be easier to buy a ready-made solution from GCP Marketplace? If so, what are the recommended APIs or services?

My goal:
To take inputs from user data (e.g. a string array) in the backend and return output via a text-to-image API.
Since I have an e-commerce project, I need a scalable solution for high traffic.
Information:
Backend: Requests will come via REST API.
My project allows users to create customized visuals (e.g. product designs).
Instead of training a model from scratch, I prefer ready-made solutions that will save time.
My questions:
Which way is more practical and faster? A ready-made model from GitHub or a solution from Google Cloud Marketplace?
If I prefer a model from GitHub, what steps should I follow to import these models to GCP?
How can I optimize a scalable text-to-image solution on GCP for a high-traffic application?
What platforms am I asking about:
If you have experience with Stable Diffusion or similar models, can you share them?
I would like to get suggestions from those who have started such a project on Google Cloud.

empty sluice
#

a1111/forge has it i think, if not as an extension

#

comfy has a node for ot aswell

empty sluice
visual horizon
quiet prairie
arctic glade
#

hello ! i am super new to stable diffusion, but i would like to generate art like this ! would anyone know which checkpoints/lora to use to achieve something like this ?

quiet prairie
#

i would personally say illustrious is currently the best with pony close behind, but pony has more loras avaliable than illustrious

#

you can find everything you will need on CIVITAI

quiet raft
#

How do you add two character in 1 image?

#

Is it as simple to just use the "BREAK" word?

silver valley
quiet prairie
#

put this in the tech support area

#

someone should be able to help you there

#

Alos could someone help me - should i have more steps in hires fix or in sampling steps?

signal sable
empty sluice
#

who knows

quiet raft
#

What am I doing wrong?

quiet prairie
eternal carbon
#

Hello!
Does anyone have a idea on which model is used to this type of pictures?

#

Thanks!

empty halo
#

Hello best wishes to you, can someone explain to me how to use AI and in which salon it is appropriate?

daring ermine
#

Hello! I'm new to this art stuff. Is there like, a correct way to string prompts? Particularly if its a more detailed image? Thanks in advance!

neat goblet
#

hey guys a question

#

I'm trying to make two characters in one image, performing an action (which would be one hugging the other).
However, I can't find any guide to generate an image with two characters (both of whom have LORAS).
Could someone help me a little with this? :/

#

this is my prompt

#

PROMPT:

in bathroom, duo, 2boys BREAK
joneo, big male, muscular, big pecs, bara, white eye, big tail, ((legless)), sweat, (hands on hips), lora:Joneo:1 BREAK
colossus, bara, muscular, smaller male, hugging, lora:Colossus:1

silver valley
neat goblet
#

But I have looked for guides and almost none have shown me two male characters.

#

=/

#

I don't even know if my prompts are fine

silver valley
#

Shouldn't be different than two girls hugging

neat goblet
#

They both have the same size, they both have one eye (1 character thing)

#

x.x

tidal crescent
#

What was it the prompt was called, for this hand gesture?

queen atlas
#

peace sign

#

try something like "a woman in a black winter jacket stands on a sidewalk in a city giving a peace sign with her right hand"