#🍥|anime
1 messages · Page 205 of 1
But sounds so outdated already
yes, AI is moving at a crazy pace
you need a 40990f fast computer graphics card, says the AI here
40 fast commputer
or the 90cm long 4900?
gotta jump, cu tomorrow

@native halo you need this PC to generate the biggest longest spikier fake elf girl hair
Dragon teacher took all of her.. Personality 
Does anybody have an idea of when SD3 is coming out?
before SD4

wut is dat modal?
oof I dunno about the yellow teeth...
is this that same pony one or whatever?
also, btw, the model specification in the prompt list worked! 🙂
ran a set overnight, and running another set now 🙂
yes, it's the autismmixSDXL_autismmixConfetti, a pony mixed model
that's great 
I love that her shirt just says "emo"
i should run this through a prompt set and test it out
please do, just know that you need to follow the prompting style of pony model
basically the score tag and maybe source_anime
oh... oof.. I dunno what that is. 😄
oops, didn't mean to combine that post and reply haha
you can read the pony page for explanation
but basically, there's a set of tags that were trained on, and you should almost always use it for pony mixed model
and the tags are score_9, score_8_up, score_7_up, score_6_up, score_5_up, score_4_up,
link here if you wanna read more https://civitai.com/models/257749?modelVersionId=290640
^ might have nsfw pic, so take note
so images are limited to tag definitions?
seems like it might limit creativity in prompts
those tags are trained as quality tags, so if you wanna put masterpiece, best quality, then use those tags instead
but well, you can do whatever if you want
it's just high quality pic were trained with those tags
even with all those tags still cant add that shiny look from the older models,seems like no one has trained that style
lick the shiny
So do you add all those score tags? Or just the one that's like the cutoff point for what you'd want? Not sure i get this based on a quick read of this description.


imagine you go on the site and can search by the highest rated art
thats what those tags kinda are
their score
when the model was training
and it was supposed to be able to do each one individually but during training the model just learned that score_9 to 4_up is one giant tag that makes good images
just a big tag that works like (high quality, best, quality)
Actually, they’re not one big tag
They are individual and you don’t need them all, or any for that matter. You tend to get better results if you use some though
ugly anime
`It's Training time
We now have annotated data and can finally train the actual Pony Diffusion. Let's keep showing the model images and our captions containing the score tags so it also learns which of the score tags correspond to which images, giving us better versions of "masterpiece".
But wait, turned out I messed up a bit! What I described above is how PD V5.X used to do things, in V6 I wanted to also be able to say - "hey, give me anything 80% good and up". But score_8 tag would only give us images in range 80* to 90%. Perhaps using both score_8 and score_9 would work but I wanted to verify that, so I changed the labels form simple score_9 to something more verbose like score_9, score_8_up, score_7_up, score_6_up, score_5_up, score_4_up and score_8 toscore_8, score_7_up, score_6_up, score_5_up, score_4_up. In reality I exposed myself to a variation of The Clever Hans effect where the model learned that the whole long string correlates to the "good looking" images, instead of separate parts of it. Unfortunately by the time I realized it, we were way past the mid point of training, so we just rolled with it (I did try to use shorter tags after the discovery but due to the way we train it didn't have as strong of an effect).
So, to summarize - we used a model trained on human preferences to label all data with special tags and then trained an text to image model on this labels allowing us to ask model for "good" images via use of these tags
Do I need to care
Maybe, in some cases.`
what the author said
As far as I know, training is still in progress. I suspect there will be a closed beta test of some sort before too long, and public launch some time after that. I would think maybe a 1-2 months until public launch.
this prompt is a mess,lemme clean it for u 🧹
right, but the way auto1111 and other tokenizers work, it splits a phrase into recognizable words and converts them to tokens. While the entire string may in fact be trained as a single 'token' to be recognized, all the individual parts are also known. I tested it myself and each individual tag has it's own affect, along with the entire string
my point was that you don't need the whole thing, reguardless of how it was trained or what went wrong
all of the images i've been posting only use
score_9, score_6_up, score_5_up, score_4_up, source_anime,
because in my testing, score_8 specifically, and score_7 to a lesser extent, had more influence on an image than any of the others, and it wasn't a style I liked
others might like the way it looks, and that's fine, but you simply don't have to use them all, or any of them
if you're using a particular artist style or character name it usually matters even less as it's going to use the examples of that artists actual work it was trained on for reference, which in theory is already of the highest quality
but, as always, it depends on your taste/preference/prompt/settings/model/etc
and in reality, the only reason I argue the point is the same as the whole copy/paste word dump prompts people use on 1.5.. if you have a good model, it's already "masterpiece" quality, you don't have to prompt for that
holy 
with my custom 1.5 merge i've never used those keywords, i had a couple negative embeddings, no extra negative prompts and no lengthy prompting at all. IMO, if the model needs all that garbage, it's a bad model at this point.
(perfect hands,beautiful hands,5 fingers,not deformed,perfect slim and magestic hand:2.5)
score_9, score_6_up, score_5_up, score_4_up, source_anime,
1girl, petite, pointy ears, devil horns, black hair, purple eyes, choker, mischievous smile, elaborate gothic victorian dress, dark cathedral, dynamic pose, smug, looking down on viewer,```
```Negative prompt: sweat, maid,```
it does hands pretty well without prompting for them
bad negatives
You want sweaty maids
👍
lol
well, apparently maid outfits count as 'victorian dress' in pony/autism
and with the particular artist style combo i'm using, it adds sweat on her face in every image
thinking 🤔
this is without the negatives, sweat drop on her cheek and a maid-ish looking outfit
the sweat isn't as bad in this particular prompt, but i've just built it into the preset I have for that artist combo 🤷♂️
rare guy generation
someone had to do it.
grabby?
grab these gifts instead
farming in LE, don't wanna see ai anymore
You killed my eyes
model?
flatpiececore
What SDXL models generate eyelashes that are the same color as the hair?
Cuz, usually the eyelashes are black regardless of the hair color
try with just a prompt like ( white eyelashes:1.6), and (black eyelashes:1.5) in negative
my honest reaction
Why is Rem being arrested?
who is rem 
howdy, is the lora folder for comfyui different than for "stable"?
https://comfyanonymous.github.io/ComfyUI_examples/lora/
put them in the models/loras directory
tyvm
does comfyui prompt textbox accepts embeddings just like stable?
no wait, you need to add stuff like embedding:name of embedding, or something like that
but there are custom nodes to autocomplete that
could you show an example?
found it, it's confirmed, the text is embedding:name_of_embedding
午后,明亮的阳光,房厅中,一个男孩(看书),(摇椅)
hmm I cant create some characters in autism confetti
sadge
it doesnt have honkai starrail characters
A..nd I merged animagine v3 and autismix confetti, ehh but images are so bad for sm reason
此消息是人工智能翻译的。这里没有机器人。我们大多数人在自己的计算机上运行稳定扩散。
Pony based models (like Autismmix) are not very compatible with other models, hence the separate category on CivitAI.
just need a char lora
oh wait u are on XL
👍
Oh no Yue is falling
a glaz drakona superhero armor with nuclear symbols on it, cute, no helmet, girl, cute, ideal body, metal skirt, dtailed realistic Dungeon Siege III background, cute, beautiful, gorgeous, pink hair, seductive pose
The prompt)
It's so hellish
she'll catch yue!

What happened to Yue?
I agree
the regional prompter just blending them is adorable 😄
Just reminder that Yue likes to hang out with girls so take good care of her 
she's in good hands 😄
Imagine not having a 4090 or something like that to train xl Loras
Which one
hard to imagine tbh 😏

damn stochastic rounding a game changer for bfloat16 training
man, change one word in a prompt and now i can't post anything here 
You can send to me
😄
for science
how much strength model/clip you guys use for loras with comfy?
depend from how many lora i load
i try to maintain a total ration of 90 100% and than play with any ratio so how much i can push them without burning the image or confusing the lora
except for lora like lcm, that need a certain weight to do their job
depends on the lora and model really, some work best at like 0.3, some are best at 1.2, depends on how well it was trained
Just train a lora then
👍
Or don't gen those characters
They are stupid
loras not good when I train them
souka
Sasuke
tried to generate that purple girl (kafka), but autismmix dun know her
i typed "Deathstare"

❇️ 
Does anyone know how to get animal ears on the side of someone's head? Like is there a lora or special prompt for that?
really
I played with ideogram a bunch before they put a queue into it and a limit
and it was okay, some good text
I definitely think text is it's strong point

new model doesn't know 2b
no more 2b from behind, life is bad 
Its definitely really good
miles better than before
can't say if its better than dalle3
dalle3 is so bad to use that I think it is lmao
You seen Mooey's Dalle images?
that looks cozy
Asserting dominance
that is through the api right

I'm pleb user

too smol, no dominance asserted
I believe that's just made with gpt4
making miss library from misao without a lora is going to be a painnnnn...
target and result
What's the difference between a regular model and an inpainting model, for inpainting?
Yes...
something about information in them that makes them better for inpainting I have no idea I don't inpaint I just click generate again 
I'm too
to notice a difference in them
no skills?
her forehead is big
thats the point of the meme,junior

like this? 

Im?
Me when the tea I make tastes nice

now thats a win
||`Morrigan Aensland,
1girl, bangs, black background, black jacket, black shirt, black shorts, blue eyes, blunt bangs, cutout, clothing cutout, crop top, eyeshadow, fangs, gradient hair, grey hair, hair rings, jacket, light purple hair, looking at viewer, makeup, multicolored hair, navel, open mouth, red eyeshadow, shirt, shorts, sleeveless, sleeveless turtleneck, sleeveless turtleneck crop top, solo, stomach, turtleneck, turtleneck crop top, two-tone hair, vampire`||
i need to make an emote or something for prompts like that
dunno why it triggers me so much
1girl, bangs, black background, black sleeveless turtleneck, croptop, black shorts, blue eyes, blunt bangs, (clothing cutout:0.9), midriff, fangs, gradient hair, grey hair, hair rings, light purple hair, red eyeshadow, two-tone hair, vampire```
probably something to do with my IT career and doing so much tech support and customer service over the years, it bothers me when people get misinformed, and then end up spreading the same information and methods to other new people
Not saying it's i8ntentional or anything, youtubers are mostly to blame since none of them bothered to learn anything before making a bunch of videos on it
thats a lot of assumptions. I'm just sharing the prompt. I dont recall spreading any misinformation.
im using old models, old prompts, and old workflows. idk. the guy wanted to prompt

I'm not saying you are, just that it has happened
always awesome to get pointed at when having fun...
even with older models, repeating the same words multiple times doesn't do anything
okay, thanks.
Like I said, I'm not saying you're at fault
I'm saying prompts that are structured that way bother me because someone, at some point, has spread misinformation on how to prompt
and it's not helping other people to create what they're trying to create, it just makes things harder
next time i'll let them know I'm just going to cause them more problems if I share the prompt.

ok, i guess just be offended then 🤷♂️
I never said not to share the prompt, i said the prompt bothered me
Either you learned something, or I'm just some random dick on the internet that made fun of you
doesn't matter to me really
yeah
i just hate coming on this thread because there's always someone who has to do that while I'm just trying to share. but its cool. I wont spread misinfo and ruin people's gens.
maybe you shouldn't be so quick to get offended over it, it has nothing to do with you or what you're sharing, sorry for the attempt at being educational
yeah
it would be better if you stop typing
Prompt, please.
1girl,pink hair,very long hair,green eyes,(latex:1.5),nurse outfit,thighhighs,garter straps,(arms behind back:1.4)
Tahnx
Lora? Or model. It feels like it's a style specific model
Ohh it's gradient hair😂😂. Here I am yapping about models and loras
dark scenes and backgrounds are my biggest complaints about Pony XL
I always prompt white backgrounds so I can't comment on that lol
Kohaku-XL-Delta is released now btw https://huggingface.co/KBlueLeaf/Kohaku-XL-Delta
So many xl models coming out lately
Hopefully Kohaku's training methods will catch on because they were able to train this model on 2x3090's. Pretty much all of the other big XL models have needed A100/H100's to train in a reasonable amount of time and most of them still don't handle less common tags as well as kxl-delta.
Wouldn't sharing this be better since it tracks downloads and reviews?
https://civitai.com/models/332076/kohaku-xl-delta
I suppose so. Kohaku used both links in the announcement posts they made in other servers.
Yeah I understand why some prefer hf but imo for casual users CivitAI is better for visibility
Oh it's a merge
Nice
Well, trained base model + merge
It's a merge in the sense that the bulk of the training was done on a LoKR and that was merged into the model instead of training it directly into the model which is less efficient. That's the trick that allowed them to pull it off on relatively weak hardware.
from the sample imgs it looks wobbly and the background looks less detailed prob due to the low training
Hey guys, ive been using SD 1.5 for a long time but i want to try out anime SDXL, but i dont know how to use it, so i have some questions:
- what is the refiner? Is it needed for anime
- do 1.5 loras work on sdxl?
- do 1.5 negative embeddings work on sdxl?
- is the tagging style the same as 1.5? Like booru tags
- Is hires fix necessary in all images?
- Is the VAE the same?
Tysm for the help.
- I don't think you need any refiner for the newest xl models
- 1.5 loras don't work on XL
- I don't think so
- Newest trained anime model use booru tags fortunately
- I use 832x1216 and it already looks great imo, no need for hires
- Most models already have the VAE baked in so no need to download
It's got a ton of artists trained. Want a different style? Use a different artist tag. I haven't tested them all yet, but there are probably more than 400 that work well.
yea its here,still looks wobbly
That's hardly any of the trained artists. I've used ones with as few as 500 instances with good results. Haven't tried lower than that yet. https://huggingface.co/KBlueLeaf/Kohaku-XL-Delta/blob/main/docs/artists-kxl-delta.json
well if u can gen an img that looks clean and has good lineart post it here to test some prompts with it
I have the day off i'm going back to bed
Ty! Is there a way for me to separate 1.5 loras from XL? like different folders idk
Also, is using 1024x1536 bad for XL?
I don't think you can separate the lora folders in webui. And yeah, that reso is completely fine
yep, inside lora folder, you can place the lora in subfolder
Prompt, please.
anime, cartoon style, Nurse with pink hair, japanese nurse uniform, hands on hips, smiling, detailed character sheet


Just to add on, if you have a SDXL based model selected, then in auto1111 webui Lora tab, you will only see those Lora that are for sdxl based. And also, I saw you asking about refiner, basically refiner is a concept of changing model mid generation. You can even try doing that with sd1.5 model. Example, use one model to generate good human anatomy, then use second model as refiner to change style to that model. The refiner is based on ratio, so if it’s 0.5, means at half the steps you set, it will use the refiner model you set.
Refiner is very cool
Can make good looking... beach landscapes
👍



Only if loras have proper metadata 
what is the best webui to use with SDXL anime? I started using xl today and im using auto1111
cool! is there any config to change from 1.5 to xl? like clip skip
Yup true true
As far as I know, clipskip doesn’t matter for sdxl
To change to using sdxl model from 1.5, it’s probably as easy as changing the model and vae
tysm, i thought i needed to change a lot, is there a way to speed up the gens a bit? im getting like 3 mins per img (i have a 3060 12GB)
On 1.5? Or sdxl?
You probably need to use xformers
xl
im using it
im genning at 832x1216 no hires, i just copied some random image in civitai to test.
Forge or Comfyui/StableSwarm
Forge? never heard of that, is it too different from auto1111?
It's just more optimized a1111
Basically a1111 but faster
nice, does it have extensions like adetailer?
You can add it like a1111
It uses a1111 extensions although not all work but adetailer works
Should not be 3 minutes for that then
nice! can i transfer all my extensions and stuff to it? (sorry for making too much questions lol)
Maybe close to 1.30
yeah now i got 1 min 20 secs, i was running some apps in background
You could probably just copy paste your extension folder but haven't tested myself
I don’t think that will work, not sure tho
There’s quite a few built in extension like controlnet and such
im going to try it out, tysm!
also, do you guys have any recommendation for models, loras, or negatives i could start with? idk if my old 1.5 negatives will work neither do i know how to prompt in sdxl lol
I think webui just detects each folder inside extension folder and loads it if it has initialization script
Also neko Yue requires pats
hopefully so, but if you wanna try that, need to remove stuff like controlnet and such
maybe not since it disabled extension if it has integrated version
At least multi diffusion can't be enabled
currently, ponyDiffusion is a pretty popular SDXL based model, despite the name, it's pretty good at anime
like 1.5 model, every SDXL based model might need different kinds of negative prompt
the model im using is autismmixSDXL_autismmixConfetti, which doesnt really require much negative. The one im using is (worst quality, low quality:1.2), 3d, censored, abdominal muscle, artistic glow
i saw one called AutismMix and it looks great, im going to try it.
ohh, then might be worth trying it

oh nice, tysm!
can confirm, it's my normal model now, even when i only have 8gb vram
how long do your gens take? also what webui are you using
but since they are based on PonyDiffusion, you need to follow a specific tags as it's trained on that
forge. for 768 x 1024, hires fix 1.5, 24 steps normal, 10 steps hires, about 50 seconds
damn thats fast
what tags?
They made a mistake in training, and instead of just score_9, it's trained on score_9, score_8_up, score_7_up, score_6_up, score_5_up, score_4_up instead. So use the latter
and there's rating_safe, rating_questionable and rating_explicit
also source_anime
oh so like quality tags
last question (i hope so) Is the prompting too different? should i still use like
1girl, blonde hair, long hair, blue eyes ?
yup, for most SDXL model, it's still really danbooru tag
and no worries about asking question
guys, i'm introducing myself in the world of training lora, someone know a good guide?
i saw this in the forge github page, is it too difficult to do this? I wanted to keep everything (sorry for the ping)
I didn't use this method, so not sure. If you install forge normally, you can opt to use models/lora/embedding and others from automatic1111 webui
by changing the webui-user.bat, which is what most of us did
thats what i want, could you give a brief explanation on how to?
once you got forge downloaded, and ran it once. You can open up forge webui-user.bat and edit it, mine look something like this
@echo off
set PYTHON=
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS=
@REM Uncomment following code to reference an existing A1111 checkout.
set A1111_HOME="D:\Downloads\A1111 Web UI NEW\stable-diffusion-webui"
@REM
@REM set VENV_DIR=%A1111_HOME%/venv
set COMMANDLINE_ARGS=%COMMANDLINE_ARGS% ^
--ckpt-dir C:/Users/User/OneDrive/Desktop/Stable_Diffusion_Models ^
--hypernetwork-dir %A1111_HOME%/models/hypernetworks ^
--embeddings-dir %A1111_HOME%/embeddings ^
--lora-dir %A1111_HOME%/models/Lora^
--vae-dir %A1111_HOME%/models/VAE^
--realesrgan-models-path %A1111_HOME%/models/RealESRGAN^
--esrgan-models-path %A1111_HOME%/models/ESRGAN^
--no-gradio-queue^
--cuda-stream^
--pin-shared-memory^
--api
call webui.bat
obviously use your own path to your own ckpt-dir from auto1111 and such
you might not need no-gradio-queue and below, but that's my own stuff
so basically you do --ckpt-dir "path to the model folder"
--hypernetwork-dir "path to the hypernetwork folder"
and so on?
yup
if you open up your forge webui-user.bat there should already have a format for you
but there's some stuff lacking, like if im not wrong, vae and below is added by myself
ok, tysm again!

im having a problem, when i run the "run" file, it gives me the problem of unrecognized argument
show me your webui-user.bat
`@echo off
set PYTHON=
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS=
@REM Uncomment following code to reference an existing A1111 checkout.
set A1111_HOME= F:\Stable Diffusion\stable-diffusion-webui
@REM
@REM set VENV_DIR=%A1111_HOME%/venv
set COMMANDLINE_ARGS=%COMMANDLINE_ARGS% ^
--ckpt-dir %A1111_HOME%/models/Stable-diffusion ^
--hypernetwork-dir %A1111_HOME%/models/hypernetworks ^
--embeddings-dir %A1111_HOME%/embeddings ^
--lora-dir %A1111_HOME%/models/Lora^
--vae-dir %A1111_HOME%/models/VAE^
--realesrgan-models-path %A1111_HOME%/models/RealESRGAN^
--esrgan-models-path %A1111_HOME%/models/ESRGAN^
--no-half-vae ^
--no-gradio-queue ^
--api
call webui.bat
`
try without --no-half-vae
oh
and i think
since your A1111 home directory there's a space, you need to enclose it in " "
set A1111_HOME= "F:\Stable Diffusion\stable-diffusion-webui"
i tried and got this error:
did you try putting it in " "
yes
another thing to check, see if your have those folder in A1111
yeah i do

send your current latest one again
@echo off
set PYTHON=
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS=@REM Uncomment following code to reference an existing A1111 checkout.
set A1111_HOME= "F:\Stable Diffusion\stable-diffusion-webui"
@REM
@REM set VENV_DIR=%A1111_HOME%/venv
set COMMANDLINE_ARGS=%COMMANDLINE_ARGS% ^
--ckpt-dir %A1111_HOME%/models/Stable-diffusion ^
--hypernetwork-dir %A1111_HOME%/models/hypernetworks ^
--embeddings-dir %A1111_HOME%/embeddings ^
--lora-dir %A1111_HOME%/models/Lora^
--vae-dir %A1111_HOME%/models/VAE^
--realesrgan-models-path %A1111_HOME%/models/RealESRGAN^
--esrgan-models-path %A1111_HOME%/models/ESRGAN^
--no-gradio-queue^
--api
call webui.bat
I don't see what's wrong 
me neither
what's the error, can you copy and paste here?
oh no, did you play around with the code and now the code keeps playing you?
If I code a cute goth girl
Will she play with me?
That sounds like a good plan

ok, i will be sending in private just so it does not clutter the chat
just running a boatload of prompts against different models, tryin' things out

So SD3? 
after we get proper controlnet tile and shuffle in XL 👍
wtf is controlnet tile and shuffle 

We need less math and graphs
And more cow girls
i like the cowgirl 

Wolverine Miku Mouse
prompt: anime-cartoon Style, black outlined, vivid colors, soft shadows , far wide view, Anthromorphic Blu hair Girl-Mouse with Claws of Wolverine with body covered in silver cybertech chainmail, Miku Hatsune Anime themed background , extra detailed, ultra sharp, ultra science-fiction
if you haven't figured it out, you need a space between the path and the ^ character
rare comfy post
Comfy spreading greatness of sd3
I figured it out, but ty anyways!
Pancakes for everyone
waffles
Pfannkuchen 
I just told SD I wanted goth dragon girl
I'm sure there is a pose like that
eh.. come on I need that pose... sadge
hmm it's fuzzy
there we go, A passender on the Star Rail train
used a TurboXL model
tubo is a little washed out?
weird, when i used it on pony i didn't get too much of a difference
could be the custom model
booba

A mix mash of 3 different models in different ratios
im testing a few of em
AnimagineXl3 As the Second one with a 0.7 ratio towards it
the first and third ones can be anything
with the basic Animagine Lora
(Anime style:3) is mostly how i get my style
Honestly, needs it, theses ai's aint no chatgpt, they don't know their own weight
@lyric zealot you can fix her
Not sure if this was related to the message above it or not, but if so, I increased contrast and lowered brightness on the sdxl vae specifically for Pony and AutismMix to help with them being a bit washed out. https://huggingface.co/nubby/blessed-sdxl-vae-fp16-fix/blob/main/sdxl_vae-fp16fix-blessed.safetensors
sighs, i'm getting bored with ai art
and games
just finished a 16 book series
so .. bored
I will fix her

looks like a modded skyrim character
well that got random
ok that one style is jus random
in a good way
skeleton
kinda like bleach but nah
do i still need to put lycoris in a different folder than lora?
nope
at least for me
these are all with 4 steps and cfg 1
with sd3 might not have to bother about steps and cfg
depends.
dpm2m karras if i want the style to pop more, ddpm for more bodily consitency, euler SGMuniform if using lightning lora
so dpm 2m karras for general use?
yeah
ok tysm, so after all it does not differenciate a lot from 1.5
but sde, 2sa karras, 2m, 3m sde, all have a small different outcome
also most of the lightning models out there suggest using dpm++ sde
and depending on cfg/other things, it might not work as well
if you have forge -> try the SGM uniform samplers
yes i heard about it but i dont use forge
i updated the a1111 some mins ago, but i dont see sgm uniform added yet
because it's not in default a1111
finally xD
yeah finally lol
u can just install a script to enable the use of different schedulers like uniform on any sampler
interesting, is there anything in particular that's special with uniform samplers?
so far im getting good results with dpm++ sde or dpm++ sde karras
and those are with lightning models
i dont know i dont use lightning🗿
those are along the line of lcm turbo
low steps and cfg
i had little quality loss with lcm which is not so much with lightning
im still struggling to see whats the best resolution to use with pony
ive tried 768x1024 1.5 hires but idk something feels wrong
ive tried 1024x1536 no hires and things get stretched lol
try 896x1152
with hires?
you could also try the generic 768x1024
being sdxl, 1024x1024 is the 'best' resolution, different resolutions are going to be deviations from what the model is native to, that said 768x1024 works fairly well though
you can try hi-res fix at 1.5 it fixes some of the wobbly lines but not all of it
i tried, but some details felt like out of place and some artifacts came with it
im still figuring things out tho, i only started using sdxl yesterday, so if any of you have tips for settings and parameters
just so you know, you can even use 512x768 then hires fix by 2x
XL would parse it fine
depends on the model, for pony based models cfg 7, 30 steps, 1024x1024 using euler a works fine for me
also depends on the style and look you want
for example this is the output i get with 512x768 and hires
if u try a bunch of different resolutions,different samplers and different steps and a minimal prompt and still comes out wobbly or deformed then its the model
unless it's pony, then it's your prompt 😄
it was probably the resolution, looks fine to me now
ty! for a do you use hires?
yeah, i reutilized an old prompt from 1.5 for testing and it came out... kinda weird
i still need to learn the prompting of pony
when i use hires fix it's nothing special, 4x-Ultrasharp upscaler with 7 steps and 0.25 denoise
btw the reason i chose to use 896x1152 it kinda happens to give more content space
how?
thats in comparison to using 5121x768
technically i could use fewer hire steps since i'm not using it to 'clean up' the image as much as just a clean upscale, but on my gpu the time difference is trivial
so basically no upscale just to clean up?
other way around, i 2x upscale, low denoise so it doesn't change much
evenin'
oh so you lower the resolution a lot
no..
its almos 1 AM for me lol
i gen at 1024x1024 and then upscale 2x
super evenin'?
oh ok, that makes sense
yeahhh
i have a 4090, so i gen at the full resolution, otherwise there's no real point (for me) to using sdxl
damn how fast is that?
15.9s with 7 steps of 2x hires
my 3060 gens like 55 seconds for each
prob too many steps or using a1111 on forge is faster like 30secs
yeah, probably because i normally have some stuff in bacgkround
im using forge with 25 steps
well lightning models are super fast which you might consider
what are those?
im using rtx 3060 and it takes me about 2ish seconds to gen 896x1152
trade quality for speed model
quality arent really bad
no its just less
yea money
the recent lightning models from community are pretty amazing as far as quality goes
lol
but any like extension or anything
just tensor RT but u need to compile some files for each model and also only works with a single lora
In early, unoptimized inference tests on consumer hardware our largest SD3 model with 8B parameters fits into the 24GB VRAM of a RTX 4090 and takes 34 seconds to generate an image of resolution 1024x1024 when using 50 sampling steps. Additionally, there will be multiple variations of Stable Diffusion 3 during the initial release, ranging from 800m to 8B parameter models to further eliminate hardware barriers.
only way to speed things up is to sacrifice something, speed, money, quality, or resolution
it works with multiple loras now, and you don't have to build a huge engine for each
but some loras won't convert properly for some reason, it's pretty buggy
im curious about the other variations that would supposedly eliminate the hardware barriers 🙂
yeah i dont really need too much speed up, as i usually use SD in background when im doing something else
not sure if @native halo will understand this one...
yea praying nvidia adds that to their drivers instead 🗿
hopefully details soon 
im just a cook
it'll be similar to SD cascade
anyways, im going to sleep now, ty so much for your help guys
just different low memory versions that have less quality/coherence
nice
is tensorRT on forge or still bugged on it?

haven't checked in a few days, i doubt they've fixed it
yup it's definitely not fixed
i opened the issue ont he git hub for it, so i'd imagine i'll be notified if it ever gets closed, or even replied to
a few replies, but not from the devs so i don't expect much

from what i can tell, it's usage on a1111 is pretty small since the installer doesn't even work properly
most people give up trying to get it working
Hey @vital raptor , are you available to help me with AI Grillz lora model training?
i like grills too
yeah so I am gonna train the lora model to convert selfie to grillz image
🤨
fat grills
Are you interested?
in grills? I like bbq, but not really..
If you can, I am willing to pay for that
(paul wall:2.5)
looks like the broken sdxl math was not fixed in 1.8 like it was supposed to be
so that's cool...
sorry, didnt realize i didn't actually answer you 😄
I might be interested if you have a dataset already, feel free to DM me what you're wanting trained with some examples. If you've got money to put towards it though you might look into training on civitai or something, it's not generally too expensive to train on better hardware than mine
I'm heading to bed now though, so I'll have to get back to you sometime between sleep and work
you want resolutions that equal roughly the same pixel count as 1024x1024 because that's the native resolution which all of the buckets were based on. here's a cheat sheet.
candy house
even on pony, eface will find a way to do lyrics prompting 😄
pony sucks at lyrics prompting
yoo tysm, that helps alot
what style is this?
Anyone else switched or switching to forge? 
I have been using forge for a while since it gave me ~40% speed increase with XL on my 1070 8GB
(by mdf an:0.8), kashu \(hizake\),
1girl, small breasts, long hair, straight hair, white hair, blunt bangs, azure eyes, pointy ears, long eyelashes, choker, blush, witch hat, black [dress|robe|dress], mage, (smug:1.1), smirk, fang out, exaggerated expression, outdoors, nature, standing, magic circle, holding staff,
score_9, score_8_up, score_7_up, score_6_up, score_5_up, score_4_up, source_anime, sketch, graphite \(medium\), rating_safe <lora:sdxl_edge_enhancer:1>
Negative prompt: long hair, sweat, 3d, slit pupils, source_cartoon, source_furry, source_pony, painting, (worst quality, low quality:1.2), lowres, jpeg artifacts, scan artifacts, (abstract:0.9), chromatic aberration, unfinished, blurry, monochrome, censored
My speed improvements on a 4090 are not huge but still there. And it loads up way faster
Well start up is faster since it loads model after you click generate first time
Maybe its also cause my A1111 install is over a year old and I never started fresh 
If it works don't fix it
I installed torch 2.2.0 on A1111 a while ago and that was already a pain in the butt
So yeah I dont live up to that

seems FB is currently having major issues
cool it should spread to twitter/youtube/tiktok
shrugs
according to down detector seems to have
seems to be mainly meta services
ahh ok seems there was a leak of a database that exposed alot of data
facepalms
well someone's having fun and it aint u
us
Almost looks as if you don't like them 
gives me idea's 😄 evil grin
facebook haha
"my facebook has no face"
Miku Book
facebook right now
hehe baby miku face book
what model/style is this?
why no meta data 
from what I know orax inpaints images and stuff to make them cooler
so no metadata 
yes
it's autism mix and the granblue fantasy lora
@vital raptor cleanest prompt ever
You might like it
Mmm... Maybe don't send prompts that are intended to generate cheese pizza.
it was a poorly written prompt that got deleted because of the type of content that it was intended to generate.
ah, i see
hope everyone is having a nice day
I'm home sick with a fever and it's been snowing for ~16 hours, but otherwise 👍
pretty good so far. haven't been up for very long yet.
bunnygirls are ok
i came here to bump @wispy canopy ~
😮
what did i, or did i not do ? 😮
how are things friend ?
I'm alive 😮
well, semi
have been on a sickness streak as of lately
doing some testing on pony models
pats received
we got sd forge, and the best nsfw sdxl in the form of pony 6
and all it's spinoffs
nah, they just announced the papers, i think
i mean, for a hentai model, this IS pretty impressive
realtime high quality yes
playing a lot of helldivers 2 lately tho, amazing game
i mean real-time as-in 24FPS 😛
yeah, and then have that hooked up to a game, which will feed regional prompting to that 😄
yes plz
love the style by mdf an
@gloomy scroll if you have prompt fusion installed, you could try by [ mdf an:kashu \(hizake\):0,0.4]
you'd have to juggle the 0.4 a bit, but in that case it interpolates each step instead of sudden switch over, might even work better
doesnt a1111 do that natively?
no, it's a modification of the behaviour
the OG a1111 spec just switches over abrubtly at the specified point
wiht prompt fusion (the difference you need to specify 2 numbers), it will interpolate from A -> B
so step 1 = 100% mdf an
step 2 = 75% mdf an/25% kashu
step 3 = 50/50
uhh... percentage
ah, when I read interpolation I thought of step 1 is A, step 2 is B, step 3 is A..etc
yeah
also, there's a modified version for that one too
well, i just like playing with the artist names and getting some cool stuff out that way
yeah, i was thinking of making a text file with a ton of artists and using wildcards to randomly mix them to see what interesting things i get
gotcha fam
all currently discovered WORKING artists for pony i have
some might be false positives tho
especially the japanese names
another very strong option with that wildcard file is [__knownponyartists__|__knownponyartists__] for blending artist styles
looks like one of the artists on the left has the same sweat problem one of the ones i use has 😄
i think it's kashu
throw some random japanese characters in your prompt to see if the japanese names are working or not
big brain
mdf an causes a lovely sketch style, but has kinda diluted contrast
that list took me quite a while (gpu hours and comparing images) to make
and still have 17k images to check :/
MADNESS
well -> time to go to bed
zzzzz
gremlin stealing the cookie jar
That was just something I stole from Nubby
thats what cookies are for
@gloomy scroll

Dragon doesn't use Nai

so thats Nai
This is
Something wrong is not right here
catdog
hotdog monster
do [cat:dog]
generates cat only

it does each at a different step
a monster that's half cat and half dog
in my case, more for real-time audio-reactive visual generation
SDXL will bring us there, someone been working on that project




