#🍥|anime
1 messages · Page 162 of 1
Yeah you can use it after, but you don't need it
this works well
i spent the last 3 months studying highres fix on comfy
only 105s
let's see how this node influence the work
Guys do you even know what the purpose of hires fix is/was?
arron, do you knnow the difference between gradient and not gradient node?
adding details to upscaled images
The gradient is something new he did last night testing stuff, he said it only really works on SD 1.5
would work better in my hands
No

It's to work around the fact that SD goes a bit mad and starts making duplicates of things at higher resolutions than it's designed for.
Yes
oh
It fixes high resolution images
I posted an example a bit ago of what this "fix" does.
Images are mostly trained on 512x512
If you make it create a 1024x1024 image for example, it duplicates stuff and does bad stuff
yep
And then do a 2nd pass over it as img2img
so kohya must be scaling the model or something (judging by parameter names)
He's scaling the unet I believe
what's the unet?
He's explained it on his twitter, but it's in Japanese and I'm not sure how well Twitters Translation works.
In comfyui terms. Create 512x512 image -> upscale esrgan -> downscale to what you want (1024x1024) -> img2img -> output
What does exponential mean in the samplers 
This is hires fix
It's all maths and latents
the default scale was 2, so I figured if you want 2x larger than 1024x1920, you have to use 4x
I've been trying to understand unets for half a year
I think tiled vae screws it though
Only got like halfway there
i know that feeling -.-'
Nubby please bless us
With your nubby skills and knowldege
Ok I go slep
GN everyone
nighty night
it's the noise scheduler. haven't looked into the exact math, but i suppose it probably uses exponential scaling on the added noise.
@stable crane 
the model is made up of 3 basic components.
- Text Encoder (sometimes referred to as CLIP) turns your prompt into vector embeddings. Basically numbers that can be interpreted by the UNET
- UNET takes the vector embeddings created by CLIP and does lots of funny math to find the output that best matches them (this is the image generation process). what it spits out are still just numbers though, these numbers are called latents and they each represent an 8x8 block of pixels
- VAE translates the latents into pixels so that you can have an image. in img2img it is also used in reverse to translate the pixels into latents that the UNET will be able to work with.
@native halo what is the theme for today
don't worry, nobody really understands them. it's largely a magic box that works, but we don't know why. some people understand them much better than you or I do though.
magic box
ohhh
girls with crow head hat
I've gott try this on realistic now too
the TE is the tag of the image's data and the unet is the library of data images
I just got another load of thick witches to gen, will try some crow

hopefully not alive on their heads
unet uses the TE to call the right image's data
he crossed out thick 😢 😢
not really no. there is no library of images. and unet is what actually learns the text/image pairing. TE is kind of like VAE but for text.
(Its so admins don't bonk me)

surely works
a model, is not a library of images converted in code, linked by tags?
an image is tagget with tags like green eyes
ecc
so good
it's not storing the images like a database
the model learns the relationship between the tags (more accurately the tokens that represent those tags) and the images. it should not be capable of ever recreating the images that it was trained on. this can happen sometimes, but it is not a desired behavior. that is what we call overfitting.
@stable crane 
nice, getting thicker
where the crows at
so, analyze the images, looks for relations between tags and forms and store them?

https://github.com/yl4579/StyleTTS2 this looks fun
Think of it sort of like teaching a child their colors. You show them many different objects in different colors and eventually they will figure out that red means the color of the apple, not the shape and will be able to tell you that the firetruck is red even if you hadn't previously showed them a picture of a firetruck.
and this bing net of relations and tokes is the unet, i suppose
mind numbing, but very cool
lol
sometimes I really feel like I'm using magic
this experimental node seems to crash comfy a few bit
here's the a1111 extension for it if you want to try it there https://github.com/wcde/sd-webui-kohya-hiresfix.git
ty, I might
well, probably don't dare
due to VRAM
I wish it were easier to re-use seed in comfy
or mark a node as dirty
what do you mean?
I think if you generate once with "fixed" seed, interrupt it, then change that to "randomize", it'll try to start your gen from the saved state
there's a layer of memoization built in it seems
very cool but can be annoying
in this case, it's trying to do the highres fix step again (when it should really be doing the base gen w/ new seed)
oh, I get what you mean
your crows @native halo
we need a b.a.c ( big ass crow)
😮
what is it called?
Whoa, that's awesome.
I don't see how it would fit into KSampler node
rgthree's ComfyUi Nodes
great work
ty 
Crow lmao
pure meme material. but the question is, internet needs more meme?
this node give sharp images, but weird mutations
using that node, the result is so different from the original, that looks like it ignoring some tokes, or loras
but the end result is much more polished
this gives me an idea
More crows
give her frizzy hair and she'll look like hermione
@native halo more crows
not enough

lol what is this
This is amazing.
creepy
Two ravens?
I may have sort of mixed a crow with a cow
@gentle musk A Crorow?
lot of crows sitting on heads
Am i not allowed here? 
looks like lcm don't works well, splitting the work on 2 samplers
and even when it works, the ending result laks in shaptness
https://github.com/light-and-ray/sd-webui-lcm-sampler for A1111, do you use this with the LCM lora?
Super cute
it works even with high-res fix but several details such as the crows and the pyramids dont appear on the img i guess thats the tradeoff less details but faster imgs
no pyramids or crows 😔
kohya deep shrink, totally changes the end result..... that node needs deep studies
Seed N at resolution A is not equivalent to seed N at resolution B. It might effect the output for more reasons than just that, but that alone is going to have a huge impact on the output.
yep
i think, it can be an important add, if studied well
looks like it can be used to add details to an image, with a relatively low time expenditure
result with lcm looks very dry
i wil look for another way to use them tomorrow
see ya
u can change the way it looks by either reducing the weight of the lora or increasing it
i swear i was lookin at her eyes
@native halo

Forgot to make her happy now trying again 
@dull jackal @wooden coral

freeu+sag is quite good 😮
what is sag
I think it got confused
needs a latex bodysuit imo 
she doesn't look amused
hmm no background.. odd
love this model
realcartoon
Anyone knows what this effect is called? (Not ai art, probably)
after-washing-machine-photo
negative photo ?

No, I mean like the texture thingy
Nice
goes to bed.. thud
interrogate turned it into this 😮
You should try adding au ra, avatar \(ff14\) I'm curious if it'll get any closer to the right horns or add the scales.
no cherry picking:
3x single shot 1024x2048 , 10 steps lcm, dpm++ 2sa, freeu, kohya highres, SAG: 1 minute 20 seconds render time
how do you get such good outputs with LCM? Mine looks kinda terrible mostly 
(damn, after a gazillion retries, this prompt/composition is waaay harder thani thought it'd be, not using controlnet)
freeu
and sag (self attention guidance)
and in this case, also kohya highres https://github.com/wcde/sd-webui-kohya-hiresfix
oh.
Guess breakdomain isnt compatible with all that
WHO BECKONED ME!?
which extensions did you use?
if you use LCM + freeu + kohya + SAG, you need to tweak the values a bit
I use the LCM Lora kohya SAG and FreeU as well
Maybe a problem with a lora I use or an embedding
Oh SAG throws me an error 
yeah, sag throws errors (but works)
@wooden coral if you use sag+kohya, you need to set the SAG guidance scale to 0.2
and that nets me this (single pass 1535x2048 10 steps)
Ill try that, thanks 
Also youre using SDXL?
I mean I get images now, so there is that
this looks kinda off, like just something is wrong, no idea why tho
the body too long?
or something

Belly 
2 meter tall girls also matter 
No, not really. The horns should be in place of the ears and she should have patterns of scales on her neck and around the outside of her face.
thought so. model simply doesn't know
have a dude
Yeah it doesn't surprise me. I was using a LoRA to get it to work.
the distance between shoulders and chest is too long
That sometimes happens. When you change the aspect ratio, sometimes proportions are also adjusted. My guess is that some of the training images were resized incorrectly and stretched/squished. Sort of like how this Samus is too wide.
wow, samus hit the Gim!
your model probably doesnt know samus well
almost perfect, if not for the shadowy face
i like those big trees
and the shadows, sublime!
and check out all those tiny gaps, you can even see the sky!
Can you gen in any anime art style?
For instance the art style of a few visual novels?
yes
Artist name is Fue.
wow
yea similar but not exactly the same cuz theres no lora for his style
yea this is his most recent vn
warm
Got it.
@wraith raptor those are all wonderful 
another pic s
What about Miura's artstyle?
the creator of Berserk?
Yes
yea that one is easier cause theres a lot of loras of his style
there's a really good manga artist by that name
hm?
like this for example
very good. Okay. Here's a request.
white hair, long hair, white skin, red eyes, black kimono, black sandals, black gloves, full body, blank background, fue artstyle
ok 😄
hmm didn't work out all ok.. umm
didnt gave me gloves or long hair 😔
If that's not long hair I would hate to see what you would call long
heres another bad one
I'm still not really sold on LCM
LCM?
sorry random statement, a tech
this one came out better but still needs upscaling to get more details
cx srv phn stuffs
read that as block pants and block sleeves and my brain went wait.. minecraft?
doh!
my brain being random again
.. ok.. .STILL
black pants 😔
sorta unreal anime ?
Watmer
ok using wrong model let me try an anime one 😄
last ones 😔
The bird is protecting the human.
They are quite handy.
Anime screencap model with some adult fine tuning. 
Also, thank you 
charming 
oh, that anime lol

something like, my hubby is a yakuza XD
yeah, that story was a killing, i love it🤣
in that scene where he was sharing his spices, the police was sure he was selling d*ugs 🤣
🤣
Adorable. 😮
if i have a 1024 pixel image and want to upscale it twice and end up at 2048
i could upscale once at 2x,
2 runs at 1.5x wont do it though would it
1.45x almost does it i think i just have to settle for some thing else
love that one
It was a cursed prompt trying to make a rat woman XD
If you're baseline is lower you can upscale higher :)
Depends on what aspect ratio you wanna go for ofc
Which? :)
Merging time 
the colors are fantastic O.O
i likem
i have to leave the house later today :<
if that was a game or anime id watch it
id love a honkai star rail or phantasy star online anime
in that style
damn, the song of demon slayer are seriously like a punch in the face
maybe i should gen something in that style
but first...
guys, do you know how to change the name of a reroute node in comfyui?
tries a merge :D
envydream nuElement and Azoyarpgtools
waits
watches the anime "Record of Grancrest War" while he waits
oooh ok, not sure if this is good.. no.. no.. i'm sure, it sucks... oi
oh... doh!
denoising str...sigh i'm a dummy.. or just tired
sigh
auto vae.. huh? sampler restart.. this isn't behaving like ther other models

Is this too nsfw or still fine?
“In other words, if I had friends, I’d have to start worrying about them, right? If my friends were hurt, I’d feel hurt too, and if they felt sad, I’d feel sad too. You end up with more weak points, so to speak. I think that’s the same as becoming weaker as a person.”
“…But you have fun when your friends have fun, and you’re happy when your friends are happy, so it’s not all about becoming weaker, is it? You might gain more weak points, but you’d gain advantages, too.”
“No,” I replied, shaking my head, “I’d feel envious when my friends were having fun, and jealous when they were happy.”
“…How petty of you,” she nailed me.
Leave me alone.
shrugs, dont' ask me
tho if you have to ask it prob isn't
Before even posting it here
Nah I once asked and they said it was alright
@dull jackal
This still ok? Been a while since I was active here :)
Can remove if not.
What got blocked?
can't say cl**V*ge
yeah first I've seen that
Well I'll hear it when sunny has time ^^
tho interesting style
yawns I should be in bed but meh dont' have to be in till later
back to nuElement
ah the spatial inconsistencies ><
i thought it was just the chair
didnt notice the tail
tries a new merge
oooh way better.
merged nuElement, AOM3, and hello25vintage
oh boy.. then again
witch mixing potion and stuff with his magical friend owl, in a room field with books and potions, lamp on his sidee while working, glowing runes are floating on the book, elaborate scene style, glitter, orange, realistic style, 8k, exposure blend, medium shot, bokeh, (hdr:1.4), high contrast, (cinematic, orange and white film), (muted colors, dim colors, soothing tones:1.3), low saturation, (hyperdetailed:1.2),
Might wanna tweak weights xD
replaces last part after boken with his own hi-res style
waits
witch mixing potion and stuff with his magical friend owl, in a room field with books and potions, lamp on his sidee while working, glowing runes are floating on the book, elaborate scene style, glitter, orange, realistic style, 8k, exposure blend, medium shot, bokeh, fine art, masterpiece, epic detail, elegant, cinematic lighting, sharp detailed, focused, vivid colors, multi-tonal shading, great shadows, hi-res, HDR, 3.5D, Unreal render
falls into bed
witch mixing potion and stuff with his magical friend owl, in a room field with books and potions, lamp on his sidee while working, glowing runes are floating on the book, elaborate scene style, glitter, orange, realistic style, 8k, exposure blend, medium shot, bokeh, sharp focus, elegant detail, HDR, detailed shadows, cinamatic lighting, unreal render, realistic
ok last one
hhe trapped in a jar
cast sleep on himself
does anyone have any idea which model or locon this is?
That was me ^^
!! GASP!

Where we runnin'?
Could you sfw check the image I posed a lil above the ping?
Been a while, not sure if it's in line with the channel policy :p
N e v e r! (Jk, I already did!) 👍 If there's an issue with something, we'll let you know!
Those are some beautiful eyes.
Was just for reference for future posts :B
The gold/orange eyes are really coming out well these days.
Gotta love the hair, too
crows too
crows are smarter than some humans
i saw a video of a crow
putting rocks into a bottle of water to raise the water level so it could drink
yea they know how to use tools
i dont even know how to use tools
i wouldnt be surprised if crows were on the internet
i wish i could get another one of those "animal body human head" gens with a crow
or knew how to do it consistently
yea i sometimes get them when i prompt for a girl hugging a dog
dalle is better at doing them
whatd u do to ur hand
i was talkin about girl in my gen 😔
ohw'
lol it reminded me of those warnings on like, heavy mac hinery
🚫hand warnings that show the hand being pulled into the geartrain
ive been learning controlnet more
i wanna do the whole set of the "<adjective> girlfriend" memes template
i tried with piper perry memes but it gets confused when theres too many ppl
try with ipadapter and softline
i wanna learn ipadapter
i downloaded a huge like 2gb model when i switched to it but i dont really know how to use it adequately
does this image suggest that you can extract pose data from an image?
idk i have never used that ui,im using the webui extension but yours looks different
oh thats a screenncap from the controlnet wiki
https://github.com/lllyasviel/ControlNet-v1-1-nightly
its in the openpose section
i was reading through it for tips and it gives examples and stuff for each CN model
then yes u can extract poses from images if u select openpose preprocessor
woah
they are best friends
i love kisses
this seems excessive, but ive been getting great results all day
That is one big bird. 😮
Can you gen Minato Hikaru from Full Metal Daemon Muramasa?
(I know I asked this before)
Who are you talking to?
you. Although the last time I asked this question, I addressed it to anyone interested in this channel.
I can try I guess, got a lora?
no
Remind me later, I can't refresh loras so I'd have to reboot sd :(
This is a beautiful image.
the clothes look real
close?
yeah thats what I used to try to make her without a lora 😄
I think this new merge I made did pretty good
hmmm, lcm sampler might be listening a lot better to the prompt
anyone did some major testing on that one yet?
ok was going for a mummy but got one punch man? what?
ah ha, turned steps down from 40 to 10 and cfg upto 9
lcm?
restart
reastart crazy good at low steps
@bitter latch lcm might be the new kid in town tho 😮
i mean, check out the 12 step seed 202 lcm image
that's a lot of good faces 😮
ddim 20 , cfg 12
ahh
yep getting that now with the lcm lora
hmm says Set CFG to ~1.5 and Steps to 3
really?
this i gotta see
not sure if that's true
hmm wonder how it'll work with my model
Mini Milk sis? 
hmm, that doesn't seem very convincing
unless you specifically prompted that
just testing that lora sampler. needed high quality science.
without lora
hmm then again it could be the polocromatic color
polycromatic.. oi can't spell
@bitter latch i feel the LCM sampler is a lot better at following your prompt, like ddim used to be (until other sampler just got better quality output)
even without the lora, that's pretty good 😮
21 seconds 😮
and i mean, check those hands 😮
even proper holding stuff?!
what is this black magic?!
on second thought better not
i'm sold. lcm lora with lcm sampler -> the new king in town
kohya deepshrink + freeu + lcm combo -> 1 pass highres cheatcode 😮
now do it in Jojo DIU style
now draw her paying taxes
and back
hmmm lcm gives it a soft effect with this model i made
cackles.. steps 100 cfg 30
gah
those are quite soft
umm
hmmm?
oooh
hmm LCM is just... shrugs
tho restart is good
withthe lora for weights.. hmm
lora loaded first
interesting
gm everyone
moo
seriously man, what the fuck did they do with LCM sampler, i only get good hands 
😮 magic... where is all the effort i have to put in to get even a decent image
she looks awfully happy with her science, what did she make?!
sad about right ct, i love left cat, i want a cat that looks at me like that ❤️
dunno if anime or not, but this feels like some anime shows i've seen
This reminds me of that Bruce Lee quote about becoming water, she kinda became the cat in that one
a way to skip lab trials
I just did merc and it had a golden door
ah, the most lovely activy every league start
thighs
she just has to show of her goods
u sure it won't hurt?
the hands. omg
well the tails accurate
@wispy canopy
switched to lcm
the experiment failed
Miku leopard ?
out of my reach lol
and the other one is darkalfa
i can see from the style
@steady grail using the lcm sampler and lcm lora is a MASSIVE increase in quality and speed tho
the gens are faster, but the results looks rough, to me
combine with kohya deepshrink and freeu
maybe in sdxl is different?
even on sd1.5
yeah, i'm testyng a workflow with freeu now
lol, trying on base sd1.4 XD
later maybe i try to start just with a ksampler, pass the result on a fu and deepshrink and than end the last steps with just another simple ksampler. i want to see if i can improve the quality, reducing the time gen
😮 default base sd1.4 😮
for now, i just test a basic working workflow
holy shit, the comparison with euler a, when sd1.4 was just out
eheheh
oh right, cfg is too low
needs to be 7
oof... even at cfg 7, that's rough
1.4 anime, in it's full glory!
sd improved so much
switching over to lcm, already huge gains
switching to darkalfa 😮
adding in kohya highres
warm 😌
@steady grail kohya deepshrink. single pass sd1.5 at 2048x2048
i can see the improvement
Negative prompt: bad, wrong, boring, badly cropped
Steps: 20, Sampler: LCM, CFG scale: 2, Seed: 3765791862, Size: 2048x2048, Model hash: d9538e0883, Model: darkalfav11 fp16 pruned-correctedvae, VAE hash: d31a5bb481, VAE: kl-f8-anime2.ckpt, FreeU Stages: "[{\"backbone_factor\": 1.2, \"skip_factor\": 0.9}, {\"backbone_factor\": 1.4, \"skip_factor\": 0.2}]", FreeU Schedule: "0.0, 1.0, 0.0", FreeU Version: 2, Lora hashes: "LCM_LoRA_Weights_SD15: aaebf6360f7d", SGM noise multiplier: True, Version: v1.6.0-356-gfc83af44```
ngl krita is crazy
boto, setup LCM asap
Yep you can live generate or normal generate with the plugin
Its sick
Also all controlnets
Yea while drawing
It connects with your comfy install
Getting an iPad pro later today, then I can draw and dont have to use my mouse 
just doodle with the finger and SD will sort it out? ^^
I mean thats what im doing rn
Also you can make presets and with loras, models, vaes etc
same prompt with sdxl -> 3x faster lol
sdxl faster than sd1.5, you heard it here first
Made a simple anime preset for example
boto, lcm sampler better, full stop
not just better at low steps, also at higher steps
lcm isnt good for details imo
Also SDXL doesnt seem to work properly yet in krita idk what im doing wrong
not anime 
this is all sdxl, brightprotonuke 1.2
I also need CFG steps etc if you can 
lcm weigth 0.8,
lcm sampler,
kohya deepshrink,
freeu
cfg 2
12-20 steps
science
the setting for lcm is not with steps between 3 and 6?
yes it is. but it doesn't mean you can also do more
Depends really I think





