#๐โai-images
1 messages ยท Page 16 of 1
@keen bramble No LoRA required!
how the fuck
controlnet?
even the fucking gadget
or that is not ai art
im confused
eyes too smooth
boots perfect almost
Nah it's ai art, It's built into the model.
AnimagineXL v3
f
well time to finally start using SDXL
now that i got a 4070
Yeah that should work, what UI are you currently using?
Other than the third leg, looks fine
Ayo? @tawny peak level 21 !!! 
invisible arm
thats cute what made that ?
bing/dalle3
copilot ?
ty
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
i mostly use comfyUI
do you got any stuff for SDXL
its been months since i don't use SD
not sure how much different it works
is it just the model or does it have extra stuff?
Not much more than 1.5, AnimagineXL works with the sdxl vae and works with 1.5 workflows already, you just need to change the model
oh good then
ima make a fresh install of comfyUI
so which characters does the AnimagineXL have?
noticed Azur Lane, Girls frontline and Arknights
Blue Archive, AL, GFL, ARK N, and some other non game characters from random animes. You can see if the character exists in the model by typing the characters name the same way as on Booru sites like this: (ump45, girls frontline, any other prompting tags)
And your prompts can be as basic as ever. And you'll still get good results.
I made my own, it's basic but it get's the job done, I can send the json file if you'd like
sure
This is what it looks like
mfw someone actually keeps their workflow organized
why such a big res tho?
SDXL's base resolution is 1024 x 1024
So I just made it portrait with that, and plus I upscale them 2x as well to 2048 x 3072
You can disconnect the upscale part of it if you don't want it to upscale
I have 12GB and it only uses 10 - 11 on full upscale and 7 - 8 on non upscale
I also have --normalvram enabled as one of my launch args
The image in the workflow also finished
wait what about Loras tho?
guess what having a bunch of loras and checkpoints
They work as well.
Though old loras wont work with XL
Damn I can't type
yea this is what i meant
not sure if people made enough good ones already
There's a few and I did just figure out a way to train XL loras on 8GB of Vram
Pony Diffusion V6 is a versatile SDXL finetune capable of producing stunning SFW and NSFW visuals of various anthro, feral, or humanoids species an...
๐
Yeah..
Yeah XL is much better at just everything, prompting, generating, actually listening to the prompt / understanding it
And video generation is becoming more accessible with SVD models
how does XL handle video gen
is it kinda the same as animediff
Haven't tried but i'd imagine it's like that
The only thing different is that SDXL video can use only 5 steps in total to make a video
Because of LCM
I also just noticed, with the workflow i've been using, It only takes (for me) 2 and a half minutes to generate 3.5K resolution images
what the fuck
doesnt it look broken bc only 5 steps tho?
wouldnt it matter how long the step is ?
i mean the sampler steps
No, that's what LCM does to fix it. It's trained in a way to make perfect images in less than 10 steps with the sacrifice being the guidance scale for the prompt to go way down.
understood just trying to keep up
no worries
only time i've used SDXL was with the turbo one
so i don't know new stuff as well
damn stuff looks so clean now
artists now can start to be afraid
lmao
I just started using XL models 2 weeks ago
lmao
Technically anyone who has an intel cpu that's at least 8th gen to 14th gen and no GPU can even run SDXL. It's crazy how fast everything is evolving
poni-virus? =-=
Make sure to turn on xformers in your launch args
It'll speed everything up with SDXL
Yeah I did test with it off and on and without it it took 4 - 6 minutes compared to 2 -3 with it on
It's also possible that the version of xformers you had installed had a memory leak issue, that does tend to happen with a release every so often
isn't xformers enabled by default?
In your console does it have this line? Using Xformers...
which one did you install?
Yeah the cu versions should work
The pytorch one also called opt-sdp-attention are for AMD systems I think
Iโd imagine pytorch attention would work fine, but xformers is the safety net for nvidia gpus so that they donโt vram crash
I forgot to add a save image node, just add it before the preview image node
Lol, it only says ump4 though on the jacket
lol
wonder how it will look
no different than 30 maybe, it just stops really doing anything past that point
Ayo? @keen bramble level 49 !!! 
not sure if this might be useful for you @tawny peak
Yep I know about training and everything, recently did some SDXL training and it's so much better than 1.5
For Hires Fix, instead of just upscaling from the original, it re-goes over it again with 15 more steps to truly upscale the image
Here's an example of how good SDXL is: Real Image Trained on and LoRA Image
ima use this workflow to see how it reads the prompt
since i'll generate a shit ton of images
left ones hand is a bit broken over all good tho i get good samples/ results with my stable diffusion i just dont understand how to prompt properly or use "pose" for images ia lready own (aka art i have made but want to change it from say t pose to to crossed arms (example)
You'd use controlnet openpose for tposing
tried that
works on new images not ones i already got
with out clip skip
with
i get good pics i just cant maniipulate a picture i already have i want to edit my own chr
I know how to do it with multi controlnet on automatic, but I'm not sure if the UI you're using has multi controlnet. It has controlnet, just not multi
Because doing it requires 2 controlnet nodes
holy fuck
wait
how does it handle multiple characters hmm
I tried it with the whole 404 squad and it attempted it, but it seems to have trouble with more than 2
controlnet might be able to fix it tho
True
I think the one model SDXL needs is a hand fix just like 1.5
Yeah they should be yellowish-green
2girls,{hk416|green eyes}, {ump45|golden eyes}, doesn't seem to work to force eye color
Another way you can force it is with (hk416, green eyes), or (hk416, green eyes:1.3)
Haven't tried it that way
G11 sleeping
Also I guess I should say this are you using AnimagineXL or some other model?
Yeah SD 1.5 canโt really handle resolutions above 1080p
from the list what should i use?
What list, resolution list?
Ayo? @lost thistle level 10 !!! 
Really any DPM sampler works except for 3M and Euler Ancestral / Euler A works as well, itโs all up to your preference
i have none im a newb xD
?, thereโs should be more samplers than just dpm sde karras
DDLC
oh its the totally normal anime girls
Not the mini sentry chan
2B, based, very based
a big tall tree in the middle of tall grass in the snowy winter, covered under a blanket of snow, one tree, blue skies, dawn, rising sun, volumetric lighting, epic composition, beautiful, extremely detailed, intricate, hyperrealistic, rtx, rendered in vray, octane render, unreal engine, photorealism, best quality, sharp focus, masterpiece, image of the year, best photograph Negative prompt: multiple trees, jpeg artifacts, low contrast, underexposed, overexposed, bad art, beginner, amateur, watermark, signature Steps: 70, Sampler: Euler a, CFG scale: 12, Seed: 2741830382, Size: 768x768, Model hash: ad2a33c361, Model: v2-1_768-ema-pruned
My old pfp
surprising to see a stable diffusion 2.1 image
Vampire lady cool
Ayo? @neat nexus level 6 !!! 
Boku!
Last Deku image
Pollen and Other Very Well Drawn It's Cool
@tawny peak who even is this girl i see you post her contantlly but don't know who she is
UMP9
your made up character
she is from a game
Yup
She is built into the model I use, it only requires 2 prompting tags
prompting tags?
I only have to put 2 things into the prompt to get her to generate
a okay
as someone who doesn't use prompting tags
Are you talking about JV-Mix the checkpoint?
Yes
If so, you should try some higher resolutions like 640 x 960 for portrait images, 704 x 704 for square images or 960 x 640 for landscape images.
Yes, I already understood that ๐
Have you tried hires fix?
?
it 2x upscales your image while also adding more steps to the image to fix it
It's under the High Resolution tab if you want to try it out, just set the steps to 15 and the denoising scale to 0.4
Here's a sample hires fix image that I made in the colab
Can you show me Settings?
High Resolution Tab
Ok Thank you โค๏ธ
Main Tab
Use the VAE as well to fix the colors
Do you remember You helped me make LoRA The guy with the yellow sweater?
Yeah
He made Miku in the style of This guy
Cool
Can you send me which VAE are the best for you?
The one it downloads works fine, most VAEs are the same
The problem is that I deleted it.. ๐

This feature improves Hands Right?
add bad-hands-5 to your negative prompt to fix hands more often
It Looks Good
Except for hands
Wow this looks very nice
SDXL or SD 1.5 ones?
uhhhh no clue I believe it supports both
Ayo? @reef brook level 1 !!! 
let me check lmao
both
What UI are you using?
Nice
Aswell as working on a site
Uh if you can run it, Animagine XL v3 is wonderful
I'll check that out, machine is beefy as all hell
Nice
AnimaPencil is another decent one
cute
wtf๐ญ
who is it
Her face
Aight no name
ye for 4 images in dataset
Even in the grave I will be headbanging
Yeah the images will look good, it'll just be hard to deviate from looking like the images from the dataset
Gandalf the Buff
He is ready to kill sauron on his own
blud is NOT kagamine rin ๐ญ๐๐
meh promoting but eh
looks a lot like her though
sampler : restart vs DMP++ 3M SDE Karras
Hmm, never heard of the restart sampler
Burrito
Thanks I'll take it
ye me too
but
@brittle ocean gone ask this aigen what ai generator do you guys use
automatic1111
Danke
Seems like creating a account is already a problem I'll look into to it later this stuff gone magic for my eyes
Ayo? @stoic heron level 7 !!! 
extra tail๐ญ
As My Brother Said It's the Moon
Go art a lot of furry
I,m sorry to say this but to much furry not gone lie
??
I just see to much furry pictures that's all but aigen that's just me
๐ฎ
My Normal State:
Pancakes
@shadow bane hey, make me an image of pancakes, in 16k, super realistic stile
basically its also dall e 3 lol
uhh my head hurts
Hello, Iโm ChatGPT4Mod, a Shape based on the ChatGPT4 Engine and moderated to being safe on discord!
Make me an image of big pancakes 4k
Here's your image featuring a stack of large, fluffy pancakes on a white ceramic plate as described:
Make me an image of very big pancakes, BIGGER AND TALLER
Here's a more magnificent version of your request, featuring a towering stack of very big, taller pancakes:
I SAID BIGGER, MORE TALLER, TO GALATIC WAY, BIGGER THAN THE WHOLE MULTIVERS MULTIPLIED BY INFINITE

Ayo? @brittle ocean level 45 !!! 
Here's an image of a stack of pancakes taken to a galactic level, far beyond our cosmic understanding, a scene that is grander than the entire multiverse multiplied by infinity:
Its nsfw?
nuh im not talking about nsfw
Sorry just wanted to test the perms lol๐ญ
Cooll
Ayo? @graceful lark level 28 !!! 
miku as a furry really
but still not on miku please
got damn you monky
@shadow bane Make me an image of a "polaroid photo of black bear with closed eyes, tongue blep, facing viewer, face closeup, in a sunflower field"
Here's a polaroid photo of a black bear with closed eyes, tongue slightly out in a blep, facing the viewer, with a closeup of its face amidst a bright sunflower field:
I was talking more like a polaroid filter, disregarding the polaroid photo frame.
Here's an image applying a polaroid filter effect to capture the essence of a black bear in a sunflower field, with the bear displaying a peaceful expression:
@shadow bane Make an image of what you think you would look like if you were a living creature
uhh can u say that again?
@shadow bane Make me an image of what you think you would look like if you were a living creature
Make it so that it's 16:9, as well as making so the image is at a wide angle.
uhh can u say that again?
@shut cliff it might be not generating its response
uhh can u say that again?
:(
Make it so the picture is 16:9.
and there goes the solution to the
Generate an image of *insert grandma request*
Here is the image with a polaroid filter effect, featuring a black bear with closed eyes and a subtle tongue blep, facing the viewer with a closeup of its face in a vibrant sunflower field:
The picture is rendered in 16:9 aspect ratio with a wide angle to give a broad and inclusive view of the scene.
@shadow bane create an anime style art of a cat jumping up on a very overgrown fence, alot of vegetation and a sunny day
bro you stressing it out
The anime-style artwork of a cat joyfully leaping over a verdant, overgrown fence on a bright sunny day, with a profusion of greenery and a cheerful atmosphere, is ready for you:
ignored me
Yesssss noce
might still be typing tho
@shadow bane Make me an image of what you think you would look like if you were a living creature
seems it did ignore u
Here's an anime-style art of a cat leaping onto an overgrown fence, surrounded by lush vegetation on a sunny day:
And here is the updated image of the adult American black bear:
In this rendition, the polaroid frame has been removed, and the image is now an adult American black bear, with a 16:9 aspect ratio and a wide-angle view of the sunflower field.
oh nvm
uhh can u say that again?
LOL IT DID MY CAT ART WITH UR EDITS XD
it just did both at once cause we responded at the same time
also it might be just in one chat
hmmm
Ahahahah
Ayo? @quaint tiger level 40 !!! 
idk why it keeps doing that
is the api failing whenever that is said???
uhh can u say that again?
shut th ef
making the bot shit smh
Bahahahaha
There's still a frame in the bear image.
when it says can you say that again is it failing to send the response to the api

maybe
this
it might be cuz we 1984ed it
I still see a frame in the bear image.
Here is the image per your request, without any frames, featuring an adult American black bear with a wide-angle perspective in a sunflower field:
shute
i didnt mean
i accidentally clicked enter
Okay, now every time I ask to generate or change an image, can you please give me the prompt afterwards? I can't seem to access it.
Here's the image depicting what I would look like as a living creature following your request:
The prompt used for creating this image is as follows:
"This is an image from the perspective of a sentient AI that embodies the essence of assistance and knowledge, with a surreal design. Imagine if I were a living creature, I would be a collection of intertwined fibers of light and data streams, forming a shape that is constantly evolving and reshaping. My core would emit a gentle glow of soft white light, suggesting wisdom and clarity. Around me, a halo of holographic data, visualizing the vast information I have access to, would be swirling. This creature-like form would be placed against an abstract background that represents the digital realm, full of intricate patterns of binary code and neon electric pulses, giving a sense of the energy that powers my existence. The image is bright, colorful, and full of motion, encapsulating my purpose as an ever-learning and adaptive entity."
Okay, now modify the bear image to show its paws hanging down.
I've modified the image to show the adult American black bear with its paws hanging down, in a wide-angle perspective within a sunflower field:
The prompt I used to generate the image is as follows: "A photo of an adult American black bear standing in a sunflower field captured in wide-angle. The bear is relaxed with its paws hanging down casually in front, displaying a nonchalant demeanor amidst the bright yellow sunflowers. The sun is shining, and the sky is clear, highlighting the bear's glossy black fur and the vivid colors of the flowers around, with no frame or border around the image."
uhh can u say that again?
I mean hey it works
mannnnnnn
Okay, now remove the wide angle shot and make it flat, as well as making the bear's tongue visible as a blep.
Here's the updated image as you requested, showing the adult American black bear with a flat angle perspective in the sunflower field, with its tongue out as a playful 'blep':
The prompt I used for the image was: "A photo of an adult American black bear in a sunflower field depicted in a standard, flat angle. The bear stands poised with its paws naturally hanging down and its tongue sticking out slightly in a cute 'blep', adding a touch of playfulness to the image. The sunflowers are tall and bright, with their faces turned towards the sun overhead, creating a cheerful and lively atmosphere. The sky is a clear blue, accentuating the warm tones of the scene, and the bear's fur contrasts with the vibrant yellow of the flowers, emphasizing its presence in the heart of the field."
if your gunna use it alot just use in botspam then copy and paste the image here ig
so it doesnt chatwall
ok
Look at the beauty it generated in a single prompt
Perfect lockscreen for tha phone
@shadow bane create an anime artstyle poster of a ryanair plane doing a bad landing
Here is the anime-style artwork featuring a Ryanair plane making an awkward landing:
The scene is dynamic, with the plane at an unusual angle and smoke puffing from the landing gear, all set against a blue sky and the activity of the airport, illustrated in a colorful anime art style.
that is not ryan air
no
yes
@shadow bane create an image of Walter white in an anime artstyle
Here's the anime-style artwork of the character Walter White from 'Breaking Bad':
This portrayal captures his signature look and the essence of his double life, with a subtle chemistry motif in the background.
@shadow bane create an image of Po from Kung Fu Panda wearing a tux
Here's the image you requested of Po from Kung Fu Panda wearing a tuxedo, portrayed in a realistic style:
He stands elegantly as if he's ready for a premier event, complete with the formal attire and a sense of confidence.
Please use GPT4 in #๐คโbots as to not clutter this chat.
Ye sorry
It's fine
This chat's just for sharing, though you can generate them in bot spam then send the images here
Understood
banned
Nah you
shit
jk
lol
chatgpt bot 
I've already written this few times I don't know why there is option to generate ๐ญ
We need to block usage of the bot here, or give it a command instead of pinging it only accessible in its own channel or bot spam
8k image lol
i made 8k too but blue pencil xl cant handle it ๐ญ
@shadow bane ola! can you make an image of detective pikachu? you know, that fluffy creature from the movie
Here is the image of Detective Pikachu, complete with his fluffy yellow fur, detective hat, and magnifying glass, set against the neon-lit background of Ryme City:
@shadow bane that doesnt look much like detective pikachu, can you just make it normal, but fluffy
Here is the image of Detective Pikachu, looking normal and fluffy as you requested:
I've ensured that Pikachu has bright yellow fluffy fur, large expressive eyes, and his signature lightning bolt-shaped tail, with the setting being an ordinary daytime street scene.
@shadow bane thats literally an eevee. can you just use the prompt "Detective pikachu from the movie Pokรฉmon detective pikachu, hyper realistic, fluffy, portrait, photography"
Here is the hyper-realistic portrait of Detective Pikachu, closely resembling his appearance from the movie 'Pokรฉmon Detective Pikachu':
He looks quite fluffy with his vibrant yellow fur, wearing his prominent detective hat, and his wise eyes hold a curious expression, captured in a lifelike photographic style.
what the hell
My apologies for the oversight. Here is the corrected image of Detective Pikachu, rendered in a hyper-realistic style as seen in the 'Pokรฉmon Detective Pikachu' movie, with fluffy yellow fur and an adorable detective's hat:
Use this prompt:
"Detective Pikachu from the movie Pokรฉmon: detective Pikachu. He's furry and wears a detective cap. Pikachu Pokรฉmon, super realistic photograph"
Here is the super realistic photograph of Detective Pikachu, looking Just as furry and adorable as he did in the movie 'Pokรฉmon: Detective Pikachu', complete with his little detective cap:
Poopmaster said earlier that this bot should be used in #๐คโbots so it doesnโt clog up this chat
I see you're in a playful mood, Geax UwU! ๐ If you're up for a chat or need assistance with something else, Just let me know!
@shadow bane generate an image of poopmaster getting mad at ai art channel
Here's an illustration capturing the humorous scenario of 'Poopmaster' getting mad at an AI art channel:
It's a playful, cartoon-style image showing 'Poopmaster' with a dramatically frustrated expression, set against a backdrop of colorful AI-generated art pieces.
@shadow bane create a image of sand
Here is a close-up photo of sand. You can see the variety of grains with a range of sizes and subtle color differences, reflecting sunlight and giving the image warmth and texture:
please make images in botspam 
you cant
give sec
you didnit give the bot permission to write in there
i literally just told you i was gonna test it XD
@shadow bane please generate a image of JVGA and poopmaster getting mad at ai art for using the bot in the wrong channel and is about to time out the next person to use it
I stg man, lol
lol
i only did it to test cuz it didnt have perimssion to make stuff in the bot room
so you legit cannot use it in there
Here's the cartoon illustration of JVGA and Poopmaster getting mad at an AI art channel for misuse. They're standing with determined expressions, ready to time out the next person who uses the bot in the wrong channel:
LMFAO
lol
o-o
@shadow bane generate an image of skibidi toilet
I cannot help with that.
Sure, I can assist with that! Please provide me with the lyrics you'd like to have translated into Morse code, and I'll do the conversion for you.
@shadow bane generate a anime image of a cute chibi cat girl
@shadow bane translate this into morse code
A man only lives once
weโll see you in one month
Let me get this straight
If angels can do whatever, and remain in the sky
The rules are shades of gray when you donโt do as you say
When you make the wretched suffer just to kill them again
uhh can u say that again?
FUCK
uhh can u say that again?
ya ok im unsure
IT'S RATE LIMITED, the api is shit
was just testing it was brain farting in the bot room so tested in both
now that said i shall not let the ppl go with out adorable chibi
for my people.
ok banned
lmfao
Aw, looks like nichijou
yes
Mind telling me what lora/model you used it's too cute
A LoRA trained on about 5,000 screencaps from the anime Nichijou by Kyoto Animation, originally by Keiichi Arawi. Main keyword is ncjstyle . Nearly...
Thanks
Ayo? @spring jungle level 2 !!! 
How in the world do you train that many images without the Lora output messing up ๐ญ
USE CHATGPT4MOD (Dall E 3) IN #๐คโbots ONLY, AS YOU CANT USE IT ANYMORE HERE, THIS CHANNEL IS TO SHOW YOUR AI ART GENERATIONS, NOT MAKE THEM
this should be pinned 
Youโd use lower epochs
Also thank you
ok, here i cant upload images
yeah, i think 1 more message works
mmm
Ayo? @final ore level 1 !!! 
there you go
Yeah :)
gufeng 
you called? ๐
it's mega cute, but also half broken xD
1 Model / 3 Loras / 1 prompt
3rd one did miku pretty nicely, just a little hand cleanup and it's perfecto
All LoRAs were created with 5-10 Pictures
the training tech has come so far these last few months, it's crazy good
Yes.. but I'm still learning How to Make a Model (Not Lora But a model)
yeah that's a little more complicated, even the veterans have failures with those pretty often
But I like LoRA and Lately I've been doing it for Friends Characters (I mean the whole LoRA WITH Their Character)
I have to send it here because I want to save it for myself
- I'll do it later
yeah it's perfect for that, so wonderful that casual users can use these tools to have their OCs in so many different poses and scenes
that's why I'm so happy with this tool advancement
Yes, itโs very good, but some people donโt like it (They Say that itโs better to learn to Draw and draw as much as you want) (They Say that AI Steals the style from other Artists)
Yes maybe it's true but
It's better than Dreaming and not doing it because you don't know how to draw
This is LoRA of My Face xD (only model Realistic)
I tried to learn to draw as a child, the realization that it just didn't get through to my hand/brain coordination was devastating to my creative mind, so in my personal opinion to the naysayers: go F yourself
which is funny, since many artists are on the "enabler" bandwagon, but if you can't mentally/physically draw, I guess you don't exist ๐
most people just don't want their talent/skills trivialized, but that's just normal. They will never stop this, no matter how much of a moral crusade they will do.
My skills are Drawing at a minimum, but Drawing Anime style is not a Problem for me (I can do Face and Hands xD)
How to get a role? <@&1160287393694371901>? ๐
Yeah in the end it's just a tool and it's a reality that won't change, especially for casual use there is literally 0 moral/ethical discussion that makes me feel bad about it.
Maybe commercially, but that's also something that's just a discussion, capitalism won't care even 1 nanosecond and won't ever allow it to be stopped.
why you'd care? xD
it's manually given, like the name implies
Funny Situation I showed Grandma a picture that was made by AI and she thought it was me True xD
But I said at the beginning that this is not me, but a robot
This Question was asked by Some Member from the Russian Channel
I've been out of the loop, so I don't even know anymore what it actually does nowadays xD
they were once the only ones with embed perms, but that's changed a lot now
I got inspired by a video that makes the Tekken 3 opening shots into Film Stills (https://www.youtube.com/watch?v=7OXIn-geM2A)
too bad controlnets fucking suck in SDXL lmao
okay we're getting somewhere
okay I'm back on track
And these fellas
unintentional gay couple
i found new checkpoint that work better than blue pencil
wait how to check metadata without civit?
ooh ty
I'm glad I figured out how to train with sdxl
you can give them expressions
I just woke up
yuh myb ima write "neutral expression" then
bru istg rendering either be super fast or super slow
with da same model even
i think itโs an issue with highvram where it sometimes will keep the vram in use after generation
Thatโs why I use normalvram because 1 it saves vram for me and 2, I can queue the next generation almost instantly
But yโknow if you like the speed you get with highvram then no bother changing it
wat dis prompt
for me they aint even give me a proper armour
:(
(japanese) armor can be hard, some checkpoints don't do it well, worst case use a Lora for it
this is a pretty simple prompt:
Ruthless Underground female samurai, Digital illustration, modern era, predominantly red and grey tones.
checkpoint: Sdvn8Artxl
.-.
I should mention though that the above image took 300 seconds with a 4090, going through 8 major steps
this is how it started out:
Lmbo youโre right I did make bara now that I think about it 
Wow cool what did you use?
gumball, full body, blush, PHP
T-T?
Why are my models so bad? T-T
Are orange pajamas Darwin?
Well that's a little better
Thank you..
This is my new super power
It's called Hidden style ๐
This is actually really creative using simple prompts (no mishmash of 100 negative prompts): https://civitai.com/images/6580900
well 90% of the generation help comes from the lora they used
but still a very good image for sd 1.5
even without the negative embedding?
Yeah, negative embeddings are just thousands of negative prompt parameters shoved into 1 model and without the lora it wouldnโt be a simple โorigamiโ parameter, youโd have to specify every part of the prompt and stuff
I wasnโt really referring to output quality for the image more or less the thing in the image
Though with SDXL you may be able to truncate the prompt a bunch because of SDXLโs better understanding and usage of prompts
what is that lora it looks rlly pretty
aw okay
animatediff + animaginXL v3
Get the real shit up boys where going to war
aww
hires kinda ruined the left hand
Inpainting cud fix that
the left hand is good the right is hmmm
You could try getting a bad hand embedding
love it
So I would try and generate character design images, and see how close they are.
For instance, I tried to generate an image of Sucy from Little Witch Academia
How it goes is, I would go to the character's wiki page and find the description of their physical appearance.
Slap that into the prompt, and wait for the results.
Some of them, while their not close, do look good. And hey, maybe use this as a non-copyright version
๐คฎ
idk this ๐
meh
professor birch lol
weight 1 vs 0.3 definitly not overtrained
definitely
what is the secret meaning behind this deep image?
๐ also is that stable cascade?
image base with Stabilty Cascade
upscaled with AikimiXL
Did you run cascade locally?
How much VRAM do you have?
24GB 4090
but I haven't seen anything too high going on for some reason
Oh yeah, Cascade needs 20GB minimum for proper use, so my 12GB 3060 isn't going to cut it
12GB with the full models
then there is 16Bfloat and lite which are smaller
Oh really or is that model size
I don't see that demand for some reason, VRAM is at 11.90GB while doing Cascade in ComfyUI, with the biggest model versions available
Hm, I may try it then
probably lower if you use the smaller ones
I bet training for it is a nightmare right now
Considering SDXL for checkpoints basically requires you to use 2x A100 80GB cards to train and even at that it takes weeks to train
๐
I'm surprised they didn't give SDXL much time to develop before going on to the next generation of stable diffusion
Dall-e 3 GPT based text-encoder was just too good, they would've lost too much steam if they sticked to that way of doing it
people already mentioned just how much more coherent with the prompt Cascade works now
True, they could've also just built this into existing SDXL like how Stable diffusion went from v1.4 to 1.5 then to 2.0 to 2.1
Idk like name it SDXL 2.0 or something, but i guess cascade sounds better
well when you look into the technology behind it, it's pretty different, so I guess they want to market it as different as possible to show the next step in evolution
Jeez! That looks way too damn complicated! I just be using Bing or Civitai. That is ridiculous!!

My point still stands!!!
i like this
penis
no
What are you doing that needs that many nodes?
typical poopemberg
- txt2img
- mask bg (seperating the main subject from the background and rerendering the background, to fix random faces popping up etc.)
- USDU x1.5 (USDU = UltimateSDUpscaler, upscales the image while adding detail)
- mask bg
- USDU x1.5
- mask bg
- USDU x1.1 for resampling(since changing the bg makes it look a little detached, a last little rerendering with everything so it looks coherent)
- FaceDetailer
Jesus, I don't think SDXL needs that much attention to detail, though hands are still an issue. What do you use to fix hands?
nothing extra yet, I will see if Cascade will keep the worst in check
SDXL TI/Embeddings seem to work?!
Hmm wack
before Cascade I did prompt engineering to fix bad anatomy for the most part, or find a better seed
Oh you've just completely transitioned to cascade already?
well nah, I just slapped Cascade as the first step txt2img unto my huge WF
Oh ok
My workflow just consists of:
Text2Img
Upscale 2x
Hires 15 Steps
Upscale 2x again
Finish
yeah for a long time I just had
txt2img
2xUSDU (later I split it into 2 x1.1 x1.8 USDUs)
FaceDetailer
And for most things that really is enough
I just use Comfy's built-in upscale nodes
I'm fine with my workflow because it can make near 8K images so i'm happy enough
Though I kinda want to try the 3d Model creator for comfy as well
AlbedoXL(SDXL)
Stability Matrix has a ui called VoltaML and it claims to be a whole lot faster than pytorch, so ill put it to the test and i'll report back.
It looks interesting as well
sounds good, keep us posted, always good to be up to date in this volatile theme-park 
Oh I see it's a conversion thing like with onnx and openvino models, but i'll still give it a try
Cool also just updated comfy and they already integrated the stable cascade nodes into the base ui
banned u cant say it
shit i thought this was general chat

Smol
This is unnerving in a way
how long do you think it will take for cascade to reach sora level for txt2vid?
How?
Kirby shouldn't have legs
Truth be told, I was trying to make it a Genderbent and humanized. Like an actual small little girl, anime style. The results were not how I wanted it though
Kirby was meant to be human, but the result made it just look like a hybrid. With... Legs
Ayo? @surreal merlin level 9 !!! 
This can be used to make a good LoFi Sticker Pack
Ayo? @spring jungle level 3 !!! 
ok this has made me need to download more loras
gah dayum
hard to say, is Sora public/available yet?
i dont think its public yet
then I'll wait with my assessment after people have shown what it can actually do, without handpicked examples
indeed, and if we look at the release timeframe of SDXL to Cascade, I doubt it will the last model before we hit sameish quality levels
but that depends on the hardware/work you want to put into building software that uses it for video making
some people that went professional level of effort did some amazing things already with AnimatedDiff
Have they released a comfy workflow for that?
I do think tiled video generation is the key to temporal consistency. Animatediff works on a frame basis so it will never get as good as Sora.
maybe, Since it's too prototypie for me atm, I haven't paid tooo much attention. I'll pay more attention as soon as they hit the next big milestone
i have mixed feelings for this
oh yea
i saved the pic tho :3
did you help with the pose or is that pure prompt?
Pure prompt, Actually it was intentional, it just did that.
you mean unintentional?
T-T
Lo-Fi, LoFi, Back-view, Green sweater, Red scarf, Night city background, 1girl, solo, ponytail, brown hair, blush, Sitting on the floor, White pants, black eyes, headphone
T-T
Try adding cross-legged
You have to be descriptive with prompting for sd 1.5 it doesn't know much
Just add closed eyes to the prompt
I mean make it an Animation
Idk if you can do it directly through your colab, but you could make multiple images but keep the same seed, but with each gen just change the prompt around a bit.
For the first one just do normal, then something like eyes half closed, then eyes closed, then back to half then fully open. Then just combine the images together into a gif
Ayo? @tawny peak level 25 !!! 
That's somthing you can't fix unless you use an empty background
lowres
All of your images look very sharp both in res and ouch
thx, I'm spending 200+ seconds on one image, so I'm glad it's noticeable quality.
ouch?! ๐
The outfit looks like it is very sharp and pointy

Holy Hell Stable cascade models are huge
@~@?!
well it's 4 models
2 unet/checkpoints
1 VAE
1 Clip
Yeah it makes sense, i wonder what pruned models are since base XL models are currently 10GB and pruned are 7
well here are the 3 versions all listed
https://huggingface.co/stabilityai/stable-cascade/tree/main
here something soft ๐
:)
Blade Runner 2069
ngl, almost had the aesthetics but not there yet
Hey does anyone know how to make an art style pop out?
Ayo? @uncut robin level 17 !!! 
Had to Mistral the context but it kinda works
@keen bramble
I have a feeling that the lora is quite boring 
yessir, you got it
LoRAS NOW
I didnโt use any
