#โจ๏ฝsdxl
1 messages ยท Page 141 of 1
do you orgnize them? bcz i feel that my folder thumbnail file might be like 500mb or something ๐
All of this is why I keep working on automation and scripting to get everything tagged and hosted on my lychee server.
Shameless url drop: https://lychee.soulctcher.net
๐
actually working on it right now. have a program to name them all according to date and then put them in folders according to date
windows?
do you have somethign that can strip their meta data while at it?
i always get back to it, that i want to get a list of all my prompts ever
putting them all in a pile before I sort them
You can save the images from comfy without the metadata. You can also use nodes that save the metadata into text file.
yeah, I've thought about that. but I never do it
You could setup a workflow that will do it all for you for existing images as well.
it sounds above my abilities
Not at all.
not that it's somethign that ever stopped from trying...
๐
guessing this the thing, right?
can i append it to a single file though?
to me helped or actualy made it Aimingfail
@zinc cargo what all do you want to have saved?
i want to keep a list of all my positive prompts
aiii, kid needs me. bbl ๐ thanks Bernix
save image file with prompt seems handy
@zinc cargo i have it this way. All credits to Aimingfail
The input image is someone else's so I'm not sure. The output was done in counterfeitXL
whats a good model for anime? also, how do i make img2img better?
@fading rock try ask in #๐ฅ๏ฝanime and get better in anything key is practicing ๐
CounterfeitXL is the best one I've seen for now.
Some luscious locks in that first block there
ah I see it was prompted for, and that followed better from dreambooth
We are so close to not even needing Dreambooth or LoRA anymore for somethings with IPAdapter Plus. I took one of your images from that post and put it through IPAdapter, it changes the face a bit too much, espeically when you lower the weight enough to change the background, but it's close.
They are apparently working on a version for faces on SDXL
Robot chicken
what kind of eggs does it lay
Grenades
Now you have to generate it laying a grenade
Hahahaha
lol
@zinc cargo does it work for you?
I tried to combine the Grenade and the Chicken
yeah, I can't get it
weird is when bassethound run, his ears are still down along his body
so that's what they look like defeathered. I knew birds weren't real
cool!
you see someone was just taken in as a suspect in Tupac's murder?
For real?
Yeah literally just got the notification like 20 minutes ago
Damn, that is what like 25 years ago now?
No statute of limitations for murder
damn, 27
Seems so happy to kil, err, meet you
He just wants to play a game
So nice to not see the bokeh ๐ป
lol nice try, i believe it should work with gligen, but havent used it myself before https://comfyanonymous.github.io/ComfyUI_examples/gligen/
was this made with local version or the discord bot
ok
I've been trying to squeeze the most out of this, like prompt golf
(Three quarter portrait, vermeer lighting:1.3), black space mage, dark floral throne, (millennium falcon cockpit:1.3), 35mm, ( POV, cinematic afrofuturism:1.3),(adult, ghastly dark, no sky, horror), 1983, (extremely detailed depth), style of Antoine Blanchard, highly detailed
neg: group shot, text, watermark, signature, camera, infographic, art nouveau, blur, low contrast, cowboy, comic books,bad hands, bad anatomy, (bad quality, worst quality,sketch,incomplete,cropped, cut off:1.3)
actually nailed the tacoma
Smells adverty
specific year + models has worked on a lot of cars for me across SDXL models. Very fun
Not sure that would work in SDXL. But you can use positional conditioning, but again, not sure how that would work with IPAdapter. I'll need to have a look into it.
really? any time i try to do most recent year/gen it does the previous one
def gets model years messed up a bit, but will often have both and they're pretty specific
let me know, im curious now ^^
first pic looks nice
2nd pic reminds me of that one gta sa remaster video
was by 12th hour i think
looking at the examples I don't think it will work, as it's controlling the text conditioning, but IPAdapter changes the actual model. So I think at higher weights it 100% wouldn't work. Maybe it would at lower weights. It looks like a lot of effort though, so I might look at it a bit later. If I do and it's decent I'll post it in here.
greetings, Arron ๐
(vermeer lighting)
i really wish ai was better with the close up details
Hello ๐
(close up, POV, intricate details:1.3)
why do i like that
nailed this one
I'm impressed...it knew the 2013 model year above from the 2003 model year:
Not perfect, but the main differences are known.
what is? Aimfailing's solution? we'll work on something for this purpes
which one are you using
Which what? Model?
@native knot
is there sdxl for lama cleaner?
can lama cleaner remove rain from the whole image?
I think this sounds like an IP Adapter task
ye
Current one I'm using is bluePencilXL
what GPU do you have?
Yup, just prompted. No ipa/controlnet/etc.
If you have a 4080 you shouldn't have ANY problem running SD 1.5 or SDXL on your machine...that is, unless you are woefully low on system RAM for some reason.
i love it
64gb
Shouldn't be a problem, then.
could "no module 'xformers'" be the issue
could be, yeah...are you running comfy or a1111?
I'm not too up on the latest for resolving a1111 xformers issues, but it sounds like you don't even have it installed at all. Typically if you just add --xformers to your command in the config, it'll install it for you.
These turned out so good! The icons are clearly readable and the colors and brightnesses are perfectly balanced. Very good hitrate. Generated 160, and easily over half look usable. ๐
they look great, art style is perfect.
Thanks! Been slowly working on them for over a week now.
this is the result when i use sdxl
I've been ๐ ing. The dark fantasy sparkle is super nostalgic
What VAE do you have selected?
idk looks good to me
You're using the wrong VAE. Either using SDXL VAE on 1.5, or vice-versa.
none
is that where i made a fucky wucky
You need the SDXL VAE
alr where do i select vae
I don't know where they tucked it away on 1.6...if you don't see it on your screen, it's probably in a Settings menu.
I pretty much use Comfy nowadays.
maybe ill try that if i cant get this to work
its still not working
wait
hang on
i think i figured it out
caveman discovers fire moment
that is MUCH better
There you go.
๐ฑ
Bolt nips.
works for me, lol
i really wish sdxl was good at current gen vehicles
look for vehicle loras. train your own LORAs with kohya
idk what that is lmao
haha, little models you can reference in your prompt. can be a person, a style, a thing. usually under 200mb. In general, if you want to prompt something not in a model, this is the path. not hard, a little fiddly and time consuming though https://www.youtube.com/watch?v=d4QJg4YPm1c
I had a look into it. It was a pain, you can sort of get it to make an image. But it doesn't really work properly because it needs text prompts in the final image anyway. I tried to combine unclip conditioning from them all and it just created a mess.
Maybe if you combined the 2 IPAdapter modified models together and ran that through it might work, but I don't have the VRAM for that.
is this a gen or a screenshot from a simpsons episode?
the other foot is kinda wonky
who wants me to run a dalle 3 prompt for them?
I would like to try by myself. How?
jealous. I have thousands of old dalle gens.
how to gain early access?
its random
this would sell as a all over print garment 100%.
(extremely detailed depth:1.3), intricate detail, natural land forms,Futuristic landscape with some spacecraft flying over the planets, in the style of mind - bending murals, floating structures, futuristic landscapes, (style of El Greco, style of Leonardo di Vinci:1.3)
i can run a prompt if u have anything in mind
Very glitchy I like it.
I put this into IPAdapter and tried to get it to put a wing on it, and it just turned it into a 3 Series
https://github.com/cubiq/ComfyUI_IPAdapter_plus
And for comfyUI nodes
prompting for dalle 3 is more natural language
Basically you give it an image or images and it can create new images out of it that look similar, or you can add text prompts to change the images.
For example I took your car and used the prompt "Desert Dunes"
Unfortunately it's turned it into a Merc
Alternatively I can use the same sort of background and change the car
ick, not my favorite. "Alien landscape vista with flying space vehicles leaving chemtrails. Ringed planets fill the red sunset horizon. Lighting of Vermeer, a mind bending oil painting mural by El Greco and Da Vinci"
did you all see the post about dall-e 3, pretty incredible stuff:
https://www.reddit.com/r/StableDiffusion/comments/16uynpp/dalle_3_blue_ball_on_a_red_cube_on_a_wooden_table/
We can just steal their images to use in SD
lmao
cool
Christmas card illustration of a mouse relaxing in a mug of hot chocolate with marshmallows. Beautiful, nostalgic
this dall-e 3?
yes
damn thats crazy
i can run some prompts for u
adorable aware fictional pygmy marmoset cuttlefish chimera, beautiful studio photograph. calm try a hard one lol
try this one i made in sd
Horror-themed , on the moon, dark side of the moon, space suit, dying , dark, creepy, lying down, dead, . Eerie, unsettling, dark, spooky, suspenseful, grim, highly detailed
omg lol
also how did you get access
holy shit I'm done, that's amazing. The other image took me an hour to get right lol
to dalle 3
so you just randomly get it
yes
Dpes Bing tell you if it's used Dalle 3?
welp time to use bing till i get it
no but try use complex text in dalle if it works then its dalle 3
can you give an example
idk if this is dalle 3 but it looks very good
Suggestions on removing these splits?
Sega Hologram Time Traveler
yes, exactly ๐ nice catch
i meant a prompt example
that is quite good yeah
major nostalgia
A woman walking in a street with a sign that says "DALLE 3 can do text", in cartoon style
Is this complex enough lol
this is insane stable diffusion needs to catch up
it will
at least we still have anime tiddies
then u have access
W
lol
how do I even get to bing he wondered
i only have access on chrome not edge lmao
nice
yeah its a massive leap
it might make me use dalle more
Problem with Dalle is they are never going to open source it
So you'll either have to pay or have limited generations
its free on bing unlimeted
Is it actually unlimited, last time they said that it still stopped you using it after a while.
yeah i think its just a bit slower
no pictures of the new model in the training data
they need to add them fr
wobailondy means family, and family means sharing prompts
Is that just from putting in "xbox 360 game cover" or something?
yep
dude wtf dalle actually nailed the walmart logo
with dalle 3 u can be very spesific and still get all the details
the amount of detail you can make in a prompt with this is insane
i hope dalle 3 will get img2img
context is understood so well
just imagine what dalle 4 will be like
makes sd prompting look like oonga boonga language
What's with all the promoting talk?
cursed
im getting snowrunner vibes from this
yeah
it does everything decent except the actual person
what was the prompt
same prompt on sdxl
lol
the image generation wars are real xD
we thought sdxl was good ๐ฆ
its like going from a flip phone to an iphone
for real, dalle-3 IS extremelyh good
i really hope anything "self-run" will be available soon
going from a note 3 to a samsung s23
yeah i dont think stability ai will be able to compete
cuz dalle 3 is built on chatgpt
I dont think so, I think you need a huge language model to pull something like this off
cries in nokia 2210 age
that just doesnt excist yet
for sure -> it's multiple models (llm/diffusion) models
they also just a bigger company
i'm pretty sure openai used a lot of model tech to make sure each part goes to the right agent (ai model)
art is dead?
dalle 3 is the first model that is actually good in hindsight
well Can you try some prompt for me?
yes
is this the biggest leap in image generation yet?
might be
by Caravaggio, impactful color paint of Xenomorph alien with fruits in Venice, highly detailed, vibrant colors , 8k, sharp, professional, clear, high contrast, high saturated, , vivid deep blacks, crystal clear
This is SDXL
how about realism tho?
well I prefer al lot SDXL result
dalle 3 follows prompt better
Wait until it get neutered to death ๐
yup. like chatgpt
I saw you post this earlier, it's so good
used to be good
thanks
its didint get the Caravaggio part
Is that blood on the aliens face? Open AI: Yes! Banned.
another try
cinematic photo Charlize Theron in sexyiest red Star Trek TOS uniform sitting in Enterprise , in the style of 1950 vintage sci-fi movie, Star Trek TOS, lomo color 100 35mm photograph, film, bokeh, professional, 4k, highly detailed
hmmmm
again this is SDXL
It seems pretty bad at people
The Dalle 3 ones
Does she like Cocaine a bit too much
sdxl makes great images, it's just that when the prompt gets complex it fails where dalle 3 succeeds
because of chatgpt
yup sadly
it also have big servers
the fuck, I asked for varations and it gave me this lol
again i think sdxl might have a better image but dalle follows the prompt better and more accurate
what's more likely, dalle adds to make better, or puts in more restrictions (like ChatGPT) and gets worse
i dont think their gonna change it that much more
2020
but if its free and very good can you really complain
I've got SDXL locally, I'm good
crazy stuff
ill use both
(close up, POV:1.3), cinematic oil painting by Caravaggio, impactful color paint of Xenomorph alien with fruits in Venice, highly detailed, vibrant colors , 8k, sharp, professional, clear, high contrast, high saturated, , vivid deep blacks, crystal clear
crystal clear is a good helper, ty
another : cinematic photo alien car, 3/4 view, in a cyberpunk downtown , futuristic 35mm photograph, film, bokeh, professional, 4k, highly detailed
SDXL
this is good
my favorite game
my 2nd favorite
new lego set dropped ๐ฅ
this one is insane
how
a robot painting itself painting itself painting itself painting itself
the fact that all of this is now possible is crazy
I did another variation
i did the same prompt but with a hand
wow
pretty close but not perfect
haha
lol
but dalle3 cant make porn ( oh wait sdxl cant either ) 
there is plenty of porn already
it's rated teen for the text options when they hand you the bill
is this a prompt?
Squid ward from sponge bob digging a hole with a shovel, a tombstone with the word Midjourney is next to it
i love these game ones lol
fr
damn
didnt know he was chill like that
ultra rare gta v copy for the gameboy color!!!!!@!!@!@!@ (only one was made fr)
๐ข
the text is so much better then sdxl, and that was the main selling point of sdxl lol
in all honesty it's pretty damn easy to add text to a photo in photoshop. it always blends well
i just dont get why people make porn with sdxl lol, like i just rather watch real porn
catching fortnite dubs in the 2000s ๐ฅถ
coldddd
maybe they like to generate their squidward lewds that u cant get anywhere
This looks like a 90s playstation magazine ad

Oh look an SDXL picture in the SDXL chat channel
lies
jk its very good
the sequel had better mini games
peripherals out of shot
here's the whole thing
Is mine better or worse lol
it took me a very long time to split it up into sections
Before it was just everywhere
the best workflow is the one you made
Yeah the organizing part takes a shit ton of time
lowkey fun tho
this one is better and smaller
What are you even doing with all that
whats going on down in australia on the bottom right
is this the world map
half the fun of sdxl is playing with settings
turtle duck
why is the carpet texture in 8k lmao
ok but how do i get that car out of the room
Window
ill buy you a new one fr
what if its a cliff house
big crane
what if its a flying house
People have done this before, they normally take the side of the house off and pull it out
It's Digiorno.
๐
these are really good
so good they are making me hungry
Hi, how use xl model?
Ya'll going to be pissed at me asking this but is that the paid version of GPT?
It's Dalle 3
nosejob voldemort
For sure, but how are you guys accessing it?
bing create\
it is for me
use a prompt like "a man holding a sign that says "dall-e is amazing"
and if you can read the text mostly
its dalle 3
Thanks lads, do boosts reset daily?
love the 80s cinematic look
Clearly very close relatives. I guess triplets.
Yep
I am just looking at how well all these hands are
For this distance, definitely very good
EW WTF
SDXL wants to play some complicated af Monopoly.
thats normal
brad pitt x trump ๐
brad trimp
Does anyone know what SD is gonna release after XL?
Have they said anything publically
XXXL
wym
Which settings?
You can upload an image that was generated and send it to the txt2img page. In png info tab
so they dont save
?
Yeah it's blank. Was weird to me too at first
But saving useful setting pngs in a folder works fine
Haha
hello ladies
Cornpus
anyone know what the SD Doodle drawing box was made with? language wise...
assume javascript maybe?
IPA workflow? Bing?
bing
Frontend is almost always JS,
be the change you want to see in the world
write your frontend entirely in Rust
getting there, not sure how to display the image when complete but api is good (gotta tweak control net obviously)
replit with ghostwriter is pretty dope
local comfy
Is there equivalent of a1111 where SDXL models can be used?
you mean a1111?
@restive elm probably you mean comfyUI. Or you can use sdxl in A1111 itself
hey @visual glade, dont suppose you have a free sec to look at something do ya?
Dang @soft zealot you came out with the boxing gloves on with that heavyweight workflow release on Civit... Nice job... RTFM lmao Boom!
ip adapter described in one picture (not by me):
its here to test, you gotta sign in with outlook microsoft https://www.bing.com/create
For sure got it running, do you know what they are doing with it?
heres a video i found today
how to get DALL E 3 access. Oh and did I mention it's free? HUGE Shoutout to the community.
โผ Link(s) From Todayโs Video:
โฉ Bing Create: https://www.bing.com/create
โฉ Chrome: https://www.google.com/chrome/
โฉ Microsoft edge: https://www.microsoft.com/en-us/edge/download?form=MA13FJ
โฉ Brave Browser: https://brave.com/download/
โฉ Firefox: htt...
looks like its good at text and real characters like simpsons
StarCraft: https://civitai.com/models/154238?modelVersionId=172817
StarCraft - Transforming Images into Celestial Constellations
Introducing StarCraft, a revolutionary LoRa (Text to Image) model designed to reimagine ordinary images as celestial constellations adorned with connecting stars. With StarCraft, you can effortlessly usher your visuals into the cosmos, creating a mesmerizing celestial experience that ignites the imagination and curiosity.
Use trigger word: c0nst3llation.
Trained on 3000 steps from a highly detailed by hand captioned large dataset.
- Special thanks the people who help me and who I care a lot about:
@MarkOREZ
@masslevel
@osiworx
@mix
@Thibaud
@Kamikaze(Elon Musk)
what he gonna do?
you wot m8
is dalle-3 only accessible for gpt4 users?
you can use bing create to use it
So dalle-3 actually seems to be WORSE than SDXL when it comes to fine details. Anybody got a good comfyui workflow that might be able to take dalle3 images and fix the fine details
Hello, can I find out when the new SDXL will be released? :). Let me guess by about 2024 or summer 2023?
what's the new sdxl?
bing now uses dalle3?
interesting, but you said it doesnt work in SDXL so would it work in SD1.5? or is it just because of the combination you used with the IPAdapter, wouldnt it be possible then if you load the model again without the adapter influencing it and/or having its own CLIP text prompt like in the example? i dont know why it wouldnt work looking at the example, but then again i still havent tried myself ^^
Yeah, I"ve been testing it and its pretty amazing.
Been really impressed with its text and hands vs anything else I've tried
So applying SDXL refiner to Dalle-3 images seems to work quite well
its kinda confusing
Before and after:
This is where I"ve been using it. I think you have to use Edge browser
https://www.bing.com/create
you get 100 free generations per day I think
100 boosts, you can still generate when they run out, its just slower
oh ok thanks
Appreciated. Most of whatโs in heavyweight is now in cruiserw eight in v4.2 which is tidier/cleaner to look at
Cruiserweight being my personal daily driver
Well the next generation of SDXL after version 1.0
Really?
Are there any rumors about this news?
not that i'm aware of
I'm sure they are always working on something though... these things take a lot of time to really flesh out into something amazing like sdxl
the devs are amazing
It's true
but then you have the folks that are working really hard in the after products of developement stage... such as comfy... then further down the line you have people that make amazing things with that framework like searge, and winston woof and sytan and tdg8 that understand how to really streamline it to be even better.
its a long and winding road
then you have people that use what they put together and make amazing images, gifs, videos, etc... it just keeps on going
How can I contact SD developers personally?
a lot of them float in and out of this channel. stick around long enough and you get to know who the players are around here.
Isnโt it an idea to contact the SD developer personally, like Emad?
Even Emad chats in this channel once in a while
Just take a look over to the right in the discord chat you can see who the devs are that are in this channel... You could @ one of the devs and start a conversation with one of them if you really want to talk to a dev
no, you shouldnt do that
you should bother them with dumb questions to get a free ban
most of them are pretty nice. but I agree. don't ask them a bunch of stupid questions... however if you are also a developer looking to contribute in some way you could try to talk to them to see what they develop and see if they would be willing to collaborate with you or maybe answer some of the questions about how to make something you are developing work with something they have already developed.
I need a counter to see how many times I said the word develop... lol
SAI doesnt just creat diffusion model, they have other projects so I dont think they are working on SDXL 2.0 atm
that would be why I said... something
which model is that? can we have the prompt for this?
StarCraft SD XL 1.0 lora and you can get all the prompts there
thanks
also made with StarCraft SD XL
@stone fossil train another lora, with the best images, so you can get the best images
ty for the multi channel spam
multiple different teams working on multiple different projects at any given time
so, SDXL 2.0 is under development?
there are definitely SD-related things being built, can't say what exactly tho, you'll find out specifics when they're announced
also language model things being built and audio things being built and other stuff too~
can we expect a gui where can use all SAI products in one app?
any plans to add RLHF into comfy(different release build)?
I wonder if SDXL 2 will come out before SD 3.0?
or they may be same thing...I dunno
Hopefully video ๐ค
How are you doing today, @stone fossil?
Fine u lol and u? ๐
Kinda terrible.
Oh thats a fair awnser I like that.
I'll get through it. Art helps.
I do not call it art, I call it image output.
But that might depress you even more forget that hehe.
It's all art in my eyes, regardless.
Nice warhammer there!
Now do a warhammer space train? ๐
Lol
Do a locomotive train and try to make the smoke starry
This is what is whats to be not sure or its good not into war hammer lol.
Interesting mix. Not bad, tho.
heard you felt bad, i hope my spiced up car makes you feel better
@upbeat summit as usual, your prompts are always fun to play with ๐
That's one spicy wall of peppers.
spicevalanche!
it's regional prompter + mmy songmaster, and it makes the most whacky stuff ๐ฎ
regional prompter for sdxl?
also, where's my commuinity regional meme?
yeah ๐
lol that one on the right
I don't think Gligen would work as it uses sd1.5 model for the text which wouldn't work with the clip_g text that SDXL uses. The positioning you can do with conditioning will work fine.
Well, it's not wrong.
txt2vid, need a huge dataset, and SD only uses public datasets, so it would be difficult to build one atm
Version 20 of my workflow is online - better cropping options (center, top, bottom, left, right + offset) for IP-Adapter and Revision, new layout (with previews for cropping), better documentation
https://github.com/JPS-GER/JPS-ComfyUI-Workflows
I can see an image I made in there ๐ถโ๐ซ๏ธ
๐
which one? seems i liked it :)
look at my comment again and make a guess ๐
the cloud couple
it's nice as input image for ip adapter / revision to get a cloudy look
for example the hair in this one is done by mixing the cloud input:
that's really cool
where can I watch more on how to use ipadapter and what version of it is it?
very helpful I bet for making datasets
I made the whole of my Cloud LoRA from ControlNet images
ipadapter isn't that Controlnet?
that I made myself that is
you could download my workflow or nodes. or search for ip adpater in comfyui manager to get the nodes needed for ip adapter
I am still on Auto11
๐
I haven't had time to learn comfy
I finetune all the time I have
besides family and such
it's controlnet depth for the pose, but the first two ip adapter images for the style
I just haven't played around with ipadapter as I was digging realvision
i think there is an ip adapter extension for a1111/sd.next too
there is
or if you just want some technical information you can look here: https://github.com/tencent-ailab/IP-Adapter
thanks!
example:
Anyone not using IPAdapter is missing out, especially the new Plus model
@icy brook Put one of your fire images in and just used London Bus as the prompt
or with different weights:
But can it make the texture take over the subject/object altogether? Bc what I see in these images is mostly how it becomes a feature.
Depends on the model,weights and input images really
i didnt get that lucky with the clouds
I think part of the reason JPS's works so well is because the position of the clouds matches where the character is
Try this instead
I'm interested how JPS is doing the mixing, because usually it tends to do the clouds as a background like this
maybe it tries to find commonalities in the images based on composition?
There's a few different ways you can mix the images together
ok
I'm feeding multiple into 1 IPAdapter, which can reduce the control somewhat, but with IPAdapter Plus and 2 IPAdapters it uses over 10GB of VRAM, so it's super slow
is that why with your great workflow, we can't control each picture's str?
You can control the conditioning of it via UnClip, but you can't control each images individual IPAdapter Weight
You can chain IPAdapters together with a single image in each, but each one you add uses more and more VRAM
i use the chain mode of my workflow (instead of loading all in one batch), so i can adjust the weights for each input image. and after that it's some fine tuning of the weights.
Yeah I can't do that with IPAdapter Plus Model as I run out of VRAM after more than 1
Need to get a 4090 or something lol
yes, i have 24gb vram - even that can be too little if you want to upscale and use more than 4 ip adapter images
a used 3090 should be enough if you don't also want to use the card for gaming
4 IPa!!!
Gonna flip over to comfy for a little bit to try some things, though I generally use A1111. Is there a new place folks are sharing workflows?
oh, looks like JPS posted one up there...
pepole here share and you can ask for stuff for specifics, people here are very helpful
Thanks @zinc cargo ๐
I wanted to switch over for a minute to see if I could get longer Animatediff outputs. Apparently comfy supports it quite well.
but can this be done in auto11? I really can't move to comfy just yet
I think it has a basic IPAdapter mode, but you can't do chaining multiple I don't think
Ooof, can't use JPS's... got like 50 red boxes. ๐
@south horizon links for the required nodes are in the description :)
you can load them through comfy manager
I think your workflow might actually do far more than I need ๐
... famous last words I guess
you won't find many workflows that don't use custom nodes, as many features like ip adapter require them. only most basic stuff is in the default nodes of comfy.
lmao, I tried blending the images together first and got this
Not what i had in mind
What is "IP Adapter"? ๐
what did the input images look like?
yea, but that's not in SDXL, it's all 1.5 ๐ฆ at least afaik
Yeah I think you're right, don't think it supports sdxl yet from what I'm seeing.
use image prompts in addition or instead of text prompts
interesting
Not sure why it's made this so blurry, but the dress is cool
These are one of Joachim's fire images, but with a prompt of "Woman wearing a long dress"
I feel like I need a meme template for this:
CFG < 3 = Jesus Take the Wheel!
So...which one should I upscale? ๐
here is one that works more like a texture - text promt was "nuclear explosion in the distance" - so the clouds were not in the text prompt - guess with a little more fine tuning that result could be further improved
Hey guys, I've trained a lora SDXL model on my dog and I cannot get quality pictures. I have tried many different combinations of steps and text encoder epochs and nothing works.. I must be mising something..
Any suggestions what I'm doing wrong?
What VAE do you have selected?
would you mind sharing your Comfy Workflow for IPadapter over dm?
I have not done anything with VAE
What is selected, though?
I'm using comfyUI if I add the node load VAE there is no options. If ComfyUI have some default I'm using that. Cant figure out what though ๐คทโโ๏ธ
You have to have the SDXL VAE.
Unless the model has it baked in. Which it doesn't seem to.
thank you, where do I find it?
Thanks, how would I use it ? I'm running on paperspace
I ahve not found any tutorial showing me
Needs to go into the VAE folder, then load it and decode the image with it.
In ComfyUI how do we make a new default for the load default? I can't seem to find it.
Thank you very much brother
I'll post a simple one in here once I'm back at my PC. The main one I use has loads of other extra stuff mixed in.
Where in file structure should I put the VAE folder?
There should already be a VAE folder under the models folder.
Thank you as I am trying to follow vids and yeah, not happening.
make into a gif
has stable diffusion said anything yet on dall-e 3 and it's features?
this could be either the wrong VAE or a wrong combination of sampler and scheduler, DPMs with normal tend to do stuff like that, you have to do karras or another scheduler.
you still need help, shoot me a message
hmm.. trying to figure out, is there a way to preview the first frame of an animatediff workflow? ๐
@vital ermine @icy brook Here's a simple, albeit messy workflow. It's got weighting on both Positive and Negative Text Prompts, so you can turn them off or on or anything in between.
It currently has 2 image slots that both have unCLIPConditioning that can help weighting, these are both fed into the IPAdapter. It also has a crop tool to help position the images. IPAdapter takes a small square of the centre of the image, so if it's not a square image and you don't crop it, you might not get the expected effect.
There's a few custom nodes you can get via the ComfyUI manager. And you'll need the Clip Vision H model for IPAdapter and the Clip Vision G model for the unCLIP.
great. thanks!
And the IPAdapter models as well of course, if you aren't sure about anything just ask.
is lineart ever gonna work for sdxl a1111?
Thank you
ImpactMakeImageBatch is missing but it can't find.
It's part of the Impact Pack, but if you are only planning on using 2 images, you can use the built in "Image Batch" node instead
Weird as I have that pack
try deleting it and then adding it back, it might be acting weird as the node changes depending on how many images you have attached
just delete the node and manually add it back in
loaded now
first one is sdxl, second one is Dall-e
pixar 3d cartoon in the prompt also
here's a better comparison. first sdxl then dall-e
I dunno, I think dall-e is better ๐ค
for this specific art style at least
in realism sdxl rules tho
i'm sure you can get the right one (or something close) from sdxl too
I have a good theory why. It's pixel art, isn't it?
yep ๐
Dalle is a pixel diffusion model, so all the pixels in the pixel art will be the same size
SDXL is latent diffusion, so it's expected that the pixel diffusion model will be better at diffusing pixels
ah, I see ๐
Where could I know Dalle is a pixel diffusion model?