#๐๏ฝgeneral-with-images
1 messages ยท Page 129 of 1
The real robot, filled with dazzling feathers, has fierce god-ray eyes and vomits rainbows and galaxies, a little evil dragon, wicked to the extreme.
rendering ๐
roodoge
what?? ๐
ah, sry, yes ๐
you know the diff between rife 47 and 49?
2
please generate logo on white background for Mental Health Union for Women
Promotional poster to protect marine life
sd
You are funny as hell ๐
sb
SMH. "AI makes everything so easy"
#๐๏ฝgeneral-with-images Please help me generate some cute Chinese alligators playing in the dock side cartoon image ip
can somebody know how install manualy image_encoder for ip adapter in a111
Superwide angle photograph of (a wooden ornate antique animatronic funky musician machine with a mechanical guitar player automaton:1.15) at a carnival, in the style of zoltar automaton has a very uncanny valley look and porcelain skin, automaton is depicted wearing a vibrant steampunk costume including a headscarf adorned with rich patterns and beads, a dark emerald green jacket with golden trimmings, a gleaming golden belt, automaton's eyes are accentuated with punk makeup, suggesting depth and wisdom, the automaton gazes forward with an enigmatic expression, automaton's hands rest the steampunk guitar covered in an ornate gears and shiny bolts and screws raised, surrounding scene is that of a eerie carnival
An astronaut standing in the middle of a mix color of flowers field
daily/hourly reminder that bot is down, cf #1047610792226340935
Help me generate a photo of James playing in Beijing Dongdan
Realistic studio shot of James playing I. Begjing city scape scene photo realistic full render complete
Add modifications as needed
James is too vague you need a name or tv show for reference that will just give you random images not consistent
LeBron James
Here is the image you requested.
Here is the image you requested.
Photos of LeBron James playing basketball in East Beijing
Here is the image you requested.
Here is the image you requested.
Here is the image you requested.
comfyui?
a111
any tips to avoid the upscale be so messy?
i use forge sometimes but afaik all you need to do is drop the models in the right folder, not sure though cuz i don't use it for ipadapter, only for convenience when generating images on my phone
photograph, athlete walking in sport
ground on a empty ground, raining,
Film Grain, --ar 16:9
i download it and put in right place and get file maybe corrupted
Looks like a wrong VAE?
dont use euler normal with 20 steps
try dpmpp ancestral, karras, 35
also take out almost all of the neg prompt, sdxl doesnt benefit from that and it can make images worse
cut cfg down to 6 or less
and delete the upscale at the end until the first part is fixed
its also possible your lora is causing problems, set strength to 0 until the image doesnt look burned, then bring th strength back up, start with 0.5, then 0.7, etc
you always want to turn off as much as possible, to simplify, then add one thing at a time until it looks fd, if youre troubleshooting
its a sd 1.5 model though
but i'll try it out
Oops you're right I got it mixed up with ghostxl
But yeah just simplify everything, if I have something that's fd and idk why, I turn off everything, remove the neg prompt, remove all quality descriptors, cut it to bare bones
Hero!
Then fix any settings issues, THEN add one big chunk of the prompt back in at a time, then star
Start working on putting stuff back in
Wrong quote but message is right. I always mess with Comfy ๐
Yay! Congrats!
I stopped trying ... did you use a special documentation?
4090 here ... should work ๐
I wanted to give up and forget about it, but suddenly I saw how the picture began to change in the preview
but now i have cuda out of memory)
i change picture
Maybe you can start your GUI with a --lowVRAM or --MedVRAM argument?
Didn't write it 100% correct ...
It's always a trial and error ... just don't give up. I'll try to install it, too ๐
you do it in comfy?
Yes ... I will try ...
I think there's a special face module not for use commercial that causes a lot of problems
i have special model for face
but it not do what show me slownshark
he show me cool hybrids
He is damn good!
you do some goode stuff too)
But Comfy I only do to learn a bit more. Usually I use Forge ...
maybe you install it in forge?
Not sure whether it already can do IP Adapter Plus ...
if feel now empty -i try to install it two days ๐
Sometimes it can be good to stop work on something and just do something complete different. I often get new ideas that way.
nice way
i have promblem often i cant stop
lost all energy
and if i get what i want i dont feel happy)
Doesn't sound good ... well you did another step ... maybe time to reward yourself?
She's a bit slim but it kinda worked ...
yes i need found solution for do this better)
Sure you will!
Did you select to change whole picture?
yes
I'll have to try by myself before I want to say something ๐
Maybe the shadows in the original picture make her look thinner?
maybe i use different resolution
A.I. just gave me an animation ... better than I thought but worse the same time ๐
At least it's walking down the steps ^^
In transform mode ๐
At least I still can laugh ^^
Using 1024*1024 output?
512
Maybe input and output size should fit better?
It's like the face ... slim
But worked!
๐
2024 Best Diffusion Award goes to you
Epic tweet.
IP Adapter Advanced sollte hier nun auch laufen ๐
I need help finding old prompts within SD}
how use it for hybrid
isnโt that the drummer from the rolling stones
sweet, thxk for the heads up
glad to see it - it's been the one big thing a1111 has had that comfy's been missing afaik
perfect timing too - now we have all the tools for "prompting" with an image from the conditioning side, right after the new ipadapter plus dropped
prompt: high quality, detailed. input image, then output with: reference_attn, _addain, and _attn+adain
couple ipadapter outputs for comparison
composition weight style ๐
ipadapter? controlnet reference? something else
?
whats all the settings on that image
haven't tried it in a1111
so curious what translates over
full preproc model name, and model name?
(mostly i'm wanting to be able to answer the "can i do this in a1111" questions)
does it work for ya with the sdxl version or is that an OOM situation for you
how much vram you got
some fun results by using a combo of ipadapter and controlnet reference
4 gb
i had ipadapter with composition weights control the steps up to 0.3, then 0.3-1.0 was controlnet ref
ahh dang hope you find an intact 4090 sticking out of a trash can sometime soon
i want buy 3060
but i not have bit money
4070 expensive i thnik
ipadapter its best in controlnet)
i dont want use it before you post your pic
yeah or faceid etc
best for preserving likeness is portraitID
but that does require closer shots
ipadapter style blending
i inpaint my real photo
My machine is rendering like a madhouse at the house
I put about 64 things in the que before work
Now Im trying to render out a cricket, when the cricket rubs its legsโฆ colors go flying.
I guessโฆ alien bug works too lol
39.057317, -77.124324 I think I saw one here
more mutants i see ๐
it's not a mutant, it just twisted her ๐
you should post the actual output! ๐
very cool seeing the input too though ๐
need sleep after 4 hours on work ๐คญ
you need to get up for work in 4 hours?
wow
seriously... this ipadapter... gonna one of these nodes tattooed on my forehead
hah
sd really likes to have umbrellas as hats.
sometimes it gets is accidently right
how cool would that be. some way to have an sd rendered image, where they're holding up a sign or have a tattoo on their head, but it's the image you've loaded into a node, perfectly inlaid.
using attention masks and timestepping with a second style adapter to frame a shot
nowwww we're talking.
i drop the second style adapter during the upscale to ensure the details are coherent
Oh cool, this looks like one of those artworks you see at hotels
what i'm really li king is this is providing some ability to subtly guide the composition
might wanna drop the .json too with the image ๐
there's also a node that allows you to embed it in the workflow iamge
Ill drop it when I get home
Wooow, that seems so difficult ๐จ
this is what ppls charts look like when they trade stocks
Awesome, is this channel dominated by you?
niu bi
at times
Itโs hard to imagine what people will say when they know about this in the future. There is a group of people on the channel who answer peopleโs questions and help people who donโt know how to get what they want in their minds. somewhat great
@hazy warren
Anyone got an idea of what small lora can I train? (15-30 images) ๐ค
id say if you want to train anything worth it, needs to be atleast 500 imagess min
yeah, just saying, that was a prompt only, so you can easily do it
you don't need 500 images
i've trained some pretty good ones with 40 or 50 or so
quality over quantity
variety and captioning and quality over quantity
super sampling?
what are you referring to exactly
the thing where it can train on the same thing multiple times, I forgot what they call it
you don't need to do anything weird
loras just don't need tons of images
a couple dozen is all you need
oh I see
if you go overboard with the training too (too many steps) it'll usually start to look "Burned" and get too stiff
start only replicating the training data
I was interested in that "Train" tab, I might have to try it out some time.
๐ฎ
Last time i trained a lora I did it with mj images but I forgot to check them correctly...and ALL of them had blurry backgrounds ๐ญ
this are the results like, well it could be worse
train it on ya cat
this a good tutorial?
https://www.youtube.com/watch?v=yPOadldf6bI
OneTrainer Stable Diffusion XL (SDXL) Fine Tuning Best Presets : https://www.patreon.com/posts/96028218
The Very Best OneTrainer Workflow & Config For SD 1.5 Based Models DreamBooth / Full Fine Tuning :
https://www.patreon.com/posts/97381002
Full Workflow For Newbie Stable Diffusion Trainers For SD 1.5 Models & SDXL Models Training With DreamB...
there's a good chance it is
i don't watch videos lol
i lack the patience
๐ฎ
yeah i haven't watched a single tutorial ever
so i have no idea
i can give you my training parameters though if you like
kinda like comfy... you can just import it and go
i think it's way easier to learn something starting with something that works
I like this.
do some gum tree's
yes plz
put wheres waldo in the background
now make them Kermit
ok.... you win lol
Not an spple ^^
V1 - LoRA V2 - LoHA. Feeling generous? https://www.buymeacoffee.com/generalawareness
green grass
blue sky
I made my first Lora and wanted you guys to be the first to know. I was motivated by an Original Character that i made for my manga, so I made a Box Braid hairstyle specific lora with great results. check it: https://civitai.com/models/381076?modelVersionId=425404
already working on V2
/game_statsA series of display boards and cabinets are arranged within an industrial museum, rendered in the style of Cinema 4D to evoke a sense of historical depth and the passage of time. The materials featured include wood, stone, brick, and concrete, with a color palette dominated by pale amber, red, and touches of black. Exhibits on display encompass miniature industrial models, photographs, and historical documents, among other artifacts.--ar4:3
someone asked whether this would work ....
#๐๏ฝgeneral-with-images Brandon Jiang is a tall and handsome CEO with a commanding presence. He possesses deep, brown eyes that exude determination and confidence. Clad in impeccably tailored black suits, he emanates an aura of authority that is both intimidating and captivating. His long, slender fingers adorned with a sophisticated wristwatch reflect his attention to detail and refined taste. With a strong nose and a sturdy jawline, he carries himself with self-assurance and decisiveness. Despite his outwardly tough demeanor, Brandon harbors a gentle and compassionate heart, showing unwavering dedication to both his career and loved ones.
a cat looking sky
Here is the image you requested
Here is the image you requested.
I like the soccer ball foot lol
its pretty cool but the slight jitter makes it feel like the buildings are morphing abit...youd have better like generating the image then using another video editting software to pan thru
้ปๆๅ็ๅฎ้่ขซๆ็ ด๏ผๅคฉ็ฉบไธญ็ๆๆ้ๆธ้ๅป๏ผไธ่ฝฎ็บขๆฅ็ผ็ผๅ่ตท๏ผๅธฆๆฅ็ฌฌไธ็ผๆธฉๆ็้ณๅ ใๆๅทฅไบบ็ๆฟ้ดๅ ๏ผ้น้็้ๅฃฐๅจ้่ฐง็็ฉบๆฐไธญๆพๅพๆ ผๅคๅบ่ณใไปไปๆททไนฑ็ๆขฆๅขไธญๆ้๏ผ็ก็ผๆฆ่ง๏ผๅฟไธญๅ ๆปกไบๅฏนๅณๅฐๅผๅง็ไธๅคฉ็ๆๅพ ไธไธๅฎ๏ผๆๅฅฝ็่ดจ้๏ผ8k
#๐๏ฝgeneral-with-images a space particle dispersed in a medium
Here is the image you requested.
particles not a creature 
no sir particles
yes yess this is grate
can u genrate a 16:9 ratio image with more particles and blured edges of the images
Cthulhu Carwash. Awesome!
@nimble mason I'm hoping to make something like the white image in the style of the sketch image
(Not including writing)
here's the output using the sketch as a style, and basically a blank prompt (just "high quality, detailed") with a generic negative ("umbrella, NSFW, low quality, bokeh, blurry, low detail, text, watermark")
there's a lot you can do to play with it to get closer adherence, and the text prompt does remain important
Wdym by using the sketch as a style?
here's using the sketch as a style input, and the white image as a composition input
this is using ipadapter plus in comfyui
I like how the style has transferred, but is there a way to get it to recreate the original more accurately?
and is this for SDXL?
I think upload stuff to discord doesn't make em better. To make this effect with another video editor would be the better idea. But I experiment with SVD and you never know what you will get ๐
yeah, sdxl
SDXL ๐
yeah, that's possible if you use controlnet as well
input sketch, output, and style reference, in that order (for anyone wondering what you can do with ipadapter plus nodes in comfyui right now. afaik there's no style transfer ipadapter available in a1111/forge, but i'm wrong about that, lemme know!)
what svd settings are you using? at this point animatediff does a better job, although it also loses more of the original image that I loaded.
6FPS, 25 Frames, 135 Motion Bucket ... I optimize afterwards
what's your augmentation level?
Still at 0 don't know what it does ๐
Ty for all the help, much appreciated
so that worked out well. augmentation level is how much noise it'll add. so if you say 0 , which is what it's at for this pic, it'll usually just be some kind of pan or orbit rotation, the subject won't move.
wow! nice
I never really know what it will do ๐
big takeaway is... is that what you were looking for?
cuz if so, you can do it in comfyui, and the workflow is embedded in that image so you can just click and drag it into comfyui and do it with other images
It's exactly what I was picturing
That sounds ideal
good to know... yeah, pretty cool what you can do now
that style transfer ability is next level
I'm willing to accept that it won't do much, but it takes a long time to render and not knowing what it'll do for you is frustrating. lots of wasted time on stuff that comes out bad for 1 good one.
80% acceptable, 30% good
acceptable includes good ๐
yeah... so much of this...
whereas the animated diff stuff comes out 80% good, but because it always has to go through an sd 1.5 model, the original image is changed a good amount. the models I used in sdxl aren't available in 1.5, even similarly
Augmention Level 0.5 without optimization ...
Augmention Level 2.0 without optimization ... ๐
Looks like 0 is pretty good
haha yeah
put your pic in adapter)
Nice, too!
i use only pose biden
Knock him out!
nah, biden is a lover not a fighter. ๐
this ipadapter thing is pretty neat. adding a little bit of prompt can make actions happen between source images.
his face very close , when i do that his face not him
did you call him out in the prompt? It wasn't close with ipadapter until I said it was biden.
yes i call,but i have not full face
Looks like I've ordered my SDD at the right moment ...
that merge really does have great prompt comprehension. didn't get that output for wading through congress in anything else i just tested
@nimble mason so this was pretty cool. took that 2 image ipadapter thing that was kind of doing 50/50 regional prompting with ipadapter and 2 images, but instead taking one and just plopping it into the middle of an existing image. so super sayan head on top of a wolf in sheep's clothing.
what's the 50/50 one you're referring to? was thhat one we were takling about the other day?
btw... the style transfer ipadapter with attention masking is very, very good with the otherwise weak and crappy qrcodemonsterXL controlnet
@nimble mason by 50/50, it's that one that was going around off the internet where it was just a 50% mask, so you could add one subject next to another. I changed the mask so that it was a square in the middle of the image instead of half the image
oooh that's cool.
i never had good luck with that stuff. island in shape of a text mask or whatever.
โค๏ธ Taiwan
Gotta love Taiwan, it's where all the nvidia hardware comes from.
Thank god I already ordered my new SSd ... prices could explode ... and they are friendly people ...
btw, highly recommend using ipadapter advanced so you can toy with the other weights
i'm finding weak output to be pretty useful too now
SEGMoE
with the fully real model instead.
@tired basin here's an image I made with segmoe lol
Watching Star Trek Picard at the moment ๐
source image for that? nice one
Hmm. The Rock is Bravestarr ๐
One of these ... can't find original ... ^^
Here it is ...
thx, yeah that's a really cool one
My workspace ๐
I mean if it can still pump out complex images such as this I won't be complaining
its still a massive improvement to me
hahahahah
oh i want to tell you the story but they're wrong
t5 is using it's full ability
it has 512 prompt afaik
really? so it DOES have 512 context? no cap?
afaik? yes
can you make sense of "because of bootstrapped clip compatibility"
no, because they arent making sense
lmao
oh they mentioned that?
tldr: they got booted from sd3 for purely arguing that "sd3 needs to be dall3"
they are the only person as of now that has been booted.
they also "made a mmdit implementation a year ago" that totally exists and wasnt made up
yeah, basically
epic
tldr: "noooo!!!! sd3 not dalle3!!!!!"
lol
thank you, you have once again revived my interest in SD3
also got 24GB so I can do highres with bigger batch sizes
and use controlnets or whatnot
also, the flaw they mentioned about "cogvlm changing screws to nails"?
it doesnt exist.
i just tried it
LMAO
and it did make nails
you tried cogvlm?
they trained on half cogvlm captions
yeah and THAT TOO
^^^^^ yesssss
it does
its good that it's only half of the dataset
so pop cultural stuff stay which might not be recognized by cogvlm
they could do 100% but then we'd lose out on small prompting or direct name things
I wonder if Ideogram did a higher ratio of auto captioned dataset
I just don't know why but the same prompt without the assistance of Magic Prompt makes nearly the same composition every time
yeah I saw the examples of cogvlm and its really advanced and high quality
isn't it like ~30B too?
oh 17B
CogVLM-17B and CogAgent-18B
That sounds gooooood
yeah they used this to caption SD3's dataset
Still no beta for me ๐
I don't think anyone from the waitlist has been invited yet
They have have added a few more people on twitter. I noticed at the beginning of the month that a few who have it now mentioned they're still waiting. And now they're posting sd3 pics
I hope one day a huge pile of people get access and we all start engagement in this server and on the subreddit
I just wonder if the reception will be highly negative or neutral
Well I think it's just people on the AI industry
I'll be happy with it because it means we got our hands on it, and no matter what happens with stability after that, we still have the engine
And I like cascade, people will make models for it
Unlike cascade
yeahh
I will be super positive about SD3
oh like I mean with the early access where we generate images only
then after we get SD3 Code and Weights then of course it'll be the end game
and no matter what happens with stability after that, we still have the engine
exactly this, we will be able to enjoy more complex images forever
and if we're missing something people will just make loras or massive finetunes
I just hope that 24GB will be enough to train loras
I hope I'll still have a benefit from my early training ๐
We just need to stop our current prompting habbits
its not like current style prompts don't work, it's just that they might not be as good
Superprompt or other instruct LLMs will help a lot
given that 24gb is realistically the limit on consumer GPUs and that Loras are the main way to extend the model, I'd expect it to be possible, just possibly slow
well its not stopping the big boys from training with their a100s and etc
@jovial tiger SegMOE
my first image! lol
here's my output
nah, not the right one ๐ฆ
alright now try a prompt that includes "batman laughing maniacally with a mouth that looks like a shark mouth with long pointy teeth"
he ment to look like this
f yeahhhh
you're gonna have so much fun with this
you'll prolly fail out of school and get fired as a result
๐
do you have comfyui manager installed?
do you see the manager button
im 45 lol
yah i dont have it
ahh k lets get you that
this is critical
if you load a workflow from someone, you'll often not have the nodes they have
they'll be red
then you can just go into manager, click install missing custom nodes, and restart comfy and you'll be set
i typed git clone https://github.com/ltdrdata/ComfyUI-Manager.git but it errored out
errored out? as in no connection?
git is not recognized as the name of a cmdlet
oh you need git
install git?
yeah
the command git doesn't exist until you get that installed
won't be able to add any custom nodes without very clunkily downloading them manually from the repo without having git
ok installing now all on default recommendations
sweet
damn didnt work still
same error
or do i run everying in git cmd?
yah that worked
k you got it?
if it doesn't recognize it in the future, might need to add the path manually to cmd.exe
yah now do i close and reopen comfy?
yeah
and then you'll have the manager
and i can drop some workflows here for you to load that'll get you some of the best custom nodes
i closed and reopend but manager not there
are you using the portable comfy?
so if you open up your ComfyUI/custom_nodes folder, do you see it there
ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI-Manager
should be in there
yah not there
k so it got cloned into the wrong folder prolly
i'd just follow the instructions directly now
launch a new cmd.exe window if you haven't
you'll have a random copy of comfy-ui manager somewhere on your hd just chillin but that's cool lol
ok manager is now in the folder
k cool now it should work
yah but it didnt
wtf
after closing comfyui completely, including the console cmd.exe window, it's not there?
its working now!@
yay!!!
cool lemme get you a workflow
click on this, click open in browser
then drag into comfyui
might take a sec but lemme know if a workflow pops up
yes it's a mess lol
buch of red everywhere lol
install missing custom nodes, then restart comfyui
it will take a bit
but these are all really handy ones i use nonstop that you'll def want
and will save you lots of time hunting
yeah, usually it downloads and installs stuff when it restarts if you just added nodes
good habit to get into... if you don't hear your comp buzzing when you run a workflow
yah i see cmd putting along
check the console window... it's likely downloading a model or osmething you were missing
man this is a lot of shit lol
lol
this is way over my head lol
geez lol
am I going to have to do this all over when version 3 comes out?
oh gawd
ok now what lmfao
hahahaha
now i clean up the workflow and send it to you again so you can actually understand wtf is going on kinda
yes please lol
i just tossed you what i had up so you could get it downloading
still cleaning it up? lol
yup almost there
ypou'll have fun with this
oh, you prolly don't have that sampler, but you can change that from res_momentumized to dpmpp_2m_ancestral or whatever too
but load some images in the load image nodes that you think are cool that have a style you want to imitate
and also try changing the weight type in the node on the top right
those were my inputs
wait u want me to do this now?
yeah why not go for it ๐
this is a brand new feature in comfy, one of the best yet imo
node in top right i dont see weight type
so if this works you're at the cutting edge
no prob!
how long have u been using this?
a couple months
damn seems way longer than that
discovered stable diffusion about 4 months ago
i discovered it like a year ago but never messed with it until today
you can call yourself an expert in my books
yeah once you get over that first hump to be able to make shit that's actually really, really cool, it's worse than drugs lol
lmfao
yah im in a crypto project called print and I like to make images for them and tweet them out on x
i guess i'll have to photoshop text in
i was hoping v3 would do that but not sure now
prior to this it was bing and chat gpt to make the images lol
ah cool
yeah text is iffy unless you use it pre-generated and build on it
this was the input
did it make that or did u build on it?
used the black and white one as a control image
oh shit thats awesome!
anyone knows what model this is, apparently its really popular in the community rn
lik everywhere
where would i load the control image into?
on pinterest i just dont know the name uno idk if this is the worng place to post
the first node?
oh god that one is a disaster
the cleaned up one isn't quite able to do that
here's a version of the cleaned up one that also upscales btw
should i use this one now? lol
yeah, the upscale can be a bit slow though
i generally recommend just leaving the upscale not hooked up... just disconnect one of the wires to the second ksampler
i have a serparate upsscaler
gotta love the pixar eyes
then if you really like an image, reduce the seed by 1, so it runs with the same seed again, reconnect the wire you disconnected, and run
so where do i load the control img?
with the cleaned up workflow?
so I was out at the grocery store when i saw the segmoe stuff. takes bits and pieces from various models and renders the result in realtime?
this discord has added a few min to some of my grocery trips...
I actually have no idea. All I know is that it's a Mixture of Experts...
I've done a couple of model merges lately to end up at this artistic/dark/slightly pixarish view of everything. it's why everything I post looks like that.
no idea what it means for Diffusion models exactly
I think I could actually merge the darkarts and proteus model to make a 2x1 segmoe
but no pixar lora though, unless I merge it into one of the models before hand
ok so the first node correct?
so I did that merge, but then I found that the new version of paincreator's ai creator artistic v15 or some such is incredibly prompt following, even beats proteus which is a tall order. so now i just use that and andrea75c's cute 3d render lora merged in ata 0.5 with no clip from it for the final output. the difference in prompt following with complex stuff is incredible.
yeah, those two images above are the inputs
so now i couple that with regional prompting, and voila.
you should be able to see where they go with the screencap of the workflow
so it's actually borrowing from both images
can you link that model please
ok i need to mess around with this youll probably see me tomorrow with a shit load of questions lol
again thank you so much man. I really appreciate you going out of your way to help me
all three of those are amazing
thanks
no prob enjoy the shit outta it
just experiment like crazy
so aicreator is the smartest?
these images that we drop into it are they just images?
don't get caught up in trying to do things perfectly, cpu & gpu time are basically free for ya so just hammer out all kinds of crazy tests and see what happens
yeah
most formats work
like how does it know to do these crazy node setups
comfyui?
yah
it's just the programming
so if i drop any image the node will change?
it's open source so if you wanna dig into that uncommented mass of code... ๐
lol
oh, what is shown will, yeah
you can also ctl-c an image outside comfy
then left click on the background in comfy, hit ctl-v and you'll get a new load image node
mindblown
if you click on the load image node, then hit ctl-v, it'll replace the image
can be a lot more convenient... recommend cruising google images or deviantart or whatever and just pasting stuff in
seeing what ya get
if i like the node setup i just hit save then just click on the json file to load it?
save will save the workflow, yes
and every time you generate an image, if you're using the save image node, it embeds the wrofklow in it
so you never lose a workflow
you can recreate anything you make
fire!
yup, best feature imo
alright see yall later i gotta go. Thanks again!
i know myself, soul, festivalman (when using comfy, he uses a lot of a1111 with LLM bots... some fancy stuff), and others leave their workflows in their images here
later
composition, two style inputs, output
and fyi if any of your images show up in my inputs here, it means i think it's straight fire
so i'm looking at this example directory. which one is just straight style transfer? I just want one image input and a prompt.
but i want just style, not composition.
idk but i'll get you one in a sec here
hah ok that works too.
sure
input, output: "a car driving in a city"
honestly, i think the style transfer is the crown jewel of all of this
i don't see any ip adapter nodes in this one? is it something else?
IPAdapter embeds?
look at the one with the car
def recommend using res_momentumized for that first stage
i get better style transfer
yeah this is great stuff
this was actually what i was originaly looking for when i hit the compoisition one.
gotcha
that's really cool ^^ but what really impresses me is how well it translates to different subject matter
yuuuup
that's when i lost my shit
hm, no luck using this as a style input
cool output, but not engraving style
dpmpp_2m exponential... great stuff, just not engraving style
smackalishes dick and rice
with juggernautxl instead of fullyreal... wow does that look great
curly dick fries with ketchup
spring dancing dick jennie with a chili dick chenie
dickatronisanating sexology
telepathy dicks to a higher level of teleportation sex topic
decrepit dick chooser
the omega slopethen sex creatures
@nimble mason something of note. in the one you sent, it did the render, then 0.5 denoise with 1.5 upscale. I added a 3rd and as you can see, it didn't actually become prompt adhering until the 3rd one which is weird.
that's weird... is that something that applies with other fringe prompts?
Here is the image you requested.
gonna try a different style source and see if it still applies.
one thing i'm excited about with this is it frees us from needing to use any tokens to describe style
so i switched everything back to dpm++2m karras again at 20 steps, x3. with 1.5x scaling each time. keeps the style and content of the image. r_momentum ends up with a wildly different and messed up image at the end compared to the first in the chain.
yes. res_momentumized makes huge changes
use exponential too if you want more stability when upscaling, it makes a big diff
if you want even more stability, unsample then sample with kadv with add noise off
either version of restart can make big changes
ok tried exponential with res_momentumized, now he's growing a face from his chest. ๐
it's the added noise at each step that's the issue.
you gotta dial that denoise waaaay down with that
i've found dpmpp_3m_gpu with exponential at 50% denoise is pretty good at not letting things get wacky
I really need to be able to get those 2m and 3m_gpus working via api. they're the ones that don't for some weird reason.
that's lame
res_momentumized works though
yeah dpmpp_3m_sde_gpu is way better at maintaining composition and tone than dpmpp_3m_sde
it's the best out of all of them with exponential that i've tried
i wasn't really good about keeping records but i think i tried all combinations at some point
so that worked. res_momentumized with 0.35 denoise and exponential
awesome
mmmm....sexy
these are settings that i've found work reasonably well in general
but it's something to throw in if you want 50% denoise after upscaling
so awesome lol
midjourney has some really great styles at times. being able to drop them in on these is pretty great.
we should get a lil collection going somewhere
i've been trying to pick out images of my own that have led to particularly good outputs with the style weights
yeah, it is more than just style btw with this.
one on the left is from midjourney, one on the right, there's no water in the prompt, but it's getting it from the style image.
although I guess it's really just stuff that you didn't mention. it's still massively less than the composition workflow.
In a bustling metropolis, skyscrapers towered into the sky, while crowds bustled through the streets and traffic flowed like a river. Suddenly, a colossal Godzilla emerged in the heart of the city, towering high into the clouds, its muscles bulging with power and authority. Its skin was covered in tough scales, its eyes gleaming with wildness, and its roar thundered through the air, deafening all who heard it. Godzilla's massive claws crushed through streets, buildings crumbled beneath its feet, and smoke billowed as flames erupted. Terrified citizens fled in panic as police cars and fire trucks rushed to the scene, attempting to stop the impending disaster. Yet Godzilla remained fearless and mighty, commanding everything around it like a sovereign ruler, demonstrating an unmatched aura of intimidation and strength.
This scene portrays Godzilla's ferocity and power as a monster, as well as its conflict and confrontation with the human world.
i bet if you combine multiple style embeds, images of the same style, with different settings... that will disappear
kinda like training a lora, need variety or it'll tend to carry over more from the training set in terms of composition
do you have a MJ sub?
would be interesting to make a lil set of some stuff of a given style to test that
even if it's just half a dozen images
hmmmmm. never used it so i'm not sure how it works with keeping a style consistent. is that part of the prompt like SD, or a special tag?
Here is the Sharkzilla you requested.
@nimble mason https://www.midjourney.com/explore?tab=random
is this available to you? you can just grab the already generated ones for styles as well
ooo
hmm pretty diff though
i guess maybe the style options is something deeper than what i'm thinking of (watercolor vs. oil painting vs. photograph vs. monochrome photograph vs. engraving etc... and color palette, etc)
i'm just going through old prompts with these style images, and it's giving wildly different stuff than the original.
now you're gonna end up having to incorporate ipadapter plus into your bot haha
it's just too good
seriously who doesn't want more of this? this is with that blue-high tech shopping cart in that 10 set before